[ 
https://issues.apache.org/jira/browse/LUCENE-9855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17314413#comment-17314413
 ] 

Julie Tibshirani commented on LUCENE-9855:
------------------------------------------

Thank you [~tomoko] for continuing to move this forward! I hope this doesn't 
distract from our focus, just wanted to clarify an aspect. I think we may be 
discussing two separate questions at once:
 1. What should be the name of the single new format interface we're 
introducing (currently called {{VectorFormat}})? Based on LUCENE-9322 and other 
discussions, there was consensus for making the API is general enough to allow 
different approaches to NN 'under the hood'. So it seems good to not mention a 
specific NN algorithm in the interface name. For me this is the focus of this 
current issue, finding a better name for the format interface.
 2. What should be our strategy to support multiple NN implementations? The 
current design is to have a single format implementation like 
{{Lucene90VectorsFormat}} that can handle multiple approaches. But perhaps each 
approach should have its own format implementation instead, like 
{{Lucene90HnswVectorsFormat}} or {{Lucene90ClusterVectorsFormat}}, etc. This 
would affect the name of the current HNSW format implementation (but not the 
interface). Based on recent emails, there doesn't seem to be consensus on this 
question. Maybe we could spin off a new issue to discuss further?


> Reconsider codec name VectorFormat
> ----------------------------------
>
>                 Key: LUCENE-9855
>                 URL: https://issues.apache.org/jira/browse/LUCENE-9855
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: core/codecs
>    Affects Versions: main (9.0)
>            Reporter: Tomoko Uchida
>            Assignee: Tomoko Uchida
>            Priority: Blocker
>
> There is some discussion about the codec name for ann search.
> https://lists.apache.org/thread.html/r3a6fa29810a1e85779de72562169e72d927d5a5dd2f9ea97705b8b2e%40%3Cdev.lucene.apache.org%3E
> Main points here are 1) use plural form for consistency, and 2) use more 
> specific name for ann search (second point could be optional).
> A few alternatives were proposed:
> - VectorsFormat
> - VectorValuesFormat
> - NeighborsFormat
> - DenseVectorsFormat



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to