-
Notifications
You must be signed in to change notification settings - Fork 25.8k
Investigate various implementations of ann search for vector fields #42326
Copy link
Copy link
Closed
Labels
:Search Relevance/VectorsVector searchVector search>featureTeam:Search RelevanceMeta label for the Search Relevance team in ElasticsearchMeta label for the Search Relevance team in Elasticsearchfeedback_neededstalled
Description
ann (approximate nearest neighbours) will be a licensed feature of Elasticsearch (not OSS).
We plan to implement prototypes of various algorithms for ann for different distance metrics:
- LSH and multiprobe LSH for euclidean distance
- partition trees for euclidean/cosine distance
- clustering-based approaches, including product quantization
We are interested in users' feedback about:
- application domains
- what distance metrics are used (euclidean vs cosine vs Hamming etc).
- what are data types in vectors (vectors of integers, bits, longs, floats etc).
We have decided to adopt Lucene implementations on ann search, so development of ann search is moved here. Relevant Lucene issues: https://issues.apache.org/jira/browse/LUCENE-9004, https://issues.apache.org/jira/browse/LUCENE-9322, https://issues.apache.org/jira/browse/LUCENE-9136
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
:Search Relevance/VectorsVector searchVector search>featureTeam:Search RelevanceMeta label for the Search Relevance team in ElasticsearchMeta label for the Search Relevance team in Elasticsearchfeedback_neededstalled
Type
Fields
Give feedbackNo fields configured for issues without a type.