Skip to content

IVFSparse Improvements #943

@davidmezzetti

Description

@davidmezzetti

Update IVFSparse with the following improvements for speed and accuracy.

  • Build k-means clusters using top n most important features from L2 norm vector
  • Prune small clusters and merge them into larger clusters
  • Calculate block summary vectors using max values per column (block max summary vector)
  • Add threaded batch search (majority of code is executed outside of the GIL with SciPy/NumPy, someday won't matter with GIL-less Python)

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions