Meta and NVIDIA collaborated to accelerate vector search on GPUs by integrating NVIDIA cuVS into Faiss v1.10.
NVIDIA cuVS outperforms classic GPU-accelerated search for IVF indexing, reducing build times by up to 4.7x and search latency by up to 8.1x.
For graph indexing, CUDA ANN Graph (CAGRA) outperforms CPU HNSW build times by up to 12.3x and reduces search latency by up to 4.7x.
Faiss 1.10.0 includes NVIDIA cuVS algorithms, offering users the choice between Faiss classic GPU implementations and newer cuVS algorithms for efficient vector search.