How Alibaba Cloud Milvus Achieves 20× Faster Billion‑Scale Vector Search with DiskANN and RaBitQ

Alibaba Cloud Milvus combines DiskANN graph indexing with the RaBitQ quantization algorithm, delivering over 20× higher QPS, sub‑10% P99 latency, 29% lower memory usage and more than 98% recall on a 100 million‑vector, 768‑dimensional benchmark, while also cutting index build time from 20 h to about 6 h.

DiskANNMilvusRaBitQ

0 likes · 7 min read

How Alibaba Cloud Milvus Achieves 20× Faster Billion‑Scale Vector Search with DiskANN and RaBitQ

DataFunSummit

May 26, 2026 · Artificial Intelligence

Building an Evolvable Context Layer for Agents with ContextSearch

The article explains how ContextSearch transforms enterprise search from simple document retrieval into an Agentic, multi‑source, runtime‑driven context layer that can understand constraints, gather evidence, verify results, and continuously evolve through trace‑backed optimization.

ContextSearchDiskANNOpenSearch

0 likes · 14 min read

Building an Evolvable Context Layer for Agents with ContextSearch

AI2ML AI to Machine Learning

Feb 27, 2026 · Artificial Intelligence

Why No Single Algorithm Dominates Vector Search: A Deep Dive into Modern Vector DBs

The article surveys emerging vector databases, explains how various vector‑search algorithms such as FLAT, IVF, HNSW, DiskANN and ScaNN differ in accuracy, speed, memory use and build time, and provides practical guidance for choosing the right index based on data size, latency and resource constraints.

DiskANNHNSWScaNN

0 likes · 9 min read

Why No Single Algorithm Dominates Vector Search: A Deep Dive into Modern Vector DBs

Volcano Engine Developer Services

Oct 20, 2025 · Artificial Intelligence

How DiskANN + RaBitQ Supercharges Milvus: 5× Faster, 90% Cheaper Vector Search

This article explains how integrating the disk‑based DiskANN index with the ultra‑compact RaBitQ quantization dramatically boosts Milvus's vector search performance and cuts costs, delivering over five times higher QPS and more than 90% cost reduction for billion‑scale AI workloads.

AICost ReductionDiskANN

0 likes · 11 min read

How DiskANN + RaBitQ Supercharges Milvus: 5× Faster, 90% Cheaper Vector Search

Volcano Engine Developer Services

Aug 20, 2024 · Databases

How Vector Databases Power RAG: Scaling, Algorithms, and Real‑World Trade‑offs

RAG technology leverages vector databases to provide context‑aware answers without updating model parameters, and this article explores how cloud search teams integrate multiple vector algorithms, balance cost, stability and latency, and adopt open‑source solutions like OpenSearch to build scalable, enterprise‑grade retrieval systems.

AIDiskANNOpenSearch

0 likes · 21 min read

How Vector Databases Power RAG: Scaling, Algorithms, and Real‑World Trade‑offs