Artificial Intelligence 7 min read

Neural Approximate Nearest Neighbor (NANN): Open‑Source Large‑Scale Retrieval with Arbitrary Complex Models

Alibaba’s open‑source Neural Approximate Nearest Neighbor (NANN) library decouples index learning from model training, enabling any TensorFlow‑based deep model to perform high‑throughput, high‑accuracy HNSW‑based retrieval with GPU multi‑streaming, XLA acceleration, graph optimizations, and adversarial training that mitigates L2‑distance mismatch, all supported by ready‑to‑use benchmarks and demos.

Alimama Tech

Mar 14, 2023

Neural Approximate Nearest Neighbor (NANN): Open‑Source Large‑Scale Retrieval with Arbitrary Complex Models

Background: Large‑scale information retrieval is a core problem in advertising. Alibaba’s NANN (Neural Approximate Nearest Neighbor) project provides an open‑source solution that decouples index learning from model training, enabling arbitrary‑complex models to be used for recall.

Core functions: NANN is built on native TensorFlow and offers high‑performance HNSW‑based search, GPU multi‑streaming, XLA acceleration, and graph‑level optimizations. It supports model‑training‑index decoupling, adversarial training to preserve retrieval quality, and provides ready‑to‑use benchmarks and demo pipelines.

User‑friendly aspects: The library works entirely within the TensorFlow ecosystem, separates model inference from retrieval, and includes a simple performance testing tool.

Algorithm updates: Recent work adds adversarial training to mitigate the mismatch between L2 distance used in index construction and neural similarity scores, significantly reducing recall loss for complex models, as demonstrated on internal and public datasets (CIKM 2022 paper).

Conclusion: NANN delivers high‑accuracy, high‑throughput KNN search for any deep model and is released under an open‑source license (https://github.com/alibaba/nann). The associated paper is available on arXiv.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

Open Source TensorFlow adversarial training Neural ANN

Written by

Alimama Tech

Official Alimama tech channel, showcasing all of Alimama's technical innovations.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.