Tagged articles

8 articles

Page 1 of 1

May 21, 2026 · Artificial Intelligence

Essential Ranking Techniques Every RAG Engineer Must Know

This article explains why ranking is the decisive factor behind successful Retrieval‑Augmented Generation (RAG) pipelines, walks through pointwise, pairwise, and listwise learning‑to‑rank paradigms, details key algorithms such as LambdaMART, compares cross‑encoders with bi‑encoders, and provides practical guidance on metrics, production‑grade rerankers, model fine‑tuning, and framework integration.

Bi-EncoderCross-EncoderLLM

0 likes · 22 min read

Essential Ranking Techniques Every RAG Engineer Must Know

James' Growth Diary

May 20, 2026 · Artificial Intelligence

Boosting RAG Retrieval Quality with Cohere Rerank and Cross‑Encoder

After achieving high recall with hybrid Elasticsearch and vector search, the article shows how inserting a reranker—either Cohere's cloud API or a local Cross‑Encoder—compresses the top‑20 candidates to the most relevant three to five, dramatically improving answer accuracy, cutting token costs, and detailing a dual‑track implementation for production and development environments.

CohereCross-EncoderLangChain

0 likes · 22 min read

Boosting RAG Retrieval Quality with Cohere Rerank and Cross‑Encoder

Su San Talks Tech

May 15, 2026 · Artificial Intelligence

Understanding Rerank in Retrieval‑Augmented Generation (RAG)

The article explains why a reranking step is essential in RAG pipelines, describes how it refines the initial vector‑search results, compares mainstream rerank techniques, discusses practical engineering choices such as candidate set size and model selection, and outlines how to evaluate and tune rerank performance.

Cross-EncoderLLMModel selection

0 likes · 15 min read

Understanding Rerank in Retrieval‑Augmented Generation (RAG)

AI Engineer Programming

May 14, 2026 · Artificial Intelligence

RAG Retrieval: Comparing Bi-encoder and Cross-encoder Architectures

The article reviews the three‑step RAG pipeline, explains why retrieval quality hinges on fast, accurate semantic matching, contrasts Bi-encoder’s offline vector indexing and speed with Cross-encoder’s token‑level interaction and higher precision, and discusses hybrid solutions such as ColBERT and LLM rerankers with practical engineering guidelines.

Bi-EncoderColBERTCross-Encoder

0 likes · 10 min read

RAG Retrieval: Comparing Bi-encoder and Cross-encoder Architectures

IT Services Circle

Apr 6, 2026 · Artificial Intelligence

Mastering RAG Interview Questions: A Complete Retrieval Optimization Blueprint

This article breaks down the full RAG retrieval pipeline—from query understanding and rewriting, through hybrid retrieval and reranking, to chunking, context compression, and dynamic routing—providing concrete techniques, formulas, and performance metrics to help candidates ace interview questions on RAG systems.

Context CompressionCross-EncoderHard Negative Mining

0 likes · 16 min read

Mastering RAG Interview Questions: A Complete Retrieval Optimization Blueprint

Wu Shixiong's Large Model Academy

Apr 6, 2026 · Artificial Intelligence

Why Rerank Beats Simple Retrieval in RAG: Practical Tips & Code

This article explains the limitations of Bi‑Encoder retrieval, introduces Cross‑Encoder rerankers, shows how a cascade of recall‑rerank‑generation improves answer quality, and provides concrete code, threshold‑filtering strategies, and domain‑specific fine‑tuning techniques for industrial RAG systems.

AI RetrievalBi-EncoderCross-Encoder

0 likes · 20 min read

Why Rerank Beats Simple Retrieval in RAG: Practical Tips & Code

Wu Shixiong's Large Model Academy

Mar 21, 2026 · Artificial Intelligence

Step‑by‑Step Guide to Implementing a Hybrid Retrieval Function with RRF Fusion

This article breaks down the end‑to‑end retrieval function used in a RAG system, detailing each of the five stages—from request construction, hybrid vector + BM25 search, RRF fusion, cross‑encoder reranking, to threshold filtering—and provides concrete Python code, parameter choices, and performance insights.

Cross-EncoderElasticsearchHybrid Retrieval

0 likes · 13 min read

Step‑by‑Step Guide to Implementing a Hybrid Retrieval Function with RRF Fusion

Data STUDIO

Sep 28, 2025 · Artificial Intelligence

Top Reranker Models for RAG in 2025: A Comparative Review

This article explains why initial retrieval in Retrieval‑Augmented Generation often yields noisy results, describes how rerankers act as quality filters to improve relevance, compares the leading 2025 reranker models—including Cohere, bge‑reranker, Voyage, Jina, FlashRank, and MixedBread—and provides code snippets, evaluation metrics, and guidance for selecting the right model for specific use cases.

AICross-EncoderLLM

0 likes · 31 min read

Top Reranker Models for RAG in 2025: A Comparative Review