Tagged articles

148 articles

Page 1 of 2

May 28, 2026 · Artificial Intelligence

AgenticRAG Delivers 5.9× Recall Boost in Enterprise Retrieval – Real‑World Pre‑Production Results

The article analyzes Microsoft’s AgenticRAG, a tool‑based RAG framework that lets LLMs control retrieval, showing up to a 5.9× recall improvement over standard methods, reduced need for fine‑tuning, and practical design insights from pre‑production deployment.

AgenticRAGClaudeGPT-5-mini

0 likes · 12 min read

AgenticRAG Delivers 5.9× Recall Boost in Enterprise Retrieval – Real‑World Pre‑Production Results

DataFunTalk

May 24, 2026 · Artificial Intelligence

Engineering and Algorithm Innovations for RAG Engines in Office Scenarios

The article analyzes the challenges of deploying large language models in enterprise settings and presents a modular Retrieval‑Augmented Generation (RAG) solution that combines document parsing, multi‑turn query rewriting, hybrid vector‑plus‑BM25 retrieval, two‑stage ranking (RRF, ColBERT, cross‑encoder) and knowledge‑filtered prompt engineering to achieve more comprehensive search, better ranking and more accurate answers.

Document ParsingHybrid RetrievalKnowledge Filtering

0 likes · 22 min read

Engineering and Algorithm Innovations for RAG Engines in Office Scenarios

Tencent Tech

May 20, 2026 · Artificial Intelligence

The Three Evolutions of AI Engineering: Prompt, Context, and Harness

This article analyzes the progressive stages of AI‑driven software engineering—Prompt Engineering, Context Engineering, and Harness Engineering—illustrating how each addresses specific challenges, presenting real‑world experiments from OpenAI and Anthropic, and outlining a roadmap for engineers to master the new paradigm.

AI agentsContext EngineeringHarness Engineering

0 likes · 19 min read

The Three Evolutions of AI Engineering: Prompt, Context, and Harness

SuanNi

May 20, 2026 · Artificial Intelligence

AI‑Powered Research Workflow: When to Trust the Tools and When to Supervise

The article surveys AI‑assisted research across the full lifecycle—creation, writing, validation, and dissemination—detailing the capabilities of prompt engineering, retrieval‑augmented generation, training‑free agents and hybrid methods, reporting benchmark numbers, failure modes, and governance challenges that dictate when human oversight remains essential.

AI research automationGovernanceRetrieval-Augmented Generation

0 likes · 17 min read

AI‑Powered Research Workflow: When to Trust the Tools and When to Supervise

Tech Minimalism

May 16, 2026 · Artificial Intelligence

One‑page guide to the three RAG architectures: Classic, Graph, and Agentic

The article explains why plain large language models cannot answer internal company questions, introduces Retrieval‑Augmented Generation (RAG) as a solution, and compares three RAG variants—Classic, Graph, and Agentic—detailing their workflows, strengths, limitations, and how to choose the right one for a given problem.

Agentic RAGClassic RAGGraph RAG

0 likes · 17 min read

One‑page guide to the three RAG architectures: Classic, Graph, and Agentic

Lao Guo's Learning Space

May 12, 2026 · Artificial Intelligence

Demystifying the Core Technologies Behind ChatGPT, GPT‑4, and DeepSeek

This article breaks down the key algorithms that power large‑language models—Transformer, Mixture‑of‑Experts, Flash Attention, KV‑Cache, Multi‑Token Prediction, quantization, Chain‑of‑Thought and Retrieval‑Augmented Generation—explaining how each contributes to the performance of ChatGPT, GPT‑4 and DeepSeek.

Flash AttentionKV CacheMixture of Experts

0 likes · 10 min read

Demystifying the Core Technologies Behind ChatGPT, GPT‑4, and DeepSeek

James' Growth Diary

May 12, 2026 · Artificial Intelligence

GraphRAG Deep Dive: Boost Multi‑Hop Reasoning Accuracy from 50% to 85% with Knowledge Graphs

This article explains why traditional vector RAG loses relational information, how GraphRAG reconstructs entity‑relationship triples into a knowledge graph, and provides step‑by‑step code, performance benchmarks, retrieval modes, and practical tips that raise multi‑hop reasoning accuracy from around 50% to 85%.

GraphRAGLangChainMulti-hop reasoning

0 likes · 14 min read

GraphRAG Deep Dive: Boost Multi‑Hop Reasoning Accuracy from 50% to 85% with Knowledge Graphs

James' Growth Diary

May 9, 2026 · Artificial Intelligence

Agentic RAG Deep Dive: Letting the Agent Decide When and How Often to Retrieve

The article analyzes the shortcomings of traditional one‑shot RAG pipelines, introduces four Agentic RAG patterns that let an LLM‑driven agent control retrieval strategy, source selection, query rewriting and retry limits, and provides concrete TypeScript implementations with LangGraph, code snippets, and practical pitfalls.

Agentic RAGLLMLangGraph

0 likes · 16 min read

Agentic RAG Deep Dive: Letting the Agent Decide When and How Often to Retrieve

Lao Guo's Learning Space

May 6, 2026 · Artificial Intelligence

Why Your RAG Keeps Missing the Mark: Enterprise‑Level Pitfall Guide

This article examines why Retrieval‑Augmented Generation systems that work in demos often fail in production, detailing common pitfalls—from chunking and vector‑database selection to hybrid retrieval and re‑ranking—and offers concrete strategies, configuration tips, and a decision tree to build reliable enterprise‑grade RAG solutions.

ChunkingHybrid RetrievalRAG

0 likes · 12 min read

Why Your RAG Keeps Missing the Mark: Enterprise‑Level Pitfall Guide

DataFunSummit

May 4, 2026 · Artificial Intelligence

Inside Alibaba Cloud AI Search: Agentic RAG Architecture and Multi‑Agent Techniques

Alibaba Cloud AI Search tackles high‑concurrency, multimodal, and multi‑hop queries by evolving its Agentic RAG architecture from a single agent to a coordinated multi‑agent system that integrates planning, retrieval, and generation, leverages hybrid vector‑text‑DB‑graph recall, GPU‑accelerated indexing, quantization, NL2SQL, and multimodal search, with performance data and real‑world case studies.

AI SearchAgentic RAGAlibaba Cloud

0 likes · 6 min read

Inside Alibaba Cloud AI Search: Agentic RAG Architecture and Multi‑Agent Techniques

DataFunTalk

May 4, 2026 · Artificial Intelligence

Engineering and Algorithm Innovations for RAG Engines in Office Applications

This article analyzes the challenges and practical solutions of building a Retrieval‑Augmented Generation (RAG) system for office scenarios, covering background issues, modular architecture, offline and online pipelines, hybrid retrieval, ranking models, knowledge filtering, prompt design, and two‑stage generation techniques.

AIDocument ParsingHybrid Retrieval

0 likes · 22 min read

Engineering and Algorithm Innovations for RAG Engines in Office Applications

DataFunSummit

May 3, 2026 · Artificial Intelligence

From Flawed to Production-Ready: Deep Dive into Building Enterprise-Grade RAG Systems

The article analyzes why early RAG deployments often fall short, dissects the most common technical pain points—from document parsing to vector overload—and presents a systematic roadmap that includes hybrid search, reranking, GraphRAG, Agentic RAG, model selection, scalability tricks, and security controls for robust B‑side production.

Agentic RAGGraphRAGHybrid Search

0 likes · 20 min read

From Flawed to Production-Ready: Deep Dive into Building Enterprise-Grade RAG Systems

AI Engineer Programming

May 3, 2026 · Artificial Intelligence

From Single Retrieval to Autonomous Reasoning: Understanding Agentic RAG

The article analyzes why traditional Retrieval‑Augmented Generation fails on multi‑hop, vague, or multi‑source queries and explains how Agentic RAG uses an LLM‑driven agent loop to make dynamic retrieval decisions, outlining its architecture, suitable scenarios, and limitations.

AI reasoningAgentic RAGLLM agents

0 likes · 7 min read

From Single Retrieval to Autonomous Reasoning: Understanding Agentic RAG

Spring Full-Stack Practical Cases

May 3, 2026 · Artificial Intelligence

9 Advanced Retrieval‑Augmented Generation (RAG) Architectures Explained

This article introduces Retrieval‑Augmented Generation (RAG) and systematically details nine distinct RAG architectures—standard, conversational with memory, corrective (CRAG), adaptive, self‑RAG, fusion, HyDE, agentic, and Graph RAG—highlighting their workflows, real‑world examples, advantages, and trade‑offs.

AI ArchitectureGraphRAGLLM

0 likes · 17 min read

9 Advanced Retrieval‑Augmented Generation (RAG) Architectures Explained

MaGe Linux Operations

Apr 28, 2026 · Artificial Intelligence

Why Your RAG Performance Is Poor: Common Issues and Optimization Strategies

This article systematically analyzes why Retrieval‑Augmented Generation pipelines often underperform—covering embedding model selection, chunking strategies, hybrid retrieval, reranking, context window waste, evaluation metrics, and a detailed troubleshooting checklist—while providing concrete code examples and best‑practice recommendations for engineers.

ChunkingEmbeddingHybrid Retrieval

0 likes · 19 min read

Why Your RAG Performance Is Poor: Common Issues and Optimization Strategies

PMTalk Product Manager Community

Apr 28, 2026 · Artificial Intelligence

First Principle for Agent Product Managers: Choosing Between Single Agent, Multi‑Agent Collaboration, and Workflow

The article presents a decision framework for AI product managers, mapping workflow determinism and context certainty to four technical patterns—traditional RPA + AI, single Agent + RAG/knowledge graph, end‑to‑end RL Agent, and multi‑Agent collaboration—each with concrete use‑case examples and selection guidelines.

AI agentsRPARetrieval-Augmented Generation

0 likes · 6 min read

First Principle for Agent Product Managers: Choosing Between Single Agent, Multi‑Agent Collaboration, and Workflow

AI Illustrated Series

Apr 27, 2026 · Artificial Intelligence

Comprehensive RAG Interview Q&A: 22 In-Depth Questions and Answers

This extensive interview guide covers 22 core RAG questions, detailing the definition, workflow, embedding selection, vector database choices, retrieval optimization, multi‑turn handling, context compression, evaluation metrics, knowledge‑graph integration, operational challenges, Agentic and hybrid RAG, document update strategies, similarity algorithms, and hallucination mitigation, providing concrete examples and practical advice for AI interview preparation.

AI InterviewEmbeddingKnowledge retrieval

0 likes · 29 min read

Comprehensive RAG Interview Q&A: 22 In-Depth Questions and Answers

DataFunTalk

Apr 26, 2026 · Artificial Intelligence

Building an Enterprise‑Grade RAG 2.0 System: Architecture, Challenges, and Best Practices

This article analyses the practical construction of an enterprise‑level Retrieval‑Augmented Generation (RAG) 2.0 system, covering background issues of large models, a modular architecture, layered offline/online pipelines, hybrid retrieval, ranking strategies, prompt engineering, and deployment insights drawn from China Mobile’s production experience.

Hybrid RetrievalRAGRanking Models

0 likes · 22 min read

Building an Enterprise‑Grade RAG 2.0 System: Architecture, Challenges, and Best Practices

DataFunSummit

Apr 22, 2026 · Artificial Intelligence

From Flawed RAG to Production‑Ready: Deep Dive into Scaling Retrieval‑Augmented Generation

This expert roundtable dissects why RAG often fails in production—low recall, hallucinations, cost overruns—and walks through concrete diagnostics, hybrid search designs, knowledge‑engineering tricks, GraphRAG and Agentic RAG advances, plus practical deployment, security, and cost‑optimization guidelines.

AI deploymentAgentic RAGHybrid Search

0 likes · 20 min read

From Flawed RAG to Production‑Ready: Deep Dive into Scaling Retrieval‑Augmented Generation

MeowKitty Programming

Apr 21, 2026 · Backend Development

2026 AI Priorities for Java Developers: Structured Output, RAG, and Observability

While many Java teams chase flashy AI demos and agents, the real 2026 focus has shifted to engineering concerns—ensuring model outputs reliably map to Java objects, integrating Retrieval‑Augmented Generation into robust data pipelines, and adding observability so AI services can be monitored and debugged like traditional back‑end components.

AILangChain4jObservability

0 likes · 7 min read

2026 AI Priorities for Java Developers: Structured Output, RAG, and Observability

AI Architect Hub

Apr 20, 2026 · Artificial Intelligence

Why LLMs Need RAG: Overcoming Core Limitations and Building Scalable AI Solutions

This article analyzes the fundamental shortcomings of large language models for enterprise use, explains how Retrieval‑Augmented Generation (RAG) bridges those gaps through a detailed offline‑online workflow, and explores emerging trends that will shape the next generation of intelligent AI architectures.

AI ArchitectureFuture AILLM

0 likes · 10 min read

Why LLMs Need RAG: Overcoming Core Limitations and Building Scalable AI Solutions

Big Data and Microservices

Apr 20, 2026 · Artificial Intelligence

Why AI Hallucinates and How RAG Turns It into an Open‑Book Test

The article explains why large language models often fabricate facts, introduces Retrieval‑Augmented Generation (RAG) as a way to ground responses with external data, walks through its four‑step workflow, showcases practical use cases, and highlights the limitations and best practices for deploying RAG.

AIKnowledge BaseLLM

0 likes · 12 min read

Why AI Hallucinates and How RAG Turns It into an Open‑Book Test

AI Architect Hub

Apr 19, 2026 · Artificial Intelligence

Mastering RAG: From Data Cleaning to Vector DBs in AI Applications

This article introduces the second stage of a large‑model application series, detailing the value of Retrieval‑Augmented Generation (RAG), its architecture, and a step‑by‑step outline covering data cleaning, text chunking, vectorization, vector‑DB selection, recall strategies, reranking, and prompt construction.

AIData cleaningLLM

0 likes · 4 min read

Mastering RAG: From Data Cleaning to Vector DBs in AI Applications

Su San Talks Tech

Apr 19, 2026 · Artificial Intelligence

Boost Enterprise RAG: Data Pipeline Tricks, Hybrid Search & Rerank

To make Retrieval‑Augmented Generation reliable in production, the article outlines five key engineering tactics—semantic chunking with metadata, hybrid vector‑keyword search, two‑stage retrieval with reranking, query rewriting and expansion, and dynamic result evaluation—each illustrated with concrete examples and code snippets.

AI EngineeringHybrid SearchQuery Rewriting

0 likes · 10 min read

Boost Enterprise RAG: Data Pipeline Tricks, Hybrid Search & Rerank

Machine Learning Algorithms & Natural Language Processing

Apr 17, 2026 · Artificial Intelligence

When RAG Retrieves the Right Docs but Still Answers Wrong: Insights from Saarland University (ACL 2026)

The article explains why conventional Retrieval‑Augmented Generation often produces incorrect answers despite retrieving relevant documents, introduces the Disco‑RAG framework that adds a structured reading step using argument trees and relation graphs, and shows how this three‑step approach dramatically improves performance on long‑document and ambiguous‑question benchmarks without any model training.

Disco-RAGRAGRetrieval-Augmented Generation

0 likes · 13 min read

When RAG Retrieves the Right Docs but Still Answers Wrong: Insights from Saarland University (ACL 2026)

DataFunSummit

Apr 17, 2026 · Artificial Intelligence

Why RAG Projects Fail: Real‑World Pitfalls and Proven Solutions

This article dissects the hype‑versus‑reality gap of Retrieval‑Augmented Generation in enterprises, exposing low recall, hallucinations, and cost overruns, then offers a systematic diagnosis, hybrid search, reranking, security controls, and advanced GraphRAG and Agentic RAG strategies to achieve reliable production deployments.

Best PracticesLLMRAG

0 likes · 17 min read

Why RAG Projects Fail: Real‑World Pitfalls and Proven Solutions

DataFunTalk

Apr 15, 2026 · Artificial Intelligence

Building a Production‑Ready RAG System for Enterprise Knowledge Work

This article analyzes the challenges and practical solutions of deploying Retrieval‑Augmented Generation (RAG) in an enterprise office setting, covering background problems, modular architecture, offline and online pipelines, hybrid retrieval, multi‑stage ranking, knowledge filtering, prompt engineering, and model selection to achieve accurate, reliable answers.

Hybrid RetrievalRAGRanking Models

0 likes · 21 min read

Building a Production‑Ready RAG System for Enterprise Knowledge Work

IT Services Circle

Apr 14, 2026 · Artificial Intelligence

What Is RAG? A Complete Guide to Retrieval‑Augmented Generation for AI Engineers

This article explains Retrieval‑Augmented Generation (RAG), covering why large language models need external knowledge, the full offline‑and‑online workflow, document chunking, embedding evolution, vector database choices, multi‑path retrieval, evaluation metrics, hallucination types, and practical strategies to mitigate them.

AI evaluationEmbeddingRAG

0 likes · 55 min read

What Is RAG? A Complete Guide to Retrieval‑Augmented Generation for AI Engineers

Spring Full-Stack Practical Cases

Apr 11, 2026 · Artificial Intelligence

Master AI Fundamentals: Tokens, Context Windows, Temperature, Hallucinations & RAG

This article breaks down five essential AI concepts—tokens, context windows, temperature settings, hallucinations, and retrieval‑augmented generation—explaining how they work, why they matter, and how to apply them effectively when building or using large language model applications.

AI fundamentalsRetrieval-Augmented GenerationTemperature

0 likes · 12 min read

Master AI Fundamentals: Tokens, Context Windows, Temperature, Hallucinations & RAG

Bighead's Algorithm Notes

Apr 7, 2026 · Artificial Intelligence

AutoHypo-Fin: Tsinghua's Web-Mining Method to Auto-Generate and Backtest Market Hypotheses

AutoHypo‑Fin is an end‑to‑end framework that harvests large‑scale web financial data, extracts entities via large language models, builds a temporal knowledge graph, uses retrieval‑augmented generation and statistical backtesting to automatically create, test, and iteratively optimize trading hypotheses, achieving superior risk‑adjusted returns compared with baseline strategies in experiments from 2019‑2024.

AutoHypo-FinLLMQuantitative Finance

0 likes · 11 min read

AutoHypo-Fin: Tsinghua's Web-Mining Method to Auto-Generate and Backtest Market Hypotheses

IT Services Circle

Apr 6, 2026 · Artificial Intelligence

Mastering RAG Interview Questions: A Complete Retrieval Optimization Blueprint

This article breaks down the full RAG retrieval pipeline—from query understanding and rewriting, through hybrid retrieval and reranking, to chunking, context compression, and dynamic routing—providing concrete techniques, formulas, and performance metrics to help candidates ace interview questions on RAG systems.

Context CompressionCross-EncoderHard Negative Mining

0 likes · 16 min read

Mastering RAG Interview Questions: A Complete Retrieval Optimization Blueprint

DataFunSummit

Apr 1, 2026 · Artificial Intelligence

Why RAG Fails in Production and How to Fix It: Expert Insights

This article analyzes why Retrieval‑Augmented Generation (RAG) often underperforms in enterprise production, identifies eight common pitfalls—from document parsing to token costs—and offers a systematic roadmap of diagnostics, hybrid search, reranking, and deployment strategies presented by leading AI experts.

AIBest PracticesRAG

0 likes · 18 min read

Why RAG Fails in Production and How to Fix It: Expert Insights

AI Step-by-Step

Mar 29, 2026 · Artificial Intelligence

How RAG Quickly Gives Your Agent Real Business Knowledge

The article explains why agents often lack business understanding, describes Retrieval‑Augmented Generation (RAG) as the fastest way to provide correct, up‑to‑date business context, outlines eight practical RAG patterns, and offers a step‑by‑step checklist for building enterprise‑ready agents.

AgentGraphRAGKnowledge retrieval

0 likes · 10 min read

How RAG Quickly Gives Your Agent Real Business Knowledge

Data Party THU

Mar 23, 2026 · Artificial Intelligence

Boosting RAG Performance: Query Translation & Decomposition Techniques

The article explains two emerging RAG query‑optimization approaches—query translation and query decomposition—detailing fan‑out retrieval, reciprocal rank fusion, HyDE, step‑back prompting, and chain‑of‑thought retrieval, and shows how combining them can improve relevance and latency in LLM‑augmented systems.

LLMRAGRetrieval-Augmented Generation

0 likes · 9 min read

Boosting RAG Performance: Query Translation & Decomposition Techniques

Woodpecker Software Testing

Mar 22, 2026 · Artificial Intelligence

How to Test Retrieval‑Augmented Generation Systems: Practical Strategies for 2024

This article explains why traditional API, assertion, and UI testing fail for Retrieval‑Augmented Generation (RAG) systems, and presents a four‑step, evidence‑driven testing framework—including golden test sets, dual‑track validation, chaos engineering, and continuous trust dashboards—to ensure factual reliability and operational robustness in real‑world deployments.

Fact CheckingLLMOpenTelemetry

0 likes · 8 min read

How to Test Retrieval‑Augmented Generation Systems: Practical Strategies for 2024

Data Party THU

Mar 21, 2026 · Artificial Intelligence

Why Bigger Context Windows Hurt LLMs and How RAG Still Wins

The article explains that expanding LLM context windows leads to attention dilution and retrieval collapse, degrading answer quality, and argues that Retrieval‑Augmented Generation remains essential because it preserves signal density through focused retrieval and selective prompting.

AI ArchitectureAttention DilutionLLM

0 likes · 8 min read

Why Bigger Context Windows Hurt LLMs and How RAG Still Wins

PaperAgent

Mar 19, 2026 · Artificial Intelligence

How MDER‑DR Boosts Multi‑Hop KG QA with Entity‑Centric Summaries

The article presents the MDER‑DR two‑stage framework that tackles semantic loss in knowledge‑graph triple indexing by generating context‑aware entity summaries and using an LLM‑driven decompose‑parse retrieval loop, achieving up to 66% performance gains on multi‑hop question answering benchmarks.

Entity SummarizationKG QALLM

0 likes · 5 min read

How MDER‑DR Boosts Multi‑Hop KG QA with Entity‑Centric Summaries

Tech Freedom Circle

Mar 19, 2026 · Artificial Intelligence

Failed Alibaba Interview: The 4 RAG Modules and 6 Design Principles You Need

The article dissects a failed Alibaba second‑round interview where the candidate answered only “vector‑search‑enhanced” for a RAG design, and then presents a systematic, four‑module RAG architecture together with six design principles, detailed indexing, query understanding, multi‑path recall, and context generation techniques to help candidates demonstrate comprehensive technical depth.

AI ArchitectureDesign PrinciplesMulti‑Path Recall

0 likes · 22 min read

Failed Alibaba Interview: The 4 RAG Modules and 6 Design Principles You Need

Woodpecker Software Testing

Mar 17, 2026 · Artificial Intelligence

Why Direct Prompting Beats LLM Knowledge‑Base Agents for Test‑Case Generation

The article explains that feeding requirements directly in the prompt yields far better test‑case designs than using an LLM‑powered knowledge‑base agent, because the model processes the full context without the loss and fragmentation introduced by retrieval‑augmented generation.

AI agentsKnowledge BaseLLM

0 likes · 6 min read

Why Direct Prompting Beats LLM Knowledge‑Base Agents for Test‑Case Generation

Data STUDIO

Mar 9, 2026 · Artificial Intelligence

Boost RAG Accuracy from 60% to 94% with 11 Proven Strategies

This article dissects why naive Retrieval‑Augmented Generation (RAG) often yields only 60% accuracy, then presents eleven concrete ingestion, query, and hybrid techniques—complete with code samples, performance trade‑offs, and real‑world case studies—that together can raise RAG accuracy to 94% while outlining practical implementation roadmaps and common pitfalls.

EmbeddingLLMRAG

0 likes · 31 min read

Boost RAG Accuracy from 60% to 94% with 11 Proven Strategies

DataFunSummit

Feb 25, 2026 · Artificial Intelligence

Why RAG Fails in Production and How to Fix It: Expert Insights

This article summarizes a DataFun‑hosted roundtable where leading AI experts dissect the gap between RAG’s promise and real‑world deployment, exposing low recall, hallucinations, and cost overruns, then present systematic diagnostics, evaluation metrics, hybrid search, and engineering best practices to reliably operationalize RAG in enterprise settings.

Hybrid SearchLLMRAG

0 likes · 18 min read

Qborfy AI

Feb 18, 2026 · Artificial Intelligence

How Retrieval‑Augmented Generation (RAG) Supercharges LLM Answers – Complete Guide & Code

This article explains Retrieval‑Augmented Generation (RAG), detailing its offline knowledge‑base construction and online retrieval‑enhanced generation workflow, comparing it with traditional and fine‑tuned models, and providing step‑by‑step LangChain implementations, advanced techniques, and practical use‑case demos.

Hybrid SearchLangChainRAG

0 likes · 16 min read

How Retrieval‑Augmented Generation (RAG) Supercharges LLM Answers – Complete Guide & Code

DataFunTalk

Feb 11, 2026 · Artificial Intelligence

Why Most RAG Deployments Fail and How to Build a Production‑Ready RAG System

This round‑table dissects the gap between RAG’s hype and real‑world production, exposing common pitfalls such as low recall, hallucinations and cost overruns, and then delivers a systematic diagnostic framework, hybrid search strategies, fine‑tuning rules, and practical best‑practice roadmaps for building reliable enterprise RAG solutions.

Agentic RAGHybrid SearchLLM

0 likes · 20 min read

Why Most RAG Deployments Fail and How to Build a Production‑Ready RAG System

AI2ML AI to Machine Learning

Feb 7, 2026 · Artificial Intelligence

Why the ‘Skills’ Approach Is the Third Major Compromise Shaping Enterprise AI in 2026

The article argues that embracing the Skills paradigm— a lightweight, low‑cost alternative to large‑scale model training—represents the third major compromise in the large‑model era, balancing reduced emergence and planning hallucinations against increased stability and engineering efficiency for enterprise AI deployments.

Mixture of ExpertsRetrieval-Augmented GenerationSkills technology

0 likes · 8 min read

Why the ‘Skills’ Approach Is the Third Major Compromise Shaping Enterprise AI in 2026

Data STUDIO

Jan 27, 2026 · Artificial Intelligence

How Python RAG Architectures Can Tame Large‑Model Hallucinations: A Complete Guide to 9 Designs

This article explains why large‑language‑model hallucinations are risky, introduces Retrieval‑Augmented Generation (RAG) as a remedy, and walks through nine Python‑based RAG architectures—standard, conversational, corrective, adaptive, fusion, HyDE, self‑RAG, agentic, and graph RAG—detailing their workflows, code examples, strengths, weaknesses, and a decision‑making map for selecting the right design.

AI HallucinationLangChainPython

0 likes · 29 min read

How Python RAG Architectures Can Tame Large‑Model Hallucinations: A Complete Guide to 9 Designs

PaperAgent

Jan 13, 2026 · Artificial Intelligence

How C2LLM Redefines Code Retrieval with Attention‑Based Pooling

Introducing C2LLM, a contrastive code LLM series that replaces mean and EOS pooling with a multi‑head attention pooling module, achieving top scores on the MTEB‑Code benchmark across 12 tasks and demonstrating cost‑effective, high‑precision code retrieval for both production and AI agent applications.

Large Language ModelMTEB-CodeRetrieval-Augmented Generation

0 likes · 8 min read

How C2LLM Redefines Code Retrieval with Attention‑Based Pooling

Sohu Tech Products

Jan 7, 2026 · Artificial Intelligence

Master Retrieval-Augmented Generation (RAG): Concepts, Benefits, Implementation

This article explains Retrieval‑Augmented Generation (RAG), its dual‑stage architecture that combines parametric LLM knowledge with external non‑parametric data, outlines its technical evolution, discusses why it outperforms pure LLMs, and provides a step‑by‑step guide with toolchain choices, evaluation metrics, and future challenges.

AIKnowledge BaseLLM

0 likes · 14 min read

Master Retrieval-Augmented Generation (RAG): Concepts, Benefits, Implementation

PaperAgent

Jan 5, 2026 · Artificial Intelligence

How QuCo‑RAG Replaces Model Confidence with Objective Evidence to Cut Hallucinations

QuCo‑RAG introduces a dynamic retrieval‑augmented generation framework that quantifies uncertainty using pre‑training corpus statistics, replacing unreliable model confidence with objective frequency and co‑occurrence evidence, achieving millisecond‑level hallucination detection, superior multi‑hop QA performance, and cross‑model transferability across various LLMs.

Dynamic RetrievalLLMRetrieval-Augmented Generation

0 likes · 9 min read

How QuCo‑RAG Replaces Model Confidence with Objective Evidence to Cut Hallucinations

Mingyi World Elasticsearch

Dec 28, 2025 · Artificial Intelligence

Building an Elasticsearch‑Powered RAG Q&A System: Theory and Full Code Walkthrough

This article walks through the principles of Retrieval‑Augmented Generation (RAG) and provides a complete Python implementation using Elasticsearch, covering document chunking, semantic embedding, bulk indexing, hybrid BM25‑vector search, RRF result fusion, prompt design, LLM invocation, and a practical demo.

ElasticsearchHybrid SearchPython

0 likes · 9 min read

Building an Elasticsearch‑Powered RAG Q&A System: Theory and Full Code Walkthrough

PaperAgent

Dec 12, 2025 · Artificial Intelligence

How BookRAG Redefines Long-Document Retrieval with Hierarchical Indexing

BookRAG introduces a hierarchical, structure‑aware indexing method that combines tree‑based document representation with graph‑based entity linking and an agent‑driven retrieval pipeline, achieving up to 71.2% recall improvement on multimodal long‑document benchmarks while cutting token usage and latency dramatically.

Agent RetrievalHierarchical IndexingLLM

0 likes · 7 min read

How BookRAG Redefines Long-Document Retrieval with Hierarchical Indexing

Open Source Tech Hub

Dec 5, 2025 · Artificial Intelligence

From Neurons to GPT: A Complete Timeline of AI Evolution and Future Trends

This comprehensive article traces AI from its biological roots and early computers through the birth of artificial intelligence, the rise of machine learning, the emergence of large language models, multimodal agents, and finally explores current breakthroughs, practical applications, and future directions.

Artificial IntelligenceMachine LearningRetrieval-Augmented Generation

0 likes · 39 min read

From Neurons to GPT: A Complete Timeline of AI Evolution and Future Trends

Architect's Guide

Nov 24, 2025 · Artificial Intelligence

Building Java LLM Applications with LangChain4j: A Hands‑On Guide

This tutorial walks through the fundamentals of large language models, prompt engineering, and word embeddings, then shows how to set up a LangChain‑based LLM stack in Java using LangChain4j, covering core modules, memory, retrieval, chains, agents, and complete code examples.

AI agentsJavaLLM

0 likes · 15 min read

Building Java LLM Applications with LangChain4j: A Hands‑On Guide

JD Tech Talk

Nov 21, 2025 · Artificial Intelligence

Mastering Chunking Strategies for Retrieval‑Augmented Generation

This article explains why effective chunking is crucial for RAG performance, compares seven major chunking strategies—including fixed‑size, semantic, recursive, document‑structure, agent‑driven, sentence, and paragraph methods—and offers practical guidance on selecting and optimizing chunks for real‑world AI applications.

AIChunkingRAG

0 likes · 10 min read

Mastering Chunking Strategies for Retrieval‑Augmented Generation

Wu Shixiong's Large Model Academy

Nov 20, 2025 · Artificial Intelligence

How to Build a Quantifiable Data Quality Framework for Dynamic Incremental RAG

This article explains why static RAG metrics don’t apply to dynamic pipelines, introduces five essential dimensions—Parseability, Deduplication, Relevance, Chunk Quality, and Freshness—and shows how to combine them into a weighted score that enables monitoring, alerts, and continuous improvement of dynamic RAG systems.

Dynamic RAGMetricsMonitoring

0 likes · 10 min read

How to Build a Quantifiable Data Quality Framework for Dynamic Incremental RAG

Data Thinking Notes

Nov 16, 2025 · Artificial Intelligence

How AI Agents Transform Automation: Architecture, Challenges & Future Trends

This comprehensive overview examines AI agents powered by large language models, detailing their definition, core components, architectural patterns, key technologies such as prompt engineering and retrieval‑augmented generation, diverse application domains, current challenges, security solutions, and emerging research directions.

Retrieval-Augmented Generationarchitecturelarge language models

0 likes · 81 min read

How AI Agents Transform Automation: Architecture, Challenges & Future Trends

Alibaba Cloud Big Data AI Platform

Nov 4, 2025 · Artificial Intelligence

How Alibaba Cloud’s PAI Powers Cutting‑Edge LLM Research at EMNLP 2025

EMNLP 2025 in Suzhou will feature Alibaba Cloud’s AI platform PAI presenting four accepted papers on knowledge distillation, small‑model reasoning, distilled reasoning models, and an automated RAG benchmark framework, alongside exhibition demos, networking events, and recruitment opportunities for AI talent.

AI PlatformEMNLP 2025Knowledge Distillation

0 likes · 10 min read

How Alibaba Cloud’s PAI Powers Cutting‑Edge LLM Research at EMNLP 2025

Wu Shixiong's Large Model Academy

Nov 2, 2025 · Artificial Intelligence

Why Document Parsing Is the Real Bottleneck in RAG Projects (And How to Fix It)

The article explains that in Retrieval‑Augmented Generation projects the hardest challenge lies in robust document parsing—handling PDFs, PPTs, scanned contracts, OCR errors, and preserving structure—to ensure high‑quality retrieval and avoid hallucinations.

AIOCRRAG

0 likes · 10 min read

Why Document Parsing Is the Real Bottleneck in RAG Projects (And How to Fix It)

DataFunSummit

Oct 30, 2025 · Artificial Intelligence

How Multimodal Large Models Are Revolutionizing Document Processing and OCR

This article explores how the explosion of unstructured data exposes the limits of traditional OCR and shows how emerging multimodal large language models provide end‑to‑end document understanding, reduce pipeline complexity, cut training costs, enable hybrid retrieval‑augmented generation, and drive real‑world industry deployments.

AIDocument ProcessingLarge Language Model

0 likes · 28 min read

How Multimodal Large Models Are Revolutionizing Document Processing and OCR

Xuanwu Backend Tech Stack

Oct 22, 2025 · Artificial Intelligence

How Rerank Transforms Retrieval‑Augmented Generation for Accurate AI Answers

This article explains the limitations of basic Retrieval‑Augmented Generation (RAG), introduces Rerank technology as a two‑step refinement process, compares dual‑encoder and cross‑encoder methods, and reviews popular Rerank models to help developers build more precise AI‑driven retrieval systems.

Artificial IntelligenceRAGRerank

0 likes · 10 min read

How Rerank Transforms Retrieval‑Augmented Generation for Accurate AI Answers

JD Tech Talk

Oct 21, 2025 · Backend Development

How Backend Engineers Are Breaking Through AI with RAG Architectures

This article details a backend developer's two‑year AI journey, the challenges of rapid model advances, and how applying microservice principles to Retrieval‑Augmented Generation (RAG) creates a scalable, multi‑agent platform for insurance knowledge, memory, and intelligent agents.

Backend AIKnowledge BaseRAG

0 likes · 11 min read

How Backend Engineers Are Breaking Through AI with RAG Architectures

AI Large Model Application Practice

Oct 13, 2025 · Artificial Intelligence

How to Tame LLM Agents: Proven Strategies to Reduce Uncertainty and Boost Reliability

This article outlines practical techniques—including prompt engineering, domain fine‑tuning, retrieval‑augmented generation, structured outputs, workflow constraints, model parameter control, behavior rules, risk‑based AI participation, and comprehensive governance—to curb the unpredictability of large language model agents in enterprise settings.

AI agentAI governanceLLM

0 likes · 18 min read

How to Tame LLM Agents: Proven Strategies to Reduce Uncertainty and Boost Reliability

DataFunSummit

Oct 9, 2025 · Artificial Intelligence

Why AI Coding Agents Still Struggle: Context Limits, Knowledge Gaps, and the Road to Human‑Like Assistants

This talk examines the core challenges facing AI coding agents—limited context windows, knowledge accumulation, and software‑engineering complexity—while outlining practical solutions such as context providing, RAG, fine‑tuning, online learning, feedback loops, and multi‑agent collaboration to move toward truly human‑like, continuously learning coding assistants.

AI codingCoding AgentOnline Learning

0 likes · 24 min read

Why AI Coding Agents Still Struggle: Context Limits, Knowledge Gaps, and the Road to Human‑Like Assistants

JD Cloud Developers

Sep 28, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Work?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines traditional information retrieval with large language models, covering its core workflow—from knowledge preparation, data cleaning, and metadata extraction to query preprocessing, vector retrieval, reranking, information integration, and final LLM generation, while also reviewing common embedding models and vector databases.

Artificial IntelligenceLLMRAG

0 likes · 13 min read

What Is Retrieval‑Augmented Generation (RAG) and How Does It Work?

Tech Freedom Circle

Sep 25, 2025 · Artificial Intelligence

How RAGFlow’s Agent Engine Turns Retrieval into a Problem‑Solving AI

This article explains how RAGFlow upgrades a traditional RAG system from a passive question‑answer engine to an active problem‑solving agent by integrating the ReAct reasoning‑action‑observation loop, a visual canvas workflow, and a modular component‑tool ecosystem, with concrete Python implementations and code examples.

AI agentsPythonRAGFlow

0 likes · 16 min read

How RAGFlow’s Agent Engine Turns Retrieval into a Problem‑Solving AI

Tech Freedom Circle

Sep 25, 2025 · Artificial Intelligence

RAGFlow Primer Part 1: Introduction and Concept Deep Dive

This article provides a comprehensive technical overview of RAGFlow, an industrial‑grade Retrieval‑Augmented Generation platform, detailing its architecture, core components such as DeepDoc, intelligent chunking, embedding integration, multi‑stage retrieval, and agent workflow, while comparing it with traditional RAG shortcomings.

Agent workflowDeepDocIntelligent Chunking

0 likes · 32 min read

RAGFlow Primer Part 1: Introduction and Concept Deep Dive

Tech Freedom Circle

Sep 25, 2025 · Artificial Intelligence

Inside RAGFlow: How Its Microservice Architecture Powers an Enterprise‑Grade Retrieval‑Augmented Generation Platform

This article provides a detailed technical walkthrough of RAGFlow's architecture, covering its microservice design, directory layout, layered structure, cloud‑native deployment, core modules such as DeepDoc, RAG engine, Agent system, and web UI, as well as multi‑tenant isolation, streaming responses, asynchronous task handling, concurrency controls, scalability strategies, and a complete request‑lifecycle example for document upload.

AI ArchitectureDeepDocDocker-Compose

0 likes · 26 min read

Inside RAGFlow: How Its Microservice Architecture Powers an Enterprise‑Grade Retrieval‑Augmented Generation Platform

Huolala Tech

Sep 24, 2025 · Artificial Intelligence

How CID-GraphRAG Boosts Multi‑Turn AI Customer Service with Dual‑Layer Retrieval

The article introduces CID-GraphRAG, a novel framework that combines intent‑driven graphs with semantic similarity search to improve multi‑turn intelligent customer service, detailing its architecture, dual‑layer retrieval mechanism, evaluation against baseline models, and future research directions.

AICustomer ServiceDialogue Systems

0 likes · 14 min read

How CID-GraphRAG Boosts Multi‑Turn AI Customer Service with Dual‑Layer Retrieval

Data Thinking Notes

Sep 21, 2025 · Artificial Intelligence

From RAG to DeepSearch & DeepResearch: How AI Is Mastering Knowledge Retrieval

Amid the rapid rise of generative AI, this article examines the limitations of large language models and explains how Retrieval‑Augmented Generation (RAG), followed by the advanced paradigms DeepSearch and DeepResearch, progressively enhance knowledge handling through dynamic retrieval, multi‑agent reasoning, and autonomous research capabilities.

AI Knowledge ManagementDeepResearchDeepSearch

0 likes · 16 min read

From RAG to DeepSearch & DeepResearch: How AI Is Mastering Knowledge Retrieval

DataFunTalk

Sep 19, 2025 · Artificial Intelligence

How Tencent’s Large Language Models Transform Business with RAG, GraphRAG, and Agents

This article examines Tencent's large language model deployments across diverse business scenarios, detailing how Retrieval‑Augmented Generation, GraphRAG, and autonomous agents boost model intelligence, improve user experience, and enable advanced content generation, understanding, and multi‑step reasoning.

Artificial IntelligenceGraphRAGRetrieval-Augmented Generation

0 likes · 4 min read

How Tencent’s Large Language Models Transform Business with RAG, GraphRAG, and Agents

Architecture & Thinking

Sep 12, 2025 · Artificial Intelligence

How Knowledge Graphs Turn Large Language Models into Trustworthy Experts

Integrating structured knowledge graphs with generative AI provides traceable, explainable, and high‑precision reasoning across domains such as medicine, finance, and law, through techniques like Retrieval‑Augmented Generation, graph neural networks, and adaptive planning, dramatically reducing hallucinations and boosting expert‑level performance.

AI HallucinationGraph Neural NetworkRetrieval-Augmented Generation

0 likes · 12 min read

How Knowledge Graphs Turn Large Language Models into Trustworthy Experts

Architects Research Society

Sep 10, 2025 · Artificial Intelligence

From Vectors to Graphs to Hybrids: The Evolution of AI Knowledge Representation

This article explores the three stages of AI knowledge representation—vector embeddings, graph‑based structures, and the emerging hybrid approach that combines vectors, graphs, and large language models—to illustrate how modern Retrieval‑Augmented Generation systems achieve both semantic similarity and precise relational reasoning.

AIRetrieval-Augmented Generationgraph databases

0 likes · 3 min read

From Vectors to Graphs to Hybrids: The Evolution of AI Knowledge Representation

Instant Consumer Technology Team

Sep 5, 2025 · Artificial Intelligence

Why Context Engineering Is the Next Frontier for Large Language Models

This article surveys over 1,400 papers to define context engineering as a systematic discipline that structures retrieval, memory, tools, and multi‑agent coordination for LLMs, highlighting the critical asymmetry between understanding long contexts and generating equally complex outputs.

Context EngineeringLLM evaluationMemory Management

0 likes · 8 min read

Why Context Engineering Is the Next Frontier for Large Language Models

Alibaba Cloud Developer

Sep 1, 2025 · Artificial Intelligence

Mastering RAG: From Chunking to Hybrid Search for Better AI Retrieval

This article delves into the implementation details and optimization strategies of Retrieval‑Augmented Generation (RAG), covering document chunking, index enhancement, embedding, hybrid search, and re‑ranking, and provides practical code examples to help developers move from quick deployment to deep performance tuning.

AIChunkingEmbedding

0 likes · 19 min read

Mastering RAG: From Chunking to Hybrid Search for Better AI Retrieval

Tencent Technical Engineering

Aug 29, 2025 · Artificial Intelligence

How Retrieval‑Augmented Generation Evolves into Autonomous AI Agents

This article examines the limitations of large language models' internal knowledge, explains how retrieval‑augmented generation (RAG) and tool‑augmented generation address these limits, and traces the evolution from simple retrieve‑then‑generate pipelines to autonomous, multi‑modal AI agents.

AI agentsLLMRAG

0 likes · 20 min read

How Retrieval‑Augmented Generation Evolves into Autonomous AI Agents

Tech Freedom Circle

Aug 26, 2025 · Artificial Intelligence

How to Optimize RAG for Alibaba Interviews? 7 Golden Rules Explained

This article provides a step‑by‑step technical guide to optimizing Retrieval‑Augmented Generation (RAG) for interview scenarios, covering query rewriting, HyDE, fallback strategies, routing and prompt routing, multi‑representation indexing, hybrid retrieval, re‑ranking, self‑RAG, generation control, performance benchmarking, and a practical checklist with concrete code examples and metrics.

AI InterviewHybrid RetrievalIndex Optimization

0 likes · 30 min read

How to Optimize RAG for Alibaba Interviews? 7 Golden Rules Explained

DaTaobao Tech

Aug 25, 2025 · Artificial Intelligence

Mastering RAG: From Quick Start to Deep Optimization Strategies

This article dives into the practical implementation of Retrieval‑Augmented Generation (RAG), covering document chunking, semantic and reverse HyDE indexing, embedding, hybrid search, and re‑ranking techniques, and provides concrete code examples and optimization tips for building high‑performance AI applications.

Artificial IntelligenceChunkingEmbedding

0 likes · 18 min read

Mastering RAG: From Quick Start to Deep Optimization Strategies

Volcano Engine Developer Services

Aug 21, 2025 · Artificial Intelligence

Why Prompt Engineering Isn’t Enough: The Rise of Context Engineering and RAG

Since last year, the debate over “Prompt Engineering” has split between practitioners who favor “Context Engineering” for building scalable agent systems and scholars who treat Prompt Engineering as a broad umbrella term, highlighting the need to dynamically construct and manage context for reliable, extensible AI applications.

AI agentsLLMRAG

0 likes · 33 min read

Why Prompt Engineering Isn’t Enough: The Rise of Context Engineering and RAG

Data Party THU

Aug 17, 2025 · Artificial Intelligence

Why Do Large Language Models Hallucinate? Unpacking the Probabilistic Roots and Fixes

Large language models often generate confident but false statements—a phenomenon called hallucination—because they predict the next token based on statistical patterns rather than factual understanding, and this article explains the underlying mechanisms and practical mitigation strategies.

Knowledge DistillationLLMRLHF

0 likes · 11 min read

Why Do Large Language Models Hallucinate? Unpacking the Probabilistic Roots and Fixes

AsiaInfo Technology: New Tech Exploration

Jul 30, 2025 · Artificial Intelligence

How MCP‑RAG Overcomes Prompt Inflation for Massive LLM Service Calls

This article analyzes the prompt‑inflation bottleneck that arises when large language models (LLMs) must handle thousands of Model Context Protocol (MCP) services, and introduces the MCP‑RAG architecture—a retrieval‑augmented generation solution that builds a metadata knowledge base and intelligent retrieval layer to enable precise, efficient MCP service discovery at scale.

AILLMMCP

0 likes · 21 min read

How MCP‑RAG Overcomes Prompt Inflation for Massive LLM Service Calls

JD Tech

Jul 29, 2025 · Artificial Intelligence

How Causal Inference Meets Large Language Models to Revolutionize E‑commerce Pricing

This article describes a QCon talk that combines causal inference with large language models to build a retrieval‑augmented generation pricing system for e‑commerce, detailing the three‑step algorithm, LLM‑driven modeling challenges, process‑reward tree search, reinforcement‑learning fine‑tuning, and experimental gains in accuracy and speed.

Retrieval-Augmented Generationcausal inferencee‑commerce pricing

0 likes · 17 min read

How Causal Inference Meets Large Language Models to Revolutionize E‑commerce Pricing

AI2ML AI to Machine Learning

Jul 24, 2025 · Artificial Intelligence

Exploring Recent Large‑Model Agent Papers: Insights and Analyses

This article reviews a series of recent research papers on large‑model agents, covering topics such as reinforcement‑learning‑driven ML agents, premise‑critique ability of LLMs, long‑term tool‑augmented LLM evaluation, agentic RAG, set‑based retrieval for multi‑hop QA, mobile VLM agents, and broader surveys of LLM applications, summarizing each work’s problem statement, prior approaches, novel contributions, experimental results, limitations, and future directions.

LLM evaluationRetrieval-Augmented Generationagentic AI

0 likes · 46 min read

Exploring Recent Large‑Model Agent Papers: Insights and Analyses

DataFunTalk

Jul 21, 2025 · Artificial Intelligence

From Prompt Engineering to Context Engineering: Transforming LLM Interactions

This article traces the evolution from prompt engineering to context engineering, detailing technical milestones, core concepts, practical strategies, and future trends that together reshape large language model applications and enable sophisticated AI agents across diverse domains.

Memory ManagementRetrieval-Augmented Generationlarge language models

0 likes · 35 min read

From Prompt Engineering to Context Engineering: Transforming LLM Interactions

Instant Consumer Technology Team

Jul 14, 2025 · Artificial Intelligence

9 Essential Technologies for Building Scalable AI Agents

An in‑depth guide outlines the nine core technologies—ranging from autonomous agent fundamentals and multi‑agent collaboration to workflow orchestration, retrieval‑augmented generation, fine‑tuning, function calling, model context protocols, agent‑to‑agent communication, and AI‑driven UI—required to design, deploy, and scale enterprise‑grade AI agents.

AI agentsAgent ArchitectureFunction Calling

0 likes · 9 min read

9 Essential Technologies for Building Scalable AI Agents

Architect's Alchemy Furnace

Jul 12, 2025 · Artificial Intelligence

Why GraphRAG Is the Future of Retrieval‑Augmented Generation

This article explains how GraphRAG combines knowledge graphs with retrieval‑augmented generation to overcome the limitations of vector‑only RAG, delivering higher accuracy, better explainability, easier development, and stronger governance for generative AI applications across various domains.

AIGraphRAGLLM

0 likes · 23 min read

Why GraphRAG Is the Future of Retrieval‑Augmented Generation

Fun with Large Models

Jul 4, 2025 · Artificial Intelligence

Boost Answer Accuracy with GraphRAG: End‑to‑End Microsoft GraphRAG Code Walkthrough

This article walks through the complete GraphRAG workflow—from environment setup and indexing to command‑line and Python API queries—demonstrating how to build a knowledge graph, tune prompts, and retrieve high‑quality answers using Microsoft’s GraphRAG implementation.

GraphRAGMicrosoftPrompt Tuning

0 likes · 25 min read

Boost Answer Accuracy with GraphRAG: End‑to‑End Microsoft GraphRAG Code Walkthrough

AI Algorithm Path

Jul 3, 2025 · Artificial Intelligence

Exploring Advanced, Graph, and Agentic RAG: The Evolution of Retrieval‑Augmented Generation

This article examines how Retrieval‑Augmented Generation (RAG) has progressed from simple keyword‑based retrieval to advanced semantic methods, modular architectures, graph‑enhanced reasoning, and autonomous agentic systems, highlighting each approach's workflow, benefits, limitations, and the shift toward dynamic AI decision‑making.

AIAgentic RAGGraph RAG

0 likes · 7 min read

Exploring Advanced, Graph, and Agentic RAG: The Evolution of Retrieval‑Augmented Generation

AIWalker

Jun 30, 2025 · Artificial Intelligence

Chinese Team Builds First AI That Understands Film, Using 440K Shot Library for Director‑Level Camera Moves

FilMaster is a pioneering AI system that learns cinematic principles from a 440,000‑shot movie database, combines multimodal LLMs, RAG, and audience‑centric rhythm control to generate editable, high‑quality films, and outperforms prior methods by over 50% on the new FilmEval benchmark.

AI film generationFilmEval benchmarkRetrieval-Augmented Generation

0 likes · 18 min read

Chinese Team Builds First AI That Understands Film, Using 440K Shot Library for Director‑Level Camera Moves

AI Algorithm Path

Jun 26, 2025 · Artificial Intelligence

The 10 Essential Components of a Retrieval‑Augmented Generation (RAG) System

This guide breaks down the ten core building blocks of a production‑ready RAG pipeline—from input handling and vector stores to prompt engineering, LLM inference, observability, and evaluation—showing why each piece matters, common pitfalls, and practical best‑practice recommendations.

LLMObservabilityRAG

0 likes · 9 min read

The 10 Essential Components of a Retrieval‑Augmented Generation (RAG) System

AI2ML AI to Machine Learning

Jun 6, 2025 · Artificial Intelligence

Tackling the Top Challenges of Retrieval‑Augmented Generation (RAG)

The article enumerates common pitfalls of Retrieval‑Augmented Generation—such as missing content, low‑rank document misses, context limits, format errors, incomplete answers, scalability bottlenecks, complex PDF extraction, data‑quality issues, domain adaptation gaps, hallucinations, and feedback‑loop deficiencies—and offers concrete mitigation strategies ranging from data cleaning and prompt design to hybrid search, hierarchical retrieval, document compression, and automated evaluation.

Hybrid SearchLLMRAG

0 likes · 9 min read

Tackling the Top Challenges of Retrieval‑Augmented Generation (RAG)

IT Services Circle

Jun 6, 2025 · Artificial Intelligence

Master Retrieval‑Augmented Generation (RAG): From Basics to Advanced Practices

This article introduces Retrieval‑Augmented Generation (RAG), explains its core components—knowledge embedding, retriever, and generator—covers practical system construction, optimization techniques, evaluation metrics, and advanced paradigms such as GraphRAG and Multi‑Modal RAG, while highlighting a comprehensive guidebook for hands‑on implementation.

AIKnowledge retrievalRAG

0 likes · 12 min read

Master Retrieval‑Augmented Generation (RAG): From Basics to Advanced Practices

Instant Consumer Technology Team

Jun 5, 2025 · Artificial Intelligence

How DeepSeek AI Transforms Government Search with Smarter, Faster Answers

This article explains how DeepSeek's large‑model‑driven search system overcomes traditional keyword‑matching limits, improves long‑tail query coverage, and delivers personalized, accurate government service results through intent parsing, knowledge‑graph retrieval, and generative optimization.

Artificial IntelligenceRetrieval-Augmented Generationgovernment services

0 likes · 9 min read

How DeepSeek AI Transforms Government Search with Smarter, Faster Answers

Alibaba Cloud Developer

Jun 5, 2025 · Artificial Intelligence

How Deep (Re)Search Transforms Code Search and AI-Powered Knowledge Retrieval

This article systematically explains the concepts of Deep Search and Deep Research, contrasts them with traditional Retrieval‑Augmented Generation, reviews leading commercial and open‑source solutions, details their architecture for code retrieval, and outlines future plans for specialized code‑search agents.

AI researchCode searchKnowledge retrieval

0 likes · 13 min read

How Deep (Re)Search Transforms Code Search and AI-Powered Knowledge Retrieval

Satori Komeiji's Programming Classroom

Jun 3, 2025 · Artificial Intelligence

Everything You Need to Know About Retrieval‑Augmented Generation (RAG)

The article explains Retrieval‑Augmented Generation (RAG) by describing how a programmer, frustrated with oversized prompts for a large language model, discovers that retrieving relevant document fragments, embedding them, and feeding the augmented context to the model yields accurate, fact‑based answers.

AIChunkingEmbedding

0 likes · 6 min read

Everything You Need to Know About Retrieval‑Augmented Generation (RAG)

Architect

May 7, 2025 · Artificial Intelligence

RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval‑Augmented Generation

The article reviews the RAG-MCP framework, which combines Retrieval‑Augmented Generation with Model Context Protocol to reduce prompt bloat and improve tool‑selection accuracy for large language models by first retrieving the most relevant tools before feeding them to the LLM.

LLMPrompt BloatRAG-MCP

0 likes · 11 min read

RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval‑Augmented Generation

Architect

May 5, 2025 · Artificial Intelligence

How Agentic RAG‑R1 Turns Retrieval‑Augmented Generation into an Autonomous AI Agent

Agentic RAG‑R1, an open‑source project from Peking University, combines Retrieval‑Augmented Generation with an agentic AI loop, introduces the GRPO reinforcement‑learning optimizer, supports LoRA‑based fine‑tuning, quantization and multimodal tool calls, and demonstrates significant accuracy gains on the MedQA benchmark across both Chinese and English test sets.

LLM Tool UseOpen SourceRetrieval-Augmented Generation

0 likes · 8 min read

How Agentic RAG‑R1 Turns Retrieval‑Augmented Generation into an Autonomous AI Agent

DataFunSummit

Apr 21, 2025 · Artificial Intelligence

Deep Integration of Knowledge Graphs and Large Language Models: Methods, Applications, and Future Directions

This article explores how knowledge graphs can be tightly integrated with large language models through prompt engineering, fine‑tuning, retrieval‑augmented generation, reasoning collaboration, and knowledge agents, outlining technical pathways, practical implementations, and future research directions across AI domains.

AIRetrieval-Augmented Generationknowledge graph

0 likes · 23 min read

Deep Integration of Knowledge Graphs and Large Language Models: Methods, Applications, and Future Directions

Data Thinking Notes

Apr 13, 2025 · Artificial Intelligence

How to Build a Retrieval‑Augmented Generation Knowledge Base with DeepSeek and RAGFlow

This guide walks you through the fundamentals of Retrieval‑Augmented Generation, introduces the open‑source RAGFlow framework, details installation steps, shows how to integrate DeepSeek LLMs, and explores practical application scenarios such as intelligent customer service and enterprise document QA.

AIDeepSeekLLM

0 likes · 11 min read

How to Build a Retrieval‑Augmented Generation Knowledge Base with DeepSeek and RAGFlow

Sohu Tech Products

Apr 9, 2025 · Artificial Intelligence

Boost LLM Retrieval Accuracy with MCP: A Step‑by‑Step Guide

This tutorial explains how to overcome the limitations of Retrieval‑Augmented Generation by using the Model Context Protocol (MCP) together with a MongoDB database, providing detailed setup steps, configuration examples, and performance comparisons that demonstrate significantly higher query precision for large language models.

AI agentsMCPMongoDB

0 likes · 24 min read

Boost LLM Retrieval Accuracy with MCP: A Step‑by‑Step Guide

Architect's Alchemy Furnace

Apr 8, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?

This article explains Retrieval‑Augmented Generation (RAG), its three‑step workflow of retrieval, augmentation, and generation, its key advantages such as improved accuracy and explainability, and compares RAG with traditional pre‑trained models, fine‑tuned models, hybrid models, knowledge‑distillation methods, and RLHF, while also covering vector, full‑text, and hybrid retrieval modes and the role of rerank models.

AIKnowledge retrievalRAG

0 likes · 18 min read

What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?

Tencent Cloud Developer

Apr 2, 2025 · Artificial Intelligence

Understanding Retrieval‑Augmented Generation (RAG): Concepts, Types, and Development

Retrieval‑Augmented Generation (RAG) enhances large language models by fetching up‑to‑date external knowledge before generation, mitigating knowledge‑cutoff limits and hallucinations through a retrieval step (using text, vector, or graph methods) and a generation step, evolving from naive single‑method approaches to advanced, modular, graph‑based, and agentic systems that enable adaptive, multi‑hop reasoning and future intelligent, multimodal pipelines.

AIHallucination MitigationKnowledge retrieval

0 likes · 9 min read

Understanding Retrieval‑Augmented Generation (RAG): Concepts, Types, and Development