Tag

Retrieval-Augmented Generation

1 views collected around this technical thread.

Instant Consumer Technology Team
Instant Consumer Technology Team
Jun 13, 2025 · Artificial Intelligence

Unlocking RAGFlow: How Retrieval‑Augmented Generation & Flow Transform AI Applications

RAGFlow is an AI architecture that merges Retrieval‑Augmented Generation with a dynamic Flow control mechanism, offering real‑time knowledge retrieval, high‑quality text generation, and flexible deployment across content creation, intelligent QA, and enterprise solutions while outlining its technical principles, advantages, challenges, and installation steps.

AIChatbotKnowledge Base
0 likes · 25 min read
Unlocking RAGFlow: How Retrieval‑Augmented Generation & Flow Transform AI Applications
IT Services Circle
IT Services Circle
Jun 6, 2025 · Artificial Intelligence

Master Retrieval‑Augmented Generation (RAG): From Basics to Advanced Practices

This article introduces Retrieval‑Augmented Generation (RAG), explains its core components—knowledge embedding, retriever, and generator—covers practical system construction, optimization techniques, evaluation metrics, and advanced paradigms such as GraphRAG and Multi‑Modal RAG, while highlighting a comprehensive guidebook for hands‑on implementation.

AILarge Language ModelsRAG
0 likes · 12 min read
Master Retrieval‑Augmented Generation (RAG): From Basics to Advanced Practices
Instant Consumer Technology Team
Instant Consumer Technology Team
Jun 5, 2025 · Artificial Intelligence

How DeepSeek AI Transforms Government Search with Smarter, Faster Answers

This article explains how DeepSeek's large‑model‑driven search system overcomes traditional keyword‑matching limits, improves long‑tail query coverage, and delivers personalized, accurate government service results through intent parsing, knowledge‑graph retrieval, and generative optimization.

Artificial IntelligenceGovernment ServicesLarge Language Models
0 likes · 9 min read
How DeepSeek AI Transforms Government Search with Smarter, Faster Answers
Instant Consumer Technology Team
Instant Consumer Technology Team
Jun 4, 2025 · Artificial Intelligence

Unlocking Retrieval-Augmented Generation: Theory, Practice, and Future Trends

This comprehensive article examines Retrieval‑Augmented Generation (RAG), covering its historical evolution, core theory, implementation variants, practical code examples, diverse applications, current controversies, and future research directions within the AI and NLP landscape.

Artificial IntelligenceGenerative ModelsNatural Language Processing
0 likes · 21 min read
Unlocking Retrieval-Augmented Generation: Theory, Practice, and Future Trends
Architect
Architect
May 7, 2025 · Artificial Intelligence

RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval‑Augmented Generation

The article reviews the RAG-MCP framework, which combines Retrieval‑Augmented Generation with Model Context Protocol to reduce prompt bloat and improve tool‑selection accuracy for large language models by first retrieving the most relevant tools before feeding them to the LLM.

Artificial IntelligenceLLMPrompt Bloat
0 likes · 11 min read
RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval‑Augmented Generation
DevOps
DevOps
Apr 27, 2025 · Artificial Intelligence

Large Model Technologies: RAG, AI Agents, Multimodal Applications, and Future Trends

This article examines how Retrieval‑Augmented Generation (RAG), AI agents, and multimodal large‑model techniques are reshaping AI‑industry integration, discusses their technical challenges and practical implementations, and outlines future development directions across algorithms, products, and domain‑specific applications.

AI agentsArtificial IntelligenceLarge Models
0 likes · 14 min read
Large Model Technologies: RAG, AI Agents, Multimodal Applications, and Future Trends
DataFunTalk
DataFunTalk
Apr 24, 2025 · Artificial Intelligence

Is Retrieval‑Augmented Generation (RAG) Dead Yet?

This article explains the original purpose of Retrieval‑Augmented Generation, why it remains essential despite advances in large‑context LLMs, and how combining RAG with fine‑tuning, longer context windows, and model‑context protocols yields more scalable, accurate, and privacy‑preserving AI systems.

AIContext WindowLarge Language Models
0 likes · 9 min read
Is Retrieval‑Augmented Generation (RAG) Dead Yet?
DataFunSummit
DataFunSummit
Apr 21, 2025 · Artificial Intelligence

Deep Integration of Knowledge Graphs and Large Language Models: Methods, Applications, and Future Directions

This article explores how knowledge graphs can be tightly integrated with large language models through prompt engineering, fine‑tuning, retrieval‑augmented generation, reasoning collaboration, and knowledge agents, outlining technical pathways, practical implementations, and future research directions across AI domains.

AIRetrieval-Augmented Generationknowledge graph
0 likes · 23 min read
Deep Integration of Knowledge Graphs and Large Language Models: Methods, Applications, and Future Directions
DevOps
DevOps
Apr 2, 2025 · Artificial Intelligence

Understanding Retrieval-Augmented Generation (RAG): Concepts, Evolution, and Types

This article explains Retrieval‑Augmented Generation (RAG), its role in mitigating large language model knowledge cutoff and hallucination, outlines the evolution from naive to advanced, modular, graph, and agentic RAG, and discusses future directions such as intelligent and multi‑modal RAG systems.

Artificial IntelligenceLLMRAG
0 likes · 10 min read
Understanding Retrieval-Augmented Generation (RAG): Concepts, Evolution, and Types
Tencent Cloud Developer
Tencent Cloud Developer
Apr 2, 2025 · Artificial Intelligence

Understanding Retrieval‑Augmented Generation (RAG): Concepts, Types, and Development

Retrieval‑Augmented Generation (RAG) enhances large language models by fetching up‑to‑date external knowledge before generation, mitigating knowledge‑cutoff limits and hallucinations through a retrieval step (using text, vector, or graph methods) and a generation step, evolving from naive single‑method approaches to advanced, modular, graph‑based, and agentic systems that enable adaptive, multi‑hop reasoning and future intelligent, multimodal pipelines.

AIAgentic AIHallucination Mitigation
0 likes · 9 min read
Understanding Retrieval‑Augmented Generation (RAG): Concepts, Types, and Development
Architecture Digest
Architecture Digest
Mar 26, 2025 · Artificial Intelligence

Getting Started with LangChain in Java: Building Large Language Model Applications

This tutorial introduces the fundamentals of LangChain, explains large language models, prompt engineering, word embeddings, and demonstrates how to use the Java implementation LangChain4j with Maven dependencies, model I/O, memory, retrieval, chains, and agents to build sophisticated LLM‑driven applications.

AIJavaLLM
0 likes · 18 min read
Getting Started with LangChain in Java: Building Large Language Model Applications
Architect
Architect
Mar 22, 2025 · Artificial Intelligence

Understanding and Mitigating Failures in Retrieval‑Augmented Generation (RAG) Systems

Retrieval‑augmented generation (RAG) combines external knowledge retrieval with large language models to improve answer accuracy, but it often suffers from retrieval mismatches, algorithmic flaws, chunking issues, embedding biases, inefficiencies, generation errors, reasoning limits, formatting problems, system‑level failures, and high resource costs, which this article analyzes and offers solutions for.

AI ReliabilityLLMRAG
0 likes · 32 min read
Understanding and Mitigating Failures in Retrieval‑Augmented Generation (RAG) Systems
DaTaobao Tech
DaTaobao Tech
Mar 19, 2025 · Artificial Intelligence

Retrieval Augmented Generation (RAG): Principles, Challenges, and Implementation Techniques

Retrieval‑augmented generation (RAG) enhances large language models by integrating a preprocessing pipeline—cleaning, chunking, embedding, and vector storage—with a query‑driven retrieval and prompt‑injection workflow, leveraging vector databases, multi‑stage recall, advanced prompting, and comprehensive evaluation metrics to mitigate knowledge cut‑off, hallucinations, and security issues.

LLMRAGRetrieval-Augmented Generation
0 likes · 27 min read
Retrieval Augmented Generation (RAG): Principles, Challenges, and Implementation Techniques
Tencent Technical Engineering
Tencent Technical Engineering
Feb 17, 2025 · Artificial Intelligence

Prompt Engineering: Definitions, Frameworks, Principles, and Advanced Techniques

The guide defines prompts as structured queries that unlock large‑language‑model abilities, outlines five core frameworks (RTF, Chain‑of‑Thought, RISEN, RODES, Density‑Chain), presents two key principles—clear, delimited instructions and explicit reasoning steps—to reduce hallucinations, and surveys advanced techniques such as zero‑shot, few‑shot, RAG, Tree‑of‑Thought and automatic prompt engineering.

AIChain-of-ThoughtFew-Shot Prompting
0 likes · 29 min read
Prompt Engineering: Definitions, Frameworks, Principles, and Advanced Techniques
DataFunSummit
DataFunSummit
Jan 22, 2025 · Artificial Intelligence

RAG2.0 Engine Design Challenges and Implementation

This article presents a comprehensive overview of the RAG2.0 engine design, covering RAG1.0 limitations, effective chunking methods, accurate retrieval techniques, advanced multimodal processing, hybrid search strategies, database indexing choices, and future directions such as agentic RAG and memory‑enhanced models.

ChunkingHybrid SearchRAG
0 likes · 23 min read
RAG2.0 Engine Design Challenges and Implementation
Sohu Tech Products
Sohu Tech Products
Jan 8, 2025 · Artificial Intelligence

Multimodal RAG: Implementation Paths and Development Prospects

The talk outlines Multimodal RAG implementation routes, comparing OCR‑based object recognition, transformer encoder‑decoder encoding, and Visual Language Model processing, explains the ColPali late‑interaction method for multi‑dimensional vector matching, addresses scaling tensors with binarization and reranking, and recommends a hybrid long‑term strategy where VLM excels on abstract imagery while traditional OCR remains valuable.

ColPaliMultimodal RAGOCR
0 likes · 10 min read
Multimodal RAG: Implementation Paths and Development Prospects
DataFunTalk
DataFunTalk
Jan 6, 2025 · Artificial Intelligence

Building and Applying NIO's Enterprise Knowledge Platform: Architecture, Challenges, and Future Directions

This article presents a comprehensive overview of NIO's company‑wide knowledge platform, detailing its background, layered architecture, retrieval‑augmented generation workflow, challenges such as accuracy, permission control and high concurrency, and future plans for AI‑assisted understanding, creation, multimodal capabilities, and expanded knowledge types.

AIRAGRetrieval-Augmented Generation
0 likes · 18 min read
Building and Applying NIO's Enterprise Knowledge Platform: Architecture, Challenges, and Future Directions
Baidu Geek Talk
Baidu Geek Talk
Dec 16, 2024 · Artificial Intelligence

AIAPI: Baidu's AI-Native Retrieval System for Large Language Model Applications

AIAPI, Baidu’s AI‑native retrieval platform for large language models, tackles hallucination, slow domain updates, and output opacity by delivering authoritative, timely, full‑content data through a dual‑channel architecture that combines traditional search and RAG, employs reusable ranking, graph‑enhanced data layers, dynamic caching that cuts storage by 70 %, and QueryPlan‑based QoS, achieving markedly higher retrieval quality and a 34 % speed gain with Wenxin 4.0.

AI-Native SystemsAIAPICost Optimization
0 likes · 12 min read
AIAPI: Baidu's AI-Native Retrieval System for Large Language Model Applications
Sohu Tech Products
Sohu Tech Products
Nov 27, 2024 · Artificial Intelligence

RAG Technology and Practical Application in Multi-Modal Query: Using Chinese-CLIP and Redis Search

The article explains how Retrieval‑Augmented Generation (RAG) outperforms direct LLM inference by enabling real‑time knowledge updates and lower costs, and demonstrates a practical multi‑modal RAG pipeline that uses Chinese‑CLIP for vector encoding, various chunking strategies, and Redis Search for fast vector storage and retrieval.

Chinese CLIPChunkingLLM
0 likes · 17 min read
RAG Technology and Practical Application in Multi-Modal Query: Using Chinese-CLIP and Redis Search
Tencent Docs Tech Team
Tencent Docs Tech Team
Nov 13, 2024 · Artificial Intelligence

Technical Architecture and Practices of the AI Document Assistant

This article explores the challenges large language models bring to efficiency tools, outlines the AI document assistant's technical thinking and architecture, and details both application‑side and model‑side practices such as retrieval‑augmented generation, intent recognition, and code‑driven table handling, concluding with key lessons.

AIAI architectureDocument Automation
0 likes · 16 min read
Technical Architecture and Practices of the AI Document Assistant