Tagged articles
924 articles
Page 5 of 10
DataFunSummit
DataFunSummit
Nov 7, 2025 · Artificial Intelligence

How Tencent’s LLM Powers Content Creation, Smart Service, and Game NPCs

This article examines Tencent’s large language model deployments across content generation, intelligent customer service, and game role‑playing, and explains the underlying technologies—Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agent systems—highlighting how they enhance performance, explainability, and multi‑step reasoning in real‑world business scenarios.

AIAgentLLM
0 likes · 4 min read
How Tencent’s LLM Powers Content Creation, Smart Service, and Game NPCs
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 6, 2025 · Artificial Intelligence

How to Optimize RAG Knowledge Base Construction: Parsing, Chunking, and Retrieval

This article explains why building a high‑quality RAG knowledge base is critical, outlines offline parsing techniques for multi‑format documents, presents semantic chunking strategies that preserve structure and context, and shows how to answer interview questions with a robust, production‑ready pipeline.

AI InterviewChunkingKnowledge Base
0 likes · 8 min read
How to Optimize RAG Knowledge Base Construction: Parsing, Chunking, and Retrieval
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 5, 2025 · Artificial Intelligence

Why Production-Ready RAG Is Ten Times Harder Than a Simple Demo

Building a Retrieval‑Augmented Generation (RAG) system may be straightforward in code, but making it reliable, accurate, and scalable in production involves challenges across data preparation, vector retrieval, query rewriting, generation control, and system integration, turning a demo into a truly useful AI service.

AILLMPrompt Engineering
0 likes · 8 min read
Why Production-Ready RAG Is Ten Times Harder Than a Simple Demo
DataFunSummit
DataFunSummit
Nov 4, 2025 · Artificial Intelligence

How Tencent Leverages RAG, GraphRAG, and Agents to Power Large Language Model Applications

This article explores Tencent's large language model deployments across various business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑playing, and explains the underlying technologies—Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agent systems—that enable these applications.

AIAgentRAG
0 likes · 4 min read
How Tencent Leverages RAG, GraphRAG, and Agents to Power Large Language Model Applications
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 4, 2025 · Artificial Intelligence

Why Financial RAG Fails and How to Solve Its Core Challenges

This article explains why Retrieval‑Augmented Generation (RAG) projects in the financial sector often underperform, highlighting data‑structure complexities, document‑parsing hurdles, chunking strategies, compliance constraints, evaluation metrics, and engineering requirements, and offers practical solutions and code examples.

ChunkingComplianceEngineering
0 likes · 10 min read
Why Financial RAG Fails and How to Solve Its Core Challenges
dbaplus Community
dbaplus Community
Nov 3, 2025 · Artificial Intelligence

How RAG Turns Natural Language Queries into Accurate SQL for Data Platforms

This article explains how Retrieval‑Augmented Generation (RAG) combines vector databases with large language models to let non‑technical users ask natural‑language questions and receive precise SQL statements, detailing the workflow, architecture, chunking methods, performance gains, and remaining challenges.

Data PlatformLLMRAG
0 likes · 17 min read
How RAG Turns Natural Language Queries into Accurate SQL for Data Platforms
Instant Consumer Technology Team
Instant Consumer Technology Team
Nov 3, 2025 · Artificial Intelligence

Large Language Models Power Big Data SRE Knowledge & Root‑Cause Automation

Facing the growing complexity of big‑data platforms, the SRE team adopted large‑language‑model agents to automate knowledge management and root‑cause analysis, employing Retrieval‑Augmented Generation, a vector store, and the Model Context Protocol to enable intelligent, scalable, and efficient incident diagnosis and resolution.

AIMCPRAG
0 likes · 12 min read
Large Language Models Power Big Data SRE Knowledge & Root‑Cause Automation
DataFunSummit
DataFunSummit
Nov 3, 2025 · Artificial Intelligence

How Tencent’s LLM Powers Real‑World AI: From RAG to Agents

This article examines Tencent's large language model applications across diverse business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑playing, and explains the three key technologies—Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agents—that enable these capabilities.

AI applicationsAgentLLM
0 likes · 4 min read
How Tencent’s LLM Powers Real‑World AI: From RAG to Agents
Data Party THU
Data Party THU
Nov 1, 2025 · Artificial Intelligence

How to Blend Process‑Oriented and Agent‑Centric AI into a Hybrid Intelligent Pipeline

This article analyzes two contrasting AI agent design paradigms—process‑driven workflow orchestration and autonomous agent intelligence—examines their strengths and limitations, and proposes a hybrid architecture that fuses deterministic pipelines with dynamic planning, tool use, and memory mechanisms to achieve both reliability and adaptability.

AIAgentHybrid
0 likes · 15 min read
How to Blend Process‑Oriented and Agent‑Centric AI into a Hybrid Intelligent Pipeline
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 1, 2025 · Artificial Intelligence

Turn a Basic RAG Demo into a High‑Impact Interview Project

This guide shows how to evolve a simple Retrieval‑Augmented Generation prototype into a production‑grade system by strengthening data ingestion, optimizing retrieval with hybrid and reranking techniques, adding query rewriting, long‑context handling, reinforcement learning, and multimodal support, so candidates can demonstrate real engineering depth in interviews.

AILLMRAG
0 likes · 7 min read
Turn a Basic RAG Demo into a High‑Impact Interview Project
BirdNest Tech Talk
BirdNest Tech Talk
Oct 30, 2025 · Artificial Intelligence

Master LangChain Chains with LCEL: From Simple Jokes to RAG and Agent Pipelines

This guide explains how LangChain’s Expression Language (LCEL) lets you declaratively connect prompts, models, and output parsers into chains, walks through environment setup, dependency installation, and detailed code examples ranging from a basic joke generator to retrieval‑augmented generation and memory‑enabled agents.

AgentLCELLangChain
0 likes · 5 min read
Master LangChain Chains with LCEL: From Simple Jokes to RAG and Agent Pipelines
Alibaba Cloud Developer
Alibaba Cloud Developer
Oct 30, 2025 · Artificial Intelligence

Why AI Agents Aren’t As Simple As They Appear: Engineering Challenges and Solutions

Building AI agents may seem straightforward with frameworks like LangChain, but hidden complexities in orchestration, memory management, reproducibility, and scalability turn simple demos into fragile systems, requiring systematic engineering, observability, and robust design to achieve reliable, production‑grade intelligent agents.

AI agentsAgent DesignLangChain
0 likes · 21 min read
Why AI Agents Aren’t As Simple As They Appear: Engineering Challenges and Solutions
DeWu Technology
DeWu Technology
Oct 29, 2025 · Artificial Intelligence

Why Chunking Can Make or Break Your RAG System – Practical Strategies & Code

This article explains how proper document chunking—choosing the right chunk size, overlap, and structure‑aware boundaries—directly impacts the relevance, factuality, and efficiency of Retrieval‑Augmented Generation pipelines, and provides multiple Python implementations ranging from simple fixed‑length splits to semantic and hybrid approaches.

ChunkingEmbeddingLLM
0 likes · 29 min read
Why Chunking Can Make or Break Your RAG System – Practical Strategies & Code
Bilibili Tech
Bilibili Tech
Oct 27, 2025 · Artificial Intelligence

How Bilibili’s LLM-Powered System Cuts Game Localization Costs by 80%

Bilibili’s game algorithm team built a four‑layer, LLM‑based translation platform that automates terminology extraction, retrieval‑augmented generation, and quality assessment, dramatically reducing localization cycles by over 85% and costs by up to 80% while supporting ten languages and ensuring consistent, culturally‑accurate game text.

LLMRAGgame localization
0 likes · 20 min read
How Bilibili’s LLM-Powered System Cuts Game Localization Costs by 80%
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Oct 27, 2025 · Artificial Intelligence

Designing Effective Generation Modules for RAG: Prompt Engineering, Multi‑Document Fusion, and Hallucination Control

This article explains how to design and optimize the generation module of Retrieval‑Augmented Generation systems by building robust prompts, merging multi‑source information, controlling answer formats, and applying post‑generation verification to reduce hallucinations and improve enterprise‑grade performance.

AIGeneration ModuleHallucination Control
0 likes · 9 min read
Designing Effective Generation Modules for RAG: Prompt Engineering, Multi‑Document Fusion, and Hallucination Control
BirdNest Tech Talk
BirdNest Tech Talk
Oct 27, 2025 · Artificial Intelligence

How LangChain’s Indexing API Enables Efficient Incremental Updates for RAG Systems

This article explains how LangChain's Indexing API adds state management and synchronization to the classic load‑split‑embed‑store RAG pipeline, detailing the RecordManager component, the index function workflow, key parameters, implementation considerations, and best‑practice code examples for production‑grade vector stores.

FAISSIndexing APILangChain
0 likes · 12 min read
How LangChain’s Indexing API Enables Efficient Incremental Updates for RAG Systems
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Oct 27, 2025 · Artificial Intelligence

Master AI Agents and MCP: A Complete 4‑Month Learning Roadmap

This article presents a structured, step‑by‑step learning path that guides beginners from Python fundamentals through AI API mastery, Retrieval‑Augmented Generation, deep MCP protocol knowledge, and advanced multi‑agent development, complete with practical code examples and performance‑monitoring techniques.

AI agentsLangChainMCP protocol
0 likes · 14 min read
Master AI Agents and MCP: A Complete 4‑Month Learning Roadmap
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Oct 24, 2025 · Artificial Intelligence

Beyond RAG: Three Emerging Knowledge‑Engineering Strategies (ICL, Online Learning, SLM)

The article outlines three post‑RAG knowledge‑engineering approaches—In‑Context Learning with dynamic few‑shot selection, Online Learning encompassing Meta‑Learning and Lifelong Learning to quickly adapt to new tasks, and the Small Language Model path that combines fine‑tuned task‑specific experts with LLM‑SLM collaboration for efficient, privacy‑preserving inference.

In-Context LearningKnowledge EngineeringLLM
0 likes · 4 min read
Beyond RAG: Three Emerging Knowledge‑Engineering Strategies (ICL, Online Learning, SLM)
DataFunTalk
DataFunTalk
Oct 23, 2025 · Artificial Intelligence

How Tencent Leverages RAG and Agents to Supercharge Large Language Models

This article examines Tencent's large language model deployments across diverse business scenarios, detailing how Retrieval‑Augmented Generation, Supervised Fine‑Tuning, and autonomous agents boost model intelligence, reduce hallucinations, and enable sophisticated content creation, understanding, and interactive applications.

AI agentsLarge Language ModelRAG
0 likes · 4 min read
How Tencent Leverages RAG and Agents to Supercharge Large Language Models
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Oct 23, 2025 · Artificial Intelligence

Boost Your RAG Bot’s Accuracy: Hybrid Search, Query Rewriting, and Re‑ranking Explained

This article walks developers through three essential upgrades for Retrieval‑Augmented Generation systems—hybrid search combining vector and keyword retrieval, query rewriting to clarify conversational inputs, and re‑ranking with a cross‑encoder—providing step‑by‑step code examples using LangChain to dramatically improve answer quality.

AIHybrid SearchLangChain
0 likes · 9 min read
Boost Your RAG Bot’s Accuracy: Hybrid Search, Query Rewriting, and Re‑ranking Explained
Xuanwu Backend Tech Stack
Xuanwu Backend Tech Stack
Oct 22, 2025 · Artificial Intelligence

How Rerank Transforms Retrieval‑Augmented Generation for Accurate AI Answers

This article explains the limitations of basic Retrieval‑Augmented Generation (RAG), introduces Rerank technology as a two‑step refinement process, compares dual‑encoder and cross‑encoder methods, and reviews popular Rerank models to help developers build more precise AI‑driven retrieval systems.

Artificial IntelligenceRAGRerank
0 likes · 10 min read
How Rerank Transforms Retrieval‑Augmented Generation for Accurate AI Answers
JD Tech Talk
JD Tech Talk
Oct 21, 2025 · Backend Development

How Backend Engineers Are Breaking Through AI with RAG Architectures

This article details a backend developer's two‑year AI journey, the challenges of rapid model advances, and how applying microservice principles to Retrieval‑Augmented Generation (RAG) creates a scalable, multi‑agent platform for insurance knowledge, memory, and intelligent agents.

Backend AIKnowledge BaseRAG
0 likes · 11 min read
How Backend Engineers Are Breaking Through AI with RAG Architectures
BirdNest Tech Talk
BirdNest Tech Talk
Oct 21, 2025 · Artificial Intelligence

How Vector Stores Enable Lightning‑Fast Semantic Search in LangChain

This article explains what vector stores are, outlines their core workflow of adding, querying, and searching embeddings, compares popular back‑ends like FAISS, Chroma, and Pinecone, and walks through a complete Chinese‑language example using LangChain’s FAISS integration with detailed code and result analysis.

AIEmbeddingsFAISS
0 likes · 10 min read
How Vector Stores Enable Lightning‑Fast Semantic Search in LangChain
BirdNest Tech Talk
BirdNest Tech Talk
Oct 16, 2025 · Artificial Intelligence

Mastering Text Splitting in LangChain: From Theory to Code

This guide explains why large documents must be broken into semantic chunks for LLMs, introduces core parameters like chunk_size and chunk_overlap, compares LangChain's various splitters, and walks through a complete Python example that loads a long text, configures a RecursiveCharacterTextSplitter, and inspects the resulting chunks.

EmbeddingLangChainRAG
0 likes · 9 min read
Mastering Text Splitting in LangChain: From Theory to Code
DataFunSummit
DataFunSummit
Oct 16, 2025 · Artificial Intelligence

How Chat BI Transforms Data Warehousing with AI: Unlock Real‑Time Insights

This presentation by iQIYI’s Technical Director Zhang Xiaoming details the evolution of BI systems, introduces the Chat BI framework, explains its three‑step implementation, outlines architectural design, data‑warehouse integration, performance optimizations, and user‑operation strategies, revealing how AI and RAG empower smarter data analytics.

AIBIChatBI
0 likes · 18 min read
How Chat BI Transforms Data Warehousing with AI: Unlock Real‑Time Insights
Alibaba Cloud Native
Alibaba Cloud Native
Oct 15, 2025 · Cloud Native

What’s New in Higress 2.0? 30 Updates Including RAG MCP Server and Performance Fixes

The Higress 2.0 release introduces 30 changes—13 new features such as a RAG MCP server and ECDS‑based configuration refactor, 7 bug fixes, 5 refactorings, documentation updates and a test improvement—providing developers with enhanced knowledge‑management capabilities, more stable routing, and clearer documentation for cloud‑native service‑mesh environments.

Bug FixMCPRAG
0 likes · 20 min read
What’s New in Higress 2.0? 30 Updates Including RAG MCP Server and Performance Fixes
Xiaohe Frontend Team
Xiaohe Frontend Team
Oct 15, 2025 · Artificial Intelligence

REFRAG: Using Tiny Models to Compress RAG for Faster, Smarter AI

Meta’s new REFRAG framework lets a lightweight encoder compress retrieved text into semantic tags, enabling large language models to answer queries with far fewer tokens, lower latency, and higher throughput, while preserving core meaning and allowing flexible placement of compressed information within prompts.

LLM efficiencyModel CompressionRAG
0 likes · 8 min read
REFRAG: Using Tiny Models to Compress RAG for Faster, Smarter AI
Practical DevOps Architecture
Practical DevOps Architecture
Oct 14, 2025 · Artificial Intelligence

Master AI Agents: From Basics to Advanced Multi-Model Development

This comprehensive AI agent development course covers 18 chapters, ranging from fundamental concepts and architecture to large‑model integration, tool and browser control, memory, RAG self‑learning, sandboxing, database manipulation, multi‑agent architectures, code assistance, and a real‑world frontend automation project, complete with source code and documentation.

AI agentsLangChainLarge Language Models
0 likes · 3 min read
Master AI Agents: From Basics to Advanced Multi-Model Development
DataFunTalk
DataFunTalk
Oct 13, 2025 · Artificial Intelligence

How Tencent Uses RAG, GraphRAG, and Agents to Power Large Language Model Applications

This article examines Tencent's large language model deployments across diverse business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑playing, while explaining the underlying technologies of Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agent systems.

AI applicationsAgentLarge Language Model
0 likes · 4 min read
How Tencent Uses RAG, GraphRAG, and Agents to Power Large Language Model Applications
DataFunTalk
DataFunTalk
Oct 11, 2025 · Artificial Intelligence

How Tencent’s LLM Powers Real‑World Apps with RAG, GraphRAG & Agents

This article explores Tencent’s large language model deployments across diverse business scenarios—content generation, intelligent customer service, and role‑playing—detailing the underlying RAG, GraphRAG, and Agent technologies, their principles, practical implementations, and the advantages they bring to enterprise AI solutions.

AIAgentLLM
0 likes · 5 min read
How Tencent’s LLM Powers Real‑World Apps with RAG, GraphRAG & Agents
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Oct 10, 2025 · Artificial Intelligence

How Ontologies Boost Large Language Models: A Comprehensive Review

This review examines how formal knowledge representations (ontologies) can be integrated with large language models to enhance reasoning, reduce hallucinations, and improve factual reliability, outlining three roles—information provider, reasoner, validator—while analyzing recent frameworks, open‑source projects, and future research challenges.

AIOntologyRAG
0 likes · 29 min read
How Ontologies Boost Large Language Models: A Comprehensive Review
JD Tech
JD Tech
Oct 9, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines external knowledge retrieval with large language models, covering its motivations, data preparation, chunking strategies, vectorization, storage, query processing, retrieval, reranking, prompt engineering, and LLM generation, plus practical optimization tips.

ChunkingLLMRAG
0 likes · 14 min read
What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?
DataFunSummit
DataFunSummit
Sep 28, 2025 · Artificial Intelligence

Unlocking Enterprise Knowledge: Building Multimodal AI Systems with LLMs

This article examines the challenges of processing massive multimodal data in enterprises and presents a knowledge‑augmentation framework that leverages Retrieval‑Augmented Generation, memory‑inspired architecture, and feedback loops to enable reliable, scalable AI‑driven decision making across diverse business scenarios.

Enterprise KnowledgeKnowledge GraphLLM
0 likes · 29 min read
Unlocking Enterprise Knowledge: Building Multimodal AI Systems with LLMs
Volcano Engine Developer Services
Volcano Engine Developer Services
Sep 28, 2025 · Artificial Intelligence

Demystifying AI Jargon: A Beginner’s Guide to Large Language Models

This guide breaks down the complex terminology of large language models—explaining tokens, transformers, self‑attention, RAG, scaling laws, dense vs. sparse architectures, and training stages—using clear analogies and step‑by‑step explanations so readers can confidently understand and work with modern AI systems.

AI fundamentalsLarge Language ModelsRAG
0 likes · 35 min read
Demystifying AI Jargon: A Beginner’s Guide to Large Language Models
Data STUDIO
Data STUDIO
Sep 28, 2025 · Artificial Intelligence

Top Reranker Models for RAG in 2025: A Comparative Review

This article explains why initial retrieval in Retrieval‑Augmented Generation often yields noisy results, describes how rerankers act as quality filters to improve relevance, compares the leading 2025 reranker models—including Cohere, bge‑reranker, Voyage, Jina, FlashRank, and MixedBread—and provides code snippets, evaluation metrics, and guidance for selecting the right model for specific use cases.

AICross-EncoderLLM
0 likes · 31 min read
Top Reranker Models for RAG in 2025: A Comparative Review
JD Tech Talk
JD Tech Talk
Sep 28, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Power Modern AI?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines traditional information retrieval with large language models, detailing its core workflow—from knowledge preparation, chunking, and embedding to vector database storage and the question‑answering stage—while highlighting key challenges, tools, and optimization strategies.

AIChunkingEmbedding
0 likes · 15 min read
What Is Retrieval‑Augmented Generation (RAG) and How Does It Power Modern AI?
JD Cloud Developers
JD Cloud Developers
Sep 28, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Work?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines traditional information retrieval with large language models, covering its core workflow—from knowledge preparation, data cleaning, and metadata extraction to query preprocessing, vector retrieval, reranking, information integration, and final LLM generation, while also reviewing common embedding models and vector databases.

Artificial IntelligenceLLMRAG
0 likes · 13 min read
What Is Retrieval‑Augmented Generation (RAG) and How Does It Work?
Tencent Advertising Technology
Tencent Advertising Technology
Sep 27, 2025 · Artificial Intelligence

How AI‑Generated Test Cases Transformed Tencent Ads R&D Workflow

This article details how Tencent's advertising R&D team tackled lengthy, experience‑driven test case creation by deploying AIGC‑powered demand analysis, Prompt + RAG knowledge retrieval, and multi‑stage automated validation, ultimately boosting test case adoption from under 20% to nearly 60% while reducing manual effort and iteration time.

AI testingAIGCAutomation
0 likes · 14 min read
How AI‑Generated Test Cases Transformed Tencent Ads R&D Workflow
Bilibili Tech
Bilibili Tech
Sep 26, 2025 · Artificial Intelligence

How RAG Transforms Natural Language Queries into Accurate SQL for Business Users

This article explains how Retrieval‑Augmented Generation (RAG) combines large language models with vector databases to let non‑technical staff query massive membership data using plain language, detailing the workflow, technical architecture, optimization challenges, and real‑world impact on data‑driven decision making.

AIData PlatformLLM
0 likes · 17 min read
How RAG Transforms Natural Language Queries into Accurate SQL for Business Users
BirdNest Tech Talk
BirdNest Tech Talk
Sep 25, 2025 · Artificial Intelligence

Mastering LangChain: A Hands‑On Guide to Building LLM Applications

This repository offers a comprehensive, step‑by‑step LangChain tutorial series that walks developers through installation, the LangChain Expression Language, streaming, parallel execution, callbacks, serialization, model customization, prompt templates, memory, multimodal support, and advanced tools like LangGraph and LangSmith, enabling the creation of sophisticated AI applications.

AI developmentLLMLangChain
0 likes · 9 min read
Mastering LangChain: A Hands‑On Guide to Building LLM Applications
DataFunSummit
DataFunSummit
Sep 24, 2025 · Artificial Intelligence

Taming LLM Hallucinations: Strategies and Solutions from 360

This article explores the problem of large‑model hallucinations, explains its definitions and classifications, analyzes root causes in data, algorithms and inference, and presents detection methods and practical mitigation techniques such as RAG, decoding strategies, and model‑enhancement approaches, illustrated with real‑world 360 use cases and future research directions.

AI safetyLLMRAG
0 likes · 22 min read
Taming LLM Hallucinations: Strategies and Solutions from 360
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Sep 23, 2025 · Artificial Intelligence

How Front‑End Developers Can Build Powerful AI Agents with LangChain.js

This article guides front‑end developers through the evolution of AI agents—from early chatbots to modern multimodal agents—covering LLM fundamentals, prompt engineering, LangChain.js workflow creation, Retrieval‑Augmented Generation, model context protocols, and future multi‑agent technologies.

AI AgentFrontend DevelopmentLLM
0 likes · 11 min read
How Front‑End Developers Can Build Powerful AI Agents with LangChain.js
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Sep 21, 2025 · Artificial Intelligence

FinKario: Event‑Enhanced Financial Knowledge Graphs Boost A‑Share Sharpe Ratio to 4.9

This article reviews the FinKario paper, which introduces an event‑augmented financial knowledge graph and a two‑stage RAG retrieval strategy that together enable real‑time knowledge updates and efficient integration of long‑form research reports, yielding a Sharpe ratio of 4.9 and outperforming baseline LLMs and institutional strategies in back‑testing.

FinKarioLLMRAG
0 likes · 10 min read
FinKario: Event‑Enhanced Financial Knowledge Graphs Boost A‑Share Sharpe Ratio to 4.9
DataFunSummit
DataFunSummit
Sep 19, 2025 · Artificial Intelligence

How Tencent Leverages LLMs: RAG, GraphRAG, and Agents in Real‑World Apps

This article examines Tencent's large language model deployments across diverse business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑play, and explains the underlying technologies—Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and intelligent agents—that enable these applications.

AIAgentLLM
0 likes · 4 min read
How Tencent Leverages LLMs: RAG, GraphRAG, and Agents in Real‑World Apps
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Sep 18, 2025 · Artificial Intelligence

How to Diagnose and Optimize RAG Systems When 30% Answers Miss the Mark

This guide explains why RAG systems often produce off‑topic answers, outlines how to measure hit‑rate, retrieval, reranking and generation metrics, provides step‑by‑step evaluation pipelines, code examples, real‑world case studies, and interview‑ready templates for diagnosing and optimizing each stage of the pipeline.

AIRAGgeneration
0 likes · 18 min read
How to Diagnose and Optimize RAG Systems When 30% Answers Miss the Mark
Data STUDIO
Data STUDIO
Sep 18, 2025 · Artificial Intelligence

Build a RAG App from Scratch: Master Text Chunking, Vector Retrieval, and Coreference Resolution

This tutorial walks through building a Retrieval‑Augmented Generation (RAG) system from the ground up, covering document parsing, text chunking strategies, vector store creation with ChromaDB, semantic search, prompt engineering for LLMs, conversation memory, coreference handling, and practical optimization tips, all illustrated with complete Python code.

ChromaDBPythonRAG
0 likes · 19 min read
Build a RAG App from Scratch: Master Text Chunking, Vector Retrieval, and Coreference Resolution
Zhuanzhuan Tech
Zhuanzhuan Tech
Sep 17, 2025 · Artificial Intelligence

LLM‑Powered Intent Understanding, RAG QA, and Knowledge Base Maintenance for Recycling

This article details how Zhuanzhuan leverages large language models to enhance on‑site device inspection through a three‑stage pipeline—intent understanding, retrieval‑augmented generation QA, and automated knowledge‑base upkeep—highlighting technical innovations, workflow integration, and the resulting operational benefits.

AIIntent UnderstandingKnowledge Base
0 likes · 14 min read
LLM‑Powered Intent Understanding, RAG QA, and Knowledge Base Maintenance for Recycling
DataFunSummit
DataFunSummit
Sep 17, 2025 · Artificial Intelligence

How Tencent’s Large Language Model Powers Real-World AI Applications

This article explores Tencent’s large language model across diverse business scenarios—content generation, intelligent customer service, role‑playing, and more—detailing the principles and practical uses of Retrieval‑Augmented Generation (RAG), GraphRAG, and Agent technologies, and how they enhance model intelligence and user experience.

AIAgentKnowledge Graph
0 likes · 4 min read
How Tencent’s Large Language Model Powers Real-World AI Applications
Architecture & Thinking
Architecture & Thinking
Sep 17, 2025 · Artificial Intelligence

How the 32B ‘Zhiyu’ Model is Revolutionizing Intelligent Operations

The Zhiyu model, a 32‑billion‑parameter SRE‑focused LLM, combines extensive domain knowledge, enhanced professional skills, and deterministic RAG to deliver precise, actionable insights for intelligent operations, backed by a robust multi‑source training pipeline, staged training, and flexible deployment options.

AI OperationsRAGSRE
0 likes · 7 min read
How the 32B ‘Zhiyu’ Model is Revolutionizing Intelligent Operations
DataFunTalk
DataFunTalk
Sep 15, 2025 · Artificial Intelligence

How AI+Data Agents Are Transforming the Automotive Industry’s Digital Leap

In an interview, Di Xingxing of Autohome details their AI+Data framework—unified lake‑warehouse, intelligent engine, and agent services—that breaks data silos, blends traditional models with LLMs, leverages causal inference and RAG knowledge bases, and uses continuous feedback to build explainable, evolving data agents for accurate sales forecasting, competitive analysis, and end‑to‑end business automation in the automotive industry.

AIAutomotiveData Engineering
0 likes · 10 min read
How AI+Data Agents Are Transforming the Automotive Industry’s Digital Leap
AI Cyberspace
AI Cyberspace
Sep 15, 2025 · Artificial Intelligence

What Is Agentic AI? From LLM Limits to Autonomous AI Agents

Agentic AI transforms static large language models into autonomous agents by adding perception, goal orientation, planning, action, interaction, and iterative loops, tracing its evolution from early chatbots through Prompt Engineering, ReAct, AutoGPT, OpenAI Function Calling, to modern multi‑agent frameworks, while addressing challenges like memory, hallucinations, and scalability.

RAGReActagentic AI
0 likes · 38 min read
What Is Agentic AI? From LLM Limits to Autonomous AI Agents
DataFunSummit
DataFunSummit
Sep 14, 2025 · Artificial Intelligence

How Tencent’s Large Language Models Boost Business with RAG, GraphRAG, and AI Agents

This article examines Tencent's large language model deployments across various business scenarios, detailing the use of Retrieval‑Augmented Generation, GraphRAG for role‑playing, and Agent technologies, while also outlining core application areas and the three main technical approaches—SFT, RAG, and Agents.

AI agentsAI applicationsGraphRAG
0 likes · 4 min read
How Tencent’s Large Language Models Boost Business with RAG, GraphRAG, and AI Agents
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Sep 13, 2025 · Artificial Intelligence

Choosing the Right Path for RAG Development: Low‑Code Platforms vs Open‑Source Frameworks

This article compares low‑code development platforms with open‑source large‑model frameworks such as LangChain and LlamaIndex, outlining their features, advantages, limitations, and suitability for building retrieval‑augmented generation (RAG) applications in various enterprise scenarios.

AI developmentLangChainLlamaIndex
0 likes · 13 min read
Choosing the Right Path for RAG Development: Low‑Code Platforms vs Open‑Source Frameworks
Fun with Large Models
Fun with Large Models
Sep 12, 2025 · Artificial Intelligence

When to Choose Model Fine‑Tuning vs RAG for Large‑Model Engineering Interviews

The article explains the technical background and suitable scenarios for Retrieval‑Augmented Generation (RAG) and model fine‑tuning, compares their strengths, discusses how they can be combined, and provides interview‑style Q&A on their capabilities, risks, and differences from model distillation.

AI InterviewFine‑TuningLarge Language Models
0 likes · 7 min read
When to Choose Model Fine‑Tuning vs RAG for Large‑Model Engineering Interviews
Data Party THU
Data Party THU
Sep 11, 2025 · Artificial Intelligence

How ComRAG Revolutionizes Real‑Time Community QA with Dynamic Vector Stores

ComRAG tackles the static‑knowledge gaps, uneven QA quality, and storage explosion of community question‑answer platforms by integrating a static documentation vector store with dual dynamic CQA stores managed via a centroid‑based memory, delivering higher accuracy, lower latency, and scalable storage for industrial retrieval‑augmented generation.

Artificial IntelligenceCommunity QADynamic Retrieval
0 likes · 7 min read
How ComRAG Revolutionizes Real‑Time Community QA with Dynamic Vector Stores
Instant Consumer Technology Team
Instant Consumer Technology Team
Sep 11, 2025 · Artificial Intelligence

How REFRAG Cuts LLM Decoding Time by 30×: A New Efficient RAG Framework

REFRAG (REpresentation For RAG) introduces a novel decoding framework that compresses, senses, and expands context using precomputed chunk embeddings, achieving up to 30.85× faster first-token generation and 16× larger context windows without sacrificing perplexity, as validated across diverse long‑context tasks.

LLMRAGchunk embeddings
0 likes · 18 min read
How REFRAG Cuts LLM Decoding Time by 30×: A New Efficient RAG Framework
Continuous Delivery 2.0
Continuous Delivery 2.0
Sep 11, 2025 · Artificial Intelligence

Building Scalable Enterprise RAG: Lessons, Pitfalls, and Proven Solutions

This article shares practical lessons from building a large‑scale enterprise RAG system, covering imperfect data, document quality scoring, hierarchical chunking, metadata design, semantic‑search failures, open‑source model choices, and table handling to achieve reliable AI‑driven search.

Enterprise AIOpen-source modelsRAG
0 likes · 13 min read
Building Scalable Enterprise RAG: Lessons, Pitfalls, and Proven Solutions
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Sep 11, 2025 · Artificial Intelligence

How AST Boosts LLM‑Powered Code Question Answering: Theory, Practice, and Future Directions

This article explores how abstract syntax trees (AST) can enrich large language model (LLM) based code question‑answering by providing precise structural context, detailing LLM strengths and limits, describing AST‑LLM collaboration, RAG integration, cutting‑edge models, practical tooling, challenges, standardisation efforts, and future research avenues.

ASTLLMRAG
0 likes · 30 min read
How AST Boosts LLM‑Powered Code Question Answering: Theory, Practice, and Future Directions
DaTaobao Tech
DaTaobao Tech
Sep 10, 2025 · Frontend Development

How AI Cut Front‑End Development Time by 60% in Alibaba’s Giraffe Search

This article details how the author transformed a constrained Weex/Muise front‑end project for the “giraffe” search page into an AI‑driven workflow, building a structured knowledge base, defining project‑level rules, and using RAG techniques to accelerate component, tracking, and payment integration, ultimately reducing development time by 60% and proposing a new “AI programming as context engineering” paradigm.

AIFrontend DevelopmentKnowledge Base
0 likes · 14 min read
How AI Cut Front‑End Development Time by 60% in Alibaba’s Giraffe Search
DataFunTalk
DataFunTalk
Sep 10, 2025 · Artificial Intelligence

Why RAG is Evolving: From Retrieval to Integrated Reasoning, Memory, and Multimodal AI

This article explores how Retrieval‑Augmented Generation (RAG) is transitioning from basic retrieve‑and‑generate pipelines to a unified architecture that incorporates reasoning chains, agent layers, knowledge graphs, Monte‑Carlo Tree Search, reinforcement learning, sophisticated memory management, and multimodal tensor‑based retrieval, while addressing engineering challenges such as storage expansion, re‑ranking, and index dimensionality.

AI reasoningRAGRetrieval-Augmented Generation
0 likes · 19 min read
Why RAG is Evolving: From Retrieval to Integrated Reasoning, Memory, and Multimodal AI
Architecture Breakthrough
Architecture Breakthrough
Sep 7, 2025 · Industry Insights

Why Arrogance Blocks You From Riding the AI Wave—and How to Overcome It

The article argues that arrogance, not lack of knowledge, hinders individuals from seizing AI opportunities, outlines four psychological barriers—unseen, undervalued, incomprehensible, and too late—and provides practical steps such as prompt engineering, RAG, fine‑tuning, and AI agents to actively engage with the AI wave.

AIIndustry InsightsPrompt Engineering
0 likes · 11 min read
Why Arrogance Blocks You From Riding the AI Wave—and How to Overcome It
Data Party THU
Data Party THU
Sep 5, 2025 · Artificial Intelligence

What a PRISMA Review Uncovers About Retrieval‑Augmented Generation (RAG)

This systematic PRISMA review analyzes 128 highly‑cited RAG papers, covering five major databases, 343 datasets, a detailed technical roadmap, evaluation metrics from EM to LLM‑as‑Judge, and future research directions, showing that RAG has evolved into a complex, programmable, and auditable distributed system.

AIRAGSystematic Review
0 likes · 5 min read
What a PRISMA Review Uncovers About Retrieval‑Augmented Generation (RAG)
Instant Consumer Technology Team
Instant Consumer Technology Team
Sep 5, 2025 · Artificial Intelligence

How Context Engineering Transforms Dify Agents: Boost Efficiency by 10×

This article explains how Context Engineering (CE) extends Prompt Engineering by integrating seven core elements—system prompts, user input, short‑term memory, long‑term memory, retrieval, tools, and structured output—using the open‑source Dify platform to build dynamic, multimodal agents that cut inference costs tenfold and raise complex‑task success rates by 40%.

AI Agent DevelopmentDifyLLM
0 likes · 16 min read
How Context Engineering Transforms Dify Agents: Boost Efficiency by 10×
DataFunSummit
DataFunSummit
Sep 4, 2025 · Artificial Intelligence

Unlocking Elasticsearch Vector Search: From Basics to RAG Implementation

This article explores the evolving search demands of the intelligent era, explains dense and sparse vector concepts, details Elasticsearch's vector search capabilities and recent performance breakthroughs, introduces hybrid and relevance‑tuning techniques, and demonstrates RAG principles and real‑world enterprise use cases.

AIElasticsearchHybrid Search
0 likes · 14 min read
Unlocking Elasticsearch Vector Search: From Basics to RAG Implementation
Data Party THU
Data Party THU
Sep 3, 2025 · Artificial Intelligence

Unlocking Large Model Secrets: Transformers, MoE, Fine‑Tuning, RAG & KV Caching

This article provides a comprehensive technical overview of today’s large‑model ecosystem, covering the Transformer architecture, Mixture‑of‑Experts extensions, five fine‑tuning methods, the evolution from traditional RAG to agentic RAG, classic agent design patterns, diverse text‑chunking strategies, and the KV‑cache optimization that accelerates inference.

Fine‑tuningKV CacheMixture of Experts
0 likes · 13 min read
Unlocking Large Model Secrets: Transformers, MoE, Fine‑Tuning, RAG & KV Caching
Efficient Ops
Efficient Ops
Sep 2, 2025 · Artificial Intelligence

How AI Is Revolutionizing Knowledge‑Base Building for Smarter Operations

At the 27th GOPS Global Operations Conference in Shanghai (Oct 17‑18, 2025), Professor Wang Peng of Fudan University will reveal how large language models can extract and structure heterogeneous operational data into high‑quality knowledge bases, and how RAG‑driven Q&A enhances fault diagnosis, SOP generation, and automated decision‑making.

Artificial IntelligenceIntelligent OperationsKnowledge Base
0 likes · 3 min read
How AI Is Revolutionizing Knowledge‑Base Building for Smarter Operations
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Sep 2, 2025 · Artificial Intelligence

Why Enterprise Large‑Model Digitalization Is So Hard: Key Challenges and Capabilities

The article analyzes why enterprise‑wide large‑model AI projects face steep hurdles, outlining required human capabilities, historical labor shifts, current hot technologies such as RAG, Agent, CoT and multimodal, their limits, a three‑stage implementation roadmap, typical case pitfalls, and the key success factors for sustainable digital transformation.

AgentCoTEnterprise AI
0 likes · 15 min read
Why Enterprise Large‑Model Digitalization Is So Hard: Key Challenges and Capabilities
Instant Consumer Technology Team
Instant Consumer Technology Team
Sep 2, 2025 · Artificial Intelligence

Why RAG Is Dead: Jeff Huber’s 5 Retrieval Secrets and Context Engineering

Jeff Huber, founder of Chroma, argues that traditional RAG is obsolete, introduces context engineering as the new paradigm, and shares five practical retrieval strategies, a complete pipeline, and insights on handling context rot, memory, and generative benchmarking to build production‑grade AI applications.

AIContext EngineeringGenerative Benchmarking
0 likes · 11 min read
Why RAG Is Dead: Jeff Huber’s 5 Retrieval Secrets and Context Engineering
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 1, 2025 · Artificial Intelligence

Mastering RAG: From Chunking to Hybrid Search for Better AI Retrieval

This article delves into the implementation details and optimization strategies of Retrieval‑Augmented Generation (RAG), covering document chunking, index enhancement, embedding, hybrid search, and re‑ranking, and provides practical code examples to help developers move from quick deployment to deep performance tuning.

AIChunkingEmbedding
0 likes · 19 min read
Mastering RAG: From Chunking to Hybrid Search for Better AI Retrieval
Data Thinking Notes
Data Thinking Notes
Aug 31, 2025 · Artificial Intelligence

Embedding's Role in Retrieval‑Augmented Generation: Basics, Challenges & Future

This article explains how embedding technology converts unstructured data into vector representations, powers precise retrieval in Retrieval‑Augmented Generation (RAG), outlines the evolution of embedding models, discusses current challenges such as long‑text handling and domain adaptation, and highlights emerging solutions.

AIEmbeddingRAG
0 likes · 12 min read
Embedding's Role in Retrieval‑Augmented Generation: Basics, Challenges & Future
Xiaolei Talks DB
Xiaolei Talks DB
Aug 28, 2025 · Databases

How AI Is Transforming Databases: Highlights from China’s DTCC2025

At DTCC2025 in Beijing, industry leaders showcased AI-driven innovations, vector database advances, RAG techniques, and distributed database performance breakthroughs, illustrating how databases are evolving from passive data stores into intelligent, autonomous systems that boost efficiency, scalability, and business value across sectors.

AIDistributed SystemsRAG
0 likes · 10 min read
How AI Is Transforming Databases: Highlights from China’s DTCC2025
Data Thinking Notes
Data Thinking Notes
Aug 26, 2025 · Artificial Intelligence

From Prompt to Context: How AI Agents Evolve into Proactive Intelligence

This article explores the rapid growth of large language models and explains how AI agents transform passive, single‑turn responses into proactive, continuous intelligence by leveraging a core “Prompt→Context→Action” loop, detailing their architecture, key components, challenges, and future directions.

AI AgentContext ManagementLLM architecture
0 likes · 20 min read
From Prompt to Context: How AI Agents Evolve into Proactive Intelligence
Tech Freedom Circle
Tech Freedom Circle
Aug 26, 2025 · Artificial Intelligence

How to Optimize RAG for Alibaba Interviews? 7 Golden Rules Explained

This article provides a step‑by‑step technical guide to optimizing Retrieval‑Augmented Generation (RAG) for interview scenarios, covering query rewriting, HyDE, fallback strategies, routing and prompt routing, multi‑representation indexing, hybrid retrieval, re‑ranking, self‑RAG, generation control, performance benchmarking, and a practical checklist with concrete code examples and metrics.

AI InterviewHybrid RetrievalIndex Optimization
0 likes · 30 min read
How to Optimize RAG for Alibaba Interviews? 7 Golden Rules Explained
Alibaba Cloud Native
Alibaba Cloud Native
Aug 26, 2025 · Artificial Intelligence

Boost Dify’s RAG Performance with Higress AI Gateway: Two Integration Strategies

This guide explains how to overcome Dify's built‑in RAG limitations by using Higress AI Gateway to connect external RAG services, detailing two integration patterns—RAG Retrieval Agent and Automatic Retrieval Injection—along with step‑by‑step configuration, validation, and the resulting benefits for enterprise AI applications.

DifyKnowledge retrievalRAG
0 likes · 13 min read
Boost Dify’s RAG Performance with Higress AI Gateway: Two Integration Strategies
DataFunSummit
DataFunSummit
Aug 25, 2025 · Artificial Intelligence

Building Xiaomi’s Vertical Domain QA Agent: From RAG to Real‑World Deployment

This article explains how Xiaomi designed and deployed a vertical‑domain question‑answering assistant for product and car queries, covering business background, a four‑module RAG‑plus‑LLM architecture, knowledge‑base construction, custom chunking strategies, dynamic signal handling, and the challenges overcome to achieve reliable real‑time voice interactions.

Agent ArchitectureLLMRAG
0 likes · 22 min read
Building Xiaomi’s Vertical Domain QA Agent: From RAG to Real‑World Deployment
DaTaobao Tech
DaTaobao Tech
Aug 25, 2025 · Artificial Intelligence

Mastering RAG: From Quick Start to Deep Optimization Strategies

This article dives into the practical implementation of Retrieval‑Augmented Generation (RAG), covering document chunking, semantic and reverse HyDE indexing, embedding, hybrid search, and re‑ranking techniques, and provides concrete code examples and optimization tips for building high‑performance AI applications.

Artificial IntelligenceChunkingEmbedding
0 likes · 18 min read
Mastering RAG: From Quick Start to Deep Optimization Strategies
Fun with Large Models
Fun with Large Models
Aug 22, 2025 · Artificial Intelligence

Step‑by‑Step Guide: Building a PDF‑Based RAG Knowledge Base with LangChain, Streamlit, DashScope & DeepSeek

This tutorial shows how to create a lightweight Retrieval‑Augmented Generation (RAG) system that indexes multiple PDF files, stores their embeddings in a FAISS vector database, and answers user queries through a LangChain agent powered by DashScope embeddings and the DeepSeek‑Chat model, all wrapped in a Streamlit UI.

DashScopeDeepSeekFAISS
0 likes · 13 min read
Step‑by‑Step Guide: Building a PDF‑Based RAG Knowledge Base with LangChain, Streamlit, DashScope & DeepSeek
Volcano Engine Developer Services
Volcano Engine Developer Services
Aug 21, 2025 · Artificial Intelligence

Why Prompt Engineering Isn’t Enough: The Rise of Context Engineering and RAG

Since last year, the debate over “Prompt Engineering” has split between practitioners who favor “Context Engineering” for building scalable agent systems and scholars who treat Prompt Engineering as a broad umbrella term, highlighting the need to dynamically construct and manage context for reliable, extensible AI applications.

AI agentsLLMPrompt Engineering
0 likes · 33 min read
Why Prompt Engineering Isn’t Enough: The Rise of Context Engineering and RAG
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 21, 2025 · Artificial Intelligence

Why Your AI Defect Deduplication Returns Mixed Data and How to Fix It

This article details the challenges of building an AI‑powered defect deduplication system using Retrieval‑Augmented Generation, explains why LLMs produce composite (spliced) results, diagnoses the root cause as information loss in the RAG pipeline, and presents a step‑by‑step solution that restores atomicity of records for reliable duplicate detection.

AI debuggingKnowledge BaseLLM
0 likes · 14 min read
Why Your AI Defect Deduplication Returns Mixed Data and How to Fix It
JD Retail Technology
JD Retail Technology
Aug 20, 2025 · Artificial Intelligence

Launch Multi-Agent AI Systems with OxyGent in Just 20 Lines of Code

Learn how to quickly set up OxyGent, a flexible AI agent framework, by installing Python, Node.js, and the MCP tools, configuring environment variables, and using just 20 lines of code to build, debug, and deploy multi‑agent applications with features like RAG, tool integration, and distributed execution.

DeploymentMCPOxyGent
0 likes · 5 min read
Launch Multi-Agent AI Systems with OxyGent in Just 20 Lines of Code
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 20, 2025 · Artificial Intelligence

How DeepSearch Elevates RAG: From RAG 1.0 to a Multi‑Agent AI Search Engine

This article explains how Alibaba Cloud OpenSearch LLM version evolved from RAG 1.0 to RAG 2.0, introducing the DeepSearch multi‑agent architecture that combines offline data processing, online query handling, planning, clarification, search, and summarization agents to deliver more accurate and complex AI‑driven answers.

AI SearchDeepSearchLLM
0 likes · 10 min read
How DeepSearch Elevates RAG: From RAG 1.0 to a Multi‑Agent AI Search Engine
Instant Consumer Technology Team
Instant Consumer Technology Team
Aug 20, 2025 · Backend Development

How I Built a Production‑Ready RAG Service in 3 Weeks Using AI Coding Tools

In just three weeks, I single‑handedly created a production‑grade Retrieval‑Augmented Generation (RAG) API with FastAPI, leveraging Cursor and Claude Code to automate coding, testing, and deployment, and I share practical insights on AI‑assisted development, high cohesion‑low coupling design, TDD, git worktree parallelism, and agent orchestration.

AI codingFastAPIGit Worktree
0 likes · 19 min read
How I Built a Production‑Ready RAG Service in 3 Weeks Using AI Coding Tools
Instant Consumer Technology Team
Instant Consumer Technology Team
Aug 19, 2025 · Artificial Intelligence

Mastering Document Chunking for RAG: Strategies, Code & Best Practices

This article explores why proper document chunking is crucial for Retrieval‑Augmented Generation, explains core concepts like context windows and signal‑to‑noise, compares various chunking strategies—from simple fixed‑size splits to semantic and hybrid approaches—and provides practical Python code examples to help you build more effective RAG pipelines.

LLMRAGText Splitting
0 likes · 24 min read
Mastering Document Chunking for RAG: Strategies, Code & Best Practices