Tagged articles

924 articles

Page 5 of 10

Nov 7, 2025 · Artificial Intelligence

How Tencent’s LLM Powers Content Creation, Smart Service, and Game NPCs

This article examines Tencent’s large language model deployments across content generation, intelligent customer service, and game role‑playing, and explains the underlying technologies—Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agent systems—highlighting how they enhance performance, explainability, and multi‑step reasoning in real‑world business scenarios.

AIAgentLLM

0 likes · 4 min read

How Tencent’s LLM Powers Content Creation, Smart Service, and Game NPCs

Wu Shixiong's Large Model Academy

Nov 6, 2025 · Artificial Intelligence

How to Optimize RAG Knowledge Base Construction: Parsing, Chunking, and Retrieval

This article explains why building a high‑quality RAG knowledge base is critical, outlines offline parsing techniques for multi‑format documents, presents semantic chunking strategies that preserve structure and context, and shows how to answer interview questions with a robust, production‑ready pipeline.

AI InterviewChunkingKnowledge Base

0 likes · 8 min read

How to Optimize RAG Knowledge Base Construction: Parsing, Chunking, and Retrieval

Wu Shixiong's Large Model Academy

Nov 5, 2025 · Artificial Intelligence

Why Production-Ready RAG Is Ten Times Harder Than a Simple Demo

Building a Retrieval‑Augmented Generation (RAG) system may be straightforward in code, but making it reliable, accurate, and scalable in production involves challenges across data preparation, vector retrieval, query rewriting, generation control, and system integration, turning a demo into a truly useful AI service.

AILLMPrompt Engineering

0 likes · 8 min read

Why Production-Ready RAG Is Ten Times Harder Than a Simple Demo

DataFunSummit

Nov 4, 2025 · Artificial Intelligence

How Tencent Leverages RAG, GraphRAG, and Agents to Power Large Language Model Applications

This article explores Tencent's large language model deployments across various business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑playing, and explains the underlying technologies—Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agent systems—that enable these applications.

AIAgentRAG

0 likes · 4 min read

How Tencent Leverages RAG, GraphRAG, and Agents to Power Large Language Model Applications

Wu Shixiong's Large Model Academy

Nov 4, 2025 · Artificial Intelligence

Why Financial RAG Fails and How to Solve Its Core Challenges

This article explains why Retrieval‑Augmented Generation (RAG) projects in the financial sector often underperform, highlighting data‑structure complexities, document‑parsing hurdles, chunking strategies, compliance constraints, evaluation metrics, and engineering requirements, and offers practical solutions and code examples.

ChunkingComplianceEngineering

0 likes · 10 min read

Why Financial RAG Fails and How to Solve Its Core Challenges

dbaplus Community

Nov 3, 2025 · Artificial Intelligence

How RAG Turns Natural Language Queries into Accurate SQL for Data Platforms

This article explains how Retrieval‑Augmented Generation (RAG) combines vector databases with large language models to let non‑technical users ask natural‑language questions and receive precise SQL statements, detailing the workflow, architecture, chunking methods, performance gains, and remaining challenges.

Data PlatformLLMRAG

0 likes · 17 min read

How RAG Turns Natural Language Queries into Accurate SQL for Data Platforms

Instant Consumer Technology Team

Nov 3, 2025 · Artificial Intelligence

Large Language Models Power Big Data SRE Knowledge & Root‑Cause Automation

Facing the growing complexity of big‑data platforms, the SRE team adopted large‑language‑model agents to automate knowledge management and root‑cause analysis, employing Retrieval‑Augmented Generation, a vector store, and the Model Context Protocol to enable intelligent, scalable, and efficient incident diagnosis and resolution.

AIMCPRAG

0 likes · 12 min read

Large Language Models Power Big Data SRE Knowledge & Root‑Cause Automation

DataFunSummit

Nov 3, 2025 · Artificial Intelligence

How Tencent’s LLM Powers Real‑World AI: From RAG to Agents

This article examines Tencent's large language model applications across diverse business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑playing, and explains the three key technologies—Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agents—that enable these capabilities.

AI applicationsAgentLLM

0 likes · 4 min read

How Tencent’s LLM Powers Real‑World AI: From RAG to Agents

Wu Shixiong's Large Model Academy

Nov 2, 2025 · Artificial Intelligence

Why Document Parsing Is the Real Bottleneck in RAG Projects (And How to Fix It)

The article explains that in Retrieval‑Augmented Generation projects the hardest challenge lies in robust document parsing—handling PDFs, PPTs, scanned contracts, OCR errors, and preserving structure—to ensure high‑quality retrieval and avoid hallucinations.

AIOCRRAG

0 likes · 10 min read

Why Document Parsing Is the Real Bottleneck in RAG Projects (And How to Fix It)

Data Party THU

Nov 1, 2025 · Artificial Intelligence

How to Blend Process‑Oriented and Agent‑Centric AI into a Hybrid Intelligent Pipeline

This article analyzes two contrasting AI agent design paradigms—process‑driven workflow orchestration and autonomous agent intelligence—examines their strengths and limitations, and proposes a hybrid architecture that fuses deterministic pipelines with dynamic planning, tool use, and memory mechanisms to achieve both reliability and adaptability.

AIAgentHybrid

0 likes · 15 min read

How to Blend Process‑Oriented and Agent‑Centric AI into a Hybrid Intelligent Pipeline

Wu Shixiong's Large Model Academy

Nov 1, 2025 · Artificial Intelligence

Turn a Basic RAG Demo into a High‑Impact Interview Project

This guide shows how to evolve a simple Retrieval‑Augmented Generation prototype into a production‑grade system by strengthening data ingestion, optimizing retrieval with hybrid and reranking techniques, adding query rewriting, long‑context handling, reinforcement learning, and multimodal support, so candidates can demonstrate real engineering depth in interviews.

AILLMRAG

0 likes · 7 min read

Turn a Basic RAG Demo into a High‑Impact Interview Project

Rare Earth Juejin Tech Community

Oct 31, 2025 · Artificial Intelligence

Build a Private AI Knowledge Base with Ollama and FastGPT

This guide walks you through setting up a locally deployed AI system using Ollama and FastGPT, covering model selection, Docker deployment, configuration, knowledge‑base creation, and testing so your team can query internal documents securely and efficiently.

AIDockerFastGPT

0 likes · 25 min read

Build a Private AI Knowledge Base with Ollama and FastGPT

Wu Shixiong's Large Model Academy

Oct 30, 2025 · Interview Experience

How to Write a RAG Project Experience That Impresses Interviewers

This guide explains why typical RAG résumé entries fall flat and provides a step‑by‑step framework—including motivation, architecture, optimization, and impact metrics—to craft a compelling, interview‑ready description of Retrieval‑Augmented Generation projects.

AI ProjectInterview TipsRAG

0 likes · 10 min read

How to Write a RAG Project Experience That Impresses Interviewers

BirdNest Tech Talk

Oct 30, 2025 · Artificial Intelligence

Master LangChain Chains with LCEL: From Simple Jokes to RAG and Agent Pipelines

This guide explains how LangChain’s Expression Language (LCEL) lets you declaratively connect prompts, models, and output parsers into chains, walks through environment setup, dependency installation, and detailed code examples ranging from a basic joke generator to retrieval‑augmented generation and memory‑enabled agents.

AgentLCELLangChain

0 likes · 5 min read

Master LangChain Chains with LCEL: From Simple Jokes to RAG and Agent Pipelines

Alibaba Cloud Developer

Oct 30, 2025 · Artificial Intelligence

Why AI Agents Aren’t As Simple As They Appear: Engineering Challenges and Solutions

Building AI agents may seem straightforward with frameworks like LangChain, but hidden complexities in orchestration, memory management, reproducibility, and scalability turn simple demos into fragile systems, requiring systematic engineering, observability, and robust design to achieve reliable, production‑grade intelligent agents.

AI agentsAgent DesignLangChain

0 likes · 21 min read

Why AI Agents Aren’t As Simple As They Appear: Engineering Challenges and Solutions

DeWu Technology

Oct 29, 2025 · Artificial Intelligence

Why Chunking Can Make or Break Your RAG System – Practical Strategies & Code

This article explains how proper document chunking—choosing the right chunk size, overlap, and structure‑aware boundaries—directly impacts the relevance, factuality, and efficiency of Retrieval‑Augmented Generation pipelines, and provides multiple Python implementations ranging from simple fixed‑length splits to semantic and hybrid approaches.

ChunkingEmbeddingLLM

0 likes · 29 min read

Why Chunking Can Make or Break Your RAG System – Practical Strategies & Code

Wu Shixiong's Large Model Academy

Oct 29, 2025 · Artificial Intelligence

How to Build a Robust RAG Evaluation Framework for Finance Q&A

This guide outlines a five‑dimensional evaluation system—accuracy, credibility, latency, scalability, and user experience—providing concrete metrics, code examples, and practical steps to assess Retrieval‑Augmented Generation models in financial insurance question‑answering scenarios.

AIFinanceRAG

0 likes · 11 min read

How to Build a Robust RAG Evaluation Framework for Finance Q&A

Alibaba Cloud Developer

Oct 28, 2025 · Artificial Intelligence

Automating Research with Spring AI: Inside the DeepResearch Java RAG System

The article introduces Spring AI Alibaba DeepResearch, a Java‑based system that automates end‑to‑end research by integrating multi‑agent workflows, configurable RAG pipelines, data ingestion, observability, and dynamic report generation, and provides detailed deployment and extension guidance.

AI workflowDeepResearchJava

0 likes · 20 min read

Automating Research with Spring AI: Inside the DeepResearch Java RAG System

AI Large Model Application Practice

Oct 27, 2025 · Artificial Intelligence

Why Context Engineering Is the Next Evolution Beyond Prompt Engineering

The article explains how traditional prompt engineering is giving way to Context Engineering and the Agentic Context Engineering (ACE) framework, which lets large language model agents continuously learn and improve through evolving, well‑structured context without fine‑tuning.

AI ArchitectureContext EngineeringLLM

0 likes · 12 min read

Why Context Engineering Is the Next Evolution Beyond Prompt Engineering

Bilibili Tech

Oct 27, 2025 · Artificial Intelligence

How Bilibili’s LLM-Powered System Cuts Game Localization Costs by 80%

Bilibili’s game algorithm team built a four‑layer, LLM‑based translation platform that automates terminology extraction, retrieval‑augmented generation, and quality assessment, dramatically reducing localization cycles by over 85% and costs by up to 80% while supporting ten languages and ensuring consistent, culturally‑accurate game text.

LLMRAGgame localization

0 likes · 20 min read

How Bilibili’s LLM-Powered System Cuts Game Localization Costs by 80%

Wu Shixiong's Large Model Academy

Oct 27, 2025 · Artificial Intelligence

Designing Effective Generation Modules for RAG: Prompt Engineering, Multi‑Document Fusion, and Hallucination Control

This article explains how to design and optimize the generation module of Retrieval‑Augmented Generation systems by building robust prompts, merging multi‑source information, controlling answer formats, and applying post‑generation verification to reduce hallucinations and improve enterprise‑grade performance.

AIGeneration ModuleHallucination Control

0 likes · 9 min read

Designing Effective Generation Modules for RAG: Prompt Engineering, Multi‑Document Fusion, and Hallucination Control

BirdNest Tech Talk

Oct 27, 2025 · Artificial Intelligence

How LangChain’s Indexing API Enables Efficient Incremental Updates for RAG Systems

This article explains how LangChain's Indexing API adds state management and synchronization to the classic load‑split‑embed‑store RAG pipeline, detailing the RecordManager component, the index function workflow, key parameters, implementation considerations, and best‑practice code examples for production‑grade vector stores.

FAISSIndexing APILangChain

0 likes · 12 min read

How LangChain’s Indexing API Enables Efficient Incremental Updates for RAG Systems

BirdNest Tech Talk

Oct 27, 2025 · Artificial Intelligence

Mastering LangChain Retrievers: From Basics to Multi‑Query and Ensemble Techniques

This article explains the concept of LangChain retrievers, compares them with vector stores, and provides step‑by‑step code examples for MultiQueryRetriever and EnsembleRetriever, showing how to boost recall and combine semantic and keyword search for robust RAG pipelines.

AIEnsembleRetrieverLangChain

0 likes · 9 min read

Mastering LangChain Retrievers: From Basics to Multi‑Query and Ensemble Techniques

Huawei Cloud Developer Alliance

Oct 27, 2025 · Artificial Intelligence

Master AI Agents and MCP: A Complete 4‑Month Learning Roadmap

This article presents a structured, step‑by‑step learning path that guides beginners from Python fundamentals through AI API mastery, Retrieval‑Augmented Generation, deep MCP protocol knowledge, and advanced multi‑agent development, complete with practical code examples and performance‑monitoring techniques.

AI agentsLangChainMCP protocol

0 likes · 14 min read

Master AI Agents and MCP: A Complete 4‑Month Learning Roadmap

Wu Shixiong's Large Model Academy

Oct 25, 2025 · Artificial Intelligence

How to Build a High‑Quality RAG Knowledge Base: A Step‑by‑Step Guide

This article breaks down the end‑to‑end engineering pipeline for constructing a Retrieval‑Augmented Generation (RAG) knowledge base, covering document parsing, data cleaning, semantic chunking, embedding, and index creation, plus practical optimization tips and a concise interview answer framework.

LLMRAGvector indexing

0 likes · 10 min read

How to Build a High‑Quality RAG Knowledge Base: A Step‑by‑Step Guide

AI2ML AI to Machine Learning

Oct 24, 2025 · Artificial Intelligence

Beyond RAG: Three Emerging Knowledge‑Engineering Strategies (ICL, Online Learning, SLM)

The article outlines three post‑RAG knowledge‑engineering approaches—In‑Context Learning with dynamic few‑shot selection, Online Learning encompassing Meta‑Learning and Lifelong Learning to quickly adapt to new tasks, and the Small Language Model path that combines fine‑tuned task‑specific experts with LLM‑SLM collaboration for efficient, privacy‑preserving inference.

In-Context LearningKnowledge EngineeringLLM

0 likes · 4 min read

Beyond RAG: Three Emerging Knowledge‑Engineering Strategies (ICL, Online Learning, SLM)

DataFunTalk

Oct 23, 2025 · Artificial Intelligence

How Tencent Leverages RAG and Agents to Supercharge Large Language Models

This article examines Tencent's large language model deployments across diverse business scenarios, detailing how Retrieval‑Augmented Generation, Supervised Fine‑Tuning, and autonomous agents boost model intelligence, reduce hallucinations, and enable sophisticated content creation, understanding, and interactive applications.

AI agentsLarge Language ModelRAG

0 likes · 4 min read

How Tencent Leverages RAG and Agents to Supercharge Large Language Models

Huawei Cloud Developer Alliance

Oct 23, 2025 · Artificial Intelligence

Boost Your RAG Bot’s Accuracy: Hybrid Search, Query Rewriting, and Re‑ranking Explained

This article walks developers through three essential upgrades for Retrieval‑Augmented Generation systems—hybrid search combining vector and keyword retrieval, query rewriting to clarify conversational inputs, and re‑ranking with a cross‑encoder—providing step‑by‑step code examples using LangChain to dramatically improve answer quality.

AIHybrid SearchLangChain

0 likes · 9 min read

Boost Your RAG Bot’s Accuracy: Hybrid Search, Query Rewriting, and Re‑ranking Explained

Xuanwu Backend Tech Stack

Oct 22, 2025 · Artificial Intelligence

How Rerank Transforms Retrieval‑Augmented Generation for Accurate AI Answers

This article explains the limitations of basic Retrieval‑Augmented Generation (RAG), introduces Rerank technology as a two‑step refinement process, compares dual‑encoder and cross‑encoder methods, and reviews popular Rerank models to help developers build more precise AI‑driven retrieval systems.

Artificial IntelligenceRAGRerank

0 likes · 10 min read

How Rerank Transforms Retrieval‑Augmented Generation for Accurate AI Answers

JD Tech Talk

Oct 21, 2025 · Backend Development

How Backend Engineers Are Breaking Through AI with RAG Architectures

This article details a backend developer's two‑year AI journey, the challenges of rapid model advances, and how applying microservice principles to Retrieval‑Augmented Generation (RAG) creates a scalable, multi‑agent platform for insurance knowledge, memory, and intelligent agents.

Backend AIKnowledge BaseRAG

0 likes · 11 min read

How Backend Engineers Are Breaking Through AI with RAG Architectures

BirdNest Tech Talk

Oct 21, 2025 · Artificial Intelligence

How Vector Stores Enable Lightning‑Fast Semantic Search in LangChain

This article explains what vector stores are, outlines their core workflow of adding, querying, and searching embeddings, compares popular back‑ends like FAISS, Chroma, and Pinecone, and walks through a complete Chinese‑language example using LangChain’s FAISS integration with detailed code and result analysis.

AIEmbeddingsFAISS

0 likes · 10 min read

How Vector Stores Enable Lightning‑Fast Semantic Search in LangChain

Alibaba Cloud Developer

Oct 20, 2025 · Artificial Intelligence

How LLM-Powered Agents Transform Secure Code Review in Enterprise Repositories

This article details the implementation of an LLM‑based code‑review agent in a C3‑level secure repository, describing its RAG‑enhanced knowledge base, CI pipeline integration, real‑world results, prompt engineering, and ongoing optimization to boost review efficiency and defect detection.

AI agentsCode ReviewLLM

0 likes · 18 min read

How LLM-Powered Agents Transform Secure Code Review in Enterprise Repositories

BirdNest Tech Talk

Oct 16, 2025 · Artificial Intelligence

Mastering Text Splitting in LangChain: From Theory to Code

This guide explains why large documents must be broken into semantic chunks for LLMs, introduces core parameters like chunk_size and chunk_overlap, compares LangChain's various splitters, and walks through a complete Python example that loads a long text, configures a RecursiveCharacterTextSplitter, and inspects the resulting chunks.

EmbeddingLangChainRAG

0 likes · 9 min read

Mastering Text Splitting in LangChain: From Theory to Code

DataFunSummit

Oct 16, 2025 · Artificial Intelligence

How Chat BI Transforms Data Warehousing with AI: Unlock Real‑Time Insights

This presentation by iQIYI’s Technical Director Zhang Xiaoming details the evolution of BI systems, introduces the Chat BI framework, explains its three‑step implementation, outlines architectural design, data‑warehouse integration, performance optimizations, and user‑operation strategies, revealing how AI and RAG empower smarter data analytics.

AIBIChatBI

0 likes · 18 min read

How Chat BI Transforms Data Warehousing with AI: Unlock Real‑Time Insights

Alibaba Cloud Native

Oct 15, 2025 · Cloud Native

What’s New in Higress 2.0? 30 Updates Including RAG MCP Server and Performance Fixes

The Higress 2.0 release introduces 30 changes—13 new features such as a RAG MCP server and ECDS‑based configuration refactor, 7 bug fixes, 5 refactorings, documentation updates and a test improvement—providing developers with enhanced knowledge‑management capabilities, more stable routing, and clearer documentation for cloud‑native service‑mesh environments.

Bug FixMCPRAG

0 likes · 20 min read

What’s New in Higress 2.0? 30 Updates Including RAG MCP Server and Performance Fixes

Xiaohe Frontend Team

Oct 15, 2025 · Artificial Intelligence

REFRAG: Using Tiny Models to Compress RAG for Faster, Smarter AI

Meta’s new REFRAG framework lets a lightweight encoder compress retrieved text into semantic tags, enabling large language models to answer queries with far fewer tokens, lower latency, and higher throughput, while preserving core meaning and allowing flexible placement of compressed information within prompts.

LLM efficiencyModel CompressionRAG

0 likes · 8 min read

REFRAG: Using Tiny Models to Compress RAG for Faster, Smarter AI

Practical DevOps Architecture

Oct 14, 2025 · Artificial Intelligence

Master AI Agents: From Basics to Advanced Multi-Model Development

This comprehensive AI agent development course covers 18 chapters, ranging from fundamental concepts and architecture to large‑model integration, tool and browser control, memory, RAG self‑learning, sandboxing, database manipulation, multi‑agent architectures, code assistance, and a real‑world frontend automation project, complete with source code and documentation.

AI agentsLangChainLarge Language Models

0 likes · 3 min read

Master AI Agents: From Basics to Advanced Multi-Model Development

Instant Consumer Technology Team

Oct 13, 2025 · Artificial Intelligence

Mastering AI Agents: Building Knowledge Bases, Workflows, and Prompt Engineering

This article explains how to design a high‑performing AI Agent by constructing a robust knowledge base, orchestrating efficient workflows, and crafting precise prompts, covering vector storage, graph databases, retrieval strategies, and practical prompt‑engineering techniques.

AI AgentKnowledge BaseLLM

0 likes · 15 min read

Mastering AI Agents: Building Knowledge Bases, Workflows, and Prompt Engineering

Volcano Engine Developer Services

Oct 13, 2025 · Artificial Intelligence

How veDB-Search Enables Scalable AI Hybrid Retrieval with a Single SQL

This article explains how veDB-Search provides a one‑stop hybrid search service that combines vector, full‑text, and scalar queries using only SQL, reducing development complexity and operational overhead for AI agents and large‑scale AI applications.

AIDatabaseHybrid Retrieval

0 likes · 15 min read

How veDB-Search Enables Scalable AI Hybrid Retrieval with a Single SQL

DataFunTalk

Oct 13, 2025 · Artificial Intelligence

How Tencent Uses RAG, GraphRAG, and Agents to Power Large Language Model Applications

This article examines Tencent's large language model deployments across diverse business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑playing, while explaining the underlying technologies of Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agent systems.

AI applicationsAgentLarge Language Model

0 likes · 4 min read

How Tencent Uses RAG, GraphRAG, and Agents to Power Large Language Model Applications

DataFunTalk

Oct 11, 2025 · Artificial Intelligence

How Tencent’s LLM Powers Real‑World Apps with RAG, GraphRAG & Agents

This article explores Tencent’s large language model deployments across diverse business scenarios—content generation, intelligent customer service, and role‑playing—detailing the underlying RAG, GraphRAG, and Agent technologies, their principles, practical implementations, and the advantages they bring to enterprise AI solutions.

AIAgentLLM

0 likes · 5 min read

How Tencent’s LLM Powers Real‑World Apps with RAG, GraphRAG & Agents

AsiaInfo Technology: New Tech Exploration

Oct 10, 2025 · Artificial Intelligence

How Ontologies Boost Large Language Models: A Comprehensive Review

This review examines how formal knowledge representations (ontologies) can be integrated with large language models to enhance reasoning, reduce hallucinations, and improve factual reliability, outlining three roles—information provider, reasoner, validator—while analyzing recent frameworks, open‑source projects, and future research challenges.

AIOntologyRAG

0 likes · 29 min read

How Ontologies Boost Large Language Models: A Comprehensive Review

JD Tech

Oct 9, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines external knowledge retrieval with large language models, covering its motivations, data preparation, chunking strategies, vectorization, storage, query processing, retrieval, reranking, prompt engineering, and LLM generation, plus practical optimization tips.

ChunkingLLMRAG

0 likes · 14 min read

What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?

DataFunSummit

Sep 28, 2025 · Artificial Intelligence

Unlocking Enterprise Knowledge: Building Multimodal AI Systems with LLMs

This article examines the challenges of processing massive multimodal data in enterprises and presents a knowledge‑augmentation framework that leverages Retrieval‑Augmented Generation, memory‑inspired architecture, and feedback loops to enable reliable, scalable AI‑driven decision making across diverse business scenarios.

Enterprise KnowledgeKnowledge GraphLLM

0 likes · 29 min read

Unlocking Enterprise Knowledge: Building Multimodal AI Systems with LLMs

Volcano Engine Developer Services

Sep 28, 2025 · Artificial Intelligence

Demystifying AI Jargon: A Beginner’s Guide to Large Language Models

This guide breaks down the complex terminology of large language models—explaining tokens, transformers, self‑attention, RAG, scaling laws, dense vs. sparse architectures, and training stages—using clear analogies and step‑by‑step explanations so readers can confidently understand and work with modern AI systems.

AI fundamentalsLarge Language ModelsRAG

0 likes · 35 min read

Demystifying AI Jargon: A Beginner’s Guide to Large Language Models

Data STUDIO

Sep 28, 2025 · Artificial Intelligence

Top Reranker Models for RAG in 2025: A Comparative Review

This article explains why initial retrieval in Retrieval‑Augmented Generation often yields noisy results, describes how rerankers act as quality filters to improve relevance, compares the leading 2025 reranker models—including Cohere, bge‑reranker, Voyage, Jina, FlashRank, and MixedBread—and provides code snippets, evaluation metrics, and guidance for selecting the right model for specific use cases.

AICross-EncoderLLM

0 likes · 31 min read

Top Reranker Models for RAG in 2025: A Comparative Review

JD Tech Talk

Sep 28, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Power Modern AI?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines traditional information retrieval with large language models, detailing its core workflow—from knowledge preparation, chunking, and embedding to vector database storage and the question‑answering stage—while highlighting key challenges, tools, and optimization strategies.

AIChunkingEmbedding

0 likes · 15 min read

What Is Retrieval‑Augmented Generation (RAG) and How Does It Power Modern AI?

JD Cloud Developers

Sep 28, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Work?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines traditional information retrieval with large language models, covering its core workflow—from knowledge preparation, data cleaning, and metadata extraction to query preprocessing, vector retrieval, reranking, information integration, and final LLM generation, while also reviewing common embedding models and vector databases.

Artificial IntelligenceLLMRAG

0 likes · 13 min read

What Is Retrieval‑Augmented Generation (RAG) and How Does It Work?

AI Large Model Application Practice

Sep 28, 2025 · Artificial Intelligence

Unlock MindsDB Knowledge Base: Build RAG Pipelines and Data Agents with SQL

This article walks through MindsDB’s Knowledge Base feature, showing how to map data sources, create vector indexes, perform semantic search, combine multiple sources with SQL joins, automate updates via jobs, and construct powerful RAG pipelines and Data Agents for AI‑driven query answering.

AIData AgentMindsDB

0 likes · 14 min read

Unlock MindsDB Knowledge Base: Build RAG Pipelines and Data Agents with SQL

Tencent Advertising Technology

Sep 27, 2025 · Artificial Intelligence

How AI‑Generated Test Cases Transformed Tencent Ads R&D Workflow

This article details how Tencent's advertising R&D team tackled lengthy, experience‑driven test case creation by deploying AIGC‑powered demand analysis, Prompt + RAG knowledge retrieval, and multi‑stage automated validation, ultimately boosting test case adoption from under 20% to nearly 60% while reducing manual effort and iteration time.

AI testingAIGCAutomation

0 likes · 14 min read

How AI‑Generated Test Cases Transformed Tencent Ads R&D Workflow

Bilibili Tech

Sep 26, 2025 · Artificial Intelligence

How RAG Transforms Natural Language Queries into Accurate SQL for Business Users

This article explains how Retrieval‑Augmented Generation (RAG) combines large language models with vector databases to let non‑technical staff query massive membership data using plain language, detailing the workflow, technical architecture, optimization challenges, and real‑world impact on data‑driven decision making.

AIData PlatformLLM

0 likes · 17 min read

How RAG Transforms Natural Language Queries into Accurate SQL for Business Users

BirdNest Tech Talk

Sep 25, 2025 · Artificial Intelligence

Mastering LangChain: A Hands‑On Guide to Building LLM Applications

This repository offers a comprehensive, step‑by‑step LangChain tutorial series that walks developers through installation, the LangChain Expression Language, streaming, parallel execution, callbacks, serialization, model customization, prompt templates, memory, multimodal support, and advanced tools like LangGraph and LangSmith, enabling the creation of sophisticated AI applications.

AI developmentLLMLangChain

0 likes · 9 min read

Mastering LangChain: A Hands‑On Guide to Building LLM Applications

DataFunSummit

Sep 24, 2025 · Artificial Intelligence

Taming LLM Hallucinations: Strategies and Solutions from 360

This article explores the problem of large‑model hallucinations, explains its definitions and classifications, analyzes root causes in data, algorithms and inference, and presents detection methods and practical mitigation techniques such as RAG, decoding strategies, and model‑enhancement approaches, illustrated with real‑world 360 use cases and future research directions.

AI safetyLLMRAG

0 likes · 22 min read

Taming LLM Hallucinations: Strategies and Solutions from 360

Rare Earth Juejin Tech Community

Sep 23, 2025 · Artificial Intelligence

How Front‑End Developers Can Build Powerful AI Agents with LangChain.js

This article guides front‑end developers through the evolution of AI agents—from early chatbots to modern multimodal agents—covering LLM fundamentals, prompt engineering, LangChain.js workflow creation, Retrieval‑Augmented Generation, model context protocols, and future multi‑agent technologies.

AI AgentFrontend DevelopmentLLM

0 likes · 11 min read

How Front‑End Developers Can Build Powerful AI Agents with LangChain.js

Bighead's Algorithm Notes

Sep 21, 2025 · Artificial Intelligence

FinKario: Event‑Enhanced Financial Knowledge Graphs Boost A‑Share Sharpe Ratio to 4.9

This article reviews the FinKario paper, which introduces an event‑augmented financial knowledge graph and a two‑stage RAG retrieval strategy that together enable real‑time knowledge updates and efficient integration of long‑form research reports, yielding a Sharpe ratio of 4.9 and outperforming baseline LLMs and institutional strategies in back‑testing.

FinKarioLLMRAG

0 likes · 10 min read

FinKario: Event‑Enhanced Financial Knowledge Graphs Boost A‑Share Sharpe Ratio to 4.9

DataFunTalk

Sep 20, 2025 · Artificial Intelligence

How Tencent’s Large Language Models Transform Business with RAG, GraphRAG, and Agents

This article examines Tencent's large language model deployments across content generation, intelligent customer service, and role‑playing, detailing the underlying SFT, Retrieval‑Augmented Generation, GraphRAG, and Agent technologies that enable smarter, more reliable AI solutions.

AgentKnowledge GraphLarge Language Model

0 likes · 4 min read

How Tencent’s Large Language Models Transform Business with RAG, GraphRAG, and Agents

DataFunSummit

Sep 19, 2025 · Artificial Intelligence

How Tencent Leverages LLMs: RAG, GraphRAG, and Agents in Real‑World Apps

This article examines Tencent's large language model deployments across diverse business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑play, and explains the underlying technologies—Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and intelligent agents—that enable these applications.

AIAgentLLM

0 likes · 4 min read

How Tencent Leverages LLMs: RAG, GraphRAG, and Agents in Real‑World Apps

Wu Shixiong's Large Model Academy

Sep 18, 2025 · Artificial Intelligence

How to Diagnose and Optimize RAG Systems When 30% Answers Miss the Mark

This guide explains why RAG systems often produce off‑topic answers, outlines how to measure hit‑rate, retrieval, reranking and generation metrics, provides step‑by‑step evaluation pipelines, code examples, real‑world case studies, and interview‑ready templates for diagnosing and optimizing each stage of the pipeline.

AIRAGgeneration

0 likes · 18 min read

How to Diagnose and Optimize RAG Systems When 30% Answers Miss the Mark

Data STUDIO

Sep 18, 2025 · Artificial Intelligence

Build a RAG App from Scratch: Master Text Chunking, Vector Retrieval, and Coreference Resolution

This tutorial walks through building a Retrieval‑Augmented Generation (RAG) system from the ground up, covering document parsing, text chunking strategies, vector store creation with ChromaDB, semantic search, prompt engineering for LLMs, conversation memory, coreference handling, and practical optimization tips, all illustrated with complete Python code.

ChromaDBPythonRAG

0 likes · 19 min read

Build a RAG App from Scratch: Master Text Chunking, Vector Retrieval, and Coreference Resolution

Zhuanzhuan Tech

Sep 17, 2025 · Artificial Intelligence

LLM‑Powered Intent Understanding, RAG QA, and Knowledge Base Maintenance for Recycling

This article details how Zhuanzhuan leverages large language models to enhance on‑site device inspection through a three‑stage pipeline—intent understanding, retrieval‑augmented generation QA, and automated knowledge‑base upkeep—highlighting technical innovations, workflow integration, and the resulting operational benefits.

AIIntent UnderstandingKnowledge Base

0 likes · 14 min read

LLM‑Powered Intent Understanding, RAG QA, and Knowledge Base Maintenance for Recycling

DataFunSummit

Sep 17, 2025 · Artificial Intelligence

How Tencent’s Large Language Model Powers Real-World AI Applications

This article explores Tencent’s large language model across diverse business scenarios—content generation, intelligent customer service, role‑playing, and more—detailing the principles and practical uses of Retrieval‑Augmented Generation (RAG), GraphRAG, and Agent technologies, and how they enhance model intelligence and user experience.

AIAgentKnowledge Graph

0 likes · 4 min read

How Tencent’s Large Language Model Powers Real-World AI Applications

Architecture & Thinking

Sep 17, 2025 · Artificial Intelligence

How the 32B ‘Zhiyu’ Model is Revolutionizing Intelligent Operations

The Zhiyu model, a 32‑billion‑parameter SRE‑focused LLM, combines extensive domain knowledge, enhanced professional skills, and deterministic RAG to deliver precise, actionable insights for intelligent operations, backed by a robust multi‑source training pipeline, staged training, and flexible deployment options.

AI OperationsRAGSRE

0 likes · 7 min read

How the 32B ‘Zhiyu’ Model is Revolutionizing Intelligent Operations

Lao Guo's Learning Space

Sep 16, 2025 · Artificial Intelligence

Why RAG Rewrites User Prompts Before Vector Similarity Retrieval

Rewriting user prompts before vector similarity retrieval enhances semantic clarity, aligns language with document style, eliminates ambiguity, and guides the embedding model to focus on relevant information, thereby improving retrieval effectiveness.

AIRAGVector Retrieval

0 likes · 1 min read

Why RAG Rewrites User Prompts Before Vector Similarity Retrieval

DataFunTalk

Sep 15, 2025 · Artificial Intelligence

How AI+Data Agents Are Transforming the Automotive Industry’s Digital Leap

In an interview, Di Xingxing of Autohome details their AI+Data framework—unified lake‑warehouse, intelligent engine, and agent services—that breaks data silos, blends traditional models with LLMs, leverages causal inference and RAG knowledge bases, and uses continuous feedback to build explainable, evolving data agents for accurate sales forecasting, competitive analysis, and end‑to‑end business automation in the automotive industry.

AIAutomotiveData Engineering

0 likes · 10 min read

How AI+Data Agents Are Transforming the Automotive Industry’s Digital Leap

AI Cyberspace

Sep 15, 2025 · Artificial Intelligence

What Is Agentic AI? From LLM Limits to Autonomous AI Agents

Agentic AI transforms static large language models into autonomous agents by adding perception, goal orientation, planning, action, interaction, and iterative loops, tracing its evolution from early chatbots through Prompt Engineering, ReAct, AutoGPT, OpenAI Function Calling, to modern multi‑agent frameworks, while addressing challenges like memory, hallucinations, and scalability.

RAGReActagentic AI

0 likes · 38 min read

What Is Agentic AI? From LLM Limits to Autonomous AI Agents

DataFunSummit

Sep 14, 2025 · Artificial Intelligence

How Tencent’s Large Language Models Boost Business with RAG, GraphRAG, and AI Agents

This article examines Tencent's large language model deployments across various business scenarios, detailing the use of Retrieval‑Augmented Generation, GraphRAG for role‑playing, and Agent technologies, while also outlining core application areas and the three main technical approaches—SFT, RAG, and Agents.

AI agentsAI applicationsGraphRAG

0 likes · 4 min read

How Tencent’s Large Language Models Boost Business with RAG, GraphRAG, and AI Agents

Architect's Alchemy Furnace

Sep 13, 2025 · Artificial Intelligence

Choosing the Right Path for RAG Development: Low‑Code Platforms vs Open‑Source Frameworks

This article compares low‑code development platforms with open‑source large‑model frameworks such as LangChain and LlamaIndex, outlining their features, advantages, limitations, and suitability for building retrieval‑augmented generation (RAG) applications in various enterprise scenarios.

AI developmentLangChainLlamaIndex

0 likes · 13 min read

Choosing the Right Path for RAG Development: Low‑Code Platforms vs Open‑Source Frameworks

Fun with Large Models

Sep 12, 2025 · Artificial Intelligence

When to Choose Model Fine‑Tuning vs RAG for Large‑Model Engineering Interviews

The article explains the technical background and suitable scenarios for Retrieval‑Augmented Generation (RAG) and model fine‑tuning, compares their strengths, discusses how they can be combined, and provides interview‑style Q&A on their capabilities, risks, and differences from model distillation.

AI InterviewFine‑TuningLarge Language Models

0 likes · 7 min read

When to Choose Model Fine‑Tuning vs RAG for Large‑Model Engineering Interviews

Data Party THU

Sep 11, 2025 · Artificial Intelligence

How ComRAG Revolutionizes Real‑Time Community QA with Dynamic Vector Stores

ComRAG tackles the static‑knowledge gaps, uneven QA quality, and storage explosion of community question‑answer platforms by integrating a static documentation vector store with dual dynamic CQA stores managed via a centroid‑based memory, delivering higher accuracy, lower latency, and scalable storage for industrial retrieval‑augmented generation.

Artificial IntelligenceCommunity QADynamic Retrieval

0 likes · 7 min read

How ComRAG Revolutionizes Real‑Time Community QA with Dynamic Vector Stores

Instant Consumer Technology Team

Sep 11, 2025 · Artificial Intelligence

How REFRAG Cuts LLM Decoding Time by 30×: A New Efficient RAG Framework

REFRAG (REpresentation For RAG) introduces a novel decoding framework that compresses, senses, and expands context using precomputed chunk embeddings, achieving up to 30.85× faster first-token generation and 16× larger context windows without sacrificing perplexity, as validated across diverse long‑context tasks.

LLMRAGchunk embeddings

0 likes · 18 min read

How REFRAG Cuts LLM Decoding Time by 30×: A New Efficient RAG Framework

Continuous Delivery 2.0

Sep 11, 2025 · Artificial Intelligence

Building Scalable Enterprise RAG: Lessons, Pitfalls, and Proven Solutions

This article shares practical lessons from building a large‑scale enterprise RAG system, covering imperfect data, document quality scoring, hierarchical chunking, metadata design, semantic‑search failures, open‑source model choices, and table handling to achieve reliable AI‑driven search.

Enterprise AIOpen-source modelsRAG

0 likes · 13 min read

Building Scalable Enterprise RAG: Lessons, Pitfalls, and Proven Solutions

AsiaInfo Technology: New Tech Exploration

Sep 11, 2025 · Artificial Intelligence

How AST Boosts LLM‑Powered Code Question Answering: Theory, Practice, and Future Directions

This article explores how abstract syntax trees (AST) can enrich large language model (LLM) based code question‑answering by providing precise structural context, detailing LLM strengths and limits, describing AST‑LLM collaboration, RAG integration, cutting‑edge models, practical tooling, challenges, standardisation efforts, and future research avenues.

ASTLLMRAG

0 likes · 30 min read

How AST Boosts LLM‑Powered Code Question Answering: Theory, Practice, and Future Directions

Architect's Alchemy Furnace

Sep 10, 2025 · Artificial Intelligence

Mastering RAG: Classic Architecture, Challenges, and Evolution Explained

This article outlines the fundamental RAG workflow—from data indexing and querying to advanced modular designs—highlights key challenges such as retrieval accuracy, model robustness, context limits, and performance, and traces the evolution from naive to modular RAG systems.

AI ArchitectureRAGretrieval-augmented-generation

0 likes · 12 min read

Mastering RAG: Classic Architecture, Challenges, and Evolution Explained

DaTaobao Tech

Sep 10, 2025 · Frontend Development

How AI Cut Front‑End Development Time by 60% in Alibaba’s Giraffe Search

This article details how the author transformed a constrained Weex/Muise front‑end project for the “giraffe” search page into an AI‑driven workflow, building a structured knowledge base, defining project‑level rules, and using RAG techniques to accelerate component, tracking, and payment integration, ultimately reducing development time by 60% and proposing a new “AI programming as context engineering” paradigm.

AIFrontend DevelopmentKnowledge Base

0 likes · 14 min read

How AI Cut Front‑End Development Time by 60% in Alibaba’s Giraffe Search

DataFunTalk

Sep 10, 2025 · Artificial Intelligence

Why RAG is Evolving: From Retrieval to Integrated Reasoning, Memory, and Multimodal AI

This article explores how Retrieval‑Augmented Generation (RAG) is transitioning from basic retrieve‑and‑generate pipelines to a unified architecture that incorporates reasoning chains, agent layers, knowledge graphs, Monte‑Carlo Tree Search, reinforcement learning, sophisticated memory management, and multimodal tensor‑based retrieval, while addressing engineering challenges such as storage expansion, re‑ranking, and index dimensionality.

AI reasoningRAGRetrieval-Augmented Generation

0 likes · 19 min read

Why RAG is Evolving: From Retrieval to Integrated Reasoning, Memory, and Multimodal AI

Architecture Breakthrough

Sep 7, 2025 · Industry Insights

Why Arrogance Blocks You From Riding the AI Wave—and How to Overcome It

The article argues that arrogance, not lack of knowledge, hinders individuals from seizing AI opportunities, outlines four psychological barriers—unseen, undervalued, incomprehensible, and too late—and provides practical steps such as prompt engineering, RAG, fine‑tuning, and AI agents to actively engage with the AI wave.

AIIndustry InsightsPrompt Engineering

0 likes · 11 min read

Why Arrogance Blocks You From Riding the AI Wave—and How to Overcome It

Data Party THU

Sep 5, 2025 · Artificial Intelligence

What a PRISMA Review Uncovers About Retrieval‑Augmented Generation (RAG)

This systematic PRISMA review analyzes 128 highly‑cited RAG papers, covering five major databases, 343 datasets, a detailed technical roadmap, evaluation metrics from EM to LLM‑as‑Judge, and future research directions, showing that RAG has evolved into a complex, programmable, and auditable distributed system.

AIRAGSystematic Review

0 likes · 5 min read

What a PRISMA Review Uncovers About Retrieval‑Augmented Generation (RAG)

Instant Consumer Technology Team

Sep 5, 2025 · Artificial Intelligence

How Context Engineering Transforms Dify Agents: Boost Efficiency by 10×

This article explains how Context Engineering (CE) extends Prompt Engineering by integrating seven core elements—system prompts, user input, short‑term memory, long‑term memory, retrieval, tools, and structured output—using the open‑source Dify platform to build dynamic, multimodal agents that cut inference costs tenfold and raise complex‑task success rates by 40%.

AI Agent DevelopmentDifyLLM

0 likes · 16 min read

How Context Engineering Transforms Dify Agents: Boost Efficiency by 10×

Liangxu Linux

Sep 4, 2025 · Artificial Intelligence

Boost Your Knowledge Base with RAGFlow – Open‑Source RAG Engine with 60K Stars

RAGFlow is an open‑source Retrieval‑Augmented Generation engine that lets large language models query diverse internal documents, provides source citations, supports many file formats, and can be quickly deployed via Docker following a step‑by‑step guide.

AIDeploymentKnowledge Base

0 likes · 5 min read

Boost Your Knowledge Base with RAGFlow – Open‑Source RAG Engine with 60K Stars

DataFunSummit

Sep 4, 2025 · Artificial Intelligence

Unlocking Elasticsearch Vector Search: From Basics to RAG Implementation

This article explores the evolving search demands of the intelligent era, explains dense and sparse vector concepts, details Elasticsearch's vector search capabilities and recent performance breakthroughs, introduces hybrid and relevance‑tuning techniques, and demonstrates RAG principles and real‑world enterprise use cases.

AIElasticsearchHybrid Search

0 likes · 14 min read

Unlocking Elasticsearch Vector Search: From Basics to RAG Implementation

Data Party THU

Sep 3, 2025 · Artificial Intelligence

Unlocking Large Model Secrets: Transformers, MoE, Fine‑Tuning, RAG & KV Caching

This article provides a comprehensive technical overview of today’s large‑model ecosystem, covering the Transformer architecture, Mixture‑of‑Experts extensions, five fine‑tuning methods, the evolution from traditional RAG to agentic RAG, classic agent design patterns, diverse text‑chunking strategies, and the KV‑cache optimization that accelerates inference.

Fine‑tuningKV CacheMixture of Experts

0 likes · 13 min read

Unlocking Large Model Secrets: Transformers, MoE, Fine‑Tuning, RAG & KV Caching

Efficient Ops

Sep 2, 2025 · Artificial Intelligence

How AI Is Revolutionizing Knowledge‑Base Building for Smarter Operations

At the 27th GOPS Global Operations Conference in Shanghai (Oct 17‑18, 2025), Professor Wang Peng of Fudan University will reveal how large language models can extract and structure heterogeneous operational data into high‑quality knowledge bases, and how RAG‑driven Q&A enhances fault diagnosis, SOP generation, and automated decision‑making.

Artificial IntelligenceIntelligent OperationsKnowledge Base

0 likes · 3 min read

How AI Is Revolutionizing Knowledge‑Base Building for Smarter Operations

AI2ML AI to Machine Learning

Sep 2, 2025 · Artificial Intelligence

Why Enterprise Large‑Model Digitalization Is So Hard: Key Challenges and Capabilities

The article analyzes why enterprise‑wide large‑model AI projects face steep hurdles, outlining required human capabilities, historical labor shifts, current hot technologies such as RAG, Agent, CoT and multimodal, their limits, a three‑stage implementation roadmap, typical case pitfalls, and the key success factors for sustainable digital transformation.

AgentCoTEnterprise AI

0 likes · 15 min read

Why Enterprise Large‑Model Digitalization Is So Hard: Key Challenges and Capabilities

Instant Consumer Technology Team

Sep 2, 2025 · Artificial Intelligence

Why RAG Is Dead: Jeff Huber’s 5 Retrieval Secrets and Context Engineering

Jeff Huber, founder of Chroma, argues that traditional RAG is obsolete, introduces context engineering as the new paradigm, and shares five practical retrieval strategies, a complete pipeline, and insights on handling context rot, memory, and generative benchmarking to build production‑grade AI applications.

AIContext EngineeringGenerative Benchmarking

0 likes · 11 min read

Why RAG Is Dead: Jeff Huber’s 5 Retrieval Secrets and Context Engineering

Alibaba Cloud Developer

Sep 1, 2025 · Artificial Intelligence

Mastering RAG: From Chunking to Hybrid Search for Better AI Retrieval

This article delves into the implementation details and optimization strategies of Retrieval‑Augmented Generation (RAG), covering document chunking, index enhancement, embedding, hybrid search, and re‑ranking, and provides practical code examples to help developers move from quick deployment to deep performance tuning.

AIChunkingEmbedding

0 likes · 19 min read

Mastering RAG: From Chunking to Hybrid Search for Better AI Retrieval

Data Thinking Notes

Aug 31, 2025 · Artificial Intelligence

Embedding's Role in Retrieval‑Augmented Generation: Basics, Challenges & Future

This article explains how embedding technology converts unstructured data into vector representations, powers precise retrieval in Retrieval‑Augmented Generation (RAG), outlines the evolution of embedding models, discusses current challenges such as long‑text handling and domain adaptation, and highlights emerging solutions.

AIEmbeddingRAG

0 likes · 12 min read

Embedding's Role in Retrieval‑Augmented Generation: Basics, Challenges & Future

Tencent Technical Engineering

Aug 29, 2025 · Artificial Intelligence

How Retrieval‑Augmented Generation Evolves into Autonomous AI Agents

This article examines the limitations of large language models' internal knowledge, explains how retrieval‑augmented generation (RAG) and tool‑augmented generation address these limits, and traces the evolution from simple retrieve‑then‑generate pipelines to autonomous, multi‑modal AI agents.

AI agentsLLMRAG

0 likes · 20 min read

How Retrieval‑Augmented Generation Evolves into Autonomous AI Agents

Xiaolei Talks DB

Aug 28, 2025 · Databases

How AI Is Transforming Databases: Highlights from China’s DTCC2025

At DTCC2025 in Beijing, industry leaders showcased AI-driven innovations, vector database advances, RAG techniques, and distributed database performance breakthroughs, illustrating how databases are evolving from passive data stores into intelligent, autonomous systems that boost efficiency, scalability, and business value across sectors.

AIDistributed SystemsRAG

0 likes · 10 min read

How AI Is Transforming Databases: Highlights from China’s DTCC2025

Data Thinking Notes

Aug 26, 2025 · Artificial Intelligence

From Prompt to Context: How AI Agents Evolve into Proactive Intelligence

This article explores the rapid growth of large language models and explains how AI agents transform passive, single‑turn responses into proactive, continuous intelligence by leveraging a core “Prompt→Context→Action” loop, detailing their architecture, key components, challenges, and future directions.

AI AgentContext ManagementLLM architecture

0 likes · 20 min read

From Prompt to Context: How AI Agents Evolve into Proactive Intelligence

Tech Freedom Circle

Aug 26, 2025 · Artificial Intelligence

How to Optimize RAG for Alibaba Interviews? 7 Golden Rules Explained

This article provides a step‑by‑step technical guide to optimizing Retrieval‑Augmented Generation (RAG) for interview scenarios, covering query rewriting, HyDE, fallback strategies, routing and prompt routing, multi‑representation indexing, hybrid retrieval, re‑ranking, self‑RAG, generation control, performance benchmarking, and a practical checklist with concrete code examples and metrics.

AI InterviewHybrid RetrievalIndex Optimization

0 likes · 30 min read

How to Optimize RAG for Alibaba Interviews? 7 Golden Rules Explained

Alibaba Cloud Native

Aug 26, 2025 · Artificial Intelligence

Boost Dify’s RAG Performance with Higress AI Gateway: Two Integration Strategies

This guide explains how to overcome Dify's built‑in RAG limitations by using Higress AI Gateway to connect external RAG services, detailing two integration patterns—RAG Retrieval Agent and Automatic Retrieval Injection—along with step‑by‑step configuration, validation, and the resulting benefits for enterprise AI applications.

DifyKnowledge retrievalRAG

0 likes · 13 min read

Boost Dify’s RAG Performance with Higress AI Gateway: Two Integration Strategies

DataFunSummit

Aug 25, 2025 · Artificial Intelligence

Building Xiaomi’s Vertical Domain QA Agent: From RAG to Real‑World Deployment

This article explains how Xiaomi designed and deployed a vertical‑domain question‑answering assistant for product and car queries, covering business background, a four‑module RAG‑plus‑LLM architecture, knowledge‑base construction, custom chunking strategies, dynamic signal handling, and the challenges overcome to achieve reliable real‑time voice interactions.

Agent ArchitectureLLMRAG

0 likes · 22 min read

Building Xiaomi’s Vertical Domain QA Agent: From RAG to Real‑World Deployment

DaTaobao Tech

Aug 25, 2025 · Artificial Intelligence

Mastering RAG: From Quick Start to Deep Optimization Strategies

This article dives into the practical implementation of Retrieval‑Augmented Generation (RAG), covering document chunking, semantic and reverse HyDE indexing, embedding, hybrid search, and re‑ranking techniques, and provides concrete code examples and optimization tips for building high‑performance AI applications.

Artificial IntelligenceChunkingEmbedding

0 likes · 18 min read

Mastering RAG: From Quick Start to Deep Optimization Strategies

Fun with Large Models

Aug 22, 2025 · Artificial Intelligence

Step‑by‑Step Guide: Building a PDF‑Based RAG Knowledge Base with LangChain, Streamlit, DashScope & DeepSeek

This tutorial shows how to create a lightweight Retrieval‑Augmented Generation (RAG) system that indexes multiple PDF files, stores their embeddings in a FAISS vector database, and answers user queries through a LangChain agent powered by DashScope embeddings and the DeepSeek‑Chat model, all wrapped in a Streamlit UI.

DashScopeDeepSeekFAISS

0 likes · 13 min read

Step‑by‑Step Guide: Building a PDF‑Based RAG Knowledge Base with LangChain, Streamlit, DashScope & DeepSeek

Volcano Engine Developer Services

Aug 21, 2025 · Artificial Intelligence

Why Prompt Engineering Isn’t Enough: The Rise of Context Engineering and RAG

Since last year, the debate over “Prompt Engineering” has split between practitioners who favor “Context Engineering” for building scalable agent systems and scholars who treat Prompt Engineering as a broad umbrella term, highlighting the need to dynamically construct and manage context for reliable, extensible AI applications.

AI agentsLLMPrompt Engineering

0 likes · 33 min read

Why Prompt Engineering Isn’t Enough: The Rise of Context Engineering and RAG

Alibaba Cloud Developer

Aug 21, 2025 · Artificial Intelligence

Why Your AI Defect Deduplication Returns Mixed Data and How to Fix It

This article details the challenges of building an AI‑powered defect deduplication system using Retrieval‑Augmented Generation, explains why LLMs produce composite (spliced) results, diagnoses the root cause as information loss in the RAG pipeline, and presents a step‑by‑step solution that restores atomicity of records for reliable duplicate detection.

AI debuggingKnowledge BaseLLM

0 likes · 14 min read

Why Your AI Defect Deduplication Returns Mixed Data and How to Fix It

JD Retail Technology

Aug 20, 2025 · Artificial Intelligence

Launch Multi-Agent AI Systems with OxyGent in Just 20 Lines of Code

Learn how to quickly set up OxyGent, a flexible AI agent framework, by installing Python, Node.js, and the MCP tools, configuring environment variables, and using just 20 lines of code to build, debug, and deploy multi‑agent applications with features like RAG, tool integration, and distributed execution.

DeploymentMCPOxyGent

0 likes · 5 min read

Launch Multi-Agent AI Systems with OxyGent in Just 20 Lines of Code

Alibaba Cloud Big Data AI Platform

Aug 20, 2025 · Artificial Intelligence

How DeepSearch Elevates RAG: From RAG 1.0 to a Multi‑Agent AI Search Engine

This article explains how Alibaba Cloud OpenSearch LLM version evolved from RAG 1.0 to RAG 2.0, introducing the DeepSearch multi‑agent architecture that combines offline data processing, online query handling, planning, clarification, search, and summarization agents to deliver more accurate and complex AI‑driven answers.

AI SearchDeepSearchLLM

0 likes · 10 min read

How DeepSearch Elevates RAG: From RAG 1.0 to a Multi‑Agent AI Search Engine

Instant Consumer Technology Team

Aug 20, 2025 · Backend Development

How I Built a Production‑Ready RAG Service in 3 Weeks Using AI Coding Tools

In just three weeks, I single‑handedly created a production‑grade Retrieval‑Augmented Generation (RAG) API with FastAPI, leveraging Cursor and Claude Code to automate coding, testing, and deployment, and I share practical insights on AI‑assisted development, high cohesion‑low coupling design, TDD, git worktree parallelism, and agent orchestration.

AI codingFastAPIGit Worktree

0 likes · 19 min read

How I Built a Production‑Ready RAG Service in 3 Weeks Using AI Coding Tools

Instant Consumer Technology Team

Aug 19, 2025 · Artificial Intelligence

Mastering Document Chunking for RAG: Strategies, Code & Best Practices

This article explores why proper document chunking is crucial for Retrieval‑Augmented Generation, explains core concepts like context windows and signal‑to‑noise, compares various chunking strategies—from simple fixed‑size splits to semantic and hybrid approaches—and provides practical Python code examples to help you build more effective RAG pipelines.

LLMRAGText Splitting

0 likes · 24 min read

Mastering Document Chunking for RAG: Strategies, Code & Best Practices