Tagged articles
924 articles
Page 6 of 10
Data Thinking Notes
Data Thinking Notes
Aug 17, 2025 · Artificial Intelligence

Unlocking AI Agents: From Basics to Real-World Development

This article provides a comprehensive overview of AI Agents, covering their fundamental concepts, core features, technical evolution, work cycle, architectural modules, key technologies such as prompt engineering and RAG, practical development steps, a data‑analysis agent case study, and typical industry applications.

AI AgentAgent ArchitectureArtificial Intelligence
0 likes · 13 min read
Unlocking AI Agents: From Basics to Real-World Development
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 15, 2025 · Artificial Intelligence

Mastering AI Agents: Prompt Engineering, Workflows, and RAG Strategies

This article systematically explains how to build reliable, high‑performance AI agents by focusing on the core components—LLM, prompts, workflows, RAG, and tools—while covering prompt engineering techniques, DSL‑based workflow design, vector‑database knowledge bases, security against prompt injection, and practical project planning.

AI AgentLLMRAG
0 likes · 15 min read
Mastering AI Agents: Prompt Engineering, Workflows, and RAG Strategies
Tencent Technical Engineering
Tencent Technical Engineering
Aug 14, 2025 · Artificial Intelligence

Why Do Large Language Models Hallucinate? Causes, Risks, and Multi‑Dimensional Solutions

This article systematically examines the root causes of hallucinations in large language models, evaluates their pros and cons, and presents a comprehensive set of optimization techniques—including prompt engineering, RAG, sampling tweaks, supervised fine‑tuning, LoRA, RLHF, chain‑of‑thought reasoning, and agent/workflow designs—to build more reliable and trustworthy AI applications.

AILLMLoRA
0 likes · 29 min read
Why Do Large Language Models Hallucinate? Causes, Risks, and Multi‑Dimensional Solutions
DaTaobao Tech
DaTaobao Tech
Aug 13, 2025 · Artificial Intelligence

Unlocking AI Power: A Complete Guide to Prompt Engineering and Advanced Techniques

This article explores the emerging field of prompt engineering, detailing its fundamentals, advanced strategies such as chain‑of‑thought, ReAct, and structured frameworks, and demonstrates practical applications in AI agents for data retrieval, SQL generation, and market insight, offering actionable guidance for developers and business users alike.

AI agentsData RetrievalRAG
0 likes · 42 min read
Unlocking AI Power: A Complete Guide to Prompt Engineering and Advanced Techniques
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 11, 2025 · Artificial Intelligence

How Fine‑Tuning Large Models Solves Code Upgrade Challenges and Boosts Stable Module Matching

This article details an innovative approach that uses large‑model supervised fine‑tuning to overcome the instability of code RAG and code agents during open‑source repository upgrades, addressing domain‑specific terminology, code style differences, and improving recall, accuracy, and deployment efficiency.

AI agentsLLMRAG
0 likes · 11 min read
How Fine‑Tuning Large Models Solves Code Upgrade Challenges and Boosts Stable Module Matching
AI Large Model Application Practice
AI Large Model Application Practice
Aug 11, 2025 · Artificial Intelligence

How to Build an LLM-Powered Smart Resume Screening System

This article presents a detailed design and implementation of an LLM‑based intelligent resume matching system that combines semantic vector retrieval, structured rule filtering, multi‑dimensional weighted scoring, and natural‑language interaction to create a fast, quantifiable, and explainable hiring pipeline.

AI RecruitmentLLMRAG
0 likes · 18 min read
How to Build an LLM-Powered Smart Resume Screening System
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 8, 2025 · Artificial Intelligence

Can GitOps Power Low‑Cost LLM Agents? A Hands‑On Exploration

This article examines how the Manus sandbox and CodeAct mechanisms inspire a GitOps‑based approach to building LLM agents, detailing the design of planner and executor components, workflow steps, advantages such as RAG and observability, and the potential for low‑cost, scalable intelligent agent development.

AI agentsGitOpsIntelligent agents
0 likes · 12 min read
Can GitOps Power Low‑Cost LLM Agents? A Hands‑On Exploration
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 8, 2025 · Artificial Intelligence

Unlocking Big Data Ops with Large Models: Opportunities, Challenges, Design

This article summarizes a Cloud Summit talk where Alibaba Cloud’s AI expert Zhang Yingying explains how large language models can enhance big‑data intelligent operations, covering opportunities, challenges, RAG‑based Q&A, multi‑agent diagnostics, and the engineering architecture needed for reliable, scalable deployment.

AI EngineeringBig Data OperationsLarge Language Models
0 likes · 20 min read
Unlocking Big Data Ops with Large Models: Opportunities, Challenges, Design
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 8, 2025 · Artificial Intelligence

What Von Neumann’s Brain Theory Reveals About Prompt Engineering for LLMs

The article explores how Von Neumann’s insights on the brain‑computer analogy illuminate modern large‑language‑model prompt engineering, comparing logical reasoning chains, memory mechanisms, and DSL‑driven computation to improve accuracy, reduce hallucinations, and balance reasoning depth with precise calculation.

DSLLarge Language ModelsPrompt Engineering
0 likes · 14 min read
What Von Neumann’s Brain Theory Reveals About Prompt Engineering for LLMs
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Aug 8, 2025 · Industry Insights

How CodeRAG Reinvents Large‑Scale Code Repository Knowledge Extraction and Hierarchical Retrieval

CodeRAG leverages AST‑centric parsing and a hierarchical knowledge graph to overcome text‑only retrieval limits in large code repositories, offering multi‑language analysis, incremental parsing, hybrid indexing, and intelligent context selection for tasks such as code completion, Q&A, documentation generation, and impact analysis.

ASTCodeRAGLarge-Scale Repos
0 likes · 15 min read
How CodeRAG Reinvents Large‑Scale Code Repository Knowledge Extraction and Hierarchical Retrieval
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 5, 2025 · Databases

How PolarDB IMCI Unifies Vector Search and Embedding in One SQL Engine

This article explains how PolarDB IMCI integrates vector indexing and embedding directly into the database kernel, offering a unified, transactional, and real‑time vector lifecycle management service that lets developers build RAG knowledge bases and AI applications using only standard SQL, dramatically reducing development and operational complexity.

AIPolardbRAG
0 likes · 11 min read
How PolarDB IMCI Unifies Vector Search and Embedding in One SQL Engine
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 5, 2025 · Artificial Intelligence

Mastering Intent Detection & Slot Filling: Proven Strategies and Code Samples

This article shares reusable AI development techniques for intent detection and slot filling, comparing four solution tiers—from simple prompt engineering to advanced RAG‑enhanced architectures—complete with practical code snippets, performance trade‑offs, and guidance on selecting the optimal approach for reliable conversational agents.

Intent DetectionNLUPrompt Engineering
0 likes · 27 min read
Mastering Intent Detection & Slot Filling: Proven Strategies and Code Samples
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Aug 4, 2025 · Artificial Intelligence

How RAG and Long‑Term Memory Turn AI into a Truly Remembering Assistant

This article explains how Retrieval‑Augmented Generation (RAG) and long‑term memory systems like MenoBase enable large language models to overcome short‑term memory limits, dynamically retrieve up‑to‑date knowledge, and personalize interactions, with practical Dify implementation steps and real‑world use cases across industries.

AIDifyKnowledge Base
0 likes · 18 min read
How RAG and Long‑Term Memory Turn AI into a Truly Remembering Assistant
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Jul 30, 2025 · Artificial Intelligence

How MCP‑RAG Overcomes Prompt Inflation for Massive LLM Service Calls

This article analyzes the prompt‑inflation bottleneck that arises when large language models (LLMs) must handle thousands of Model Context Protocol (MCP) services, and introduces the MCP‑RAG architecture—a retrieval‑augmented generation solution that builds a metadata knowledge base and intelligent retrieval layer to enable precise, efficient MCP service discovery at scale.

AILLMMCP
0 likes · 21 min read
How MCP‑RAG Overcomes Prompt Inflation for Massive LLM Service Calls
Ops Development Stories
Ops Development Stories
Jul 29, 2025 · Artificial Intelligence

Master AI Agents with LangGraph: Build Adaptive RAG, Translation, and ReAct Agents

This comprehensive guide explains what an AI Agent is, its core capabilities and design patterns, and walks through step‑by‑step implementations of RAG, Translation, and ReAct agents using LangGraph, complete with code samples, workflow diagrams, and practical tips for building personal ops knowledge‑base agents.

LLMLangGraphRAG
0 likes · 64 min read
Master AI Agents with LangGraph: Build Adaptive RAG, Translation, and ReAct Agents
SF Technology Team
SF Technology Team
Jul 29, 2025 · Artificial Intelligence

How SF Tech’s Proprietary Large Models Revolutionize Logistics and AI Operations

The DA Data Intelligence Conference in Shenzhen showcased SF Tech’s breakthroughs in large‑model AI, revealing how their proprietary multimodal models, RAG innovations, and agent platforms dramatically improve logistics decision‑making, resource scheduling, and customer service across multiple industries.

AI OperationsAgent PlatformLarge Models
0 likes · 11 min read
How SF Tech’s Proprietary Large Models Revolutionize Logistics and AI Operations
Architecture and Beyond
Architecture and Beyond
Jul 27, 2025 · Artificial Intelligence

What Makes an AI Agent Tick? From Expert Systems to Modern Architectures

This article traces the evolution of AI agents from early expert systems to today’s multimodal, memory‑rich agents, explains their perception, reasoning, memory and action modules, discusses model selection, prompt engineering, RAG techniques, and highlights current limitations such as hallucinations, reliability, cost, and security.

AI AgentFunction CallingLarge Language Model
0 likes · 28 min read
What Makes an AI Agent Tick? From Expert Systems to Modern Architectures
JD Tech Talk
JD Tech Talk
Jul 23, 2025 · Artificial Intelligence

Causal Inference + LLMs: Transforming E‑Commerce Pricing Strategies

This article describes how integrating causal inference with large language models and Retrieval‑Augmented Generation can automate and explain e‑commerce product pricing, detailing the three‑step workflow, reinforcement‑learning rewards, experimental results, and future directions for end‑to‑end RAG‑LLM training.

RAGcausal inferencee‑commerce pricing
0 likes · 15 min read
Causal Inference + LLMs: Transforming E‑Commerce Pricing Strategies
Zhuanzhuan Tech
Zhuanzhuan Tech
Jul 23, 2025 · Artificial Intelligence

Why AI‑Generated Code Often Misses the Mark and How a Code Knowledge Base Fixes It

AI‑generated code frequently fails to match project conventions due to lack of contextual memory, but building a dynamic code knowledge base combined with Retrieval‑Augmented Generation (RAG) enables precise, compliant code output, reduces errors, accelerates development, and transforms AI into a project‑specific assistant.

AIKnowledge BaseRAG
0 likes · 13 min read
Why AI‑Generated Code Often Misses the Mark and How a Code Knowledge Base Fixes It
Tencent Cloud Developer
Tencent Cloud Developer
Jul 23, 2025 · Artificial Intelligence

Why Retrieval‑Augmented Generation Is Evolving Into Agentic AI Search

This article explains how the inherent knowledge limits of large language models drive the rise of Retrieval‑Augmented Generation (RAG), outlines its three evolutionary stages, introduces Agentic RAG and DeepSearch, and discusses the knowledge and ability boundaries that shape future AI search systems.

AI SearchDeepSearchKnowledge retrieval
0 likes · 19 min read
Why Retrieval‑Augmented Generation Is Evolving Into Agentic AI Search
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Jul 18, 2025 · Artificial Intelligence

Video: Building an Intelligent Knowledge‑Base Q&A System with Large Models and Elasticsearch (RAG)

The video walks through the differences between traditional keyword search and vector search, explains the core concept of Retrieval‑Augmented Generation, and demonstrates how to construct a knowledge‑base Q&A system using a large language model integrated with Elasticsearch.

ElasticsearchKnowledge BaseLarge Language Model
0 likes · 1 min read
Video: Building an Intelligent Knowledge‑Base Q&A System with Large Models and Elasticsearch (RAG)
DataFunSummit
DataFunSummit
Jul 16, 2025 · Artificial Intelligence

How Tencent Cloud ES Powers RAG with Hybrid Search and Massive Vector Optimizations

This article explores how Tencent Cloud Elasticsearch combines decades of text search expertise with cutting‑edge vector retrieval and large language models to deliver a one‑stop Retrieval‑Augmented Generation solution, detailing the underlying models, hybrid search architecture, performance tricks, and real‑world case studies.

ElasticsearchHybrid SearchLLM
0 likes · 24 min read
How Tencent Cloud ES Powers RAG with Hybrid Search and Massive Vector Optimizations
DataFunSummit
DataFunSummit
Jul 15, 2025 · Artificial Intelligence

Unlocking Semantic Search: Elasticsearch Vector Search & RAG Applications

This article explains why traditional keyword search falls short, introduces Elasticsearch's vector search and hybrid retrieval capabilities, and shows how combining it with large language models enables Retrieval‑Augmented Generation (RAG) for more accurate, context‑aware AI-driven search across text and multimedia data.

AIElasticsearchLarge Language Models
0 likes · 5 min read
Unlocking Semantic Search: Elasticsearch Vector Search & RAG Applications
Tencent Cloud Developer
Tencent Cloud Developer
Jul 15, 2025 · Artificial Intelligence

How RAG Evolved: From Naive to Agentic – A Complete Guide

This article systematically outlines the evolution of Retrieval‑Augmented Generation (RAG) from its naive three‑step pipeline to advanced, modular, and agentic architectures, highlighting each generation's motivations, core features, advantages, drawbacks, and practical implementation details for large language model applications.

Agentic RAGArtificial IntelligenceLLM
0 likes · 20 min read
How RAG Evolved: From Naive to Agentic – A Complete Guide
Ops Development Stories
Ops Development Stories
Jul 14, 2025 · Artificial Intelligence

Mastering AIOps: Prompt Engineering, Function Calling, RAG, Graph RAG, and Local LLM Deployment

This comprehensive guide explores AIOps techniques such as prompt engineering, chat completions, memory management, function calling, fine‑tuning, retrieval‑augmented generation (RAG), graph‑based RAG, and practical steps for deploying open‑source large language models locally, providing code examples and best‑practice recommendations for modern DevOps environments.

Function CallingGraph RAGRAG
0 likes · 47 min read
Mastering AIOps: Prompt Engineering, Function Calling, RAG, Graph RAG, and Local LLM Deployment
Tencent Technical Engineering
Tencent Technical Engineering
Jul 14, 2025 · Artificial Intelligence

Demystifying AIGC, Agents, and MCP: Core Concepts and How They Interact

This article provides a concise overview of the latest AI concepts—including AIGC, Retrieval‑Augmented Generation, Function‑Calling models, intelligent agents, and the Model Context Protocol—explaining their principles, differences, and how they can be combined to build more powerful AI applications for developers outside the AI field.

AIGCAgentFunction Calling
0 likes · 15 min read
Demystifying AIGC, Agents, and MCP: Core Concepts and How They Interact
DaTaobao Tech
DaTaobao Tech
Jul 14, 2025 · Artificial Intelligence

Mastering AI Application Modes: Embedding, Copilot, and Agents Explained

This article explores practical AI engineering strategies, detailing the three AI application modes—Embedding, Copilot, and Agents—along with prompt engineering, model selection, function calling, RAG, workflow design, and multi‑agent architectures to boost business efficiency and user experience.

AIPrompt EngineeringRAG
0 likes · 25 min read
Mastering AI Application Modes: Embedding, Copilot, and Agents Explained
Data Thinking Notes
Data Thinking Notes
Jul 13, 2025 · Artificial Intelligence

How to Build an Enterprise Knowledge Base with Dify: Full Setup Guide

This article walks developers through the entire process of deploying Dify locally, configuring model providers, creating and segmenting a knowledge base with RAG, choosing indexing methods, and integrating the knowledge base into a chatbot application, complete with code snippets and visual guides.

AI deploymentDifyKnowledge Base
0 likes · 11 min read
How to Build an Enterprise Knowledge Base with Dify: Full Setup Guide
Architecture and Beyond
Architecture and Beyond
Jul 12, 2025 · Artificial Intelligence

What Exactly Is an AI Agent? History, Architecture, and Future Challenges

This article traces the evolution of AI agents from early expert systems to modern large‑language‑model‑driven assistants, explains their core perception, reasoning, memory, and action modules, compares thinking and execution models, and discusses current limitations such as hallucinations, reliability, cost, and security.

AI AgentLarge Language ModelMemory Architecture
0 likes · 20 min read
What Exactly Is an AI Agent? History, Architecture, and Future Challenges
Architect
Architect
Jul 11, 2025 · Artificial Intelligence

How OpenAI’s Zero‑Vector Agentic RAG Redefines AI Knowledge Retrieval

OpenAI’s new non‑vectorized Agentic RAG approach replaces traditional vector search with a hierarchical, multi‑round content selection process, leveraging large‑context models like GPT‑4.1‑mini for efficient document loading, dynamic navigation, and accurate answer generation, while outlining model selection strategies, cost trade‑offs, and production considerations.

AI ArchitectureModel selectionRAG
0 likes · 15 min read
How OpenAI’s Zero‑Vector Agentic RAG Redefines AI Knowledge Retrieval
Sanyou's Java Diary
Sanyou's Java Diary
Jul 10, 2025 · Artificial Intelligence

Demystifying AIGC, Agents, RAG, and MCP: Core AI Concepts Explained

This article provides a concise overview of the latest AI breakthroughs—including AIGC, multimodal technology, Retrieval‑Augmented Generation (RAG), intelligent agents with function‑calling models, and the Model Context Protocol (MCP)—explaining their principles, relationships, and practical implications for developers outside the AI field.

AIAIGCFunction Calling
0 likes · 16 min read
Demystifying AIGC, Agents, RAG, and MCP: Core AI Concepts Explained
DataFunSummit
DataFunSummit
Jul 10, 2025 · Artificial Intelligence

How Large Language Model AI Agents Transform Intelligent Operations and On‑Call Support

This article details the design and implementation of a large‑model‑driven intelligent operations dialogue system, covering intent recognition, routing, multi‑agent planning, RAG, workflow, ReAct, reflection, tree‑search techniques, evaluation challenges, and future multi‑agent collaboration for on‑call support.

AI agentsIntelligent OperationsRAG
0 likes · 23 min read
How Large Language Model AI Agents Transform Intelligent Operations and On‑Call Support
Tencent Cloud Developer
Tencent Cloud Developer
Jul 10, 2025 · Artificial Intelligence

Demystifying AIGC, Agents, and MCP: Essential AI Concepts for Developers

This article provides a concise, developer‑focused overview of emerging AI concepts—including AIGC, multimodal models, Retrieval‑Augmented Generation, intelligent agents, Function‑Calling, and the Model Context Protocol (MCP)—explaining their core principles, differences, and how they interrelate to enable advanced AI applications.

AIAIGCAgent
0 likes · 16 min read
Demystifying AIGC, Agents, and MCP: Essential AI Concepts for Developers
JD Tech Talk
JD Tech Talk
Jul 8, 2025 · Artificial Intelligence

How AI Can Turn a Code Maze into a Knowledge Highway for New Developers

New developer Li Ming’s frustrating onboarding experience highlights hidden business rules, undocumented code, and poor knowledge transfer, prompting him to build an AI‑driven knowledge base that links code changes, requirements, and operational docs, ultimately streamlining troubleshooting, accelerating feature development, and improving knowledge retention across teams.

AILarge Language ModelRAG
0 likes · 18 min read
How AI Can Turn a Code Maze into a Knowledge Highway for New Developers
JD Cloud Developers
JD Cloud Developers
Jul 8, 2025 · Artificial Intelligence

How AI Can Turn a Code Maze into a Knowledge Hub for New Developers

This article follows a new developer named Li Ming as he confronts undocumented code, hidden business rules, and fragmented knowledge, then demonstrates how leveraging large‑language models to index, associate, and retrieve code, requirements, and operational data can create an intelligent knowledge base that streamlines onboarding, reduces errors, and enhances collaboration across development, testing, and product teams.

AIRAGsoftware development
0 likes · 19 min read
How AI Can Turn a Code Maze into a Knowledge Hub for New Developers
AI Algorithm Path
AI Algorithm Path
Jul 3, 2025 · Artificial Intelligence

Exploring Advanced, Graph, and Agentic RAG: The Evolution of Retrieval‑Augmented Generation

This article examines how Retrieval‑Augmented Generation (RAG) has progressed from simple keyword‑based retrieval to advanced semantic methods, modular architectures, graph‑enhanced reasoning, and autonomous agentic systems, highlighting each approach's workflow, benefits, limitations, and the shift toward dynamic AI decision‑making.

AIAgentic RAGGraph RAG
0 likes · 7 min read
Exploring Advanced, Graph, and Agentic RAG: The Evolution of Retrieval‑Augmented Generation
Instant Consumer Technology Team
Instant Consumer Technology Team
Jul 3, 2025 · Artificial Intelligence

Why Buying an AI Appliance Is a Strategic Pitfall for Enterprises

Enterprises rushing to purchase DeepSeek AI appliances and smart‑agent platforms often face hidden technical, data, and organizational challenges that turn promised "plug‑and‑play" solutions into costly missteps, highlighting the need for realistic strategy, robust data governance, and continuous capability building.

AI capability buildingAI deploymentData Governance
0 likes · 28 min read
Why Buying an AI Appliance Is a Strategic Pitfall for Enterprises
AI Large Model Application Practice
AI Large Model Application Practice
Jul 2, 2025 · Artificial Intelligence

Build a PPT‑Powered RAG Engine with Visual Models and MCP Server

This article explains how to construct a Retrieval‑Augmented Generation (RAG) pipeline for multi‑page PPT documents by converting slides to images, extracting content with a vision model, indexing with LlamaIndex and Chroma, and exposing the functionality through an MCP Server with tools for adding, querying, and managing PPTs.

LlamaIndexMCP ServerPPT
0 likes · 13 min read
Build a PPT‑Powered RAG Engine with Visual Models and MCP Server
Ops Development Stories
Ops Development Stories
Jul 1, 2025 · Artificial Intelligence

From Lean to AIOps: How AI is Transforming Modern Operations

This comprehensive guide walks through the evolution from Lean and Agile practices to DevOps and finally AIOps, explaining core concepts, key algorithms, the role of large language models, RAG‑based root‑cause analysis, and practical implementation steps for intelligent operations.

Large Language ModelsLeanRAG
0 likes · 19 min read
From Lean to AIOps: How AI is Transforming Modern Operations
DataFunTalk
DataFunTalk
Jun 29, 2025 · Artificial Intelligence

Large Models Boost Douyin User Experience: Expert Insights

In an interview at the DA Digital Intelligence Conference, ByteDance AI specialist Cai Conghuai explains how large language models, combined with techniques like SFT, DPO, and RAG, are reshaping Douyin's user‑experience signal detection, root‑cause analysis, and evaluation, while outlining future AI‑agent breakthroughs.

AIDPOLarge Language Models
0 likes · 12 min read
Large Models Boost Douyin User Experience: Expert Insights
Architect
Architect
Jun 28, 2025 · Artificial Intelligence

How MultiAgentPPT Generates Slides with AI Agents: Architecture and Code Walkthrough

This article examines the MultiAgentPPT project, detailing its multi‑agent workflow, the four core agents that generate outlines, split topics, conduct research, and summarize results, and explains how the system retrieves data via a WeChat crawler and constructs prompts for LLM‑driven PPT creation.

AI agentsMultiAgentPPTPPT generation
0 likes · 6 min read
How MultiAgentPPT Generates Slides with AI Agents: Architecture and Code Walkthrough
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jun 27, 2025 · Artificial Intelligence

Build a Powerful AI Search RAG Application with PAI‑LangStudio, Qwen3 & Elasticsearch

This guide walks you through using the PAI‑LangStudio platform together with the Qwen3 large language model and Elasticsearch to create a full‑stack AI Search RAG solution, covering prerequisites, step‑by‑step configuration of model services, database connections, runtimes, knowledge bases, workflow creation, testing, and deployment for production use.

AI SearchElasticsearchLarge Language Model
0 likes · 11 min read
Build a Powerful AI Search RAG Application with PAI‑LangStudio, Qwen3 & Elasticsearch
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Jun 27, 2025 · Operations

How AI‑Powered Ops‑Nexus Transforms Intelligent Operations for 100k+ Servers

This article details the design, technology choices, functional modules, core implementation, performance optimizations, and future roadmap of Ops‑Nexus, an AI‑driven intelligent operations platform that streamlines alarm analysis, log processing, and host health checks for large‑scale monitoring environments.

AI OpsIntelligent OperationsLLM
0 likes · 12 min read
How AI‑Powered Ops‑Nexus Transforms Intelligent Operations for 100k+ Servers
AI Algorithm Path
AI Algorithm Path
Jun 26, 2025 · Artificial Intelligence

The 10 Essential Components of a Retrieval‑Augmented Generation (RAG) System

This guide breaks down the ten core building blocks of a production‑ready RAG pipeline—from input handling and vector stores to prompt engineering, LLM inference, observability, and evaluation—showing why each piece matters, common pitfalls, and practical best‑practice recommendations.

LLMObservabilityPrompt Engineering
0 likes · 9 min read
The 10 Essential Components of a Retrieval‑Augmented Generation (RAG) System
Su San Talks Tech
Su San Talks Tech
Jun 26, 2025 · Artificial Intelligence

Master Spring AI Alibaba 1.0: Upgrade Guide, New Features & Real‑World Code

This article walks you through what Spring AI Alibaba 1.0 offers, highlights its major updates such as the Graph multi‑agent framework and ecosystem integrations, and provides a step‑by‑step upgrade path with Maven dependency changes, code fixes, and configuration adjustments for Java developers.

AI FrameworkMCPRAG
0 likes · 20 min read
Master Spring AI Alibaba 1.0: Upgrade Guide, New Features & Real‑World Code
DeWu Technology
DeWu Technology
Jun 25, 2025 · Artificial Intelligence

Engineering Large Language Models with Spring AI: From Basics to RAG and Function Calls

This article walks through the fundamentals of large language models, their stateless and structured-output nature, explains how Spring‑AI provides a Java‑friendly API for model integration, covers RAG architecture, the MCP protocol, and demonstrates end‑to‑end code examples for building intelligent agents.

AI integrationFunction CallingJava
0 likes · 15 min read
Engineering Large Language Models with Spring AI: From Basics to RAG and Function Calls
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 24, 2025 · Artificial Intelligence

How Transformers and Mixture-of-Experts Power Large Language Models

This article explores the role of Transformers and Mixture‑of‑Experts in large models, outlines five fine‑tuning methods, compares traditional and agentic RAG, presents classic agent design patterns, text‑chunking strategies, levels of intelligent agent systems, and explains KV‑caching techniques.

Large Language ModelsMixture of ExpertsRAG
0 likes · 2 min read
How Transformers and Mixture-of-Experts Power Large Language Models
Fun with Large Models
Fun with Large Models
Jun 23, 2025 · Artificial Intelligence

Boost RAG Answer Accuracy: Detailed Step‑by‑Step GraphRAG Knowledge‑Graph Construction

This article walks through the complete GraphRAG knowledge‑graph building pipeline—text splitting, entity extraction, relation mining, community clustering, and report generation—using a concrete example from the book “The Age of Big Data,” and explains why each step improves retrieval and answer quality.

GraphRAGKnowledge GraphRAG
0 likes · 20 min read
Boost RAG Answer Accuracy: Detailed Step‑by‑Step GraphRAG Knowledge‑Graph Construction
Tech Freedom Circle
Tech Freedom Circle
Jun 21, 2025 · Artificial Intelligence

How MCP + LLM + Agent Architecture Becomes the AI Agent’s Neural Hub and New Infrastructure

The article explains the Model Context Protocol (MCP) as a zero‑code bridge that lets large language models seamlessly access databases, external APIs, and execute code, detailing its benefits for developers and everyday users, its core components, step‑by‑step workflow, real‑world examples, and how it outperforms traditional APIs in modern AI agent systems.

AI AgentLLMMCP
0 likes · 37 min read
How MCP + LLM + Agent Architecture Becomes the AI Agent’s Neural Hub and New Infrastructure
Data Thinking Notes
Data Thinking Notes
Jun 19, 2025 · Artificial Intelligence

Andrew Ng on Building Agentic AI Systems: Tools, MCP, and Practical Insights

In a candid conversation, Andrew Ng and Harrison Chase explore the evolving landscape of AI agents, discussing modular toolchains, the emerging MCP standard, challenges of agent‑to‑agent communication, voice interaction latency, and the importance of rapid, technically skilled execution for successful AI product development.

AI agentsLangChainMCP
0 likes · 19 min read
Andrew Ng on Building Agentic AI Systems: Tools, MCP, and Practical Insights
DataFunSummit
DataFunSummit
Jun 19, 2025 · Artificial Intelligence

How Large Models Are Revolutionizing Douyin’s User Experience – Expert Insights

In a detailed interview, ByteDance AI specialist Cai Conghuai explains how large‑model techniques such as SFT, DPO and RAG address Douyin’s multimodal user‑experience challenges, improve signal detection, root‑cause analysis, and outline future AI‑agent breakthroughs for content platforms.

AI AlgorithmsMultimodal LearningRAG
0 likes · 11 min read
How Large Models Are Revolutionizing Douyin’s User Experience – Expert Insights
Fun with Large Models
Fun with Large Models
Jun 19, 2025 · Artificial Intelligence

How GraphRAG Boosts Answer Accuracy with Knowledge Graphs (Part 1)

This article explains GraphRAG’s architecture, compares it with traditional RAG, and presents experimental results showing that GraphRAG’s knowledge‑graph‑driven retrieval markedly improves answer accuracy, especially on low‑match, multi‑paragraph queries.

GraphRAGKnowledge GraphLarge Language Models
0 likes · 11 min read
How GraphRAG Boosts Answer Accuracy with Knowledge Graphs (Part 1)
Tencent Technical Engineering
Tencent Technical Engineering
Jun 16, 2025 · Artificial Intelligence

Mastering RAG and AI Agents: Practical Tips, Code Samples, and Evaluation Strategies

This comprehensive guide walks you through the fundamentals of Retrieval‑Augmented Generation (RAG) and AI agents, explains their inner workings, shares optimization tricks, provides ready‑to‑run code snippets, and demonstrates how to evaluate performance with metrics such as recall, faithfulness, and answer relevance.

AI agentsLLMPrompt Engineering
0 likes · 36 min read
Mastering RAG and AI Agents: Practical Tips, Code Samples, and Evaluation Strategies
ITPUB
ITPUB
Jun 15, 2025 · Artificial Intelligence

How to Build a High‑Performance Enterprise RAG System with Model Context Protocol (MCP)

This article presents a step‑by‑step guide for constructing a scalable enterprise Retrieval‑Augmented Generation (RAG) solution using the Model Context Protocol (MCP), covering architecture comparison, system design, Milvus‑backed knowledge store, Python client implementation, deployment scripts, code examples, and best‑practice recommendations.

KnowledgeBaseLLMMCP
0 likes · 22 min read
How to Build a High‑Performance Enterprise RAG System with Model Context Protocol (MCP)
TAL Education Technology
TAL Education Technology
Jun 13, 2025 · Operations

How Large Language Models Are Revolutionizing Fault Localization

This article explores how the rapid rise of large language models and techniques like Retrieval‑Augmented Generation, Chain‑of‑Thought prompting, and multi‑agent architectures can dramatically improve the speed, accuracy, and automation of fault localization in modern operations environments.

Agent ArchitectureCoTFault Localization
0 likes · 14 min read
How Large Language Models Are Revolutionizing Fault Localization
Instant Consumer Technology Team
Instant Consumer Technology Team
Jun 12, 2025 · Artificial Intelligence

How to Build a Production-Ready RAG System with Qwen3 Embedding and Reranker Models

This guide walks through using Alibaba's new Qwen3-Embedding and Qwen3-Reranker models to build a two‑stage Retrieval‑Augmented Generation pipeline with Milvus, covering environment setup, data ingestion, vector indexing, reranking, and LLM‑driven answer generation, demonstrating production‑grade performance across multilingual queries.

EmbeddingLLMMilvus
0 likes · 19 min read
How to Build a Production-Ready RAG System with Qwen3 Embedding and Reranker Models
DataFunSummit
DataFunSummit
Jun 12, 2025 · Artificial Intelligence

How Alibaba Cloud’s AI Search Evolves with Agentic RAG and Multi‑Model Innovations

This article details Alibaba Cloud AI Search’s development journey, covering its dual product lines, the evolution of Agentic RAG technology, multi‑agent architectures, vector retrieval breakthroughs, GPU‑accelerated indexing, NL2SQL capabilities, deployment models, and future directions for AI‑driven search solutions.

AI SearchGPU AccelerationLarge Models
0 likes · 33 min read
How Alibaba Cloud’s AI Search Evolves with Agentic RAG and Multi‑Model Innovations
Zuoyebang Tech Team
Zuoyebang Tech Team
Jun 12, 2025 · Information Security

How AI‑Powered RAG and Agents Are Revolutionizing Enterprise Security Operations

This article explains how the rise of AI large‑model technology and Retrieval‑Augmented Generation (RAG) combined with autonomous AI agents enable a three‑layer network‑boundary defense, address deep operational challenges such as alert overload and response latency, and dramatically improve incident‑response efficiency in large‑scale enterprises.

AI agentsAI securityIncident Response
0 likes · 16 min read
How AI‑Powered RAG and Agents Are Revolutionizing Enterprise Security Operations
Data Thinking Notes
Data Thinking Notes
Jun 11, 2025 · Artificial Intelligence

How RAG‑Powered AI Boosted Government Data Labeling Efficiency by 5×

This case study details how a government‑focused AI system using retrieval‑augmented generation (RAG) and advanced preprocessing algorithms increased data labeling speed by up to five times, raised accuracy above 95%, and produced high‑quality enterprise, spatial, and economic datasets.

AIAutomationGovernment
0 likes · 5 min read
How RAG‑Powered AI Boosted Government Data Labeling Efficiency by 5×
Sohu Tech Products
Sohu Tech Products
Jun 11, 2025 · Artificial Intelligence

How DeepSeek and TiDB AI Are Redefining Data Engines for the Large‑Model Era

This article explores DeepSeek's open‑source large‑model breakthroughs, PingCAP's AI‑enhanced database roadmap, TiDB.AI's retrieval‑augmented generation framework, the unified TiDB data engine, and practical Q&A insights on knowledge‑graph construction, vector search, and AI‑driven SQL generation.

AIDatabaseDeepSeek
0 likes · 15 min read
How DeepSeek and TiDB AI Are Redefining Data Engines for the Large‑Model Era
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 10, 2025 · Artificial Intelligence

How AI Application Architectures Evolve: From Simple LLM Calls to Guardrails, Routing, and Agents

This article traces the evolution of AI application architectures—from the earliest minimal user‑LLM interaction to advanced designs featuring context enhancement, input/output guardrails, intent routing, model gateways, caching strategies, agent capabilities, monitoring, and inference performance optimizations—providing practical insights and references for developers.

AI ArchitectureAgentCaching
0 likes · 21 min read
How AI Application Architectures Evolve: From Simple LLM Calls to Guardrails, Routing, and Agents
Data Thinking Notes
Data Thinking Notes
Jun 8, 2025 · Artificial Intelligence

Explore the Complete AI Large Model Technology Landscape: Architecture Diagrams Across Industries

This article presents a panoramic view of AI large‑model technologies, showcasing a series of architecture diagrams that illustrate general model frameworks, RAG knowledge‑base structures, agricultural and retail applications, IoT integration, compliance and risk‑management setups, agent platforms, and CRM‑enhanced solutions.

AILarge ModelsRAG
0 likes · 3 min read
Explore the Complete AI Large Model Technology Landscape: Architecture Diagrams Across Industries
Qborfy AI
Qborfy AI
Jun 7, 2025 · Artificial Intelligence

Build a Retrieval‑Augmented Generation (RAG) Chatbot with LangChain and Streamlit

This guide walks through the complete process of creating a RAG‑powered question‑answering bot using LangChain, Streamlit, and vector‑store embeddings, covering theory, architecture, data loading, chunking, vector indexing, retrieval, LLM integration, and full code implementation with practical examples.

ChatbotEmbeddingsLangChain
0 likes · 13 min read
Build a Retrieval‑Augmented Generation (RAG) Chatbot with LangChain and Streamlit
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Jun 6, 2025 · Artificial Intelligence

Tackling the Top Challenges of Retrieval‑Augmented Generation (RAG)

The article enumerates common pitfalls of Retrieval‑Augmented Generation—such as missing content, low‑rank document misses, context limits, format errors, incomplete answers, scalability bottlenecks, complex PDF extraction, data‑quality issues, domain adaptation gaps, hallucinations, and feedback‑loop deficiencies—and offers concrete mitigation strategies ranging from data cleaning and prompt design to hybrid search, hierarchical retrieval, document compression, and automated evaluation.

Hybrid SearchLLMPrompt Engineering
0 likes · 9 min read
Tackling the Top Challenges of Retrieval‑Augmented Generation (RAG)
IT Services Circle
IT Services Circle
Jun 6, 2025 · Artificial Intelligence

Master Retrieval‑Augmented Generation (RAG): From Basics to Advanced Practices

This article introduces Retrieval‑Augmented Generation (RAG), explains its core components—knowledge embedding, retriever, and generator—covers practical system construction, optimization techniques, evaluation metrics, and advanced paradigms such as GraphRAG and Multi‑Modal RAG, while highlighting a comprehensive guidebook for hands‑on implementation.

AIKnowledge retrievalRAG
0 likes · 12 min read
Master Retrieval‑Augmented Generation (RAG): From Basics to Advanced Practices
Didi Tech
Didi Tech
Jun 5, 2025 · Artificial Intelligence

Unlocking Modern AI Application Architecture: From RAG to Agents and MCP

This article surveys the evolution of AI applications, explains large language model fundamentals, outlines architectural challenges, and introduces three core patterns—Retrieval‑Augmented Generation (RAG), autonomous Agents, and Model Context Protocol (MCP)—while providing practical LangChain code snippets and integration guidance.

AIAgentLLM
0 likes · 28 min read
Unlocking Modern AI Application Architecture: From RAG to Agents and MCP
Fighter's World
Fighter's World
Jun 2, 2025 · Artificial Intelligence

Why Is Context King for Large Language Models?

This article provides a comprehensive technical analysis of LLM context, covering its definition, types, tokenization, window‑size evolution, diminishing returns, management techniques such as RAG, CoT, memory‑as‑a‑service, and future challenges like multimodal fusion, privacy, and autonomous agent memory.

Agent MemoryContext ManagementLLM
0 likes · 48 min read
Why Is Context King for Large Language Models?
dbaplus Community
dbaplus Community
May 31, 2025 · Artificial Intelligence

How RAG is Shaping the Future of AI-Powered User Experience

Amid the rapid rise of large language models, this article examines RAG’s development, technical hurdles, core strategies, and future outlook, illustrating how Alibaba’s Chatbot and Copilot projects boost retrieval accuracy to 90% and generation precision to 85% while tackling data quality, heterogeneous retrieval, and evaluation challenges.

AI SearchRAGevaluation metrics
0 likes · 27 min read
How RAG is Shaping the Future of AI-Powered User Experience
ITFLY8 Architecture Home
ITFLY8 Architecture Home
May 30, 2025 · Artificial Intelligence

Explore the Full Spectrum of AI Large Model Architectures

This article presents a comprehensive visual collection of AI large‑model architecture diagrams, covering general frameworks, RAG knowledge‑base systems, agriculture, e‑commerce recommendation, IoT, compliance risk management, agent platforms, and CRM integration, offering a panoramic view of modern AI infrastructure.

AIComplianceIoT
0 likes · 3 min read
Explore the Full Spectrum of AI Large Model Architectures
Instant Consumer Technology Team
Instant Consumer Technology Team
May 29, 2025 · Artificial Intelligence

API vs GUI Agents: How to Choose the Right LLM Automation Approach

This article examines the evolution of large language model agents, contrasting API‑based agents that use predefined function calls with GUI‑based agents that interact with visual interfaces, and explores hybrid strategies, orchestration tools, RAG techniques, and practical guidelines for selecting the optimal paradigm.

API vs GUIHybrid automationLLM Agents
0 likes · 34 min read
API vs GUI Agents: How to Choose the Right LLM Automation Approach
DevOps
DevOps
May 28, 2025 · Artificial Intelligence

Google Proposes a “Sufficient Context” Framework to Strengthen Enterprise Retrieval‑Augmented Generation Systems

Google researchers introduce a “sufficient context” framework that classifies retrieved passages as adequate or inadequate for answering a query, enabling large language models in enterprise RAG systems to decide when to answer, refuse, or request more information, thereby improving accuracy and reducing hallucinations.

AI reliabilityEnterprise AILarge Language Models
0 likes · 9 min read
Google Proposes a “Sufficient Context” Framework to Strengthen Enterprise Retrieval‑Augmented Generation Systems
phodal
phodal
May 28, 2025 · Artificial Intelligence

Boost Code Retrieval with AutoDev’s Pre‑Generated Context Worker

The article explains how AutoDev’s Context Worker pre‑generates semantic code context to improve RAG performance, outlines the limitations of vector‑based retrieval, describes the tool’s multi‑language AST analysis, knowledge‑graph construction, and provides command‑line usage examples for integrating the generated context into AI‑driven development workflows.

AIASTCLI
0 likes · 8 min read
Boost Code Retrieval with AutoDev’s Pre‑Generated Context Worker
Coder Circle
Coder Circle
May 28, 2025 · Artificial Intelligence

Core AI Concepts Every Spring AI Developer Should Know

This article explains fundamental AI concepts—including models, prompts, prompt templates, embeddings, tokens, structured output, data integration, RAG, and tool calling—and shows how Spring AI simplifies their use for Java developers building intelligent applications.

AI modelsEmbeddingsPrompt Engineering
0 likes · 13 min read
Core AI Concepts Every Spring AI Developer Should Know
Programmer DD
Programmer DD
May 21, 2025 · Artificial Intelligence

What’s New in Spring AI 1.0 GA? A Deep Dive into Java AI Features

Spring AI 1.0 GA introduces a comprehensive suite of AI capabilities for Java developers, including a ChatClient supporting 20 models, vector‑store integrations, RAG pipelines, advanced chat memory, @Tool function calling, model evaluation, observability, Model Context Protocol, and autonomous agents, with examples for major cloud providers.

AI modelsJavaMCP
0 likes · 6 min read
What’s New in Spring AI 1.0 GA? A Deep Dive into Java AI Features
Java Architecture Diary
Java Architecture Diary
May 21, 2025 · Artificial Intelligence

Spring AI 1.0 Launch: Production‑Ready Java AI Framework Unveiled

Spring AI 1.0, the first production‑grade Java AI framework, introduces ready‑to‑use APIs, seamless model integration, enterprise‑level RAG engine, smart tool calling, and three development modes, empowering developers to rapidly build, customize, and fully control AI applications with major model providers like OpenAI, Anthropic, DeepSeek.

AI FrameworkDeepSeekJava AI
0 likes · 13 min read
Spring AI 1.0 Launch: Production‑Ready Java AI Framework Unveiled
DeWu Technology
DeWu Technology
May 19, 2025 · Artificial Intelligence

AI-Powered Automated Test Case Generation: Design, Implementation, and Future Plans

This article presents a comprehensive AI-driven solution for automatically generating functional test cases, detailing the AI background, design scheme, core components such as PRD parsing, test‑point generation, test‑case creation, knowledge‑base construction, implementation results, and future development directions.

AIKnowledge BaseLLM
0 likes · 7 min read
AI-Powered Automated Test Case Generation: Design, Implementation, and Future Plans
Tencent Technical Engineering
Tencent Technical Engineering
May 19, 2025 · Artificial Intelligence

RAG, Agents, and Multimodal Large Models: Evolution, Challenges, and Future Trends

This article examines the evolution of large model technologies—including Retrieval‑Augmented Generation, AI agents, and multimodal models—detailing their technical foundations, practical challenges, industry applications, and future development trends, offering a comprehensive perspective for AI practitioners and researchers.

AI AgentKnowledge retrievalMultimodal
0 likes · 14 min read
RAG, Agents, and Multimodal Large Models: Evolution, Challenges, and Future Trends
DataFunSummit
DataFunSummit
May 13, 2025 · Artificial Intelligence

Integrating Large Language Models and Knowledge Graphs for Financial Applications: Challenges, Solutions, and Future Directions

This talk explores the technical challenges of applying large language models and knowledge graphs in finance, discusses solutions such as RAG enhancements, graph‑guided retrieval, multimodal extensions, and presents future research directions including multimodal graph integration, agentic systems, and decision‑making applications.

AIAgentic SystemsFinance
0 likes · 33 min read
Integrating Large Language Models and Knowledge Graphs for Financial Applications: Challenges, Solutions, and Future Directions
DataFunSummit
DataFunSummit
May 9, 2025 · Artificial Intelligence

Practical Experience Building Zhihu Direct Answer: An AI‑Powered Search Product

This article presents a comprehensive overview of Zhihu Direct Answer, describing its AI‑driven search architecture, RAG framework, query understanding, retrieval, chunking, reranking, generation, evaluation mechanisms, engineering optimizations, and the professional edition, while sharing concrete performance‑boosting practices and future development plans.

AIProduct DevelopmentRAG
0 likes · 14 min read
Practical Experience Building Zhihu Direct Answer: An AI‑Powered Search Product
Alibaba Cloud Native
Alibaba Cloud Native
May 9, 2025 · Artificial Intelligence

Build a Retrieval‑Augmented Generation (RAG) App with LangChain, Higress, and Elasticsearch

This tutorial walks through building a Retrieval‑Augmented Generation (RAG) system by combining LangChain for document processing, Elasticsearch’s vector store with the ELSER v2 model for semantic search, and Higress as a cloud‑native AI gateway, complete with deployment scripts, code examples, and query testing.

AIHigressLangChain
0 likes · 15 min read
Build a Retrieval‑Augmented Generation (RAG) App with LangChain, Higress, and Elasticsearch
phodal
phodal
May 9, 2025 · Artificial Intelligence

Why Pre‑Generated Context Is the Key to Faster, More Accurate AI Code Retrieval

The article examines how pre‑generating structured context for codebases can overcome the uncertainty and quality issues of traditional Retrieval‑Augmented Generation, outlines the technical and business challenges of RAG, compares existing code‑search tools, and introduces AutoDev’s Context Worker as a practical solution.

AILLMRAG
0 likes · 11 min read
Why Pre‑Generated Context Is the Key to Faster, More Accurate AI Code Retrieval