Tagged articles

924 articles

Page 6 of 10

Aug 17, 2025 · Artificial Intelligence

Unlocking AI Agents: From Basics to Real-World Development

This article provides a comprehensive overview of AI Agents, covering their fundamental concepts, core features, technical evolution, work cycle, architectural modules, key technologies such as prompt engineering and RAG, practical development steps, a data‑analysis agent case study, and typical industry applications.

AI AgentAgent ArchitectureArtificial Intelligence

0 likes · 13 min read

Unlocking AI Agents: From Basics to Real-World Development

Alibaba Cloud Developer

Aug 15, 2025 · Artificial Intelligence

Mastering AI Agents: Prompt Engineering, Workflows, and RAG Strategies

This article systematically explains how to build reliable, high‑performance AI agents by focusing on the core components—LLM, prompts, workflows, RAG, and tools—while covering prompt engineering techniques, DSL‑based workflow design, vector‑database knowledge bases, security against prompt injection, and practical project planning.

AI AgentLLMRAG

0 likes · 15 min read

Mastering AI Agents: Prompt Engineering, Workflows, and RAG Strategies

Tencent Technical Engineering

Aug 14, 2025 · Artificial Intelligence

Why Do Large Language Models Hallucinate? Causes, Risks, and Multi‑Dimensional Solutions

This article systematically examines the root causes of hallucinations in large language models, evaluates their pros and cons, and presents a comprehensive set of optimization techniques—including prompt engineering, RAG, sampling tweaks, supervised fine‑tuning, LoRA, RLHF, chain‑of‑thought reasoning, and agent/workflow designs—to build more reliable and trustworthy AI applications.

AILLMLoRA

0 likes · 29 min read

Why Do Large Language Models Hallucinate? Causes, Risks, and Multi‑Dimensional Solutions

Zhuanzhuan Tech

Aug 13, 2025 · Artificial Intelligence

From Third‑Party to Self‑Developed: Zhaozhuan’s AI Chatbot Evolution

This article details how Zhaozhuan transformed its customer‑service robot from relying on external NLP providers to a fully self‑developed AI solution, covering technical choices, model training, RAG integration, deployment strategies, and future directions.

AIChatbotCustomer Service

0 likes · 10 min read

From Third‑Party to Self‑Developed: Zhaozhuan’s AI Chatbot Evolution

Mingyi World Elasticsearch

Aug 13, 2025 · Artificial Intelligence

Rapidly Build a Labor Law AI Q&A Assistant Using Claude and Coco AI

The article details how a legal team tackled inefficient labor law document search by creating a RAG‑based intelligent Q&A assistant with Claude and the open‑source Coco AI toolkit, covering solution comparison, environment setup, data preparation, testing, and optimization insights.

ClaudeCoco AILabor Law

0 likes · 9 min read

Rapidly Build a Labor Law AI Q&A Assistant Using Claude and Coco AI

DaTaobao Tech

Aug 13, 2025 · Artificial Intelligence

Unlocking AI Power: A Complete Guide to Prompt Engineering and Advanced Techniques

This article explores the emerging field of prompt engineering, detailing its fundamentals, advanced strategies such as chain‑of‑thought, ReAct, and structured frameworks, and demonstrates practical applications in AI agents for data retrieval, SQL generation, and market insight, offering actionable guidance for developers and business users alike.

AI agentsData RetrievalRAG

0 likes · 42 min read

Unlocking AI Power: A Complete Guide to Prompt Engineering and Advanced Techniques

Alibaba Cloud Developer

Aug 11, 2025 · Artificial Intelligence

How Fine‑Tuning Large Models Solves Code Upgrade Challenges and Boosts Stable Module Matching

This article details an innovative approach that uses large‑model supervised fine‑tuning to overcome the instability of code RAG and code agents during open‑source repository upgrades, addressing domain‑specific terminology, code style differences, and improving recall, accuracy, and deployment efficiency.

AI agentsLLMRAG

0 likes · 11 min read

How Fine‑Tuning Large Models Solves Code Upgrade Challenges and Boosts Stable Module Matching

AI Large Model Application Practice

Aug 11, 2025 · Artificial Intelligence

How to Build an LLM-Powered Smart Resume Screening System

This article presents a detailed design and implementation of an LLM‑based intelligent resume matching system that combines semantic vector retrieval, structured rule filtering, multi‑dimensional weighted scoring, and natural‑language interaction to create a fast, quantifiable, and explainable hiring pipeline.

AI RecruitmentLLMRAG

0 likes · 18 min read

How to Build an LLM-Powered Smart Resume Screening System

Alibaba Cloud Big Data AI Platform

Aug 8, 2025 · Artificial Intelligence

Can GitOps Power Low‑Cost LLM Agents? A Hands‑On Exploration

This article examines how the Manus sandbox and CodeAct mechanisms inspire a GitOps‑based approach to building LLM agents, detailing the design of planner and executor components, workflow steps, advantages such as RAG and observability, and the potential for low‑cost, scalable intelligent agent development.

AI agentsGitOpsIntelligent agents

0 likes · 12 min read

Can GitOps Power Low‑Cost LLM Agents? A Hands‑On Exploration

Alibaba Cloud Big Data AI Platform

Aug 8, 2025 · Artificial Intelligence

Unlocking Big Data Ops with Large Models: Opportunities, Challenges, Design

This article summarizes a Cloud Summit talk where Alibaba Cloud’s AI expert Zhang Yingying explains how large language models can enhance big‑data intelligent operations, covering opportunities, challenges, RAG‑based Q&A, multi‑agent diagnostics, and the engineering architecture needed for reliable, scalable deployment.

AI EngineeringBig Data OperationsLarge Language Models

0 likes · 20 min read

Unlocking Big Data Ops with Large Models: Opportunities, Challenges, Design

Alibaba Cloud Big Data AI Platform

Aug 8, 2025 · Artificial Intelligence

What Von Neumann’s Brain Theory Reveals About Prompt Engineering for LLMs

The article explores how Von Neumann’s insights on the brain‑computer analogy illuminate modern large‑language‑model prompt engineering, comparing logical reasoning chains, memory mechanisms, and DSL‑driven computation to improve accuracy, reduce hallucinations, and balance reasoning depth with precise calculation.

DSLLarge Language ModelsPrompt Engineering

0 likes · 14 min read

What Von Neumann’s Brain Theory Reveals About Prompt Engineering for LLMs

AsiaInfo Technology: New Tech Exploration

Aug 8, 2025 · Industry Insights

How CodeRAG Reinvents Large‑Scale Code Repository Knowledge Extraction and Hierarchical Retrieval

CodeRAG leverages AST‑centric parsing and a hierarchical knowledge graph to overcome text‑only retrieval limits in large code repositories, offering multi‑language analysis, incremental parsing, hybrid indexing, and intelligent context selection for tasks such as code completion, Q&A, documentation generation, and impact analysis.

ASTCodeRAGLarge-Scale Repos

0 likes · 15 min read

How CodeRAG Reinvents Large‑Scale Code Repository Knowledge Extraction and Hierarchical Retrieval

Alibaba Cloud Big Data AI Platform

Aug 7, 2025 · Databases

Unlock Powerful Hybrid Search with Milvus 2.5: Full-Text, BM25, and RAG Guide

This tutorial explains how to use Milvus 2.5's new full‑text, BM25 keyword matching, and hybrid search capabilities—including step‑by‑step setup, schema design, code examples, and RAG integration—to achieve high recall and precision in large‑scale AI vector retrieval scenarios.

Full-Text SearchHybrid SearchMilvus

0 likes · 13 min read

Unlock Powerful Hybrid Search with Milvus 2.5: Full-Text, BM25, and RAG Guide

Alibaba Cloud Developer

Aug 5, 2025 · Databases

How PolarDB IMCI Unifies Vector Search and Embedding in One SQL Engine

This article explains how PolarDB IMCI integrates vector indexing and embedding directly into the database kernel, offering a unified, transactional, and real‑time vector lifecycle management service that lets developers build RAG knowledge bases and AI applications using only standard SQL, dramatically reducing development and operational complexity.

AIPolardbRAG

0 likes · 11 min read

How PolarDB IMCI Unifies Vector Search and Embedding in One SQL Engine

Alibaba Cloud Developer

Aug 5, 2025 · Artificial Intelligence

Mastering Intent Detection & Slot Filling: Proven Strategies and Code Samples

This article shares reusable AI development techniques for intent detection and slot filling, comparing four solution tiers—from simple prompt engineering to advanced RAG‑enhanced architectures—complete with practical code snippets, performance trade‑offs, and guidance on selecting the optimal approach for reliable conversational agents.

Intent DetectionNLUPrompt Engineering

0 likes · 27 min read

Mastering Intent Detection & Slot Filling: Proven Strategies and Code Samples

Architect's Alchemy Furnace

Aug 4, 2025 · Artificial Intelligence

How RAG and Long‑Term Memory Turn AI into a Truly Remembering Assistant

This article explains how Retrieval‑Augmented Generation (RAG) and long‑term memory systems like MenoBase enable large language models to overcome short‑term memory limits, dynamically retrieve up‑to‑date knowledge, and personalize interactions, with practical Dify implementation steps and real‑world use cases across industries.

AIDifyKnowledge Base

0 likes · 18 min read

How RAG and Long‑Term Memory Turn AI into a Truly Remembering Assistant

Alibaba Cloud Big Data AI Platform

Jul 30, 2025 · Artificial Intelligence

Build Enterprise‑Grade RAG Apps with Milvus and Dify: A Step‑by‑Step Guide

This guide explains how to combine Alibaba Cloud Milvus, a high‑performance vector database, with the low‑code AI platform Dify to create an enterprise‑level Retrieval‑Augmented Generation (RAG) application, covering architecture, installation, data ingestion, and verification steps.

AI ApplicationDifyLow‑code

0 likes · 9 min read

Build Enterprise‑Grade RAG Apps with Milvus and Dify: A Step‑by‑Step Guide

AsiaInfo Technology: New Tech Exploration

Jul 30, 2025 · Artificial Intelligence

How MCP‑RAG Overcomes Prompt Inflation for Massive LLM Service Calls

This article analyzes the prompt‑inflation bottleneck that arises when large language models (LLMs) must handle thousands of Model Context Protocol (MCP) services, and introduces the MCP‑RAG architecture—a retrieval‑augmented generation solution that builds a metadata knowledge base and intelligent retrieval layer to enable precise, efficient MCP service discovery at scale.

AILLMMCP

0 likes · 21 min read

How MCP‑RAG Overcomes Prompt Inflation for Massive LLM Service Calls

Ops Development Stories

Jul 29, 2025 · Artificial Intelligence

Master AI Agents with LangGraph: Build Adaptive RAG, Translation, and ReAct Agents

This comprehensive guide explains what an AI Agent is, its core capabilities and design patterns, and walks through step‑by‑step implementations of RAG, Translation, and ReAct agents using LangGraph, complete with code samples, workflow diagrams, and practical tips for building personal ops knowledge‑base agents.

LLMLangGraphRAG

0 likes · 64 min read

Master AI Agents with LangGraph: Build Adaptive RAG, Translation, and ReAct Agents

SF Technology Team

Jul 29, 2025 · Artificial Intelligence

How SF Tech’s Proprietary Large Models Revolutionize Logistics and AI Operations

The DA Data Intelligence Conference in Shenzhen showcased SF Tech’s breakthroughs in large‑model AI, revealing how their proprietary multimodal models, RAG innovations, and agent platforms dramatically improve logistics decision‑making, resource scheduling, and customer service across multiple industries.

AI OperationsAgent PlatformLarge Models

0 likes · 11 min read

How SF Tech’s Proprietary Large Models Revolutionize Logistics and AI Operations

Ops Development & AI Practice

Jul 29, 2025 · Artificial Intelligence

Building a Retrieval‑Augmented Generation QA Bot to Keep LLMs Up‑to‑Date

This article explains how to create a RAG‑based intelligent QA system that fetches the latest documentation (e.g., PlantUML) before querying Gemini, detailing knowledge‑base creation, embedding, vector store management, LangChain integration, and deployment tips.

AI AssistantEmbeddingGemini

0 likes · 8 min read

Building a Retrieval‑Augmented Generation QA Bot to Keep LLMs Up‑to‑Date

Architecture and Beyond

Jul 27, 2025 · Artificial Intelligence

What Makes an AI Agent Tick? From Expert Systems to Modern Architectures

This article traces the evolution of AI agents from early expert systems to today’s multimodal, memory‑rich agents, explains their perception, reasoning, memory and action modules, discusses model selection, prompt engineering, RAG techniques, and highlights current limitations such as hallucinations, reliability, cost, and security.

AI AgentFunction CallingLarge Language Model

0 likes · 28 min read

What Makes an AI Agent Tick? From Expert Systems to Modern Architectures

Mingyi World Elasticsearch

Jul 23, 2025 · Artificial Intelligence

How to Build an Enhanced RAG Retrieval and AI Assistant for Youdao Cloud Notes

This article walks through retrieving ten years of Youdao Cloud Notes, selecting a RAG implementation (self‑built or using Coco AI), configuring cookies, loading the notes locally, and integrating a large language model to enable full‑text search and intelligent question‑answering.

AI AssistantCoco AIRAG

0 likes · 8 min read

How to Build an Enhanced RAG Retrieval and AI Assistant for Youdao Cloud Notes

JD Tech Talk

Jul 23, 2025 · Artificial Intelligence

Causal Inference + LLMs: Transforming E‑Commerce Pricing Strategies

This article describes how integrating causal inference with large language models and Retrieval‑Augmented Generation can automate and explain e‑commerce product pricing, detailing the three‑step workflow, reinforcement‑learning rewards, experimental results, and future directions for end‑to‑end RAG‑LLM training.

RAGcausal inferencee‑commerce pricing

0 likes · 15 min read

Causal Inference + LLMs: Transforming E‑Commerce Pricing Strategies

Zhuanzhuan Tech

Jul 23, 2025 · Artificial Intelligence

Why AI‑Generated Code Often Misses the Mark and How a Code Knowledge Base Fixes It

AI‑generated code frequently fails to match project conventions due to lack of contextual memory, but building a dynamic code knowledge base combined with Retrieval‑Augmented Generation (RAG) enables precise, compliant code output, reduces errors, accelerates development, and transforms AI into a project‑specific assistant.

AIKnowledge BaseRAG

0 likes · 13 min read

Why AI‑Generated Code Often Misses the Mark and How a Code Knowledge Base Fixes It

Tencent Cloud Developer

Jul 23, 2025 · Artificial Intelligence

Why Retrieval‑Augmented Generation Is Evolving Into Agentic AI Search

This article explains how the inherent knowledge limits of large language models drive the rise of Retrieval‑Augmented Generation (RAG), outlines its three evolutionary stages, introduces Agentic RAG and DeepSearch, and discusses the knowledge and ability boundaries that shape future AI search systems.

AI SearchDeepSearchKnowledge retrieval

0 likes · 19 min read

Why Retrieval‑Augmented Generation Is Evolving Into Agentic AI Search

Mingyi World Elasticsearch

Jul 18, 2025 · Artificial Intelligence

Video: Building an Intelligent Knowledge‑Base Q&A System with Large Models and Elasticsearch (RAG)

The video walks through the differences between traditional keyword search and vector search, explains the core concept of Retrieval‑Augmented Generation, and demonstrates how to construct a knowledge‑base Q&A system using a large language model integrated with Elasticsearch.

ElasticsearchKnowledge BaseLarge Language Model

0 likes · 1 min read

Video: Building an Intelligent Knowledge‑Base Q&A System with Large Models and Elasticsearch (RAG)

DataFunSummit

Jul 16, 2025 · Artificial Intelligence

How Tencent Cloud ES Powers RAG with Hybrid Search and Massive Vector Optimizations

This article explores how Tencent Cloud Elasticsearch combines decades of text search expertise with cutting‑edge vector retrieval and large language models to deliver a one‑stop Retrieval‑Augmented Generation solution, detailing the underlying models, hybrid search architecture, performance tricks, and real‑world case studies.

ElasticsearchHybrid SearchLLM

0 likes · 24 min read

How Tencent Cloud ES Powers RAG with Hybrid Search and Massive Vector Optimizations

DataFunSummit

Jul 15, 2025 · Artificial Intelligence

Unlocking Semantic Search: Elasticsearch Vector Search & RAG Applications

This article explains why traditional keyword search falls short, introduces Elasticsearch's vector search and hybrid retrieval capabilities, and shows how combining it with large language models enables Retrieval‑Augmented Generation (RAG) for more accurate, context‑aware AI-driven search across text and multimedia data.

AIElasticsearchLarge Language Models

0 likes · 5 min read

Unlocking Semantic Search: Elasticsearch Vector Search & RAG Applications

Tencent Cloud Developer

Jul 15, 2025 · Artificial Intelligence

How RAG Evolved: From Naive to Agentic – A Complete Guide

This article systematically outlines the evolution of Retrieval‑Augmented Generation (RAG) from its naive three‑step pipeline to advanced, modular, and agentic architectures, highlighting each generation's motivations, core features, advantages, drawbacks, and practical implementation details for large language model applications.

Agentic RAGArtificial IntelligenceLLM

0 likes · 20 min read

How RAG Evolved: From Naive to Agentic – A Complete Guide

Fun with Large Models

Jul 15, 2025 · Artificial Intelligence

Getting Started with LangChain & LangGraph: Core Concepts of AI Agents

This article introduces AI Agents and explains why LangChain is the leading framework, detailing its core concepts, three‑layer architecture, key features, comparison with other agent frameworks, and showcasing popular projects built with LangChain and LangGraph.

AI AgentLLMLangChain

0 likes · 10 min read

Getting Started with LangChain & LangGraph: Core Concepts of AI Agents

Ops Development Stories

Jul 14, 2025 · Artificial Intelligence

Mastering AIOps: Prompt Engineering, Function Calling, RAG, Graph RAG, and Local LLM Deployment

This comprehensive guide explores AIOps techniques such as prompt engineering, chat completions, memory management, function calling, fine‑tuning, retrieval‑augmented generation (RAG), graph‑based RAG, and practical steps for deploying open‑source large language models locally, providing code examples and best‑practice recommendations for modern DevOps environments.

Function CallingGraph RAGRAG

0 likes · 47 min read

Mastering AIOps: Prompt Engineering, Function Calling, RAG, Graph RAG, and Local LLM Deployment

Tencent Technical Engineering

Jul 14, 2025 · Artificial Intelligence

Demystifying AIGC, Agents, and MCP: Core Concepts and How They Interact

This article provides a concise overview of the latest AI concepts—including AIGC, Retrieval‑Augmented Generation, Function‑Calling models, intelligent agents, and the Model Context Protocol—explaining their principles, differences, and how they can be combined to build more powerful AI applications for developers outside the AI field.

AIGCAgentFunction Calling

0 likes · 15 min read

Demystifying AIGC, Agents, and MCP: Core Concepts and How They Interact

DaTaobao Tech

Jul 14, 2025 · Artificial Intelligence

Mastering AI Application Modes: Embedding, Copilot, and Agents Explained

This article explores practical AI engineering strategies, detailing the three AI application modes—Embedding, Copilot, and Agents—along with prompt engineering, model selection, function calling, RAG, workflow design, and multi‑agent architectures to boost business efficiency and user experience.

AIPrompt EngineeringRAG

0 likes · 25 min read

Mastering AI Application Modes: Embedding, Copilot, and Agents Explained

Data Thinking Notes

Jul 13, 2025 · Artificial Intelligence

How to Build an Enterprise Knowledge Base with Dify: Full Setup Guide

This article walks developers through the entire process of deploying Dify locally, configuring model providers, creating and segmenting a knowledge base with RAG, choosing indexing methods, and integrating the knowledge base into a chatbot application, complete with code snippets and visual guides.

AI deploymentDifyKnowledge Base

0 likes · 11 min read

How to Build an Enterprise Knowledge Base with Dify: Full Setup Guide

Architecture and Beyond

Jul 12, 2025 · Artificial Intelligence

What Exactly Is an AI Agent? History, Architecture, and Future Challenges

This article traces the evolution of AI agents from early expert systems to modern large‑language‑model‑driven assistants, explains their core perception, reasoning, memory, and action modules, compares thinking and execution models, and discusses current limitations such as hallucinations, reliability, cost, and security.

AI AgentLarge Language ModelMemory Architecture

0 likes · 20 min read

Architect

Jul 11, 2025 · Artificial Intelligence

How OpenAI’s Zero‑Vector Agentic RAG Redefines AI Knowledge Retrieval

OpenAI’s new non‑vectorized Agentic RAG approach replaces traditional vector search with a hierarchical, multi‑round content selection process, leveraging large‑context models like GPT‑4.1‑mini for efficient document loading, dynamic navigation, and accurate answer generation, while outlining model selection strategies, cost trade‑offs, and production considerations.

AI ArchitectureModel selectionRAG

0 likes · 15 min read

How OpenAI’s Zero‑Vector Agentic RAG Redefines AI Knowledge Retrieval

Sanyou's Java Diary

Jul 10, 2025 · Artificial Intelligence

Demystifying AIGC, Agents, RAG, and MCP: Core AI Concepts Explained

This article provides a concise overview of the latest AI breakthroughs—including AIGC, multimodal technology, Retrieval‑Augmented Generation (RAG), intelligent agents with function‑calling models, and the Model Context Protocol (MCP)—explaining their principles, relationships, and practical implications for developers outside the AI field.

AIAIGCFunction Calling

0 likes · 16 min read

Demystifying AIGC, Agents, RAG, and MCP: Core AI Concepts Explained

DataFunSummit

Jul 10, 2025 · Artificial Intelligence

How Large Language Model AI Agents Transform Intelligent Operations and On‑Call Support

This article details the design and implementation of a large‑model‑driven intelligent operations dialogue system, covering intent recognition, routing, multi‑agent planning, RAG, workflow, ReAct, reflection, tree‑search techniques, evaluation challenges, and future multi‑agent collaboration for on‑call support.

AI agentsIntelligent OperationsRAG

0 likes · 23 min read

How Large Language Model AI Agents Transform Intelligent Operations and On‑Call Support

Tencent Cloud Developer

Jul 10, 2025 · Artificial Intelligence

Demystifying AIGC, Agents, and MCP: Essential AI Concepts for Developers

This article provides a concise, developer‑focused overview of emerging AI concepts—including AIGC, multimodal models, Retrieval‑Augmented Generation, intelligent agents, Function‑Calling, and the Model Context Protocol (MCP)—explaining their core principles, differences, and how they interrelate to enable advanced AI applications.

AIAIGCAgent

0 likes · 16 min read

Demystifying AIGC, Agents, and MCP: Essential AI Concepts for Developers

Huolala Tech

Jul 9, 2025 · Artificial Intelligence

How HuoLala Overcomes Large‑Model Limits with Retrieval‑Augmented Vector Databases

This article details HuoLala's journey from confronting large‑model knowledge gaps and latency issues to adopting Retrieval‑Augmented Generation (RAG) with vector databases, outlining their architecture challenges, selection criteria, deployment scenarios, and future plans for AI‑driven logistics.

AIElasticsearchMilvus

0 likes · 14 min read

How HuoLala Overcomes Large‑Model Limits with Retrieval‑Augmented Vector Databases

Fun with Large Models

Jul 9, 2025 · Artificial Intelligence

Boosting RAG Accuracy with GraphRAG: Large‑Scale KG Build and Mode Comparison

This article demonstrates how to construct a knowledge graph from a 58,099‑character document using GraphRAG, measures the indexing time on an M2 PRO machine, and compares Global, Local, and Drift query modes, highlighting three scenarios where Local mode is unsuitable.

GraphRAGKnowledge GraphRAG

0 likes · 11 min read

Boosting RAG Accuracy with GraphRAG: Large‑Scale KG Build and Mode Comparison

JD Tech Talk

Jul 8, 2025 · Artificial Intelligence

How AI Can Turn a Code Maze into a Knowledge Highway for New Developers

New developer Li Ming’s frustrating onboarding experience highlights hidden business rules, undocumented code, and poor knowledge transfer, prompting him to build an AI‑driven knowledge base that links code changes, requirements, and operational docs, ultimately streamlining troubleshooting, accelerating feature development, and improving knowledge retention across teams.

AILarge Language ModelRAG

0 likes · 18 min read

How AI Can Turn a Code Maze into a Knowledge Highway for New Developers

JD Cloud Developers

Jul 8, 2025 · Artificial Intelligence

How AI Can Turn a Code Maze into a Knowledge Hub for New Developers

This article follows a new developer named Li Ming as he confronts undocumented code, hidden business rules, and fragmented knowledge, then demonstrates how leveraging large‑language models to index, associate, and retrieve code, requirements, and operational data can create an intelligent knowledge base that streamlines onboarding, reduces errors, and enhances collaboration across development, testing, and product teams.

AIRAGsoftware development

0 likes · 19 min read

How AI Can Turn a Code Maze into a Knowledge Hub for New Developers

Architect's Alchemy Furnace

Jul 7, 2025 · Artificial Intelligence

How to Build an AI‑Powered Knowledge Retrieval Workflow with Dify and Embedding Models

This guide explains how to organize a knowledge base, select embedding and rerank models, clean user queries, and construct a Dify DSL workflow that iteratively retrieves and merges information before handing it to a large language model for answer generation.

AI workflowDifyKnowledge retrieval

0 likes · 23 min read

How to Build an AI‑Powered Knowledge Retrieval Workflow with Dify and Embedding Models

AI Algorithm Path

Jul 3, 2025 · Artificial Intelligence

Exploring Advanced, Graph, and Agentic RAG: The Evolution of Retrieval‑Augmented Generation

This article examines how Retrieval‑Augmented Generation (RAG) has progressed from simple keyword‑based retrieval to advanced semantic methods, modular architectures, graph‑enhanced reasoning, and autonomous agentic systems, highlighting each approach's workflow, benefits, limitations, and the shift toward dynamic AI decision‑making.

AIAgentic RAGGraph RAG

0 likes · 7 min read

Exploring Advanced, Graph, and Agentic RAG: The Evolution of Retrieval‑Augmented Generation

Instant Consumer Technology Team

Jul 3, 2025 · Artificial Intelligence

Why Buying an AI Appliance Is a Strategic Pitfall for Enterprises

Enterprises rushing to purchase DeepSeek AI appliances and smart‑agent platforms often face hidden technical, data, and organizational challenges that turn promised "plug‑and‑play" solutions into costly missteps, highlighting the need for realistic strategy, robust data governance, and continuous capability building.

AI capability buildingAI deploymentData Governance

0 likes · 28 min read

Why Buying an AI Appliance Is a Strategic Pitfall for Enterprises

AI Large Model Application Practice

Jul 2, 2025 · Artificial Intelligence

Build a PPT‑Powered RAG Engine with Visual Models and MCP Server

This article explains how to construct a Retrieval‑Augmented Generation (RAG) pipeline for multi‑page PPT documents by converting slides to images, extracting content with a vision model, indexing with LlamaIndex and Chroma, and exposing the functionality through an MCP Server with tools for adding, querying, and managing PPTs.

LlamaIndexMCP ServerPPT

0 likes · 13 min read

Build a PPT‑Powered RAG Engine with Visual Models and MCP Server

Instant Consumer Technology Team

Jul 1, 2025 · Artificial Intelligence

Choosing Between Gemini CLI and Claude Code: A Deep AI Coding Tool Comparison

This article examines the rise of terminal AI coding assistants, comparing Google’s Gemini CLI and Anthropic’s Claude Code across features, context length, cost, performance, platform support, and usage tips, helping developers decide which tool best fits their productivity and project needs.

AI codingClaude CodeGemini CLI

0 likes · 14 min read

Choosing Between Gemini CLI and Claude Code: A Deep AI Coding Tool Comparison

Ops Development Stories

Jul 1, 2025 · Artificial Intelligence

From Lean to AIOps: How AI is Transforming Modern Operations

This comprehensive guide walks through the evolution from Lean and Agile practices to DevOps and finally AIOps, explaining core concepts, key algorithms, the role of large language models, RAG‑based root‑cause analysis, and practical implementation steps for intelligent operations.

Large Language ModelsLeanRAG

0 likes · 19 min read

From Lean to AIOps: How AI is Transforming Modern Operations

Ops Development Stories

Jun 30, 2025 · Artificial Intelligence

Build a Private AI Knowledge Assistant with n8n: Zero‑Code RAG in 30 Minutes

This guide shows how to create a fully local Retrieval‑Augmented Generation (RAG) system using n8n, Docker, Ollama and the free Qwen3 embedding model, enabling secure, up‑to‑date AI assistants that answer enterprise questions without exposing any proprietary data.

AI AssistantDockerEmbedding

0 likes · 17 min read

Build a Private AI Knowledge Assistant with n8n: Zero‑Code RAG in 30 Minutes

DataFunTalk

Jun 29, 2025 · Artificial Intelligence

Large Models Boost Douyin User Experience: Expert Insights

In an interview at the DA Digital Intelligence Conference, ByteDance AI specialist Cai Conghuai explains how large language models, combined with techniques like SFT, DPO, and RAG, are reshaping Douyin's user‑experience signal detection, root‑cause analysis, and evaluation, while outlining future AI‑agent breakthroughs.

AIDPOLarge Language Models

0 likes · 12 min read

Large Models Boost Douyin User Experience: Expert Insights

Architect

Jun 28, 2025 · Artificial Intelligence

How MultiAgentPPT Generates Slides with AI Agents: Architecture and Code Walkthrough

This article examines the MultiAgentPPT project, detailing its multi‑agent workflow, the four core agents that generate outlines, split topics, conduct research, and summarize results, and explains how the system retrieves data via a WeChat crawler and constructs prompts for LLM‑driven PPT creation.

AI agentsMultiAgentPPTPPT generation

0 likes · 6 min read

How MultiAgentPPT Generates Slides with AI Agents: Architecture and Code Walkthrough

Alibaba Cloud Big Data AI Platform

Jun 27, 2025 · Artificial Intelligence

Build a Powerful AI Search RAG Application with PAI‑LangStudio, Qwen3 & Elasticsearch

This guide walks you through using the PAI‑LangStudio platform together with the Qwen3 large language model and Elasticsearch to create a full‑stack AI Search RAG solution, covering prerequisites, step‑by‑step configuration of model services, database connections, runtimes, knowledge bases, workflow creation, testing, and deployment for production use.

AI SearchElasticsearchLarge Language Model

0 likes · 11 min read

Build a Powerful AI Search RAG Application with PAI‑LangStudio, Qwen3 & Elasticsearch

360 Zhihui Cloud Developer

Jun 27, 2025 · Operations

How AI‑Powered Ops‑Nexus Transforms Intelligent Operations for 100k+ Servers

This article details the design, technology choices, functional modules, core implementation, performance optimizations, and future roadmap of Ops‑Nexus, an AI‑driven intelligent operations platform that streamlines alarm analysis, log processing, and host health checks for large‑scale monitoring environments.

AI OpsIntelligent OperationsLLM

0 likes · 12 min read

How AI‑Powered Ops‑Nexus Transforms Intelligent Operations for 100k+ Servers

AI Algorithm Path

Jun 26, 2025 · Artificial Intelligence

The 10 Essential Components of a Retrieval‑Augmented Generation (RAG) System

This guide breaks down the ten core building blocks of a production‑ready RAG pipeline—from input handling and vector stores to prompt engineering, LLM inference, observability, and evaluation—showing why each piece matters, common pitfalls, and practical best‑practice recommendations.

LLMObservabilityPrompt Engineering

0 likes · 9 min read

The 10 Essential Components of a Retrieval‑Augmented Generation (RAG) System

Su San Talks Tech

Jun 26, 2025 · Artificial Intelligence

Master Spring AI Alibaba 1.0: Upgrade Guide, New Features & Real‑World Code

This article walks you through what Spring AI Alibaba 1.0 offers, highlights its major updates such as the Graph multi‑agent framework and ecosystem integrations, and provides a step‑by‑step upgrade path with Maven dependency changes, code fixes, and configuration adjustments for Java developers.

AI FrameworkMCPRAG

0 likes · 20 min read

Master Spring AI Alibaba 1.0: Upgrade Guide, New Features & Real‑World Code

DeWu Technology

Jun 25, 2025 · Artificial Intelligence

Engineering Large Language Models with Spring AI: From Basics to RAG and Function Calls

This article walks through the fundamentals of large language models, their stateless and structured-output nature, explains how Spring‑AI provides a Java‑friendly API for model integration, covers RAG architecture, the MCP protocol, and demonstrates end‑to‑end code examples for building intelligent agents.

AI integrationFunction CallingJava

0 likes · 15 min read

Engineering Large Language Models with Spring AI: From Basics to RAG and Function Calls

ITFLY8 Architecture Home

Jun 24, 2025 · Artificial Intelligence

How Transformers and Mixture-of-Experts Power Large Language Models

This article explores the role of Transformers and Mixture‑of‑Experts in large models, outlines five fine‑tuning methods, compares traditional and agentic RAG, presents classic agent design patterns, text‑chunking strategies, levels of intelligent agent systems, and explains KV‑caching techniques.

Large Language ModelsMixture of ExpertsRAG

0 likes · 2 min read

How Transformers and Mixture-of-Experts Power Large Language Models

Fun with Large Models

Jun 23, 2025 · Artificial Intelligence

Boost RAG Answer Accuracy: Detailed Step‑by‑Step GraphRAG Knowledge‑Graph Construction

This article walks through the complete GraphRAG knowledge‑graph building pipeline—text splitting, entity extraction, relation mining, community clustering, and report generation—using a concrete example from the book “The Age of Big Data,” and explains why each step improves retrieval and answer quality.

GraphRAGKnowledge GraphRAG

0 likes · 20 min read

Boost RAG Answer Accuracy: Detailed Step‑by‑Step GraphRAG Knowledge‑Graph Construction

Tech Freedom Circle

Jun 21, 2025 · Artificial Intelligence

How MCP + LLM + Agent Architecture Becomes the AI Agent’s Neural Hub and New Infrastructure

The article explains the Model Context Protocol (MCP) as a zero‑code bridge that lets large language models seamlessly access databases, external APIs, and execute code, detailing its benefits for developers and everyday users, its core components, step‑by‑step workflow, real‑world examples, and how it outperforms traditional APIs in modern AI agent systems.

AI AgentLLMMCP

0 likes · 37 min read

How MCP + LLM + Agent Architecture Becomes the AI Agent’s Neural Hub and New Infrastructure

Data Thinking Notes

Jun 19, 2025 · Artificial Intelligence

Andrew Ng on Building Agentic AI Systems: Tools, MCP, and Practical Insights

In a candid conversation, Andrew Ng and Harrison Chase explore the evolving landscape of AI agents, discussing modular toolchains, the emerging MCP standard, challenges of agent‑to‑agent communication, voice interaction latency, and the importance of rapid, technically skilled execution for successful AI product development.

AI agentsLangChainMCP

0 likes · 19 min read

Andrew Ng on Building Agentic AI Systems: Tools, MCP, and Practical Insights

DataFunSummit

Jun 19, 2025 · Artificial Intelligence

How Large Models Are Revolutionizing Douyin’s User Experience – Expert Insights

In a detailed interview, ByteDance AI specialist Cai Conghuai explains how large‑model techniques such as SFT, DPO and RAG address Douyin’s multimodal user‑experience challenges, improve signal detection, root‑cause analysis, and outline future AI‑agent breakthroughs for content platforms.

AI AlgorithmsMultimodal LearningRAG

0 likes · 11 min read

How Large Models Are Revolutionizing Douyin’s User Experience – Expert Insights

Fun with Large Models

Jun 19, 2025 · Artificial Intelligence

How GraphRAG Boosts Answer Accuracy with Knowledge Graphs (Part 1)

This article explains GraphRAG’s architecture, compares it with traditional RAG, and presents experimental results showing that GraphRAG’s knowledge‑graph‑driven retrieval markedly improves answer accuracy, especially on low‑match, multi‑paragraph queries.

GraphRAGKnowledge GraphLarge Language Models

0 likes · 11 min read

How GraphRAG Boosts Answer Accuracy with Knowledge Graphs (Part 1)

Tencent Technical Engineering

Jun 16, 2025 · Artificial Intelligence

Mastering RAG and AI Agents: Practical Tips, Code Samples, and Evaluation Strategies

This comprehensive guide walks you through the fundamentals of Retrieval‑Augmented Generation (RAG) and AI agents, explains their inner workings, shares optimization tricks, provides ready‑to‑run code snippets, and demonstrates how to evaluate performance with metrics such as recall, faithfulness, and answer relevance.

AI agentsLLMPrompt Engineering

0 likes · 36 min read

Mastering RAG and AI Agents: Practical Tips, Code Samples, and Evaluation Strategies

ITPUB

Jun 15, 2025 · Artificial Intelligence

How to Build a High‑Performance Enterprise RAG System with Model Context Protocol (MCP)

This article presents a step‑by‑step guide for constructing a scalable enterprise Retrieval‑Augmented Generation (RAG) solution using the Model Context Protocol (MCP), covering architecture comparison, system design, Milvus‑backed knowledge store, Python client implementation, deployment scripts, code examples, and best‑practice recommendations.

KnowledgeBaseLLMMCP

0 likes · 22 min read

How to Build a High‑Performance Enterprise RAG System with Model Context Protocol (MCP)

TAL Education Technology

Jun 13, 2025 · Operations

How Large Language Models Are Revolutionizing Fault Localization

This article explores how the rapid rise of large language models and techniques like Retrieval‑Augmented Generation, Chain‑of‑Thought prompting, and multi‑agent architectures can dramatically improve the speed, accuracy, and automation of fault localization in modern operations environments.

Agent ArchitectureCoTFault Localization

0 likes · 14 min read

How Large Language Models Are Revolutionizing Fault Localization

Instant Consumer Technology Team

Jun 12, 2025 · Artificial Intelligence

How to Build a Production-Ready RAG System with Qwen3 Embedding and Reranker Models

This guide walks through using Alibaba's new Qwen3-Embedding and Qwen3-Reranker models to build a two‑stage Retrieval‑Augmented Generation pipeline with Milvus, covering environment setup, data ingestion, vector indexing, reranking, and LLM‑driven answer generation, demonstrating production‑grade performance across multilingual queries.

EmbeddingLLMMilvus

0 likes · 19 min read

How to Build a Production-Ready RAG System with Qwen3 Embedding and Reranker Models

DataFunSummit

Jun 12, 2025 · Artificial Intelligence

How Alibaba Cloud’s AI Search Evolves with Agentic RAG and Multi‑Model Innovations

This article details Alibaba Cloud AI Search’s development journey, covering its dual product lines, the evolution of Agentic RAG technology, multi‑agent architectures, vector retrieval breakthroughs, GPU‑accelerated indexing, NL2SQL capabilities, deployment models, and future directions for AI‑driven search solutions.

AI SearchGPU AccelerationLarge Models

0 likes · 33 min read

How Alibaba Cloud’s AI Search Evolves with Agentic RAG and Multi‑Model Innovations

Zuoyebang Tech Team

Jun 12, 2025 · Information Security

How AI‑Powered RAG and Agents Are Revolutionizing Enterprise Security Operations

This article explains how the rise of AI large‑model technology and Retrieval‑Augmented Generation (RAG) combined with autonomous AI agents enable a three‑layer network‑boundary defense, address deep operational challenges such as alert overload and response latency, and dramatically improve incident‑response efficiency in large‑scale enterprises.

AI agentsAI securityIncident Response

0 likes · 16 min read

How AI‑Powered RAG and Agents Are Revolutionizing Enterprise Security Operations

Data Thinking Notes

Jun 11, 2025 · Artificial Intelligence

How RAG‑Powered AI Boosted Government Data Labeling Efficiency by 5×

This case study details how a government‑focused AI system using retrieval‑augmented generation (RAG) and advanced preprocessing algorithms increased data labeling speed by up to five times, raised accuracy above 95%, and produced high‑quality enterprise, spatial, and economic datasets.

AIAutomationGovernment

0 likes · 5 min read

How RAG‑Powered AI Boosted Government Data Labeling Efficiency by 5×

Sohu Tech Products

Jun 11, 2025 · Artificial Intelligence

How DeepSeek and TiDB AI Are Redefining Data Engines for the Large‑Model Era

This article explores DeepSeek's open‑source large‑model breakthroughs, PingCAP's AI‑enhanced database roadmap, TiDB.AI's retrieval‑augmented generation framework, the unified TiDB data engine, and practical Q&A insights on knowledge‑graph construction, vector search, and AI‑driven SQL generation.

AIDatabaseDeepSeek

0 likes · 15 min read

How DeepSeek and TiDB AI Are Redefining Data Engines for the Large‑Model Era

Alibaba Cloud Developer

Jun 10, 2025 · Artificial Intelligence

How AI Application Architectures Evolve: From Simple LLM Calls to Guardrails, Routing, and Agents

This article traces the evolution of AI application architectures—from the earliest minimal user‑LLM interaction to advanced designs featuring context enhancement, input/output guardrails, intent routing, model gateways, caching strategies, agent capabilities, monitoring, and inference performance optimizations—providing practical insights and references for developers.

AI ArchitectureAgentCaching

0 likes · 21 min read

How AI Application Architectures Evolve: From Simple LLM Calls to Guardrails, Routing, and Agents

Data Thinking Notes

Jun 8, 2025 · Artificial Intelligence

Explore the Complete AI Large Model Technology Landscape: Architecture Diagrams Across Industries

This article presents a panoramic view of AI large‑model technologies, showcasing a series of architecture diagrams that illustrate general model frameworks, RAG knowledge‑base structures, agricultural and retail applications, IoT integration, compliance and risk‑management setups, agent platforms, and CRM‑enhanced solutions.

AILarge ModelsRAG

0 likes · 3 min read

Explore the Complete AI Large Model Technology Landscape: Architecture Diagrams Across Industries

Qborfy AI

Jun 7, 2025 · Artificial Intelligence

Build a Retrieval‑Augmented Generation (RAG) Chatbot with LangChain and Streamlit

This guide walks through the complete process of creating a RAG‑powered question‑answering bot using LangChain, Streamlit, and vector‑store embeddings, covering theory, architecture, data loading, chunking, vector indexing, retrieval, LLM integration, and full code implementation with practical examples.

ChatbotEmbeddingsLangChain

0 likes · 13 min read

Build a Retrieval‑Augmented Generation (RAG) Chatbot with LangChain and Streamlit

AI2ML AI to Machine Learning

Jun 6, 2025 · Artificial Intelligence

Tackling the Top Challenges of Retrieval‑Augmented Generation (RAG)

The article enumerates common pitfalls of Retrieval‑Augmented Generation—such as missing content, low‑rank document misses, context limits, format errors, incomplete answers, scalability bottlenecks, complex PDF extraction, data‑quality issues, domain adaptation gaps, hallucinations, and feedback‑loop deficiencies—and offers concrete mitigation strategies ranging from data cleaning and prompt design to hybrid search, hierarchical retrieval, document compression, and automated evaluation.

Hybrid SearchLLMPrompt Engineering

0 likes · 9 min read

Tackling the Top Challenges of Retrieval‑Augmented Generation (RAG)

IT Services Circle

Jun 6, 2025 · Artificial Intelligence

Master Retrieval‑Augmented Generation (RAG): From Basics to Advanced Practices

This article introduces Retrieval‑Augmented Generation (RAG), explains its core components—knowledge embedding, retriever, and generator—covers practical system construction, optimization techniques, evaluation metrics, and advanced paradigms such as GraphRAG and Multi‑Modal RAG, while highlighting a comprehensive guidebook for hands‑on implementation.

AIKnowledge retrievalRAG

0 likes · 12 min read

Master Retrieval‑Augmented Generation (RAG): From Basics to Advanced Practices

Didi Tech

Jun 5, 2025 · Artificial Intelligence

Unlocking Modern AI Application Architecture: From RAG to Agents and MCP

This article surveys the evolution of AI applications, explains large language model fundamentals, outlines architectural challenges, and introduces three core patterns—Retrieval‑Augmented Generation (RAG), autonomous Agents, and Model Context Protocol (MCP)—while providing practical LangChain code snippets and integration guidance.

AIAgentLLM

0 likes · 28 min read

Unlocking Modern AI Application Architecture: From RAG to Agents and MCP

Instant Consumer Technology Team

Jun 4, 2025 · Artificial Intelligence

Unlocking Retrieval-Augmented Generation: Theory, Practice, and Future Trends

This comprehensive article examines Retrieval‑Augmented Generation (RAG), covering its historical evolution, core theory, implementation variants, practical code examples, diverse applications, current controversies, and future research directions within the AI and NLP landscape.

Artificial IntelligenceGenerative ModelsRAG

0 likes · 21 min read

Unlocking Retrieval-Augmented Generation: Theory, Practice, and Future Trends

Full-Stack Internet Architecture

Jun 4, 2025 · Artificial Intelligence

Key AI Concepts for Spring AI: Models, Prompts, Embeddings, Tokens, Structured Output, and RAG

This article introduces essential AI concepts—including models, prompts and prompt templates, embeddings, tokens, structured output, and Retrieval‑Augmented Generation—explaining their meanings and relevance for effectively using Spring AI in real‑world applications.

AIEmbeddingsMachine Learning

0 likes · 7 min read

Key AI Concepts for Spring AI: Models, Prompts, Embeddings, Tokens, Structured Output, and RAG

Satori Komeiji's Programming Classroom

Jun 3, 2025 · Artificial Intelligence

Everything You Need to Know About Retrieval‑Augmented Generation (RAG)

The article explains Retrieval‑Augmented Generation (RAG) by describing how a programmer, frustrated with oversized prompts for a large language model, discovers that retrieving relevant document fragments, embedding them, and feeding the augmented context to the model yields accurate, fact‑based answers.

AIChunkingEmbedding

0 likes · 6 min read

Everything You Need to Know About Retrieval‑Augmented Generation (RAG)

Fighter's World

Jun 2, 2025 · Artificial Intelligence

Why Is Context King for Large Language Models?

This article provides a comprehensive technical analysis of LLM context, covering its definition, types, tokenization, window‑size evolution, diminishing returns, management techniques such as RAG, CoT, memory‑as‑a‑service, and future challenges like multimodal fusion, privacy, and autonomous agent memory.

Agent MemoryContext ManagementLLM

0 likes · 48 min read

Why Is Context King for Large Language Models?

dbaplus Community

May 31, 2025 · Artificial Intelligence

How RAG is Shaping the Future of AI-Powered User Experience

Amid the rapid rise of large language models, this article examines RAG’s development, technical hurdles, core strategies, and future outlook, illustrating how Alibaba’s Chatbot and Copilot projects boost retrieval accuracy to 90% and generation precision to 85% while tackling data quality, heterogeneous retrieval, and evaluation challenges.

AI SearchRAGevaluation metrics

0 likes · 27 min read

How RAG is Shaping the Future of AI-Powered User Experience

ITFLY8 Architecture Home

May 30, 2025 · Artificial Intelligence

Explore the Full Spectrum of AI Large Model Architectures

This article presents a comprehensive visual collection of AI large‑model architecture diagrams, covering general frameworks, RAG knowledge‑base systems, agriculture, e‑commerce recommendation, IoT, compliance risk management, agent platforms, and CRM integration, offering a panoramic view of modern AI infrastructure.

AIComplianceIoT

0 likes · 3 min read

Explore the Full Spectrum of AI Large Model Architectures

Instant Consumer Technology Team

May 29, 2025 · Artificial Intelligence

API vs GUI Agents: How to Choose the Right LLM Automation Approach

This article examines the evolution of large language model agents, contrasting API‑based agents that use predefined function calls with GUI‑based agents that interact with visual interfaces, and explores hybrid strategies, orchestration tools, RAG techniques, and practical guidelines for selecting the optimal paradigm.

API vs GUIHybrid automationLLM Agents

0 likes · 34 min read

API vs GUI Agents: How to Choose the Right LLM Automation Approach

DevOps

May 28, 2025 · Artificial Intelligence

Google Proposes a “Sufficient Context” Framework to Strengthen Enterprise Retrieval‑Augmented Generation Systems

Google researchers introduce a “sufficient context” framework that classifies retrieved passages as adequate or inadequate for answering a query, enabling large language models in enterprise RAG systems to decide when to answer, refuse, or request more information, thereby improving accuracy and reducing hallucinations.

AI reliabilityEnterprise AILarge Language Models

0 likes · 9 min read

Google Proposes a “Sufficient Context” Framework to Strengthen Enterprise Retrieval‑Augmented Generation Systems

Sohu Tech Products

May 28, 2025 · Artificial Intelligence

Introducing AIFlowy: An Open‑Source Java‑Based One‑Stop AI Application Development Platform

AIFlowy is a Java‑powered, open‑source, enterprise‑grade AI platform that offers a bot for natural‑language interaction, extensible plugins, a knowledge‑base with RAG support, and visual workflow automation, enabling developers to quickly build and customize AI applications for domestic B2B scenarios.

AIBotJava

0 likes · 10 min read

Introducing AIFlowy: An Open‑Source Java‑Based One‑Stop AI Application Development Platform

phodal

May 28, 2025 · Artificial Intelligence

Boost Code Retrieval with AutoDev’s Pre‑Generated Context Worker

The article explains how AutoDev’s Context Worker pre‑generates semantic code context to improve RAG performance, outlines the limitations of vector‑based retrieval, describes the tool’s multi‑language AST analysis, knowledge‑graph construction, and provides command‑line usage examples for integrating the generated context into AI‑driven development workflows.

AIASTCLI

0 likes · 8 min read

Boost Code Retrieval with AutoDev’s Pre‑Generated Context Worker

Coder Circle

May 28, 2025 · Artificial Intelligence

Core AI Concepts Every Spring AI Developer Should Know

This article explains fundamental AI concepts—including models, prompts, prompt templates, embeddings, tokens, structured output, data integration, RAG, and tool calling—and shows how Spring AI simplifies their use for Java developers building intelligent applications.

AI modelsEmbeddingsPrompt Engineering

0 likes · 13 min read

Core AI Concepts Every Spring AI Developer Should Know

Architect

May 23, 2025 · Artificial Intelligence

How We Won the RAG Challenge: Multi‑Router & Dynamic Knowledge Base Techniques Revealed

This article details the end‑to‑end design, parsing tricks, vector database setup, retrieval strategies, prompt engineering, and LLM reranking that powered the winning solution in a company‑annual‑report question‑answering competition.

FAISSLLMPDF parsing

0 likes · 37 min read

How We Won the RAG Challenge: Multi‑Router & Dynamic Knowledge Base Techniques Revealed

Eric Tech Circle

May 22, 2025 · Artificial Intelligence

Build a Fast, Zero‑Cost Local AI Knowledge Base with Ollama, Cherry Studio, and Qwen‑3

This guide walks you through building a high‑performance local AI knowledge base using Ollama, Cherry Studio, and the Qwen‑3 model, covering RAG fundamentals, model selection, document preparation, system configuration, and step‑by‑step UI operations for non‑programmers.

AIKnowledge BaseOllama

0 likes · 13 min read

Build a Fast, Zero‑Cost Local AI Knowledge Base with Ollama, Cherry Studio, and Qwen‑3

Programmer DD

May 21, 2025 · Artificial Intelligence

What’s New in Spring AI 1.0 GA? A Deep Dive into Java AI Features

Spring AI 1.0 GA introduces a comprehensive suite of AI capabilities for Java developers, including a ChatClient supporting 20 models, vector‑store integrations, RAG pipelines, advanced chat memory, @Tool function calling, model evaluation, observability, Model Context Protocol, and autonomous agents, with examples for major cloud providers.

AI modelsJavaMCP

0 likes · 6 min read

What’s New in Spring AI 1.0 GA? A Deep Dive into Java AI Features

Java Tech Enthusiast

May 21, 2025 · Artificial Intelligence

How ChatGPT's New Memory Feature Works: Technical Analysis and Implementation Details

The article provides a detailed technical breakdown of OpenAI's new ChatGPT memory feature, explaining its two memory modes, underlying sub‑systems, possible implementation approaches using vector stores and scheduled jobs, and early user feedback highlighting both benefits and bugs.

AIChatGPTRAG

0 likes · 8 min read

How ChatGPT's New Memory Feature Works: Technical Analysis and Implementation Details

Java Architecture Diary

May 21, 2025 · Artificial Intelligence

Spring AI 1.0 Launch: Production‑Ready Java AI Framework Unveiled

Spring AI 1.0, the first production‑grade Java AI framework, introduces ready‑to‑use APIs, seamless model integration, enterprise‑level RAG engine, smart tool calling, and three development modes, empowering developers to rapidly build, customize, and fully control AI applications with major model providers like OpenAI, Anthropic, DeepSeek.

AI FrameworkDeepSeekJava AI

0 likes · 13 min read

Spring AI 1.0 Launch: Production‑Ready Java AI Framework Unveiled

DeWu Technology

May 19, 2025 · Artificial Intelligence

AI-Powered Automated Test Case Generation: Design, Implementation, and Future Plans

This article presents a comprehensive AI-driven solution for automatically generating functional test cases, detailing the AI background, design scheme, core components such as PRD parsing, test‑point generation, test‑case creation, knowledge‑base construction, implementation results, and future development directions.

AIKnowledge BaseLLM

0 likes · 7 min read

AI-Powered Automated Test Case Generation: Design, Implementation, and Future Plans

Tencent Technical Engineering

May 19, 2025 · Artificial Intelligence

RAG, Agents, and Multimodal Large Models: Evolution, Challenges, and Future Trends

This article examines the evolution of large model technologies—including Retrieval‑Augmented Generation, AI agents, and multimodal models—detailing their technical foundations, practical challenges, industry applications, and future development trends, offering a comprehensive perspective for AI practitioners and researchers.

AI AgentKnowledge retrievalMultimodal

0 likes · 14 min read

RAG, Agents, and Multimodal Large Models: Evolution, Challenges, and Future Trends

DataFunSummit

May 13, 2025 · Artificial Intelligence

Integrating Large Language Models and Knowledge Graphs for Financial Applications: Challenges, Solutions, and Future Directions

This talk explores the technical challenges of applying large language models and knowledge graphs in finance, discusses solutions such as RAG enhancements, graph‑guided retrieval, multimodal extensions, and presents future research directions including multimodal graph integration, agentic systems, and decision‑making applications.

AIAgentic SystemsFinance

0 likes · 33 min read

Integrating Large Language Models and Knowledge Graphs for Financial Applications: Challenges, Solutions, and Future Directions

DataFunSummit

May 9, 2025 · Artificial Intelligence

Practical Experience Building Zhihu Direct Answer: An AI‑Powered Search Product

This article presents a comprehensive overview of Zhihu Direct Answer, describing its AI‑driven search architecture, RAG framework, query understanding, retrieval, chunking, reranking, generation, evaluation mechanisms, engineering optimizations, and the professional edition, while sharing concrete performance‑boosting practices and future development plans.

AIProduct DevelopmentRAG

0 likes · 14 min read

Practical Experience Building Zhihu Direct Answer: An AI‑Powered Search Product

Alibaba Cloud Native

May 9, 2025 · Artificial Intelligence

Build a Retrieval‑Augmented Generation (RAG) App with LangChain, Higress, and Elasticsearch

This tutorial walks through building a Retrieval‑Augmented Generation (RAG) system by combining LangChain for document processing, Elasticsearch’s vector store with the ELSER v2 model for semantic search, and Higress as a cloud‑native AI gateway, complete with deployment scripts, code examples, and query testing.

AIHigressLangChain

0 likes · 15 min read

Build a Retrieval‑Augmented Generation (RAG) App with LangChain, Higress, and Elasticsearch

phodal

May 9, 2025 · Artificial Intelligence

Why Pre‑Generated Context Is the Key to Faster, More Accurate AI Code Retrieval

The article examines how pre‑generating structured context for codebases can overcome the uncertainty and quality issues of traditional Retrieval‑Augmented Generation, outlines the technical and business challenges of RAG, compares existing code‑search tools, and introduces AutoDev’s Context Worker as a practical solution.

AILLMRAG

0 likes · 11 min read

Why Pre‑Generated Context Is the Key to Faster, More Accurate AI Code Retrieval