Topic

RAG

Collection size
167 articles
Page 6 of 9
Sohu Tech Products
Sohu Tech Products
May 28, 2025 · Artificial Intelligence

Introducing AIFlowy: An Open‑Source Java‑Based One‑Stop AI Application Development Platform

AIFlowy is a Java‑powered, open‑source, enterprise‑grade AI platform that offers a bot for natural‑language interaction, extensible plugins, a knowledge‑base with RAG support, and visual workflow automation, enabling developers to quickly build and customize AI applications for domestic B2B scenarios.

AIBotJava
0 likes · 10 min read
Introducing AIFlowy: An Open‑Source Java‑Based One‑Stop AI Application Development Platform
DevOps
DevOps
May 28, 2025 · Artificial Intelligence

Google Proposes a “Sufficient Context” Framework to Strengthen Enterprise Retrieval‑Augmented Generation Systems

Google researchers introduce a “sufficient context” framework that classifies retrieved passages as adequate or inadequate for answering a query, enabling large language models in enterprise RAG systems to decide when to answer, refuse, or request more information, thereby improving accuracy and reducing hallucinations.

AI ReliabilityContext EvaluationEnterprise AI
0 likes · 9 min read
Google Proposes a “Sufficient Context” Framework to Strengthen Enterprise Retrieval‑Augmented Generation Systems
Selected Java Interview Questions
Selected Java Interview Questions
Aug 18, 2024 · Backend Development

Redis Introduces a Multi‑Threaded Query Engine to Boost Vector Search Performance for Generative AI

Redis has launched a multi‑threaded query engine that vertically scales its in‑memory database, dramatically increasing query throughput and lowering latency for vector similarity searches, thereby addressing the performance demands of real‑time retrieval‑augmented generation in generative AI applications.

BackendGenerative AIMultithreading
0 likes · 9 min read
Redis Introduces a Multi‑Threaded Query Engine to Boost Vector Search Performance for Generative AI
IT Services Circle
IT Services Circle
Jun 6, 2025 · Artificial Intelligence

Master Retrieval‑Augmented Generation (RAG): From Basics to Advanced Practices

This article introduces Retrieval‑Augmented Generation (RAG), explains its core components—knowledge embedding, retriever, and generator—covers practical system construction, optimization techniques, evaluation metrics, and advanced paradigms such as GraphRAG and Multi‑Modal RAG, while highlighting a comprehensive guidebook for hands‑on implementation.

AIRAGRetrieval-Augmented Generation
0 likes · 12 min read
Master Retrieval‑Augmented Generation (RAG): From Basics to Advanced Practices
Code Mala Tang
Code Mala Tang
Sep 12, 2024 · Artificial Intelligence

Boost LLM Accuracy with Retrieval‑Augmented Generation Using LangChain.js

This article explains the core concepts of Retrieval‑Augmented Generation (RAG), walks through its implementation steps with LangChain.js—including text chunking, embedding, storage, retrieval, and generation—and showcases practical use cases, challenges, and best practices for building reliable AI‑powered applications.

AI applicationsLLMLangChain
0 likes · 16 min read
Boost LLM Accuracy with Retrieval‑Augmented Generation Using LangChain.js
Architecture & Thinking
Architecture & Thinking
Jun 19, 2024 · Artificial Intelligence

Build AI‑Native Apps Quickly with Spring AI: From Chat Models to RAG

This guide explains what an AI‑native application is, compares AI‑native and AI‑based approaches, and walks through Spring AI’s core features—including chat models, prompt templates, function calling, structured output, image generation, embedding, and vector stores—showing step‑by‑step code examples and how to assemble a complete AI‑native app with RAG support.

AI native applicationJavaPrompt Engineering
0 likes · 43 min read
Build AI‑Native Apps Quickly with Spring AI: From Chat Models to RAG
Spring Full-Stack Practical Cases
Spring Full-Stack Practical Cases
Apr 10, 2025 · Artificial Intelligence

Build a RAG-Powered Knowledge Base with Spring Boot, Milvus, and Ollama

This guide walks through creating a Retrieval‑Augmented Generation (RAG) system using Spring Boot 3.4.2, Milvus vector database, and the bge‑m3 embedding model via Ollama, covering environment setup, dependency configuration, vector store operations, and integration with a large language model to deliver refined, similarity‑based answers.

LLMMilvusRAG
0 likes · 11 min read
Build a RAG-Powered Knowledge Base with Spring Boot, Milvus, and Ollama
Java Architecture Diary
Java Architecture Diary
May 21, 2025 · Artificial Intelligence

Spring AI 1.0 Launch: Production‑Ready Java AI Framework Unveiled

Spring AI 1.0, the first production‑grade Java AI framework, introduces ready‑to‑use APIs, seamless model integration, enterprise‑level RAG engine, smart tool calling, and three development modes, empowering developers to rapidly build, customize, and fully control AI applications with major model providers like OpenAI, Anthropic, DeepSeek.

AI FrameworkDeepSeekJava AI
0 likes · 13 min read
Spring AI 1.0 Launch: Production‑Ready Java AI Framework Unveiled
Java Architecture Diary
Java Architecture Diary
Apr 14, 2025 · Artificial Intelligence

How to Empower LLMs with a Private SearXNG Search Engine for Real‑Time Knowledge

This guide explains why large language models need private search capabilities, outlines the benefits of a self‑hosted SearXNG engine, provides step‑by‑step Docker deployment, and demonstrates Java integration using LangChain4j for both basic queries and retrieval‑augmented generation (RAG).

DockerLLMLangChain4j
0 likes · 6 min read
How to Empower LLMs with a Private SearXNG Search Engine for Real‑Time Knowledge
Sohu Tech Products
Sohu Tech Products
Jun 11, 2025 · Artificial Intelligence

How DeepSeek and TiDB AI Are Redefining Data Engines for the Large‑Model Era

This article explores DeepSeek's open‑source large‑model breakthroughs, PingCAP's AI‑enhanced database roadmap, TiDB.AI's retrieval‑augmented generation framework, the unified TiDB data engine, and practical Q&A insights on knowledge‑graph construction, vector search, and AI‑driven SQL generation.

AIDatabaseDeepSeek
0 likes · 15 min read
How DeepSeek and TiDB AI Are Redefining Data Engines for the Large‑Model Era
macrozheng
macrozheng
Feb 17, 2025 · Artificial Intelligence

Unlock DeepSeek4j 1.4: Build a Private AI Knowledge Base with Spring Boot

This guide explains why DeepSeek4j is needed, its core features, and provides step‑by‑step instructions—including dependency setup, configuration, code examples, and a complete RAG pipeline using Milvus—to help developers quickly create a private AI knowledge base with Spring Boot.

AIDeepSeek4jJava
0 likes · 12 min read
Unlock DeepSeek4j 1.4: Build a Private AI Knowledge Base with Spring Boot
macrozheng
macrozheng
Jan 20, 2025 · Artificial Intelligence

How Redis’s New Multithreaded Query Engine Boosts Vector Search for Real‑Time AI Apps

Redis has introduced a multithreaded query engine that dramatically lowers latency and multiplies throughput for vector‑based retrieval, enabling real‑time RAG applications to approach the 100 ms response target while scaling vertically to billions of documents.

AI performanceMultithreadingRAG
0 likes · 6 min read
How Redis’s New Multithreaded Query Engine Boosts Vector Search for Real‑Time AI Apps
Tencent Tech
Tencent Tech
Dec 11, 2024 · Artificial Intelligence

Inside Tencent LeYong AI: Solving Enterprise RAG with Knowledge, Engineering & Algorithms

This article explores how Tencent's LeYong AI assistant leverages Retrieval‑Augmented Generation to empower enterprise knowledge retrieval, detailing three capability dimensions—knowledge management, engineering, and algorithmic—along with eight sub‑areas such as knowledge boundaries, quality, permissions, multimodal handling, long‑context span, and complex reasoning.

AI assistantsEnterprise AIRAG
0 likes · 18 min read
Inside Tencent LeYong AI: Solving Enterprise RAG with Knowledge, Engineering & Algorithms
Tencent Technical Engineering
Tencent Technical Engineering
Jun 16, 2025 · Artificial Intelligence

Mastering RAG and AI Agents: Practical Tips, Code Samples, and Evaluation Strategies

This comprehensive guide walks you through the fundamentals of Retrieval‑Augmented Generation (RAG) and AI agents, explains their inner workings, shares optimization tricks, provides ready‑to‑run code snippets, and demonstrates how to evaluate performance with metrics such as recall, faithfulness, and answer relevance.

AI agentsLLMPrompt Engineering
0 likes · 36 min read
Mastering RAG and AI Agents: Practical Tips, Code Samples, and Evaluation Strategies
Java Tech Enthusiast
Java Tech Enthusiast
Oct 8, 2024 · Artificial Intelligence

Spring AI Framework for Java Developers

Spring AI is a Java‑centric framework that unifies access to chat, text‑to‑image, embedding and retrieval‑augmented generation models—including OpenAI, Anthropic and Alibaba’s Tongyi Qianwen—through synchronous or asynchronous APIs, POJO mapping, function calling, vector‑store integration and fluent tooling for rapid AI agent development.

AI frameworksJava developmentRAG
0 likes · 5 min read
Spring AI Framework for Java Developers
DaTaobao Tech
DaTaobao Tech
Mar 19, 2025 · Artificial Intelligence

Retrieval Augmented Generation (RAG): Principles, Challenges, and Implementation Techniques

Retrieval‑augmented generation (RAG) enhances large language models by integrating a preprocessing pipeline—cleaning, chunking, embedding, and vector storage—with a query‑driven retrieval and prompt‑injection workflow, leveraging vector databases, multi‑stage recall, advanced prompting, and comprehensive evaluation metrics to mitigate knowledge cut‑off, hallucinations, and security issues.

LLMRAGRetrieval-Augmented Generation
0 likes · 27 min read
Retrieval Augmented Generation (RAG): Principles, Challenges, and Implementation Techniques
DaTaobao Tech
DaTaobao Tech
Aug 12, 2024 · Artificial Intelligence

Challenges and Optimization Techniques for Retrieval‑Augmented Generation (RAG)

Deploying large language models faces domain gaps, hallucinations, and high barriers, so Retrieval‑Augmented Generation (RAG) combines retrieval with generation, and advanced optimizations—such as RAPTOR’s hierarchical clustering, Self‑RAG’s self‑reflective retrieval, CRAG’s corrective evaluator, proposition‑level Dense X Retrieval, sophisticated chunking, query rewriting, and hybrid sparse‑dense methods—are essential for improving accuracy, reducing hallucinations, and achieving efficient, scalable performance.

AIOptimizationRAG
0 likes · 22 min read
Challenges and Optimization Techniques for Retrieval‑Augmented Generation (RAG)
DaTaobao Tech
DaTaobao Tech
Dec 27, 2023 · Artificial Intelligence

Deploying a Private LLM Knowledge Base on a MacBook

The guide walks through installing and quantizing the open‑source ChatGLM3‑6B model and the m3e‑base embedder on a MacBook, wrapping them with a FastAPI OpenAI‑compatible service, routing requests through a One‑API gateway, storing metadata in MongoDB and vectors in PostgreSQL pgvector, deploying FastGPT for RAG, ingesting data, and demonstrating 5‑7 second response times, while outlining future improvements.

ChatGLM3LLMMacBook
0 likes · 23 min read
Deploying a Private LLM Knowledge Base on a MacBook
37 Interactive Technology Team
37 Interactive Technology Team
Aug 12, 2024 · Backend Development

Intelligent Backend Menu Search with OpenAI Embeddings, LangChain, and DIFY

The article demonstrates how to improve backend menu navigation by building a knowledge base of menu metadata, generating concise Chinese descriptions with OpenAI embeddings, and implementing RAG retrieval using both LangChain code orchestration and DIFY’s visual workflow, highlighting each approach’s flexibility and ease of use.

Backend SearchDIFYLangChain
0 likes · 9 min read
Intelligent Backend Menu Search with OpenAI Embeddings, LangChain, and DIFY
37 Interactive Technology Team
37 Interactive Technology Team
Aug 5, 2024 · Artificial Intelligence

Case Study: Applying AIGC to Component Activity Business with Dify

This case study shows how AIGC, implemented through Dify’s low‑code platform, enables a natural‑language AI assistant to recommend and insert the optimal components from a 200‑plus library, streamlining selection, building an embedding‑based knowledge base, exposing a RAG‑driven agent via API, and demonstrating rapid AI‑business validation compared with custom frameworks.

AI AgentAIGCDify
0 likes · 8 min read
Case Study: Applying AIGC to Component Activity Business with Dify