RAG | BestHub

Collection size

167 articles

Page 6 of 9

Sohu Tech Products

May 28, 2025 · Artificial Intelligence

Introducing AIFlowy: An Open‑Source Java‑Based One‑Stop AI Application Development Platform

AIFlowy is a Java‑powered, open‑source, enterprise‑grade AI platform that offers a bot for natural‑language interaction, extensible plugins, a knowledge‑base with RAG support, and visual workflow automation, enabling developers to quickly build and customize AI applications for domestic B2B scenarios.

AIBotJava

0 likes · 10 min read

Introducing AIFlowy: An Open‑Source Java‑Based One‑Stop AI Application Development Platform

DevOps

May 28, 2025 · Artificial Intelligence

Google Proposes a “Sufficient Context” Framework to Strengthen Enterprise Retrieval‑Augmented Generation Systems

Google researchers introduce a “sufficient context” framework that classifies retrieved passages as adequate or inadequate for answering a query, enabling large language models in enterprise RAG systems to decide when to answer, refuse, or request more information, thereby improving accuracy and reducing hallucinations.

AI ReliabilityContext EvaluationEnterprise AI

0 likes · 9 min read

Google Proposes a “Sufficient Context” Framework to Strengthen Enterprise Retrieval‑Augmented Generation Systems

Selected Java Interview Questions

Aug 18, 2024 · Backend Development

Redis Introduces a Multi‑Threaded Query Engine to Boost Vector Search Performance for Generative AI

Redis has launched a multi‑threaded query engine that vertically scales its in‑memory database, dramatically increasing query throughput and lowering latency for vector similarity searches, thereby addressing the performance demands of real‑time retrieval‑augmented generation in generative AI applications.

BackendGenerative AIMultithreading

0 likes · 9 min read

Redis Introduces a Multi‑Threaded Query Engine to Boost Vector Search Performance for Generative AI

IT Services Circle

Jun 6, 2025 · Artificial Intelligence

Master Retrieval‑Augmented Generation (RAG): From Basics to Advanced Practices

This article introduces Retrieval‑Augmented Generation (RAG), explains its core components—knowledge embedding, retriever, and generator—covers practical system construction, optimization techniques, evaluation metrics, and advanced paradigms such as GraphRAG and Multi‑Modal RAG, while highlighting a comprehensive guidebook for hands‑on implementation.

AIRAGRetrieval-Augmented Generation

0 likes · 12 min read

Master Retrieval‑Augmented Generation (RAG): From Basics to Advanced Practices

Code Mala Tang

Sep 12, 2024 · Artificial Intelligence

Boost LLM Accuracy with Retrieval‑Augmented Generation Using LangChain.js

This article explains the core concepts of Retrieval‑Augmented Generation (RAG), walks through its implementation steps with LangChain.js—including text chunking, embedding, storage, retrieval, and generation—and showcases practical use cases, challenges, and best practices for building reliable AI‑powered applications.

AI applicationsLLMLangChain

0 likes · 16 min read

Boost LLM Accuracy with Retrieval‑Augmented Generation Using LangChain.js

Architecture & Thinking

Jun 19, 2024 · Artificial Intelligence

Build AI‑Native Apps Quickly with Spring AI: From Chat Models to RAG

This guide explains what an AI‑native application is, compares AI‑native and AI‑based approaches, and walks through Spring AI’s core features—including chat models, prompt templates, function calling, structured output, image generation, embedding, and vector stores—showing step‑by‑step code examples and how to assemble a complete AI‑native app with RAG support.

AI native applicationJavaPrompt Engineering

0 likes · 43 min read

Build AI‑Native Apps Quickly with Spring AI: From Chat Models to RAG

Spring Full-Stack Practical Cases

Apr 10, 2025 · Artificial Intelligence

Build a RAG-Powered Knowledge Base with Spring Boot, Milvus, and Ollama

This guide walks through creating a Retrieval‑Augmented Generation (RAG) system using Spring Boot 3.4.2, Milvus vector database, and the bge‑m3 embedding model via Ollama, covering environment setup, dependency configuration, vector store operations, and integration with a large language model to deliver refined, similarity‑based answers.

LLMMilvusRAG

0 likes · 11 min read

Build a RAG-Powered Knowledge Base with Spring Boot, Milvus, and Ollama

Java Architecture Diary

May 21, 2025 · Artificial Intelligence

Spring AI 1.0 Launch: Production‑Ready Java AI Framework Unveiled

Spring AI 1.0, the first production‑grade Java AI framework, introduces ready‑to‑use APIs, seamless model integration, enterprise‑level RAG engine, smart tool calling, and three development modes, empowering developers to rapidly build, customize, and fully control AI applications with major model providers like OpenAI, Anthropic, DeepSeek.

AI FrameworkDeepSeekJava AI

0 likes · 13 min read

Spring AI 1.0 Launch: Production‑Ready Java AI Framework Unveiled

Java Architecture Diary

Apr 14, 2025 · Artificial Intelligence

How to Empower LLMs with a Private SearXNG Search Engine for Real‑Time Knowledge

This guide explains why large language models need private search capabilities, outlines the benefits of a self‑hosted SearXNG engine, provides step‑by‑step Docker deployment, and demonstrates Java integration using LangChain4j for both basic queries and retrieval‑augmented generation (RAG).

DockerLLMLangChain4j

0 likes · 6 min read

How to Empower LLMs with a Private SearXNG Search Engine for Real‑Time Knowledge

Sohu Tech Products

Jun 11, 2025 · Artificial Intelligence

How DeepSeek and TiDB AI Are Redefining Data Engines for the Large‑Model Era

This article explores DeepSeek's open‑source large‑model breakthroughs, PingCAP's AI‑enhanced database roadmap, TiDB.AI's retrieval‑augmented generation framework, the unified TiDB data engine, and practical Q&A insights on knowledge‑graph construction, vector search, and AI‑driven SQL generation.

AIDatabaseDeepSeek

0 likes · 15 min read

How DeepSeek and TiDB AI Are Redefining Data Engines for the Large‑Model Era

macrozheng

Feb 17, 2025 · Artificial Intelligence

Unlock DeepSeek4j 1.4: Build a Private AI Knowledge Base with Spring Boot

This guide explains why DeepSeek4j is needed, its core features, and provides step‑by‑step instructions—including dependency setup, configuration, code examples, and a complete RAG pipeline using Milvus—to help developers quickly create a private AI knowledge base with Spring Boot.

AIDeepSeek4jJava

0 likes · 12 min read

Unlock DeepSeek4j 1.4: Build a Private AI Knowledge Base with Spring Boot

macrozheng

Jan 20, 2025 · Artificial Intelligence

How Redis’s New Multithreaded Query Engine Boosts Vector Search for Real‑Time AI Apps

Redis has introduced a multithreaded query engine that dramatically lowers latency and multiplies throughput for vector‑based retrieval, enabling real‑time RAG applications to approach the 100 ms response target while scaling vertically to billions of documents.

AI performanceMultithreadingRAG

0 likes · 6 min read

How Redis’s New Multithreaded Query Engine Boosts Vector Search for Real‑Time AI Apps

Tencent Tech

Dec 11, 2024 · Artificial Intelligence

Inside Tencent LeYong AI: Solving Enterprise RAG with Knowledge, Engineering & Algorithms

This article explores how Tencent's LeYong AI assistant leverages Retrieval‑Augmented Generation to empower enterprise knowledge retrieval, detailing three capability dimensions—knowledge management, engineering, and algorithmic—along with eight sub‑areas such as knowledge boundaries, quality, permissions, multimodal handling, long‑context span, and complex reasoning.

AI assistantsEnterprise AIRAG

0 likes · 18 min read

Inside Tencent LeYong AI: Solving Enterprise RAG with Knowledge, Engineering & Algorithms

Tencent Technical Engineering

Jun 16, 2025 · Artificial Intelligence

Mastering RAG and AI Agents: Practical Tips, Code Samples, and Evaluation Strategies

This comprehensive guide walks you through the fundamentals of Retrieval‑Augmented Generation (RAG) and AI agents, explains their inner workings, shares optimization tricks, provides ready‑to‑run code snippets, and demonstrates how to evaluate performance with metrics such as recall, faithfulness, and answer relevance.

AI agentsLLMPrompt Engineering

0 likes · 36 min read

Mastering RAG and AI Agents: Practical Tips, Code Samples, and Evaluation Strategies

Java Tech Enthusiast

Oct 8, 2024 · Artificial Intelligence

Spring AI Framework for Java Developers

Spring AI is a Java‑centric framework that unifies access to chat, text‑to‑image, embedding and retrieval‑augmented generation models—including OpenAI, Anthropic and Alibaba’s Tongyi Qianwen—through synchronous or asynchronous APIs, POJO mapping, function calling, vector‑store integration and fluent tooling for rapid AI agent development.

AI frameworksJava developmentRAG

0 likes · 5 min read

DaTaobao Tech

Mar 19, 2025 · Artificial Intelligence

Retrieval Augmented Generation (RAG): Principles, Challenges, and Implementation Techniques

Retrieval‑augmented generation (RAG) enhances large language models by integrating a preprocessing pipeline—cleaning, chunking, embedding, and vector storage—with a query‑driven retrieval and prompt‑injection workflow, leveraging vector databases, multi‑stage recall, advanced prompting, and comprehensive evaluation metrics to mitigate knowledge cut‑off, hallucinations, and security issues.

LLMRAGRetrieval-Augmented Generation

0 likes · 27 min read

Retrieval Augmented Generation (RAG): Principles, Challenges, and Implementation Techniques

DaTaobao Tech

Aug 12, 2024 · Artificial Intelligence

Challenges and Optimization Techniques for Retrieval‑Augmented Generation (RAG)

Deploying large language models faces domain gaps, hallucinations, and high barriers, so Retrieval‑Augmented Generation (RAG) combines retrieval with generation, and advanced optimizations—such as RAPTOR’s hierarchical clustering, Self‑RAG’s self‑reflective retrieval, CRAG’s corrective evaluator, proposition‑level Dense X Retrieval, sophisticated chunking, query rewriting, and hybrid sparse‑dense methods—are essential for improving accuracy, reducing hallucinations, and achieving efficient, scalable performance.

AIOptimizationRAG

0 likes · 22 min read

Challenges and Optimization Techniques for Retrieval‑Augmented Generation (RAG)

DaTaobao Tech

Dec 27, 2023 · Artificial Intelligence

Deploying a Private LLM Knowledge Base on a MacBook

The guide walks through installing and quantizing the open‑source ChatGLM3‑6B model and the m3e‑base embedder on a MacBook, wrapping them with a FastAPI OpenAI‑compatible service, routing requests through a One‑API gateway, storing metadata in MongoDB and vectors in PostgreSQL pgvector, deploying FastGPT for RAG, ingesting data, and demonstrating 5‑7 second response times, while outlining future improvements.

ChatGLM3LLMMacBook

0 likes · 23 min read

Deploying a Private LLM Knowledge Base on a MacBook

37 Interactive Technology Team

Aug 12, 2024 · Backend Development

Intelligent Backend Menu Search with OpenAI Embeddings, LangChain, and DIFY

The article demonstrates how to improve backend menu navigation by building a knowledge base of menu metadata, generating concise Chinese descriptions with OpenAI embeddings, and implementing RAG retrieval using both LangChain code orchestration and DIFY’s visual workflow, highlighting each approach’s flexibility and ease of use.

Backend SearchDIFYLangChain

0 likes · 9 min read

Intelligent Backend Menu Search with OpenAI Embeddings, LangChain, and DIFY

37 Interactive Technology Team

Aug 5, 2024 · Artificial Intelligence

Case Study: Applying AIGC to Component Activity Business with Dify

This case study shows how AIGC, implemented through Dify’s low‑code platform, enables a natural‑language AI assistant to recommend and insert the optimal components from a 200‑plus library, streamlining selection, building an embedding‑based knowledge base, exposing a RAG‑driven agent via API, and demonstrating rapid AI‑business validation compared with custom frameworks.

AI AgentAIGCDify

0 likes · 8 min read

Case Study: Applying AIGC to Component Activity Business with Dify