Tagged articles
924 articles
Page 8 of 10
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jan 15, 2025 · Artificial Intelligence

Build an Education‑Focused RAG Solution Using Alibaba PAI

This guide explains how to create a Retrieval‑Augmented Generation (RAG) solution for education on Alibaba PAI, covering knowledge‑base construction with PAI‑Designer, model deployment, connection setup in LangStudio, workflow configuration, online deployment, and a legal‑domain case comparison that highlights RAG's accuracy benefits.

Alibaba PAIEmbeddingKnowledge Base
0 likes · 14 min read
Build an Education‑Focused RAG Solution Using Alibaba PAI
DataFunSummit
DataFunSummit
Jan 11, 2025 · Artificial Intelligence

Generative AI Applications, MLOps, and LLMOps: A Comprehensive Overview

This article presents a detailed overview of generative AI lifecycle management, covering practical use cases such as email summarization, the roles of providers, fine‑tuners and consumers, MLOps/LLMOps processes, retrieval‑augmented generation, efficient fine‑tuning methods like PEFT, and Amazon Bedrock services for model deployment and monitoring.

Amazon BedrockLLMOpsMLOps
0 likes · 14 min read
Generative AI Applications, MLOps, and LLMOps: A Comprehensive Overview
JD Tech Talk
JD Tech Talk
Jan 9, 2025 · Artificial Intelligence

Practical Guide to Building Retrieval‑Augmented Generation (RAG) Applications with LangChain4j in Java

This article provides a step‑by‑step tutorial for Java engineers on using the LangChain4j framework to implement Retrieval‑Augmented Generation (RAG) with large language models, covering concepts, environment setup, code integration, document splitting, embedding, vector‑store operations, and prompt engineering.

EmbeddingJavaLangChain4j
0 likes · 35 min read
Practical Guide to Building Retrieval‑Augmented Generation (RAG) Applications with LangChain4j in Java
JD Cloud Developers
JD Cloud Developers
Jan 9, 2025 · Artificial Intelligence

Boost Your Java Apps with LangChain4j: A Hands‑On RAG Guide

This article walks Java developers through the fundamentals of Retrieval‑Augmented Generation (RAG), explains the LangChain4j framework, compares large‑model development with traditional Java coding, and provides step‑by‑step code examples for environment setup, document splitting, embedding, vector‑store operations, and LLM interaction.

EmbeddingJavaLangChain4j
0 likes · 34 min read
Boost Your Java Apps with LangChain4j: A Hands‑On RAG Guide
DevOps
DevOps
Jan 8, 2025 · Artificial Intelligence

Designing Generative AI Agents: Models, Tools, Extensions, Function Calls, and Data Storage

The article explains how generative AI agents combine language models, tool integration, self‑guided planning, prompt‑engineering frameworks, extensions, function calls, and vector‑based data storage to create adaptable, retrieval‑augmented systems that can interact with real‑world APIs and perform complex tasks.

RAGdata storageextensions
0 likes · 12 min read
Designing Generative AI Agents: Models, Tools, Extensions, Function Calls, and Data Storage
DeWu Technology
DeWu Technology
Jan 6, 2025 · Artificial Intelligence

Design and Implementation of a Retrieval‑Augmented Generation (RAG) Answering Assistant for the Dewu Open Platform

The paper describes building a Retrieval‑Augmented Generation assistant for the Dewu Open Platform that leverages GPT‑4o‑mini, OpenAI embeddings, Milvus vector store, and LangChain.js to semantically retrieve API documentation, structure user queries, and generate accurate, JSON‑formatted answers, thereby reducing manual support and hallucinations.

AILLMLangChain
0 likes · 28 min read
Design and Implementation of a Retrieval‑Augmented Generation (RAG) Answering Assistant for the Dewu Open Platform
DataFunTalk
DataFunTalk
Jan 6, 2025 · Artificial Intelligence

Building and Applying NIO's Enterprise Knowledge Platform: Architecture, Challenges, and Future Directions

This article presents a comprehensive overview of NIO's company‑wide knowledge platform, detailing its background, layered architecture, retrieval‑augmented generation workflow, challenges such as accuracy, permission control and high concurrency, and future plans for AI‑assisted understanding, creation, multimodal capabilities, and expanded knowledge types.

AIEnterprise ArchitectureRAG
0 likes · 18 min read
Building and Applying NIO's Enterprise Knowledge Platform: Architecture, Challenges, and Future Directions
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jan 3, 2025 · Artificial Intelligence

Build an Education‑Focused Retrieval‑Augmented Generation (RAG) Solution with Alibaba PAI

This guide walks you through creating a RAG‑enhanced AI solution for education using Alibaba PAI, covering prerequisite setup, knowledge‑base construction with PAI‑Designer, model deployment, connection configuration, workflow assembly, and a side‑by‑side comparison of RAG versus non‑RAG answers.

AI PlatformLLMMilvus
0 likes · 16 min read
Build an Education‑Focused Retrieval‑Augmented Generation (RAG) Solution with Alibaba PAI
DeWu Technology
DeWu Technology
Dec 25, 2024 · Artificial Intelligence

AI-Powered Intelligent Coding: Product Evolution, Technical Advances, and Future Outlook

AI‑powered coding tools—from JetBrains’ free IDEs to VSCode extensions like Cursor and end‑to‑end web platforms—are rapidly evolving, offering code continuation, AI‑driven Q&A, multi‑file editing, and chat interfaces, while advances in context handling, caching, LLM fine‑tuning, and speculative decoding promise faster, more integrated development workflows and a future where IDEs become chat‑centric assistants that streamline debugging, deployment, and junior developer support.

AI codingIDE integrationIntelligent code completion
0 likes · 18 min read
AI-Powered Intelligent Coding: Product Evolution, Technical Advances, and Future Outlook
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 24, 2024 · Artificial Intelligence

Build a Medical RAG Solution with Alibaba PAI: Step-by-Step Guide

Learn how to create a Retrieval‑Augmented Generation (RAG) system for medical applications using Alibaba's PAI platform, covering knowledge‑base construction with PAI‑Designer, template setup in PAI‑LangStudio, deployment of LLM and embedding models, vector database integration, and end‑to‑end workflow configuration.

EmbeddingLLMMilvus
0 likes · 18 min read
Build a Medical RAG Solution with Alibaba PAI: Step-by-Step Guide
DataFunSummit
DataFunSummit
Dec 23, 2024 · Artificial Intelligence

Huolala's Large Model Evaluation Framework (LaLaEval) and Application Practices

This article presents Huolala's comprehensive LaLaEval framework for evaluating large language models, detailing the challenges of model deployment, the five‑step assessment process, two real‑world case studies in freight and driver invitation, and future directions toward more automated, product‑driven evaluation.

AIFrameworkLogistics
0 likes · 24 min read
Huolala's Large Model Evaluation Framework (LaLaEval) and Application Practices
AI Large Model Application Practice
AI Large Model Application Practice
Dec 23, 2024 · Artificial Intelligence

Master LlamaIndex Workflows: Build Multi‑Agent RAG Applications Step‑by‑Step

This article introduces LlamaIndex Workflows, explains its event‑driven design, walks through a multi‑agent demo that combines weather search and email sending, provides complete Python code for defining events, steps, and the orchestrator, and compares its strengths and limitations against similar frameworks.

AILlamaIndexPython
0 likes · 13 min read
Master LlamaIndex Workflows: Build Multi‑Agent RAG Applications Step‑by‑Step
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 20, 2024 · Artificial Intelligence

How to Build a Retrieval‑Augmented Generation (RAG) System with Alibaba Cloud Milvus and PAI

This guide walks you through setting up Alibaba Cloud Milvus, configuring public access, deploying a RAG system via PAI, uploading a knowledge base, interacting with the model through the Web UI, and inspecting vector collections with Attu, all with step‑by‑step instructions and configuration details.

AIAlibaba CloudMilvus
0 likes · 10 min read
How to Build a Retrieval‑Augmented Generation (RAG) System with Alibaba Cloud Milvus and PAI
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 19, 2024 · Artificial Intelligence

How to Build a Full-Stack RAG Knowledge QA App with Alibaba Cloud Low-Code Platform

This guide walks you through creating a complete retrieval‑augmented generation (RAG) knowledge‑question‑answer system on Alibaba Cloud, covering AI model integration, cloud‑native low‑code development, database setup, UI customization, session persistence, analytics dashboards, and multi‑channel deployment.

AIChatbotCloud Native
0 likes · 22 min read
How to Build a Full-Stack RAG Knowledge QA App with Alibaba Cloud Low-Code Platform
iQIYI Technical Product Team
iQIYI Technical Product Team
Dec 19, 2024 · Artificial Intelligence

Project BaixiaoSheng: An AI‑Powered Project Management Assistant – iQIYI Case Study

Project BaixiaoSheng, iQIYI’s AI‑powered project management assistant unveiled at the 13th TOP 100 Global Software Case Study Summit, uses a Retrieval‑Augmented Generation framework with static knowledge Q&A, dynamic data consulting, and scenario‑assistant automation to cut context‑switching, streamline data flow, and boost cross‑system efficiency, while future plans target fine‑tuned LLMs, multi‑model fusion, and AI‑agent orchestration.

AIKnowledge BaseLarge Language Model
0 likes · 11 min read
Project BaixiaoSheng: An AI‑Powered Project Management Assistant – iQIYI Case Study
Baidu Geek Talk
Baidu Geek Talk
Dec 16, 2024 · Artificial Intelligence

AIAPI: Baidu's AI-Native Retrieval System for Large Language Model Applications

AIAPI, Baidu’s AI‑native retrieval platform for large language models, tackles hallucination, slow domain updates, and output opacity by delivering authoritative, timely, full‑content data through a dual‑channel architecture that combines traditional search and RAG, employs reusable ranking, graph‑enhanced data layers, dynamic caching that cuts storage by 70 %, and QueryPlan‑based QoS, achieving markedly higher retrieval quality and a 34 % speed gain with Wenxin 4.0.

AI-Native SystemsAIAPILarge Language Models
0 likes · 12 min read
AIAPI: Baidu's AI-Native Retrieval System for Large Language Model Applications
NewBeeNLP
NewBeeNLP
Dec 16, 2024 · Artificial Intelligence

How Tencent Boosts LLM Power with RAG, GraphRAG, and Agent Technologies

This article examines Tencent's large language model deployments across content generation, intelligent customer service, and role‑playing scenarios, detailing the principles and practical implementations of Retrieval‑Augmented Generation (RAG), GraphRAG, and Agent techniques, and discusses challenges, optimization strategies, and real‑world use cases.

AIAgentGraphRAG
0 likes · 18 min read
How Tencent Boosts LLM Power with RAG, GraphRAG, and Agent Technologies
Baobao Algorithm Notes
Baobao Algorithm Notes
Dec 15, 2024 · Artificial Intelligence

What Are the Best Practices for Retrieval‑Augmented Generation (RAG)?

This comprehensive study evaluates various components of Retrieval‑Augmented Generation pipelines—including query classification, chunking, embedding models, vector databases, retrieval, re‑ranking, summarization, and generator fine‑tuning—identifies optimal configurations, and proposes best‑practice guidelines for both performance‑maximizing and efficiency‑balanced RAG systems.

Best PracticesLLMRAG
0 likes · 17 min read
What Are the Best Practices for Retrieval‑Augmented Generation (RAG)?
Alibaba Cloud Native
Alibaba Cloud Native
Dec 13, 2024 · Artificial Intelligence

Build a 24/7 AI Customer Assistant for Your Website in 10 Minutes

This guide shows how to create a zero‑code AI chatbot using Alibaba Cloud Function Compute and the Bailei large‑model platform, configure API keys, deploy a sample site, embed the assistant, and enhance it with a private knowledge base for accurate product support.

AI AssistantCustomer ServiceFunction Compute
0 likes · 9 min read
Build a 24/7 AI Customer Assistant for Your Website in 10 Minutes
Alimama Tech
Alimama Tech
Dec 11, 2024 · Artificial Intelligence

Engineering Architecture of Alibaba's AI Digital Employee "AI XiaoWan"

Alibaba’s AI digital employee “AI XiaoWan” uses a native multi‑agent architecture where a Controller Agent interprets intent, plans tasks, and orchestrates execution while an Executable Agent performs domain‑specific operations, communicating via a standardized Agent Communication Protocol, leveraging a centralized Tool Center, a retrieval‑augmented knowledge base, and a data‑flywheel feedback loop to continuously improve and evolve toward memory‑based reasoning and self‑learning.

AIKnowledge BaseLarge Language Model
0 likes · 14 min read
Engineering Architecture of Alibaba's AI Digital Employee "AI XiaoWan"
Tencent Tech
Tencent Tech
Dec 11, 2024 · Artificial Intelligence

Inside Tencent LeYong AI: Solving Enterprise RAG with Knowledge, Engineering & Algorithms

This article explores how Tencent's LeYong AI assistant leverages Retrieval‑Augmented Generation to empower enterprise knowledge retrieval, detailing three capability dimensions—knowledge management, engineering, and algorithmic—along with eight sub‑areas such as knowledge boundaries, quality, permissions, multimodal handling, long‑context span, and complex reasoning.

AI assistantsEnterprise AILarge Language Models
0 likes · 18 min read
Inside Tencent LeYong AI: Solving Enterprise RAG with Knowledge, Engineering & Algorithms
AI Large Model Application Practice
AI Large Model Application Practice
Dec 11, 2024 · Artificial Intelligence

What Are Vectors and Why They Power Modern AI

This article explains vectors as numeric representations of data, how they enable similarity comparison, the role of embedding models and vector databases, their use in semantic search and RAG applications, and discusses their advantages and limitations in modern AI systems.

AI fundamentalsEmbeddingRAG
0 likes · 10 min read
What Are Vectors and Why They Power Modern AI
DataFunTalk
DataFunTalk
Dec 10, 2024 · Artificial Intelligence

Tencent Large Language Model Applications: RAG, GraphRAG, and Agent Technologies

This article explores Tencent's large language model deployments across various business scenarios, detailing the principles and practical implementations of Retrieval‑Augmented Generation (RAG), GraphRAG for role‑playing, and Agent technologies, while also covering model fine‑tuning, knowledge‑base construction, and evaluation methods.

AI applicationsAgentGraphRAG
0 likes · 15 min read
Tencent Large Language Model Applications: RAG, GraphRAG, and Agent Technologies
21CTO
21CTO
Dec 9, 2024 · Artificial Intelligence

Unlock AI Mastery: 5 Open-Source Tools to Learn by Doing

This guide introduces five open-source AI projects—SWIRL, Postiz, OpenBB, Open WebUI, and Auto Jobs Applier AI Agent—explaining how each can be used to practice AI concepts, from retrieval-augmented generation and AI-driven scheduling to financial analysis, model integration, and automated job applications, while highlighting the learning benefits of hands-on experimentation.

AIRAGtools
0 likes · 9 min read
Unlock AI Mastery: 5 Open-Source Tools to Learn by Doing
Tencent Cloud Developer
Tencent Cloud Developer
Dec 5, 2024 · Industry Insights

Why Most RAG Projects Fail and How Tencent’s LeXiang AI Assistant Overcomes Them

The article analyses the rapid growth of Retrieval‑Augmented Generation (RAG) in enterprises, explains why self‑built RAG solutions often collapse under cost and maintenance pressures, and demonstrates how Tencent LeXiang AI Assistant addresses these issues through a robust knowledge‑management core, extensive industry experience, scalable resources, and advanced multimodal capabilities.

AI AssistantEnterprise AILarge Language Model
0 likes · 16 min read
Why Most RAG Projects Fail and How Tencent’s LeXiang AI Assistant Overcomes Them
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 5, 2024 · Artificial Intelligence

How to Build a Financial RAG Solution with Alibaba PAI: Step-by-Step Guide

Learn how to create a Retrieval‑Augmented Generation (RAG) system for financial scenarios using Alibaba’s PAI platform—covering knowledge‑base construction with PAI‑Designer, template creation in PAI‑LangStudio, deployment of LLM and embedding models, and linking vector stores for accurate, context‑aware answers.

EmbeddingFinancial AIPAI
0 likes · 17 min read
How to Build a Financial RAG Solution with Alibaba PAI: Step-by-Step Guide
Sohu Tech Products
Sohu Tech Products
Nov 27, 2024 · Artificial Intelligence

RAG Technology and Practical Application in Multi-Modal Query: Using Chinese-CLIP and Redis Search

The article explains how Retrieval‑Augmented Generation (RAG) outperforms direct LLM inference by enabling real‑time knowledge updates and lower costs, and demonstrates a practical multi‑modal RAG pipeline that uses Chinese‑CLIP for vector encoding, various chunking strategies, and Redis Search for fast vector storage and retrieval.

Chinese-CLIPChunkingLLM
0 likes · 17 min read
RAG Technology and Practical Application in Multi-Modal Query: Using Chinese-CLIP and Redis Search
System Architect Go
System Architect Go
Nov 19, 2024 · Artificial Intelligence

Retrieval Augmented Generation (RAG) System Overview and Implementation with LangChain, Redis, and llama.cpp

This article explains the concept, architecture, and step‑by‑step implementation of Retrieval Augmented Generation (RAG), covering indexing, retrieval & generation processes, a practical LangChain‑Redis‑llama.cpp example on Kubernetes, code snippets, test results, challenges, and references.

AIEmbeddingLLM
0 likes · 6 min read
Retrieval Augmented Generation (RAG) System Overview and Implementation with LangChain, Redis, and llama.cpp
Alibaba Cloud Developer
Alibaba Cloud Developer
Nov 18, 2024 · Artificial Intelligence

Solving Knowledge Challenges in Retrieval‑Augmented Generation: Practical Optimizations

This article shares a half‑year of hands‑on experience with Retrieval‑Augmented Generation, analyzing why simple RAG setups often feel unintelligent, identifying three core knowledge issues, and presenting concrete optimization strategies—including chunking, knowledge expansion, and tag‑based conflict resolution—to improve retrieval and generation performance in low‑resource environments.

AILarge Language ModelsRAG
0 likes · 25 min read
Solving Knowledge Challenges in Retrieval‑Augmented Generation: Practical Optimizations
dbaplus Community
dbaplus Community
Nov 16, 2024 · Artificial Intelligence

Are LLM Frameworks Overhyped? A Critical Look at RAG and Reusability

The article critiques LLM frameworks, comparing them to early ORM tools, explains how Retrieval Augmented Generation works, warns against premature optimization, and advises developers to favor simple, visible practices over complex, abstracted frameworks for better control and understanding.

AILLMModelEvaluation
0 likes · 7 min read
Are LLM Frameworks Overhyped? A Critical Look at RAG and Reusability
ITPUB
ITPUB
Nov 15, 2024 · Databases

Why Vector Databases Matter: Deploying PgVector on PostgreSQL for Scalable AI Retrieval

This article explains the need for vector databases in the AI era, reviews PostgreSQL's extensible ecosystem, compares vector‑database options, provides step‑by‑step PgVector installation and usage, shares operational best practices, performance tuning tips, and real‑world Qunar & Tujia case studies.

AIPerformance tuningPostgreSQL
0 likes · 27 min read
Why Vector Databases Matter: Deploying PgVector on PostgreSQL for Scalable AI Retrieval
Architects' Tech Alliance
Architects' Tech Alliance
Nov 12, 2024 · Artificial Intelligence

How Retrieval‑Augmented Generation Boosts Enterprise AI with Intel Optimizations

This article explains the fundamentals of Retrieval‑Augmented Generation (RAG), its four‑step workflow, architecture, and how Intel’s hardware and software optimizations—including vector search, quantized embeddings, and advanced inference extensions—enhance performance, security, and scalability for enterprise LLM applications.

AI inferenceEmbedding QuantizationIntel Optimization
0 likes · 14 min read
How Retrieval‑Augmented Generation Boosts Enterprise AI with Intel Optimizations
Data Thinking Notes
Data Thinking Notes
Nov 12, 2024 · Artificial Intelligence

Unlock Data Power with DB‑GPT: An Open‑Source AI Framework for Data Development

DB‑GPT is an open‑source AI‑native data application framework that unifies multi‑model management, RAG, agents, and workflow orchestration to simplify building large‑model‑driven data solutions, offering features such as private Q&A, multi‑source analytics, automated fine‑tuning, and robust privacy security.

AIData FrameworkOpen Source
0 likes · 13 min read
Unlock Data Power with DB‑GPT: An Open‑Source AI Framework for Data Development
Aikesheng Open Source Community
Aikesheng Open Source Community
Nov 12, 2024 · Artificial Intelligence

ChatDBA: An AI‑Powered Database Fault Diagnosis Assistant Using Large Language Models

ChatDBA is a conversational AI system built by Shanghai Aikesheng that employs large language models and Retrieval‑Augmented Generation to help database administrators diagnose faults, learn domain knowledge, and generate or optimize SQL, with a redesigned architecture that addresses early‑stage shortcomings and outlines future enhancements.

ChatDBAFault DiagnosisKnowledge Base
0 likes · 10 min read
ChatDBA: An AI‑Powered Database Fault Diagnosis Assistant Using Large Language Models
Baidu Tech Salon
Baidu Tech Salon
Nov 11, 2024 · Cloud Native

Baidu Cloud Native Data Platform: Empowering Enterprise AI in the LLM Era

To empower enterprise AI in the LLM era, Baidu Cloud unveils a cloud‑native data platform featuring upgraded databases—PegaDB, GaiaDB 5.0, Vector DB 2.0, Palo 2.0—and integrated services like DBSC 2.0, EDAP 2.0, and DBStack, delivering high‑performance, cost‑effective handling of structured, unstructured, and vector data for fine‑tuning and Enterprise RAG.

DBStackData LakehouseEDAP
0 likes · 10 min read
Baidu Cloud Native Data Platform: Empowering Enterprise AI in the LLM Era
JavaEdge
JavaEdge
Nov 9, 2024 · Artificial Intelligence

Build an AI‑Powered Airline Ticket Agent with Spring AI Alibaba

This tutorial walks through creating an intelligent airline‑ticket customer‑service agent using Spring AI Alibaba, covering requirements, architecture, RAG integration, function calling, chat memory, core capabilities, code implementation with ChatClient, and a sample running result.

AI AgentAlibabaChat Memory
0 likes · 9 min read
Build an AI‑Powered Airline Ticket Agent with Spring AI Alibaba
DataFunSummit
DataFunSummit
Nov 8, 2024 · Artificial Intelligence

ChatDBA: An AI‑Powered Database Fault Diagnosis Assistant Using Retrieval‑Augmented Generation

ChatDBA, developed by Shanghai Aikesheng, is an AI-driven database operation assistant that leverages large language models and Retrieval‑Augmented Generation to provide fault diagnosis, knowledge learning, SQL generation and optimization, addressing challenges such as vague outputs, complex troubleshooting logic, and memory management through a structured architecture and multi‑modal retrieval strategies.

AIDatabaseFault Diagnosis
0 likes · 10 min read
ChatDBA: An AI‑Powered Database Fault Diagnosis Assistant Using Retrieval‑Augmented Generation
NewBeeNLP
NewBeeNLP
Nov 7, 2024 · Artificial Intelligence

Tackling Large Model Hallucinations: Causes, Detection, and Mitigation Strategies

This article provides a comprehensive analysis of large language model hallucinations, detailing their definitions, classifications, root causes, detection techniques, and a wide range of mitigation approaches—including RAG pipelines, decoding strategies, and model‑enhancement methods—to improve reliability and safety in real‑world AI applications.

AI safetyLarge Language ModelsPrompt Engineering
0 likes · 22 min read
Tackling Large Model Hallucinations: Causes, Detection, and Mitigation Strategies
Sohu Tech Products
Sohu Tech Products
Nov 6, 2024 · Artificial Intelligence

RAG2.0 Engine Design Challenges and Implementation

The talk outlines RAG2.0’s design challenges—low vector recall, complex documents, semantic gaps—and presents a two‑stage architecture using deep multimodal understanding and knowledge‑graph‑enhanced retrieval, detailing advanced chunking, multi‑index and multi‑path retrieval, efficient sorting models like ColBERT, and future multi‑modal and memory‑augmented agent directions.

ColBERTDelayed InteractionEnterprise AI
0 likes · 23 min read
RAG2.0 Engine Design Challenges and Implementation
37 Interactive Technology Team
37 Interactive Technology Team
Nov 4, 2024 · Artificial Intelligence

Developing RAG and Agent Applications with LangChain: A Case Study of an AI Assistant for Activity Components

The article outlines a step‑by‑step methodology for creating Retrieval‑Augmented Generation and custom Agent applications with LangChain, illustrated by an AI assistant for activity components that evolves from a rapid Dify prototype to a LangChain‑based RAG system and finally a hand‑crafted ReAct‑style agent, detailing LCEL chain composition, vector‑search integration, model performance trade‑offs, and a unified routing layer.

AI AssistantAgentCloud-native
0 likes · 6 min read
Developing RAG and Agent Applications with LangChain: A Case Study of an AI Assistant for Activity Components
DataFunTalk
DataFunTalk
Oct 31, 2024 · Artificial Intelligence

Tencent OlaChat: An LLM‑Powered Intelligent Business Intelligence Platform – Architecture, Capabilities, and Practice

This article presents the evolution from traditional to intelligent BI, explores how large language models enable natural‑language data analysis, details the OlaChat platform’s architecture, metadata‑enhanced retrieval methods, Text2SQL pipeline, multi‑turn dialogue system, and shares practical deployment insights and Q&A.

Intelligent AnalyticsLLMMetadata Retrieval
0 likes · 20 min read
Tencent OlaChat: An LLM‑Powered Intelligent Business Intelligence Platform – Architecture, Capabilities, and Practice
JD Tech
JD Tech
Oct 31, 2024 · Artificial Intelligence

Design and Implementation of the Logistics Intelligent Robot “Yunli XiaoZhi” Powered by Large Language Models

The article details the development of Yunli XiaoZhi, an AI‑driven logistics chatbot that combines knowledge‑base Q&A, data‑analysis, proactive alerts and report‑pushing to streamline SOP access, reduce manual query effort, and improve operational efficiency for operators, carriers and drivers.

AI chatbotData AnalysisKnowledge Base
0 likes · 22 min read
Design and Implementation of the Logistics Intelligent Robot “Yunli XiaoZhi” Powered by Large Language Models
AI Large Model Application Practice
AI Large Model Application Practice
Oct 30, 2024 · Artificial Intelligence

How to Efficiently Incrementally Update Knowledge in RAG Applications

Incremental knowledge updates in Retrieval‑Augmented Generation (RAG) systems can be achieved by using document‑level or chunk‑level strategies, leveraging hash fingerprints, record managers, and framework‑specific APIs such as LangChain’s index() with cleanup modes or LlamaIndex’s ingestion pipeline, reducing redundant computation and cost.

LangChainLlamaIndexRAG
0 likes · 12 min read
How to Efficiently Incrementally Update Knowledge in RAG Applications
Baobao Algorithm Notes
Baobao Algorithm Notes
Oct 29, 2024 · Industry Insights

Inside Perplexity AI: How RAG Powers the Next‑Gen Search Engine

In this interview, Perplexity AI CEO Aravind Srinivas explains the company’s retrieval‑augmented generation architecture, multi‑model strategy, vector‑database use, competitive positioning against Google, monetization plans, and future product road‑map, offering a deep industry perspective on AI‑driven search.

AI startupLLMPerplexity AI
0 likes · 38 min read
Inside Perplexity AI: How RAG Powers the Next‑Gen Search Engine
Baidu Geek Talk
Baidu Geek Talk
Oct 28, 2024 · Artificial Intelligence

Baidu Intelligent Cloud Qianfan AppBuilder: Enterprise-Level Large Model Application Development Platform

Baidu Intelligent Cloud’s Qianfan AppBuilder 3.0 offers an enterprise‑grade platform that simplifies large‑model application development by providing high‑accuracy RAG, robust agent scheduling, extensive integration, secure private‑or‑hybrid deployment, and a guided methodology, enabling industries to transform processes, add AI copilots, and create novel capabilities.

AI integrationBaidu Intelligent CloudEnterprise AI
0 likes · 12 min read
Baidu Intelligent Cloud Qianfan AppBuilder: Enterprise-Level Large Model Application Development Platform
DevOps
DevOps
Oct 27, 2024 · Artificial Intelligence

Best Practices for Building Efficient Retrieval‑Augmented Generation (RAG) Systems

This article reviews Wang et al.'s 2024 research on Retrieval‑Augmented Generation, outlining optimal practices such as query classification, chunk sizing, hybrid metadata search, embedding selection, vector databases, query transformation, reranking, document repacking, summarization, fine‑tuning, and multimodal retrieval to guide developers in constructing high‑performance RAG pipelines.

LLMQuery ClassificationRAG
0 likes · 11 min read
Best Practices for Building Efficient Retrieval‑Augmented Generation (RAG) Systems
DataFunSummit
DataFunSummit
Oct 27, 2024 · Artificial Intelligence

How Siemens Harnesses Generative AI to Build the Enterprise Knowledge Chatbot “XiaoYu”

This article describes Siemens' journey in applying generative AI and Retrieval‑Augmented Generation to create an internal knowledge chatbot, detailing the business challenges, technical architecture, data integration, multi‑modal capabilities, deployment outcomes, and strategic lessons for enterprise AI adoption.

AI chatbotEnterprise Knowledge ManagementGenerative AI
0 likes · 21 min read
How Siemens Harnesses Generative AI to Build the Enterprise Knowledge Chatbot “XiaoYu”
Alibaba Cloud Native
Alibaba Cloud Native
Oct 26, 2024 · Artificial Intelligence

Build a Real‑Time Semantic Search with EventBridge, DashVector, and FunctionCompute

This tutorial walks through constructing a zero‑to‑one RAG pipeline that ingests OSS text files via EventBridge, transforms them into embeddings with DashScope, stores vectors in DashVector, and performs semantic search using FunctionCompute and a Qwen‑Turbo LLM, complete with code samples and configuration steps.

DashVectorEmbeddingEventBridge
0 likes · 10 min read
Build a Real‑Time Semantic Search with EventBridge, DashVector, and FunctionCompute
DataFunSummit
DataFunSummit
Oct 25, 2024 · Artificial Intelligence

Progress and Standardization of Large Model + Data Intelligence Applications by the China Academy of Information and Communications Technology

This article reviews the China Academy of Information and Communications Technology's advancements in large‑model‑driven data intelligence, covering development trends, key deployment technologies such as prompt engineering, fine‑tuning and RAG, emerging application paradigms, challenges, and a series of newly drafted standards to guide industry adoption.

AIData IntelligenceKnowledge Graph
0 likes · 13 min read
Progress and Standardization of Large Model + Data Intelligence Applications by the China Academy of Information and Communications Technology
DataFunSummit
DataFunSummit
Oct 24, 2024 · Big Data

Bilibili’s Large Language Model‑Based Intelligent Assistant for the Big Data Platform: Architecture, Principles, and Deployment

This article details Bilibili’s implementation of a large‑language‑model‑driven intelligent assistant for its massive big‑data platform, covering background, problem analysis, architectural design, knowledge‑base construction, precision and recall challenges, deployment across offline and real‑time Spark/Flink diagnostics, and future outlooks.

AgentBig DataFlink
0 likes · 23 min read
Bilibili’s Large Language Model‑Based Intelligent Assistant for the Big Data Platform: Architecture, Principles, and Deployment
21CTO
21CTO
Oct 23, 2024 · Artificial Intelligence

IBM Unveils Granite 3.0 LLMs: Open‑Source, Secure, and Cost‑Effective AI Models

IBM introduced the Granite 3.0 series, an open‑source family of large language models that combine cutting‑edge performance with enhanced security, multi‑language support, and cost‑efficiency, while offering a variety of base, instruct, and specialist variants for enterprise use.

AI modelsGraniteIBM
0 likes · 4 min read
IBM Unveils Granite 3.0 LLMs: Open‑Source, Secure, and Cost‑Effective AI Models
DaTaobao Tech
DaTaobao Tech
Oct 23, 2024 · Artificial Intelligence

Retrieval-Augmented Generation (RAG): Principles, Applications, Limitations and Challenges

Retrieval-Augmented Generation (RAG) combines a retriever that fetches relevant external documents and a generator that uses them, improving LLM accuracy, relevance, privacy, and up-to-date information, but faces challenges such as retrieval latency, computational cost, chunking strategies, embedding selection, and system integration complexity.

AIKnowledge retrievalLLM
0 likes · 13 min read
Retrieval-Augmented Generation (RAG): Principles, Applications, Limitations and Challenges
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Oct 22, 2024 · Artificial Intelligence

How Alibaba Cloud Optimizes Enterprise RAG: Key Techniques for AI Search

At the 2024 Alibaba Cloud Yúnxī Conference, senior AI Search expert Xing Shaomin detailed the enterprise‑grade Retrieval‑Augmented Generation (RAG) pipeline, covering critical link architecture, effectiveness, performance, and cost optimizations, as well as practical applications, vector store enhancements, LLM agents, and deployment strategies.

AI SearchEnterprise AILLM
0 likes · 16 min read
How Alibaba Cloud Optimizes Enterprise RAG: Key Techniques for AI Search
DataFunSummit
DataFunSummit
Oct 21, 2024 · Artificial Intelligence

Retrieval‑Augmented Generation (RAG) for Office Applications: Architecture, Challenges, and Practical Practices

This article introduces Retrieval‑Augmented Generation (RAG) as a solution to the hallucination, freshness, and data‑privacy issues of large language models, details its modular architecture, explains the layered system design and hybrid retrieval pipeline, and shares the practical challenges and engineering tricks encountered when deploying RAG in enterprise office scenarios.

AIHybrid RetrievalLarge Language Model
0 likes · 19 min read
Retrieval‑Augmented Generation (RAG) for Office Applications: Architecture, Challenges, and Practical Practices
Alibaba Cloud Native
Alibaba Cloud Native
Oct 18, 2024 · Artificial Intelligence

How Spring AI Alibaba Simplifies Java AI Application Development

This article introduces the open‑source Spring AI Alibaba framework, explains its background, core features such as chat model abstraction, prompt templates, structured output, function calling, RAG and chat memory, and walks through a complete smart‑ticket‑assistant example with code snippets and deployment guidance.

AI FrameworkChat MemoryFunction Calling
0 likes · 17 min read
How Spring AI Alibaba Simplifies Java AI Application Development
DataFunSummit
DataFunSummit
Oct 18, 2024 · Artificial Intelligence

Building Efficient RAG Applications with a Small Team: Insights from PingCAP AI Lab

This article details how PingCAP's three‑person AI Lab leveraged Retrieval‑Augmented Generation (RAG) techniques—including basic RAG, fine‑tuned embeddings, re‑ranking, graph RAG, and agent‑based RAG—to create scalable, multilingual document‑question answering services while addressing large‑scale documentation challenges, model limitations, and user feedback loops.

AgentEmbeddingLLM
0 likes · 14 min read
Building Efficient RAG Applications with a Small Team: Insights from PingCAP AI Lab
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Oct 18, 2024 · Artificial Intelligence

Integrate Alibaba Cloud AI Search with Elasticsearch: A Step‑by‑Step Guide

This tutorial walks you through configuring Elasticsearch’s Open Inference API to connect with Alibaba Cloud AI Search, covering setup of text generation, rerank, sparse and dense vector services, and demonstrates end‑to‑end requests with code examples for building RAG and semantic search applications.

Alibaba Cloud AI SearchElasticsearchInference API
0 likes · 11 min read
Integrate Alibaba Cloud AI Search with Elasticsearch: A Step‑by‑Step Guide
Alibaba Cloud Developer
Alibaba Cloud Developer
Oct 17, 2024 · Artificial Intelligence

Build AI-Powered Java Apps Fast with Spring AI Alibaba: Features & Demo

Spring AI Alibaba is an open‑source Java framework that integrates Alibaba Cloud's large‑model services with Spring AI, offering high‑level abstractions for chat models, prompts, function calling, RAG, and conversation memory, and includes a complete ticket‑assistant example with code snippets.

AI FrameworkChatbotFunction Calling
0 likes · 17 min read
Build AI-Powered Java Apps Fast with Spring AI Alibaba: Features & Demo
AntData
AntData
Oct 16, 2024 · Artificial Intelligence

Building a Data Assistant Application with DB‑GPT V0.6.0

This tutorial walks through the end‑to‑end process of creating a data‑assistant application using DB‑GPT V0.6.0, covering prerequisite deployment, knowledge‑base construction, sub‑agent creation, RAG‑based QA, AWEL workflow installation, intent‑recognition knowledge base, and unified multi‑agent orchestration.

AIDB-GPTData Assistant
0 likes · 12 min read
Building a Data Assistant Application with DB‑GPT V0.6.0
Baobao Algorithm Notes
Baobao Algorithm Notes
Oct 16, 2024 · Artificial Intelligence

How the DB3 Team Won the Meta CRAG RAG Challenge: Prompts, Retrieval, and LoRA Fine‑Tuning

This article analyzes the Meta Comprehensive RAG (CRAG) benchmark, detailing its three tasks, evaluation metrics, and the champion DB3 team's end‑to‑end solution that combines data preprocessing, dual‑stage retrieval, prompt engineering, LoRA‑based fine‑tuning, and public data augmentation to achieve top scores across all tasks.

BenchmarkKnowledge GraphLLM
0 likes · 17 min read
How the DB3 Team Won the Meta CRAG RAG Challenge: Prompts, Retrieval, and LoRA Fine‑Tuning
21CTO
21CTO
Oct 10, 2024 · Artificial Intelligence

5 Practical AI Projects to Build Your Skills with Python

This article presents five hands‑on AI project ideas—from resume optimization to multimodal search—complete with step‑by‑step instructions, required Python libraries, and code snippets, helping beginners and intermediate developers quickly build valuable AI applications.

AIAutomationLLM
0 likes · 12 min read
5 Practical AI Projects to Build Your Skills with Python
DaTaobao Tech
DaTaobao Tech
Oct 9, 2024 · Artificial Intelligence

Building a Vertical Domain QA Bot with Vector Search, RAG, and SFT

This guide walks entry‑level developers through building a logistics‑focused QA bot by first embedding documents for vector similarity search, then adding retrieval‑augmented generation, fine‑tuning a small model, integrating hybrid checks, and optimizing deployment with feedback loops to achieve fast, accurate, out‑of‑scope‑aware answers.

AIChatbotRAG
0 likes · 15 min read
Building a Vertical Domain QA Bot with Vector Search, RAG, and SFT
DataFunTalk
DataFunTalk
Oct 9, 2024 · Artificial Intelligence

Interview on Data Fabric, Data Virtualization, and AI Integration with Denodo Leaders

In this interview, Denodo executives discuss the origins, challenges, and future of data fabric and data virtualization, explore how generative AI and retrieval‑augmented generation enhance data management, share customer success stories, and offer strategic insights for enterprises navigating digital transformation.

Data FabricDenodoEnterprise Data Management
0 likes · 19 min read
Interview on Data Fabric, Data Virtualization, and AI Integration with Denodo Leaders
DevOps
DevOps
Oct 8, 2024 · Artificial Intelligence

Top 20+ Retrieval‑Augmented Generation (RAG) Interview Questions and Answers

This article presents over twenty essential Retrieval‑Augmented Generation (RAG) interview questions with detailed answers, covering fundamentals, applications, architecture, training, limitations, ethical considerations, and integration, offering AI enthusiasts and job candidates a comprehensive guide to mastering RAG concepts.

AI InterviewMachine LearningNLP
0 likes · 15 min read
Top 20+ Retrieval‑Augmented Generation (RAG) Interview Questions and Answers
Java Tech Enthusiast
Java Tech Enthusiast
Oct 8, 2024 · Artificial Intelligence

Spring AI Framework for Java Developers

Spring AI is a Java‑centric framework that unifies access to chat, text‑to‑image, embedding and retrieval‑augmented generation models—including OpenAI, Anthropic and Alibaba’s Tongyi Qianwen—through synchronous or asynchronous APIs, POJO mapping, function calling, vector‑store integration and fluent tooling for rapid AI agent development.

AI frameworksCloud ComputingFunction Calling
0 likes · 5 min read
Spring AI Framework for Java Developers
JD Tech Talk
JD Tech Talk
Oct 8, 2024 · Artificial Intelligence

Building a Retrieval‑Augmented Generation (RAG) System with Rust and Qdrant

This article explains how to construct a Retrieval‑Augmented Generation pipeline in Rust, covering knowledge‑base creation with Qdrant, model loading and embedding using the candle library, data ingestion, and integration of a Rust‑based inference service based on mistral.rs, while also discussing resource usage and common pitfalls.

AIEmbeddingLLM
0 likes · 16 min read
Building a Retrieval‑Augmented Generation (RAG) System with Rust and Qdrant
Architect
Architect
Oct 7, 2024 · Artificial Intelligence

Master Prompt Engineering: A Universal Framework for Building Effective LLM Prompts

This article presents a systematic, four‑part Prompt engineering framework—role definition, problem description, goal setting, and requirement specification—augmented with RAG, few‑shot examples, memory handling, and model‑parameter tuning, enabling developers to craft high‑quality prompts for large language models across diverse tasks.

Large Language ModelsModel ParametersPrompt Engineering
0 likes · 28 min read
Master Prompt Engineering: A Universal Framework for Building Effective LLM Prompts
JavaEdge
JavaEdge
Oct 2, 2024 · Artificial Intelligence

Boost RAG Retrieval Accuracy with Contextual Embeddings and BM25

This article presents a contextual retrieval technique that combines contextual embeddings and contextual BM25 to reduce RAG miss rates by up to 67%, explains the underlying methods, implementation steps, cost considerations, experimental results, and practical deployment guidance.

AIBM25Contextual Retrieval
0 likes · 17 min read
Boost RAG Retrieval Accuracy with Contextual Embeddings and BM25
DataFunSummit
DataFunSummit
Oct 2, 2024 · Artificial Intelligence

NVIDIA’s Solutions for Large Language Models: NeMo Framework, TensorRT‑LLM, and Retrieval‑Augmented Generation

This article explains NVIDIA’s end‑to‑end stack for large language models, covering the NeMo Framework for data processing, training, and deployment, the open‑source TensorRT‑LLM inference accelerator, and the Retrieval‑Augmented Generation (RAG) technique that enriches model outputs with external knowledge.

Large Language ModelsNVIDIANeMo
0 likes · 17 min read
NVIDIA’s Solutions for Large Language Models: NeMo Framework, TensorRT‑LLM, and Retrieval‑Augmented Generation
JD Cloud Developers
JD Cloud Developers
Sep 30, 2024 · Artificial Intelligence

How a Large‑Model Powered Bot Boosts Logistics Ops with Smart Q&A and Data Insights

This article describes the design, implementation, and impact of a large‑model‑driven logistics chatbot that unifies knowledge Q&A, data analysis, proactive alerts, and report pushing to streamline operations for functional staff, frontline workers, and managers, dramatically reducing query time and improving decision efficiency.

AI chatbotData AnalysisEnterprise AI
0 likes · 20 min read
How a Large‑Model Powered Bot Boosts Logistics Ops with Smart Q&A and Data Insights
JD Tech Talk
JD Tech Talk
Sep 30, 2024 · Artificial Intelligence

Yunli XiaoZhi: An AI‑Powered Intelligent Assistant for Knowledge Q&A and Data Analysis in Logistics Operations

The document describes the design, implementation, and operational results of Yunli XiaoZhi, an AI‑driven portable knowledge‑base and data‑analysis chatbot that consolidates SOPs, manuals, and real‑time information for logistics staff, using LangChain‑based RAG, vector databases, and large‑model prompting to improve query efficiency, proactive alerts, and reporting across multiple user groups.

AIChatbotData Analysis
0 likes · 19 min read
Yunli XiaoZhi: An AI‑Powered Intelligent Assistant for Knowledge Q&A and Data Analysis in Logistics Operations
JD Cloud Developers
JD Cloud Developers
Sep 29, 2024 · Artificial Intelligence

Build a Local AI Q&A System with Java, Ollama, and LangChain4J

This article walks through building a local AI question‑answer system using Java, Ollama, LangChain4J, embeddings, and a Chroma vector database, covering LLM fundamentals, embedding techniques, RAG architecture, setup steps, Maven dependencies, and sample code to retrieve and answer queries.

AIEmbeddingJava
0 likes · 19 min read
Build a Local AI Q&A System with Java, Ollama, and LangChain4J
21CTO
21CTO
Sep 28, 2024 · Artificial Intelligence

How Digital Twins and Generative AI Are Transforming Real‑Time Monitoring

This article explores how digital twins evolve from design tools to real‑time monitoring platforms, how integrating generative AI and retrieval‑augmented generation (RAG) boosts AI accuracy and situational awareness, and why software teams must adopt these combined technologies to stay ahead in modern operations.

Digital TwinGenerative AIRAG
0 likes · 11 min read
How Digital Twins and Generative AI Are Transforming Real‑Time Monitoring
Tencent Cloud Developer
Tencent Cloud Developer
Sep 27, 2024 · Artificial Intelligence

A Comprehensive Prompt Engineering Framework: Universal Templates, RAG, Few‑Shot, Memory, and Automated Optimization

The article presents a universal four‑part prompt template—role, problem description, goal, and requirements—augmented with role definitions, RAG‑based knowledge retrieval, few‑shot examples, memory handling, temperature/top‑p tuning, and automated optimization techniques such as APE, APO, and OPRO, enabling developers to reliably craft high‑quality prompts for LLMs.

AI Prompt OptimizationLarge Language ModelsPrompt Engineering
0 likes · 26 min read
A Comprehensive Prompt Engineering Framework: Universal Templates, RAG, Few‑Shot, Memory, and Automated Optimization
iQIYI Technical Product Team
iQIYI Technical Product Team
Sep 26, 2024 · Artificial Intelligence

AI-Powered Search in iQIYI: Techniques, Architecture, and Implementation

iQIYI’s AI‑powered search expands beyond title‑only queries by handling fuzzy role, plot, star, award, and semantic searches, using Chain‑of‑Thought‑generated TIPS, Retrieval‑Augmented Generation with sophisticated indexing, chunking, embedding, reranking, and prompt‑engineering to deliver personalized, accurate video recommendations that boost user engagement.

AI SearchEmbeddingQuery Guidance
0 likes · 15 min read
AI-Powered Search in iQIYI: Techniques, Architecture, and Implementation
AntData
AntData
Sep 26, 2024 · Artificial Intelligence

DB-GPT: Open-Source AI-Native Data Application Development Framework

DB‑GPT is an open‑source AI‑native data‑application framework that provides multi‑model management, Text‑to‑SQL optimization, RAG, multi‑agent collaboration, and intelligent workflow orchestration, enabling developers to build scalable large‑model database applications, with proven enterprise adoption, community growth, and academic publications.

AIData EngineeringLarge Language Models
0 likes · 6 min read
DB-GPT: Open-Source AI-Native Data Application Development Framework
JavaEdge
JavaEdge
Sep 24, 2024 · Artificial Intelligence

Mastering RAG with LangChain4j: From Simple Setup to Advanced Retrieval‑Augmented Generation

This article explains how to extend large language models with domain‑specific knowledge using Retrieval‑Augmented Generation (RAG) in LangChain4j, covering the concepts of RAG, its indexing and retrieval stages, simple RAG setup, detailed API usage, and advanced customization options such as query transformers and content injectors.

EmbeddingJavaLLM
0 likes · 24 min read
Mastering RAG with LangChain4j: From Simple Setup to Advanced Retrieval‑Augmented Generation
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 23, 2024 · Artificial Intelligence

Boosting Aviator Script Development with AI—No Model Training Required

This article details an engineering‑focused practice that uses large language models, RAG, prompt engineering, and reranking to automatically generate, review, and refine Aviator scripts for decision‑center policies without any model pre‑training, offering practical insights and code examples for developers.

AI Code GenerationAviator scriptLLM
0 likes · 29 min read
Boosting Aviator Script Development with AI—No Model Training Required
Fighter's World
Fighter's World
Sep 22, 2024 · Artificial Intelligence

How Large-Model AI Transforms Smart Customer Service – Alibaba Cloud Insights

The talk outlines the evolution of intelligent customer service over three decades, explains how generative large-model AI like ChatGPT has raised service expectations, and presents Alibaba Cloud’s four-stage implementation—experience, efficiency, capability, and insight—through three concrete cases and a roadmap for SMEs to build their own smart service systems.

AI agentsAlibaba-CloudCustomer Service
0 likes · 12 min read
How Large-Model AI Transforms Smart Customer Service – Alibaba Cloud Insights
Senior Brother's Insights
Senior Brother's Insights
Sep 19, 2024 · Artificial Intelligence

Rule Engines vs AI Models: Choosing the Right Approach for Product Logic

The article compares traditional rule‑engine architectures with AI‑driven models, explains their differing characteristics, outlines when deterministic rule matching is preferable over flexible AI inference, and recommends practical technologies such as Drools for rule‑based solutions and LLM‑based RAG/Agent frameworks for AI‑centric scenarios.

AIDroolsLLM
0 likes · 9 min read
Rule Engines vs AI Models: Choosing the Right Approach for Product Logic
JavaEdge
JavaEdge
Sep 19, 2024 · Artificial Intelligence

Unlock Java LLM Power: A Deep Dive into LangChain4j Features and Architecture

LangChain4j streamlines the integration of large language models into Java applications by offering a standardized API, extensive support for over a dozen LLM providers and vector stores, a rich toolbox for RAG, chat memory, and tool calling, plus two abstraction layers that cater to both low‑level control and high‑level convenience.

AIJavaLLM
0 likes · 10 min read
Unlock Java LLM Power: A Deep Dive into LangChain4j Features and Architecture
DevOps
DevOps
Sep 13, 2024 · Artificial Intelligence

15 Advanced Retrieval‑Augmented Generation (RAG) Techniques for Production‑Ready AI Solutions

The article outlines fifteen advanced Retrieval‑Augmented Generation (RAG) techniques—from hierarchical indexing and context caching to multimodal alignment and microservice orchestration—explaining how they help transform AI prototypes into scalable, reliable production systems while highlighting common pitfalls and a concluding call to action.

AI productionLLMRAG
0 likes · 8 min read
15 Advanced Retrieval‑Augmented Generation (RAG) Techniques for Production‑Ready AI Solutions
Code Mala Tang
Code Mala Tang
Sep 12, 2024 · Artificial Intelligence

Boost LLM Accuracy with Retrieval‑Augmented Generation Using LangChain.js

This article explains the core concepts of Retrieval‑Augmented Generation (RAG), walks through its implementation steps with LangChain.js—including text chunking, embedding, storage, retrieval, and generation—and showcases practical use cases, challenges, and best practices for building reliable AI‑powered applications.

AI applicationsEmbeddingLLM
0 likes · 16 min read
Boost LLM Accuracy with Retrieval‑Augmented Generation Using LangChain.js
Baidu Geek Talk
Baidu Geek Talk
Sep 11, 2024 · Databases

Why Vector Databases Are the Next Big Thing in AI: A Deep Dive into RAG and Baidu’s VectorDB

This article examines the 70‑year evolution of databases, explains how large‑model AI drives the rise of vector databases and Retrieval‑Augmented Generation (RAG), outlines the four‑stage RAG workflow, compares Baidu’s self‑built VectorDB with open‑source alternatives, and showcases real‑world deployments that highlight performance, scalability, and enterprise benefits.

AIDatabase ArchitectureIndustry Insights
0 likes · 16 min read
Why Vector Databases Are the Next Big Thing in AI: A Deep Dive into RAG and Baidu’s VectorDB