Tagged articles

924 articles

Page 8 of 10

Jan 15, 2025 · Artificial Intelligence

Build an Education‑Focused RAG Solution Using Alibaba PAI

This guide explains how to create a Retrieval‑Augmented Generation (RAG) solution for education on Alibaba PAI, covering knowledge‑base construction with PAI‑Designer, model deployment, connection setup in LangStudio, workflow configuration, online deployment, and a legal‑domain case comparison that highlights RAG's accuracy benefits.

Alibaba PAIEmbeddingKnowledge Base

0 likes · 14 min read

Build an Education‑Focused RAG Solution Using Alibaba PAI

DataFunSummit

Jan 11, 2025 · Artificial Intelligence

Generative AI Applications, MLOps, and LLMOps: A Comprehensive Overview

This article presents a detailed overview of generative AI lifecycle management, covering practical use cases such as email summarization, the roles of providers, fine‑tuners and consumers, MLOps/LLMOps processes, retrieval‑augmented generation, efficient fine‑tuning methods like PEFT, and Amazon Bedrock services for model deployment and monitoring.

Amazon BedrockLLMOpsMLOps

0 likes · 14 min read

Generative AI Applications, MLOps, and LLMOps: A Comprehensive Overview

JD Tech Talk

Jan 9, 2025 · Artificial Intelligence

Practical Guide to Building Retrieval‑Augmented Generation (RAG) Applications with LangChain4j in Java

This article provides a step‑by‑step tutorial for Java engineers on using the LangChain4j framework to implement Retrieval‑Augmented Generation (RAG) with large language models, covering concepts, environment setup, code integration, document splitting, embedding, vector‑store operations, and prompt engineering.

EmbeddingJavaLangChain4j

0 likes · 35 min read

Practical Guide to Building Retrieval‑Augmented Generation (RAG) Applications with LangChain4j in Java

JD Cloud Developers

Jan 9, 2025 · Artificial Intelligence

Boost Your Java Apps with LangChain4j: A Hands‑On RAG Guide

This article walks Java developers through the fundamentals of Retrieval‑Augmented Generation (RAG), explains the LangChain4j framework, compares large‑model development with traditional Java coding, and provides step‑by‑step code examples for environment setup, document splitting, embedding, vector‑store operations, and LLM interaction.

EmbeddingJavaLangChain4j

0 likes · 34 min read

Boost Your Java Apps with LangChain4j: A Hands‑On RAG Guide

DevOps

Jan 8, 2025 · Artificial Intelligence

Designing Generative AI Agents: Models, Tools, Extensions, Function Calls, and Data Storage

The article explains how generative AI agents combine language models, tool integration, self‑guided planning, prompt‑engineering frameworks, extensions, function calls, and vector‑based data storage to create adaptable, retrieval‑augmented systems that can interact with real‑world APIs and perform complex tasks.

RAGdata storageextensions

0 likes · 12 min read

Designing Generative AI Agents: Models, Tools, Extensions, Function Calls, and Data Storage

DataFunSummit

Jan 8, 2025 · Artificial Intelligence

Ximalaya's ChatBI: Applying Large‑Model AI to Build an Intelligent Business Intelligence Platform

This article presents Ximalaya's practical exploration of a large‑model‑driven BI system called ChatBI, detailing the background challenges, product architecture, implementation workflow, model‑optimization techniques, launch results, and future directions for data‑intelligent operations.

AI agentsData IntelligenceRAG

0 likes · 9 min read

Ximalaya's ChatBI: Applying Large‑Model AI to Build an Intelligent Business Intelligence Platform

DeWu Technology

Jan 6, 2025 · Artificial Intelligence

Design and Implementation of a Retrieval‑Augmented Generation (RAG) Answering Assistant for the Dewu Open Platform

The paper describes building a Retrieval‑Augmented Generation assistant for the Dewu Open Platform that leverages GPT‑4o‑mini, OpenAI embeddings, Milvus vector store, and LangChain.js to semantically retrieve API documentation, structure user queries, and generate accurate, JSON‑formatted answers, thereby reducing manual support and hallucinations.

AILLMLangChain

0 likes · 28 min read

Design and Implementation of a Retrieval‑Augmented Generation (RAG) Answering Assistant for the Dewu Open Platform

DataFunTalk

Jan 6, 2025 · Artificial Intelligence

Building and Applying NIO's Enterprise Knowledge Platform: Architecture, Challenges, and Future Directions

This article presents a comprehensive overview of NIO's company‑wide knowledge platform, detailing its background, layered architecture, retrieval‑augmented generation workflow, challenges such as accuracy, permission control and high concurrency, and future plans for AI‑assisted understanding, creation, multimodal capabilities, and expanded knowledge types.

AIEnterprise ArchitectureRAG

0 likes · 18 min read

Building and Applying NIO's Enterprise Knowledge Platform: Architecture, Challenges, and Future Directions

Alibaba Cloud Big Data AI Platform

Jan 3, 2025 · Artificial Intelligence

Build an Education‑Focused Retrieval‑Augmented Generation (RAG) Solution with Alibaba PAI

This guide walks you through creating a RAG‑enhanced AI solution for education using Alibaba PAI, covering prerequisite setup, knowledge‑base construction with PAI‑Designer, model deployment, connection configuration, workflow assembly, and a side‑by‑side comparison of RAG versus non‑RAG answers.

AI PlatformLLMMilvus

0 likes · 16 min read

Build an Education‑Focused Retrieval‑Augmented Generation (RAG) Solution with Alibaba PAI

DeWu Technology

Dec 25, 2024 · Artificial Intelligence

AI-Powered Intelligent Coding: Product Evolution, Technical Advances, and Future Outlook

AI‑powered coding tools—from JetBrains’ free IDEs to VSCode extensions like Cursor and end‑to‑end web platforms—are rapidly evolving, offering code continuation, AI‑driven Q&A, multi‑file editing, and chat interfaces, while advances in context handling, caching, LLM fine‑tuning, and speculative decoding promise faster, more integrated development workflows and a future where IDEs become chat‑centric assistants that streamline debugging, deployment, and junior developer support.

AI codingIDE integrationIntelligent code completion

0 likes · 18 min read

AI-Powered Intelligent Coding: Product Evolution, Technical Advances, and Future Outlook

Alibaba Cloud Big Data AI Platform

Dec 24, 2024 · Artificial Intelligence

Build a Medical RAG Solution with Alibaba PAI: Step-by-Step Guide

Learn how to create a Retrieval‑Augmented Generation (RAG) system for medical applications using Alibaba's PAI platform, covering knowledge‑base construction with PAI‑Designer, template setup in PAI‑LangStudio, deployment of LLM and embedding models, vector database integration, and end‑to‑end workflow configuration.

EmbeddingLLMMilvus

0 likes · 18 min read

Build a Medical RAG Solution with Alibaba PAI: Step-by-Step Guide

DataFunSummit

Dec 23, 2024 · Artificial Intelligence

Huolala's Large Model Evaluation Framework (LaLaEval) and Application Practices

This article presents Huolala's comprehensive LaLaEval framework for evaluating large language models, detailing the challenges of model deployment, the five‑step assessment process, two real‑world case studies in freight and driver invitation, and future directions toward more automated, product‑driven evaluation.

AIFrameworkLogistics

0 likes · 24 min read

Huolala's Large Model Evaluation Framework (LaLaEval) and Application Practices

AI Large Model Application Practice

Dec 23, 2024 · Artificial Intelligence

Master LlamaIndex Workflows: Build Multi‑Agent RAG Applications Step‑by‑Step

This article introduces LlamaIndex Workflows, explains its event‑driven design, walks through a multi‑agent demo that combines weather search and email sending, provides complete Python code for defining events, steps, and the orchestrator, and compares its strengths and limitations against similar frameworks.

AILlamaIndexPython

0 likes · 13 min read

Master LlamaIndex Workflows: Build Multi‑Agent RAG Applications Step‑by‑Step

Alibaba Cloud Big Data AI Platform

Dec 20, 2024 · Artificial Intelligence

How to Build a Retrieval‑Augmented Generation (RAG) System with Alibaba Cloud Milvus and PAI

This guide walks you through setting up Alibaba Cloud Milvus, configuring public access, deploying a RAG system via PAI, uploading a knowledge base, interacting with the model through the Web UI, and inspecting vector collections with Attu, all with step‑by‑step instructions and configuration details.

AIAlibaba CloudMilvus

0 likes · 10 min read

How to Build a Retrieval‑Augmented Generation (RAG) System with Alibaba Cloud Milvus and PAI

Alibaba Cloud Developer

Dec 19, 2024 · Artificial Intelligence

How to Build a Full-Stack RAG Knowledge QA App with Alibaba Cloud Low-Code Platform

This guide walks you through creating a complete retrieval‑augmented generation (RAG) knowledge‑question‑answer system on Alibaba Cloud, covering AI model integration, cloud‑native low‑code development, database setup, UI customization, session persistence, analytics dashboards, and multi‑channel deployment.

AIChatbotCloud Native

0 likes · 22 min read

How to Build a Full-Stack RAG Knowledge QA App with Alibaba Cloud Low-Code Platform

iQIYI Technical Product Team

Dec 19, 2024 · Artificial Intelligence

Project BaixiaoSheng: An AI‑Powered Project Management Assistant – iQIYI Case Study

Project BaixiaoSheng, iQIYI’s AI‑powered project management assistant unveiled at the 13th TOP 100 Global Software Case Study Summit, uses a Retrieval‑Augmented Generation framework with static knowledge Q&A, dynamic data consulting, and scenario‑assistant automation to cut context‑switching, streamline data flow, and boost cross‑system efficiency, while future plans target fine‑tuned LLMs, multi‑model fusion, and AI‑agent orchestration.

AIKnowledge BaseLarge Language Model

0 likes · 11 min read

Project BaixiaoSheng: An AI‑Powered Project Management Assistant – iQIYI Case Study

Alibaba Cloud Big Data AI Platform

Dec 17, 2024 · Artificial Intelligence

Build Chinese Vector Search with Alibaba Cloud AI and Elasticsearch Inference APIs

This guide walks you through creating sparse and dense vector inference endpoints on Elasticsearch using Alibaba Cloud AI services, demonstrates how to generate embeddings, perform completion, rerank results, and integrate RAG workflows for accurate Chinese‑language search.

AI SearchCompletionDense Embedding

0 likes · 14 min read

Build Chinese Vector Search with Alibaba Cloud AI and Elasticsearch Inference APIs

Baidu Geek Talk

Dec 16, 2024 · Artificial Intelligence

AIAPI: Baidu's AI-Native Retrieval System for Large Language Model Applications

AIAPI, Baidu’s AI‑native retrieval platform for large language models, tackles hallucination, slow domain updates, and output opacity by delivering authoritative, timely, full‑content data through a dual‑channel architecture that combines traditional search and RAG, employs reusable ranking, graph‑enhanced data layers, dynamic caching that cuts storage by 70 %, and QueryPlan‑based QoS, achieving markedly higher retrieval quality and a 34 % speed gain with Wenxin 4.0.

AI-Native SystemsAIAPILarge Language Models

0 likes · 12 min read

AIAPI: Baidu's AI-Native Retrieval System for Large Language Model Applications

NewBeeNLP

Dec 16, 2024 · Artificial Intelligence

How Tencent Boosts LLM Power with RAG, GraphRAG, and Agent Technologies

This article examines Tencent's large language model deployments across content generation, intelligent customer service, and role‑playing scenarios, detailing the principles and practical implementations of Retrieval‑Augmented Generation (RAG), GraphRAG, and Agent techniques, and discusses challenges, optimization strategies, and real‑world use cases.

AIAgentGraphRAG

0 likes · 18 min read

How Tencent Boosts LLM Power with RAG, GraphRAG, and Agent Technologies

Alibaba Cloud Big Data AI Platform

Dec 16, 2024 · Artificial Intelligence

Build a RAG-Powered Q&A App with Alibaba Cloud Milvus, DashScope & PAI

This guide walks you through creating a Retrieval‑Augmented Generation (RAG) question‑answering application by integrating Alibaba Cloud Milvus vector search, DashScope embedding models, and PAI EAS LLM services, covering prerequisites, service deployment, configuration, Python code setup, and execution steps.

LLMLangChainMilvus

0 likes · 12 min read

Build a RAG-Powered Q&A App with Alibaba Cloud Milvus, DashScope & PAI

Baobao Algorithm Notes

Dec 15, 2024 · Artificial Intelligence

What Are the Best Practices for Retrieval‑Augmented Generation (RAG)?

This comprehensive study evaluates various components of Retrieval‑Augmented Generation pipelines—including query classification, chunking, embedding models, vector databases, retrieval, re‑ranking, summarization, and generator fine‑tuning—identifies optimal configurations, and proposes best‑practice guidelines for both performance‑maximizing and efficiency‑balanced RAG systems.

Best PracticesLLMRAG

0 likes · 17 min read

What Are the Best Practices for Retrieval‑Augmented Generation (RAG)?

Alibaba Cloud Native

Dec 13, 2024 · Artificial Intelligence

Build a 24/7 AI Customer Assistant for Your Website in 10 Minutes

This guide shows how to create a zero‑code AI chatbot using Alibaba Cloud Function Compute and the Bailei large‑model platform, configure API keys, deploy a sample site, embed the assistant, and enhance it with a private knowledge base for accurate product support.

AI AssistantCustomer ServiceFunction Compute

0 likes · 9 min read

Build a 24/7 AI Customer Assistant for Your Website in 10 Minutes

Alimama Tech

Dec 11, 2024 · Artificial Intelligence

Engineering Architecture of Alibaba's AI Digital Employee "AI XiaoWan"

Alibaba’s AI digital employee “AI XiaoWan” uses a native multi‑agent architecture where a Controller Agent interprets intent, plans tasks, and orchestrates execution while an Executable Agent performs domain‑specific operations, communicating via a standardized Agent Communication Protocol, leveraging a centralized Tool Center, a retrieval‑augmented knowledge base, and a data‑flywheel feedback loop to continuously improve and evolve toward memory‑based reasoning and self‑learning.

AIKnowledge BaseLarge Language Model

0 likes · 14 min read

Engineering Architecture of Alibaba's AI Digital Employee "AI XiaoWan"

Tencent Tech

Dec 11, 2024 · Artificial Intelligence

Inside Tencent LeYong AI: Solving Enterprise RAG with Knowledge, Engineering & Algorithms

This article explores how Tencent's LeYong AI assistant leverages Retrieval‑Augmented Generation to empower enterprise knowledge retrieval, detailing three capability dimensions—knowledge management, engineering, and algorithmic—along with eight sub‑areas such as knowledge boundaries, quality, permissions, multimodal handling, long‑context span, and complex reasoning.

AI assistantsEnterprise AILarge Language Models

0 likes · 18 min read

Inside Tencent LeYong AI: Solving Enterprise RAG with Knowledge, Engineering & Algorithms

AI Large Model Application Practice

Dec 11, 2024 · Artificial Intelligence

What Are Vectors and Why They Power Modern AI

This article explains vectors as numeric representations of data, how they enable similarity comparison, the role of embedding models and vector databases, their use in semantic search and RAG applications, and discusses their advantages and limitations in modern AI systems.

AI fundamentalsEmbeddingRAG

0 likes · 10 min read

What Are Vectors and Why They Power Modern AI

DataFunTalk

Dec 10, 2024 · Artificial Intelligence

Tencent Large Language Model Applications: RAG, GraphRAG, and Agent Technologies

This article explores Tencent's large language model deployments across various business scenarios, detailing the principles and practical implementations of Retrieval‑Augmented Generation (RAG), GraphRAG for role‑playing, and Agent technologies, while also covering model fine‑tuning, knowledge‑base construction, and evaluation methods.

AI applicationsAgentGraphRAG

0 likes · 15 min read

Tencent Large Language Model Applications: RAG, GraphRAG, and Agent Technologies

21CTO

Dec 9, 2024 · Artificial Intelligence

Unlock AI Mastery: 5 Open-Source Tools to Learn by Doing

This guide introduces five open-source AI projects—SWIRL, Postiz, OpenBB, Open WebUI, and Auto Jobs Applier AI Agent—explaining how each can be used to practice AI concepts, from retrieval-augmented generation and AI-driven scheduling to financial analysis, model integration, and automated job applications, while highlighting the learning benefits of hands-on experimentation.

AIRAGtools

0 likes · 9 min read

Unlock AI Mastery: 5 Open-Source Tools to Learn by Doing

Zhihu Tech Column

Dec 9, 2024 · Artificial Intelligence

Large Model Application Engineering: ZhiLight Inference Framework and Zhihu Direct Answer System

The article details Zhihu's technical salon on large‑model engineering, covering the RAG‑based Zhihu Direct Answer system, the open‑source ZhiLight inference framework, prompt engineering, agent research, and future plans for integrating AI into product and community workflows.

AI EngineeringAgentInference Framework

0 likes · 8 min read

Large Model Application Engineering: ZhiLight Inference Framework and Zhihu Direct Answer System

Tencent Cloud Developer

Dec 5, 2024 · Industry Insights

Why Most RAG Projects Fail and How Tencent’s LeXiang AI Assistant Overcomes Them

The article analyses the rapid growth of Retrieval‑Augmented Generation (RAG) in enterprises, explains why self‑built RAG solutions often collapse under cost and maintenance pressures, and demonstrates how Tencent LeXiang AI Assistant addresses these issues through a robust knowledge‑management core, extensive industry experience, scalable resources, and advanced multimodal capabilities.

AI AssistantEnterprise AILarge Language Model

0 likes · 16 min read

Why Most RAG Projects Fail and How Tencent’s LeXiang AI Assistant Overcomes Them

Alibaba Cloud Big Data AI Platform

Dec 5, 2024 · Artificial Intelligence

How to Build a Financial RAG Solution with Alibaba PAI: Step-by-Step Guide

Learn how to create a Retrieval‑Augmented Generation (RAG) system for financial scenarios using Alibaba’s PAI platform—covering knowledge‑base construction with PAI‑Designer, template creation in PAI‑LangStudio, deployment of LLM and embedding models, and linking vector stores for accurate, context‑aware answers.

EmbeddingFinancial AIPAI

0 likes · 17 min read

How to Build a Financial RAG Solution with Alibaba PAI: Step-by-Step Guide

AI Large Model Application Practice

Nov 29, 2024 · Artificial Intelligence

Understanding RAG: How Retrieval‑Augmented Generation Reduces Large‑Model Hallucinations

This article explains the hallucination problem of large language models, introduces Retrieval‑Augmented Generation (RAG) as a solution, compares RAG with model fine‑tuning, and outlines basic RAG architecture and workflow for practical applications.

Hallucination MitigationLarge Language ModelsRAG

0 likes · 10 min read

Understanding RAG: How Retrieval‑Augmented Generation Reduces Large‑Model Hallucinations

Sohu Tech Products

Nov 27, 2024 · Artificial Intelligence

RAG Technology and Practical Application in Multi-Modal Query: Using Chinese-CLIP and Redis Search

The article explains how Retrieval‑Augmented Generation (RAG) outperforms direct LLM inference by enabling real‑time knowledge updates and lower costs, and demonstrates a practical multi‑modal RAG pipeline that uses Chinese‑CLIP for vector encoding, various chunking strategies, and Redis Search for fast vector storage and retrieval.

Chinese-CLIPChunkingLLM

0 likes · 17 min read

RAG Technology and Practical Application in Multi-Modal Query: Using Chinese-CLIP and Redis Search

System Architect Go

Nov 19, 2024 · Artificial Intelligence

Retrieval Augmented Generation (RAG) System Overview and Implementation with LangChain, Redis, and llama.cpp

This article explains the concept, architecture, and step‑by‑step implementation of Retrieval Augmented Generation (RAG), covering indexing, retrieval & generation processes, a practical LangChain‑Redis‑llama.cpp example on Kubernetes, code snippets, test results, challenges, and references.

AIEmbeddingLLM

0 likes · 6 min read

Retrieval Augmented Generation (RAG) System Overview and Implementation with LangChain, Redis, and llama.cpp

Alibaba Cloud Developer

Nov 18, 2024 · Artificial Intelligence

Solving Knowledge Challenges in Retrieval‑Augmented Generation: Practical Optimizations

This article shares a half‑year of hands‑on experience with Retrieval‑Augmented Generation, analyzing why simple RAG setups often feel unintelligent, identifying three core knowledge issues, and presenting concrete optimization strategies—including chunking, knowledge expansion, and tag‑based conflict resolution—to improve retrieval and generation performance in low‑resource environments.

AILarge Language ModelsRAG

0 likes · 25 min read

Solving Knowledge Challenges in Retrieval‑Augmented Generation: Practical Optimizations

dbaplus Community

Nov 16, 2024 · Artificial Intelligence

Are LLM Frameworks Overhyped? A Critical Look at RAG and Reusability

The article critiques LLM frameworks, comparing them to early ORM tools, explains how Retrieval Augmented Generation works, warns against premature optimization, and advises developers to favor simple, visible practices over complex, abstracted frameworks for better control and understanding.

AILLMModelEvaluation

0 likes · 7 min read

Are LLM Frameworks Overhyped? A Critical Look at RAG and Reusability

ITPUB

Nov 15, 2024 · Databases

Why Vector Databases Matter: Deploying PgVector on PostgreSQL for Scalable AI Retrieval

This article explains the need for vector databases in the AI era, reviews PostgreSQL's extensible ecosystem, compares vector‑database options, provides step‑by‑step PgVector installation and usage, shares operational best practices, performance tuning tips, and real‑world Qunar & Tujia case studies.

AIPerformance tuningPostgreSQL

0 likes · 27 min read

Why Vector Databases Matter: Deploying PgVector on PostgreSQL for Scalable AI Retrieval

Alibaba Cloud Developer

Nov 14, 2024 · Artificial Intelligence

Building a High‑Accuracy Automotive Maintenance Q&A System with Multi‑Agent LLMs

This article details how to design, implement, and evaluate a complex‑table intelligent Q&A solution for automotive maintenance using large language models, RAG pipelines, multi‑agent architectures, prompt engineering, and Alibaba Cloud services, achieving up to 93.8% accuracy.

AutomotiveLLMPrompt Engineering

0 likes · 31 min read

Building a High‑Accuracy Automotive Maintenance Q&A System with Multi‑Agent LLMs

Alibaba Cloud Developer

Nov 13, 2024 · Artificial Intelligence

Boost RAG Accuracy with GraphRAG: Combining Knowledge Graphs and Vectors on PolarDB

This article explains how to build a GraphRAG system that integrates knowledge graphs and vector embeddings using PolarDB, Alibaba Cloud's Tongyi Qianwen LLM, and LangChain, demonstrating improved retrieval‑augmented generation accuracy through combined graph‑and‑vector search.

GraphRAGKnowledge GraphLangChain

0 likes · 23 min read

Boost RAG Accuracy with GraphRAG: Combining Knowledge Graphs and Vectors on PolarDB

Architects' Tech Alliance

Nov 12, 2024 · Artificial Intelligence

How Retrieval‑Augmented Generation Boosts Enterprise AI with Intel Optimizations

This article explains the fundamentals of Retrieval‑Augmented Generation (RAG), its four‑step workflow, architecture, and how Intel’s hardware and software optimizations—including vector search, quantized embeddings, and advanced inference extensions—enhance performance, security, and scalability for enterprise LLM applications.

AI inferenceEmbedding QuantizationIntel Optimization

0 likes · 14 min read

How Retrieval‑Augmented Generation Boosts Enterprise AI with Intel Optimizations

Data Thinking Notes

Nov 12, 2024 · Artificial Intelligence

Unlock Data Power with DB‑GPT: An Open‑Source AI Framework for Data Development

DB‑GPT is an open‑source AI‑native data application framework that unifies multi‑model management, RAG, agents, and workflow orchestration to simplify building large‑model‑driven data solutions, offering features such as private Q&A, multi‑source analytics, automated fine‑tuning, and robust privacy security.

AIData FrameworkOpen Source

0 likes · 13 min read

Unlock Data Power with DB‑GPT: An Open‑Source AI Framework for Data Development

Aikesheng Open Source Community

Nov 12, 2024 · Artificial Intelligence

ChatDBA: An AI‑Powered Database Fault Diagnosis Assistant Using Large Language Models

ChatDBA is a conversational AI system built by Shanghai Aikesheng that employs large language models and Retrieval‑Augmented Generation to help database administrators diagnose faults, learn domain knowledge, and generate or optimize SQL, with a redesigned architecture that addresses early‑stage shortcomings and outlines future enhancements.

ChatDBAFault DiagnosisKnowledge Base

0 likes · 10 min read

ChatDBA: An AI‑Powered Database Fault Diagnosis Assistant Using Large Language Models

Aikesheng Open Source Community

Nov 11, 2024 · Databases

ChatDBA: An AI‑Powered Intelligent Assistant for Database Fault Diagnosis and Management

ChatDBA is an AI‑driven conversational system developed by Shanghai Aikesheng that assists DBAs with fault diagnosis, knowledge learning, SQL generation and optimization by leveraging large language models, RAG architecture, and advanced retrieval and document‑processing techniques.

ChatDBAFault DiagnosisLLM

0 likes · 10 min read

ChatDBA: An AI‑Powered Intelligent Assistant for Database Fault Diagnosis and Management

Baidu Tech Salon

Nov 11, 2024 · Cloud Native

Baidu Cloud Native Data Platform: Empowering Enterprise AI in the LLM Era

To empower enterprise AI in the LLM era, Baidu Cloud unveils a cloud‑native data platform featuring upgraded databases—PegaDB, GaiaDB 5.0, Vector DB 2.0, Palo 2.0—and integrated services like DBSC 2.0, EDAP 2.0, and DBStack, delivering high‑performance, cost‑effective handling of structured, unstructured, and vector data for fine‑tuning and Enterprise RAG.

DBStackData LakehouseEDAP

0 likes · 10 min read

Baidu Cloud Native Data Platform: Empowering Enterprise AI in the LLM Era

JavaEdge

Nov 9, 2024 · Artificial Intelligence

Build an AI‑Powered Airline Ticket Agent with Spring AI Alibaba

This tutorial walks through creating an intelligent airline‑ticket customer‑service agent using Spring AI Alibaba, covering requirements, architecture, RAG integration, function calling, chat memory, core capabilities, code implementation with ChatClient, and a sample running result.

AI AgentAlibabaChat Memory

0 likes · 9 min read

Build an AI‑Powered Airline Ticket Agent with Spring AI Alibaba

DataFunSummit

Nov 8, 2024 · Artificial Intelligence

ChatDBA: An AI‑Powered Database Fault Diagnosis Assistant Using Retrieval‑Augmented Generation

ChatDBA, developed by Shanghai Aikesheng, is an AI-driven database operation assistant that leverages large language models and Retrieval‑Augmented Generation to provide fault diagnosis, knowledge learning, SQL generation and optimization, addressing challenges such as vague outputs, complex troubleshooting logic, and memory management through a structured architecture and multi‑modal retrieval strategies.

AIDatabaseFault Diagnosis

0 likes · 10 min read

ChatDBA: An AI‑Powered Database Fault Diagnosis Assistant Using Retrieval‑Augmented Generation

AI Large Model Application Practice

Nov 8, 2024 · Artificial Intelligence

How to Build a Multimodal Embedding RAG with Cohere and LlamaIndex

This guide explains how to overcome the limitations of text‑only embeddings for enterprise AI search by using a multimodal embedding model to index and retrieve both text and images, detailing the full workflow, code examples, and performance benefits.

CohereLLMLlamaIndex

0 likes · 13 min read

How to Build a Multimodal Embedding RAG with Cohere and LlamaIndex

NewBeeNLP

Nov 7, 2024 · Artificial Intelligence

Tackling Large Model Hallucinations: Causes, Detection, and Mitigation Strategies

This article provides a comprehensive analysis of large language model hallucinations, detailing their definitions, classifications, root causes, detection techniques, and a wide range of mitigation approaches—including RAG pipelines, decoding strategies, and model‑enhancement methods—to improve reliability and safety in real‑world AI applications.

AI safetyLarge Language ModelsPrompt Engineering

0 likes · 22 min read

Tackling Large Model Hallucinations: Causes, Detection, and Mitigation Strategies

Sohu Tech Products

Nov 6, 2024 · Artificial Intelligence

RAG2.0 Engine Design Challenges and Implementation

The talk outlines RAG2.0’s design challenges—low vector recall, complex documents, semantic gaps—and presents a two‑stage architecture using deep multimodal understanding and knowledge‑graph‑enhanced retrieval, detailing advanced chunking, multi‑index and multi‑path retrieval, efficient sorting models like ColBERT, and future multi‑modal and memory‑augmented agent directions.

ColBERTDelayed InteractionEnterprise AI

0 likes · 23 min read

RAG2.0 Engine Design Challenges and Implementation

37 Interactive Technology Team

Nov 4, 2024 · Artificial Intelligence

Developing RAG and Agent Applications with LangChain: A Case Study of an AI Assistant for Activity Components

The article outlines a step‑by‑step methodology for creating Retrieval‑Augmented Generation and custom Agent applications with LangChain, illustrated by an AI assistant for activity components that evolves from a rapid Dify prototype to a LangChain‑based RAG system and finally a hand‑crafted ReAct‑style agent, detailing LCEL chain composition, vector‑search integration, model performance trade‑offs, and a unified routing layer.

AI AssistantAgentCloud-native

0 likes · 6 min read

Developing RAG and Agent Applications with LangChain: A Case Study of an AI Assistant for Activity Components

DataFunTalk

Oct 31, 2024 · Artificial Intelligence

Tencent OlaChat: An LLM‑Powered Intelligent Business Intelligence Platform – Architecture, Capabilities, and Practice

This article presents the evolution from traditional to intelligent BI, explores how large language models enable natural‑language data analysis, details the OlaChat platform’s architecture, metadata‑enhanced retrieval methods, Text2SQL pipeline, multi‑turn dialogue system, and shares practical deployment insights and Q&A.

Intelligent AnalyticsLLMMetadata Retrieval

0 likes · 20 min read

Tencent OlaChat: An LLM‑Powered Intelligent Business Intelligence Platform – Architecture, Capabilities, and Practice

JD Tech

Oct 31, 2024 · Artificial Intelligence

Design and Implementation of the Logistics Intelligent Robot “Yunli XiaoZhi” Powered by Large Language Models

The article details the development of Yunli XiaoZhi, an AI‑driven logistics chatbot that combines knowledge‑base Q&A, data‑analysis, proactive alerts and report‑pushing to streamline SOP access, reduce manual query effort, and improve operational efficiency for operators, carriers and drivers.

AI chatbotData AnalysisKnowledge Base

0 likes · 22 min read

Design and Implementation of the Logistics Intelligent Robot “Yunli XiaoZhi” Powered by Large Language Models

AI Large Model Application Practice

Oct 30, 2024 · Artificial Intelligence

How to Efficiently Incrementally Update Knowledge in RAG Applications

Incremental knowledge updates in Retrieval‑Augmented Generation (RAG) systems can be achieved by using document‑level or chunk‑level strategies, leveraging hash fingerprints, record managers, and framework‑specific APIs such as LangChain’s index() with cleanup modes or LlamaIndex’s ingestion pipeline, reducing redundant computation and cost.

LangChainLlamaIndexRAG

0 likes · 12 min read

How to Efficiently Incrementally Update Knowledge in RAG Applications

Baobao Algorithm Notes

Oct 29, 2024 · Industry Insights

Inside Perplexity AI: How RAG Powers the Next‑Gen Search Engine

In this interview, Perplexity AI CEO Aravind Srinivas explains the company’s retrieval‑augmented generation architecture, multi‑model strategy, vector‑database use, competitive positioning against Google, monetization plans, and future product road‑map, offering a deep industry perspective on AI‑driven search.

AI startupLLMPerplexity AI

0 likes · 38 min read

Inside Perplexity AI: How RAG Powers the Next‑Gen Search Engine

Baidu Geek Talk

Oct 28, 2024 · Artificial Intelligence

Baidu Intelligent Cloud Qianfan AppBuilder: Enterprise-Level Large Model Application Development Platform

Baidu Intelligent Cloud’s Qianfan AppBuilder 3.0 offers an enterprise‑grade platform that simplifies large‑model application development by providing high‑accuracy RAG, robust agent scheduling, extensive integration, secure private‑or‑hybrid deployment, and a guided methodology, enabling industries to transform processes, add AI copilots, and create novel capabilities.

AI integrationBaidu Intelligent CloudEnterprise AI

0 likes · 12 min read

Baidu Intelligent Cloud Qianfan AppBuilder: Enterprise-Level Large Model Application Development Platform

DevOps

Oct 27, 2024 · Artificial Intelligence

Best Practices for Building Efficient Retrieval‑Augmented Generation (RAG) Systems

This article reviews Wang et al.'s 2024 research on Retrieval‑Augmented Generation, outlining optimal practices such as query classification, chunk sizing, hybrid metadata search, embedding selection, vector databases, query transformation, reranking, document repacking, summarization, fine‑tuning, and multimodal retrieval to guide developers in constructing high‑performance RAG pipelines.

LLMQuery ClassificationRAG

0 likes · 11 min read

Best Practices for Building Efficient Retrieval‑Augmented Generation (RAG) Systems

DataFunSummit

Oct 27, 2024 · Artificial Intelligence

How Siemens Harnesses Generative AI to Build the Enterprise Knowledge Chatbot “XiaoYu”

This article describes Siemens' journey in applying generative AI and Retrieval‑Augmented Generation to create an internal knowledge chatbot, detailing the business challenges, technical architecture, data integration, multi‑modal capabilities, deployment outcomes, and strategic lessons for enterprise AI adoption.

AI chatbotEnterprise Knowledge ManagementGenerative AI

0 likes · 21 min read

How Siemens Harnesses Generative AI to Build the Enterprise Knowledge Chatbot “XiaoYu”

Alibaba Cloud Native

Oct 26, 2024 · Artificial Intelligence

Build a Real‑Time Semantic Search with EventBridge, DashVector, and FunctionCompute

This tutorial walks through constructing a zero‑to‑one RAG pipeline that ingests OSS text files via EventBridge, transforms them into embeddings with DashScope, stores vectors in DashVector, and performs semantic search using FunctionCompute and a Qwen‑Turbo LLM, complete with code samples and configuration steps.

DashVectorEmbeddingEventBridge

0 likes · 10 min read

Build a Real‑Time Semantic Search with EventBridge, DashVector, and FunctionCompute

DataFunSummit

Oct 25, 2024 · Artificial Intelligence

Progress and Standardization of Large Model + Data Intelligence Applications by the China Academy of Information and Communications Technology

This article reviews the China Academy of Information and Communications Technology's advancements in large‑model‑driven data intelligence, covering development trends, key deployment technologies such as prompt engineering, fine‑tuning and RAG, emerging application paradigms, challenges, and a series of newly drafted standards to guide industry adoption.

AIData IntelligenceKnowledge Graph

0 likes · 13 min read

Progress and Standardization of Large Model + Data Intelligence Applications by the China Academy of Information and Communications Technology

DataFunSummit

Oct 24, 2024 · Big Data

Bilibili’s Large Language Model‑Based Intelligent Assistant for the Big Data Platform: Architecture, Principles, and Deployment

This article details Bilibili’s implementation of a large‑language‑model‑driven intelligent assistant for its massive big‑data platform, covering background, problem analysis, architectural design, knowledge‑base construction, precision and recall challenges, deployment across offline and real‑time Spark/Flink diagnostics, and future outlooks.

AgentBig DataFlink

0 likes · 23 min read

Bilibili’s Large Language Model‑Based Intelligent Assistant for the Big Data Platform: Architecture, Principles, and Deployment

21CTO

Oct 23, 2024 · Artificial Intelligence

IBM Unveils Granite 3.0 LLMs: Open‑Source, Secure, and Cost‑Effective AI Models

IBM introduced the Granite 3.0 series, an open‑source family of large language models that combine cutting‑edge performance with enhanced security, multi‑language support, and cost‑efficiency, while offering a variety of base, instruct, and specialist variants for enterprise use.

AI modelsGraniteIBM

0 likes · 4 min read

IBM Unveils Granite 3.0 LLMs: Open‑Source, Secure, and Cost‑Effective AI Models

DaTaobao Tech

Oct 23, 2024 · Artificial Intelligence

Retrieval-Augmented Generation (RAG): Principles, Applications, Limitations and Challenges

Retrieval-Augmented Generation (RAG) combines a retriever that fetches relevant external documents and a generator that uses them, improving LLM accuracy, relevance, privacy, and up-to-date information, but faces challenges such as retrieval latency, computational cost, chunking strategies, embedding selection, and system integration complexity.

AIKnowledge retrievalLLM

0 likes · 13 min read

Retrieval-Augmented Generation (RAG): Principles, Applications, Limitations and Challenges

Alibaba Cloud Big Data AI Platform

Oct 22, 2024 · Artificial Intelligence

How Alibaba Cloud Optimizes Enterprise RAG: Key Techniques for AI Search

At the 2024 Alibaba Cloud Yúnxī Conference, senior AI Search expert Xing Shaomin detailed the enterprise‑grade Retrieval‑Augmented Generation (RAG) pipeline, covering critical link architecture, effectiveness, performance, and cost optimizations, as well as practical applications, vector store enhancements, LLM agents, and deployment strategies.

AI SearchEnterprise AILLM

0 likes · 16 min read

How Alibaba Cloud Optimizes Enterprise RAG: Key Techniques for AI Search

DataFunSummit

Oct 21, 2024 · Artificial Intelligence

Retrieval‑Augmented Generation (RAG) for Office Applications: Architecture, Challenges, and Practical Practices

This article introduces Retrieval‑Augmented Generation (RAG) as a solution to the hallucination, freshness, and data‑privacy issues of large language models, details its modular architecture, explains the layered system design and hybrid retrieval pipeline, and shares the practical challenges and engineering tricks encountered when deploying RAG in enterprise office scenarios.

AIHybrid RetrievalLarge Language Model

0 likes · 19 min read

Retrieval‑Augmented Generation (RAG) for Office Applications: Architecture, Challenges, and Practical Practices

Alibaba Cloud Native

Oct 18, 2024 · Artificial Intelligence

How Spring AI Alibaba Simplifies Java AI Application Development

This article introduces the open‑source Spring AI Alibaba framework, explains its background, core features such as chat model abstraction, prompt templates, structured output, function calling, RAG and chat memory, and walks through a complete smart‑ticket‑assistant example with code snippets and deployment guidance.

AI FrameworkChat MemoryFunction Calling

0 likes · 17 min read

How Spring AI Alibaba Simplifies Java AI Application Development

DataFunSummit

Oct 18, 2024 · Artificial Intelligence

Building Efficient RAG Applications with a Small Team: Insights from PingCAP AI Lab

This article details how PingCAP's three‑person AI Lab leveraged Retrieval‑Augmented Generation (RAG) techniques—including basic RAG, fine‑tuned embeddings, re‑ranking, graph RAG, and agent‑based RAG—to create scalable, multilingual document‑question answering services while addressing large‑scale documentation challenges, model limitations, and user feedback loops.

AgentEmbeddingLLM

0 likes · 14 min read

Building Efficient RAG Applications with a Small Team: Insights from PingCAP AI Lab

Architecture Digest

Oct 18, 2024 · Databases

Redis Introduces Multi‑Threaded Query Engine to Boost Vector Search Performance

Redis has launched an enhanced, multi‑threaded query engine that dramatically increases throughput and reduces latency for vector similarity searches, enabling vertical scaling and better support for real‑time RAG applications while maintaining sub‑10 ms response times.

Database PerformanceQuery EngineRAG

0 likes · 7 min read

Redis Introduces Multi‑Threaded Query Engine to Boost Vector Search Performance

Alibaba Cloud Big Data AI Platform

Oct 18, 2024 · Artificial Intelligence

Integrate Alibaba Cloud AI Search with Elasticsearch: A Step‑by‑Step Guide

This tutorial walks you through configuring Elasticsearch’s Open Inference API to connect with Alibaba Cloud AI Search, covering setup of text generation, rerank, sparse and dense vector services, and demonstrates end‑to‑end requests with code examples for building RAG and semantic search applications.

Alibaba Cloud AI SearchElasticsearchInference API

0 likes · 11 min read

Integrate Alibaba Cloud AI Search with Elasticsearch: A Step‑by‑Step Guide

Baobao Algorithm Notes

Oct 17, 2024 · Artificial Intelligence

How Contextual Retrieval Slashes RAG Failures by Up to 67% and Cuts Costs

Anthropic’s Contextual Retrieval augments traditional RAG with contextual embeddings and BM25, reducing retrieval failure rates by 49% (up to 67% with reranking), improving accuracy across domains, and lowering latency and cost through Claude’s prompt‑caching feature.

AIBM25Contextual Retrieval

0 likes · 11 min read

How Contextual Retrieval Slashes RAG Failures by Up to 67% and Cuts Costs

Alibaba Cloud Developer

Oct 17, 2024 · Artificial Intelligence

Build AI-Powered Java Apps Fast with Spring AI Alibaba: Features & Demo

Spring AI Alibaba is an open‑source Java framework that integrates Alibaba Cloud's large‑model services with Spring AI, offering high‑level abstractions for chat models, prompts, function calling, RAG, and conversation memory, and includes a complete ticket‑assistant example with code snippets.

AI FrameworkChatbotFunction Calling

0 likes · 17 min read

Build AI-Powered Java Apps Fast with Spring AI Alibaba: Features & Demo

AntData

Oct 16, 2024 · Artificial Intelligence

Building a Data Assistant Application with DB‑GPT V0.6.0

This tutorial walks through the end‑to‑end process of creating a data‑assistant application using DB‑GPT V0.6.0, covering prerequisite deployment, knowledge‑base construction, sub‑agent creation, RAG‑based QA, AWEL workflow installation, intent‑recognition knowledge base, and unified multi‑agent orchestration.

AIDB-GPTData Assistant

0 likes · 12 min read

Building a Data Assistant Application with DB‑GPT V0.6.0

Baobao Algorithm Notes

Oct 16, 2024 · Artificial Intelligence

How the DB3 Team Won the Meta CRAG RAG Challenge: Prompts, Retrieval, and LoRA Fine‑Tuning

This article analyzes the Meta Comprehensive RAG (CRAG) benchmark, detailing its three tasks, evaluation metrics, and the champion DB3 team's end‑to‑end solution that combines data preprocessing, dual‑stage retrieval, prompt engineering, LoRA‑based fine‑tuning, and public data augmentation to achieve top scores across all tasks.

BenchmarkKnowledge GraphLLM

0 likes · 17 min read

How the DB3 Team Won the Meta CRAG RAG Challenge: Prompts, Retrieval, and LoRA Fine‑Tuning

AI Large Model Application Practice

Oct 14, 2024 · Artificial Intelligence

Build a Multimodal RAG Pipeline with Kotaemon, Azure Document Intelligence, and VLM

This guide walks through setting up the open‑source Kotaemon framework, configuring Azure Document Intelligence and a visual large model, and implementing code to extract and caption images and tables from PDFs for end‑to‑end multimodal RAG applications.

AzurePythonRAG

0 likes · 12 min read

Build a Multimodal RAG Pipeline with Kotaemon, Azure Document Intelligence, and VLM

21CTO

Oct 10, 2024 · Artificial Intelligence

5 Practical AI Projects to Build Your Skills with Python

This article presents five hands‑on AI project ideas—from resume optimization to multimodal search—complete with step‑by‑step instructions, required Python libraries, and code snippets, helping beginners and intermediate developers quickly build valuable AI applications.

AIAutomationLLM

0 likes · 12 min read

5 Practical AI Projects to Build Your Skills with Python

DaTaobao Tech

Oct 9, 2024 · Artificial Intelligence

Building a Vertical Domain QA Bot with Vector Search, RAG, and SFT

This guide walks entry‑level developers through building a logistics‑focused QA bot by first embedding documents for vector similarity search, then adding retrieval‑augmented generation, fine‑tuning a small model, integrating hybrid checks, and optimizing deployment with feedback loops to achieve fast, accurate, out‑of‑scope‑aware answers.

AIChatbotRAG

0 likes · 15 min read

Building a Vertical Domain QA Bot with Vector Search, RAG, and SFT

DataFunTalk

Oct 9, 2024 · Artificial Intelligence

Interview on Data Fabric, Data Virtualization, and AI Integration with Denodo Leaders

In this interview, Denodo executives discuss the origins, challenges, and future of data fabric and data virtualization, explore how generative AI and retrieval‑augmented generation enhance data management, share customer success stories, and offer strategic insights for enterprises navigating digital transformation.

Data FabricDenodoEnterprise Data Management

0 likes · 19 min read

Interview on Data Fabric, Data Virtualization, and AI Integration with Denodo Leaders

DevOps

Oct 8, 2024 · Artificial Intelligence

Top 20+ Retrieval‑Augmented Generation (RAG) Interview Questions and Answers

This article presents over twenty essential Retrieval‑Augmented Generation (RAG) interview questions with detailed answers, covering fundamentals, applications, architecture, training, limitations, ethical considerations, and integration, offering AI enthusiasts and job candidates a comprehensive guide to mastering RAG concepts.

AI InterviewMachine LearningNLP

0 likes · 15 min read

Top 20+ Retrieval‑Augmented Generation (RAG) Interview Questions and Answers

Java Tech Enthusiast

Oct 8, 2024 · Artificial Intelligence

Spring AI Framework for Java Developers

Spring AI is a Java‑centric framework that unifies access to chat, text‑to‑image, embedding and retrieval‑augmented generation models—including OpenAI, Anthropic and Alibaba’s Tongyi Qianwen—through synchronous or asynchronous APIs, POJO mapping, function calling, vector‑store integration and fluent tooling for rapid AI agent development.

AI frameworksCloud ComputingFunction Calling

0 likes · 5 min read

JD Tech Talk

Oct 8, 2024 · Artificial Intelligence

Building a Retrieval‑Augmented Generation (RAG) System with Rust and Qdrant

This article explains how to construct a Retrieval‑Augmented Generation pipeline in Rust, covering knowledge‑base creation with Qdrant, model loading and embedding using the candle library, data ingestion, and integration of a Rust‑based inference service based on mistral.rs, while also discussing resource usage and common pitfalls.

AIEmbeddingLLM

0 likes · 16 min read

Building a Retrieval‑Augmented Generation (RAG) System with Rust and Qdrant

JD Cloud Developers

Oct 8, 2024 · Artificial Intelligence

How to Build a Rust-Powered Retrieval‑Augmented Generation (RAG) System from Scratch

This article explains how to construct a Retrieval‑Augmented Generation pipeline in Rust, covering knowledge‑base creation with Qdrant, model loading and embedding generation using candle, and integrating a Rust‑based inference service to answer queries with up‑to‑date external data.

EmbeddingLLMQdrant

0 likes · 17 min read

How to Build a Rust-Powered Retrieval‑Augmented Generation (RAG) System from Scratch

Architect

Oct 7, 2024 · Artificial Intelligence

Master Prompt Engineering: A Universal Framework for Building Effective LLM Prompts

This article presents a systematic, four‑part Prompt engineering framework—role definition, problem description, goal setting, and requirement specification—augmented with RAG, few‑shot examples, memory handling, and model‑parameter tuning, enabling developers to craft high‑quality prompts for large language models across diverse tasks.

Large Language ModelsModel ParametersPrompt Engineering

0 likes · 28 min read

Master Prompt Engineering: A Universal Framework for Building Effective LLM Prompts

JavaEdge

Oct 2, 2024 · Artificial Intelligence

Boost RAG Retrieval Accuracy with Contextual Embeddings and BM25

This article presents a contextual retrieval technique that combines contextual embeddings and contextual BM25 to reduce RAG miss rates by up to 67%, explains the underlying methods, implementation steps, cost considerations, experimental results, and practical deployment guidance.

AIBM25Contextual Retrieval

0 likes · 17 min read

Boost RAG Retrieval Accuracy with Contextual Embeddings and BM25

DataFunSummit

Oct 2, 2024 · Artificial Intelligence

NVIDIA’s Solutions for Large Language Models: NeMo Framework, TensorRT‑LLM, and Retrieval‑Augmented Generation

This article explains NVIDIA’s end‑to‑end stack for large language models, covering the NeMo Framework for data processing, training, and deployment, the open‑source TensorRT‑LLM inference accelerator, and the Retrieval‑Augmented Generation (RAG) technique that enriches model outputs with external knowledge.

Large Language ModelsNVIDIANeMo

0 likes · 17 min read

NVIDIA’s Solutions for Large Language Models: NeMo Framework, TensorRT‑LLM, and Retrieval‑Augmented Generation

JD Cloud Developers

Sep 30, 2024 · Artificial Intelligence

How a Large‑Model Powered Bot Boosts Logistics Ops with Smart Q&A and Data Insights

This article describes the design, implementation, and impact of a large‑model‑driven logistics chatbot that unifies knowledge Q&A, data analysis, proactive alerts, and report pushing to streamline operations for functional staff, frontline workers, and managers, dramatically reducing query time and improving decision efficiency.

AI chatbotData AnalysisEnterprise AI

0 likes · 20 min read

How a Large‑Model Powered Bot Boosts Logistics Ops with Smart Q&A and Data Insights

JD Tech Talk

Sep 30, 2024 · Artificial Intelligence

Yunli XiaoZhi: An AI‑Powered Intelligent Assistant for Knowledge Q&A and Data Analysis in Logistics Operations

The document describes the design, implementation, and operational results of Yunli XiaoZhi, an AI‑driven portable knowledge‑base and data‑analysis chatbot that consolidates SOPs, manuals, and real‑time information for logistics staff, using LangChain‑based RAG, vector databases, and large‑model prompting to improve query efficiency, proactive alerts, and reporting across multiple user groups.

AIChatbotData Analysis

0 likes · 19 min read

Yunli XiaoZhi: An AI‑Powered Intelligent Assistant for Knowledge Q&A and Data Analysis in Logistics Operations

JD Tech Talk

Sep 29, 2024 · Artificial Intelligence

Building a Simple Local AI Question‑Answer System with Java, LangChain, Ollama, and ChromaDB

This article explains how to set up a lightweight local AI Q&A system using Java, LangChain (and LangChain4J), Ollama for LLM inference, embedding techniques, and a vector database (ChromaDB), covering core concepts, environment preparation, Maven dependencies, and sample code.

AILLMLangChain

0 likes · 21 min read

Building a Simple Local AI Question‑Answer System with Java, LangChain, Ollama, and ChromaDB

JD Cloud Developers

Sep 29, 2024 · Artificial Intelligence

Build a Local AI Q&A System with Java, Ollama, and LangChain4J

This article walks through building a local AI question‑answer system using Java, Ollama, LangChain4J, embeddings, and a Chroma vector database, covering LLM fundamentals, embedding techniques, RAG architecture, setup steps, Maven dependencies, and sample code to retrieve and answer queries.

AIEmbeddingJava

0 likes · 19 min read

Build a Local AI Q&A System with Java, Ollama, and LangChain4J

21CTO

Sep 28, 2024 · Artificial Intelligence

How Digital Twins and Generative AI Are Transforming Real‑Time Monitoring

This article explores how digital twins evolve from design tools to real‑time monitoring platforms, how integrating generative AI and retrieval‑augmented generation (RAG) boosts AI accuracy and situational awareness, and why software teams must adopt these combined technologies to stay ahead in modern operations.

Digital TwinGenerative AIRAG

0 likes · 11 min read

How Digital Twins and Generative AI Are Transforming Real‑Time Monitoring

Tencent Cloud Developer

Sep 27, 2024 · Artificial Intelligence

A Comprehensive Prompt Engineering Framework: Universal Templates, RAG, Few‑Shot, Memory, and Automated Optimization

The article presents a universal four‑part prompt template—role, problem description, goal, and requirements—augmented with role definitions, RAG‑based knowledge retrieval, few‑shot examples, memory handling, temperature/top‑p tuning, and automated optimization techniques such as APE, APO, and OPRO, enabling developers to reliably craft high‑quality prompts for LLMs.

AI Prompt OptimizationLarge Language ModelsPrompt Engineering

0 likes · 26 min read

A Comprehensive Prompt Engineering Framework: Universal Templates, RAG, Few‑Shot, Memory, and Automated Optimization

iQIYI Technical Product Team

Sep 26, 2024 · Artificial Intelligence

AI-Powered Search in iQIYI: Techniques, Architecture, and Implementation

iQIYI’s AI‑powered search expands beyond title‑only queries by handling fuzzy role, plot, star, award, and semantic searches, using Chain‑of‑Thought‑generated TIPS, Retrieval‑Augmented Generation with sophisticated indexing, chunking, embedding, reranking, and prompt‑engineering to deliver personalized, accurate video recommendations that boost user engagement.

AI SearchEmbeddingQuery Guidance

0 likes · 15 min read

AI-Powered Search in iQIYI: Techniques, Architecture, and Implementation

AntData

Sep 26, 2024 · Artificial Intelligence

DB-GPT: Open-Source AI-Native Data Application Development Framework

DB‑GPT is an open‑source AI‑native data‑application framework that provides multi‑model management, Text‑to‑SQL optimization, RAG, multi‑agent collaboration, and intelligent workflow orchestration, enabling developers to build scalable large‑model database applications, with proven enterprise adoption, community growth, and academic publications.

AIData EngineeringLarge Language Models

0 likes · 6 min read

DB-GPT: Open-Source AI-Native Data Application Development Framework

JavaEdge

Sep 24, 2024 · Artificial Intelligence

Mastering RAG with LangChain4j: From Simple Setup to Advanced Retrieval‑Augmented Generation

This article explains how to extend large language models with domain‑specific knowledge using Retrieval‑Augmented Generation (RAG) in LangChain4j, covering the concepts of RAG, its indexing and retrieval stages, simple RAG setup, detailed API usage, and advanced customization options such as query transformers and content injectors.

EmbeddingJavaLLM

0 likes · 24 min read

Mastering RAG with LangChain4j: From Simple Setup to Advanced Retrieval‑Augmented Generation

Alibaba Cloud Developer

Sep 23, 2024 · Artificial Intelligence

Boosting Aviator Script Development with AI—No Model Training Required

This article details an engineering‑focused practice that uses large language models, RAG, prompt engineering, and reranking to automatically generate, review, and refine Aviator scripts for decision‑center policies without any model pre‑training, offering practical insights and code examples for developers.

AI Code GenerationAviator scriptLLM

0 likes · 29 min read

Boosting Aviator Script Development with AI—No Model Training Required

Fighter's World

Sep 22, 2024 · Artificial Intelligence

How Large-Model AI Transforms Smart Customer Service – Alibaba Cloud Insights

The talk outlines the evolution of intelligent customer service over three decades, explains how generative large-model AI like ChatGPT has raised service expectations, and presents Alibaba Cloud’s four-stage implementation—experience, efficiency, capability, and insight—through three concrete cases and a roadmap for SMEs to build their own smart service systems.

AI agentsAlibaba-CloudCustomer Service

0 likes · 12 min read

How Large-Model AI Transforms Smart Customer Service – Alibaba Cloud Insights

Senior Brother's Insights

Sep 19, 2024 · Artificial Intelligence

Rule Engines vs AI Models: Choosing the Right Approach for Product Logic

The article compares traditional rule‑engine architectures with AI‑driven models, explains their differing characteristics, outlines when deterministic rule matching is preferable over flexible AI inference, and recommends practical technologies such as Drools for rule‑based solutions and LLM‑based RAG/Agent frameworks for AI‑centric scenarios.

AIDroolsLLM

0 likes · 9 min read

Rule Engines vs AI Models: Choosing the Right Approach for Product Logic

JavaEdge

Sep 19, 2024 · Artificial Intelligence

Unlock Java LLM Power: A Deep Dive into LangChain4j Features and Architecture

LangChain4j streamlines the integration of large language models into Java applications by offering a standardized API, extensive support for over a dozen LLM providers and vector stores, a rich toolbox for RAG, chat memory, and tool calling, plus two abstraction layers that cater to both low‑level control and high‑level convenience.

AIJavaLLM

0 likes · 10 min read

Unlock Java LLM Power: A Deep Dive into LangChain4j Features and Architecture

Qunhe Technology Quality Tech

Sep 19, 2024 · Artificial Intelligence

Deploy FastGPT Locally: Step‑by‑Step Docker & Source Code Guide for RAG AI

This article explains how to set up FastGPT, a Retrieval‑Augmented Generation (RAG) knowledge‑base system powered by large language models, covering both Docker‑compose image deployment and source‑code installation, including environment configuration, database setup, and API usage examples.

AIDockerFastGPT

0 likes · 13 min read

Deploy FastGPT Locally: Step‑by‑Step Docker & Source Code Guide for RAG AI

21CTO

Sep 17, 2024 · Artificial Intelligence

How to Build AI-Powered Semantic Search with OpenAI Embeddings and Milvus

This guide explains vector embeddings, OpenAI's text‑embedding models, and how to use PyMilvus with Zilliz Cloud to create a Retrieval‑Augmented Generation (RAG) system for fast, accurate semantic search.

MilvusOpenAIPython

0 likes · 12 min read

How to Build AI-Powered Semantic Search with OpenAI Embeddings and Milvus

DevOps

Sep 13, 2024 · Artificial Intelligence

15 Advanced Retrieval‑Augmented Generation (RAG) Techniques for Production‑Ready AI Solutions

The article outlines fifteen advanced Retrieval‑Augmented Generation (RAG) techniques—from hierarchical indexing and context caching to multimodal alignment and microservice orchestration—explaining how they help transform AI prototypes into scalable, reliable production systems while highlighting common pitfalls and a concluding call to action.

AI productionLLMRAG

0 likes · 8 min read

15 Advanced Retrieval‑Augmented Generation (RAG) Techniques for Production‑Ready AI Solutions

Code Mala Tang

Sep 12, 2024 · Artificial Intelligence

Boost LLM Accuracy with Retrieval‑Augmented Generation Using LangChain.js

This article explains the core concepts of Retrieval‑Augmented Generation (RAG), walks through its implementation steps with LangChain.js—including text chunking, embedding, storage, retrieval, and generation—and showcases practical use cases, challenges, and best practices for building reliable AI‑powered applications.

AI applicationsEmbeddingLLM

0 likes · 16 min read

Boost LLM Accuracy with Retrieval‑Augmented Generation Using LangChain.js

Baidu Geek Talk

Sep 11, 2024 · Databases

Why Vector Databases Are the Next Big Thing in AI: A Deep Dive into RAG and Baidu’s VectorDB

This article examines the 70‑year evolution of databases, explains how large‑model AI drives the rise of vector databases and Retrieval‑Augmented Generation (RAG), outlines the four‑stage RAG workflow, compares Baidu’s self‑built VectorDB with open‑source alternatives, and showcases real‑world deployments that highlight performance, scalability, and enterprise benefits.

AIDatabase ArchitectureIndustry Insights

0 likes · 16 min read

Why Vector Databases Are the Next Big Thing in AI: A Deep Dive into RAG and Baidu’s VectorDB