Tagged articles

924 articles

Page 7 of 10

May 8, 2025 · Artificial Intelligence

Building and Optimizing a Store Smart Assistant with Aily: Architecture, Workflow, and Practical Lessons

The article details how Youzan’s Store Smart Assistant was built on the Feishu Aily platform, describing why Aily was chosen, the three‑stage development process, deep system integration, practical tips for knowledge‑base management and model stability, and the resulting efficiency gains such as handling 80% of routine queries.

AI AssistantAily platformKnowledge Base

0 likes · 24 min read

Building and Optimizing a Store Smart Assistant with Aily: Architecture, Workflow, and Practical Lessons

Alibaba Cloud Big Data AI Platform

May 6, 2025 · Artificial Intelligence

Build a Powerful RAG‑Enabled AI Q&A App with PAI‑LangStudio and Qwen3

This guide walks you through using Alibaba Cloud's PAI‑LangStudio together with the Qwen3 large language model to create an AI‑powered question‑answering system that combines Retrieval‑Augmented Generation, web search, secure deployment, and flexible customization for production use.

LangStudioMilvusQwen3

0 likes · 10 min read

Build a Powerful RAG‑Enabled AI Q&A App with PAI‑LangStudio and Qwen3

ITPUB

Apr 28, 2025 · Artificial Intelligence

How Large Language Models are Transforming Automotive Operations and Optimization

In this interview, an automotive industry expert explains how large language models and advanced operations‑optimization techniques are reshaping vehicle design, production planning, logistics, and customer services, while also discussing implementation challenges, team requirements, and future AI‑driven opportunities.

AI adoptionAutomotive AILarge Language Models

0 likes · 15 min read

How Large Language Models are Transforming Automotive Operations and Optimization

DevOps

Apr 27, 2025 · Artificial Intelligence

Large Model Technologies: RAG, AI Agents, Multimodal Applications, and Future Trends

This article examines how Retrieval‑Augmented Generation (RAG), AI agents, and multimodal large‑model techniques are reshaping AI‑industry integration, discusses their technical challenges and practical implementations, and outlines future development directions across algorithms, products, and domain‑specific applications.

AI agentsArtificial IntelligenceLarge Models

0 likes · 14 min read

Large Model Technologies: RAG, AI Agents, Multimodal Applications, and Future Trends

Fun with Large Models

Apr 25, 2025 · Artificial Intelligence

Why Your RAG System Underperforms and How to Boost Its Effectiveness by 20%

This article analyzes common shortcomings of RAG pipelines—data preparation, retrieval, and LLM generation—and provides concrete optimization techniques such as advanced chunking, embedding model selection, retrieval parameter tuning, rerank models, and prompt engineering, promising up to a 20% performance gain.

ChunkingEmbeddingPrompt Engineering

0 likes · 17 min read

Why Your RAG System Underperforms and How to Boost Its Effectiveness by 20%

DataFunTalk

Apr 24, 2025 · Artificial Intelligence

Is Retrieval‑Augmented Generation (RAG) Dead Yet?

This article explains the original purpose of Retrieval‑Augmented Generation, why it remains essential despite advances in large‑context LLMs, and how combining RAG with fine‑tuning, longer context windows, and model‑context protocols yields more scalable, accurate, and privacy‑preserving AI systems.

AIKnowledge retrievalRAG

0 likes · 9 min read

Is Retrieval‑Augmented Generation (RAG) Dead Yet?

Tencent Cloud Developer

Apr 24, 2025 · Industry Insights

How RAG, AI Agents, and Multimodal Models Are Reshaping Industry – Trends, Challenges, and Real‑World Cases

The article analyzes the rapid evolution of large‑model technologies—Retrieval‑Augmented Generation, autonomous agents, and multimodal AI—detailing their technical foundations, practical challenges, industry applications such as unified multimodal tasks, open‑world detection, and video moderation, and forecasting future development directions.

AI agentsLarge ModelsRAG

0 likes · 15 min read

How RAG, AI Agents, and Multimodal Models Are Reshaping Industry – Trends, Challenges, and Real‑World Cases

Big Data Technology & Architecture

Apr 22, 2025 · Artificial Intelligence

Introduction to Retrieval‑Augmented Generation (RAG) and Vector Indexing with StarRocks and DeepSeek

This article explains the fundamentals of Retrieval‑Augmented Generation, demonstrates how to create and query vector indexes using StarRocks, shows how DeepSeek provides embeddings and answer generation, and walks through a complete end‑to‑end RAG pipeline with code examples and a web UI.

AIDeepSeekEmbedding

0 likes · 20 min read

Introduction to Retrieval‑Augmented Generation (RAG) and Vector Indexing with StarRocks and DeepSeek

Alibaba Cloud Developer

Apr 21, 2025 · Artificial Intelligence

Building a Private Knowledge‑Base Q&A Agent with MCP: A Hands‑On Guide

This article explores the rapid rise of AI agents, explains the Model Context Protocol (MCP) standard, and provides a step‑by‑step implementation of a private knowledge‑base Q&A system using Python and Java, covering knowledge construction, retrieval, architecture, and deployment details.

AI AgentJavaKnowledge Base

0 likes · 9 min read

Building a Private Knowledge‑Base Q&A Agent with MCP: A Hands‑On Guide

DevOps

Apr 20, 2025 · Artificial Intelligence

Building a Medical Knowledge Base with RAG: A Step‑by‑Step Example

This article demonstrates how to construct an AI‑powered medical knowledge base for diabetes treatment by preprocessing literature, performing semantic chunking, generating BioBERT embeddings, storing them in a FAISS vector database, and using a RAG framework together with a knowledge graph to retrieve and generate accurate answers.

BioBERTFAISSKnowledge Graph

0 likes · 12 min read

Building a Medical Knowledge Base with RAG: A Step‑by‑Step Example

DaTaobao Tech

Apr 18, 2025 · Frontend Development

How AI Is Transforming Frontend Development: From Design‑to‑Code to Automated Testing

This article explores how AI-driven tools are reshaping frontend engineering by automating design‑to‑code conversion, interface‑to‑data‑model mapping, private component integration, code fitting, AI code review, and automated test regression, and it evaluates their impact on efficiency and future development workflows.

AIAutomationCodeGeneration

0 likes · 37 min read

How AI Is Transforming Frontend Development: From Design‑to‑Code to Automated Testing

Fun with Large Models

Apr 18, 2025 · Artificial Intelligence

How RAG Works: From Data Prep to LLM Generation Explained

This article breaks down Retrieval‑Augmented Generation (RAG) into its three core stages—data preparation, data retrieval, and LLM generation—showing how document chunking, embedding, vector databases, similarity search, and optional re‑ranking combine to let large language models produce more accurate, knowledge‑grounded answers.

EmbeddingLLMRAG

0 likes · 9 min read

How RAG Works: From Data Prep to LLM Generation Explained

Data Thinking Notes

Apr 17, 2025 · Artificial Intelligence

How Dify Accelerates Generative AI App Development with Low‑Code and Modular Design

Dify is an open‑source LLM application platform that blends BaaS and LLMOps, offering low‑code development, modular components, extensive model support, and advanced retrieval features, while also detailing its current limitations and recent enhancements such as MySQL integration and Elasticsearch‑based RAG capabilities.

AIElasticsearchLLM

0 likes · 7 min read

How Dify Accelerates Generative AI App Development with Low‑Code and Modular Design

Java Architecture Diary

Apr 14, 2025 · Artificial Intelligence

How to Empower LLMs with a Private SearXNG Search Engine for Real‑Time Knowledge

This guide explains why large language models need private search capabilities, outlines the benefits of a self‑hosted SearXNG engine, provides step‑by‑step Docker deployment, and demonstrates Java integration using LangChain4j for both basic queries and retrieval‑augmented generation (RAG).

DockerLLMLangChain4j

0 likes · 6 min read

How to Empower LLMs with a Private SearXNG Search Engine for Real‑Time Knowledge

Data Thinking Notes

Apr 13, 2025 · Artificial Intelligence

How to Build a Retrieval‑Augmented Generation Knowledge Base with DeepSeek and RAGFlow

This guide walks you through the fundamentals of Retrieval‑Augmented Generation, introduces the open‑source RAGFlow framework, details installation steps, shows how to integrate DeepSeek LLMs, and explores practical application scenarios such as intelligent customer service and enterprise document QA.

AIDeepSeekLLM

0 likes · 11 min read

How to Build a Retrieval‑Augmented Generation Knowledge Base with DeepSeek and RAGFlow

Spring Full-Stack Practical Cases

Apr 10, 2025 · Artificial Intelligence

Build a RAG-Powered Knowledge Base with Spring Boot, Milvus, and Ollama

This guide walks through creating a Retrieval‑Augmented Generation (RAG) system using Spring Boot 3.4.2, Milvus vector database, and the bge‑m3 embedding model via Ollama, covering environment setup, dependency configuration, vector store operations, and integration with a large language model to deliver refined, similarity‑based answers.

EmbeddingLLMMilvus

0 likes · 11 min read

Build a RAG-Powered Knowledge Base with Spring Boot, Milvus, and Ollama

Alibaba Cloud Big Data AI Platform

Apr 10, 2025 · Artificial Intelligence

Building a Pet Hospital AI Assistant with RAG and LLMs

This article walks through the motivation, core concepts of Retrieval‑Augmented Generation, and a step‑by‑step guide to constructing a pet‑hospital AI assistant on Alibaba Cloud using LLMs, vector databases, and automated pipelines, complete with code examples and practical tips.

AI AssistantAlibaba CloudLLM

0 likes · 18 min read

Building a Pet Hospital AI Assistant with RAG and LLMs

Architect's Alchemy Furnace

Apr 8, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?

This article explains Retrieval‑Augmented Generation (RAG), its three‑step workflow of retrieval, augmentation, and generation, its key advantages such as improved accuracy and explainability, and compares RAG with traditional pre‑trained models, fine‑tuned models, hybrid models, knowledge‑distillation methods, and RLHF, while also covering vector, full‑text, and hybrid retrieval modes and the role of rerank models.

AIKnowledge retrievalRAG

0 likes · 18 min read

What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?

Alibaba Cloud Developer

Apr 8, 2025 · Artificial Intelligence

Unlocking LLM Secrets: From Prompt Basics to RAG and Tool Integration

This article introduces the fundamental paradigms of large language models, explaining how simple prompts, messages, and tools like RAG and ReAct enable powerful applications, while providing practical code examples, translation strategies, and insights on prompt engineering, tool usage, and model fine‑tuning.

AILLM applicationsLarge Language Models

0 likes · 23 min read

Unlocking LLM Secrets: From Prompt Basics to RAG and Tool Integration

dbaplus Community

Apr 7, 2025 · Databases

How Do LLMs Tackle Oracle Bad Block Errors? A Hands‑On Evaluation

This article presents a hands‑on evaluation of several large language models—including Mistral‑Small, Deepseek‑r1, Llama 3.3 and ChatGPT‑4‑go—on Oracle database bad‑block errors, RAG‑based document retrieval, and log‑driven reasoning, revealing performance gaps, scoring results, and practical DBA implications.

AIDatabaseLLM evaluation

0 likes · 11 min read

How Do LLMs Tackle Oracle Bad Block Errors? A Hands‑On Evaluation

Beijing SF i-TECH City Technology Team

Apr 7, 2025 · Artificial Intelligence

LLM Application in Text Information Detection and Extraction: A Case Study of Blue-Collar Recruitment Data Processing

This article explores the application of Large Language Models (LLM) in text information detection and extraction, focusing on blue-collar recruitment data processing. It details the implementation of LLM through prompt engineering, RAG enhancement, and model fine-tuning to improve data cleaning efficiency and accuracy.

AI applicationsLLMPrompt Engineering

0 likes · 31 min read

LLM Application in Text Information Detection and Extraction: A Case Study of Blue-Collar Recruitment Data Processing

DataFunSummit

Apr 7, 2025 · Artificial Intelligence

Bridging the Gap Between Large Models and Real‑World Applications with RAG and Agents

This article examines how Retrieval‑Augmented Generation (RAG) and multi‑agent technologies narrow the gap between large language models and practical deployment, highlighting their roles in operations automation, financial risk control, intelligent data governance, database localization, edge inference, and future AI‑driven solutions.

Data GovernanceLarge Language ModelsOperations Automation

0 likes · 8 min read

Bridging the Gap Between Large Models and Real‑World Applications with RAG and Agents

JD Tech Talk

Apr 7, 2025 · Artificial Intelligence

Best Practices for Building AI Agents: Prompt Design, Tool Management, and Context Optimization

This article explains how to develop robust AI agents by breaking down large prompts, selecting appropriate tools, managing context efficiently, and applying modular design principles to reduce token costs, avoid hallucinations, and improve overall performance and reliability.

AI AgentBest PracticesLLM

0 likes · 11 min read

Best Practices for Building AI Agents: Prompt Design, Tool Management, and Context Optimization

JD Cloud Developers

Apr 7, 2025 · Artificial Intelligence

Why Bigger Prompts Fail: Modular Strategies for Building Efficient AI Agents

This article explains why overloading prompts and tools harms AI‑Agent performance, and offers practical modular design, intent‑driven instruction splitting, and efficient context management strategies such as curated function‑call tools and dynamic RAG to reduce token costs, improve response speed, and avoid hallucinations.

AI AgentLLMModular Design

0 likes · 13 min read

Why Bigger Prompts Fail: Modular Strategies for Building Efficient AI Agents

Big Data Technology & Architecture

Apr 3, 2025 · Artificial Intelligence

Understanding Model Context Protocol (MCP), Retrieval-Augmented Generation (RAG), and Vector Databases for LLM Integration

This article explains the Model Context Protocol (MCP) as a standard for LLM‑data integration, describes Retrieval‑Augmented Generation (RAG) techniques to reduce hallucinations, and introduces vector databases like Milvus that store high‑dimensional embeddings for efficient AI retrieval tasks.

LLMMCPMilvus

0 likes · 7 min read

Understanding Model Context Protocol (MCP), Retrieval-Augmented Generation (RAG), and Vector Databases for LLM Integration

DevOps

Apr 2, 2025 · Artificial Intelligence

Understanding Retrieval-Augmented Generation (RAG): Concepts, Evolution, and Types

This article explains Retrieval‑Augmented Generation (RAG), its role in mitigating large language model knowledge cutoff and hallucination, outlines the evolution from naive to advanced, modular, graph, and agentic RAG, and discusses future directions such as intelligent and multi‑modal RAG systems.

Artificial IntelligenceKnowledge retrievalLLM

0 likes · 10 min read

Understanding Retrieval-Augmented Generation (RAG): Concepts, Evolution, and Types

AntTech

Apr 2, 2025 · Artificial Intelligence

PEAR: Position-Embedding-Agnostic Attention Re-weighting Enhances Retrieval-Augmented Generation with Zero Inference Overhead

The PEAR framework introduces a position‑embedding‑agnostic attention re‑weighting method that detects and suppresses detrimental attention heads in large language models, dramatically improving retrieval‑augmented generation performance without adding any inference overhead, as demonstrated on multiple RAG benchmarks and LLM families.

Attention Re-weightingLLMPEAR

0 likes · 6 min read

PEAR: Position-Embedding-Agnostic Attention Re-weighting Enhances Retrieval-Augmented Generation with Zero Inference Overhead

Tencent Cloud Developer

Apr 2, 2025 · Artificial Intelligence

Understanding Retrieval‑Augmented Generation (RAG): Concepts, Types, and Development

Retrieval‑Augmented Generation (RAG) enhances large language models by fetching up‑to‑date external knowledge before generation, mitigating knowledge‑cutoff limits and hallucinations through a retrieval step (using text, vector, or graph methods) and a generation step, evolving from naive single‑method approaches to advanced, modular, graph‑based, and agentic systems that enable adaptive, multi‑hop reasoning and future intelligent, multimodal pipelines.

AIHallucination MitigationKnowledge retrieval

0 likes · 9 min read

Understanding Retrieval‑Augmented Generation (RAG): Concepts, Types, and Development

Architect

Apr 1, 2025 · Artificial Intelligence

When to Fine‑Tune Large Language Models vs. Relying on Prompting and RAG

The article explains why most projects should start with prompt engineering or simple agent workflows, outlines the scenarios where model fine‑tuning adds real value, compares fine‑tuning with Retrieval‑Augmented Generation, and offers practical criteria for deciding which approach to adopt.

AI deploymentLarge Language ModelsLoRA

0 likes · 9 min read

When to Fine‑Tune Large Language Models vs. Relying on Prompting and RAG

Architect

Mar 30, 2025 · Artificial Intelligence

What Is Retrieval-Augmented Generation? A Deep Dive into RAG Techniques

This article provides a comprehensive survey of Retrieval‑Augmented Generation (RAG), covering its basic principles, key components, seven technical variants, challenges, evaluation methods, and future research directions across multimodal, graph‑based, and agentic extensions.

AI SurveyKnowledge retrievalLarge Language Models

0 likes · 9 min read

What Is Retrieval-Augmented Generation? A Deep Dive into RAG Techniques

Architect

Mar 29, 2025 · Artificial Intelligence

How Non‑AI Developers Can Build Powerful LLM Apps: Prompt Engineering, RAG, and AI Agents Explained

This article guides developers without an AI background through the fundamentals of building large‑language‑model applications, covering prompt engineering, multi‑turn interaction, function calling, retrieval‑augmented generation, vector databases, code assistants, and the MCP protocol for AI agents.

AI AgentEmbeddingFunction Calling

0 likes · 51 min read

How Non‑AI Developers Can Build Powerful LLM Apps: Prompt Engineering, RAG, and AI Agents Explained

Architect

Mar 26, 2025 · Artificial Intelligence

Agent Memory Mechanisms and Dify Knowledge Base Segmentation & Retrieval Details

This article explains the fundamentals of AI agent memory—including short‑term, long‑term, and working memory types and their storage designs—and then details Dify's knowledge‑base segmentation modes, indexing strategies, and retrieval configurations for effective RAG applications.

Agent MemoryDifyKnowledge Base

0 likes · 14 min read

Agent Memory Mechanisms and Dify Knowledge Base Segmentation & Retrieval Details

Architect's Alchemy Furnace

Mar 26, 2025 · Artificial Intelligence

Mastering AI Knowledge Bases with Dify: From Creation to Advanced Retrieval

This guide explains how Dify visualizes RAG pipelines, lets developers upload and structure documents, choose segmentation modes, configure indexing and retrieval settings, and leverage embeddings and vector search to build fast, accurate, and up‑to‑date AI knowledge bases.

AIDifyEmbedding

0 likes · 26 min read

Mastering AI Knowledge Bases with Dify: From Creation to Advanced Retrieval

DaTaobao Tech

Mar 26, 2025 · Artificial Intelligence

Overview of Retrieval-Augmented Generation (RAG) and Related AI Technologies

The article surveys Retrieval‑Augmented Generation (RAG) as a solution to large language model limits—such as outdated knowledge, hallucinations, and security risks—by integrating vector‑database retrieval with LLM generation, and discusses related tools, multi‑agent frameworks, prompt engineering, fine‑tuning methods, and emerging optimization trends.

AI applicationsLLMPrompt Engineering

0 likes · 29 min read

Overview of Retrieval-Augmented Generation (RAG) and Related AI Technologies

Architects' Tech Alliance

Mar 25, 2025 · Artificial Intelligence

Why Large Language Models Need RAG and Fine‑Tuning for Vertical Domains

The article analyzes major limitations of large language models—hallucination, outdated knowledge, and insufficient domain expertise—and explains how Retrieval‑Augmented Generation and various fine‑tuning strategies can address these issues while outlining practical cost considerations.

RAGdomain adaptationfine-tuning

0 likes · 4 min read

Why Large Language Models Need RAG and Fine‑Tuning for Vertical Domains

AI Large Model Application Practice

Mar 24, 2025 · Artificial Intelligence

How to Build a Multimodal RAG Pipeline for PPT Documents with Vision LLMs

This article explains a step‑by‑step implementation of a multimodal Retrieval‑Augmented Generation system that parses PPT/PDF files, extracts rich text and images with vision models, indexes them in a vector store, and generates answers that combine markdown and relevant slide screenshots.

LLMMultimodalPython

0 likes · 9 min read

How to Build a Multimodal RAG Pipeline for PPT Documents with Vision LLMs

Architect

Mar 22, 2025 · Artificial Intelligence

Understanding and Mitigating Failures in Retrieval‑Augmented Generation (RAG) Systems

Retrieval‑augmented generation (RAG) combines external knowledge retrieval with large language models to improve answer accuracy, but it often suffers from retrieval mismatches, algorithmic flaws, chunking issues, embedding biases, inefficiencies, generation errors, reasoning limits, formatting problems, system‑level failures, and high resource costs, which this article analyzes and offers solutions for.

AI reliabilityLLMRAG

0 likes · 32 min read

Understanding and Mitigating Failures in Retrieval‑Augmented Generation (RAG) Systems

Architect

Mar 19, 2025 · Artificial Intelligence

Choosing the Best Embedding Model for RAG: A Practical Guide Using MTEB Rankings

This guide explains how to leverage the Massive Text Embedding Benchmark (MTEB) to identify high‑performing embedding models for Retrieval‑Augmented Generation (RAG) and outlines key factors such as model size, dimension, language support, resource requirements, inference speed, domain suitability, long‑text handling, scalability, and cost.

AIEmbeddingMTEB

0 likes · 12 min read

Choosing the Best Embedding Model for RAG: A Practical Guide Using MTEB Rankings

Ops Development & AI Practice

Mar 19, 2025 · Artificial Intelligence

Can Cache‑Augmented Generation Outperform RAG? A Deep Dive into LLM Efficiency

Cache‑augmented generation (CAG) preloads documents into LLM context using KV caches to eliminate retrieval latency, offering faster inference for static knowledge bases, while RAG remains more flexible for dynamic or large corpora; this article compares their definitions, performance, implementation steps, and future prospects.

CAGCache AugmentationInference Optimization

0 likes · 11 min read

Can Cache‑Augmented Generation Outperform RAG? A Deep Dive into LLM Efficiency

Alibaba Cloud Native

Mar 19, 2025 · Artificial Intelligence

Mastering Retrieval‑Augmented Generation with Spring AI: A Complete Guide

This article explains the Retrieval‑Augmented Generation (RAG) paradigm, walks through its four core steps, and provides a detailed Spring AI implementation—including configuration, vector storage, REST controller, multi‑query expansion, query rewriting, document joining, and error handling—plus best‑practice recommendations for production deployments.

AIJavaRAG

0 likes · 23 min read

Mastering Retrieval‑Augmented Generation with Spring AI: A Complete Guide

DaTaobao Tech

Mar 19, 2025 · Artificial Intelligence

Retrieval Augmented Generation (RAG): Principles, Challenges, and Implementation Techniques

Retrieval‑augmented generation (RAG) enhances large language models by integrating a preprocessing pipeline—cleaning, chunking, embedding, and vector storage—with a query‑driven retrieval and prompt‑injection workflow, leveraging vector databases, multi‑stage recall, advanced prompting, and comprehensive evaluation metrics to mitigate knowledge cut‑off, hallucinations, and security issues.

LLMRAGRetrieval-Augmented Generation

0 likes · 27 min read

Retrieval Augmented Generation (RAG): Principles, Challenges, and Implementation Techniques

Architect's Alchemy Furnace

Mar 18, 2025 · Artificial Intelligence

How to Build an AI Agent with Ollama: From Model Setup to Knowledge Base

This step‑by‑step guide shows how to create an AI Agent by configuring a local Ollama model, selecting an embedding model, building a knowledge base, uploading documents, and testing the agent's retrieval capabilities, providing a practical RAG workflow for developers.

AI AgentEmbeddingOllama

0 likes · 8 min read

How to Build an AI Agent with Ollama: From Model Setup to Knowledge Base

Architect

Mar 15, 2025 · Artificial Intelligence

Why Building Your Own RAG System Is a Costly Mistake

The article explains that developing a custom Retrieval‑Augmented Generation (RAG) solution incurs hidden infrastructure, personnel, and security costs, leads to operational overload and budget overruns, and is rarely justified compared to purchasing a proven vendor solution.

AILLMRAG

0 likes · 11 min read

Why Building Your Own RAG System Is a Costly Mistake

Alibaba Cloud Developer

Mar 14, 2025 · Artificial Intelligence

Solving Rate Limiting, Load Balancing, and Data Challenges in AI Inference with Tair

This article explains how AI inference services can tackle five core problems—rate limiting, load balancing, asynchronous processing, user data management, and index enhancement—by leveraging Tair's rich data structures, offering practical code examples, architectural diagrams, and a comparison with alternative solutions.

AI inferenceRAGRate Limiting

0 likes · 20 min read

Solving Rate Limiting, Load Balancing, and Data Challenges in AI Inference with Tair

DaTaobao Tech

Mar 14, 2025 · Artificial Intelligence

AI-Driven Engineering Efficiency: Practices and Insights from a Live-Streaming Team

The article recounts a live‑streaming team’s six‑month experiment using large‑language‑model AI to boost backend, frontend, testing, data‑science and data‑engineering productivity, detailing goals, LLM strengths and limits, and practical tactics such as task splitting, input refinement, human‑AI guidance, retrieval‑augmented generation and fine‑tuning, while emphasizing disciplined task design, prompt iteration, and future vertical integrations.

AIPrompt EngineeringRAG

0 likes · 17 min read

AI-Driven Engineering Efficiency: Practices and Insights from a Live-Streaming Team

Architect

Mar 13, 2025 · Industry Insights

How DeepSearch Redefines AI-Powered Search: Architecture, Iterative Reasoning, and Lessons Learned

This article analyzes the emergence of DeepSearch and DeepResearch as next‑generation AI search frameworks, detailing their iterative search‑read‑reason loop, design decisions, implementation challenges, code architecture, and practical insights for building robust LLM‑driven agents in 2025.

AI SearchDeepSearchIterative Reasoning

0 likes · 24 min read

How DeepSearch Redefines AI-Powered Search: Architecture, Iterative Reasoning, and Lessons Learned

Alibaba Cloud Developer

Mar 12, 2025 · Artificial Intelligence

How to Optimize DB‑GPT RAG Pipelines for Better Retrieval and Knowledge Processing

This article explains how to use the DB‑GPT application framework to improve Retrieval‑Augmented Generation (RAG) by detailing knowledge loading, chunking, embedding, graph extraction, storage options, retrieval strategies, workflow customization, evaluation metrics, and real‑world case studies.

AIDB-GPTRAG

0 likes · 27 min read

How to Optimize DB‑GPT RAG Pipelines for Better Retrieval and Knowledge Processing

Tencent Technical Engineering

Mar 10, 2025 · Artificial Intelligence

How Non‑AI Developers Can Build LLM Apps: Prompt Engineering, RAG, and Function Calling Explained

This guide shows non‑AI developers how to create large‑model applications by mastering prompt engineering, multi‑turn interactions, Retrieval‑Augmented Generation, function calling, and AI‑Agent integration, with practical code examples, tool design patterns, and deployment tips.

AI AgentEmbeddingFunction Calling

0 likes · 48 min read

How Non‑AI Developers Can Build LLM Apps: Prompt Engineering, RAG, and Function Calling Explained

DevOps

Mar 9, 2025 · Artificial Intelligence

A Beginner's Guide to Building Large Language Model Applications: Prompt Engineering, Retrieval‑Augmented Generation, Function Calling, and AI Agents

This article provides a comprehensive introduction to developing large language model (LLM) applications, covering prompt engineering, zero‑ and few‑shot techniques, function calling, retrieval‑augmented generation (RAG) with embedding and vector databases, code assistants, and the MCP protocol for building AI agents, all aimed at non‑AI specialists.

AI AgentEmbeddingFunction Calling

0 likes · 48 min read

A Beginner's Guide to Building Large Language Model Applications: Prompt Engineering, Retrieval‑Augmented Generation, Function Calling, and AI Agents

Mingyi World Elasticsearch

Mar 6, 2025 · Artificial Intelligence

Smart Q&A Knowledge Base Powered by Qwen2.5‑14B and Elasticsearch RAG

This article details a smart Q&A knowledge‑base system that integrates the Qwen2.5‑14B large language model with Elasticsearch vector search via RAG, covering data ingestion with FSCrawler, Chinese sentence embedding, Gradio UI, performance tests on a 483‑page book, architecture diagrams, code walkthroughs, and suggested enhancements.

Chinese EmbeddingElasticsearchFSCrawler

0 likes · 11 min read

Smart Q&A Knowledge Base Powered by Qwen2.5‑14B and Elasticsearch RAG

Alibaba Cloud Big Data AI Platform

Mar 5, 2025 · Artificial Intelligence

How to Unlock Milvus 2.5’s Full‑Text and Hybrid Search for RAG Applications

This guide walks you through Milvus 2.5’s new full‑text, keyword‑matching, and hybrid search capabilities, showing how to set up the schema, install dependencies, prepare data with LangChain, and combine dense and sparse vectors for RAG‑enabled retrieval using Python.

Full-Text SearchHybrid SearchLangChain

0 likes · 14 min read

How to Unlock Milvus 2.5’s Full‑Text and Hybrid Search for RAG Applications

Cognitive Technology Team

Mar 4, 2025 · Artificial Intelligence

Deep Searcher: An Open‑Source Agentic RAG Framework for Enterprise‑Level Search and Knowledge Retrieval

The article introduces Deep Searcher, an open‑source Agentic Retrieval‑Augmented Generation system that combines large language models, Milvus vector databases, and multi‑step reasoning to deliver enterprise‑grade search, reporting, and complex query capabilities, and compares its performance against traditional RAG and Graph RAG approaches.

LLMOpen SourceRAG

0 likes · 18 min read

Deep Searcher: An Open‑Source Agentic RAG Framework for Enterprise‑Level Search and Knowledge Retrieval

Alibaba Cloud Developer

Mar 4, 2025 · Artificial Intelligence

Build a Smart Knowledge Base with DeepSeek R1 and Alibaba Cloud Low‑Code

This tutorial guides you through creating an AI‑powered, customizable knowledge space by integrating DeepSeek R1 via Alibaba Cloud Bailei's Model‑as‑a‑Service with the low‑code Mobinext platform, covering setup, configuration, deployment, and future expansion for multi‑tenant use.

AIAlibaba CloudDeepSeek

0 likes · 12 min read

Build a Smart Knowledge Base with DeepSeek R1 and Alibaba Cloud Low‑Code

Huawei Cloud Developer Alliance

Mar 4, 2025 · Artificial Intelligence

Build a RAG Vector Database with DeepSeek on a Cloud Host – Step‑by‑Step Guide

This tutorial explains how to deploy the DeepSeek‑r1:1.5b model on a cloud server using Ollama, create a retrieval‑augmented generation (RAG) vector database with the mxbai‑embed‑large embedding model, and build an interactive AI application that answers questions from uploaded PDFs.

AIDeepSeekOllama

0 likes · 6 min read

Build a RAG Vector Database with DeepSeek on a Cloud Host – Step‑by‑Step Guide

Tencent Cloud Developer

Mar 4, 2025 · Artificial Intelligence

A Practical Guide to Building Large Language Model Applications: Prompt Engineering, Retrieval‑Augmented Generation, Function Calling and AI Agents

The guide teaches non‑AI developers how to build practical LLM‑powered applications by mastering prompt engineering, function calling, retrieval‑augmented generation, and AI agents, and introduces the Modal Context Protocol for seamless tool integration, offering a clear learning path to leverage large language models without deep theory.

AI AgentFunction CallingLLM

0 likes · 48 min read

A Practical Guide to Building Large Language Model Applications: Prompt Engineering, Retrieval‑Augmented Generation, Function Calling and AI Agents

Alibaba Cloud Big Data AI Platform

Mar 4, 2025 · Artificial Intelligence

Deploy a High‑Performance RAG Service with Hologres, DeepSeek, and PAI‑EAS

This guide walks you through building a Retrieval‑Augmented Generation (RAG) system by integrating Alibaba Cloud's Hologres vector store, the Proxima high‑performance vector engine, and DeepSeek large language models via PAI‑EAS, covering prerequisites, deployment steps, configuration, and inference verification.

AI deploymentDeepSeekHologres

0 likes · 12 min read

Deploy a High‑Performance RAG Service with Hologres, DeepSeek, and PAI‑EAS

AI Large Model Application Practice

Mar 3, 2025 · Artificial Intelligence

Can DeepSeek‑R1 Unlock True “Deep Thinking” for Enterprise RAG?

This article examines how swapping in DeepSeek‑R1 enhances Retrieval‑Augmented Generation with deeper reasoning, outlines its benefits and pitfalls—including slower inference, higher compute costs, and hallucinations—provides a simple hallucination test, and proposes an Agentic RAG research assistant to balance accuracy and creativity.

AI reasoningDeepSeekLLM

0 likes · 10 min read

Can DeepSeek‑R1 Unlock True “Deep Thinking” for Enterprise RAG?

Cognitive Technology Team

Feb 28, 2025 · Artificial Intelligence

Comparative Study of Traditional RAG, GraphRAG, and DeepSearcher for Knowledge Retrieval and Generation

This article examines why Retrieval‑Augmented Generation (RAG) is needed, compares traditional RAG, GraphRAG, and the DeepSearcher framework across architecture, data organization, retrieval mechanisms, result generation, efficiency and accuracy, and provides step‑by‑step implementation guides and experimental results using vector and graph databases.

Artificial IntelligenceDeepSearcherGraph Database

0 likes · 20 min read

Comparative Study of Traditional RAG, GraphRAG, and DeepSearcher for Knowledge Retrieval and Generation

JavaEdge

Feb 27, 2025 · Artificial Intelligence

How to Quickly Build a DeepSeek‑Powered Knowledge Base on Tencent Cloud

This guide walks through deploying the full‑feature DeepSeek V3+R1 model on Tencent Cloud, configuring a smart knowledge‑base application, importing documentation, enabling internet search, tuning retrieval parameters, and publishing the app for public use, all without writing code.

AIDeepSeekKnowledge Base

0 likes · 6 min read

How to Quickly Build a DeepSeek‑Powered Knowledge Base on Tencent Cloud

Alibaba Cloud Big Data AI Platform

Feb 25, 2025 · Artificial Intelligence

Build a RAG‑Powered Smart Q&A Assistant with Milvus, DeepSeek, and PAI LangStudio

This step‑by‑step guide shows how to assemble a Retrieval‑Augmented Generation (RAG) system using Alibaba Cloud Milvus vector search, the DeepSeek large language model, and PAI LangStudio, covering instance creation, data upload, model deployment, connection setup, flow design, and service invocation.

AI TutorialDeepSeekLLM

0 likes · 9 min read

Build a RAG‑Powered Smart Q&A Assistant with Milvus, DeepSeek, and PAI LangStudio

Ma Wei Says

Feb 23, 2025 · Artificial Intelligence

How Microsoft’s PIKE‑RAG Builds Knowledge‑Driven AI Across Four Stages

The article explains Microsoft’s open‑source PIKE‑RAG system, detailing its four progressive stages—from knowledge‑base construction to creative multi‑agent reasoning—while describing the underlying modules, chunking strategies, multi‑granularity retrieval, and code snippets that enable specialized domain understanding and inference.

AI RetrievalKnowledge GraphLLM

0 likes · 11 min read

How Microsoft’s PIKE‑RAG Builds Knowledge‑Driven AI Across Four Stages

dbaplus Community

Feb 23, 2025 · Databases

Why Vector Databases Are Really Just Search Engines in Disguise

The article traces the evolution of embedding technology from a secret weapon of tech giants to a mainstream developer tool, explains the rapid rise and subsequent integration of vector databases into traditional search engines, and argues that vector databases are essentially search engines with added vector capabilities.

AI infrastructureEmbeddingsRAG

0 likes · 9 min read

Why Vector Databases Are Really Just Search Engines in Disguise

ZhongAn Tech Team

Feb 22, 2025 · Artificial Intelligence

How SkyReels, DeepSeek NSA, Grok‑3, and KG²RAG Are Shaping the Next AI Wave

This issue reviews China's first open‑source short‑film model SkyReels‑V1, DeepSeek's Native Sparse Attention breakthrough, xAI's massive Grok‑3 deployment on 200k H100 GPUs, and a knowledge‑graph‑guided RAG framework, highlighting their performance gains, architectural innovations, and industry impact.

AIKnowledge GraphLarge Models

0 likes · 15 min read

How SkyReels, DeepSeek NSA, Grok‑3, and KG²RAG Are Shaping the Next AI Wave

Ma Wei Says

Feb 22, 2025 · Artificial Intelligence

Pick the Best Vector Similarity Strategy in LangChain: Euclidean, Inner Product, Cosine

This guide explains how to configure and import the appropriate DistanceStrategy in LangChain, compares Euclidean distance, maximum inner product, and cosine similarity, and outlines their formulas, advantages, and typical use‑cases for vector‑based retrieval.

AI embeddingsLangChainRAG

0 likes · 6 min read

Pick the Best Vector Similarity Strategy in LangChain: Euclidean, Inner Product, Cosine

Rare Earth Juejin Tech Community

Feb 22, 2025 · Artificial Intelligence

Deploying DeepSeek Locally with Ollama, Building Personal and Organizational Knowledge Bases, and Integrating with Spring AI

This guide explains how to locally deploy the DeepSeek large‑language model using Ollama on Windows, macOS, and Linux, configure model storage and CORS, build personal and enterprise RAG knowledge bases with AnythingLLM and Open WebUI, and integrate the model into a Spring AI application via Docker and Docker‑Compose.

ContainerizationDeepSeekDocker

0 likes · 16 min read

Deploying DeepSeek Locally with Ollama, Building Personal and Organizational Knowledge Bases, and Integrating with Spring AI

Architecture and Beyond

Feb 22, 2025 · Artificial Intelligence

Understanding Retrieval‑Augmented Generation (RAG) and Its Role in Enhancing Large Language Models

The article explains how the inherent knowledge‑staleness, hallucination, lack of private data, non‑traceable output, limited long‑text handling, and data‑security concerns of large language models can be mitigated by Retrieval‑Augmented Generation, which combines external retrieval, augmentation, and generation to provide up‑to‑date, reliable, and secure AI responses.

AIKnowledge augmentationLLM

0 likes · 15 min read

Understanding Retrieval‑Augmented Generation (RAG) and Its Role in Enhancing Large Language Models

DataFunSummit

Feb 21, 2025 · Artificial Intelligence

Multimodal Retrieval‑Augmented Generation (RAG): Implementation Paths and Future Prospects

This article explores multimodal Retrieval‑Augmented Generation (RAG), detailing five core topics—including semantic extraction, visual‑language models, scaling strategies, technical roadmap choices, and a Q&A—while presenting three implementation pathways, performance evaluations, and future directions for AI‑driven document understanding.

RAGTensor Retrievaldocument understanding

0 likes · 11 min read

Multimodal Retrieval‑Augmented Generation (RAG): Implementation Paths and Future Prospects

Ma Wei Says

Feb 21, 2025 · Artificial Intelligence

How PIKE‑RAG Boosts Retrieval‑Augmented Generation for Industrial AI

PIKE‑RAG, a Retrieval‑Augmented Generation framework from Microsoft Research, tackles knowledge source diversity, one‑size‑fits‑all limitations, and LLMs' lack of domain expertise by building multi‑layer heterogeneous graphs, task‑driven modular pipelines, and a staged L0‑L4 system for more accurate industrial AI responses.

AIKnowledgeGraphLLM

0 likes · 5 min read

How PIKE‑RAG Boosts Retrieval‑Augmented Generation for Industrial AI

Baobao Algorithm Notes

Feb 20, 2025 · Industry Insights

How DeepSeek R1 Is Redefining Large‑Model Engineer Roles and the AI Job Market

The article analyzes DeepSeek R1’s release, showing how rising base‑model thresholds, a shift toward infrastructure‑centric skills, and the rise of retrieval‑augmented generation are rapidly diminishing traditional large‑model algorithm engineer positions while reshaping the broader AI industry landscape.

AGIAI IndustryDeepSeek

0 likes · 6 min read

How DeepSeek R1 Is Redefining Large‑Model Engineer Roles and the AI Job Market

Tencent Cloud Developer

Feb 20, 2025 · Artificial Intelligence

Build Your Own Private Knowledge Base with Cloud Studio DeepSeek R1 in Minutes

This guide explains what a knowledge base and Retrieval‑Augmented Generation (RAG) are, why personal knowledge bases are valuable, and provides step‑by‑step instructions for using Cloud Studio's DeepSeek‑R1 CPU template to set up and query a private knowledge base with Open‑WebUI or AnythingLLM.

AI TutorialCloud StudioDeepSeek

0 likes · 8 min read

Build Your Own Private Knowledge Base with Cloud Studio DeepSeek R1 in Minutes

Alibaba Cloud Developer

Feb 20, 2025 · Artificial Intelligence

How LLMs Power Real-Time Interactive 3D Worlds in Unreal Engine

This article explains how large language models are integrated with Unreal Engine to enable natural‑language‑driven 3D model search, manipulation, and scene understanding, detailing metadata extraction, vision‑language labeling, RAG‑based retrieval, and function‑call translation for interactive virtual environments.

3D interactionLLMRAG

0 likes · 21 min read

How LLMs Power Real-Time Interactive 3D Worlds in Unreal Engine

Alibaba Cloud Big Data AI Platform

Feb 19, 2025 · Artificial Intelligence

Build a DeepSeek AI Assistant with PAI‑RAG: Internet Search & Enterprise Knowledge Base

This guide walks you through using Alibaba Cloud's PAI‑RAG platform to deploy a DeepSeek large‑language‑model assistant that combines real‑time web search with an enterprise knowledge‑base, covering deployment, network‑search configuration, testing, and advanced enterprise features.

AI AssistantDeepSeekEnterprise Knowledge Base

0 likes · 10 min read

Build a DeepSeek AI Assistant with PAI‑RAG: Internet Search & Enterprise Knowledge Base

JD Tech Talk

Feb 18, 2025 · Artificial Intelligence

Agent Applications in Advertising at JD.com: Technical Implementation and Platform Architecture

This article explores how JD.com's advertising team leverages Agent technology to enhance advertising operations through AI-driven automation, covering technical implementations like RAG, Function Call capabilities, and platform architecture for scalable AI solutions.

AI PlatformAgentJD.com

0 likes · 23 min read

Agent Applications in Advertising at JD.com: Technical Implementation and Platform Architecture

JD Cloud Developers

Feb 18, 2025 · Artificial Intelligence

How JD Advertising Leverages AI Agents to Boost Ad Operations

This article details JD Advertising's practical use of large‑model agents—including RAG, function calls, and workflow automation—to improve service efficiency, monitoring, and developer productivity across its advertising platform.

AIAdvertisingAgent

0 likes · 26 min read

How JD Advertising Leverages AI Agents to Boost Ad Operations

JD Retail Technology

Feb 18, 2025 · Artificial Intelligence

Engineering Practices of JD Advertising Agent: JDZunTong Intelligent Assistant

JD’s advertising R&D team created the JDZunTong Intelligent Assistant by engineering a modular Agent platform that combines advanced Retrieval‑Augmented Generation (RAG 1.0 → 2.0) and Function‑Call capabilities, a visual designer, custom tool registration, and a native Python workflow engine to deliver intelligent customer service, data queries, and ad creation for merchants.

AIAgentJD Advertising

0 likes · 18 min read

Engineering Practices of JD Advertising Agent: JDZunTong Intelligent Assistant

macrozheng

Feb 17, 2025 · Artificial Intelligence

Unlock DeepSeek4j 1.4: Build a Private AI Knowledge Base with Spring Boot

This guide explains why DeepSeek4j is needed, its core features, and provides step‑by‑step instructions—including dependency setup, configuration, code examples, and a complete RAG pipeline using Milvus—to help developers quickly create a private AI knowledge base with Spring Boot.

AIDeepSeek4jMilvus

0 likes · 12 min read

Unlock DeepSeek4j 1.4: Build a Private AI Knowledge Base with Spring Boot

Liangxu Linux

Feb 16, 2025 · Artificial Intelligence

Build a Free Private AI with DeepSeek, Ollama, and Local Knowledge Base

This guide explains how to locally deploy the open‑source DeepSeek model using Ollama, enhance interaction with Chatbox and Page Assist, and connect a local knowledge base via AnythingLLM's RAG architecture, providing step‑by‑step instructions, hardware requirements, and API examples for a self‑hosted AI system.

AI deploymentAnythingLLMDeepSeek

0 likes · 22 min read

Build a Free Private AI with DeepSeek, Ollama, and Local Knowledge Base

AIWalker

Feb 14, 2025 · Artificial Intelligence

ImageRAG: Leveraging RAG and AIGC to Elevate Image Generation Quality

ImageRAG introduces a dynamic retrieval‑augmented generation framework that integrates visual language models and CLIP‑based similarity search to supply reference images, enabling diffusion models like OmniGen and SDXL to better render rare and fine‑grained concepts, as demonstrated through extensive quantitative and qualitative experiments.

AIGCImageRAGOmniGen

0 likes · 18 min read

ImageRAG: Leveraging RAG and AIGC to Elevate Image Generation Quality

DataFunSummit

Feb 14, 2025 · Artificial Intelligence

Building Large‑Scale Recommendation Systems with Big Data and Large Language Models on Alibaba Cloud AI Platform

This presentation details how Alibaba Cloud's AI platform integrates big‑data pipelines, feature‑store services, and large language model capabilities to construct high‑performance search‑recommendation architectures, covering system design, training and inference optimizations, LLM‑driven use cases, and open‑source RAG tooling.

AI PlatformBig DataFeature Store

0 likes · 17 min read

Building Large‑Scale Recommendation Systems with Big Data and Large Language Models on Alibaba Cloud AI Platform

Alibaba Cloud Big Data AI Platform

Feb 14, 2025 · Artificial Intelligence

Deploy a DeepSeek AI App with Web Search & Private Knowledge Base in 30 Minutes

This guide walks you through deploying DeepSeek models on Alibaba Cloud PAI, integrating SerpAPI for live web search, building a private knowledge base, and assembling a RAG-enabled chatbot workflow, all within 30 minutes, enabling enterprises to create intelligent applications that combine large‑model capabilities with up‑to‑date information.

AI ApplicationAlibaba CloudDeepSeek

0 likes · 7 min read

Deploy a DeepSeek AI App with Web Search & Private Knowledge Base in 30 Minutes

AI Algorithm Path

Feb 13, 2025 · Artificial Intelligence

How to Build a Local RAG Knowledge Base with DeepSeek‑R1 and Ollama

This article walks through setting up a local Retrieval‑Augmented Generation (RAG) system using the open‑source DeepSeek‑R1 model run via Ollama, covering installation, model selection, PDF ingestion with LangChain, semantic chunking, FAISS vector store creation, RetrievalQA chain construction, and a Streamlit UI for querying.

DeepSeek-R1FAISSLangChain

0 likes · 8 min read

How to Build a Local RAG Knowledge Base with DeepSeek‑R1 and Ollama

Java Architecture Diary

Feb 13, 2025 · Artificial Intelligence

Create a Java RAG System Using DeepSeek R1, Milvus, and Spring

This guide walks through building a Java RAG system with DeepSeek R1, Milvus, and Spring, covering environment setup, vector model integration via OpenAI protocol, Maven dependencies, data embedding, and a chat endpoint that combines semantic retrieval with LLM generation.

AI integrationDeepSeekMilvus

0 likes · 11 min read

Create a Java RAG System Using DeepSeek R1, Milvus, and Spring

DevOps

Feb 12, 2025 · Artificial Intelligence

A Comprehensive Guide to Prompt Engineering, RAG, and Optimization Techniques for Large Language Models

This article presents a systematic framework for crafting effective prompts, detailing the universal prompt template, role definition, task decomposition, RAG integration, few‑shot examples, memory handling, and parameter tuning to enhance large language model performance across diverse applications.

Prompt EngineeringPrompt TemplatesRAG

0 likes · 24 min read

A Comprehensive Guide to Prompt Engineering, RAG, and Optimization Techniques for Large Language Models

Architect

Feb 12, 2025 · Artificial Intelligence

Master Prompt Engineering: A Universal Framework for LLMs

This article presents a comprehensive, step‑by‑step Prompt engineering framework—including role definition, problem description, goal setting, and requirement specification—augmented with techniques such as RAG, few‑shot examples, memory handling, and parameter tuning, enabling users to craft effective prompts for large language models across domains.

AI Prompt OptimizationFew-shotLarge Language Models

0 likes · 27 min read

Master Prompt Engineering: A Universal Framework for LLMs

Alibaba Cloud Native

Feb 12, 2025 · Artificial Intelligence

Boost AI Agents with Spring AI Alibaba: 20+ RAG Sources & Tool‑Calling Integrations

This article explains how Spring AI Alibaba enables AI agents to leverage Retrieval‑Augmented Generation and Tool Calling by providing over twenty ready‑made RAG data source connectors and more than twenty function‑calling interfaces, along with practical code examples for integrating document readers and weather services.

Document ReaderFunction CallingJava

0 likes · 12 min read

Boost AI Agents with Spring AI Alibaba: 20+ RAG Sources & Tool‑Calling Integrations

Infra Learning Club

Feb 11, 2025 · Artificial Intelligence

How to Run DeepSeek R1 Locally and Build a RAG System with Ollama and LangChain

This guide walks you through installing Ollama, pulling the open‑source DeepSeek R1 model, and using LangChain and Streamlit to create a locally hosted Retrieval‑Augmented Generation (RAG) system that can answer questions from uploaded PDFs without any cloud API.

DeepSeekLLMLangChain

0 likes · 6 min read

How to Run DeepSeek R1 Locally and Build a RAG System with Ollama and LangChain

DataFunTalk

Feb 11, 2025 · Artificial Intelligence

Roundtable on Enhancing Large Model Effectiveness: RAG, Tool Use, and Knowledge Engineering

Experts from Dipu, Ant Financial, iKang, and Zhihu discuss practical strategies for improving large model performance, covering RAG, tool‑using, offline knowledge engineering, multimodal training, evaluation metrics, and future trends, while sharing case studies from manufacturing, healthcare, retail, and C‑end applications.

Knowledge EngineeringLarge Language ModelsRAG

0 likes · 9 min read

Roundtable on Enhancing Large Model Effectiveness: RAG, Tool Use, and Knowledge Engineering

AI2ML AI to Machine Learning

Feb 10, 2025 · Artificial Intelligence

Eight Ways Enterprises Can Leverage DeepSeek

The article outlines eight distinct enterprise strategies for adopting DeepSeek, categorizing them by model maturity, available data types, and specific business challenges, and maps these approaches onto four capability tiers—from basic compliance requirements to advanced multimodal, low‑cost solutions.

AI agentsDeepSeekEnterprise AI

0 likes · 3 min read

Eight Ways Enterprises Can Leverage DeepSeek

iKang Technology Team

Feb 7, 2025 · Artificial Intelligence

Retrieval‑Augmented Generation (RAG) with LangChain: Concepts and Python Implementation

Retrieval‑Augmented Generation (RAG) using LangChain lets developers enhance large language models by embedding user queries, fetching relevant documents from a vector store, inserting the context into a prompt template, and generating concise, source‑grounded answers, offering low‑cost, up‑to‑date knowledge while reducing hallucinations and fine‑tuning expenses.

LLMLangChainRAG

0 likes · 10 min read

Retrieval‑Augmented Generation (RAG) with LangChain: Concepts and Python Implementation

Cognitive Technology Team

Feb 6, 2025 · Artificial Intelligence

DeepSeek Model Guide: 10 Practical Tips and Usage Techniques

This article presents ten detailed techniques for effectively using DeepSeek's large language models—including mode selection, model comparisons, knowledge updates, prompt engineering, RAG, file uploads, API access, and open‑source resources—while offering concrete examples and code snippets for each feature.

AI APIDeepSeekLarge Language Model

0 likes · 12 min read

DeepSeek Model Guide: 10 Practical Tips and Usage Techniques

DataFunSummit

Jan 30, 2025 · Databases

Mature Practices for Building Risk‑Control Knowledge Graphs on NebulaGraph and Leveraging Large Language Models

This article explains how NebulaGraph’s large‑scale graph database can be used to construct real‑time risk‑control knowledge graphs, describes practical applications such as community detection and path analysis, and explores how large language models enhance graph queries through Text‑to‑GQL, agents, exploration chains, and semi‑structured knowledge extraction.

AIGraph DatabaseKnowledge Graph

0 likes · 11 min read

Mature Practices for Building Risk‑Control Knowledge Graphs on NebulaGraph and Leveraging Large Language Models

Architect

Jan 27, 2025 · Artificial Intelligence

How to Build a Retrieval‑Augmented Generation QA Assistant for an Open Platform

This article details a step‑by‑step design of a RAG‑based intelligent Q&A assistant for the DeWu Open Platform, covering background, RAG fundamentals, system architecture, technology selection, prompt engineering with CO‑STAR, data preprocessing, vector store setup, LangChain.js implementation, similarity search, runnable chaining, debugging, and future prospects.

AILLMLangChain

0 likes · 28 min read

How to Build a Retrieval‑Augmented Generation QA Assistant for an Open Platform

Alibaba Cloud Native

Jan 27, 2025 · Frontend Development

How Large Language Models Can Supercharge Frontend Development: Practical Insights

This article explores how large language models can be leveraged to automate and accelerate frontend development tasks, covering prompt engineering, repo‑level code generation, quality factors, hallucination mitigation, knowledge‑base integration, and practical strategies for improving developer productivity.

AIFrontendKnowledge Base

0 likes · 22 min read

How Large Language Models Can Supercharge Frontend Development: Practical Insights

DataFunSummit

Jan 26, 2025 · Artificial Intelligence

ChatBI in Automotive Enterprises: Challenges, Architecture, and Implementation

This article examines the rise of ChatBI in automotive companies, outlining current BI challenges, the five “no” and five “difficulties” issues, the motivations for adopting ChatBI, its evolving architecture, and practical implementation steps to achieve data‑driven decision making.

AIAutomotiveChatBI

0 likes · 17 min read

ChatBI in Automotive Enterprises: Challenges, Architecture, and Implementation

DataFunSummit

Jan 22, 2025 · Artificial Intelligence

RAG2.0 Engine Design Challenges and Implementation

This article presents a comprehensive overview of the RAG2.0 engine design, covering RAG1.0 limitations, effective chunking methods, accurate retrieval techniques, advanced multimodal processing, hybrid search strategies, database indexing choices, and future directions such as agentic RAG and memory‑enhanced models.

ChunkingHybrid SearchMultimodal

0 likes · 23 min read

RAG2.0 Engine Design Challenges and Implementation

Baidu Geek Talk

Jan 20, 2025 · Industry Insights

How Baidu’s Qianfan AppBuilder Is Redefining AI‑Native App Development

The interview explores how Baidu Cloud's Qianfan AppBuilder platform evolves from traditional coding to AI‑native low‑code development, detailing the impact of large‑model agents, Retrieval‑Augmented Generation, security, multimodal support, and future roadmap on enterprise productivity and digital transformation.

AI agentsAI native appsEnterprise AI

0 likes · 18 min read

How Baidu’s Qianfan AppBuilder Is Redefining AI‑Native App Development

macrozheng

Jan 20, 2025 · Artificial Intelligence

How Redis’s New Multithreaded Query Engine Boosts Vector Search for Real‑Time AI Apps

Redis has introduced a multithreaded query engine that dramatically lowers latency and multiplies throughput for vector‑based retrieval, enabling real‑time RAG applications to approach the 100 ms response target while scaling vertically to billions of documents.

AI PerformanceBenchmarkRAG

0 likes · 6 min read

How Redis’s New Multithreaded Query Engine Boosts Vector Search for Real‑Time AI Apps

Zhihu Tech Column

Jan 17, 2025 · Artificial Intelligence

Zhihu Direct Answer: Product Overview and Technical Practices

This article summarizes the key technical insights from Zhihu Direct Answer, an AI-powered search product, covering its product overview, RAG framework, query understanding, retrieval strategies, chunking, reranking, generation techniques, evaluation methods, and engineering optimizations for cost and performance.

AI SearchChunkingEngineering Optimization

0 likes · 13 min read

Zhihu Direct Answer: Product Overview and Technical Practices

Architecture Digest

Jan 16, 2025 · Artificial Intelligence

Redis Introduces Multi‑Threaded Query Engine to Boost Vector Search Performance for Generative AI

Redis has unveiled a multi‑threaded query engine that dramatically increases query throughput and lowers latency for vector similarity searches, offering up to 16× performance gains and enabling real‑time Retrieval‑Augmented Generation (RAG) workloads in generative AI applications.

Database PerformanceGenerative AIRAG

0 likes · 7 min read

Redis Introduces Multi‑Threaded Query Engine to Boost Vector Search Performance for Generative AI

DaTaobao Tech

Jan 15, 2025 · Mobile Development

How AI Transformed Taobao’s Post‑Purchase Info‑Flow Across Android, iOS, and Weex

Facing the challenge of maintaining four codebases for Taobao’s post‑purchase information flow, the team leveraged AI‑driven code generation, prompt engineering, and RAG to automate template conversion from DX to Weex, dramatically cutting development cycles, reducing manual effort, and improving monitoring and stability across Android, iOS, and HarmonyOS.

AICross‑platform developmentMobile Engineering

0 likes · 20 min read

How AI Transformed Taobao’s Post‑Purchase Info‑Flow Across Android, iOS, and Weex