Tagged articles
924 articles
Page 9 of 10
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Sep 10, 2024 · Artificial Intelligence

Unlocking AI Search with Alibaba Cloud Elasticsearch: Vectors, HNSW & RAG

This article details Alibaba Cloud Elasticsearch's AI search advancements, covering embedding vectors, HNSW-based approximate nearest neighbor search, hardware-accelerated vector engines, sparse vectors, hybrid retrieval, the Inference API, and RAG implementations that together boost performance, efficiency, and relevance for modern AI-driven search applications.

ElasticsearchHNSWRAG
0 likes · 11 min read
Unlocking AI Search with Alibaba Cloud Elasticsearch: Vectors, HNSW & RAG
DataFunSummit
DataFunSummit
Sep 6, 2024 · Artificial Intelligence

Knowledge Graph and RAG Applications in 360 Document Cloud: Challenges and Solutions

This article presents a comprehensive overview of 360's document cloud knowledge management and Q&A scenarios, discussing business pain points, large‑model challenges, the advantages of the intelligent document solution, and how knowledge graphs enhance retrieval‑augmented generation and document standardization for AI‑driven enterprise applications.

AIDocument ManagementEnterprise AI
0 likes · 15 min read
Knowledge Graph and RAG Applications in 360 Document Cloud: Challenges and Solutions
DataFunSummit
DataFunSummit
Sep 4, 2024 · Artificial Intelligence

How Elasticsearch Powers Retrieval‑Augmented Generation (RAG) Applications

This article explains how Elasticsearch’s advanced search capabilities—including vector and semantic search, hardware acceleration, hybrid retrieval, model re‑ranking, multi‑vector support, and integrated security—enable robust RAG implementations and outlines future directions such as a new compute engine, stronger vector engines, and cloud‑native serverless deployment.

AIElasticsearchHybrid Search
0 likes · 9 min read
How Elasticsearch Powers Retrieval‑Augmented Generation (RAG) Applications
Full-Stack Cultivation Path
Full-Stack Cultivation Path
Sep 4, 2024 · Artificial Intelligence

Hot Open-Source RAG Tool for Document Chat: GraphRAG, Multimodal QA & Complex Reasoning

This article introduces Kotaemon, an open‑source Retrieval‑Augmented Generation platform that lets users chat with their documents, offering a self‑hosted web UI, support for local and API LLMs, hybrid retrieval, multimodal question answering, GraphRAG indexing, and advanced reasoning capabilities, along with step‑by‑step installation via App or Docker.

GraphRAGLLMMultimodal QA
0 likes · 6 min read
Hot Open-Source RAG Tool for Document Chat: GraphRAG, Multimodal QA & Complex Reasoning
AI Large Model Application Practice
AI Large Model Application Practice
Sep 4, 2024 · Artificial Intelligence

When to Use GraphRAG vs. Traditional RAG and How to Combine Them

This article compares GraphRAG with traditional RAG across seven dimensions—suitable scenarios, knowledge representation, retrieval, comprehensive queries, hidden‑relationship understanding, scalability, and performance‑cost trade‑offs—explains how they can be fused, and offers guidance on selecting the right approach for complex data‑driven applications.

Artificial IntelligenceGraphRAGLLM
0 likes · 13 min read
When to Use GraphRAG vs. Traditional RAG and How to Combine Them
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Sep 2, 2024 · Artificial Intelligence

Turning PDFs and Word Docs into Searchable Knowledge for RAG Systems

This article explains why generic large language models struggle with domain‑specific data, introduces Retrieval‑Augmented Generation (RAG) as a solution, compares Word and PDF formats, outlines document‑parsing pipelines, reviews open‑source PDF tools, and presents Alibaba Cloud's rule‑based parsing architecture with performance results.

AIDocument ParsingLLM
0 likes · 13 min read
Turning PDFs and Word Docs into Searchable Knowledge for RAG Systems
Data Thinking Notes
Data Thinking Notes
Sep 1, 2024 · Artificial Intelligence

Master LLMs: Basics, Prompt Engineering, RAG, Agents & Multimodal AI

This article provides a comprehensive overview of large language models, covering their fundamental concepts, historical milestones, parameter scaling, prompt engineering techniques, retrieval‑augmented generation, autonomous agents, and multimodal model applications, illustrating how these technologies reshape AI capabilities across domains.

AI agentsLLMPrompt Engineering
0 likes · 22 min read
Master LLMs: Basics, Prompt Engineering, RAG, Agents & Multimodal AI
AI Large Model Application Practice
AI Large Model Application Practice
Aug 29, 2024 · Artificial Intelligence

8 Essential Indexing Strategies to Boost Enterprise RAG Performance

This article presents eight practical optimization recommendations for the indexing stage of enterprise‑level Retrieval‑Augmented Generation (RAG) applications, covering chunk creation, abbreviation handling, multimodal document processing, semantic enrichment, metadata usage, alternative index types, and embedding model selection.

ChunkingMultimodalRAG
0 likes · 15 min read
8 Essential Indexing Strategies to Boost Enterprise RAG Performance
DataFunSummit
DataFunSummit
Aug 29, 2024 · Artificial Intelligence

Intelligent NPC Practices in Tencent Games: Multi‑Modal LLM Solutions and System Optimizations

This article details Tencent Game's end‑to‑end approach to building intelligent NPCs, covering the opportunities brought by AI, the practical implementation of multimodal LLM‑driven dialogue, knowledge‑augmented retrieval, long‑context handling, safety measures, multimodal expression (voice and facial animation), and system‑level performance optimizations for real‑time deployment.

AILLMMultimodal
0 likes · 18 min read
Intelligent NPC Practices in Tencent Games: Multi‑Modal LLM Solutions and System Optimizations
Qunar Tech Salon
Qunar Tech Salon
Aug 28, 2024 · Databases

Why Vector Databases Are Needed, PgVector Installation, Usage, and Operational Practices in PostgreSQL

This article explains the necessity of vector databases for AI workloads, reviews the PostgreSQL ecosystem, compares vector database options, provides detailed PgVector installation and usage steps, shares operational best‑practices, performance tuning tips, and real‑world deployment cases at Qunar and Tujia.

AIPerformance tuningPostgreSQL
0 likes · 24 min read
Why Vector Databases Are Needed, PgVector Installation, Usage, and Operational Practices in PostgreSQL
DataFunSummit
DataFunSummit
Aug 25, 2024 · Artificial Intelligence

Applying Large AI Models to Financial Data Governance and Innovative Use Cases

This article presents a comprehensive technical overview of how large AI models are reshaping financial data production, governance, multimodal document understanding, lakehouse storage, private‑domain model deployment, data‑centric engineering methods, and multi‑agent intelligent advisory within the finance sector.

AILarge ModelsMultimodal
0 likes · 21 min read
Applying Large AI Models to Financial Data Governance and Innovative Use Cases
phodal
phodal
Aug 22, 2024 · Artificial Intelligence

What’s New in Shire 0.5? AI Coding Agent Gains SonarQube, Git, and Data‑Guarding Features

Shire 0.5 introduces SonarQube issue support, Git‑enabled ShireQL queries, a reranking function for RAG, and new data‑guarding capabilities like the redact function and customizable secret‑pattern YAML, enabling developers to securely build AI‑powered coding agents that leverage IDE assets while protecting sensitive information.

AI codingData PrivacyGit queries
0 likes · 8 min read
What’s New in Shire 0.5? AI Coding Agent Gains SonarQube, Git, and Data‑Guarding Features
Huolala Tech
Huolala Tech
Aug 22, 2024 · Artificial Intelligence

How Large Language Models Automate Order Cancellation Responsibility at HuoLala

This article explains how HuoLala leverages large language models, multimodal feature integration, and retrieval‑augmented generation to automatically determine responsibility for order cancellations, improving accuracy, explainability, and driver‑user experience.

AILarge Language ModelsMultimodal Retrieval
0 likes · 10 min read
How Large Language Models Automate Order Cancellation Responsibility at HuoLala
Volcano Engine Developer Services
Volcano Engine Developer Services
Aug 20, 2024 · Databases

How Vector Databases Power RAG: Scaling, Algorithms, and Real‑World Trade‑offs

RAG technology leverages vector databases to provide context‑aware answers without updating model parameters, and this article explores how cloud search teams integrate multiple vector algorithms, balance cost, stability and latency, and adopt open‑source solutions like OpenSearch to build scalable, enterprise‑grade retrieval systems.

AIDiskANNOpenSearch
0 likes · 21 min read
How Vector Databases Power RAG: Scaling, Algorithms, and Real‑World Trade‑offs
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 19, 2024 · Artificial Intelligence

Ensuring Stable AI Agents: Engineering Practices, RAG, and Monitoring

This article shares engineering insights from Hema’s AI smart customer service deployment, detailing key stability factors for AI agents—including hallucination mitigation, memory integration, RAG enhancement, exception handling, and comprehensive monitoring—to improve reliability and performance in real‑world e‑commerce chatbot scenarios.

AI AgentLLMMonitoring
0 likes · 13 min read
Ensuring Stable AI Agents: Engineering Practices, RAG, and Monitoring
Selected Java Interview Questions
Selected Java Interview Questions
Aug 18, 2024 · Backend Development

Redis Introduces a Multi‑Threaded Query Engine to Boost Vector Search Performance for Generative AI

Redis has launched a multi‑threaded query engine that vertically scales its in‑memory database, dramatically increasing query throughput and lowering latency for vector similarity searches, thereby addressing the performance demands of real‑time retrieval‑augmented generation in generative AI applications.

RAGRedisbackend
0 likes · 9 min read
Redis Introduces a Multi‑Threaded Query Engine to Boost Vector Search Performance for Generative AI
AI Large Model Application Practice
AI Large Model Application Practice
Aug 16, 2024 · Artificial Intelligence

How to Query a Microsoft GraphRAG Knowledge Graph with Neo4j: Local and Global Modes

This guide explains how to query a Microsoft GraphRAG knowledge graph using the official CLI, API, and a custom Neo4j implementation, covering both local and global retrieval modes, vector index creation, Cypher query customization, and integration with LangChain for end‑to‑end RAG pipelines.

LangChainMicrosoft GraphRAGNeo4j
0 likes · 13 min read
How to Query a Microsoft GraphRAG Knowledge Graph with Neo4j: Local and Global Modes
DaTaobao Tech
DaTaobao Tech
Aug 12, 2024 · Artificial Intelligence

Challenges and Optimization Techniques for Retrieval‑Augmented Generation (RAG)

Deploying large language models faces domain gaps, hallucinations, and high barriers, so Retrieval‑Augmented Generation (RAG) combines retrieval with generation, and advanced optimizations—such as RAPTOR’s hierarchical clustering, Self‑RAG’s self‑reflective retrieval, CRAG’s corrective evaluator, proposition‑level Dense X Retrieval, sophisticated chunking, query rewriting, and hybrid sparse‑dense methods—are essential for improving accuracy, reducing hallucinations, and achieving efficient, scalable performance.

AILarge Language ModelsOptimization
0 likes · 22 min read
Challenges and Optimization Techniques for Retrieval‑Augmented Generation (RAG)
37 Interactive Technology Team
37 Interactive Technology Team
Aug 12, 2024 · Backend Development

Intelligent Backend Menu Search with OpenAI Embeddings, LangChain, and DIFY

The article demonstrates how to improve backend menu navigation by building a knowledge base of menu metadata, generating concise Chinese descriptions with OpenAI embeddings, and implementing RAG retrieval using both LangChain code orchestration and DIFY’s visual workflow, highlighting each approach’s flexibility and ease of use.

Backend SearchKnowledge BaseLangChain
0 likes · 9 min read
Intelligent Backend Menu Search with OpenAI Embeddings, LangChain, and DIFY
AI Large Model Application Practice
AI Large Model Application Practice
Aug 9, 2024 · Artificial Intelligence

How to Build and Index Microsoft GraphRAG with Neo4j: A Step‑by‑Step Guide

This article explains the fundamentals of Microsoft GraphRAG, details its indexing pipeline—including text chunking, entity‑relationship extraction, community detection, and description generation—shows how to set up the graphrag library, create adaptive prompts, build the index, and import the resulting graph into Neo4j for visualization and analysis.

AIGraphRAGNeo4j
0 likes · 13 min read
How to Build and Index Microsoft GraphRAG with Neo4j: A Step‑by‑Step Guide
58 Tech
58 Tech
Aug 7, 2024 · Artificial Intelligence

Bridging Compute and Applications: 58.com AI Lab’s Large‑Model Platform and AI Agent Solutions

In this article, 58.com AI Lab senior director Zhan Kunlin explains how the company built a multi‑layer AI platform, created a vertical large‑language model called LingXi, and developed an AI Agent system with RAG capabilities to accelerate practical AI applications across various business scenarios.

AI PlatformAI agentsLarge Language Model
0 likes · 10 min read
Bridging Compute and Applications: 58.com AI Lab’s Large‑Model Platform and AI Agent Solutions
37 Interactive Technology Team
37 Interactive Technology Team
Aug 5, 2024 · Artificial Intelligence

Case Study: Applying AIGC to Component Activity Business with Dify

This case study shows how AIGC, implemented through Dify’s low‑code platform, enables a natural‑language AI assistant to recommend and insert the optimal components from a 200‑plus library, streamlining selection, building an embedding‑based knowledge base, exposing a RAG‑driven agent via API, and demonstrating rapid AI‑business validation compared with custom frameworks.

AI AgentAIGCBusiness Automation
0 likes · 8 min read
Case Study: Applying AIGC to Component Activity Business with Dify
NewBeeNLP
NewBeeNLP
Aug 5, 2024 · Industry Insights

How Alibaba Cloud Scales Search Recommendations with Big Data, AI, and LLMs

This article details Alibaba Cloud's end‑to‑end architecture for search and advertising recommendation, covering the data platform, AI services, feature‑store design, training and inference optimizations, and the integration of large language models for new recommendation scenarios.

AI PlatformAlibaba CloudBig Data
0 likes · 17 min read
How Alibaba Cloud Scales Search Recommendations with Big Data, AI, and LLMs
Architect
Architect
Aug 2, 2024 · Artificial Intelligence

Building AI‑Native Applications with Spring AI: A Complete Tutorial

This article explains how to quickly develop an AI‑native application using Spring AI, covering core features such as chat models, prompt templates, function calling, structured output, image generation, embedding, vector stores, and Retrieval‑Augmented Generation (RAG), and provides end‑to‑end Java code examples for building a simple AI‑driven service.

AI-nativeFunction CallingJava
0 likes · 40 min read
Building AI‑Native Applications with Spring AI: A Complete Tutorial
DataFunTalk
DataFunTalk
Aug 2, 2024 · Artificial Intelligence

From Big Data to Large Models: Alibaba Cloud AI Platform Architecture and Practices for Search Recommendation

This presentation details Alibaba Cloud's AI platform, covering the end‑to‑end pipeline from big‑data processing and feature engineering to large‑model training, inference optimization, recommendation system architecture, and RAG applications, highlighting practical engineering solutions and performance gains.

AI PlatformBig DataFeature Store
0 likes · 18 min read
From Big Data to Large Models: Alibaba Cloud AI Platform Architecture and Practices for Search Recommendation
Open Source Tech Hub
Open Source Tech Hub
Jul 31, 2024 · Artificial Intelligence

Understanding LLMs, AI Agents, and Retrieval-Augmented Generation: Key Concepts and Challenges

This article explains the fundamentals of large language models, artificial general intelligence, AI-generated content, AI agents, retrieval‑augmented generation, knowledge bases, multimodal processing, fine‑tuning, alignment, tokens, vectors, and related tools, highlighting their capabilities, limitations, and practical considerations.

AI AgentArtificial IntelligenceLLM
0 likes · 14 min read
Understanding LLMs, AI Agents, and Retrieval-Augmented Generation: Key Concepts and Challenges
Model Perspective
Model Perspective
Jul 30, 2024 · Artificial Intelligence

Your Complete AI Learning Roadmap: From Basics to Large Model Mastery

This guide presents a comprehensive AI learning roadmap, dividing study into five progressive stages—from foundational math and programming to core deep‑learning and reinforcement‑learning techniques, large‑model training, industry applications, and future trends—plus curated book lists, tool recommendations, and practical RAG tutorials.

AI learning roadmapAI resourcesLarge Models
0 likes · 9 min read
Your Complete AI Learning Roadmap: From Basics to Large Model Mastery
Tencent Cloud Developer
Tencent Cloud Developer
Jul 30, 2024 · Artificial Intelligence

A Systematic Guide to Prompt Engineering: From Zero to One

This guide walks readers from beginner to proficient Prompt Engineer by outlining the evolution of prompting, introducing a universal four‑component template, and detailing a five‑step workflow—including refinement, retrieval‑augmented generation, chain‑of‑thought reasoning, and advanced tuning techniques—plus evaluation metrics for LLM performance.

AI promptingLLM optimizationLarge Language Models
0 likes · 51 min read
A Systematic Guide to Prompt Engineering: From Zero to One
phodal
phodal
Jul 24, 2024 · Artificial Intelligence

How to Build Trustworthy Coding Agents with Shire’s Custom RAG Workflow

This article explains how to use the Shire language to create reliable coding agents by defining custom RAG workflows, leveraging IDE APIs, code verification functions, and vector‑based search, with detailed examples, configuration snippets, and a roadmap for future enhancements.

AICoding AgentIDE
0 likes · 10 min read
How to Build Trustworthy Coding Agents with Shire’s Custom RAG Workflow
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 22, 2024 · Artificial Intelligence

How Alibaba’s Logistics AI Overcame B2B Large Model Challenges

Alibaba’s logistics AI team shares their year‑long journey building a vertical‑domain large language model for logistics, detailing model alignment, Text2API, RAG, SFT techniques, challenges like accuracy and knowledge‑base maintenance, and showcasing real‑world applications such as chatbots, DingTalk assistants, and custom AI assistants.

RAGSFTText2API
0 likes · 16 min read
How Alibaba’s Logistics AI Overcame B2B Large Model Challenges
DevOps
DevOps
Jul 21, 2024 · Artificial Intelligence

LLM Fundamentals, Applications, Prompt Engineering, RAG, and Agentic Workflows

This article provides a comprehensive overview of large language models (LLMs), covering their basic concepts, relationship with NLP, development history, parameter scaling, offline deployment, practical applications, prompt‑engineering frameworks, retrieval‑augmented generation, LangChain integration, agents, workflow orchestration, and future directions toward multimodal AI and AGI.

AI applicationsAgentArtificial Intelligence
0 likes · 36 min read
LLM Fundamentals, Applications, Prompt Engineering, RAG, and Agentic Workflows
DaTaobao Tech
DaTaobao Tech
Jul 19, 2024 · Artificial Intelligence

Practices and Techniques for Vertical Domain Large Language Models

Vertical domain large language models, fine‑tuned on specialized data, deliver higher expertise and task performance, but require continual knowledge updates and careful alignment; techniques such as BPO‑guided instruction tuning (+1.8% accuracy), Reflexion‑based Text2API (+4% API correctness), advanced RAG preprocessing, and SFT combined with ORPO (+5.2% gain) demonstrate notable improvements while underscoring remaining challenges and collaborative opportunities.

AIRAGSFT
0 likes · 9 min read
Practices and Techniques for Vertical Domain Large Language Models
Tencent Cloud Developer
Tencent Cloud Developer
Jul 18, 2024 · Artificial Intelligence

Exploring Large Language Models (LLM): Fundamentals, Applications, and Future Directions

Exploring Large Language Models, this article surveys their core concepts, evolution through Transformers, GPT and BERT, generation challenges, diverse applications such as QA, multimodal creation, summarization and retrieval‑augmented generation, prompt‑engineering frameworks and tools, LangChain‑based pipelines, AI‑driven agents, and future prospects toward domain‑specific use, multimodality, and AGI.

AIAgentLLM
0 likes · 35 min read
Exploring Large Language Models (LLM): Fundamentals, Applications, and Future Directions
JD Tech Talk
JD Tech Talk
Jul 16, 2024 · Artificial Intelligence

Task‑Aware Decoding (TaD): A Plug‑and‑Play Method to Mitigate Hallucinations in Large Language Models

TaD, a task‑aware decoding technique jointly developed by JD.com and Tsinghua University and presented at IJCAI 2024, leverages differences between pre‑ and post‑fine‑tuned LLM outputs to construct knowledge vectors, significantly reducing hallucinations across various models, tasks, and data‑scarce scenarios, especially when combined with RAG.

AILLMRAG
0 likes · 18 min read
Task‑Aware Decoding (TaD): A Plug‑and‑Play Method to Mitigate Hallucinations in Large Language Models
Architect
Architect
Jul 13, 2024 · Artificial Intelligence

Practical Guide to Building LLM Products: Prompt Engineering, RAG, Evaluation, and Operations

This article provides a comprehensive, step‑by‑step guide for developing large‑language‑model (LLM) applications, covering prompt design techniques, n‑shot and chain‑of‑thought strategies, retrieval‑augmented generation, structured I/O, workflow optimization, evaluation pipelines, operational best practices, and team organization to create reliable, scalable AI products.

AI OperationsLLMProduct Development
0 likes · 54 min read
Practical Guide to Building LLM Products: Prompt Engineering, RAG, Evaluation, and Operations
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Jul 12, 2024 · Artificial Intelligence

How AI‑Native Transforms User Experience Management in Telecom Networks

This article examines how the AI‑Native approach reshapes the AISWare CEM platform by integrating large language models, Retrieval‑Augmented Generation, and atomic capability decomposition to improve user perception, streamline interactions, and enable intelligent diagnostic assistants for telecom operators.

AI-nativeAtomic CapabilitiesDiagnostic Assistant
0 likes · 12 min read
How AI‑Native Transforms User Experience Management in Telecom Networks
JD Tech
JD Tech
Jul 10, 2024 · Artificial Intelligence

Implementing Retrieval‑Augmented Generation (RAG) with LangChain4j in Java

This article provides a step‑by‑step guide for Java engineers on building a Retrieval‑Augmented Generation (RAG) application using the LangChain4j framework, covering RAG fundamentals, environment setup, Maven integration, document loading, splitting, embedding with OpenAI, vector store management with Chroma, and prompt‑based LLM interaction.

EmbeddingJavaLLM
0 likes · 35 min read
Implementing Retrieval‑Augmented Generation (RAG) with LangChain4j in Java
21CTO
21CTO
Jul 7, 2024 · Artificial Intelligence

How to Build a Secure Local LLM Chatbot with Ollama, Python, and ChromaDB

This tutorial walks you through creating a privacy‑preserving, locally hosted large language model chatbot using Ollama, Python 3, and ChromaDB, covering RAG fundamentals, GPU selection, environment setup, and full source code for a Flask‑based application.

ChromaDBLLMOllama
0 likes · 19 min read
How to Build a Secure Local LLM Chatbot with Ollama, Python, and ChromaDB
AI Large Model Application Practice
AI Large Model Application Practice
Jul 4, 2024 · Artificial Intelligence

Mastering Multimodal RAG: From PDF Parsing to Advanced Query Rewriting

This article explains how to handle complex multimodal PDFs in RAG systems, outlines extraction, indexing, and multimodal model integration, details four query‑rewriting strategies (HyDE, stepwise, sub‑question, backward), and presents key evaluation metrics and tools for assessing RAG performance.

Document ParsingMultimodalQuery Rewriting
0 likes · 12 min read
Mastering Multimodal RAG: From PDF Parsing to Advanced Query Rewriting
AntTech
AntTech
Jul 2, 2024 · Artificial Intelligence

Design and Implementation of a Generalized Retrieval‑Augmented Generation (RAG) Framework with Graph RAG Support

This article surveys Retrieval‑Augmented Generation (RAG), analyzes the limitations of traditional vector‑based RAG, introduces Graph RAG that leverages knowledge graphs for more reliable context, proposes a universal RAG architecture compatible with vector, graph and full‑text indexes, and details its open‑source implementation, code components, testing, and future research directions.

AIEngineeringGraphRAGKnowledgeGraph
0 likes · 26 min read
Design and Implementation of a Generalized Retrieval‑Augmented Generation (RAG) Framework with Graph RAG Support
JD Tech
JD Tech
Jun 28, 2024 · Artificial Intelligence

An Overview of Large Language Models: History, Fundamentals, Prompt Engineering, Retrieval‑Augmented Generation, Agents, and Multimodal AI

This article provides a comprehensive introduction to large language models, covering their historical development, core architecture, training process, prompt engineering techniques, Retrieval‑Augmented Generation, agent frameworks, multimodal capabilities, safety challenges, and future research directions.

AI agentsAI safetyLarge Language Models
0 likes · 22 min read
An Overview of Large Language Models: History, Fundamentals, Prompt Engineering, Retrieval‑Augmented Generation, Agents, and Multimodal AI
Baobao Algorithm Notes
Baobao Algorithm Notes
Jun 27, 2024 · Artificial Intelligence

Engineering Data for R&D Large Language Models: From Pre‑training to Prompt Design

This article presents a comprehensive guide to data engineering for research‑focused large language models, covering domain‑adaptive pre‑training, supervised fine‑tuning, retrieval‑augmented generation, dataset construction, data cleaning pipelines, token‑izer adaptation, and prompt engineering best practices to boost model performance in specialized tasks.

Data EngineeringFine‑TuningLLM
0 likes · 20 min read
Engineering Data for R&D Large Language Models: From Pre‑training to Prompt Design
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 27, 2024 · Artificial Intelligence

How to Supercharge Retrieval‑Augmented Generation: Papers, Techniques, and Real‑World Tips

This article surveys the main challenges of deploying large language models, introduces key RAG optimization papers such as RAPTOR, Self‑RAG, and CRAG, and compiles practical engineering tricks—including chunking, query rewriting, hybrid and progressive retrieval—to help practitioners build more accurate and efficient RAG systems.

AI researchLLM optimizationRAG
0 likes · 22 min read
How to Supercharge Retrieval‑Augmented Generation: Papers, Techniques, and Real‑World Tips
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jun 21, 2024 · Databases

How Vector Databases and Large Models are Transforming AI-Driven Database Operations

This article reviews the evolution of databases and large models, explains the role of vector databases and Retrieval‑Augmented Generation (RAG) in AI‑enhanced data management, and showcases Baidu Cloud's VectorDB and DBSC solutions for intelligent database operations and knowledge‑driven services.

AI4DBDatabase operationsLarge Language Models
0 likes · 15 min read
How Vector Databases and Large Models are Transforming AI-Driven Database Operations
DataFunTalk
DataFunTalk
Jun 21, 2024 · Artificial Intelligence

Fine‑tuning Large Language Models with Alibaba Cloud PAI: Practices, Techniques, and Deployment

This article introduces the Alibaba Cloud PAI platform for large language model (LLM) fine‑tuning, covering model‑training pipelines, performance‑cost trade‑offs, retrieval‑augmented generation, fine‑tuning methods such as full‑parameter, LoRA and QLoRA, model selection, data preparation, evaluation, and real‑world deployment examples.

AI PlatformLLMModel Deployment
0 likes · 20 min read
Fine‑tuning Large Language Models with Alibaba Cloud PAI: Practices, Techniques, and Deployment
JD Cloud Developers
JD Cloud Developers
Jun 20, 2024 · Artificial Intelligence

How Large Language Models Boost Courier Efficiency: From Voice Commands to Smart QA

This article explains how large language models like ChatGPT can transform courier operations by automating voice‑driven tasks, enabling intelligent question answering with retrieval‑augmented generation, extracting and splitting document content, embedding it for vector search, and delivering smart prompts and agents to improve productivity and accuracy.

AIEmbeddingLogistics
0 likes · 15 min read
How Large Language Models Boost Courier Efficiency: From Voice Commands to Smart QA
Architecture & Thinking
Architecture & Thinking
Jun 19, 2024 · Artificial Intelligence

Build AI‑Native Apps Quickly with Spring AI: From Chat Models to RAG

This guide explains what an AI‑native application is, compares AI‑native and AI‑based approaches, and walks through Spring AI’s core features—including chat models, prompt templates, function calling, structured output, image generation, embedding, and vector stores—showing step‑by‑step code examples and how to assemble a complete AI‑native app with RAG support.

AI native applicationFunction CallingJava
0 likes · 43 min read
Build AI‑Native Apps Quickly with Spring AI: From Chat Models to RAG
JD Tech
JD Tech
Jun 19, 2024 · Artificial Intelligence

Advances in Large AI Models: Prompt Engineering, RAG, Agents, Fine‑Tuning, Vector Databases and Knowledge Graphs

This article surveys the rapid expansion of large AI models, covering prompt engineering, structured prompts, retrieval‑augmented generation, AI agents, fine‑tuning strategies, vector database technology, knowledge graphs, function calling, and their collective role in moving toward artificial general intelligence.

AIAgentFine‑tuning
0 likes · 23 min read
Advances in Large AI Models: Prompt Engineering, RAG, Agents, Fine‑Tuning, Vector Databases and Knowledge Graphs
AI Large Model Application Practice
AI Large Model Application Practice
Jun 17, 2024 · Artificial Intelligence

Boost Your RAG Pipeline with Cohere and BGE Rerank Models

This guide explains why post‑retrieval reranking is essential for Retrieval‑Augmented Generation, compares the commercial Cohere Rerank service with the open‑source bge‑reranker‑large model, and provides step‑by‑step code for integrating both into LlamaIndex pipelines, including a custom TEI‑based processor.

BGECohereLlamaIndex
0 likes · 11 min read
Boost Your RAG Pipeline with Cohere and BGE Rerank Models
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jun 14, 2024 · Artificial Intelligence

How Alibaba Cloud OpenSearch Powers RAG: Insights from AICon 2024

In this talk, Alibaba Cloud's OpenSearch RAG team shares their year‑long journey of building retrieval‑augmented generation systems, covering data parsing, slicing, vectorization, hybrid retrieval, model fine‑tuning, performance optimizations, cost reduction, and future directions such as multimodal queries and agents.

AI SearchHybrid RetrievalLLM
0 likes · 25 min read
How Alibaba Cloud OpenSearch Powers RAG: Insights from AICon 2024
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 11, 2024 · Artificial Intelligence

Mastering Retrieval‑Augmented Generation: Challenges, Paradigms, and Engineering Best Practices

This article explores Retrieval‑Augmented Generation (RAG) by outlining its background, inherent challenges such as knowledge limits and hallucinations, describing the Naïve, Advanced, and Modular RAG paradigms, and presenting practical engineering strategies for pre‑retrieval, retrieval, and post‑retrieval optimization.

Artificial IntelligenceKnowledge retrievalMachine Learning
0 likes · 25 min read
Mastering Retrieval‑Augmented Generation: Challenges, Paradigms, and Engineering Best Practices
AI Large Model Application Practice
AI Large Model Application Practice
Jun 7, 2024 · Artificial Intelligence

Mastering Advanced Retrieval: Fusion and Recursive Strategies for RAG

This article explores two advanced retrieval paradigms—Fusion Retrieval, which merges results from multiple retrievers using re‑ranking, and Recursive Retrieval, which builds hierarchical chunk‑to‑chunk or chunk‑to‑retriever links—to boost the quality and flexibility of Retrieval‑Augmented Generation pipelines.

Fusion RetrievalLLMLangChain
0 likes · 12 min read
Mastering Advanced Retrieval: Fusion and Recursive Strategies for RAG
Bilibili Tech
Bilibili Tech
Jun 7, 2024 · Artificial Intelligence

AI Development for Frontend Developers: From Basics to Agent Implementation

This article guides frontend developers through AI development, comparing model training, fine‑tuning, prompt engineering, and Retrieval‑Augmented Generation, then explains agent creation via ReAct and tool‑call methods, and showcases Langchain and Flowise as low‑code frameworks for building domain‑specific AI agents.

AI developmentAgentFlowise
0 likes · 13 min read
AI Development for Frontend Developers: From Basics to Agent Implementation
Sohu Tech Products
Sohu Tech Products
Jun 5, 2024 · Artificial Intelligence

Retrieval Augmented Generation (RAG): Concepts, Workflow, and LangChain Implementation

The article outlines LLM issues such as hallucination, outdated knowledge, and data privacy, then explains Retrieval‑Augmented Generation—detailing its data‑preparation and query‑time retrieval workflow, demonstrates a full LangChain implementation, and contrasts RAG with fine‑tuning as complementary strategies for up‑to‑date, grounded responses.

LLMLangChainPrompt Engineering
0 likes · 15 min read
Retrieval Augmented Generation (RAG): Concepts, Workflow, and LangChain Implementation
Tencent Cloud Developer
Tencent Cloud Developer
Jun 5, 2024 · Artificial Intelligence

Introduction to AI Development and Practical Applications

The article surveys AI development from early GPT experiments to real‑world deployments, explaining how tools like LangChain and Retrieval‑Augmented Generation enable sophisticated agents, multi‑prompt workflows, and function calls for chatbots, education, and creative content while addressing accuracy, resource, and ethical challenges.

AI DemosAI developmentAgent Frameworks
0 likes · 34 min read
Introduction to AI Development and Practical Applications
JD Retail Technology
JD Retail Technology
Jun 4, 2024 · Databases

How to Deploy and Query JD’s Open‑Source Vearch Vector Database for LLM Retrieval

This article walks through the practical use of JD’s self‑developed Vearch vector database—covering cluster creation, space setup, data insertion, and both text and vector search—illustrating how it integrates with LangChain and OpenAI embeddings to enable retrieval‑augmented generation for large language models.

EmbeddingLLM RetrievalLangChain
0 likes · 16 min read
How to Deploy and Query JD’s Open‑Source Vearch Vector Database for LLM Retrieval
Baobao Algorithm Notes
Baobao Algorithm Notes
Jun 3, 2024 · Artificial Intelligence

Can Adversarial Training Make Retrieval‑Augmented Generators More Robust?

Recent arXiv work introduces ATM, an adversarially‑tuned multi‑agent system that iteratively pits a fake‑knowledge attacker against a generator, dramatically improving retrieval‑augmented language models’ resistance to hallucinated content and boosting performance on knowledge‑intensive benchmarks, even with noisy or irrelevant documents.

Hallucination MitigationLarge Language ModelsRAG
0 likes · 12 min read
Can Adversarial Training Make Retrieval‑Augmented Generators More Robust?
JD Tech
JD Tech
May 31, 2024 · Artificial Intelligence

Understanding Large Language Models, Retrieval‑Augmented Generation, and AI Agents: Concepts, Engineering Practices, and Applications

This article explains the fundamentals and engineering practices of large language models (LLM), retrieval‑augmented generation (RAG) and AI agents, compares small and large embedding models, provides Python code for vector‑database RAG with Chroma, and discusses integration, use cases, and future challenges in AI development.

AI EngineeringAI agentsLLM
0 likes · 41 min read
Understanding Large Language Models, Retrieval‑Augmented Generation, and AI Agents: Concepts, Engineering Practices, and Applications
G7 EasyFlow Tech Circle
G7 EasyFlow Tech Circle
May 29, 2024 · Artificial Intelligence

Engineering Large Model Enterprise Applications: Best Practices

This article outlines the key characteristics of large‑model enterprise applications, compares them with consumer use cases, and presents a comprehensive engineering roadmap—including model selection, knowledge‑base integration, tool implementation, intent recognition, output control, high‑availability deployment, and ongoing optimization—to help practitioners effectively harness AI models in real‑world business environments.

AI EngineeringRAGlarge model
0 likes · 12 min read
Engineering Large Model Enterprise Applications: Best Practices
37 Interactive Technology Team
37 Interactive Technology Team
May 27, 2024 · Artificial Intelligence

Enhancing AI Code Review Quality with Contextual Embedding and Function Calling

The article explains how AI code reviews suffer from missing context, and improves them by embedding the codebase, using Retrieval‑Augmented Generation to fetch relevant snippets, and adding a function‑calling tool that lets the model autonomously request additional code, resulting in precise, bug‑detecting feedback.

AI code reviewEmbeddingFunction Calling
0 likes · 8 min read
Enhancing AI Code Review Quality with Contextual Embedding and Function Calling
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
May 27, 2024 · Databases

Baidu’s Enterprise Vector Database: Architecture, Performance, and RAG Secrets

An exclusive interview with Baidu’s senior database architects reveals the motivations behind building a dedicated enterprise vector database, details its novel column‑store engine, C++‑based retrieval stack, performance gains over open‑source solutions, multi‑modal support, RAG integration, and future research directions.

AILarge Language ModelsRAG
0 likes · 28 min read
Baidu’s Enterprise Vector Database: Architecture, Performance, and RAG Secrets
Eric Tech Circle
Eric Tech Circle
May 22, 2024 · Artificial Intelligence

Deploy and Build AI Apps with Dify: A Complete Open‑Source Guide

This article introduces Dify, an open‑source LLM application platform, outlines its core features such as workflows, model support, RAG pipelines, agents, and observability, compares it with alternatives, and provides step‑by‑step deployment instructions using Docker Compose and Helm for local and Kubernetes environments.

AI PlatformDockerKubernetes
0 likes · 7 min read
Deploy and Build AI Apps with Dify: A Complete Open‑Source Guide
Baidu Tech Salon
Baidu Tech Salon
May 10, 2024 · Artificial Intelligence

Baidu Comate: Core Capabilities of Intelligent Code Assistant

The article surveys Baidu Comate, an AI‑powered code assistant built on the Wenxin (ERNIE) large model, tracing software development from the 1950s crisis through the internet and open‑source era to today’s AI‑driven tools, and highlights its features and demonstration at a global development conference.

AI codingBaidu ComateIDE plugin
0 likes · 7 min read
Baidu Comate: Core Capabilities of Intelligent Code Assistant
DataFunSummit
DataFunSummit
May 10, 2024 · Artificial Intelligence

LLMOps: Definition, Fine‑tuning Techniques, Application Architecture, Challenges and Solutions

This article introduces LLMOps by defining large language model operations, explains the three stages of LLM development, details modern fine‑tuning methods such as PEFT, Adapter, Prefix, Prompt and LoRA, outlines the architecture for building LLM applications, discusses the main difficulties of agent‑based deployments, and presents practical solutions including Prompt IDE, low‑code deployment, monitoring and cost control.

AI OperationsLLMOpsModel Deployment
0 likes · 14 min read
LLMOps: Definition, Fine‑tuning Techniques, Application Architecture, Challenges and Solutions
Java Backend Technology
Java Backend Technology
May 8, 2024 · Artificial Intelligence

Explore the Latest Open‑Source AI Projects: Llama 3, MaxKB, Phidata & RAGFlow

This article highlights four cutting‑edge open‑source AI initiatives—Meta’s Llama 3 large language model, the MaxKB knowledge‑base Q&A system, the Phidata framework for building AI assistants, and the RAGFlow retrieval‑augmented generation engine—detailing their capabilities, licensing, and where to access the code.

AIKnowledge BaseLLM
0 likes · 7 min read
Explore the Latest Open‑Source AI Projects: Llama 3, MaxKB, Phidata & RAGFlow
21CTO
21CTO
May 6, 2024 · Databases

How Oracle’s New 23ai Database Brings AI-Powered Vector Search to Enterprises

Oracle’s latest release, Database 23ai, upgrades its 23c platform with AI-driven vector search, RAG capabilities, and enhanced JSON and graph querying, positioning the database as a unified, secure, and scalable solution for handling structured, semi‑structured, and unstructured data across cloud and on‑premises environments.

AIDatabaseOracle
0 likes · 7 min read
How Oracle’s New 23ai Database Brings AI-Powered Vector Search to Enterprises
AI Large Model Application Practice
AI Large Model Application Practice
May 3, 2024 · Artificial Intelligence

Can Giant Context LLMs Replace RAG? Exploring the Limits of Long‑Context Retrieval

This article examines whether the rapid growth of large‑language‑model context windows can eliminate the need for retrieval‑augmented generation, presenting experimental needle‑in‑a‑haystack tests, analysis of model performance across token lengths and needle positions, and practical guidance using an open‑source evaluation tool.

AILLMNeedle-in-a-Haystack
0 likes · 13 min read
Can Giant Context LLMs Replace RAG? Exploring the Limits of Long‑Context Retrieval
DataFunTalk
DataFunTalk
Apr 29, 2024 · Artificial Intelligence

Practical Experience and Q&A Exploration of Patent Large Models

This article presents a comprehensive overview of the development, training, data preparation, algorithmic strategies, evaluation methods, and RAG integration for a domain‑specific patent large language model, highlighting challenges, practical results, and future research directions.

Domain-specific ModelPatent AIRAG
0 likes · 19 min read
Practical Experience and Q&A Exploration of Patent Large Models
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Apr 29, 2024 · Artificial Intelligence

Building Enterprise‑Grade Retrieval‑Augmented Generation (RAG) Systems: Challenges, Fault Points, and Best Practices

This comprehensive guide explores the complexities of building enterprise‑level Retrieval‑Augmented Generation (RAG) systems, detailing common failure points, architectural components such as authentication, input guards, query rewriting, document ingestion, indexing, storage, retrieval, generation, observability, caching, and multi‑tenant considerations, and provides actionable best‑practice recommendations for developers and technical leaders.

CachingEnterprise AILLM
0 likes · 32 min read
Building Enterprise‑Grade Retrieval‑Augmented Generation (RAG) Systems: Challenges, Fault Points, and Best Practices
DevOps
DevOps
Apr 17, 2024 · Artificial Intelligence

Engineering Capabilities for Enterprise Large Model Applications: Prompt Engineering, RAG, and Fine‑Tuning

The article explores how enterprises can build and improve large‑model applications by combining prompt engineering, retrieval‑augmented generation (RAG), and fine‑tuning, discusses their relationships, optimization dimensions, testing challenges, and provides practical guidance for SE4AI implementation.

AI EngineeringEnterprise AILarge Language Models
0 likes · 20 min read
Engineering Capabilities for Enterprise Large Model Applications: Prompt Engineering, RAG, and Fine‑Tuning
21CTO
21CTO
Apr 12, 2024 · Artificial Intelligence

How I Built an AI‑Powered Resume Chatbot with LLMs and RAG

Senior developer Jon Olson shares how he created an AI resume assistant using GPT‑4/3.5, LangChain, LlamaIndex, and retrieval‑augmented generation, detailing prompt engineering, backend integration, and future routing features to help job seekers showcase their skills.

AI chatbotLLMLangChain
0 likes · 8 min read
How I Built an AI‑Powered Resume Chatbot with LLMs and RAG
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Apr 12, 2024 · Artificial Intelligence

Typical Business and Technical Architectures for Large Language Model Applications

This article reviews the common business and technical architectures used in large language model (LLM) applications, explains AI Embedded, AI Copilot, and AI Agent modes—including single‑ and multi‑agent systems—and offers guidance on selecting appropriate technology stacks such as prompt‑only, function‑calling agents, RAG, and fine‑tuning.

AI AgentLLMRAG
0 likes · 9 min read
Typical Business and Technical Architectures for Large Language Model Applications
Eric Tech Circle
Eric Tech Circle
Apr 11, 2024 · Artificial Intelligence

Build a Generative AI RAG App with Spring AI in Minutes

This guide walks you through setting up Spring AI, configuring model providers and vector stores, initializing a Spring Boot project, adding OpenAI credentials, and running a complete RAG (Retrieval‑Augmented Generation) demo with code snippets and sample API calls.

JavaOpenAIRAG
0 likes · 15 min read
Build a Generative AI RAG App with Spring AI in Minutes
HelloTech
HelloTech
Apr 10, 2024 · Artificial Intelligence

An Overview of LangChain: Architecture, Core Components, and Code Examples

LangChain is an open‑source framework that provides Python and JavaScript SDKs, templates, and services such as LangServe and LangSmith to compose models, embeddings, prompts, indexes, memory, chains, and agents via a concise expression language, enabling rapid prototyping, debugging, and deployment of LLM‑driven applications.

AI EngineeringJavaScriptLLM
0 likes · 19 min read
An Overview of LangChain: Architecture, Core Components, and Code Examples
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 10, 2024 · Artificial Intelligence

Master LangChain in 10 Minutes: From Basics to Advanced AI Engineering

This guide walks AI engineers through a rapid 10‑minute boot‑strap of LangChain, explaining its purpose, core concepts, design questions, environment setup, and step‑by‑step code examples that cover APIs, chains, memory, retrieval‑augmented generation, tools, agents, and the overall architecture.

AI EngineeringLLMLangChain
0 likes · 28 min read
Master LangChain in 10 Minutes: From Basics to Advanced AI Engineering
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Apr 8, 2024 · Artificial Intelligence

PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers

The article introduces PreFLMR, an open‑source, general‑purpose pre‑trained multimodal retriever that leverages fine‑grained late‑interaction to boost retrieval‑augmented generation for knowledge‑intensive visual tasks, describes its M2KR benchmark, training stages, and strong experimental results across multiple tasks.

AIFLMRKnowledge retrieval
0 likes · 11 min read
PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Mar 30, 2024 · Artificial Intelligence

Comprehensive Guide to Coze: AI Bot Development, Prompt Engineering, and Workflow Design

This article provides an in‑depth overview of the Coze low‑code AI bot platform, covering its core features, product comparisons, step‑by‑step bot creation, RAG implementation, plugin usage, memory mechanisms, cron jobs, agent design, advanced workflow techniques, quality management, and future prospects.

AI botCozeLLM
0 likes · 25 min read
Comprehensive Guide to Coze: AI Bot Development, Prompt Engineering, and Workflow Design
AI Large Model Application Practice
AI Large Model Application Practice
Mar 29, 2024 · Artificial Intelligence

How RAG Architecture Evolves: From Simple Chains to Flexible RAG Flows

This article examines the evolution of Retrieval‑Augmented Generation (RAG) architectures for large language models, outlines the challenges they face, introduces the modular RAG Flow concept with four workflow paradigms, and provides a step‑by‑step implementation using LangChain and LlamaIndex with code examples.

LLMLangChainRAG
0 likes · 15 min read
How RAG Architecture Evolves: From Simple Chains to Flexible RAG Flows
Sohu Tech Products
Sohu Tech Products
Mar 27, 2024 · Artificial Intelligence

Building a RAG Application with Baidu Vector Database and Qianfan Embedding

This tutorial walks through building a Retrieval‑Augmented Generation application by setting up Baidu’s Vector Database and Qianfan embedding service, configuring credentials, creating a document database and vector table, loading and chunking PDFs, generating embeddings, storing them, and performing scalar, vector and hybrid similarity searches, ready for integration with Wenxin LLM for answer generation.

AI applicationsBaidu QianfanEmbedding
0 likes · 11 min read
Building a RAG Application with Baidu Vector Database and Qianfan Embedding
Sohu Tech Products
Sohu Tech Products
Mar 27, 2024 · Artificial Intelligence

NVIDIA NeMo Framework, TensorRT‑LLM, and RAG for Large Language Model Solutions

NVIDIA’s comprehensive LLM ecosystem combines the full‑stack NeMo Framework for data curation, distributed training, fine‑tuning, inference acceleration with TensorRT‑LLM and Triton, plus Retrieval‑Augmented Generation and Guardrails, enabling efficient, low‑latency, knowledge‑grounded model deployment across clusters.

AI accelerationLarge Language ModelsNVIDIA
0 likes · 16 min read
NVIDIA NeMo Framework, TensorRT‑LLM, and RAG for Large Language Model Solutions
Eric Tech Circle
Eric Tech Circle
Mar 24, 2024 · Artificial Intelligence

Running Local LLMs: Ollama vs Hugging Face – A Hands‑On Comparison

This guide compares Ollama and Hugging Face for running large language models locally, detailing API and local execution methods, installation steps, model selection, resource requirements, integration with AnythingLLM, container deployment, embedding and vector store setup, and practical observations on performance and limitations.

AnythingLLMDockerEmbedding
0 likes · 15 min read
Running Local LLMs: Ollama vs Hugging Face – A Hands‑On Comparison