Tagged articles

924 articles

Page 9 of 10

Sep 11, 2024 · Artificial Intelligence

How to Build a Retrieval‑Augmented Generation (RAG) System with OpenSearch LLM and Dify

Learn step‑by‑step how to integrate OpenSearch LLM’s intelligent Q&A edition with the Dify large‑model platform to create a robust Retrieval‑Augmented Generation (RAG) system, covering architecture, workflow setup, API authentication, result parsing, and practical code examples.

AIDifyLLM

0 likes · 7 min read

How to Build a Retrieval‑Augmented Generation (RAG) System with OpenSearch LLM and Dify

Alibaba Cloud Big Data AI Platform

Sep 10, 2024 · Artificial Intelligence

Unlocking AI Search with Alibaba Cloud Elasticsearch: Vectors, HNSW & RAG

This article details Alibaba Cloud Elasticsearch's AI search advancements, covering embedding vectors, HNSW-based approximate nearest neighbor search, hardware-accelerated vector engines, sparse vectors, hybrid retrieval, the Inference API, and RAG implementations that together boost performance, efficiency, and relevance for modern AI-driven search applications.

ElasticsearchHNSWRAG

0 likes · 11 min read

Unlocking AI Search with Alibaba Cloud Elasticsearch: Vectors, HNSW & RAG

Architect's Alchemy Furnace

Sep 6, 2024 · Artificial Intelligence

Exploring LLM Application Architectures: From AI Embedded to Multi‑Agent Systems

This article examines the typical business and technical architectures for large language model applications, covering AI Embedded, Copilot, and Agent modes, single‑ and multi‑agent systems, core frameworks, and guidance on selecting appropriate technical routes.

AI agentsLLMRAG

0 likes · 11 min read

Exploring LLM Application Architectures: From AI Embedded to Multi‑Agent Systems

DataFunSummit

Sep 6, 2024 · Artificial Intelligence

Knowledge Graph and RAG Applications in 360 Document Cloud: Challenges and Solutions

This article presents a comprehensive overview of 360's document cloud knowledge management and Q&A scenarios, discussing business pain points, large‑model challenges, the advantages of the intelligent document solution, and how knowledge graphs enhance retrieval‑augmented generation and document standardization for AI‑driven enterprise applications.

AIDocument ManagementEnterprise AI

0 likes · 15 min read

Knowledge Graph and RAG Applications in 360 Document Cloud: Challenges and Solutions

Baidu Intelligent Cloud Tech Hub

Sep 5, 2024 · Databases

How Vector Databases Power AI and RAG: Insights from Baidu’s DTCC 2024

This article reviews the 70‑year evolution of databases, explains how vector databases and Retrieval‑Augmented Generation (RAG) are reshaping AI applications, and details Baidu Intelligent Cloud's VectorDB architecture, performance advantages, real‑world use cases, and future trends in data engineering.

AIData EngineeringDatabase Architecture

0 likes · 16 min read

How Vector Databases Power AI and RAG: Insights from Baidu’s DTCC 2024

DataFunSummit

Sep 4, 2024 · Artificial Intelligence

How Elasticsearch Powers Retrieval‑Augmented Generation (RAG) Applications

This article explains how Elasticsearch’s advanced search capabilities—including vector and semantic search, hardware acceleration, hybrid retrieval, model re‑ranking, multi‑vector support, and integrated security—enable robust RAG implementations and outlines future directions such as a new compute engine, stronger vector engines, and cloud‑native serverless deployment.

AIElasticsearchHybrid Search

0 likes · 9 min read

How Elasticsearch Powers Retrieval‑Augmented Generation (RAG) Applications

Full-Stack Cultivation Path

Sep 4, 2024 · Artificial Intelligence

Hot Open-Source RAG Tool for Document Chat: GraphRAG, Multimodal QA & Complex Reasoning

This article introduces Kotaemon, an open‑source Retrieval‑Augmented Generation platform that lets users chat with their documents, offering a self‑hosted web UI, support for local and API LLMs, hybrid retrieval, multimodal question answering, GraphRAG indexing, and advanced reasoning capabilities, along with step‑by‑step installation via App or Docker.

GraphRAGLLMMultimodal QA

0 likes · 6 min read

Hot Open-Source RAG Tool for Document Chat: GraphRAG, Multimodal QA & Complex Reasoning

AI Large Model Application Practice

Sep 4, 2024 · Artificial Intelligence

When to Use GraphRAG vs. Traditional RAG and How to Combine Them

This article compares GraphRAG with traditional RAG across seven dimensions—suitable scenarios, knowledge representation, retrieval, comprehensive queries, hidden‑relationship understanding, scalability, and performance‑cost trade‑offs—explains how they can be fused, and offers guidance on selecting the right approach for complex data‑driven applications.

Artificial IntelligenceGraphRAGLLM

0 likes · 13 min read

When to Use GraphRAG vs. Traditional RAG and How to Combine Them

Alibaba Cloud Big Data AI Platform

Sep 2, 2024 · Artificial Intelligence

Turning PDFs and Word Docs into Searchable Knowledge for RAG Systems

This article explains why generic large language models struggle with domain‑specific data, introduces Retrieval‑Augmented Generation (RAG) as a solution, compares Word and PDF formats, outlines document‑parsing pipelines, reviews open‑source PDF tools, and presents Alibaba Cloud's rule‑based parsing architecture with performance results.

AIDocument ParsingLLM

0 likes · 13 min read

Turning PDFs and Word Docs into Searchable Knowledge for RAG Systems

Data Thinking Notes

Sep 1, 2024 · Artificial Intelligence

Master LLMs: Basics, Prompt Engineering, RAG, Agents & Multimodal AI

This article provides a comprehensive overview of large language models, covering their fundamental concepts, historical milestones, parameter scaling, prompt engineering techniques, retrieval‑augmented generation, autonomous agents, and multimodal model applications, illustrating how these technologies reshape AI capabilities across domains.

AI agentsLLMPrompt Engineering

0 likes · 22 min read

Master LLMs: Basics, Prompt Engineering, RAG, Agents & Multimodal AI

AI Large Model Application Practice

Aug 29, 2024 · Artificial Intelligence

8 Essential Indexing Strategies to Boost Enterprise RAG Performance

This article presents eight practical optimization recommendations for the indexing stage of enterprise‑level Retrieval‑Augmented Generation (RAG) applications, covering chunk creation, abbreviation handling, multimodal document processing, semantic enrichment, metadata usage, alternative index types, and embedding model selection.

ChunkingMultimodalRAG

0 likes · 15 min read

8 Essential Indexing Strategies to Boost Enterprise RAG Performance

DataFunSummit

Aug 29, 2024 · Artificial Intelligence

Intelligent NPC Practices in Tencent Games: Multi‑Modal LLM Solutions and System Optimizations

This article details Tencent Game's end‑to‑end approach to building intelligent NPCs, covering the opportunities brought by AI, the practical implementation of multimodal LLM‑driven dialogue, knowledge‑augmented retrieval, long‑context handling, safety measures, multimodal expression (voice and facial animation), and system‑level performance optimizations for real‑time deployment.

AILLMMultimodal

0 likes · 18 min read

Intelligent NPC Practices in Tencent Games: Multi‑Modal LLM Solutions and System Optimizations

Qunar Tech Salon

Aug 28, 2024 · Databases

Why Vector Databases Are Needed, PgVector Installation, Usage, and Operational Practices in PostgreSQL

This article explains the necessity of vector databases for AI workloads, reviews the PostgreSQL ecosystem, compares vector database options, provides detailed PgVector installation and usage steps, shares operational best‑practices, performance tuning tips, and real‑world deployment cases at Qunar and Tujia.

AIPerformance tuningPostgreSQL

0 likes · 24 min read

Why Vector Databases Are Needed, PgVector Installation, Usage, and Operational Practices in PostgreSQL

Qunhe Technology Quality Tech

Aug 27, 2024 · Artificial Intelligence

Boosting Test Code Quality: How Large Language Models Transform Code Review

This article explores how mature testing teams can leverage large language models for automated code review, outlining the advantages, challenges, and a practical implementation using FastGPT and GitLab CI to build a low‑cost, AI‑enhanced review system that improves efficiency and feedback quality.

AICode ReviewFastGPT

0 likes · 10 min read

Boosting Test Code Quality: How Large Language Models Transform Code Review

DataFunSummit

Aug 25, 2024 · Artificial Intelligence

Applying Large AI Models to Financial Data Governance and Innovative Use Cases

This article presents a comprehensive technical overview of how large AI models are reshaping financial data production, governance, multimodal document understanding, lakehouse storage, private‑domain model deployment, data‑centric engineering methods, and multi‑agent intelligent advisory within the finance sector.

AILarge ModelsMultimodal

0 likes · 21 min read

Applying Large AI Models to Financial Data Governance and Innovative Use Cases

phodal

Aug 22, 2024 · Artificial Intelligence

What’s New in Shire 0.5? AI Coding Agent Gains SonarQube, Git, and Data‑Guarding Features

Shire 0.5 introduces SonarQube issue support, Git‑enabled ShireQL queries, a reranking function for RAG, and new data‑guarding capabilities like the redact function and customizable secret‑pattern YAML, enabling developers to securely build AI‑powered coding agents that leverage IDE assets while protecting sensitive information.

AI codingData PrivacyGit queries

0 likes · 8 min read

What’s New in Shire 0.5? AI Coding Agent Gains SonarQube, Git, and Data‑Guarding Features

Huolala Tech

Aug 22, 2024 · Artificial Intelligence

How Large Language Models Automate Order Cancellation Responsibility at HuoLala

This article explains how HuoLala leverages large language models, multimodal feature integration, and retrieval‑augmented generation to automatically determine responsibility for order cancellations, improving accuracy, explainability, and driver‑user experience.

AILarge Language ModelsMultimodal Retrieval

0 likes · 10 min read

How Large Language Models Automate Order Cancellation Responsibility at HuoLala

Volcano Engine Developer Services

Aug 20, 2024 · Databases

How Vector Databases Power RAG: Scaling, Algorithms, and Real‑World Trade‑offs

RAG technology leverages vector databases to provide context‑aware answers without updating model parameters, and this article explores how cloud search teams integrate multiple vector algorithms, balance cost, stability and latency, and adopt open‑source solutions like OpenSearch to build scalable, enterprise‑grade retrieval systems.

AIDiskANNOpenSearch

0 likes · 21 min read

How Vector Databases Power RAG: Scaling, Algorithms, and Real‑World Trade‑offs

Alibaba Cloud Developer

Aug 19, 2024 · Artificial Intelligence

Ensuring Stable AI Agents: Engineering Practices, RAG, and Monitoring

This article shares engineering insights from Hema’s AI smart customer service deployment, detailing key stability factors for AI agents—including hallucination mitigation, memory integration, RAG enhancement, exception handling, and comprehensive monitoring—to improve reliability and performance in real‑world e‑commerce chatbot scenarios.

AI AgentLLMMonitoring

0 likes · 13 min read

Ensuring Stable AI Agents: Engineering Practices, RAG, and Monitoring

Selected Java Interview Questions

Aug 18, 2024 · Backend Development

Redis Introduces a Multi‑Threaded Query Engine to Boost Vector Search Performance for Generative AI

Redis has launched a multi‑threaded query engine that vertically scales its in‑memory database, dramatically increasing query throughput and lowering latency for vector similarity searches, thereby addressing the performance demands of real‑time retrieval‑augmented generation in generative AI applications.

RAGRedisbackend

0 likes · 9 min read

Redis Introduces a Multi‑Threaded Query Engine to Boost Vector Search Performance for Generative AI

Programmer DD

Aug 16, 2024 · Databases

How Redis’s New Multithreaded Query Engine Supercharges Vector Search for AI

Redis has introduced a multithreaded query engine that dramatically boosts throughput and lowers latency for vector searches, enabling scalable, real‑time retrieval‑augmented generation (RAG) workloads while preserving the low‑latency performance of its core in‑memory database.

AIRAGRedis

0 likes · 8 min read

How Redis’s New Multithreaded Query Engine Supercharges Vector Search for AI

AI Large Model Application Practice

Aug 16, 2024 · Artificial Intelligence

How to Query a Microsoft GraphRAG Knowledge Graph with Neo4j: Local and Global Modes

This guide explains how to query a Microsoft GraphRAG knowledge graph using the official CLI, API, and a custom Neo4j implementation, covering both local and global retrieval modes, vector index creation, Cypher query customization, and integration with LangChain for end‑to‑end RAG pipelines.

LangChainMicrosoft GraphRAGNeo4j

0 likes · 13 min read

How to Query a Microsoft GraphRAG Knowledge Graph with Neo4j: Local and Global Modes

Qunhe Technology Quality Tech

Aug 14, 2024 · Artificial Intelligence

Should Your Testing Team Build a Private LLM or Use RAG with a General Model?

This article compares the high costs and technical challenges of building a private large language model with the benefits, flexibility, and lower risk of using Retrieval‑Augmented Generation (RAG) on a general LLM, offering practical guidance for testing teams seeking AI assistance.

AILarge Language ModelsModel Deployment

0 likes · 11 min read

Should Your Testing Team Build a Private LLM or Use RAG with a General Model?

DaTaobao Tech

Aug 12, 2024 · Artificial Intelligence

Challenges and Optimization Techniques for Retrieval‑Augmented Generation (RAG)

Deploying large language models faces domain gaps, hallucinations, and high barriers, so Retrieval‑Augmented Generation (RAG) combines retrieval with generation, and advanced optimizations—such as RAPTOR’s hierarchical clustering, Self‑RAG’s self‑reflective retrieval, CRAG’s corrective evaluator, proposition‑level Dense X Retrieval, sophisticated chunking, query rewriting, and hybrid sparse‑dense methods—are essential for improving accuracy, reducing hallucinations, and achieving efficient, scalable performance.

AILarge Language ModelsOptimization

0 likes · 22 min read

Challenges and Optimization Techniques for Retrieval‑Augmented Generation (RAG)

37 Interactive Technology Team

Aug 12, 2024 · Backend Development

Intelligent Backend Menu Search with OpenAI Embeddings, LangChain, and DIFY

The article demonstrates how to improve backend menu navigation by building a knowledge base of menu metadata, generating concise Chinese descriptions with OpenAI embeddings, and implementing RAG retrieval using both LangChain code orchestration and DIFY’s visual workflow, highlighting each approach’s flexibility and ease of use.

Backend SearchKnowledge BaseLangChain

0 likes · 9 min read

Intelligent Backend Menu Search with OpenAI Embeddings, LangChain, and DIFY

AI Large Model Application Practice

Aug 9, 2024 · Artificial Intelligence

How to Build and Index Microsoft GraphRAG with Neo4j: A Step‑by‑Step Guide

This article explains the fundamentals of Microsoft GraphRAG, details its indexing pipeline—including text chunking, entity‑relationship extraction, community detection, and description generation—shows how to set up the graphrag library, create adaptive prompts, build the index, and import the resulting graph into Neo4j for visualization and analysis.

AIGraphRAGNeo4j

0 likes · 13 min read

How to Build and Index Microsoft GraphRAG with Neo4j: A Step‑by‑Step Guide

58 Tech

Aug 7, 2024 · Artificial Intelligence

Bridging Compute and Applications: 58.com AI Lab’s Large‑Model Platform and AI Agent Solutions

In this article, 58.com AI Lab senior director Zhan Kunlin explains how the company built a multi‑layer AI platform, created a vertical large‑language model called LingXi, and developed an AI Agent system with RAG capabilities to accelerate practical AI applications across various business scenarios.

AI PlatformAI agentsLarge Language Model

0 likes · 10 min read

Bridging Compute and Applications: 58.com AI Lab’s Large‑Model Platform and AI Agent Solutions

37 Interactive Technology Team

Aug 5, 2024 · Artificial Intelligence

Case Study: Applying AIGC to Component Activity Business with Dify

This case study shows how AIGC, implemented through Dify’s low‑code platform, enables a natural‑language AI assistant to recommend and insert the optimal components from a 200‑plus library, streamlining selection, building an embedding‑based knowledge base, exposing a RAG‑driven agent via API, and demonstrating rapid AI‑business validation compared with custom frameworks.

AI AgentAIGCBusiness Automation

0 likes · 8 min read

Case Study: Applying AIGC to Component Activity Business with Dify

NewBeeNLP

Aug 5, 2024 · Industry Insights

How Alibaba Cloud Scales Search Recommendations with Big Data, AI, and LLMs

This article details Alibaba Cloud's end‑to‑end architecture for search and advertising recommendation, covering the data platform, AI services, feature‑store design, training and inference optimizations, and the integration of large language models for new recommendation scenarios.

AI PlatformAlibaba CloudBig Data

0 likes · 17 min read

How Alibaba Cloud Scales Search Recommendations with Big Data, AI, and LLMs

Architect

Aug 2, 2024 · Artificial Intelligence

Building AI‑Native Applications with Spring AI: A Complete Tutorial

This article explains how to quickly develop an AI‑native application using Spring AI, covering core features such as chat models, prompt templates, function calling, structured output, image generation, embedding, vector stores, and Retrieval‑Augmented Generation (RAG), and provides end‑to‑end Java code examples for building a simple AI‑driven service.

AI-nativeFunction CallingJava

0 likes · 40 min read

Building AI‑Native Applications with Spring AI: A Complete Tutorial

DataFunTalk

Aug 2, 2024 · Artificial Intelligence

From Big Data to Large Models: Alibaba Cloud AI Platform Architecture and Practices for Search Recommendation

This presentation details Alibaba Cloud's AI platform, covering the end‑to‑end pipeline from big‑data processing and feature engineering to large‑model training, inference optimization, recommendation system architecture, and RAG applications, highlighting practical engineering solutions and performance gains.

AI PlatformBig DataFeature Store

0 likes · 18 min read

From Big Data to Large Models: Alibaba Cloud AI Platform Architecture and Practices for Search Recommendation

Data Thinking Notes

Aug 1, 2024 · Artificial Intelligence

Unlocking Vertical Domain LLMs: Advantages, Challenges, and Alignment Strategies

Over the past year our team explored applying large language models to specialized domains, detailing their professional benefits, unique challenges such as accuracy and knowledge‑base maintenance, and presenting solutions like alignment enhancement via BPO, Text2API, RAG, and advanced SFT/DPO techniques.

Large Language ModelsRAGSFT

0 likes · 10 min read

Unlocking Vertical Domain LLMs: Advantages, Challenges, and Alignment Strategies

Open Source Tech Hub

Jul 31, 2024 · Artificial Intelligence

Understanding LLMs, AI Agents, and Retrieval-Augmented Generation: Key Concepts and Challenges

This article explains the fundamentals of large language models, artificial general intelligence, AI-generated content, AI agents, retrieval‑augmented generation, knowledge bases, multimodal processing, fine‑tuning, alignment, tokens, vectors, and related tools, highlighting their capabilities, limitations, and practical considerations.

AI AgentArtificial IntelligenceLLM

0 likes · 14 min read

Understanding LLMs, AI Agents, and Retrieval-Augmented Generation: Key Concepts and Challenges

21CTO

Jul 30, 2024 · Artificial Intelligence

What Does Galileo’s New Hallucination Index Reveal About Today’s Top Generative AI Models?

Galileo’s Hallucination Index evaluates 22 leading generative AI models using a contextual‑adherence metric, ranking Claude 3.5 Sonnet as the overall RAG leader, Gemini 1.5 Flash as the most cost‑effective, and highlighting open‑source and context‑length performance nuances for AI practitioners.

AIRAGgenerative AI

0 likes · 5 min read

What Does Galileo’s New Hallucination Index Reveal About Today’s Top Generative AI Models?

Model Perspective

Jul 30, 2024 · Artificial Intelligence

Your Complete AI Learning Roadmap: From Basics to Large Model Mastery

This guide presents a comprehensive AI learning roadmap, dividing study into five progressive stages—from foundational math and programming to core deep‑learning and reinforcement‑learning techniques, large‑model training, industry applications, and future trends—plus curated book lists, tool recommendations, and practical RAG tutorials.

AI learning roadmapAI resourcesLarge Models

0 likes · 9 min read

Your Complete AI Learning Roadmap: From Basics to Large Model Mastery

Tencent Cloud Developer

Jul 30, 2024 · Artificial Intelligence

A Systematic Guide to Prompt Engineering: From Zero to One

This guide walks readers from beginner to proficient Prompt Engineer by outlining the evolution of prompting, introducing a universal four‑component template, and detailing a five‑step workflow—including refinement, retrieval‑augmented generation, chain‑of‑thought reasoning, and advanced tuning techniques—plus evaluation metrics for LLM performance.

AI promptingLLM optimizationLarge Language Models

0 likes · 51 min read

A Systematic Guide to Prompt Engineering: From Zero to One

phodal

Jul 24, 2024 · Artificial Intelligence

How to Build Trustworthy Coding Agents with Shire’s Custom RAG Workflow

This article explains how to use the Shire language to create reliable coding agents by defining custom RAG workflows, leveraging IDE APIs, code verification functions, and vector‑based search, with detailed examples, configuration snippets, and a roadmap for future enhancements.

AICoding AgentIDE

0 likes · 10 min read

How to Build Trustworthy Coding Agents with Shire’s Custom RAG Workflow

Alibaba Cloud Developer

Jul 22, 2024 · Artificial Intelligence

How Alibaba’s Logistics AI Overcame B2B Large Model Challenges

Alibaba’s logistics AI team shares their year‑long journey building a vertical‑domain large language model for logistics, detailing model alignment, Text2API, RAG, SFT techniques, challenges like accuracy and knowledge‑base maintenance, and showcasing real‑world applications such as chatbots, DingTalk assistants, and custom AI assistants.

RAGSFTText2API

0 likes · 16 min read

How Alibaba’s Logistics AI Overcame B2B Large Model Challenges

DevOps

Jul 21, 2024 · Artificial Intelligence

LLM Fundamentals, Applications, Prompt Engineering, RAG, and Agentic Workflows

This article provides a comprehensive overview of large language models (LLMs), covering their basic concepts, relationship with NLP, development history, parameter scaling, offline deployment, practical applications, prompt‑engineering frameworks, retrieval‑augmented generation, LangChain integration, agents, workflow orchestration, and future directions toward multimodal AI and AGI.

AI applicationsAgentArtificial Intelligence

0 likes · 36 min read

LLM Fundamentals, Applications, Prompt Engineering, RAG, and Agentic Workflows

DaTaobao Tech

Jul 19, 2024 · Artificial Intelligence

Practices and Techniques for Vertical Domain Large Language Models

Vertical domain large language models, fine‑tuned on specialized data, deliver higher expertise and task performance, but require continual knowledge updates and careful alignment; techniques such as BPO‑guided instruction tuning (+1.8% accuracy), Reflexion‑based Text2API (+4% API correctness), advanced RAG preprocessing, and SFT combined with ORPO (+5.2% gain) demonstrate notable improvements while underscoring remaining challenges and collaborative opportunities.

AIRAGSFT

0 likes · 9 min read

Practices and Techniques for Vertical Domain Large Language Models

Tencent Cloud Developer

Jul 18, 2024 · Artificial Intelligence

Exploring Large Language Models (LLM): Fundamentals, Applications, and Future Directions

Exploring Large Language Models, this article surveys their core concepts, evolution through Transformers, GPT and BERT, generation challenges, diverse applications such as QA, multimodal creation, summarization and retrieval‑augmented generation, prompt‑engineering frameworks and tools, LangChain‑based pipelines, AI‑driven agents, and future prospects toward domain‑specific use, multimodality, and AGI.

AIAgentLLM

0 likes · 35 min read

Exploring Large Language Models (LLM): Fundamentals, Applications, and Future Directions

JD Tech Talk

Jul 16, 2024 · Artificial Intelligence

Task‑Aware Decoding (TaD): A Plug‑and‑Play Method to Mitigate Hallucinations in Large Language Models

TaD, a task‑aware decoding technique jointly developed by JD.com and Tsinghua University and presented at IJCAI 2024, leverages differences between pre‑ and post‑fine‑tuned LLM outputs to construct knowledge vectors, significantly reducing hallucinations across various models, tasks, and data‑scarce scenarios, especially when combined with RAG.

AILLMRAG

0 likes · 18 min read

Task‑Aware Decoding (TaD): A Plug‑and‑Play Method to Mitigate Hallucinations in Large Language Models

Architect

Jul 13, 2024 · Artificial Intelligence

Practical Guide to Building LLM Products: Prompt Engineering, RAG, Evaluation, and Operations

This article provides a comprehensive, step‑by‑step guide for developing large‑language‑model (LLM) applications, covering prompt design techniques, n‑shot and chain‑of‑thought strategies, retrieval‑augmented generation, structured I/O, workflow optimization, evaluation pipelines, operational best practices, and team organization to create reliable, scalable AI products.

AI OperationsLLMProduct Development

0 likes · 54 min read

Practical Guide to Building LLM Products: Prompt Engineering, RAG, Evaluation, and Operations

AsiaInfo Technology: New Tech Exploration

Jul 12, 2024 · Artificial Intelligence

How AI‑Native Transforms User Experience Management in Telecom Networks

This article examines how the AI‑Native approach reshapes the AISWare CEM platform by integrating large language models, Retrieval‑Augmented Generation, and atomic capability decomposition to improve user perception, streamline interactions, and enable intelligent diagnostic assistants for telecom operators.

AI-nativeAtomic CapabilitiesDiagnostic Assistant

0 likes · 12 min read

How AI‑Native Transforms User Experience Management in Telecom Networks

JD Tech

Jul 10, 2024 · Artificial Intelligence

Implementing Retrieval‑Augmented Generation (RAG) with LangChain4j in Java

This article provides a step‑by‑step guide for Java engineers on building a Retrieval‑Augmented Generation (RAG) application using the LangChain4j framework, covering RAG fundamentals, environment setup, Maven integration, document loading, splitting, embedding with OpenAI, vector store management with Chroma, and prompt‑based LLM interaction.

EmbeddingJavaLLM

0 likes · 35 min read

Implementing Retrieval‑Augmented Generation (RAG) with LangChain4j in Java

21CTO

Jul 7, 2024 · Artificial Intelligence

How to Build a Secure Local LLM Chatbot with Ollama, Python, and ChromaDB

This tutorial walks you through creating a privacy‑preserving, locally hosted large language model chatbot using Ollama, Python 3, and ChromaDB, covering RAG fundamentals, GPU selection, environment setup, and full source code for a Flask‑based application.

ChromaDBLLMOllama

0 likes · 19 min read

How to Build a Secure Local LLM Chatbot with Ollama, Python, and ChromaDB

AI Large Model Application Practice

Jul 4, 2024 · Artificial Intelligence

Mastering Multimodal RAG: From PDF Parsing to Advanced Query Rewriting

This article explains how to handle complex multimodal PDFs in RAG systems, outlines extraction, indexing, and multimodal model integration, details four query‑rewriting strategies (HyDE, stepwise, sub‑question, backward), and presents key evaluation metrics and tools for assessing RAG performance.

Document ParsingMultimodalQuery Rewriting

0 likes · 12 min read

Mastering Multimodal RAG: From PDF Parsing to Advanced Query Rewriting

Baidu Geek Talk

Jul 3, 2024 · Databases

How Vector Databases Power AI‑Driven Retrieval: Inside Baidu’s VectorDB

This article reviews the evolution of databases and large models, explains vector database fundamentals and RAG pipelines, and details Baidu's VectorDB architecture, performance advantages, and its role in AI‑enhanced database operations.

AI integrationDatabase operationsLarge Language Model

0 likes · 15 min read

How Vector Databases Power AI‑Driven Retrieval: Inside Baidu’s VectorDB

AntTech

Jul 2, 2024 · Artificial Intelligence

Design and Implementation of a Generalized Retrieval‑Augmented Generation (RAG) Framework with Graph RAG Support

This article surveys Retrieval‑Augmented Generation (RAG), analyzes the limitations of traditional vector‑based RAG, introduces Graph RAG that leverages knowledge graphs for more reliable context, proposes a universal RAG architecture compatible with vector, graph and full‑text indexes, and details its open‑source implementation, code components, testing, and future research directions.

AIEngineeringGraphRAGKnowledgeGraph

0 likes · 26 min read

Design and Implementation of a Generalized Retrieval‑Augmented Generation (RAG) Framework with Graph RAG Support

JD Tech

Jun 28, 2024 · Artificial Intelligence

An Overview of Large Language Models: History, Fundamentals, Prompt Engineering, Retrieval‑Augmented Generation, Agents, and Multimodal AI

This article provides a comprehensive introduction to large language models, covering their historical development, core architecture, training process, prompt engineering techniques, Retrieval‑Augmented Generation, agent frameworks, multimodal capabilities, safety challenges, and future research directions.

AI agentsAI safetyLarge Language Models

0 likes · 22 min read

An Overview of Large Language Models: History, Fundamentals, Prompt Engineering, Retrieval‑Augmented Generation, Agents, and Multimodal AI

Baobao Algorithm Notes

Jun 27, 2024 · Artificial Intelligence

Engineering Data for R&D Large Language Models: From Pre‑training to Prompt Design

This article presents a comprehensive guide to data engineering for research‑focused large language models, covering domain‑adaptive pre‑training, supervised fine‑tuning, retrieval‑augmented generation, dataset construction, data cleaning pipelines, token‑izer adaptation, and prompt engineering best practices to boost model performance in specialized tasks.

Data EngineeringFine‑TuningLLM

0 likes · 20 min read

Engineering Data for R&D Large Language Models: From Pre‑training to Prompt Design

Alibaba Cloud Developer

Jun 27, 2024 · Artificial Intelligence

How to Supercharge Retrieval‑Augmented Generation: Papers, Techniques, and Real‑World Tips

This article surveys the main challenges of deploying large language models, introduces key RAG optimization papers such as RAPTOR, Self‑RAG, and CRAG, and compiles practical engineering tricks—including chunking, query rewriting, hybrid and progressive retrieval—to help practitioners build more accurate and efficient RAG systems.

AI researchLLM optimizationRAG

0 likes · 22 min read

How to Supercharge Retrieval‑Augmented Generation: Papers, Techniques, and Real‑World Tips

Baidu Intelligent Cloud Tech Hub

Jun 21, 2024 · Databases

How Vector Databases and Large Models are Transforming AI-Driven Database Operations

This article reviews the evolution of databases and large models, explains the role of vector databases and Retrieval‑Augmented Generation (RAG) in AI‑enhanced data management, and showcases Baidu Cloud's VectorDB and DBSC solutions for intelligent database operations and knowledge‑driven services.

AI4DBDatabase operationsLarge Language Models

0 likes · 15 min read

How Vector Databases and Large Models are Transforming AI-Driven Database Operations

DataFunTalk

Jun 21, 2024 · Artificial Intelligence

Fine‑tuning Large Language Models with Alibaba Cloud PAI: Practices, Techniques, and Deployment

This article introduces the Alibaba Cloud PAI platform for large language model (LLM) fine‑tuning, covering model‑training pipelines, performance‑cost trade‑offs, retrieval‑augmented generation, fine‑tuning methods such as full‑parameter, LoRA and QLoRA, model selection, data preparation, evaluation, and real‑world deployment examples.

AI PlatformLLMModel Deployment

0 likes · 20 min read

Fine‑tuning Large Language Models with Alibaba Cloud PAI: Practices, Techniques, and Deployment

JD Cloud Developers

Jun 20, 2024 · Artificial Intelligence

How Large Language Models Boost Courier Efficiency: From Voice Commands to Smart QA

This article explains how large language models like ChatGPT can transform courier operations by automating voice‑driven tasks, enabling intelligent question answering with retrieval‑augmented generation, extracting and splitting document content, embedding it for vector search, and delivering smart prompts and agents to improve productivity and accuracy.

AIEmbeddingLogistics

0 likes · 15 min read

How Large Language Models Boost Courier Efficiency: From Voice Commands to Smart QA

Architecture & Thinking

Jun 19, 2024 · Artificial Intelligence

Build AI‑Native Apps Quickly with Spring AI: From Chat Models to RAG

This guide explains what an AI‑native application is, compares AI‑native and AI‑based approaches, and walks through Spring AI’s core features—including chat models, prompt templates, function calling, structured output, image generation, embedding, and vector stores—showing step‑by‑step code examples and how to assemble a complete AI‑native app with RAG support.

AI native applicationFunction CallingJava

0 likes · 43 min read

Build AI‑Native Apps Quickly with Spring AI: From Chat Models to RAG

JD Tech

Jun 19, 2024 · Artificial Intelligence

Advances in Large AI Models: Prompt Engineering, RAG, Agents, Fine‑Tuning, Vector Databases and Knowledge Graphs

This article surveys the rapid expansion of large AI models, covering prompt engineering, structured prompts, retrieval‑augmented generation, AI agents, fine‑tuning strategies, vector database technology, knowledge graphs, function calling, and their collective role in moving toward artificial general intelligence.

AIAgentFine‑tuning

0 likes · 23 min read

Advances in Large AI Models: Prompt Engineering, RAG, Agents, Fine‑Tuning, Vector Databases and Knowledge Graphs

AI Large Model Application Practice

Jun 17, 2024 · Artificial Intelligence

Boost Your RAG Pipeline with Cohere and BGE Rerank Models

This guide explains why post‑retrieval reranking is essential for Retrieval‑Augmented Generation, compares the commercial Cohere Rerank service with the open‑source bge‑reranker‑large model, and provides step‑by‑step code for integrating both into LlamaIndex pipelines, including a custom TEI‑based processor.

BGECohereLlamaIndex

0 likes · 11 min read

Boost Your RAG Pipeline with Cohere and BGE Rerank Models

JD Tech Talk

Jun 14, 2024 · Artificial Intelligence

Building a Retrieval‑Augmented Generation (RAG) System with JD Cloud Docs, ClickHouse, LangChain, and FastAPI

This guide explains how to build a Retrieval‑Augmented Generation (RAG) system using JD Cloud documentation as a knowledge base, storing document embeddings in ClickHouse, leveraging LangChain for vector retrieval, and exposing query and answer services via FastAPI and a Gradio UI.

AIClickHouseFastAPI

0 likes · 13 min read

Building a Retrieval‑Augmented Generation (RAG) System with JD Cloud Docs, ClickHouse, LangChain, and FastAPI

JD Cloud Developers

Jun 14, 2024 · Artificial Intelligence

Build a Retrieval‑Augmented Generation (RAG) System Using JD Cloud Docs and ClickHouse

This guide walks through creating a Retrieval‑Augmented Generation pipeline that harvests JD Cloud documentation, stores vector embeddings in ClickHouse, and serves queries via FastAPI, LangChain, a Qwen LLM, and a Gradio front‑end.

ClickHouseFastAPILLM

0 likes · 14 min read

Build a Retrieval‑Augmented Generation (RAG) System Using JD Cloud Docs and ClickHouse

Alibaba Cloud Big Data AI Platform

Jun 14, 2024 · Artificial Intelligence

How Alibaba Cloud OpenSearch Powers RAG: Insights from AICon 2024

In this talk, Alibaba Cloud's OpenSearch RAG team shares their year‑long journey of building retrieval‑augmented generation systems, covering data parsing, slicing, vectorization, hybrid retrieval, model fine‑tuning, performance optimizations, cost reduction, and future directions such as multimodal queries and agents.

AI SearchHybrid RetrievalLLM

0 likes · 25 min read

How Alibaba Cloud OpenSearch Powers RAG: Insights from AICon 2024

Alibaba Cloud Developer

Jun 11, 2024 · Artificial Intelligence

Mastering Retrieval‑Augmented Generation: Challenges, Paradigms, and Engineering Best Practices

This article explores Retrieval‑Augmented Generation (RAG) by outlining its background, inherent challenges such as knowledge limits and hallucinations, describing the Naïve, Advanced, and Modular RAG paradigms, and presenting practical engineering strategies for pre‑retrieval, retrieval, and post‑retrieval optimization.

Artificial IntelligenceKnowledge retrievalMachine Learning

0 likes · 25 min read

Mastering Retrieval‑Augmented Generation: Challenges, Paradigms, and Engineering Best Practices

AI Large Model Application Practice

Jun 7, 2024 · Artificial Intelligence

Mastering Advanced Retrieval: Fusion and Recursive Strategies for RAG

This article explores two advanced retrieval paradigms—Fusion Retrieval, which merges results from multiple retrievers using re‑ranking, and Recursive Retrieval, which builds hierarchical chunk‑to‑chunk or chunk‑to‑retriever links—to boost the quality and flexibility of Retrieval‑Augmented Generation pipelines.

Fusion RetrievalLLMLangChain

0 likes · 12 min read

Mastering Advanced Retrieval: Fusion and Recursive Strategies for RAG

Bilibili Tech

Jun 7, 2024 · Artificial Intelligence

AI Development for Frontend Developers: From Basics to Agent Implementation

This article guides frontend developers through AI development, comparing model training, fine‑tuning, prompt engineering, and Retrieval‑Augmented Generation, then explains agent creation via ReAct and tool‑call methods, and showcases Langchain and Flowise as low‑code frameworks for building domain‑specific AI agents.

AI developmentAgentFlowise

0 likes · 13 min read

AI Development for Frontend Developers: From Basics to Agent Implementation

Sohu Tech Products

Jun 5, 2024 · Artificial Intelligence

Retrieval Augmented Generation (RAG): Concepts, Workflow, and LangChain Implementation

The article outlines LLM issues such as hallucination, outdated knowledge, and data privacy, then explains Retrieval‑Augmented Generation—detailing its data‑preparation and query‑time retrieval workflow, demonstrates a full LangChain implementation, and contrasts RAG with fine‑tuning as complementary strategies for up‑to‑date, grounded responses.

LLMLangChainPrompt Engineering

0 likes · 15 min read

Retrieval Augmented Generation (RAG): Concepts, Workflow, and LangChain Implementation

Tencent Cloud Developer

Jun 5, 2024 · Artificial Intelligence

Introduction to AI Development and Practical Applications

The article surveys AI development from early GPT experiments to real‑world deployments, explaining how tools like LangChain and Retrieval‑Augmented Generation enable sophisticated agents, multi‑prompt workflows, and function calls for chatbots, education, and creative content while addressing accuracy, resource, and ethical challenges.

AI DemosAI developmentAgent Frameworks

0 likes · 34 min read

Introduction to AI Development and Practical Applications

JD Retail Technology

Jun 4, 2024 · Databases

How to Deploy and Query JD’s Open‑Source Vearch Vector Database for LLM Retrieval

This article walks through the practical use of JD’s self‑developed Vearch vector database—covering cluster creation, space setup, data insertion, and both text and vector search—illustrating how it integrates with LangChain and OpenAI embeddings to enable retrieval‑augmented generation for large language models.

EmbeddingLLM RetrievalLangChain

0 likes · 16 min read

How to Deploy and Query JD’s Open‑Source Vearch Vector Database for LLM Retrieval

Baobao Algorithm Notes

Jun 3, 2024 · Artificial Intelligence

Can Adversarial Training Make Retrieval‑Augmented Generators More Robust?

Recent arXiv work introduces ATM, an adversarially‑tuned multi‑agent system that iteratively pits a fake‑knowledge attacker against a generator, dramatically improving retrieval‑augmented language models’ resistance to hallucinated content and boosting performance on knowledge‑intensive benchmarks, even with noisy or irrelevant documents.

Hallucination MitigationLarge Language ModelsRAG

0 likes · 12 min read

Can Adversarial Training Make Retrieval‑Augmented Generators More Robust?

JD Tech

May 31, 2024 · Artificial Intelligence

Understanding Large Language Models, Retrieval‑Augmented Generation, and AI Agents: Concepts, Engineering Practices, and Applications

This article explains the fundamentals and engineering practices of large language models (LLM), retrieval‑augmented generation (RAG) and AI agents, compares small and large embedding models, provides Python code for vector‑database RAG with Chroma, and discusses integration, use cases, and future challenges in AI development.

AI EngineeringAI agentsLLM

0 likes · 41 min read

Understanding Large Language Models, Retrieval‑Augmented Generation, and AI Agents: Concepts, Engineering Practices, and Applications

G7 EasyFlow Tech Circle

May 29, 2024 · Artificial Intelligence

Engineering Large Model Enterprise Applications: Best Practices

This article outlines the key characteristics of large‑model enterprise applications, compares them with consumer use cases, and presents a comprehensive engineering roadmap—including model selection, knowledge‑base integration, tool implementation, intent recognition, output control, high‑availability deployment, and ongoing optimization—to help practitioners effectively harness AI models in real‑world business environments.

AI EngineeringRAGlarge model

0 likes · 12 min read

Engineering Large Model Enterprise Applications: Best Practices

37 Interactive Technology Team

May 27, 2024 · Artificial Intelligence

Enhancing AI Code Review Quality with Contextual Embedding and Function Calling

The article explains how AI code reviews suffer from missing context, and improves them by embedding the codebase, using Retrieval‑Augmented Generation to fetch relevant snippets, and adding a function‑calling tool that lets the model autonomously request additional code, resulting in precise, bug‑detecting feedback.

AI code reviewEmbeddingFunction Calling

0 likes · 8 min read

Enhancing AI Code Review Quality with Contextual Embedding and Function Calling

Huawei Cloud Developer Alliance

May 27, 2024 · Artificial Intelligence

Unlocking AI Large Model Potential: Key Takeaways from Shanghai HCDG AI Model Salon

The Shanghai HCDG AI Model Salon showcased cutting‑edge large‑model technologies, including ModelArts‑based LLM deployment, RAG advancements, and observability best practices, highlighting how Huawei Cloud and partners are accelerating AI adoption across industries.

AIHuawei CloudLangChain

0 likes · 5 min read

Unlocking AI Large Model Potential: Key Takeaways from Shanghai HCDG AI Model Salon

Baidu Intelligent Cloud Tech Hub

May 27, 2024 · Databases

Baidu’s Enterprise Vector Database: Architecture, Performance, and RAG Secrets

An exclusive interview with Baidu’s senior database architects reveals the motivations behind building a dedicated enterprise vector database, details its novel column‑store engine, C++‑based retrieval stack, performance gains over open‑source solutions, multi‑modal support, RAG integration, and future research directions.

AILarge Language ModelsRAG

0 likes · 28 min read

Baidu’s Enterprise Vector Database: Architecture, Performance, and RAG Secrets

AI Large Model Application Practice

May 27, 2024 · Artificial Intelligence

Building Agentic RAG with LlamaIndex: From Tool Agents to a Top Agent

This article walks through the design and implementation of an Agentic Retrieval‑Augmented Generation system using LlamaIndex, showing how to wrap multiple RAG engines as tools, orchestrate them with hierarchical AI agents, and scale the solution with tool retrieval for large document collections.

AI AgentLlamaIndexPython

0 likes · 14 min read

Building Agentic RAG with LlamaIndex: From Tool Agents to a Top Agent

Alibaba Cloud Native

May 25, 2024 · Cloud Native

Build a Retrieval‑Augmented Generation (RAG) App with Spring Cloud Alibaba AI and Redis

This guide explains how to implement a Retrieval‑Augmented Generation (RAG) workflow by loading a beer‑info JSON dataset into a Redis vector store, wiring it with Spring Cloud Alibaba AI Starter, and exposing a web API that returns LLM‑generated answers.

AIJavaRAG

0 likes · 5 min read

Build a Retrieval‑Augmented Generation (RAG) App with Spring Cloud Alibaba AI and Redis

Eric Tech Circle

May 22, 2024 · Artificial Intelligence

Deploy and Build AI Apps with Dify: A Complete Open‑Source Guide

This article introduces Dify, an open‑source LLM application platform, outlines its core features such as workflows, model support, RAG pipelines, agents, and observability, compares it with alternatives, and provides step‑by‑step deployment instructions using Docker Compose and Helm for local and Kubernetes environments.

AI PlatformDockerKubernetes

0 likes · 7 min read

Deploy and Build AI Apps with Dify: A Complete Open‑Source Guide

Alibaba Cloud Big Data AI Platform

May 11, 2024 · Artificial Intelligence

How to Build a High‑Performance RAG System with Milvus on Alibaba Cloud PAI

This guide explains how to integrate Milvus vector search with Alibaba Cloud PAI to create a Retrieval‑Augmented Generation (RAG) solution, covering background, prerequisites, deployment steps, configuration parameters, and practical usage through the Web UI.

AIAlibaba CloudLangChain

0 likes · 7 min read

How to Build a High‑Performance RAG System with Milvus on Alibaba Cloud PAI

Baidu Tech Salon

May 10, 2024 · Artificial Intelligence

Baidu Comate: Core Capabilities of Intelligent Code Assistant

The article surveys Baidu Comate, an AI‑powered code assistant built on the Wenxin (ERNIE) large model, tracing software development from the 1950s crisis through the internet and open‑source era to today’s AI‑driven tools, and highlights its features and demonstration at a global development conference.

AI codingBaidu ComateIDE plugin

0 likes · 7 min read

Baidu Comate: Core Capabilities of Intelligent Code Assistant

DataFunSummit

May 10, 2024 · Artificial Intelligence

LLMOps: Definition, Fine‑tuning Techniques, Application Architecture, Challenges and Solutions

This article introduces LLMOps by defining large language model operations, explains the three stages of LLM development, details modern fine‑tuning methods such as PEFT, Adapter, Prefix, Prompt and LoRA, outlines the architecture for building LLM applications, discusses the main difficulties of agent‑based deployments, and presents practical solutions including Prompt IDE, low‑code deployment, monitoring and cost control.

AI OperationsLLMOpsModel Deployment

0 likes · 14 min read

LLMOps: Definition, Fine‑tuning Techniques, Application Architecture, Challenges and Solutions

Java Backend Technology

May 8, 2024 · Artificial Intelligence

Explore the Latest Open‑Source AI Projects: Llama 3, MaxKB, Phidata & RAGFlow

This article highlights four cutting‑edge open‑source AI initiatives—Meta’s Llama 3 large language model, the MaxKB knowledge‑base Q&A system, the Phidata framework for building AI assistants, and the RAGFlow retrieval‑augmented generation engine—detailing their capabilities, licensing, and where to access the code.

AIKnowledge BaseLLM

0 likes · 7 min read

Explore the Latest Open‑Source AI Projects: Llama 3, MaxKB, Phidata & RAGFlow

21CTO

May 6, 2024 · Databases

How Oracle’s New 23ai Database Brings AI-Powered Vector Search to Enterprises

Oracle’s latest release, Database 23ai, upgrades its 23c platform with AI-driven vector search, RAG capabilities, and enhanced JSON and graph querying, positioning the database as a unified, secure, and scalable solution for handling structured, semi‑structured, and unstructured data across cloud and on‑premises environments.

AIDatabaseOracle

0 likes · 7 min read

How Oracle’s New 23ai Database Brings AI-Powered Vector Search to Enterprises

AI Large Model Application Practice

May 3, 2024 · Artificial Intelligence

Can Giant Context LLMs Replace RAG? Exploring the Limits of Long‑Context Retrieval

This article examines whether the rapid growth of large‑language‑model context windows can eliminate the need for retrieval‑augmented generation, presenting experimental needle‑in‑a‑haystack tests, analysis of model performance across token lengths and needle positions, and practical guidance using an open‑source evaluation tool.

AILLMNeedle-in-a-Haystack

0 likes · 13 min read

Can Giant Context LLMs Replace RAG? Exploring the Limits of Long‑Context Retrieval

DataFunTalk

Apr 29, 2024 · Artificial Intelligence

Practical Experience and Q&A Exploration of Patent Large Models

This article presents a comprehensive overview of the development, training, data preparation, algorithmic strategies, evaluation methods, and RAG integration for a domain‑specific patent large language model, highlighting challenges, practical results, and future research directions.

Domain-specific ModelPatent AIRAG

0 likes · 19 min read

Practical Experience and Q&A Exploration of Patent Large Models

Rare Earth Juejin Tech Community

Apr 29, 2024 · Artificial Intelligence

Building Enterprise‑Grade Retrieval‑Augmented Generation (RAG) Systems: Challenges, Fault Points, and Best Practices

This comprehensive guide explores the complexities of building enterprise‑level Retrieval‑Augmented Generation (RAG) systems, detailing common failure points, architectural components such as authentication, input guards, query rewriting, document ingestion, indexing, storage, retrieval, generation, observability, caching, and multi‑tenant considerations, and provides actionable best‑practice recommendations for developers and technical leaders.

CachingEnterprise AILLM

0 likes · 32 min read

Building Enterprise‑Grade Retrieval‑Augmented Generation (RAG) Systems: Challenges, Fault Points, and Best Practices

Huolala Tech

Apr 25, 2024 · Artificial Intelligence

How LLM‑Powered Multi‑Agent AI Boosts Vehicle Selection in HuoLala’s Customer Service

This article details the design and implementation of an LLM‑driven multi‑agent AI customer‑service assistant for vehicle selection at HuoLala, covering system architecture, algorithmic solutions, retrieval‑augmented generation, NLU/NLG agents, performance improvements, and future outlooks.

AI Customer ServiceLLMRAG

0 likes · 12 min read

How LLM‑Powered Multi‑Agent AI Boosts Vehicle Selection in HuoLala’s Customer Service

Volcano Engine Developer Services

Apr 18, 2024 · Databases

How VikingDB Powers AI Retrieval and Scalable Vector Search

VikingDB is a cloud‑native vector database that originated at ByteDance, offering high‑performance ANN search, hybrid dense‑sparse retrieval for Retrieval‑Augmented Generation, extensive scaling and filtering capabilities, and ready‑to‑use SDKs for real‑world AI applications.

AI RetrievalANN indexingRAG

0 likes · 17 min read

How VikingDB Powers AI Retrieval and Scalable Vector Search

DevOps

Apr 17, 2024 · Artificial Intelligence

Engineering Capabilities for Enterprise Large Model Applications: Prompt Engineering, RAG, and Fine‑Tuning

The article explores how enterprises can build and improve large‑model applications by combining prompt engineering, retrieval‑augmented generation (RAG), and fine‑tuning, discusses their relationships, optimization dimensions, testing challenges, and provides practical guidance for SE4AI implementation.

AI EngineeringEnterprise AILarge Language Models

0 likes · 20 min read

Engineering Capabilities for Enterprise Large Model Applications: Prompt Engineering, RAG, and Fine‑Tuning

dbaplus Community

Apr 17, 2024 · Operations

Boost IT Operations with Offline LLMs: A Step‑by‑Step RAG Guide Using LangChain

This article explains how to build an offline knowledge base for IT operations by combining large language models with Retrieval‑Augmented Generation (RAG) using LangChain, covering document loading, chunking, embedding, vector storage, and query‑time retrieval with concrete code examples.

EmbeddingLLMLangChain

0 likes · 6 min read

Boost IT Operations with Offline LLMs: A Step‑by‑Step RAG Guide Using LangChain

21CTO

Apr 12, 2024 · Artificial Intelligence

How I Built an AI‑Powered Resume Chatbot with LLMs and RAG

Senior developer Jon Olson shares how he created an AI resume assistant using GPT‑4/3.5, LangChain, LlamaIndex, and retrieval‑augmented generation, detailing prompt engineering, backend integration, and future routing features to help job seekers showcase their skills.

AI chatbotLLMLangChain

0 likes · 8 min read

How I Built an AI‑Powered Resume Chatbot with LLMs and RAG

Rare Earth Juejin Tech Community

Apr 12, 2024 · Artificial Intelligence

Typical Business and Technical Architectures for Large Language Model Applications

This article reviews the common business and technical architectures used in large language model (LLM) applications, explains AI Embedded, AI Copilot, and AI Agent modes—including single‑ and multi‑agent systems—and offers guidance on selecting appropriate technology stacks such as prompt‑only, function‑calling agents, RAG, and fine‑tuning.

AI AgentLLMRAG

0 likes · 9 min read

Typical Business and Technical Architectures for Large Language Model Applications

Eric Tech Circle

Apr 11, 2024 · Artificial Intelligence

Build a Generative AI RAG App with Spring AI in Minutes

This guide walks you through setting up Spring AI, configuring model providers and vector stores, initializing a Spring Boot project, adding OpenAI credentials, and running a complete RAG (Retrieval‑Augmented Generation) demo with code snippets and sample API calls.

JavaOpenAIRAG

0 likes · 15 min read

Build a Generative AI RAG App with Spring AI in Minutes

Rare Earth Juejin Tech Community

Apr 11, 2024 · Artificial Intelligence

AI Hackathon Journey: Building the "Novel Jump" Bot on Coze Platform

This article recounts the author's participation in a Shenzhen AI Hackathon, detailing the development of an interactive novel‑character chatbot using the Coze platform, describing the workflow design, technical challenges, model choices, knowledge‑base construction, and the final demo and award outcomes.

AIChatbotCoze

0 likes · 12 min read

AI Hackathon Journey: Building the "Novel Jump" Bot on Coze Platform

HelloTech

Apr 10, 2024 · Artificial Intelligence

An Overview of LangChain: Architecture, Core Components, and Code Examples

LangChain is an open‑source framework that provides Python and JavaScript SDKs, templates, and services such as LangServe and LangSmith to compose models, embeddings, prompts, indexes, memory, chains, and agents via a concise expression language, enabling rapid prototyping, debugging, and deployment of LLM‑driven applications.

AI EngineeringJavaScriptLLM

0 likes · 19 min read

An Overview of LangChain: Architecture, Core Components, and Code Examples

Alibaba Cloud Developer

Apr 10, 2024 · Artificial Intelligence

Master LangChain in 10 Minutes: From Basics to Advanced AI Engineering

This guide walks AI engineers through a rapid 10‑minute boot‑strap of LangChain, explaining its purpose, core concepts, design questions, environment setup, and step‑by‑step code examples that cover APIs, chains, memory, retrieval‑augmented generation, tools, agents, and the overall architecture.

AI EngineeringLLMLangChain

0 likes · 28 min read

Master LangChain in 10 Minutes: From Basics to Advanced AI Engineering

Rare Earth Juejin Tech Community

Apr 8, 2024 · Artificial Intelligence

PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers

The article introduces PreFLMR, an open‑source, general‑purpose pre‑trained multimodal retriever that leverages fine‑grained late‑interaction to boost retrieval‑augmented generation for knowledge‑intensive visual tasks, describes its M2KR benchmark, training stages, and strong experimental results across multiple tasks.

AIFLMRKnowledge retrieval

0 likes · 11 min read

PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers

Rare Earth Juejin Tech Community

Mar 30, 2024 · Artificial Intelligence

Comprehensive Guide to Coze: AI Bot Development, Prompt Engineering, and Workflow Design

This article provides an in‑depth overview of the Coze low‑code AI bot platform, covering its core features, product comparisons, step‑by‑step bot creation, RAG implementation, plugin usage, memory mechanisms, cron jobs, agent design, advanced workflow techniques, quality management, and future prospects.

AI botCozeLLM

0 likes · 25 min read

Comprehensive Guide to Coze: AI Bot Development, Prompt Engineering, and Workflow Design

AI Large Model Application Practice

Mar 29, 2024 · Artificial Intelligence

How RAG Architecture Evolves: From Simple Chains to Flexible RAG Flows

This article examines the evolution of Retrieval‑Augmented Generation (RAG) architectures for large language models, outlines the challenges they face, introduces the modular RAG Flow concept with four workflow paradigms, and provides a step‑by‑step implementation using LangChain and LlamaIndex with code examples.

LLMLangChainRAG

0 likes · 15 min read

How RAG Architecture Evolves: From Simple Chains to Flexible RAG Flows

Sohu Tech Products

Mar 27, 2024 · Artificial Intelligence

Building a RAG Application with Baidu Vector Database and Qianfan Embedding

This tutorial walks through building a Retrieval‑Augmented Generation application by setting up Baidu’s Vector Database and Qianfan embedding service, configuring credentials, creating a document database and vector table, loading and chunking PDFs, generating embeddings, storing them, and performing scalar, vector and hybrid similarity searches, ready for integration with Wenxin LLM for answer generation.

AI applicationsBaidu QianfanEmbedding

0 likes · 11 min read

Building a RAG Application with Baidu Vector Database and Qianfan Embedding

Sohu Tech Products

Mar 27, 2024 · Artificial Intelligence

NVIDIA NeMo Framework, TensorRT‑LLM, and RAG for Large Language Model Solutions

NVIDIA’s comprehensive LLM ecosystem combines the full‑stack NeMo Framework for data curation, distributed training, fine‑tuning, inference acceleration with TensorRT‑LLM and Triton, plus Retrieval‑Augmented Generation and Guardrails, enabling efficient, low‑latency, knowledge‑grounded model deployment across clusters.

AI accelerationLarge Language ModelsNVIDIA

0 likes · 16 min read

NVIDIA NeMo Framework, TensorRT‑LLM, and RAG for Large Language Model Solutions

Eric Tech Circle

Mar 24, 2024 · Artificial Intelligence

Running Local LLMs: Ollama vs Hugging Face – A Hands‑On Comparison

This guide compares Ollama and Hugging Face for running large language models locally, detailing API and local execution methods, installation steps, model selection, resource requirements, integration with AnythingLLM, container deployment, embedding and vector store setup, and practical observations on performance and limitations.

AnythingLLMDockerEmbedding

0 likes · 15 min read

Running Local LLMs: Ollama vs Hugging Face – A Hands‑On Comparison