Tagged articles

2079 articles

Page 9 of 21

Dec 12, 2025 · Artificial Intelligence

How BookRAG Redefines Long-Document Retrieval with Hierarchical Indexing

BookRAG introduces a hierarchical, structure‑aware indexing method that combines tree‑based document representation with graph‑based entity linking and an agent‑driven retrieval pipeline, achieving up to 71.2% recall improvement on multimodal long‑document benchmarks while cutting token usage and latency dramatically.

Agent RetrievalHierarchical IndexingLLM

0 likes · 7 min read

How BookRAG Redefines Long-Document Retrieval with Hierarchical Indexing

PaperAgent

Dec 11, 2025 · Artificial Intelligence

Which Small Language Model Wins After Fine‑Tuning? A Data‑Driven Benchmark

A comprehensive benchmark fine‑tunes twelve small language models on eight diverse tasks, compares them against a 120B teacher model, and reveals which models excel overall, which are most "plastic" for improvement, and how small models can rival much larger ones.

AIBenchmarkLLM

0 likes · 11 min read

Which Small Language Model Wins After Fine‑Tuning? A Data‑Driven Benchmark

Xiaohongshu Tech REDtech

Dec 11, 2025 · Artificial Intelligence

Fine‑Grained Activation Offloading: Cutting Memory Use While Preserving LLM Throughput

The article introduces a fine‑grained activation offloading technique implemented in Megatron‑Core that offloads module‑level activations to CPU, overlaps transfer with computation, and remains compatible with pipeline and virtual pipeline parallelism, dramatically reducing peak GPU memory for large language models while incurring minimal throughput loss.

LLMMegatronMemory Optimization

0 likes · 18 min read

Fine‑Grained Activation Offloading: Cutting Memory Use While Preserving LLM Throughput

Wu Shixiong's Large Model Academy

Dec 11, 2025 · Artificial Intelligence

Why Reward Models Need Reasoning: From Scalar Scores to RM‑R1

Interviewers increasingly ask why modern reward models must go beyond scalar scores to incorporate reasoning, and this article explains the limitations of traditional scalar reward models, the benefits of the RM‑R1 framework, and how reasoning‑based rewards improve alignment, stability, and task performance in large language model training.

AI alignmentLLMRLHF

0 likes · 11 min read

Why Reward Models Need Reasoning: From Scalar Scores to RM‑R1

Sohu Tech Products

Dec 10, 2025 · Artificial Intelligence

Build a Next.js Chatbot Quickly with Vercel AI SDK

This guide explains how to integrate large language models into modern web applications using Vercel AI SDK, covering core modules, package responsibilities, when to choose each package, installation steps, example code for both backend API routes and React front‑end, and a complete quick‑start workflow.

AI IntegrationLLMNext.js

0 likes · 12 min read

Build a Next.js Chatbot Quickly with Vercel AI SDK

Baidu Intelligent Cloud Tech Hub

Dec 10, 2025 · Artificial Intelligence

Accelerate LLM Deployment on Baidu Kunlun XPU with the Open‑Source vLLM‑Kunlun Plugin

The vLLM‑Kunlun Plugin, built on the vLLM hardware‑plugin RFC, lets developers deploy any major large language model on Baidu's Kunlun XPU instantly without modifying vLLM core code, dramatically shortening migration time, providing high‑performance fusion operators, and offering open‑source tools for precision verification and profiling.

KunlunLLMOpen Source

0 likes · 8 min read

Accelerate LLM Deployment on Baidu Kunlun XPU with the Open‑Source vLLM‑Kunlun Plugin

BirdNest Tech Talk

Dec 9, 2025 · Artificial Intelligence

How BettaFish Uses Multi‑Agent AI to Break the Information Filter Bubble

BettaFish is a Go‑based, AI‑driven multi‑agent opinion analysis platform that tackles information silos, overload, and bias by aggregating data from diverse sources, iteratively refining results through reflection loops, and delivering visualized, actionable reports for scientific decision‑making.

AIData visualizationGo

0 likes · 24 min read

How BettaFish Uses Multi‑Agent AI to Break the Information Filter Bubble

Tech Freedom Circle

Dec 9, 2025 · Artificial Intelligence

LangGraph Source Code Deep Dive: From Zero to One – Multi‑Agent Architecture and Core Implementation

This article dissects LangGraph, the graph‑based workflow engine for LLM agents, explaining its stateful design, node‑edge architecture, compilation and execution process, and why it outperforms traditional linear LangChain chains for complex multi‑step AI applications.

AI orchestrationGraph WorkflowLLM

0 likes · 26 min read

LangGraph Source Code Deep Dive: From Zero to One – Multi‑Agent Architecture and Core Implementation

PaperAgent

Dec 9, 2025 · Artificial Intelligence

How Code Graph Model (CGM) Redefines Repository‑Level Code Understanding

The Code Graph Model (CGM) introduced by Ant's multimodal code team integrates repository‑level graph structures into open‑source LLMs, achieving a 44% solve rate on SWE‑bench Lite, eliminating agent dependence, and demonstrating a novel graph‑enhanced code model through multi‑granular graph construction, dual‑modal alignment, and a lightweight GraphRAG framework.

AICode GraphGraphRAG

0 likes · 9 min read

How Code Graph Model (CGM) Redefines Repository‑Level Code Understanding

AI Info Trend

Dec 9, 2025 · Industry Insights

Which LLM Leads the 2025 AI Race? Intelligence, Speed, and Cost Rankings Revealed

The article analyzes 2025 large‑language‑model benchmarks across intelligence, speed, and price, highlighting top performers like Gemini 3 Pro, Claude Opus 4.5, and Grok 4.1 Fast, and discusses trade‑offs, cost democratization, and future trends in AI development.

AI RankingCostIntelligence

0 likes · 9 min read

Which LLM Leads the 2025 AI Race? Intelligence, Speed, and Cost Rankings Revealed

DeWu Technology

Dec 8, 2025 · Artificial Intelligence

Unlocking Model Context Protocol (MCP): A Deep Dive into AI‑Database Integration

This article provides a comprehensive technical overview of the Model Context Protocol (MCP), an open‑standard JSON‑RPC 2.0 protocol that enables large language models to securely interact with external data sources, tools, and services, detailing its design, architecture, Python SDK implementation, transport mechanisms, and real‑world deployment examples such as the DW‑DBA‑MCP project.

LLMModel Context ProtocolPython SDK

0 likes · 45 min read

Unlocking Model Context Protocol (MCP): A Deep Dive into AI‑Database Integration

Tencent Technical Engineering

Dec 8, 2025 · Artificial Intelligence

Building Persistent Long‑Term Memory for LLM Agents with LangGraph – A Complete Guide

This article explains how to give large language model agents lasting memory by combining short‑term and long‑term storage in LangGraph, covering concepts, implementation details, database persistence, tool integration, semantic search, memory‑management strategies, checkpoint handling, and a multi‑agent supervisor example.

Agent MemoryLLMLangGraph

0 likes · 43 min read

Building Persistent Long‑Term Memory for LLM Agents with LangGraph – A Complete Guide

AI Large Model Application Practice

Dec 8, 2025 · Artificial Intelligence

Unlocking Anthropic’s New Agent Toolkit: MCP, PTC, Skills, and Subagents Explained

This article breaks down Anthropic’s latest Agent engineering concepts—MCP, Programmatic Tool Calling (PTC), Skills, and Subagents—showing how they work together to reduce latency, token cost, and context overload while enabling modular, scalable AI workflows.

AnthropicLLMPTC

0 likes · 13 min read

Unlocking Anthropic’s New Agent Toolkit: MCP, PTC, Skills, and Subagents Explained

Wuming AI

Dec 7, 2025 · Artificial Intelligence

What Is MCP and How It Revolutionizes AI Tool Integration

This article explains the MCP protocol for AI agents, detailing why a universal tool‑calling standard is needed, how it solves the M×N integration nightmare, the roles and execution stages involved, and demonstrates its use with Cherry Studio while highlighting current limitations.

AI AgentCherry StudioLLM

0 likes · 20 min read

What Is MCP and How It Revolutionizes AI Tool Integration

BirdNest Tech Talk

Dec 7, 2025 · Artificial Intelligence

Recreating DeerFlow’s Multi‑Agent Research Pipeline with LangGraphGo in 30 Minutes

This article walks through the open‑source DeerFlow framework—its multi‑agent architecture, core features, and a step‑by‑step implementation using the Go‑based LangGraphGo library, covering planner, researcher, reporter and podcast nodes, state‑graph design, CLI/web modes, and deployment instructions.

AI researchLLMLangGraphGo

0 likes · 14 min read

Recreating DeerFlow’s Multi‑Agent Research Pipeline with LangGraphGo in 30 Minutes

21CTO

Dec 7, 2025 · Backend Development

Top Laravel AI Packages to Power Intelligent Web Apps

This article reviews the most popular and actively maintained Laravel AI packages—including Prism, LarAgent, Laravel AI Toolkit, and Laravel MCP—detailing their features, typical use‑cases, and how to choose the right one for building chatbots, automation agents, content generators, and AI‑enhanced Laravel applications.

AIBackend DevelopmentLLM

0 likes · 6 min read

Top Laravel AI Packages to Power Intelligent Web Apps

PaperAgent

Dec 7, 2025 · Industry Insights

What 1,000 Trillion Tokens Reveal About the Rise of Open‑Source LLMs

A massive 1 000 trillion‑token study by a16z and OpenRouter shows open‑source models now hold a third of the LLM market, programming tasks have surged to over 50 % of usage, role‑play scenarios dominate open‑source traffic, and price elasticity is surprisingly low, reshaping the competitive landscape.

AI MarketLLMOpen-source models

0 likes · 6 min read

What 1,000 Trillion Tokens Reveal About the Rise of Open‑Source LLMs

Baobao Algorithm Notes

Dec 7, 2025 · Artificial Intelligence

Can RL Really Boost LLM Reasoning? A Critical Review of Recent Findings

This article critically examines recent RL‑for‑LLM studies, revealing that reinforcement learning improves search efficiency but does not extend the intrinsic reasoning capabilities of base models, and explores the underlying model‑conditioned optimization bias, comparisons with SFT distillation, and the trade‑off with catastrophic forgetting.

Catastrophic ForgettingLLMSFT

0 likes · 11 min read

Can RL Really Boost LLM Reasoning? A Critical Review of Recent Findings

Data Party THU

Dec 6, 2025 · Artificial Intelligence

Why Adding Toxic Data Can Make Language Models Safer and More Capable

A recent study shows that deliberately mixing a moderate amount of toxic content into large‑language‑model pre‑training actually sharpens the model’s internal representation of toxicity, enabling post‑training interventions to more effectively detoxify the model while preserving or even improving its general capabilities.

LLMToxic Datadetoxification

0 likes · 10 min read

Why Adding Toxic Data Can Make Language Models Safer and More Capable

Bighead's Algorithm Notes

Dec 5, 2025 · Artificial Intelligence

Quantitative Finance Paper Summaries (Nov 29–Dec 5 2025)

This article presents concise summaries of five recent AI‑driven finance papers, covering a stress‑testing framework for LLM trading agents, an orchestration framework for financial agents, an event‑reflection memory model for stock forecasting, a hybrid LLM‑Bayesian network architecture for options wheel strategies, and their experimental results.

BenchmarkingFinancial AILLM

0 likes · 12 min read

Quantitative Finance Paper Summaries (Nov 29–Dec 5 2025)

PaperAgent

Dec 5, 2025 · Artificial Intelligence

Can LLMs Be Trained to Confess? Inside the “Confession” Method for Honest AI

The article reviews OpenAI’s “Confession” training approach for large language models, explains why traditional RLHF fails to ensure honesty, details the confession methodology and PPO update, presents experimental results showing higher honesty rates, analyzes error cases, and discusses limitations and future risks.

AI honestyArtificial IntelligenceConfession Training

0 likes · 6 min read

Can LLMs Be Trained to Confess? Inside the “Confession” Method for Honest AI

Wu Shixiong's Large Model Academy

Dec 5, 2025 · Artificial Intelligence

Why Do LLM Function Calls Hallucinate Parameters and How to Prevent It?

This article explains the root causes of hallucinated parameters in LLM Function Calls, outlines five common failure patterns, and presents a systematic five‑step engineering framework—including schema design, prompt rules, dynamic routing, result validation, and clarification—to reliably eliminate such errors in real‑world AI agents.

AI AgentLLMfunction call

0 likes · 11 min read

Why Do LLM Function Calls Hallucinate Parameters and How to Prevent It?

Frontend AI Walk

Dec 5, 2025 · Artificial Intelligence

Master Prompt Engineering: From Random Chat to Precise Control with Zero-shot, Few-shot, and Chain‑of‑Thought

This article explains how to converse effectively with large language models by mastering three core prompting techniques—Zero‑shot, Few‑shot, and Chain‑of‑Thought—illustrated with front‑end analogies, code snippets, and a step‑by‑step DeepSeek JSON‑generation exercise that shows common pitfalls and best practices.

DeepSeekFew-shotJSON generation

0 likes · 12 min read

Master Prompt Engineering: From Random Chat to Precise Control with Zero-shot, Few-shot, and Chain‑of‑Thought

Fun with Large Models

Dec 5, 2025 · Artificial Intelligence

DeepSeek Math V2 & V3.2: A Plain‑Language Deep Dive into Core Innovations

This article provides a detailed, easy‑to‑understand analysis of DeepSeek‑Math‑V2’s self‑verification training method and DeepSeek‑V3.2’s GRPO framework, sparse‑attention DSA mechanism, massive agent data pipeline, and benchmark results that place both models among the world’s top open‑source large language models.

DeepSeekGRPOLLM

0 likes · 19 min read

DeepSeek Math V2 & V3.2: A Plain‑Language Deep Dive into Core Innovations

Bighead's Algorithm Notes

Dec 4, 2025 · Artificial Intelligence

Paper Review: RETuning Boosts Large‑Model Stock Trend Prediction Reasoning

This article analyzes the RETuning framework, which addresses LLMs' bias toward analyst opinions and lack of evidence weighting in stock movement prediction by introducing a two‑stage cold‑start fine‑tuning and reinforcement learning pipeline, evaluating it on the large Fin‑2024 dataset and demonstrating significant F1 gains, inference‑time scaling, and out‑of‑distribution robustness.

Fin-2024GRPOInference Scaling

0 likes · 12 min read

Paper Review: RETuning Boosts Large‑Model Stock Trend Prediction Reasoning

DataFunTalk

Dec 4, 2025 · Artificial Intelligence

Agentic RAG, LLM‑Powered Recommendation, and Generative Ranking: Cutting‑Edge AI Search Techniques

This article reviews three advanced AI search solutions—Alibaba Cloud's Agentic RAG architecture for multi‑modal retrieval, Huawei's LLM‑enhanced recommendation system with factorized prompting, and Baidu's generative ranking model GRAB—detailing their technical challenges, design choices, performance gains, and deployment insights.

AI SearchBaiduGenerative Ranking

0 likes · 8 min read

Agentic RAG, LLM‑Powered Recommendation, and Generative Ranking: Cutting‑Edge AI Search Techniques

ShiZhen AI

Dec 4, 2025 · Artificial Intelligence

What Is a Context Window? Explaining LLM Memory Capacity

The article explains that a context window defines an LLM's token‑level memory capacity, shows how longer windows cause quadratic computation growth, introduces KV Cache as a way to extend context without exploding resources, and covers advanced techniques like Ring Attention, NIAH benchmarking, and attention decay in long sequences.

KV CacheLLMNIAH benchmark

0 likes · 6 min read

What Is a Context Window? Explaining LLM Memory Capacity

Aikesheng Open Source Community

Dec 4, 2025 · Artificial Intelligence

Gemini 3 Pro vs DeepSeek‑V3.2‑Exp: Which LLM Dominates SQL Understanding, Optimization, and Dialect Conversion?

This report evaluates the professional‑grade LLMs Gemini 3 Pro and DeepSeek‑V3.2‑Exp on three SQL‑related dimensions—understanding, optimization, and dialect conversion—using the SCALE benchmark, presenting detailed scores, strengths, weaknesses, and practical recommendations for database engineers and decision makers.

DatabaseDeepSeekGemini

0 likes · 16 min read

Gemini 3 Pro vs DeepSeek‑V3.2‑Exp: Which LLM Dominates SQL Understanding, Optimization, and Dialect Conversion?

Past Memory Big Data

Dec 4, 2025 · Artificial Intelligence

Text2SQL Showdown: Which Technical Path Delivers Higher Accuracy and Lower Cost?

The article analyzes two contrasting Text2SQL architectures—LLM + RAG + DSL versus rule‑driven NLQ—examining their accuracy under controlled conditions, implementation costs, complex query support, and real‑world suitability for enterprise BI, and concludes which approach is more reliable and cost‑effective.

AI+RulesBusiness IntelligenceDSL

0 likes · 16 min read

Text2SQL Showdown: Which Technical Path Delivers Higher Accuracy and Lower Cost?

Wuming AI

Dec 3, 2025 · Artificial Intelligence

How to Reduce LLM Hallucinations: Model Selection, Web Search, and Verification Agents

This article explains a step‑by‑step workflow for mitigating large‑language‑model hallucinations by picking low‑hallucination models, leveraging internet‑enabled search tools, rephrasing queries, and creating a dedicated verification assistant with concrete prompts and a Claude implementation.

LLMPrompt Engineeringhallucination

0 likes · 6 min read

How to Reduce LLM Hallucinations: Model Selection, Web Search, and Verification Agents

Tencent Technical Engineering

Dec 3, 2025 · Artificial Intelligence

Why Transformers Power Modern LLMs: A Deep Dive into Architecture and Mechanics

This article provides a comprehensive, step‑by‑step explanation of the Transformer architecture that underpins large language models, covering tokenization, embeddings, positional encoding, attention mechanisms, feed‑forward networks, layer stacking, a detailed translation example, visualized attention weights, and a survey of recent open‑source LLM designs such as DeepSeek V3, OLMo 2, and Gemma 3.

EmbeddingLLMNeural Network

0 likes · 38 min read

Why Transformers Power Modern LLMs: A Deep Dive into Architecture and Mechanics

360 Smart Cloud

Dec 3, 2025 · Artificial Intelligence

How Model Distillation Enhances LLM Performance on the TLM Platform

This article explains the TLM large‑model development platform and details how knowledge distillation—using soft labels, temperature scaling, and combined loss functions—compresses teacher models into efficient student models, with practical steps and evaluation on the platform.

AIKnowledge DistillationLLM

0 likes · 5 min read

How Model Distillation Enhances LLM Performance on the TLM Platform

AntData

Dec 3, 2025 · Artificial Intelligence

How to Build and Refine Your Personal AI Agent Assistant

This article walks through turning a generic AI model into a personal assistant by explaining user‑centric workflows, crafting effective natural‑language prompts, adding clarification steps, validating AI‑generated results through multiple methods, and handling errors with product interactions to create a reliable, evolving assistant.

ChatBILLMresult validation

0 likes · 10 min read

How to Build and Refine Your Personal AI Agent Assistant

DataFunTalk

Dec 2, 2025 · Artificial Intelligence

How Agentic RAG, LLM‑Powered Recommendation, and Generative Ranking Are Redefining AI Search

This article reviews three cutting‑edge AI search and recommendation techniques—Alibaba Cloud's Agentic RAG architecture, Huawei Noah's LLM‑enhanced recommendation pipeline, and Baidu's GRAB generative ranking model—detailing their design challenges, multi‑modal retrieval strategies, performance gains, and real‑world deployment results.

AI AgentsAI SearchGenerative Ranking

0 likes · 8 min read

How Agentic RAG, LLM‑Powered Recommendation, and Generative Ranking Are Redefining AI Search

AsiaInfo Technology: New Tech Exploration

Dec 2, 2025 · Artificial Intelligence

How LLMs Can Revolutionize Test Case Generation: Methods, Benefits, and Challenges

This article examines the shortcomings of manual test case creation, explains how large language models (LLMs) can dramatically improve efficiency, coverage, consistency, and knowledge sharing in software testing, outlines the key capabilities required, and presents a detailed end‑to‑end solution with practical steps, evaluation metrics, and future outlook.

AI automationKnowledge BaseLLM

0 likes · 20 min read

How LLMs Can Revolutionize Test Case Generation: Methods, Benefits, and Challenges

Frontend AI Walk

Dec 2, 2025 · Artificial Intelligence

Understanding LLMs: A Frontend Developer’s Primer on Large Language Models

The article demystifies large language models for frontend developers by likening token prediction to autocomplete, explaining tokens, context windows, temperature, the two-stage training process, and the critical role of prompts, using concrete code examples and analogies to familiar frontend concepts.

Frontend AnalogyLLMLarge Language Model

0 likes · 10 min read

Understanding LLMs: A Frontend Developer’s Primer on Large Language Models

Tencent Technical Engineering

Dec 1, 2025 · Artificial Intelligence

Do Machines Really Think? Inside Deep Reasoning, Scaling Laws & RLHF for LLMs

This article examines whether large language models truly think, explores the origins of deep reasoning through transformer architectures and scaling laws, reviews chain‑of‑thought and its variants, and analyzes how reinforcement learning from human feedback—including PPO, DPO, and GRPO—helps internalise step‑by‑step reasoning while pointing to future directions such as atomic thought, hierarchical models, and training‑free in‑context knowledge bases.

AI alignmentLLMRLHF

0 likes · 35 min read

Do Machines Really Think? Inside Deep Reasoning, Scaling Laws & RLHF for LLMs

AI Large Model Application Practice

Dec 1, 2025 · Artificial Intelligence

Which Open‑Source Agent Memory Engine Wins? Deep Dive into Mem0, Graphiti & Cognee

This article examines the limitations of LLM short‑term context windows and compares three open‑source long‑term memory frameworks—Mem0, Graphiti, and Cognee—by detailing their architectures, storage modes, integration steps, code examples, strengths, drawbacks, and practical selection guidance for building smarter AI agents.

Agent MemoryGraphitiLLM

0 likes · 20 min read

Which Open‑Source Agent Memory Engine Wins? Deep Dive into Mem0, Graphiti & Cognee

AI Tech Publishing

Nov 30, 2025 · Artificial Intelligence

Agent Architecture Design Part 1: Context Compression Strategies and Their Use Cases

The article explains why large‑model agents need context compression, outlines five engineering‑level schemes (both lossless and lossy), demonstrates each with concrete XML snippets and step‑by‑step reasoning, and advises using lossless methods before resorting to lossy prompt‑driven compression.

LLMLosslessLossy

0 likes · 12 min read

Agent Architecture Design Part 1: Context Compression Strategies and Their Use Cases

Bighead's Algorithm Notes

Nov 30, 2025 · Artificial Intelligence

How TSci Uses LLMs to Automate End‑to‑End Time‑Series Forecasting

The article reviews the TSci framework, an LLM‑driven multi‑agent system that automates data diagnosis, model selection, ensemble forecasting, and report generation for time‑series prediction, achieving up to 38 % lower MAE than LLM baselines and improving report quality across five evaluation dimensions.

Agent FrameworkLLMTSci

0 likes · 10 min read

How TSci Uses LLMs to Automate End‑to‑End Time‑Series Forecasting

DataFunSummit

Nov 29, 2025 · Artificial Intelligence

How LLMs Are Transforming Long-Term Cross-Domain Interest Modeling for Recommendations

The Datafun Summit 2025 talk by JD’s algorithm engineer Tian Mingyang explains how generative AI is driving a paradigm shift in recommendation systems, detailing the limits of traditional models, the new dynamic cross‑domain inference chain technique, joint engineering‑algorithm optimizations, and the remaining challenges for future deployment.

AICross-Domain ModelingEngineering Optimization

0 likes · 32 min read

How LLMs Are Transforming Long-Term Cross-Domain Interest Modeling for Recommendations

Data Party THU

Nov 29, 2025 · Artificial Intelligence

Unlocking AI Agents: From Fundamentals to Building Your First LLM‑Powered Agent

This comprehensive guide explores the concept of AI agents, detailing their definitions, classifications, and core interaction loops, then walks you through building a functional LLM‑driven travel assistant with step‑by‑step code, tool integration, and practical insights on agent versus workflow paradigms.

AI AgentsAgent ArchitectureLLM

0 likes · 39 min read

Unlocking AI Agents: From Fundamentals to Building Your First LLM‑Powered Agent

PaperAgent

Nov 29, 2025 · Industry Insights

NeurIPS 2025 Insights: AI Agents, Reasoning, and the Shift to Real-World Systems

An analysis of the 5,984 papers accepted at NeurIPS 2025 shows a decisive move from ever‑larger models toward agents, reasoning‑focused LLMs, efficiency engineering, AI for Science, and trustworthy AI, signaling the transition from a research‑toy era to an engineering‑driven AI ecosystem.

AI for ScienceAI trendsLLM

0 likes · 7 min read

NeurIPS 2025 Insights: AI Agents, Reasoning, and the Shift to Real-World Systems

Huya Tech Engineering

Nov 28, 2025 · Operations

How LLMs Accelerate Root‑Cause Diagnosis in Large‑Scale Microservices

By abstracting a massive microservice system as a dynamic multi‑layer graph and integrating large language models, the article outlines three evolution stages—from manual expert debugging to rule‑based AIOps and finally LLM‑driven cognitive reasoning—detailing practical workflows, context engineering, and real‑world case studies that dramatically improve MTTR and accuracy.

Context EngineeringLLMObservability

0 likes · 20 min read

How LLMs Accelerate Root‑Cause Diagnosis in Large‑Scale Microservices

Bilibili Tech

Nov 28, 2025 · Artificial Intelligence

How We Built an LLM‑Powered AI Hub to Read and Analyze Community Chats

This article details the design and deployment of a multi‑layer LLM system that automatically reads massive creator group chats, extracts structured insights, mitigates hallucinations with dual‑model verification, uses few‑shot prompting for stable output, and delivers real‑time risk alerts and operational reports.

AI OperationsLLMPrompt Engineering

0 likes · 14 min read

How We Built an LLM‑Powered AI Hub to Read and Analyze Community Chats

ShiZhen AI

Nov 28, 2025 · Artificial Intelligence

DeepSeekMath‑V2 Scores 118/120 on Putnam and Achieves Gold‑Level IMO Performance

DeepSeekMath‑V2, released open‑source on 27 Nov 2025, attains gold‑level results on IMO 2025, scores 118 out of 120 on the Putnam 2024 competition, introduces a generator‑verifier self‑verification architecture, uses GRPO training, and outperforms leading closed‑source models on IMO‑ProofBench.

BenchmarkDeepSeekMath-V2GRPO

0 likes · 7 min read

DeepSeekMath‑V2 Scores 118/120 on Putnam and Achieves Gold‑Level IMO Performance

phodal

Nov 27, 2025 · Artificial Intelligence

How AutoDev’s Agentic RAG Turns Docs into a Programmable Knowledge Base

This article explains how AutoDev builds an Agentic Retrieval‑Augmented Generation system with a Document Query Language (DocQL) that lets LLM agents navigate hierarchical code and documentation structures using JSONPath‑like queries, detailing implementation, multi‑level keyword expansion, and experimental findings.

AIAgentic RAGDocQL

0 likes · 12 min read

How AutoDev’s Agentic RAG Turns Docs into a Programmable Knowledge Base

Data Party THU

Nov 27, 2025 · Artificial Intelligence

Choosing an Agent Framework: AutoGen, AgentScope, CAMEL, LangGraph Compared

This article examines the evolution of intelligent agent frameworks, presenting a comprehensive overview of AutoGen, AgentScope, CAMEL, and LangGraph, analyzing their architectures, strengths, limitations, and suitable use cases, and offering guidance on selecting the most appropriate framework for complex multi‑agent applications.

Agent FrameworksLLMcomparative analysis

0 likes · 31 min read

Choosing an Agent Framework: AutoGen, AgentScope, CAMEL, LangGraph Compared

Bilibili Tech

Nov 27, 2025 · Artificial Intelligence

Mastering Agentic Systems with Blades: Concepts, Code, and Workflow Patterns

This article explains what an AI Agent is, distinguishes it from traditional workflows, and demonstrates how to build and customize agents using the Go‑based Blades framework, covering core concepts, code examples, five workflow patterns, best‑practice guidelines, and reference resources.

AIBladesGo

0 likes · 11 min read

Mastering Agentic Systems with Blades: Concepts, Code, and Workflow Patterns

phodal

Nov 26, 2025 · Artificial Intelligence

How Multi‑Agent AI Transforms Code Review into Automated Fixes

AutoDev leverages a multi‑agent architecture and comprehensive information aggregation to turn traditional, fragmented code review into an intelligent, end‑to‑end process that not only detects issues but also generates and applies corrective patches automatically.

AICode ReviewDevOps

0 likes · 9 min read

How Multi‑Agent AI Transforms Code Review into Automated Fixes

Java Tech Enthusiast

Nov 26, 2025 · Artificial Intelligence

How LLM, RAG, and AI Agents Work Together

The article clarifies how large language models (LLM), retrieval‑augmented generation (RAG), and AI agents complement each other, describing the brain‑like reasoning of LLMs, the dynamic knowledge access provided by RAG, and the autonomous action capabilities of AI agents, plus practical usage scenarios.

AI AgentArtificial IntelligenceLLM

0 likes · 7 min read

How LLM, RAG, and AI Agents Work Together

Bighead's Algorithm Notes

Nov 25, 2025 · Artificial Intelligence

FinSentLLM: A Multi‑LLM Framework for Financial Sentiment Prediction

FinSentLLM integrates multiple LLM experts with structured financial semantic signals, achieving 3‑6% higher accuracy and F1 on the Financial PhraseBank compared to baselines, while DCC‑GARCH and Johansen cointegration analyses confirm a statistically significant long‑term co‑movement between the predicted sentiment signals and stock market dynamics.

DCC-GARCHFinSentLLMFinancial Sentiment Analysis

0 likes · 12 min read

FinSentLLM: A Multi‑LLM Framework for Financial Sentiment Prediction

AI Info Trend

Nov 25, 2025 · Artificial Intelligence

Why Claude Opus 4.5 Is the New Powerhouse for Enterprise AI Agents

Claude Opus 4.5, Anthropic’s latest flagship LLM, dramatically upgrades reasoning, tool use, and multi‑step automation, targeting high‑intensity enterprise scenarios, offering stronger coding, longer context handling, and better cost‑effectiveness, while still requiring careful prompt engineering and budgeting for token usage.

Claude Opus 4.5Coding AutomationEnterprise AI

0 likes · 7 min read

Why Claude Opus 4.5 Is the New Powerhouse for Enterprise AI Agents

Tencent Technical Engineering

Nov 24, 2025 · Artificial Intelligence

Inside Google gemini-cli: Turning the Terminal into an AI Agent with ReAct Architecture

This article systematically dissects Google’s open‑source gemini‑cli, revealing how it transforms a traditional command‑line terminal into an AI‑driven collaborative interface by detailing its ReAct loop, tool‑calling mechanisms, context management, and extensible architecture for building similar terminal agents.

AI AgentCLIGemini CLI

0 likes · 27 min read

Inside Google gemini-cli: Turning the Terminal into an AI Agent with ReAct Architecture

Wu Shixiong's Large Model Academy

Nov 24, 2025 · Artificial Intelligence

Why Dynamic Function Routing Is the Key to Stable LLM Agents

In real‑world LLM agents, giving the model too many tools at once leads to frequent function‑call errors, but applying dynamic function routing to narrow the candidate set dramatically reduces the error rate—from over 20% down to around 1%—and provides clear guidelines on when and how to implement it.

Dynamic RoutingFunction CallingLLM

0 likes · 9 min read

Why Dynamic Function Routing Is the Key to Stable LLM Agents

Architect's Guide

Nov 24, 2025 · Artificial Intelligence

Building Java LLM Applications with LangChain4j: A Hands‑On Guide

This tutorial walks through the fundamentals of large language models, prompt engineering, and word embeddings, then shows how to set up a LangChain‑based LLM stack in Java using LangChain4j, covering core modules, memory, retrieval, chains, agents, and complete code examples.

AI AgentsLLMLangChain

0 likes · 15 min read

Building Java LLM Applications with LangChain4j: A Hands‑On Guide

AI Large Model Application Practice

Nov 24, 2025 · Artificial Intelligence

How to Turn Text into an AI‑Powered PPT Video: A Step‑by‑Step Guide

This article breaks down the end‑to‑end engineering pipeline that converts a knowledge source such as a URL or PDF into a narrated PPT‑style video, detailing six core stages—from knowledge extraction and script generation to image creation, voice synthesis, and final video stitching—while highlighting practical model choices, prompt design, and stability tricks.

Artificial IntelligenceLLMPPT

0 likes · 16 min read

How to Turn Text into an AI‑Powered PPT Video: A Step‑by‑Step Guide

AI Tech Publishing

Nov 23, 2025 · Artificial Intelligence

How Agents Leverage File Systems for Context Engineering

The article examines why file system access is crucial for autonomous agents, outlining common context‑engineering failures such as missing, excessive, or irrelevant information, and demonstrates how using file‑system tools like ls, grep, and write‑file can reduce token waste, enable dynamic storage, improve targeted search, and support continual learning.

Context EngineeringLLMToken Management

0 likes · 11 min read

How Agents Leverage File Systems for Context Engineering

Wu Shixiong's Large Model Academy

Nov 21, 2025 · Artificial Intelligence

How to Build a Multi‑Layer Cache for Dynamic RAG Systems

This article explains why dynamic Retrieval‑Augmented Generation (RAG) requires a layered caching strategy rather than simple result caching, details a four‑level cache architecture—including embedding, search, answer, and pipeline caches—provides practical key‑generation and TTL guidelines, and outlines dirty‑data defenses to keep caches consistent and performant.

AI EngineeringCachingLLM

0 likes · 10 min read

How to Build a Multi‑Layer Cache for Dynamic RAG Systems

Youzan Coder

Nov 21, 2025 · Artificial Intelligence

How to Build, Evaluate, and Optimize AI Test Agents: A Practical Guide

This guide walks you through creating AI‑powered test agents, defining success metrics, building evaluation datasets, crafting and refining system prompts with techniques like chain‑of‑thought, XML, few‑shot and concise inputs, and scaling the workflow by splitting agents and managing prompt versions.

AI AgentsEvaluationLLM

0 likes · 21 min read

How to Build, Evaluate, and Optimize AI Test Agents: A Practical Guide

Qunhe Technology Quality Tech

Nov 20, 2025 · Artificial Intelligence

How to Build a Quantifiable Quality Assurance System for AI‑Native Products

This article explains the background of AI‑native products, uses VoxDeck as a case study to illustrate typical generation successes and failures, and proposes a systematic, metric‑driven quality‑assurance framework—including data sampling, multi‑dimensional anomaly detection, AI‑assisted checks, and continuous improvement—to boost efficiency, reliability, and business value of AI‑generated content.

AI-nativeLLMPrompt Engineering

0 likes · 14 min read

How to Build a Quantifiable Quality Assurance System for AI‑Native Products

Baobao Algorithm Notes

Nov 20, 2025 · Artificial Intelligence

Why Reinforcement Learning Preserves LLM Generality Better Than Supervised Fine‑Tuning

The article analyzes why reinforcement learning (RL) fine‑tuning retains a large language model's general abilities better than supervised fine‑tuning (SFT), explaining the off‑policy distribution shift of SFT and the on‑policy data consistency, KL penalty, and trust‑region mechanisms that give RL its anti‑forgetting properties.

Catastrophic ForgettingLLMOn-Policy Data

0 likes · 8 min read

Why Reinforcement Learning Preserves LLM Generality Better Than Supervised Fine‑Tuning

Alibaba Cloud Big Data AI Platform

Nov 19, 2025 · Big Data

How We Migrated 100k BigQuery SQL Scripts to MaxCompute Using AST and LLM Automation

This article details a real‑world migration of a Southeast Asian tech group’s data warehouse from Google BigQuery to Alibaba Cloud MaxCompute, describing the challenges of converting 100,000 SQL scripts, the AST‑driven and LLM‑assisted automation pipeline, rule‑engine iteration, quality control, and the measurable performance and cost benefits achieved.

ASTBigQueryData Warehouse

0 likes · 12 min read

How We Migrated 100k BigQuery SQL Scripts to MaxCompute Using AST and LLM Automation

Baidu Maps Tech Team

Nov 19, 2025 · Artificial Intelligence

Boosting Socio‑Economic Q&A: The ARAG Framework Merges Structured Data Analysis with RAG

ARAG introduces a novel Retrieval‑Augmented Generation framework that tightly integrates LLM‑driven structured data analysis with unstructured information retrieval, addressing the “structured + unstructured” reasoning gap in socio‑economic queries, and demonstrates superior accuracy, robustness, and hallucination resistance through extensive evaluations.

Data AnalysisHallucination MitigationLLM

0 likes · 12 min read

Boosting Socio‑Economic Q&A: The ARAG Framework Merges Structured Data Analysis with RAG

Data STUDIO

Nov 19, 2025 · Artificial Intelligence

Why TOON Beats JSON for LLM Data Exchange: Token Savings and Accuracy Gains

The article explains how the Token‑Oriented Object Notation (TOON) format reduces token usage by 30‑60% and improves accuracy compared to JSON when feeding structured data to large language models, offering concrete syntax, benchmark results, code examples, and guidance on when to adopt it.

BenchmarkJSON alternativeLLM

0 likes · 10 min read

Why TOON Beats JSON for LLM Data Exchange: Token Savings and Accuracy Gains

BirdNest Tech Talk

Nov 18, 2025 · Industry Insights

A Practical Guide to Major LLM Services: URLs, Docs, and API Tips

This article compiles the entry points, documentation links, pricing details, and hands‑on API examples for several leading large‑language‑model providers—including DeepSeek, Alibaba Cloud, Baidu Qianfan, ByteDance Volcengine, OpenRouter, and Google Gemini—while comparing their usability, free‑tier offers, and developer experience.

APICloud AILLM

0 likes · 13 min read

A Practical Guide to Major LLM Services: URLs, Docs, and API Tips

Wu Shixiong's Large Model Academy

Nov 18, 2025 · Artificial Intelligence

How to Make LLM Agents’ Function Calls Stable and Accurate: 5 Proven Strategies

This article breaks down why function‑call reliability is the biggest bottleneck for LLM agents and presents a systematic five‑step loop—schema quality, prompt context, sampling, training data, and runtime defenses—plus concrete optimization techniques such as dynamic tool routing, plan‑execute, validation layers, memory injection, and log‑driven tuning, illustrated with real‑world cases.

LLMTool Routingagent

0 likes · 12 min read

How to Make LLM Agents’ Function Calls Stable and Accurate: 5 Proven Strategies

JakartaEE China Community

Nov 18, 2025 · Artificial Intelligence

How to Build a Retrieval‑Augmented Generation (RAG) System with Langchain4j and Ollama 3

This article explains why Retrieval‑Augmented Generation improves LLM accuracy, outlines the key Langchain4j and Ollama3 components, and provides a step‑by‑step Java example—including Maven setup, document ingestion, embedding, similarity search, prompt creation, and response generation—to demonstrate a functional RAG pipeline.

EmbeddingLLMLangChain4j

0 likes · 8 min read

How to Build a Retrieval‑Augmented Generation (RAG) System with Langchain4j and Ollama 3

Architect

Nov 17, 2025 · Artificial Intelligence

Comparing Tasking AI and Dify: Architecture, Core Capabilities, and AI Workflow Engines

This article examines the design of LLM‑native AI application platforms Tasking AI and Dify, comparing their LLM integration, plugin management, multi‑tenant isolation, system architecture, and especially Dify’s GraphEngine for complex AI workflow orchestration.

AI platformDifyGraphEngine

0 likes · 22 min read

Comparing Tasking AI and Dify: Architecture, Core Capabilities, and AI Workflow Engines

BirdNest Tech Talk

Nov 17, 2025 · Artificial Intelligence

How to Parse and Use Claude Skills with Go: A Deep Dive into LLM Tool Integration

This article explains the concept of Claude Skills, walks through a Go library that parses skill packages, demonstrates a CLI inspector, shows how to run skills with Deepseek‑v3 via an OpenAI‑compatible API, and outlines future security enhancements.

ClaudeDeepSeekGo

0 likes · 13 min read

How to Parse and Use Claude Skills with Go: A Deep Dive into LLM Tool Integration

Wu Shixiong's Large Model Academy

Nov 14, 2025 · Artificial Intelligence

How to Engineer Reliable Function Calls for LLM Agents: An End‑to‑End Framework

This article explains why function‑call accuracy is critical for LLM agents, identifies four common failure causes, and presents a systematic, five‑step engineering framework—including dynamic routing, chain‑of‑thought planning, result validation, memory injection, and log‑driven optimization—backed by concrete examples and quantitative improvements.

Function CallingInterview PreparationLLM

0 likes · 10 min read

How to Engineer Reliable Function Calls for LLM Agents: An End‑to‑End Framework

Programmer DD

Nov 14, 2025 · Artificial Intelligence

Can TOON Format Cut LLM Token Costs by Up to 60%?

This article explains how the TOON data‑serialization format reduces token usage and improves accuracy for large language model calls compared with traditional JSON, provides benchmark results, outlines scenarios where TOON is advantageous or unsuitable, and shows Java integration examples.

LLMTOONdata serialization

0 likes · 6 min read

Can TOON Format Cut LLM Token Costs by Up to 60%?

AI Tech Publishing

Nov 13, 2025 · Artificial Intelligence

Claude’s Prompt Engineering Best Practices: A Step‑by‑Step Guide

This guide outlines Claude team’s best practices for prompt engineering, covering core techniques such as clear instructions, background context, specificity, examples, and advanced methods like pre‑filled responses, chain‑of‑thought, output formatting, and prompt chaining, with concrete examples and code snippets.

AI promptingClaudeContext Engineering

0 likes · 18 min read

Claude’s Prompt Engineering Best Practices: A Step‑by‑Step Guide

Wu Shixiong's Large Model Academy

Nov 12, 2025 · Artificial Intelligence

Agent Memory Modules Explained: Short‑Term vs Long‑Term Strategies for LLM Agents

This article breaks down the memory systems behind LLM‑based agents, explaining why persistent memory is needed, the differences between short‑term context buffers and long‑term vector stores, practical implementation choices, maintenance strategies, and how to articulate these concepts effectively in technical interviews.

LLMagentretrieval

0 likes · 14 min read

Agent Memory Modules Explained: Short‑Term vs Long‑Term Strategies for LLM Agents

Alibaba Cloud Developer

Nov 12, 2025 · Artificial Intelligence

How Self‑Programming AI Agents Are Built: From LLM Brain to Dynamic Code Execution

This article explains how a self‑programming AI Agent is constructed by extending large language models as the brain, designing a multi‑area architecture, implementing memory layers, prompt engineering with segment mechanisms, and enabling code generation and execution through a Python‑Java bridge, while sharing practical insights and future directions.

AI AgentCode ExecutionLLM

0 likes · 34 min read

How Self‑Programming AI Agents Are Built: From LLM Brain to Dynamic Code Execution

HyperAI Super Neural

Nov 11, 2025 · Artificial Intelligence

How Deepseek-OCR Achieves SOTA Using Ultra‑Low Visual Token Counts

Deepseek-OCR leverages a visual‑compression approach, combining DeepEncoder and the DeepSeek3B‑MoE‑A570M decoder, to represent document text with far fewer visual tokens, achieving up to 97% OCR accuracy and surpassing GOT‑OCR2.0 and MinerU2.0 on OmniDocBench, while the article offers a one‑click deployment tutorial.

DeepEncoderLLMOCR

0 likes · 6 min read

How Deepseek-OCR Achieves SOTA Using Ultra‑Low Visual Token Counts

Code Mala Tang

Nov 11, 2025 · Artificial Intelligence

Unlock Structured Data from Any Text with LangExtract – A Free Python LLM Tool

LangExtract is an open‑source Python library that uses LLMs to turn messy documents—such as medical records, contracts, novels, or news articles—into structured data with just a few lines of code and optional visualisation.

Data ParsingLLMLangExtract

0 likes · 11 min read

Unlock Structured Data from Any Text with LangExtract – A Free Python LLM Tool

Mingyi World Elasticsearch

Nov 10, 2025 · Backend Development

Is Elasticsearch API Really That Hard? Explore the New 9.x Docs and Java Client

The article explains how the Elasticsearch 9.x documentation now provides language‑specific, strongly‑typed examples—especially a fluent Java client—that eliminate the manual cURL/JSON workflow, improve developer productivity, and serve as reliable references for large language models.

APICode ExamplesElasticsearch

0 likes · 9 min read

Is Elasticsearch API Really That Hard? Explore the New 9.x Docs and Java Client

Old Meng AI Explorer

Nov 10, 2025 · Mobile Development

How Cactus Turns Any Smartphone into a Powerful Offline AI Assistant

Cactus is a lightweight, open‑source mobile AI framework that runs large language models locally on iOS and Android without internet, offering chat, image recognition, and text‑to‑speech while consuming low resources, supporting older phones, and providing simple demo apps and Flutter integration for developers.

AIFlutterLLM

0 likes · 10 min read

How Cactus Turns Any Smartphone into a Powerful Offline AI Assistant

Data Party THU

Nov 9, 2025 · Artificial Intelligence

Mastering Chunking Strategies for Effective RAG: Fixed, Recursive, Semantic, Structured, and Delayed

This article walks through the core RAG pipeline, explains why chunking is the linchpin of retrieval quality, and provides detailed definitions, trade‑offs, and implementation examples for five chunking techniques—fixed, recursive, semantic, structure‑aware, and delayed—so you can choose the right approach for any document‑heavy AI application.

AIChunkingLLM

0 likes · 10 min read

Mastering Chunking Strategies for Effective RAG: Fixed, Recursive, Semantic, Structured, and Delayed

DataFunSummit

Nov 8, 2025 · Artificial Intelligence

How Tencent’s LLM Powers Real‑World AI Solutions with RAG and Agents

This article examines Tencent's large language model deployments across diverse business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑playing, while deep‑diving into the RAG, GraphRAG, and Agent technologies that enable smarter, more reliable AI applications.

AILLMRAG

0 likes · 4 min read

How Tencent’s LLM Powers Real‑World AI Solutions with RAG and Agents

AI Product Manager Community

Nov 8, 2025 · Artificial Intelligence

Why Prompt Engineering Fails: Embracing Context Engineering for Smarter LLMs

The article explains that prompt engineering alone cannot guarantee reliable AI responses because models lack situational awareness, and introduces context engineering as a systematic approach that structures memory, manages context flow, and integrates RAG and evaluation to make large language models truly useful in real‑world applications.

AIContext EngineeringLLM

0 likes · 7 min read

Why Prompt Engineering Fails: Embracing Context Engineering for Smarter LLMs

21CTO

Nov 7, 2025 · Artificial Intelligence

any-llm 1.0: Seamlessly Switch Between Cloud and Local LLMs with One Python Library

Mozilla.ai's any-llm v1.0 is an open‑source Python library that unifies access to multiple large language model providers, enabling developers to move between cloud‑based and on‑premise LLMs without rewriting code, while offering async‑first APIs, reusable connections, and extensive compatibility features.

AI developmentLLMPython

0 likes · 4 min read

any-llm 1.0: Seamlessly Switch Between Cloud and Local LLMs with One Python Library

DataFunSummit

Nov 7, 2025 · Artificial Intelligence

How Close Are Agents to AGI? Insights from Experiments and Benchmarks

Through a series of experiments, benchmark analyses, and theoretical discussions, this article explores the limits of current AI agents, their underlying mechanisms, performance gaps to human-level intelligence, and the challenges that remain on the path from agents to true AGI.

AGIBenchmarkLLM

0 likes · 26 min read

How Close Are Agents to AGI? Insights from Experiments and Benchmarks

DataFunSummit

Nov 7, 2025 · Artificial Intelligence

How Tencent’s LLM Powers Content Creation, Smart Service, and Game NPCs

This article examines Tencent’s large language model deployments across content generation, intelligent customer service, and game role‑playing, and explains the underlying technologies—Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agent systems—highlighting how they enhance performance, explainability, and multi‑step reasoning in real‑world business scenarios.

AILLMRAG

0 likes · 4 min read

How Tencent’s LLM Powers Content Creation, Smart Service, and Game NPCs

Ele.me Technology

Nov 7, 2025 · Artificial Intelligence

LLM‑SM Hybrid Strategies: Boosting Decision Optimization and Store Design

Recent advances in large language models (LLMs) have sparked interest in their decision‑making capabilities, yet challenges remain; this article explores classic prediction‑optimization pipelines, introduces emerging LLM‑as‑Predictor/Ranker/Optimizer paradigms, and details practical case studies on delivery‑price optimization and intelligent store‑decoration recommendation using LLM‑SM hybrid systems.

Decision OptimizationHybrid ModelingLLM

0 likes · 30 min read

LLM‑SM Hybrid Strategies: Boosting Decision Optimization and Store Design

JD Tech

Nov 6, 2025 · Artificial Intelligence

LLMs Revolutionize Recommendation Systems: From Generative Models to Production

This article surveys the evolution of generative recommendation systems powered by large language models, detailing their technical foundations, engineering challenges, recent breakthroughs, and future research directions, while highlighting why the paradigm shift is occurring now.

AI EngineeringGenerative RecommendationLLM

0 likes · 30 min read

LLMs Revolutionize Recommendation Systems: From Generative Models to Production

Tencent Cloud Developer

Nov 6, 2025 · Artificial Intelligence

From Prompt to Multi‑Agent: How LLMs Evolve into Autonomous Agents

Since ChatGPT's debut, the LLM landscape has progressed through four stages—prompt engineering, chain orchestration, autonomous agents, and multi‑agent systems—each enhancing intelligence and automation, with this article detailing their evolution, advantages, drawbacks, and practical implementation examples in Go.

GoLLMagent

0 likes · 24 min read

From Prompt to Multi‑Agent: How LLMs Evolve into Autonomous Agents

AI Tech Publishing

Nov 5, 2025 · Artificial Intelligence

Why AI Agents Should Be Positioned as Assistants, Not Replacements

The article explains that marketing AI agents as human replacements leads to poor performance, professional resistance, and hallucination risks, and argues that repositioning them as assistants with human‑in‑the‑loop verification improves efficiency and acceptance.

AI AgentBI EngineerData Agent

0 likes · 3 min read

Why AI Agents Should Be Positioned as Assistants, Not Replacements

Kuaishou Tech

Nov 5, 2025 · Artificial Intelligence

How HiPO Gives LLMs a Smart Thinking Switch to Cut Costs and Boost Accuracy

This article explains the overthinking problem of large language models, introduces the HiPO framework with hybrid data cold‑start and reinforcement‑learning reward mechanisms that let models decide when to think deeply or answer directly, and shows experimental results demonstrating significant efficiency gains and accuracy improvements across multiple benchmarks.

EfficiencyHybrid Policy OptimizationLLM

0 likes · 13 min read

How HiPO Gives LLMs a Smart Thinking Switch to Cut Costs and Boost Accuracy

Data Party THU

Nov 5, 2025 · Artificial Intelligence

How to Give LLM Agents Memory, Reflection, and Goal Tracking

This article explains why current LLM agents lose context after each conversation and presents a practical architecture—using SQLite for structured storage, a vector database for semantic retrieval, and LLM‑driven reflection—to add persistent memory, self‑evaluation, and goal‑tracking capabilities that turn agents into learning partners.

Goal TrackingLLMMemory

0 likes · 10 min read

How to Give LLM Agents Memory, Reflection, and Goal Tracking

Code Mala Tang

Nov 5, 2025 · Backend Development

How to Build a Production-Ready Async LLM API with FastAPI

Learn how to design and deploy a high‑performance, production‑grade LLM API using FastAPI, covering async routing, type‑safe Pydantic models, streaming via SSE/WebSockets, middleware, caching, rate limiting, observability, retries, and cost‑control strategies for robust AI services.

AsyncFastAPILLM

0 likes · 12 min read

How to Build a Production-Ready Async LLM API with FastAPI

Wu Shixiong's Large Model Academy

Nov 5, 2025 · Artificial Intelligence

Why Production-Ready RAG Is Ten Times Harder Than a Simple Demo

Building a Retrieval‑Augmented Generation (RAG) system may be straightforward in code, but making it reliable, accurate, and scalable in production involves challenges across data preparation, vector retrieval, query rewriting, generation control, and system integration, turning a demo into a truly useful AI service.

AILLMPrompt Engineering

0 likes · 8 min read

Why Production-Ready RAG Is Ten Times Harder Than a Simple Demo

JavaGuide

Nov 5, 2025 · Artificial Intelligence

Cursor Goes Beyond the IDE with Agent Mode and Its Own Composer LLM

Cursor, once hailed as the leading AI‑enhanced IDE, has shifted its focus by making Agent mode the default and launching its own large‑model Composer, which the vendor claims runs four times faster than comparable models, though real‑world performance remains to be validated.

AI IDEClaudeCodex

0 likes · 4 min read

Cursor Goes Beyond the IDE with Agent Mode and Its Own Composer LLM

AI2ML AI to Machine Learning

Nov 4, 2025 · Artificial Intelligence

Common Debugging Signals for Large Language Models

This article outlines the end‑to‑end workflow for large‑model training, highlights typical debugging challenges such as memory OOM, performance bottlenecks, and gradient issues, and provides concrete strategies, tools (DeepSpeed, Megatron, Torchtitan, veScale) and best‑practice checklists to help engineers diagnose and resolve problems efficiently.

DebuggingDeepSpeedLLM

0 likes · 12 min read

Common Debugging Signals for Large Language Models

DataFunTalk

Nov 4, 2025 · Artificial Intelligence

Can LLMs Trade Crypto Profitably? Inside the Alpha Arena Competition

Alpha Arena’s first season pitted six leading large language models against real crypto markets with $10,000 each, revealing stark differences in trading bias, risk management, and sensitivity to prompts, as Qwen3‑Max and DeepSeek outperformed GPT‑5, while detailed case studies expose model vulnerabilities and future research directions.

AI AgentsAlpha ArenaLLM

0 likes · 12 min read

Can LLMs Trade Crypto Profitably? Inside the Alpha Arena Competition

Data STUDIO

Nov 4, 2025 · Artificial Intelligence

How to Build a Memory-Enabled AI Agent with SQLite and Vector Search

This article explains how to give AI agents persistent memory, reflection, and goal‑tracking by storing interaction summaries in SQLite, embedding them for semantic retrieval with a vector database, and using LLM‑generated prompts to recall, reflect, and manage objectives across sessions.

AI AgentGoal TrackingLLM

0 likes · 10 min read

How to Build a Memory-Enabled AI Agent with SQLite and Vector Search

dbaplus Community

Nov 3, 2025 · Artificial Intelligence

How RAG Turns Natural Language Queries into Accurate SQL for Data Platforms

This article explains how Retrieval‑Augmented Generation (RAG) combines vector databases with large language models to let non‑technical users ask natural‑language questions and receive precise SQL statements, detailing the workflow, architecture, chunking methods, performance gains, and remaining challenges.

Data PlatformLLMRAG

0 likes · 17 min read

How RAG Turns Natural Language Queries into Accurate SQL for Data Platforms

DataFunSummit

Nov 3, 2025 · Artificial Intelligence

How Tencent’s LLM Powers Real‑World AI: From RAG to Agents

This article examines Tencent's large language model applications across diverse business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑playing, and explains the three key technologies—Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agents—that enable these capabilities.

AI applicationsLLMRAG

0 likes · 4 min read

How Tencent’s LLM Powers Real‑World AI: From RAG to Agents

Meituan Technology Team

Nov 3, 2025 · Artificial Intelligence

Introducing VitaBench: A Real-World Agent Benchmark That Reveals a 30% Success Gap

VitaBench, a new open‑source benchmark from Meituan’s LongCat team, evaluates LLM‑driven agents across three realistic life‑service scenarios—food ordering, restaurant dining, and travel planning—using 66 tools and quantifying reasoning, tool, and interaction complexities, exposing a mere 30% success rate on complex cross‑scene tasks.

AIBenchmarkInteraction

0 likes · 14 min read

Introducing VitaBench: A Real-World Agent Benchmark That Reveals a 30% Success Gap