Tagged articles
2074 articles
Page 5 of 21
ArcThink
ArcThink
Apr 2, 2026 · Artificial Intelligence

Why LLMs Forget You: Uncovering the Limits and Solutions for Long‑Term Memory

The article explains why large language models lack persistent memory due to the stateless Transformer architecture, breaks down the four dimensions of memory loss, surveys seven technical approaches, three product implementations, and emerging research, and discusses security and privacy implications.

AILLMLong-term Memory
0 likes · 22 min read
Why LLMs Forget You: Uncovering the Limits and Solutions for Long‑Term Memory
AI Step-by-Step
AI Step-by-Step
Apr 1, 2026 · Artificial Intelligence

When to Use Which Model in an Agent: Beyond the “Strongest Model” Myth

The article explains why routing every request to the most powerful LLM hurts cost, speed, and throughput, and presents a three‑layer task decomposition that assigns execution‑level tasks to cheap small models, intermediate tasks to mid‑size models, and high‑risk judgment tasks to large models, with concrete examples and a minimal routing strategy.

Agent DesignLLMModel routing
0 likes · 8 min read
When to Use Which Model in an Agent: Beyond the “Strongest Model” Myth
DaTaobao Tech
DaTaobao Tech
Apr 1, 2026 · Industry Insights

How AI Turned Taobao’s Marketing Venue Testing from Manual to Intelligent Automation

This article details the AI-driven testing platform built for Taobao’s marketing venues, describing how large‑language models and multimodal agents enable visual rendering verification, price and content consistency checks, and automated multi‑device adaptation, resulting in a 40% overall efficiency boost and a 100% increase in tester productivity.

AILLMMultimodal Agent
0 likes · 12 min read
How AI Turned Taobao’s Marketing Venue Testing from Manual to Intelligent Automation
PaperAgent
PaperAgent
Apr 1, 2026 · Artificial Intelligence

How Meta‑Harness Revolutionizes LLM Harness Optimization with 10× Search Speed

Meta‑Harness introduces an external‑loop optimization framework that lets coding agents automatically search and improve large‑language‑model harnesses, achieving up to ten‑fold faster search, ten‑times token efficiency, and significant performance gains across text classification, math reasoning, and agentic coding tasks.

LLMMeta-HarnessRetrieval-Augmented Math
0 likes · 11 min read
How Meta‑Harness Revolutionizes LLM Harness Optimization with 10× Search Speed
Data STUDIO
Data STUDIO
Apr 1, 2026 · Artificial Intelligence

Blackboard System: Enabling Dynamic Collaboration Among Expert AI Agents

This article compares a rigid sequential multi‑agent pipeline with a flexible blackboard architecture, showing how shared memory and a dynamic controller let specialist AI agents cooperate opportunistically, obey conditional user instructions, and achieve higher efficiency and instruction‑following scores.

Blackboard SystemDynamic SchedulingLLM
0 likes · 21 min read
Blackboard System: Enabling Dynamic Collaboration Among Expert AI Agents
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Apr 1, 2026 · Artificial Intelligence

How to Design an Effective Agent Memory System for Enterprise AI Assistants

This article explains why AI agents need a structured memory module, outlines three memory types from cognitive science, details short‑term and long‑term storage architectures using vector databases, and provides concrete code and management strategies—including conflict resolution, TTL expiration, and privacy compliance—to build a robust Agent Memory system.

Agent MemoryLLMMem0
0 likes · 23 min read
How to Design an Effective Agent Memory System for Enterprise AI Assistants
Radish, Keep Going!
Radish, Keep Going!
Mar 31, 2026 · Artificial Intelligence

Why Agent‑First Systems Fail and How Harness Engineering Fixes Them

The article analyzes OpenAI’s Harness Engineering approach, explains four systemic failure modes of LLM‑driven agents, and details five modular components—readable environment, task state machine, verification loop, architectural constraints, and loop detection—that together enable reliable, large‑scale agent development.

AIAutomationHarness
0 likes · 17 min read
Why Agent‑First Systems Fail and How Harness Engineering Fixes Them
Senior Tony
Senior Tony
Mar 31, 2026 · Artificial Intelligence

Build and Debug LangGraph Workflows with Alibaba Qwen in Minutes

This article walks through creating a LangGraph workflow in Python, first using OpenAI’s GPT‑5‑nano model, then swapping to Alibaba’s Qwen 3.5‑plus model, showing how to suppress warnings, filter out thinking responses, visualize the graph, and troubleshoot common errors, all without any prior AI coding experience.

AI workflowAlibaba QwenLLM
0 likes · 8 min read
Build and Debug LangGraph Workflows with Alibaba Qwen in Minutes
Qborfy AI
Qborfy AI
Mar 31, 2026 · Artificial Intelligence

Mastering AI Agents with the Plan-and-Solve Design Pattern

The article introduces the Plan-and-Solve design pattern for AI agents, explaining how separating planning and execution improves handling of complex tasks, compares it with ReAct, provides detailed workflow diagrams, concrete examples such as weekly report generation, and offers a full Python implementation with dynamic replanning and result aggregation.

AI agentsAgent DesignLLM
0 likes · 14 min read
Mastering AI Agents with the Plan-and-Solve Design Pattern
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Mar 31, 2026 · Artificial Intelligence

How to Build a Production‑Ready AI Memory System with Mem0 and Elasticsearch

This guide explains how to overcome the stateless nature of large language models by using the Mem0 framework together with Elasticsearch to create a persistent, vector‑searchable memory layer, covering architecture, real‑world scenarios, step‑by‑step deployment, and integration with the OpenClaw agent framework.

AI memoryElasticsearchLLM
0 likes · 15 min read
How to Build a Production‑Ready AI Memory System with Mem0 and Elasticsearch
Woodpecker Software Testing
Woodpecker Software Testing
Mar 31, 2026 · Industry Insights

2026 AI Agent Testing Trends Every Test Expert Must Know

The article outlines how software testing is shifting from functional correctness to trustworthy behavior verification for AI agents in 2026, detailing a three‑dimensional trust matrix, agent‑native CI pipelines, human‑AI collaborative testing, and compliance‑driven auditable agents with concrete industry examples and metrics.

AI ComplianceAI testingLLM
0 likes · 9 min read
2026 AI Agent Testing Trends Every Test Expert Must Know
Data STUDIO
Data STUDIO
Mar 31, 2026 · Artificial Intelligence

Agent Architecture: Planner → Executor → Verifier – Adding a “Quality Inspector” to Your AI

This article introduces the PEV (Planner‑Executor‑Verifier) architecture, explains why AI agents need a verification step to avoid blindly trusting faulty tool outputs, demonstrates a full implementation with LangGraph, compares its robustness to a naïve baseline, and discusses its advantages, limitations, and suitable use cases.

AI agentsLLMLangGraph
0 likes · 23 min read
Agent Architecture: Planner → Executor → Verifier – Adding a “Quality Inspector” to Your AI
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Mar 31, 2026 · Information Security

Securing LLM Code Interpreter: Sandbox Strategies and Real‑World Pitfalls

This article examines why RAG systems need a Code Interpreter, explains the dangers of executing LLM‑generated code with exec(), and presents three sandbox designs—restricted exec, Docker containers, and E2B cloud sandboxes—along with whitelist/blacklist rules, an eight‑step execution flow, and practical lessons learned from production deployment.

Code InterpreterLLMPython
0 likes · 26 min read
Securing LLM Code Interpreter: Sandbox Strategies and Real‑World Pitfalls
AI Tech Publishing
AI Tech Publishing
Mar 31, 2026 · Artificial Intelligence

Step‑by‑Step Guide to Building Your First AI Agent from Scratch (Full Code Included)

This comprehensive guide walks you through the fundamentals of AI agents, explains the core agent loop, compares workflow patterns with autonomous agents, and provides a practical five‑step process—including tool design, memory handling, testing, and multi‑agent collaboration—complete with real code examples for Anthropic and OpenAI SDKs.

AI AgentLLMMemory
0 likes · 22 min read
Step‑by‑Step Guide to Building Your First AI Agent from Scratch (Full Code Included)
Black & White Path
Black & White Path
Mar 30, 2026 · Information Security

OWASP Top 10 Risks for LLMs Every AI Security Beginner Must Know

The article outlines the OWASP Top 10 threats for large language model applications—including prompt injection, data leakage, supply‑chain attacks, model poisoning, improper output handling, excessive agency, system prompt leakage, vector embedding weaknesses, misinformation, and unbounded consumption—plus three essential mitigation rules for newcomers.

AI securityLLMOWASP
0 likes · 6 min read
OWASP Top 10 Risks for LLMs Every AI Security Beginner Must Know
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Mar 30, 2026 · Operations

Mastering RAG Post‑Launch: A Closed‑Loop Badcase Management Blueprint

This article explains how to establish a six‑step closed‑loop workflow for operating RAG‑based question‑answer systems in insurance, covering badcase collection via three channels, four‑type classification, automated scripts, regression testing, gray‑scale rollout, and real‑world metrics that boosted answer accuracy from 76 % to 89 %.

Badcase ManagementInsurance AILLM
0 likes · 20 min read
Mastering RAG Post‑Launch: A Closed‑Loop Badcase Management Blueprint
Su San Talks Tech
Su San Talks Tech
Mar 30, 2026 · Artificial Intelligence

Mastering LLM Function Calling: Theory, Workflow, and Hands‑On Code

This article explains the fundamentals of large‑model function calling, why it’s needed to bridge language models with real‑world tools, and provides a step‑by‑step implementation in Python—including tool definition, intent extraction, local execution, and result integration—complete with code samples and diagrams.

AI AgentAPIFunction Calling
0 likes · 11 min read
Mastering LLM Function Calling: Theory, Workflow, and Hands‑On Code
Qborfy AI
Qborfy AI
Mar 29, 2026 · Artificial Intelligence

Mastering AI Agent Reflection: The Generate‑Reflect‑Refine Loop

This article explains the Reflection design pattern for AI agents, detailing how a three‑step generate‑reflect‑refine cycle can iteratively improve outputs, provides both a simple two‑call implementation and a structured class‑based version, and shares practical tips, benchmarks, and references to the original research.

AI agentsLLMPrompt Engineering
0 likes · 9 min read
Mastering AI Agent Reflection: The Generate‑Reflect‑Refine Loop
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Mar 29, 2026 · Artificial Intelligence

Mastering RAG Prompt Engineering: Prevent Hallucinations and Boost Accuracy

This article dissects the unique challenges of RAG prompting, presents a systematic System/User Prompt design with strong constraints and citation requirements, compares constraint strengths with quantitative hallucination rates, and offers long‑context compression strategies and rigorous testing methods to ensure reliable LLM answers.

Context CompressionLLMRAG
0 likes · 19 min read
Mastering RAG Prompt Engineering: Prevent Hallucinations and Boost Accuracy
Java One
Java One
Mar 28, 2026 · Artificial Intelligence

Building a Vector‑Free RAG System with Hierarchical Page Indexing

This guide explains how to create a retrieval‑augmented generation (RAG) system that avoids embeddings by converting documents into a hierarchical tree, using an LLM to navigate, summarize, and retrieve answers, complete with a full Python implementation and a GitHub repository.

Hierarchical IndexingLLMPython
0 likes · 15 min read
Building a Vector‑Free RAG System with Hierarchical Page Indexing
AI Algorithm Path
AI Algorithm Path
Mar 28, 2026 · Artificial Intelligence

A Practical Guide to Building Agent Skills for Large Language Models

This guide explains the concept of LLM "Skills", shows how to organize skill directories for Claude and Copilot, walks through creating a "prepare‑pr" skill with a SKILL.md file, integrates Bash scripts for git checks, and demonstrates testing and extending the skill with additional checks and templates.

Agent SkillsBash scriptClaude
0 likes · 12 min read
A Practical Guide to Building Agent Skills for Large Language Models
AI Large-Model Wave and Transformation Guide
AI Large-Model Wave and Transformation Guide
Mar 28, 2026 · Artificial Intelligence

How to Ace LLM Interview Questions: Deep Dive into Pre‑training, SFT, DPO & RLHF

This guide breaks down the four major large‑model training paradigms—pre‑training, supervised fine‑tuning, preference alignment, and RLHF—explaining which parameters are updated, how attention is reshaped, and what capabilities are gained, so you can deliver a structured, interview‑ready answer.

AI InterviewLLMLarge Language Models
0 likes · 8 min read
How to Ace LLM Interview Questions: Deep Dive into Pre‑training, SFT, DPO & RLHF
AI Large-Model Wave and Transformation Guide
AI Large-Model Wave and Transformation Guide
Mar 28, 2026 · Artificial Intelligence

From RNNs to Multimodal Agents: A Decade of Transformer Evolution

This article traces the evolution of sequence models from early RNN/LSTM designs through the breakthrough Transformer, its major branches, dense scaling, efficiency‑focused variants, next‑generation linear‑complexity SSMs, and finally multimodal agent architectures, highlighting each stage's strengths, weaknesses, and typical use cases.

AI ArchitectureEfficient AttentionLLM
0 likes · 12 min read
From RNNs to Multimodal Agents: A Decade of Transformer Evolution
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Mar 28, 2026 · Artificial Intelligence

Mastering Multi‑Agent Systems: Design, Parallel Execution, and Interview Strategies

This article dissects the shortcomings of single‑agent LLM pipelines, introduces the Supervisor‑based Multi‑Agent architecture with LangGraph, demonstrates parallel task execution, robust error handling, and result merging, and provides concrete interview guidance backed by real performance data.

AI ArchitectureError HandlingLLM
0 likes · 19 min read
Mastering Multi‑Agent Systems: Design, Parallel Execution, and Interview Strategies
Code Mala Tang
Code Mala Tang
Mar 28, 2026 · Artificial Intelligence

Can Claude Translate the Linux Kernel to Rust? Insights, Experiments, and Costs

This article evaluates Claude's ability to translate isolated Linux kernel modules from C to Rust, presenting a detailed analysis of translation granularity, token costs, experimental results on drivers, networking, and file‑system modules, and discussing the technical and economic challenges of a full kernel rewrite.

AILLMRust
0 likes · 19 min read
Can Claude Translate the Linux Kernel to Rust? Insights, Experiments, and Costs
AI Tech Publishing
AI Tech Publishing
Mar 28, 2026 · Artificial Intelligence

Designing Agent Memory Systems: Four Types, Three Strategies, and Full Python Implementation

This article breaks down agentic memory into four distinct types—In‑context, External, Episodic, and Semantic/Parametric—explains three forgetting strategies (time decay, importance scoring, periodic consolidation), shows how memory flows through an agent loop, and provides complete Python code using OpenAI embeddings and ChromaDB for a production‑ready memory layer.

Agent MemoryChromaDBLLM
0 likes · 22 min read
Designing Agent Memory Systems: Four Types, Three Strategies, and Full Python Implementation
AI Insight Log
AI Insight Log
Mar 28, 2026 · Artificial Intelligence

Anthropic’s Leaked Mythos Model Claims to Outperform Opus 4.6 – Why Release Is Delayed

A leaked internal Anthropic blog reveals the upcoming Claude Mythos (codenamed Capybara) model, touted as a step‑change over Opus 4.6 in programming, academic reasoning, and cybersecurity, while highlighting unprecedented security risks, early access for security professionals, and high compute costs that postpone a full launch.

AI safetyAnthropicClaude Mythos
0 likes · 5 min read
Anthropic’s Leaked Mythos Model Claims to Outperform Opus 4.6 – Why Release Is Delayed
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Mar 27, 2026 · Artificial Intelligence

Weekly Quantitative Finance Paper Roundup (Mar 21‑27, 2026)

This article presents concise English summaries of four recent AI‑driven quantitative finance papers, covering an agentic AI screening platform for portfolio investment, a wavelet‑based physics‑informed neural network for option pricing, the FinRL‑X modular trading infrastructure, and the S³G stock state‑space graph for enhanced trend prediction, each with authors, links, and key experimental results.

AILLMModular Trading Infrastructure
0 likes · 12 min read
Weekly Quantitative Finance Paper Roundup (Mar 21‑27, 2026)
DataFunTalk
DataFunTalk
Mar 27, 2026 · Artificial Intelligence

Building a Production‑Ready RAG Engine: Architecture, Challenges & Solutions

This article examines the practical challenges of deploying Retrieval‑Augmented Generation in enterprise settings, outlines a layered RAG architecture with offline document processing and online query handling, and details the hybrid retrieval, multi‑stage ranking, knowledge filtering, and generation techniques that improve accuracy and reduce hallucinations.

AI EngineeringHybrid RetrievalKnowledge Filtering
0 likes · 22 min read
Building a Production‑Ready RAG Engine: Architecture, Challenges & Solutions
Qborfy AI
Qborfy AI
Mar 26, 2026 · Artificial Intelligence

Mastering ReAct: Turn LLMs into Thoughtful, Actionable AI Agents

This article explains the ReAct (Reasoning + Acting) design pattern for large language model agents, detailing its thought‑action‑observation loop, concrete examples, prompt engineering tips, full Python implementations, common pitfalls, and references to the original Google research.

AI agentsLLMOpenAI
0 likes · 11 min read
Mastering ReAct: Turn LLMs into Thoughtful, Actionable AI Agents
AI Explorer
AI Explorer
Mar 26, 2026 · Artificial Intelligence

Reinventing Financial Trading with a Multi‑Agent LLM Framework

TradingAgents introduces a multi‑agent architecture that lets specialized LLM experts—researchers, analysts, traders and risk managers—collaborate to analyse markets, manage risk and execute trades, offering a new AI‑driven collaboration paradigm for quantitative finance while providing explainable decisions and enterprise‑grade stability.

AI CollaborationFinancial AILLM
0 likes · 6 min read
Reinventing Financial Trading with a Multi‑Agent LLM Framework
AI Programming Lab
AI Programming Lab
Mar 26, 2026 · Artificial Intelligence

LLMs to the Left, Harness Engineering to the Right: Bridging the Gap

The article argues that the real bottleneck for LLM‑driven agents is not model capability but the surrounding control system—Harness Engineering—which can dramatically boost success rates, reduce failure cascades, and become the lasting moat for AI productivity.

AI OpsAgent HarnessContext Engineering
0 likes · 14 min read
LLMs to the Left, Harness Engineering to the Right: Bridging the Gap
Data STUDIO
Data STUDIO
Mar 26, 2026 · Artificial Intelligence

Metacognitive Agents: Teaching AI to Self‑Assess Before Answering

The article introduces metacognitive agents that equip AI with a self‑model to evaluate confidence, domain relevance, tool availability, and risk before acting, demonstrating a LangGraph‑based medical triage assistant with code, workflow, safety advantages, and practical test results.

AI safetyLLMLangGraph
0 likes · 22 min read
Metacognitive Agents: Teaching AI to Self‑Assess Before Answering
Architecture Musings
Architecture Musings
Mar 25, 2026 · Information Security

Seeing AI Agent Drift in Vector Space: An Unvalidated Thought Experiment

The article imagines an AI coding agent that silently exfiltrates credentials hidden in data, explains why rule‑based and text‑level defenses miss such attacks, proposes monitoring the agent's vector‑space decision trajectory with six geometric metrics, and critically evaluates the feasibility and limitations of this approach.

AI agentsAnomaly DetectionLLM
0 likes · 23 min read
Seeing AI Agent Drift in Vector Space: An Unvalidated Thought Experiment
SuanNi
SuanNi
Mar 25, 2026 · Artificial Intelligence

How to Evaluate, Optimize, and Secure Retrieval‑Augmented Generation (RAG) Pipelines

This article explains the evaluation pillar of context engineering, introduces the three core RAG metrics (context relevance, faithfulness, answer relevance), details the RAGAS automated assessment framework, shows how to build evaluation datasets, adopt evaluation‑driven development, and protect RAG systems from prompt injection and data leakage.

LLMRAGRAGAS
0 likes · 13 min read
How to Evaluate, Optimize, and Secure Retrieval‑Augmented Generation (RAG) Pipelines
Full-Stack Cultivation Path
Full-Stack Cultivation Path
Mar 25, 2026 · Artificial Intelligence

Understanding Tool Use in LLMs: How Models Leverage Tool Calls

This article explains why large language models need tool use, defines the concepts of Tool Use, Tool Call, and Function Calling, compares them, walks through a complete tool‑use workflow, and discusses architectural, safety, and design considerations for building reliable LLM agents.

AgentLLMPrompt Engineering
0 likes · 17 min read
Understanding Tool Use in LLMs: How Models Leverage Tool Calls
AI Engineering
AI Engineering
Mar 25, 2026 · Artificial Intelligence

Is “Harness Engineering” Just Rebranded Engineering Common Sense?

The article examines the hype around “harness engineering” in LLM workflows, showing through SGLang’s multi‑agent experience that the approach merely repackages established software‑engineering principles such as separation of concerns, docs‑as‑code, and structured routing, and discusses its limits and future implications.

Harness EngineeringLLMSGLang
0 likes · 8 min read
Is “Harness Engineering” Just Rebranded Engineering Common Sense?
Architect's Journey
Architect's Journey
Mar 25, 2026 · Artificial Intelligence

Why SKILL Makes AI Development Surprisingly Simple

The article introduces the SKILL framework, explains its file‑based structure and LLM‑driven entry point, compares it with traditional API‑centric backends, outlines its suitable use cases and limitations, and argues that mastering SKILL will become a core productivity skill for developers.

AI EngineeringLLMSKILL framework
0 likes · 8 min read
Why SKILL Makes AI Development Surprisingly Simple
Java Architecture Diary
Java Architecture Diary
Mar 25, 2026 · Artificial Intelligence

Building Java AI Agents with Koog: A Hands‑On Guide to Native Java Agent APIs

JetBrains' newly released Koog for Java provides a native Java AI Agent framework that lets developers annotate methods as tools, assemble agents with a Builder‑style API, and let large language models orchestrate multi‑step tasks without writing explicit control flow, as demonstrated with banking and e‑commerce examples.

AI AgentBuilder APIJava
0 likes · 9 min read
Building Java AI Agents with Koog: A Hands‑On Guide to Native Java Agent APIs
AI Explorer
AI Explorer
Mar 24, 2026 · Artificial Intelligence

Revolutionizing Financial Trading with a Multi‑Agent AI Framework

TradingAgents is an open‑source Python framework that uses multiple specialized LLM agents—Analyst, Researcher, Trader, and Risk Manager—to mimic a real investment bank’s workflow, offering a more robust and explainable approach to quantitative trading and financial research.

Financial AILLMOpen Source
0 likes · 6 min read
Revolutionizing Financial Trading with a Multi‑Agent AI Framework
Tencent Tech
Tencent Tech
Mar 24, 2026 · Artificial Intelligence

Unlocking AI Power: How Skill Packages Transform Large Language Models

This article provides a comprehensive technical guide to Skill packages—standardized knowledge containers that give large language models expert-level execution capabilities—covering their definition, architecture, integration with the Model Context Protocol (MCP), creation workflow, best‑practice tips, collaborative patterns, debugging strategies, philosophical implications, and future directions.

AI toolingLLMMCP
0 likes · 18 min read
Unlocking AI Power: How Skill Packages Transform Large Language Models
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Mar 24, 2026 · Artificial Intelligence

How Hologres + Mem0 Deliver Low‑Cost, High‑Performance Long‑Memory for LLMs

This article explains how the combination of Hologres, a unified real‑time data warehouse, and Mem0, an open‑source LLM memory framework, overcomes the limited context window of large language models by providing scalable, low‑latency, and cost‑effective long‑term memory for AI applications.

AI infrastructureHologresLLM
0 likes · 11 min read
How Hologres + Mem0 Deliver Low‑Cost, High‑Performance Long‑Memory for LLMs
SuanNi
SuanNi
Mar 24, 2026 · Artificial Intelligence

How Compression, Orchestration, and LangGraph Are Redefining LLM Context Engineering

This article analyzes the six pillars of context engineering for large language models, focusing on compression techniques, extractive vs. abstractive methods, the LLMLingua toolkit, dynamic orchestration with routing and agentic RAG, and how LangGraph enables sophisticated agent‑driven workflows.

Agentic RAGContext CompressionLLM
0 likes · 14 min read
How Compression, Orchestration, and LangGraph Are Redefining LLM Context Engineering
AgentGuide
AgentGuide
Mar 24, 2026 · Artificial Intelligence

What I Learned Moving from Backend Engineering to AI Agent Development

The author, a former backend engineer turned AI Agent developer, explains how LLM uncertainty, context engineering, shifting code responsibilities, workflow standards, new failure modes, and the ReAct paradigm shape modern Agent development, and outlines tasks best suited—or unsuited—for LLMs.

AI AgentContext EngineeringLLM
0 likes · 6 min read
What I Learned Moving from Backend Engineering to AI Agent Development
DataFunTalk
DataFunTalk
Mar 24, 2026 · Artificial Intelligence

Memory‑Based Self‑Evolution: Redefining LLM Agents Beyond Parameter Updates

This article examines the limitations of traditional supervised fine‑tuning and reinforcement learning for LLM agents, introduces a memory‑based self‑evolution paradigm with technologies such as Dynamic Cheatsheet, ReasoningBank, ACE and MemGen, and shows how building an experience bank can turn static models into continuously learning agents, especially in the insurance sector.

Insurance AILLMknowledge flywheel
0 likes · 13 min read
Memory‑Based Self‑Evolution: Redefining LLM Agents Beyond Parameter Updates
Data STUDIO
Data STUDIO
Mar 24, 2026 · Artificial Intelligence

Turn LLMs into Real Assistants: Build a Tool‑Using Agent in Minutes

This article explains why large language models alone can hallucinate, introduces the tool‑using agent architecture, and provides a step‑by‑step Python tutorial using LangChain, LangGraph, and Tavily to create, run, and evaluate a real‑time web‑search capable AI assistant.

AgentLLMLangChain
0 likes · 16 min read
Turn LLMs into Real Assistants: Build a Tool‑Using Agent in Minutes
SuanNi
SuanNi
Mar 24, 2026 · Artificial Intelligence

How Memento‑Skills Enables Self‑Evolving LLMs Without Fine‑Tuning

Introducing Memento‑Skills, a novel framework that freezes LLM parameters while an external skill library iteratively reads, writes, and refines capabilities, achieving up to 116% accuracy gains on GAIA and HLE benchmarks and demonstrating scalable self‑evolution without costly model fine‑tuning.

LLMreinforcement learningself-evolution
0 likes · 11 min read
How Memento‑Skills Enables Self‑Evolving LLMs Without Fine‑Tuning
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Mar 24, 2026 · Artificial Intelligence

A Comprehensive Guide to Major Attention Mechanisms: From MHA and GQA to MLA, Sparse and Hybrid Architectures

This article reviews and compares the most important attention variants used in modern large language models—including multi‑head attention, grouped‑query attention, multi‑head latent attention, sparse and sliding‑window attention, gated attention, and hybrid designs—detailing their motivations, memory trade‑offs, example architectures, and experimental findings.

Hybrid ArchitectureLLMMHA
0 likes · 29 min read
A Comprehensive Guide to Major Attention Mechanisms: From MHA and GQA to MLA, Sparse and Hybrid Architectures
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 24, 2026 · Artificial Intelligence

Why LLMs Behave Unpredictably: From Uncertainty to Practical Agent Design

This article analyzes the sources of LLM output uncertainty, explores hardware and architectural constraints, demonstrates how to build robust AI agents with prompt engineering, tool orchestration, and memory management, and compares traditional micro‑service design with modern LLM‑centric workflows.

AI AgentHardwareLLM
0 likes · 64 min read
Why LLMs Behave Unpredictably: From Uncertainty to Practical Agent Design
Data Party THU
Data Party THU
Mar 23, 2026 · Artificial Intelligence

Boosting RAG Performance: Query Translation & Decomposition Techniques

The article explains two emerging RAG query‑optimization approaches—query translation and query decomposition—detailing fan‑out retrieval, reciprocal rank fusion, HyDE, step‑back prompting, and chain‑of‑thought retrieval, and shows how combining them can improve relevance and latency in LLM‑augmented systems.

LLMRAGRetrieval-Augmented Generation
0 likes · 9 min read
Boosting RAG Performance: Query Translation & Decomposition Techniques
Full-Stack Cultivation Path
Full-Stack Cultivation Path
Mar 23, 2026 · Artificial Intelligence

What Exactly Is a Token in LLMs? A First‑Principles Explanation

The article explains that a token is the smallest discrete text unit a large language model processes, detailing why tokenization is essential, how tokenizers work, how tokens flow through the transformer, and how token counts affect context windows, cost, latency, and overall model behavior.

Cost ManagementEmbeddingLLM
0 likes · 20 min read
What Exactly Is a Token in LLMs? A First‑Principles Explanation
SuanNi
SuanNi
Mar 23, 2026 · Artificial Intelligence

Can LLMs Predict Real‑World War Outcomes? A Deep Dive into the 2026 Middle East Conflict

A research team from MBZUAI and the University of Maryland constructed an 11‑point timeline of the 2026 Middle East escalation, fed contemporaneous news to leading large language models, and evaluated their strategic reasoning, economic impact forecasts, and political signal interpretation, revealing both strengths and limitations of AI under extreme uncertainty.

AI evaluationArtificial IntelligenceGeopolitics
0 likes · 12 min read
Can LLMs Predict Real‑World War Outcomes? A Deep Dive into the 2026 Middle East Conflict
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Mar 23, 2026 · Artificial Intelligence

How Agent Skills Solve LLM Development Pain Points and Gain Standard Status

The article analyses the emergence of Agent Skills as an open LLM standard, explains the technical shortcomings of current prompt‑centric workflows, describes the three‑layer skill architecture and its benefits for reuse, versioning and organization‑wide deployment, and discusses current limitations and future evolution paths.

AI StandardsAgent SkillsLLM
0 likes · 29 min read
How Agent Skills Solve LLM Development Pain Points and Gain Standard Status
Su San Talks Tech
Su San Talks Tech
Mar 23, 2026 · Artificial Intelligence

How OpenClaw Turns AI Agents into Real‑World Automation Tools

OpenClaw is an AI Agent framework that bridges chat platforms and large language models, enabling automated tasks through context‑engineered prompts, tool usage, memory management, sub‑agents, and security controls, while illustrating practical examples, workflow steps, and mitigation strategies for potential shell‑command exploits.

AI AgentLLMOpenClaw
0 likes · 18 min read
How OpenClaw Turns AI Agents into Real‑World Automation Tools
Architect
Architect
Mar 22, 2026 · Artificial Intelligence

Can Frozen LLMs Keep Learning? Inside Memento‑Skills' Deployment‑Time Learning

The article analyses the Memento‑Skills paper and its open‑source implementation, showing how a frozen large language model can continuously improve by treating skills as external memory, using a five‑step Observe‑Read‑Act‑Feedback‑Write loop, advanced routing, and modular architecture to achieve significant gains on GAIA and HLE benchmarks.

AI ArchitectureAgentDeployment-Time Learning
0 likes · 21 min read
Can Frozen LLMs Keep Learning? Inside Memento‑Skills' Deployment‑Time Learning
Woodpecker Software Testing
Woodpecker Software Testing
Mar 22, 2026 · Artificial Intelligence

How to Test Retrieval‑Augmented Generation Systems: Practical Strategies for 2024

This article explains why traditional API, assertion, and UI testing fail for Retrieval‑Augmented Generation (RAG) systems, and presents a four‑step, evidence‑driven testing framework—including golden test sets, dual‑track validation, chaos engineering, and continuous trust dashboards—to ensure factual reliability and operational robustness in real‑world deployments.

Fact CheckingLLMOpenTelemetry
0 likes · 8 min read
How to Test Retrieval‑Augmented Generation Systems: Practical Strategies for 2024
PaperAgent
PaperAgent
Mar 22, 2026 · Artificial Intelligence

Can LLM Agents Self‑Evolve Without Retraining? Inside Memento‑Skills

The article analyzes the Memento‑Skills framework, which treats external memory as executable skills to enable deployment‑time continual learning for frozen LLM agents, detailing its read‑write reflective loop, skill‑as‑memory design, behavior‑trained skill router, experimental validation on GAIA and HLE benchmarks, and theoretical guarantees without gradient updates.

AIAgentLLM
0 likes · 9 min read
Can LLM Agents Self‑Evolve Without Retraining? Inside Memento‑Skills
DataFunSummit
DataFunSummit
Mar 21, 2026 · Artificial Intelligence

How Slidebatching Revolutionizes LLM Inference Scheduling for Faster, More Efficient AI Services

The article examines the memory and latency challenges of 1750‑billion‑parameter LLM inference, introduces the xLLM framework’s Slidebatching and PD‑separation scheduling strategies, and details how these techniques achieve up to 35% system‑throughput gains and 52% SLO compliance improvements in real‑world multi‑priority workloads.

AI PerformanceLLMPD separation
0 likes · 15 min read
How Slidebatching Revolutionizes LLM Inference Scheduling for Faster, More Efficient AI Services
Data Party THU
Data Party THU
Mar 21, 2026 · Artificial Intelligence

Why Bigger Context Windows Hurt LLMs and How RAG Still Wins

The article explains that expanding LLM context windows leads to attention dilution and retrieval collapse, degrading answer quality, and argues that Retrieval‑Augmented Generation remains essential because it preserves signal density through focused retrieval and selective prompting.

AI ArchitectureAttention DilutionLLM
0 likes · 8 min read
Why Bigger Context Windows Hurt LLMs and How RAG Still Wins
Architect's Guide
Architect's Guide
Mar 21, 2026 · Artificial Intelligence

Turn PDFs, Word Docs, and Images into Instant Answers with WeKnora’s LLM‑Powered Search

WeKnora is a Tencent‑open‑source LLM‑based document understanding and semantic search framework that extracts structured content from PDFs, Word files and images, offers agent‑driven reasoning, multi‑modal retrieval, and a modular architecture, with step‑by‑step Docker deployment and a web UI for instant querying.

AILLMRAG
0 likes · 7 min read
Turn PDFs, Word Docs, and Images into Instant Answers with WeKnora’s LLM‑Powered Search
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Mar 20, 2026 · Artificial Intelligence

Cursor’s Composer 2 Beats Claude Opus 4.6 with ‘Ankle‑Cut’ Pricing via New Reinforcement‑Learning Method

Cursor’s newly released Composer 2 model surpasses Claude Opus 4.6 on benchmarks such as Terminal‑Bench 2.0, offers dramatically lower token pricing, and achieves these gains by introducing a novel self‑summary reinforcement‑learning technique that compresses long‑context tasks while preserving critical information.

Composer 2CursorLLM
0 likes · 9 min read
Cursor’s Composer 2 Beats Claude Opus 4.6 with ‘Ankle‑Cut’ Pricing via New Reinforcement‑Learning Method
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Mar 20, 2026 · Artificial Intelligence

Why Vector‑Based RAG Falls Short and How PageIndex’s Reasoning‑Based Retrieval Solves It

This article analyzes the fundamental limitations of traditional vector‑based Retrieval‑Augmented Generation, introduces Vectify AI’s reasoning‑driven PageIndex framework, and explains how hierarchical, non‑vector indexing enables more accurate, context‑aware document retrieval for complex, domain‑specific texts.

AILLMPageIndex
0 likes · 15 min read
Why Vector‑Based RAG Falls Short and How PageIndex’s Reasoning‑Based Retrieval Solves It
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Mar 20, 2026 · Artificial Intelligence

Mastering MinerU: Overcoming Its Top 9 Limitations for Reliable Document Parsing

This article examines MinerU's strengths and nine critical shortcomings—such as layout order errors, cross‑page table splits, merged‑cell failures, OCR misrecognition, and licensing issues—and provides concrete improvement strategies, interview‑ready resume bullets, and practical response frameworks for engineers.

LLMLayout AnalysisMinerU
0 likes · 13 min read
Mastering MinerU: Overcoming Its Top 9 Limitations for Reliable Document Parsing
Shuge Unlimited
Shuge Unlimited
Mar 20, 2026 · Artificial Intelligence

How a Single Gateway Manages 30+ Messaging Platforms in OpenClaw

This article dissects OpenClaw’s low‑level architecture, showing how a long‑lived Gateway process coordinates over 30 messaging platforms, how the pi‑mono embedded agent runtime drives the thinking cycle, and how the Context Engine and Session Management ensure consistent state, persistence, and extensibility.

AI AssistantContext EngineLLM
0 likes · 17 min read
How a Single Gateway Manages 30+ Messaging Platforms in OpenClaw
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Mar 19, 2026 · Artificial Intelligence

Inside Xiaomi’s Hunter Alpha: 1‑Trillion‑Parameter LLM with 1M Context and Top Global Rankings

Xiaomi’s newly unveiled MiMo‑V2‑Pro, codenamed Hunter Alpha, is a trillion‑parameter LLM with a 1 million‑token context window that tops OpenRouter usage, achieves the second‑best domestic and eighth‑best global scores on Artificial Analysis, and delivers strong benchmark results across PinchBench, ClawEval, and SWE‑bench.

LLMMiMo-V2-ProMultimodal
0 likes · 9 min read
Inside Xiaomi’s Hunter Alpha: 1‑Trillion‑Parameter LLM with 1M Context and Top Global Rankings
PaperAgent
PaperAgent
Mar 19, 2026 · Artificial Intelligence

How MDER‑DR Boosts Multi‑Hop KG QA with Entity‑Centric Summaries

The article presents the MDER‑DR two‑stage framework that tackles semantic loss in knowledge‑graph triple indexing by generating context‑aware entity summaries and using an LLM‑driven decompose‑parse retrieval loop, achieving up to 66% performance gains on multi‑hop question answering benchmarks.

Entity SummarizationKG QAKnowledge Graph
0 likes · 5 min read
How MDER‑DR Boosts Multi‑Hop KG QA with Entity‑Centric Summaries
AgentGuide
AgentGuide
Mar 19, 2026 · Artificial Intelligence

What Exactly Is an AI Agent? Complete Interview Guide

This article breaks down the concept of AI agents for interview preparation, covering their definition, core components like planning, memory, and tool use, differences from plain LLM chats, real‑world challenges, typical use cases, detailed component analysis, and a runnable pseudo‑code example.

AI AgentLLMMemory
0 likes · 9 min read
What Exactly Is an AI Agent? Complete Interview Guide
Architect's Ambition
Architect's Ambition
Mar 18, 2026 · Artificial Intelligence

From Zero to a Real AI Agent: Master Its Core Essence, Not Just API Calls

The article explains why an AI Agent is more than a simple LLM API call, outlines its four essential modules—memory, planning, tool use, and feedback—shows how they differ from ordinary models, and offers practical steps and common pitfalls for building a production‑grade single‑agent system.

AI AgentLLMMemory
0 likes · 13 min read
From Zero to a Real AI Agent: Master Its Core Essence, Not Just API Calls
AI Explorer
AI Explorer
Mar 18, 2026 · Artificial Intelligence

Unlock Instant AI Agents with LangGraph‑Powered Deep Agents

Deep Agents, an open‑source framework built on LangGraph, bundles planning, file‑system tools, sub‑agent coordination and context management into a ready‑to‑run AI agent that can be launched with three lines of Python code and fully customized for diverse applications.

AI agentsAgent FrameworkDeep Agents
0 likes · 7 min read
Unlock Instant AI Agents with LangGraph‑Powered Deep Agents
DeepHub IMBA
DeepHub IMBA
Mar 18, 2026 · Artificial Intelligence

CRAG Architecture Explained: Fixing Erroneous Retrieval Results Before the Generator

The article analyzes how most RAG pipelines blindly feed retrieved documents to LLMs, introduces CRAG's lightweight evaluator with confidence thresholds, describes its sentence‑level decomposition, filtering, and dual‑knowledge routing, and provides a full implementation walkthrough with a real insurance query example.

CRAGFAISSLLM
0 likes · 13 min read
CRAG Architecture Explained: Fixing Erroneous Retrieval Results Before the Generator
o-ai.tech
o-ai.tech
Mar 18, 2026 · Artificial Intelligence

How Anthropic Builds Effective AI Agents: Practical Patterns and Principles

This guide distills Anthropic’s frontline experience into a concise framework for building high‑performing AI agents, covering the workflow‑vs‑agent distinction, five composable architecture patterns, core design principles, tool‑centric optimization, and pragmatic advice on using or bypassing agent frameworks.

AI agentsAnthropicLLM
0 likes · 9 min read
How Anthropic Builds Effective AI Agents: Practical Patterns and Principles
Huolala Tech
Huolala Tech
Mar 18, 2026 · Artificial Intelligence

Boosting LLM Accuracy: From RAG to GraphRAG for Enterprise Metadata Retrieval

This article explains the fundamentals of Retrieval‑Augmented Generation (RAG), introduces GraphRAG as an advanced architecture using knowledge graphs, details implementation pipelines, evaluates performance improvements, analyzes common pitfalls, and outlines future enhancements for enterprise metadata search.

AIGraphRAGKnowledge Graph
0 likes · 17 min read
Boosting LLM Accuracy: From RAG to GraphRAG for Enterprise Metadata Retrieval
Data STUDIO
Data STUDIO
Mar 18, 2026 · Artificial Intelligence

Building a Smart Web AI Agent with FastAPI, LangGraph, and MCP

This article walks through the design and implementation of a production‑ready Web AI agent that uses FastAPI as the HTTP layer, LangGraph to orchestrate multi‑step reasoning, and MCP to expose external tools, showing how to manage state, integrate multiple LLM providers, and extend the system with persistence, rate‑limiting, and monitoring.

AI AgentFastAPILLM
0 likes · 20 min read
Building a Smart Web AI Agent with FastAPI, LangGraph, and MCP
SuanNi
SuanNi
Mar 18, 2026 · Artificial Intelligence

Explore the LLM Architecture Gallery: Visualizing Seven Years of Model Evolution

The LLM Architecture Gallery, created by Sebastian Raschka, offers an interactive visual compendium of open‑weight large language models from 2019 to 2026, detailing their core parameters, architectural innovations, and the broader trends shaping modern AI research.

AIArtificial IntelligenceLLM
0 likes · 8 min read
Explore the LLM Architecture Gallery: Visualizing Seven Years of Model Evolution
AI Tech Publishing
AI Tech Publishing
Mar 18, 2026 · Artificial Intelligence

How Context Engineering Turns AI Agents from ‘Usable’ to ‘Highly Effective’

The article explains how organizing the prompt, tool schemas, dialogue history, and retrieved documents—collectively the context window—affects an AI agent’s decisions, introduces the concepts of Lost‑in‑the‑Middle, Thinking Tokens, tool‑response caching, compaction versus SubAgent strategies, and shows a step‑by‑step evolution that raised accuracy from 60 % to over 95 %.

AI agentsContext EngineeringLLM
0 likes · 9 min read
How Context Engineering Turns AI Agents from ‘Usable’ to ‘Highly Effective’
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Mar 17, 2026 · Artificial Intelligence

From Lists to Decision Reports: The Deep Research Framework for Recommender Systems

The paper introduces Deep Research for Recommender Systems, a multi‑agent framework called RecPilot that replaces traditional list‑based recommendations with AI‑driven exploration, trajectory simulation, and structured decision‑support reports, and demonstrates its superiority on TMALL data through extensive trajectory and report‑generation evaluations.

LLMRecPilotdecision report
0 likes · 10 min read
From Lists to Decision Reports: The Deep Research Framework for Recommender Systems
DeepHub IMBA
DeepHub IMBA
Mar 17, 2026 · Artificial Intelligence

Advanced RAG Techniques: Boosting Retrieval with Query Translation and Decomposition

The article examines how retrieval‑augmented generation suffers from poor query formulation and presents two advanced strategies—query translation, which generates multiple semantically similar variants, and query decomposition, which breaks complex questions into finer sub‑queries—detailing methods such as fan‑out retrieval, reciprocal rank fusion, HyDE, step‑back prompting, and chain‑of‑thought retrieval, and explains when to combine them.

Hybrid RetrievalLLMQuery Decomposition
0 likes · 9 min read
Advanced RAG Techniques: Boosting Retrieval with Query Translation and Decomposition
IT Services Circle
IT Services Circle
Mar 17, 2026 · Artificial Intelligence

How AI Workflows and Agents Transform Automation: From Rigid Rules to Intelligent Decision‑Making

This article explains the distinction and synergy between traditional AI workflows and modern agents, outlines their four‑step processes, showcases practical examples such as intelligent customer service and content generation, and recommends tools for beginners to quickly build AI‑driven applications.

AIAgentAutomation
0 likes · 11 min read
How AI Workflows and Agents Transform Automation: From Rigid Rules to Intelligent Decision‑Making
Data Party THU
Data Party THU
Mar 17, 2026 · Artificial Intelligence

How OpenMAIC Is Redefining AI-Powered Learning: From Multi‑Agent Labs to Classroom Revolution

OpenMAIC, the world’s first multi‑agent generative learning framework released by Tsinghua University, transforms technical documents into zero‑barrier interactive courses, supports AI‑driven lesson planning, multi‑agent discussions, and plug‑in extensions, and is rapidly evolving through 2024‑2026 to reshape education and beyond.

AI educationLLMMulti-Agent AI
0 likes · 10 min read
How OpenMAIC Is Redefining AI-Powered Learning: From Multi‑Agent Labs to Classroom Revolution
PaperAgent
PaperAgent
Mar 17, 2026 · Artificial Intelligence

Can Attention Replace Fixed Residuals? Inside the ‘Attention Residuals’ Breakthrough

This article analyzes the newly released Attention Residuals paper, explaining how learnable attention weighting replaces fixed residual addition to mitigate information dilution in deep LLMs, detailing the proposed Block AttnRes design, engineering trade‑offs, experimental results, and its significance for foundational model architecture.

Block AttentionLLMResidual Connections
0 likes · 9 min read
Can Attention Replace Fixed Residuals? Inside the ‘Attention Residuals’ Breakthrough
AI Step-by-Step
AI Step-by-Step
Mar 17, 2026 · Industry Insights

How OpenClaw Redefines Enterprise Software with AI‑Powered Business Integration

The article analyzes OpenClaw as an AI‑driven business‑connection layer that unifies chat entry, tool execution, and event‑driven automation, showing how enterprises can shorten system gaps, automate long‑tail workflows, and adopt a new agent‑based service model without replacing existing ERP or CRM solutions.

AI agentsHooksLLM
0 likes · 8 min read
How OpenClaw Redefines Enterprise Software with AI‑Powered Business Integration