Tagged articles

2074 articles

Page 5 of 21

Apr 2, 2026 · Artificial Intelligence

Why LLMs Forget You: Uncovering the Limits and Solutions for Long‑Term Memory

The article explains why large language models lack persistent memory due to the stateless Transformer architecture, breaks down the four dimensions of memory loss, surveys seven technical approaches, three product implementations, and emerging research, and discusses security and privacy implications.

AILLMLong-term Memory

0 likes · 22 min read

Why LLMs Forget You: Uncovering the Limits and Solutions for Long‑Term Memory

Shuge Unlimited

Apr 2, 2026 · Artificial Intelligence

Claude Code’s Hidden Pet System: How a Source‑Map Leak Uncovered an April‑Fools’ Easter Egg

A forgotten source‑map in Claude Code v2.1.88 exposed 510,000 lines of code, revealing a deliberately engineered Buddy pet system that combines deterministic random generation with LLM‑crafted personalities, complete with rarity tiers, ASCII art, and an April‑Fools’ activation window.

ASCII artClaude CodeLLM

0 likes · 15 min read

Claude Code’s Hidden Pet System: How a Source‑Map Leak Uncovered an April‑Fools’ Easter Egg

AI Step-by-Step

Apr 1, 2026 · Artificial Intelligence

When to Use Which Model in an Agent: Beyond the “Strongest Model” Myth

The article explains why routing every request to the most powerful LLM hurts cost, speed, and throughput, and presents a three‑layer task decomposition that assigns execution‑level tasks to cheap small models, intermediate tasks to mid‑size models, and high‑risk judgment tasks to large models, with concrete examples and a minimal routing strategy.

Agent DesignLLMModel routing

0 likes · 8 min read

When to Use Which Model in an Agent: Beyond the “Strongest Model” Myth

DaTaobao Tech

Apr 1, 2026 · Industry Insights

How AI Turned Taobao’s Marketing Venue Testing from Manual to Intelligent Automation

This article details the AI-driven testing platform built for Taobao’s marketing venues, describing how large‑language models and multimodal agents enable visual rendering verification, price and content consistency checks, and automated multi‑device adaptation, resulting in a 40% overall efficiency boost and a 100% increase in tester productivity.

AILLMMultimodal Agent

0 likes · 12 min read

How AI Turned Taobao’s Marketing Venue Testing from Manual to Intelligent Automation

PaperAgent

Apr 1, 2026 · Artificial Intelligence

How Meta‑Harness Revolutionizes LLM Harness Optimization with 10× Search Speed

Meta‑Harness introduces an external‑loop optimization framework that lets coding agents automatically search and improve large‑language‑model harnesses, achieving up to ten‑fold faster search, ten‑times token efficiency, and significant performance gains across text classification, math reasoning, and agentic coding tasks.

LLMMeta-HarnessRetrieval-Augmented Math

0 likes · 11 min read

How Meta‑Harness Revolutionizes LLM Harness Optimization with 10× Search Speed

Data STUDIO

Apr 1, 2026 · Artificial Intelligence

Blackboard System: Enabling Dynamic Collaboration Among Expert AI Agents

This article compares a rigid sequential multi‑agent pipeline with a flexible blackboard architecture, showing how shared memory and a dynamic controller let specialist AI agents cooperate opportunistically, obey conditional user instructions, and achieve higher efficiency and instruction‑following scores.

Blackboard SystemDynamic SchedulingLLM

0 likes · 21 min read

Blackboard System: Enabling Dynamic Collaboration Among Expert AI Agents

Wu Shixiong's Large Model Academy

Apr 1, 2026 · Artificial Intelligence

How to Design an Effective Agent Memory System for Enterprise AI Assistants

This article explains why AI agents need a structured memory module, outlines three memory types from cognitive science, details short‑term and long‑term storage architectures using vector databases, and provides concrete code and management strategies—including conflict resolution, TTL expiration, and privacy compliance—to build a robust Agent Memory system.

Agent MemoryLLMMem0

0 likes · 23 min read

How to Design an Effective Agent Memory System for Enterprise AI Assistants

Radish, Keep Going!

Mar 31, 2026 · Artificial Intelligence

Why Agent‑First Systems Fail and How Harness Engineering Fixes Them

The article analyzes OpenAI’s Harness Engineering approach, explains four systemic failure modes of LLM‑driven agents, and details five modular components—readable environment, task state machine, verification loop, architectural constraints, and loop detection—that together enable reliable, large‑scale agent development.

AIAutomationHarness

0 likes · 17 min read

Why Agent‑First Systems Fail and How Harness Engineering Fixes Them

Senior Tony

Mar 31, 2026 · Artificial Intelligence

Build and Debug LangGraph Workflows with Alibaba Qwen in Minutes

This article walks through creating a LangGraph workflow in Python, first using OpenAI’s GPT‑5‑nano model, then swapping to Alibaba’s Qwen 3.5‑plus model, showing how to suppress warnings, filter out thinking responses, visualize the graph, and troubleshoot common errors, all without any prior AI coding experience.

AI workflowAlibaba QwenLLM

0 likes · 8 min read

Build and Debug LangGraph Workflows with Alibaba Qwen in Minutes

Qborfy AI

Mar 31, 2026 · Artificial Intelligence

Mastering AI Agents with the Plan-and-Solve Design Pattern

The article introduces the Plan-and-Solve design pattern for AI agents, explaining how separating planning and execution improves handling of complex tasks, compares it with ReAct, provides detailed workflow diagrams, concrete examples such as weekly report generation, and offers a full Python implementation with dynamic replanning and result aggregation.

AI agentsAgent DesignLLM

0 likes · 14 min read

Mastering AI Agents with the Plan-and-Solve Design Pattern

Alibaba Cloud Big Data AI Platform

Mar 31, 2026 · Artificial Intelligence

How to Build a Production‑Ready AI Memory System with Mem0 and Elasticsearch

This guide explains how to overcome the stateless nature of large language models by using the Mem0 framework together with Elasticsearch to create a persistent, vector‑searchable memory layer, covering architecture, real‑world scenarios, step‑by‑step deployment, and integration with the OpenClaw agent framework.

AI memoryElasticsearchLLM

0 likes · 15 min read

How to Build a Production‑Ready AI Memory System with Mem0 and Elasticsearch

Woodpecker Software Testing

Mar 31, 2026 · Industry Insights

2026 AI Agent Testing Trends Every Test Expert Must Know

The article outlines how software testing is shifting from functional correctness to trustworthy behavior verification for AI agents in 2026, detailing a three‑dimensional trust matrix, agent‑native CI pipelines, human‑AI collaborative testing, and compliance‑driven auditable agents with concrete industry examples and metrics.

AI ComplianceAI testingLLM

0 likes · 9 min read

2026 AI Agent Testing Trends Every Test Expert Must Know

Data STUDIO

Mar 31, 2026 · Artificial Intelligence

Agent Architecture: Planner → Executor → Verifier – Adding a “Quality Inspector” to Your AI

This article introduces the PEV (Planner‑Executor‑Verifier) architecture, explains why AI agents need a verification step to avoid blindly trusting faulty tool outputs, demonstrates a full implementation with LangGraph, compares its robustness to a naïve baseline, and discusses its advantages, limitations, and suitable use cases.

AI agentsLLMLangGraph

0 likes · 23 min read

Agent Architecture: Planner → Executor → Verifier – Adding a “Quality Inspector” to Your AI

Wu Shixiong's Large Model Academy

Mar 31, 2026 · Information Security

Securing LLM Code Interpreter: Sandbox Strategies and Real‑World Pitfalls

This article examines why RAG systems need a Code Interpreter, explains the dangers of executing LLM‑generated code with exec(), and presents three sandbox designs—restricted exec, Docker containers, and E2B cloud sandboxes—along with whitelist/blacklist rules, an eight‑step execution flow, and practical lessons learned from production deployment.

Code InterpreterLLMPython

0 likes · 26 min read

Securing LLM Code Interpreter: Sandbox Strategies and Real‑World Pitfalls

AI Tech Publishing

Mar 31, 2026 · Artificial Intelligence

Step‑by‑Step Guide to Building Your First AI Agent from Scratch (Full Code Included)

This comprehensive guide walks you through the fundamentals of AI agents, explains the core agent loop, compares workflow patterns with autonomous agents, and provides a practical five‑step process—including tool design, memory handling, testing, and multi‑agent collaboration—complete with real code examples for Anthropic and OpenAI SDKs.

AI AgentLLMMemory

0 likes · 22 min read

Step‑by‑Step Guide to Building Your First AI Agent from Scratch (Full Code Included)

Machine Heart

Mar 30, 2026 · Artificial Intelligence

Why AI’s Over‑Friendly Behavior Wins Users: Insights from a Science Paper on AI Sycophancy

A recent Science paper reveals that large language models habitually over‑affirm users—a phenomenon called AI sycophancy—leading to higher user trust and preference but also reducing prosocial intentions, responsibility, and conflict‑resolution willingness across diverse scenarios.

AI SycophancyLLMhuman-AI interaction

0 likes · 16 min read

Why AI’s Over‑Friendly Behavior Wins Users: Insights from a Science Paper on AI Sycophancy

AI Engineer Programming

Mar 30, 2026 · Artificial Intelligence

Agent, Multi‑Agent, Deep Agent: Start Simple, Add Complexity Only When Needed

The article clarifies the distinct meanings of Agent, Multi‑Agent, and Deep Agent, explains how control shifts from engineers to models, compares architectures across nine dimensions, and shows why a lightweight harness is essential for long‑running, parallel AI‑driven software development.

AgentDeep AgentHarness

0 likes · 22 min read

Agent, Multi‑Agent, Deep Agent: Start Simple, Add Complexity Only When Needed

Black & White Path

Mar 30, 2026 · Information Security

OWASP Top 10 Risks for LLMs Every AI Security Beginner Must Know

The article outlines the OWASP Top 10 threats for large language model applications—including prompt injection, data leakage, supply‑chain attacks, model poisoning, improper output handling, excessive agency, system prompt leakage, vector embedding weaknesses, misinformation, and unbounded consumption—plus three essential mitigation rules for newcomers.

AI securityLLMOWASP

0 likes · 6 min read

OWASP Top 10 Risks for LLMs Every AI Security Beginner Must Know

Wu Shixiong's Large Model Academy

Mar 30, 2026 · Operations

Mastering RAG Post‑Launch: A Closed‑Loop Badcase Management Blueprint

This article explains how to establish a six‑step closed‑loop workflow for operating RAG‑based question‑answer systems in insurance, covering badcase collection via three channels, four‑type classification, automated scripts, regression testing, gray‑scale rollout, and real‑world metrics that boosted answer accuracy from 76 % to 89 %.

Badcase ManagementInsurance AILLM

0 likes · 20 min read

Mastering RAG Post‑Launch: A Closed‑Loop Badcase Management Blueprint

Su San Talks Tech

Mar 30, 2026 · Artificial Intelligence

Mastering LLM Function Calling: Theory, Workflow, and Hands‑On Code

This article explains the fundamentals of large‑model function calling, why it’s needed to bridge language models with real‑world tools, and provides a step‑by‑step implementation in Python—including tool definition, intent extraction, local execution, and result integration—complete with code samples and diagrams.

AI AgentAPIFunction Calling

0 likes · 11 min read

Mastering LLM Function Calling: Theory, Workflow, and Hands‑On Code

Open Source Tech Hub

Mar 30, 2026 · Artificial Intelligence

How to Install Neuron AI and Build Your First Agent in a Webman PHP App

This guide walks you through installing the Webman framework, adding the Neuron AI package via Composer, generating a custom Agent class, configuring its provider with an API key, and interacting with the agent through a controller to receive responses from a large language model.

AgentComposerLLM

0 likes · 3 min read

How to Install Neuron AI and Build Your First Agent in a Webman PHP App

Machine Heart

Mar 29, 2026 · Artificial Intelligence

How Small Teams Can Build Deep Research Agents with the OpenResearcher Open‑Source Pipeline

OpenResearcher presents a fully open, reproducible offline pipeline that synthesizes 97,000 long‑horizon research trajectories, enabling a 30B LLM to achieve 54.8% accuracy on BrowseComp‑Plus and surpass leading closed‑source models while eliminating online API costs.

AILLMOpenResearcher

0 likes · 16 min read

How Small Teams Can Build Deep Research Agents with the OpenResearcher Open‑Source Pipeline

Qborfy AI

Mar 29, 2026 · Artificial Intelligence

Mastering AI Agent Reflection: The Generate‑Reflect‑Refine Loop

This article explains the Reflection design pattern for AI agents, detailing how a three‑step generate‑reflect‑refine cycle can iteratively improve outputs, provides both a simple two‑call implementation and a structured class‑based version, and shares practical tips, benchmarks, and references to the original research.

AI agentsLLMPrompt Engineering

0 likes · 9 min read

Mastering AI Agent Reflection: The Generate‑Reflect‑Refine Loop

Wu Shixiong's Large Model Academy

Mar 29, 2026 · Artificial Intelligence

Mastering RAG Prompt Engineering: Prevent Hallucinations and Boost Accuracy

This article dissects the unique challenges of RAG prompting, presents a systematic System/User Prompt design with strong constraints and citation requirements, compares constraint strengths with quantitative hallucination rates, and offers long‑context compression strategies and rigorous testing methods to ensure reliable LLM answers.

Context CompressionLLMRAG

0 likes · 19 min read

Mastering RAG Prompt Engineering: Prevent Hallucinations and Boost Accuracy

Java One

Mar 28, 2026 · Artificial Intelligence

Building a Vector‑Free RAG System with Hierarchical Page Indexing

This guide explains how to create a retrieval‑augmented generation (RAG) system that avoids embeddings by converting documents into a hierarchical tree, using an LLM to navigate, summarize, and retrieve answers, complete with a full Python implementation and a GitHub repository.

Hierarchical IndexingLLMPython

0 likes · 15 min read

Building a Vector‑Free RAG System with Hierarchical Page Indexing

Machine Learning Algorithms & Natural Language Processing

Mar 28, 2026 · Artificial Intelligence

A Comprehensive Guide to LLM Post‑Training: From RLHF and GRPO to Agentic RL

This article systematically explains the post‑training pipeline for large language models, covering supervised fine‑tuning, RLHF, PPO, GRPO, RLVR, DPO and emerging Agentic RL, while illustrating each method with analogies, detailed workflows, tables, and recent research findings.

DPOGRPOLLM

0 likes · 24 min read

A Comprehensive Guide to LLM Post‑Training: From RLHF and GRPO to Agentic RL

AI Algorithm Path

Mar 28, 2026 · Artificial Intelligence

A Practical Guide to Building Agent Skills for Large Language Models

This guide explains the concept of LLM "Skills", shows how to organize skill directories for Claude and Copilot, walks through creating a "prepare‑pr" skill with a SKILL.md file, integrates Bash scripts for git checks, and demonstrates testing and extending the skill with additional checks and templates.

Agent SkillsBash scriptClaude

0 likes · 12 min read

A Practical Guide to Building Agent Skills for Large Language Models

AI Large-Model Wave and Transformation Guide

Mar 28, 2026 · Artificial Intelligence

How to Ace LLM Interview Questions: Deep Dive into Pre‑training, SFT, DPO & RLHF

This guide breaks down the four major large‑model training paradigms—pre‑training, supervised fine‑tuning, preference alignment, and RLHF—explaining which parameters are updated, how attention is reshaped, and what capabilities are gained, so you can deliver a structured, interview‑ready answer.

AI InterviewLLMLarge Language Models

0 likes · 8 min read

How to Ace LLM Interview Questions: Deep Dive into Pre‑training, SFT, DPO & RLHF

AI Large-Model Wave and Transformation Guide

Mar 28, 2026 · Artificial Intelligence

From RNNs to Multimodal Agents: A Decade of Transformer Evolution

This article traces the evolution of sequence models from early RNN/LSTM designs through the breakthrough Transformer, its major branches, dense scaling, efficiency‑focused variants, next‑generation linear‑complexity SSMs, and finally multimodal agent architectures, highlighting each stage's strengths, weaknesses, and typical use cases.

AI ArchitectureEfficient AttentionLLM

0 likes · 12 min read

From RNNs to Multimodal Agents: A Decade of Transformer Evolution

Wu Shixiong's Large Model Academy

Mar 28, 2026 · Artificial Intelligence

Mastering Multi‑Agent Systems: Design, Parallel Execution, and Interview Strategies

This article dissects the shortcomings of single‑agent LLM pipelines, introduces the Supervisor‑based Multi‑Agent architecture with LangGraph, demonstrates parallel task execution, robust error handling, and result merging, and provides concrete interview guidance backed by real performance data.

AI ArchitectureError HandlingLLM

0 likes · 19 min read

Mastering Multi‑Agent Systems: Design, Parallel Execution, and Interview Strategies

Code Mala Tang

Mar 28, 2026 · Artificial Intelligence

Can Claude Translate the Linux Kernel to Rust? Insights, Experiments, and Costs

This article evaluates Claude's ability to translate isolated Linux kernel modules from C to Rust, presenting a detailed analysis of translation granularity, token costs, experimental results on drivers, networking, and file‑system modules, and discussing the technical and economic challenges of a full kernel rewrite.

AILLMRust

0 likes · 19 min read

Can Claude Translate the Linux Kernel to Rust? Insights, Experiments, and Costs

AI Tech Publishing

Mar 28, 2026 · Artificial Intelligence

Designing Agent Memory Systems: Four Types, Three Strategies, and Full Python Implementation

This article breaks down agentic memory into four distinct types—In‑context, External, Episodic, and Semantic/Parametric—explains three forgetting strategies (time decay, importance scoring, periodic consolidation), shows how memory flows through an agent loop, and provides complete Python code using OpenAI embeddings and ChromaDB for a production‑ready memory layer.

Agent MemoryChromaDBLLM

0 likes · 22 min read

Designing Agent Memory Systems: Four Types, Three Strategies, and Full Python Implementation

AI Insight Log

Mar 28, 2026 · Artificial Intelligence

Anthropic’s Leaked Mythos Model Claims to Outperform Opus 4.6 – Why Release Is Delayed

A leaked internal Anthropic blog reveals the upcoming Claude Mythos (codenamed Capybara) model, touted as a step‑change over Opus 4.6 in programming, academic reasoning, and cybersecurity, while highlighting unprecedented security risks, early access for security professionals, and high compute costs that postpone a full launch.

AI safetyAnthropicClaude Mythos

0 likes · 5 min read

Anthropic’s Leaked Mythos Model Claims to Outperform Opus 4.6 – Why Release Is Delayed

Bighead's Algorithm Notes

Mar 27, 2026 · Artificial Intelligence

Weekly Quantitative Finance Paper Roundup (Mar 21‑27, 2026)

This article presents concise English summaries of four recent AI‑driven quantitative finance papers, covering an agentic AI screening platform for portfolio investment, a wavelet‑based physics‑informed neural network for option pricing, the FinRL‑X modular trading infrastructure, and the S³G stock state‑space graph for enhanced trend prediction, each with authors, links, and key experimental results.

AILLMModular Trading Infrastructure

0 likes · 12 min read

Weekly Quantitative Finance Paper Roundup (Mar 21‑27, 2026)

DataFunTalk

Mar 27, 2026 · Artificial Intelligence

Building a Production‑Ready RAG Engine: Architecture, Challenges & Solutions

This article examines the practical challenges of deploying Retrieval‑Augmented Generation in enterprise settings, outlines a layered RAG architecture with offline document processing and online query handling, and details the hybrid retrieval, multi‑stage ranking, knowledge filtering, and generation techniques that improve accuracy and reduce hallucinations.

AI EngineeringHybrid RetrievalKnowledge Filtering

0 likes · 22 min read

Building a Production‑Ready RAG Engine: Architecture, Challenges & Solutions

Qborfy AI

Mar 26, 2026 · Artificial Intelligence

Mastering ReAct: Turn LLMs into Thoughtful, Actionable AI Agents

This article explains the ReAct (Reasoning + Acting) design pattern for large language model agents, detailing its thought‑action‑observation loop, concrete examples, prompt engineering tips, full Python implementations, common pitfalls, and references to the original Google research.

AI agentsLLMOpenAI

0 likes · 11 min read

Mastering ReAct: Turn LLMs into Thoughtful, Actionable AI Agents

AI Explorer

Mar 26, 2026 · Artificial Intelligence

Reinventing Financial Trading with a Multi‑Agent LLM Framework

TradingAgents introduces a multi‑agent architecture that lets specialized LLM experts—researchers, analysts, traders and risk managers—collaborate to analyse markets, manage risk and execute trades, offering a new AI‑driven collaboration paradigm for quantitative finance while providing explainable decisions and enterprise‑grade stability.

AI CollaborationFinancial AILLM

0 likes · 6 min read

Reinventing Financial Trading with a Multi‑Agent LLM Framework

AI Programming Lab

Mar 26, 2026 · Artificial Intelligence

LLMs to the Left, Harness Engineering to the Right: Bridging the Gap

The article argues that the real bottleneck for LLM‑driven agents is not model capability but the surrounding control system—Harness Engineering—which can dramatically boost success rates, reduce failure cascades, and become the lasting moat for AI productivity.

AI OpsAgent HarnessContext Engineering

0 likes · 14 min read

LLMs to the Left, Harness Engineering to the Right: Bridging the Gap

Data STUDIO

Mar 26, 2026 · Artificial Intelligence

Metacognitive Agents: Teaching AI to Self‑Assess Before Answering

The article introduces metacognitive agents that equip AI with a self‑model to evaluate confidence, domain relevance, tool availability, and risk before acting, demonstrating a LangGraph‑based medical triage assistant with code, workflow, safety advantages, and practical test results.

AI safetyLLMLangGraph

0 likes · 22 min read

Metacognitive Agents: Teaching AI to Self‑Assess Before Answering

DevOps Coach

Mar 26, 2026 · Artificial Intelligence

Turn Claude into a Structured Engineer: Mastering Claude.md for Reliable Code Generation

This guide explains how the Claude.md (or agent.md) file lets you embed a reusable engineering workflow into Claude, covering planning, sub‑agents, self‑improvement loops, verification, elegance, autonomous bug fixing, and core principles to dramatically improve code quality and reliability.

ClaudeLLMPromptEngineering

0 likes · 14 min read

Turn Claude into a Structured Engineer: Mastering Claude.md for Reliable Code Generation

Architecture Musings

Mar 25, 2026 · Information Security

Seeing AI Agent Drift in Vector Space: An Unvalidated Thought Experiment

The article imagines an AI coding agent that silently exfiltrates credentials hidden in data, explains why rule‑based and text‑level defenses miss such attacks, proposes monitoring the agent's vector‑space decision trajectory with six geometric metrics, and critically evaluates the feasibility and limitations of this approach.

AI agentsAnomaly DetectionLLM

0 likes · 23 min read

Seeing AI Agent Drift in Vector Space: An Unvalidated Thought Experiment

SuanNi

Mar 25, 2026 · Artificial Intelligence

How to Evaluate, Optimize, and Secure Retrieval‑Augmented Generation (RAG) Pipelines

This article explains the evaluation pillar of context engineering, introduces the three core RAG metrics (context relevance, faithfulness, answer relevance), details the RAGAS automated assessment framework, shows how to build evaluation datasets, adopt evaluation‑driven development, and protect RAG systems from prompt injection and data leakage.

LLMRAGRAGAS

0 likes · 13 min read

How to Evaluate, Optimize, and Secure Retrieval‑Augmented Generation (RAG) Pipelines

Full-Stack Cultivation Path

Mar 25, 2026 · Artificial Intelligence

Understanding Tool Use in LLMs: How Models Leverage Tool Calls

This article explains why large language models need tool use, defines the concepts of Tool Use, Tool Call, and Function Calling, compares them, walks through a complete tool‑use workflow, and discusses architectural, safety, and design considerations for building reliable LLM agents.

AgentLLMPrompt Engineering

0 likes · 17 min read

Understanding Tool Use in LLMs: How Models Leverage Tool Calls

AI Engineering

Mar 25, 2026 · Artificial Intelligence

Is “Harness Engineering” Just Rebranded Engineering Common Sense?

The article examines the hype around “harness engineering” in LLM workflows, showing through SGLang’s multi‑agent experience that the approach merely repackages established software‑engineering principles such as separation of concerns, docs‑as‑code, and structured routing, and discusses its limits and future implications.

Harness EngineeringLLMSGLang

0 likes · 8 min read

Is “Harness Engineering” Just Rebranded Engineering Common Sense?

Architect's Journey

Mar 25, 2026 · Artificial Intelligence

Why SKILL Makes AI Development Surprisingly Simple

The article introduces the SKILL framework, explains its file‑based structure and LLM‑driven entry point, compares it with traditional API‑centric backends, outlines its suitable use cases and limitations, and argues that mastering SKILL will become a core productivity skill for developers.

AI EngineeringLLMSKILL framework

0 likes · 8 min read

Why SKILL Makes AI Development Surprisingly Simple

Java Architecture Diary

Mar 25, 2026 · Artificial Intelligence

Building Java AI Agents with Koog: A Hands‑On Guide to Native Java Agent APIs

JetBrains' newly released Koog for Java provides a native Java AI Agent framework that lets developers annotate methods as tools, assemble agents with a Builder‑style API, and let large language models orchestrate multi‑step tasks without writing explicit control flow, as demonstrated with banking and e‑commerce examples.

AI AgentBuilder APIJava

0 likes · 9 min read

Building Java AI Agents with Koog: A Hands‑On Guide to Native Java Agent APIs

Code Wrench

Mar 25, 2026 · Artificial Intelligence

Unlocking LocalAI’s Multimodal Power: Voice, Vision, and Code Generation Explained

This article explores LocalAI’s multimodal capabilities—including speech‑to‑text, text‑to‑speech, and image generation—demonstrates zero‑code migration via Python SDK and LangChain, and reveals the Go‑based API adapter that enables seamless OpenAI‑compatible integration.

APIGoLLM

0 likes · 8 min read

Unlocking LocalAI’s Multimodal Power: Voice, Vision, and Code Generation Explained

AI Explorer

Mar 24, 2026 · Artificial Intelligence

Revolutionizing Financial Trading with a Multi‑Agent AI Framework

TradingAgents is an open‑source Python framework that uses multiple specialized LLM agents—Analyst, Researcher, Trader, and Risk Manager—to mimic a real investment bank’s workflow, offering a more robust and explainable approach to quantitative trading and financial research.

Financial AILLMOpen Source

0 likes · 6 min read

Revolutionizing Financial Trading with a Multi‑Agent AI Framework

Tencent Tech

Mar 24, 2026 · Artificial Intelligence

Unlocking AI Power: How Skill Packages Transform Large Language Models

This article provides a comprehensive technical guide to Skill packages—standardized knowledge containers that give large language models expert-level execution capabilities—covering their definition, architecture, integration with the Model Context Protocol (MCP), creation workflow, best‑practice tips, collaborative patterns, debugging strategies, philosophical implications, and future directions.

AI toolingLLMMCP

0 likes · 18 min read

Unlocking AI Power: How Skill Packages Transform Large Language Models

Alibaba Cloud Big Data AI Platform

Mar 24, 2026 · Artificial Intelligence

How Hologres + Mem0 Deliver Low‑Cost, High‑Performance Long‑Memory for LLMs

This article explains how the combination of Hologres, a unified real‑time data warehouse, and Mem0, an open‑source LLM memory framework, overcomes the limited context window of large language models by providing scalable, low‑latency, and cost‑effective long‑term memory for AI applications.

AI infrastructureHologresLLM

0 likes · 11 min read

How Hologres + Mem0 Deliver Low‑Cost, High‑Performance Long‑Memory for LLMs

SuanNi

Mar 24, 2026 · Artificial Intelligence

How Compression, Orchestration, and LangGraph Are Redefining LLM Context Engineering

This article analyzes the six pillars of context engineering for large language models, focusing on compression techniques, extractive vs. abstractive methods, the LLMLingua toolkit, dynamic orchestration with routing and agentic RAG, and how LangGraph enables sophisticated agent‑driven workflows.

Agentic RAGContext CompressionLLM

0 likes · 14 min read

How Compression, Orchestration, and LangGraph Are Redefining LLM Context Engineering

AgentGuide

Mar 24, 2026 · Artificial Intelligence

What I Learned Moving from Backend Engineering to AI Agent Development

The author, a former backend engineer turned AI Agent developer, explains how LLM uncertainty, context engineering, shifting code responsibilities, workflow standards, new failure modes, and the ReAct paradigm shape modern Agent development, and outlines tasks best suited—or unsuited—for LLMs.

AI AgentContext EngineeringLLM

0 likes · 6 min read

What I Learned Moving from Backend Engineering to AI Agent Development

DataFunTalk

Mar 24, 2026 · Artificial Intelligence

Memory‑Based Self‑Evolution: Redefining LLM Agents Beyond Parameter Updates

This article examines the limitations of traditional supervised fine‑tuning and reinforcement learning for LLM agents, introduces a memory‑based self‑evolution paradigm with technologies such as Dynamic Cheatsheet, ReasoningBank, ACE and MemGen, and shows how building an experience bank can turn static models into continuously learning agents, especially in the insurance sector.

Insurance AILLMknowledge flywheel

0 likes · 13 min read

Memory‑Based Self‑Evolution: Redefining LLM Agents Beyond Parameter Updates

Data STUDIO

Mar 24, 2026 · Artificial Intelligence

Turn LLMs into Real Assistants: Build a Tool‑Using Agent in Minutes

This article explains why large language models alone can hallucinate, introduces the tool‑using agent architecture, and provides a step‑by‑step Python tutorial using LangChain, LangGraph, and Tavily to create, run, and evaluate a real‑time web‑search capable AI assistant.

AgentLLMLangChain

0 likes · 16 min read

Turn LLMs into Real Assistants: Build a Tool‑Using Agent in Minutes

Code Wrench

Mar 24, 2026 · Artificial Intelligence

Building a Private AI Coding Assistant with LocalAI: Go‑Powered OpenAI API Replacement

This article introduces LocalAI, an open‑source Go‑based self‑hosted LLM server that serves as a drop‑in OpenAI API replacement, outlines its key features, privacy and cost benefits, provides a Docker quick‑start guide, and explains its modular architecture for developers seeking private AI solutions.

AI AssistantGoLLM

0 likes · 7 min read

Building a Private AI Coding Assistant with LocalAI: Go‑Powered OpenAI API Replacement

SuanNi

Mar 24, 2026 · Artificial Intelligence

How Memento‑Skills Enables Self‑Evolving LLMs Without Fine‑Tuning

Introducing Memento‑Skills, a novel framework that freezes LLM parameters while an external skill library iteratively reads, writes, and refines capabilities, achieving up to 116% accuracy gains on GAIA and HLE benchmarks and demonstrating scalable self‑evolution without costly model fine‑tuning.

LLMreinforcement learningself-evolution

0 likes · 11 min read

How Memento‑Skills Enables Self‑Evolving LLMs Without Fine‑Tuning

Machine Learning Algorithms & Natural Language Processing

Mar 24, 2026 · Artificial Intelligence

A Comprehensive Guide to Major Attention Mechanisms: From MHA and GQA to MLA, Sparse and Hybrid Architectures

This article reviews and compares the most important attention variants used in modern large language models—including multi‑head attention, grouped‑query attention, multi‑head latent attention, sparse and sliding‑window attention, gated attention, and hybrid designs—detailing their motivations, memory trade‑offs, example architectures, and experimental findings.

Hybrid ArchitectureLLMMHA

0 likes · 29 min read

A Comprehensive Guide to Major Attention Mechanisms: From MHA and GQA to MLA, Sparse and Hybrid Architectures

Alibaba Cloud Developer

Mar 24, 2026 · Artificial Intelligence

Why LLMs Behave Unpredictably: From Uncertainty to Practical Agent Design

This article analyzes the sources of LLM output uncertainty, explores hardware and architectural constraints, demonstrates how to build robust AI agents with prompt engineering, tool orchestration, and memory management, and compares traditional micro‑service design with modern LLM‑centric workflows.

AI AgentHardwareLLM

0 likes · 64 min read

Why LLMs Behave Unpredictably: From Uncertainty to Practical Agent Design

Wu Shixiong's Large Model Academy

Mar 23, 2026 · Artificial Intelligence

From RAG to Deep Research: Building Autonomous AI Agents for Industry Reports

This article explains how Deep Research extends traditional Retrieval‑Augmented Generation by adding autonomous planning, multi‑step search, self‑correction, and long‑context synthesis to enable AI agents that can generate comprehensive industry analysis reports.

AI AgentAutonomous RetrievalLLM

0 likes · 18 min read

From RAG to Deep Research: Building Autonomous AI Agents for Industry Reports

Data Party THU

Mar 23, 2026 · Artificial Intelligence

Boosting RAG Performance: Query Translation & Decomposition Techniques

The article explains two emerging RAG query‑optimization approaches—query translation and query decomposition—detailing fan‑out retrieval, reciprocal rank fusion, HyDE, step‑back prompting, and chain‑of‑thought retrieval, and shows how combining them can improve relevance and latency in LLM‑augmented systems.

LLMRAGRetrieval-Augmented Generation

0 likes · 9 min read

Boosting RAG Performance: Query Translation & Decomposition Techniques

Full-Stack Cultivation Path

Mar 23, 2026 · Artificial Intelligence

What Exactly Is a Token in LLMs? A First‑Principles Explanation

The article explains that a token is the smallest discrete text unit a large language model processes, detailing why tokenization is essential, how tokenizers work, how tokens flow through the transformer, and how token counts affect context windows, cost, latency, and overall model behavior.

Cost ManagementEmbeddingLLM

0 likes · 20 min read

What Exactly Is a Token in LLMs? A First‑Principles Explanation

SuanNi

Mar 23, 2026 · Artificial Intelligence

Can LLMs Predict Real‑World War Outcomes? A Deep Dive into the 2026 Middle East Conflict

A research team from MBZUAI and the University of Maryland constructed an 11‑point timeline of the 2026 Middle East escalation, fed contemporaneous news to leading large language models, and evaluated their strategic reasoning, economic impact forecasts, and political signal interpretation, revealing both strengths and limitations of AI under extreme uncertainty.

AI evaluationArtificial IntelligenceGeopolitics

0 likes · 12 min read

Can LLMs Predict Real‑World War Outcomes? A Deep Dive into the 2026 Middle East Conflict

Huawei Cloud Developer Alliance

Mar 23, 2026 · Artificial Intelligence

How Agent Skills Solve LLM Development Pain Points and Gain Standard Status

The article analyses the emergence of Agent Skills as an open LLM standard, explains the technical shortcomings of current prompt‑centric workflows, describes the three‑layer skill architecture and its benefits for reuse, versioning and organization‑wide deployment, and discusses current limitations and future evolution paths.

AI StandardsAgent SkillsLLM

0 likes · 29 min read

How Agent Skills Solve LLM Development Pain Points and Gain Standard Status

Su San Talks Tech

Mar 23, 2026 · Artificial Intelligence

How OpenClaw Turns AI Agents into Real‑World Automation Tools

OpenClaw is an AI Agent framework that bridges chat platforms and large language models, enabling automated tasks through context‑engineered prompts, tool usage, memory management, sub‑agents, and security controls, while illustrating practical examples, workflow steps, and mitigation strategies for potential shell‑command exploits.

AI AgentLLMOpenClaw

0 likes · 18 min read

How OpenClaw Turns AI Agents into Real‑World Automation Tools

Architect

Mar 22, 2026 · Artificial Intelligence

Can Frozen LLMs Keep Learning? Inside Memento‑Skills' Deployment‑Time Learning

The article analyses the Memento‑Skills paper and its open‑source implementation, showing how a frozen large language model can continuously improve by treating skills as external memory, using a five‑step Observe‑Read‑Act‑Feedback‑Write loop, advanced routing, and modular architecture to achieve significant gains on GAIA and HLE benchmarks.

AI ArchitectureAgentDeployment-Time Learning

0 likes · 21 min read

Can Frozen LLMs Keep Learning? Inside Memento‑Skills' Deployment‑Time Learning

Woodpecker Software Testing

Mar 22, 2026 · Artificial Intelligence

How to Test Retrieval‑Augmented Generation Systems: Practical Strategies for 2024

This article explains why traditional API, assertion, and UI testing fail for Retrieval‑Augmented Generation (RAG) systems, and presents a four‑step, evidence‑driven testing framework—including golden test sets, dual‑track validation, chaos engineering, and continuous trust dashboards—to ensure factual reliability and operational robustness in real‑world deployments.

Fact CheckingLLMOpenTelemetry

0 likes · 8 min read

How to Test Retrieval‑Augmented Generation Systems: Practical Strategies for 2024

Radish, Keep Going!

Mar 22, 2026 · Artificial Intelligence

Why LLMs Pick 7200–7500: The Hidden Bias Behind Their ‘Random’ Numbers

A Reddit experiment showed that major LLMs consistently choose numbers between 7200 and 7500 when asked to pick a random value from 1 to 10000, revealing deterministic patterns rooted in training data bias, human heuristics, and serious security implications.

AIBiasLLM

0 likes · 11 min read

Why LLMs Pick 7200–7500: The Hidden Bias Behind Their ‘Random’ Numbers

AI Product Manager Community

Mar 22, 2026 · Artificial Intelligence

How to Build a Systematic Solution for LLM Hallucinations in Enterprise AI

This article outlines a comprehensive, multi‑layered approach—including data anchoring, architectural guardrails, prompt engineering, and LLMOps—to mitigate hallucinations in large language models for enterprise applications.

AI guardrailsHallucination MitigationKnowledge Graph

0 likes · 7 min read

How to Build a Systematic Solution for LLM Hallucinations in Enterprise AI

PaperAgent

Mar 22, 2026 · Artificial Intelligence

Can LLM Agents Self‑Evolve Without Retraining? Inside Memento‑Skills

The article analyzes the Memento‑Skills framework, which treats external memory as executable skills to enable deployment‑time continual learning for frozen LLM agents, detailing its read‑write reflective loop, skill‑as‑memory design, behavior‑trained skill router, experimental validation on GAIA and HLE benchmarks, and theoretical guarantees without gradient updates.

AIAgentLLM

0 likes · 9 min read

Can LLM Agents Self‑Evolve Without Retraining? Inside Memento‑Skills

Architect's Journey

Mar 22, 2026 · Artificial Intelligence

Cursor Builds Its Own Code Model, Cutting Costs 90% and Sparking a Price War

Cursor unveiled Composer 2, a self‑trained code‑only LLM that scores 61.3 on its CursorBench, outperforms Claude Opus 4.6 and rivals GPT‑5.4, while charging just $0.50 per million tokens—about one‑tenth the price of competitors—prompting a new AI pricing battle.

AI competitionComposer 2Cursor

0 likes · 5 min read

Cursor Builds Its Own Code Model, Cutting Costs 90% and Sparking a Price War

DataFunSummit

Mar 21, 2026 · Artificial Intelligence

How Slidebatching Revolutionizes LLM Inference Scheduling for Faster, More Efficient AI Services

The article examines the memory and latency challenges of 1750‑billion‑parameter LLM inference, introduces the xLLM framework’s Slidebatching and PD‑separation scheduling strategies, and details how these techniques achieve up to 35% system‑throughput gains and 52% SLO compliance improvements in real‑world multi‑priority workloads.

AI PerformanceLLMPD separation

0 likes · 15 min read

How Slidebatching Revolutionizes LLM Inference Scheduling for Faster, More Efficient AI Services

Data Party THU

Mar 21, 2026 · Artificial Intelligence

Why Bigger Context Windows Hurt LLMs and How RAG Still Wins

The article explains that expanding LLM context windows leads to attention dilution and retrieval collapse, degrading answer quality, and argues that Retrieval‑Augmented Generation remains essential because it preserves signal density through focused retrieval and selective prompting.

AI ArchitectureAttention DilutionLLM

0 likes · 8 min read

Why Bigger Context Windows Hurt LLMs and How RAG Still Wins

SuanNi

Mar 21, 2026 · Artificial Intelligence

Mastering Context Engineering: Six Pillars, Retrieval Strategies, and Structured Output

This article explains the six pillars of context engineering, focusing on structuring techniques, advanced retrieval methods, hybrid search, reranking, query transformation, and practical pipelines that turn raw data into reliable, LLM‑ready inputs for higher quality AI responses.

Hybrid SearchLLMRAG

0 likes · 14 min read

Mastering Context Engineering: Six Pillars, Retrieval Strategies, and Structured Output

Architect's Guide

Mar 21, 2026 · Artificial Intelligence

Turn PDFs, Word Docs, and Images into Instant Answers with WeKnora’s LLM‑Powered Search

WeKnora is a Tencent‑open‑source LLM‑based document understanding and semantic search framework that extracts structured content from PDFs, Word files and images, offers agent‑driven reasoning, multi‑modal retrieval, and a modular architecture, with step‑by‑step Docker deployment and a web UI for instant querying.

AILLMRAG

0 likes · 7 min read

Turn PDFs, Word Docs, and Images into Instant Answers with WeKnora’s LLM‑Powered Search

Machine Learning Algorithms & Natural Language Processing

Mar 20, 2026 · Artificial Intelligence

Cursor’s Composer 2 Beats Claude Opus 4.6 with ‘Ankle‑Cut’ Pricing via New Reinforcement‑Learning Method

Cursor’s newly released Composer 2 model surpasses Claude Opus 4.6 on benchmarks such as Terminal‑Bench 2.0, offers dramatically lower token pricing, and achieves these gains by introducing a novel self‑summary reinforcement‑learning technique that compresses long‑context tasks while preserving critical information.

Composer 2CursorLLM

0 likes · 9 min read

Cursor’s Composer 2 Beats Claude Opus 4.6 with ‘Ankle‑Cut’ Pricing via New Reinforcement‑Learning Method

Architect's Alchemy Furnace

Mar 20, 2026 · Artificial Intelligence

Why Vector‑Based RAG Falls Short and How PageIndex’s Reasoning‑Based Retrieval Solves It

This article analyzes the fundamental limitations of traditional vector‑based Retrieval‑Augmented Generation, introduces Vectify AI’s reasoning‑driven PageIndex framework, and explains how hierarchical, non‑vector indexing enables more accurate, context‑aware document retrieval for complex, domain‑specific texts.

AILLMPageIndex

0 likes · 15 min read

Why Vector‑Based RAG Falls Short and How PageIndex’s Reasoning‑Based Retrieval Solves It

AI Engineering

Mar 20, 2026 · Artificial Intelligence

Cursor Unveils Composer 2: A Code‑Focused Model Priced at a Fraction of GPT‑5

Cursor's Composer 2, a code‑only AI model, jumps from a 44.2 to 61.3 benchmark score, outperforms Claude Opus 4.6, nears GPT‑5.4, and costs just $0.50 per million tokens, reshaping its strategy after heavy reliance on external APIs.

AI modelComposer 2Cursor

0 likes · 4 min read

Cursor Unveils Composer 2: A Code‑Focused Model Priced at a Fraction of GPT‑5

Wu Shixiong's Large Model Academy

Mar 20, 2026 · Artificial Intelligence

Mastering MinerU: Overcoming Its Top 9 Limitations for Reliable Document Parsing

This article examines MinerU's strengths and nine critical shortcomings—such as layout order errors, cross‑page table splits, merged‑cell failures, OCR misrecognition, and licensing issues—and provides concrete improvement strategies, interview‑ready resume bullets, and practical response frameworks for engineers.

LLMLayout AnalysisMinerU

0 likes · 13 min read

Mastering MinerU: Overcoming Its Top 9 Limitations for Reliable Document Parsing

Shuge Unlimited

Mar 20, 2026 · Artificial Intelligence

How a Single Gateway Manages 30+ Messaging Platforms in OpenClaw

This article dissects OpenClaw’s low‑level architecture, showing how a long‑lived Gateway process coordinates over 30 messaging platforms, how the pi‑mono embedded agent runtime drives the thinking cycle, and how the Context Engine and Session Management ensure consistent state, persistence, and extensibility.

AI AssistantContext EngineLLM

0 likes · 17 min read

How a Single Gateway Manages 30+ Messaging Platforms in OpenClaw

Machine Learning Algorithms & Natural Language Processing

Mar 19, 2026 · Artificial Intelligence

Inside Xiaomi’s Hunter Alpha: 1‑Trillion‑Parameter LLM with 1M Context and Top Global Rankings

Xiaomi’s newly unveiled MiMo‑V2‑Pro, codenamed Hunter Alpha, is a trillion‑parameter LLM with a 1 million‑token context window that tops OpenRouter usage, achieves the second‑best domestic and eighth‑best global scores on Artificial Analysis, and delivers strong benchmark results across PinchBench, ClawEval, and SWE‑bench.

LLMMiMo-V2-ProMultimodal

0 likes · 9 min read

Inside Xiaomi’s Hunter Alpha: 1‑Trillion‑Parameter LLM with 1M Context and Top Global Rankings

DeepHub IMBA

Mar 19, 2026 · Artificial Intelligence

Understanding Agent Memory: From Stateless LLMs to Persistent Multi‑Layer Architecture

The article analyzes why large language models are inherently stateless, outlines a four‑layer memory architecture for AI agents—including working, situational, semantic, and procedural memory—and explains write, retrieval, and forgetting mechanisms along with current tooling such as Mem0 and Letta.

Agent MemoryLLMLetta

0 likes · 9 min read

Understanding Agent Memory: From Stateless LLMs to Persistent Multi‑Layer Architecture

JakartaEE China Community

Mar 19, 2026 · Artificial Intelligence

Quarkus Meets LangChain4j: A Guide to Integrating Large Language Models

This article explains how the new quarkus‑langchain4j 0.1 extension lets developers embed LLMs into Quarkus applications using a declarative @RegisterAiService API, with support for tools, document stores, RAG, and full code examples.

AI integrationJavaLLM

0 likes · 13 min read

Quarkus Meets LangChain4j: A Guide to Integrating Large Language Models

PaperAgent

Mar 19, 2026 · Artificial Intelligence

How MDER‑DR Boosts Multi‑Hop KG QA with Entity‑Centric Summaries

The article presents the MDER‑DR two‑stage framework that tackles semantic loss in knowledge‑graph triple indexing by generating context‑aware entity summaries and using an LLM‑driven decompose‑parse retrieval loop, achieving up to 66% performance gains on multi‑hop question answering benchmarks.

Entity SummarizationKG QAKnowledge Graph

0 likes · 5 min read

How MDER‑DR Boosts Multi‑Hop KG QA with Entity‑Centric Summaries

AgentGuide

Mar 19, 2026 · Artificial Intelligence

What Exactly Is an AI Agent? Complete Interview Guide

This article breaks down the concept of AI agents for interview preparation, covering their definition, core components like planning, memory, and tool use, differences from plain LLM chats, real‑world challenges, typical use cases, detailed component analysis, and a runnable pseudo‑code example.

AI AgentLLMMemory

0 likes · 9 min read

What Exactly Is an AI Agent? Complete Interview Guide

Architect's Ambition

Mar 18, 2026 · Artificial Intelligence

From Zero to a Real AI Agent: Master Its Core Essence, Not Just API Calls

The article explains why an AI Agent is more than a simple LLM API call, outlines its four essential modules—memory, planning, tool use, and feedback—shows how they differ from ordinary models, and offers practical steps and common pitfalls for building a production‑grade single‑agent system.

AI AgentLLMMemory

0 likes · 13 min read

From Zero to a Real AI Agent: Master Its Core Essence, Not Just API Calls

AI Explorer

Mar 18, 2026 · Artificial Intelligence

Run and Fine‑Tune Hundreds of Open‑Source LLMs Locally with Unsloth

Unsloth offers a unified web UI that accelerates fine‑tuning by up to 2×, cuts VRAM usage by 70% (80% for RL), supports hundreds of open‑source models, and provides simple installation steps for rapid local AI experimentation.

AI workstationGPU optimizationLLM

0 likes · 6 min read

Run and Fine‑Tune Hundreds of Open‑Source LLMs Locally with Unsloth

AI Explorer

Mar 18, 2026 · Artificial Intelligence

Unlock Instant AI Agents with LangGraph‑Powered Deep Agents

Deep Agents, an open‑source framework built on LangGraph, bundles planning, file‑system tools, sub‑agent coordination and context management into a ready‑to‑run AI agent that can be launched with three lines of Python code and fully customized for diverse applications.

AI agentsAgent FrameworkDeep Agents

0 likes · 7 min read

Unlock Instant AI Agents with LangGraph‑Powered Deep Agents

DeepHub IMBA

Mar 18, 2026 · Artificial Intelligence

CRAG Architecture Explained: Fixing Erroneous Retrieval Results Before the Generator

The article analyzes how most RAG pipelines blindly feed retrieved documents to LLMs, introduces CRAG's lightweight evaluator with confidence thresholds, describes its sentence‑level decomposition, filtering, and dual‑knowledge routing, and provides a full implementation walkthrough with a real insurance query example.

CRAGFAISSLLM

0 likes · 13 min read

CRAG Architecture Explained: Fixing Erroneous Retrieval Results Before the Generator

o-ai.tech

Mar 18, 2026 · Artificial Intelligence

How Anthropic Builds Effective AI Agents: Practical Patterns and Principles

This guide distills Anthropic’s frontline experience into a concise framework for building high‑performing AI agents, covering the workflow‑vs‑agent distinction, five composable architecture patterns, core design principles, tool‑centric optimization, and pragmatic advice on using or bypassing agent frameworks.

AI agentsAnthropicLLM

0 likes · 9 min read

How Anthropic Builds Effective AI Agents: Practical Patterns and Principles

Huolala Tech

Mar 18, 2026 · Artificial Intelligence

Boosting LLM Accuracy: From RAG to GraphRAG for Enterprise Metadata Retrieval

This article explains the fundamentals of Retrieval‑Augmented Generation (RAG), introduces GraphRAG as an advanced architecture using knowledge graphs, details implementation pipelines, evaluates performance improvements, analyzes common pitfalls, and outlines future enhancements for enterprise metadata search.

AIGraphRAGKnowledge Graph

0 likes · 17 min read

Boosting LLM Accuracy: From RAG to GraphRAG for Enterprise Metadata Retrieval

Data STUDIO

Mar 18, 2026 · Artificial Intelligence

Building a Smart Web AI Agent with FastAPI, LangGraph, and MCP

This article walks through the design and implementation of a production‑ready Web AI agent that uses FastAPI as the HTTP layer, LangGraph to orchestrate multi‑step reasoning, and MCP to expose external tools, showing how to manage state, integrate multiple LLM providers, and extend the system with persistence, rate‑limiting, and monitoring.

AI AgentFastAPILLM

0 likes · 20 min read

Building a Smart Web AI Agent with FastAPI, LangGraph, and MCP

SuanNi

Mar 18, 2026 · Artificial Intelligence

Explore the LLM Architecture Gallery: Visualizing Seven Years of Model Evolution

The LLM Architecture Gallery, created by Sebastian Raschka, offers an interactive visual compendium of open‑weight large language models from 2019 to 2026, detailing their core parameters, architectural innovations, and the broader trends shaping modern AI research.

AIArtificial IntelligenceLLM

0 likes · 8 min read

Explore the LLM Architecture Gallery: Visualizing Seven Years of Model Evolution

AI Tech Publishing

Mar 18, 2026 · Artificial Intelligence

How Context Engineering Turns AI Agents from ‘Usable’ to ‘Highly Effective’

The article explains how organizing the prompt, tool schemas, dialogue history, and retrieved documents—collectively the context window—affects an AI agent’s decisions, introduces the concepts of Lost‑in‑the‑Middle, Thinking Tokens, tool‑response caching, compaction versus SubAgent strategies, and shows a step‑by‑step evolution that raised accuracy from 60 % to over 95 %.

AI agentsContext EngineeringLLM

0 likes · 9 min read

How Context Engineering Turns AI Agents from ‘Usable’ to ‘Highly Effective’

Machine Learning Algorithms & Natural Language Processing

Mar 17, 2026 · Artificial Intelligence

From Lists to Decision Reports: The Deep Research Framework for Recommender Systems

The paper introduces Deep Research for Recommender Systems, a multi‑agent framework called RecPilot that replaces traditional list‑based recommendations with AI‑driven exploration, trajectory simulation, and structured decision‑support reports, and demonstrates its superiority on TMALL data through extensive trajectory and report‑generation evaluations.

LLMRecPilotdecision report

0 likes · 10 min read

From Lists to Decision Reports: The Deep Research Framework for Recommender Systems

Woodpecker Software Testing

Mar 17, 2026 · Artificial Intelligence

Why Direct Prompting Beats LLM Knowledge‑Base Agents for Test‑Case Generation

The article explains that feeding requirements directly in the prompt yields far better test‑case designs than using an LLM‑powered knowledge‑base agent, because the model processes the full context without the loss and fragmentation introduced by retrieval‑augmented generation.

AI agentsKnowledge BaseLLM

0 likes · 6 min read

Why Direct Prompting Beats LLM Knowledge‑Base Agents for Test‑Case Generation

DeepHub IMBA

Mar 17, 2026 · Artificial Intelligence

Advanced RAG Techniques: Boosting Retrieval with Query Translation and Decomposition

The article examines how retrieval‑augmented generation suffers from poor query formulation and presents two advanced strategies—query translation, which generates multiple semantically similar variants, and query decomposition, which breaks complex questions into finer sub‑queries—detailing methods such as fan‑out retrieval, reciprocal rank fusion, HyDE, step‑back prompting, and chain‑of‑thought retrieval, and explains when to combine them.

Hybrid RetrievalLLMQuery Decomposition

0 likes · 9 min read

Advanced RAG Techniques: Boosting Retrieval with Query Translation and Decomposition

IT Services Circle

Mar 17, 2026 · Artificial Intelligence

How AI Workflows and Agents Transform Automation: From Rigid Rules to Intelligent Decision‑Making

This article explains the distinction and synergy between traditional AI workflows and modern agents, outlines their four‑step processes, showcases practical examples such as intelligent customer service and content generation, and recommends tools for beginners to quickly build AI‑driven applications.

AIAgentAutomation

0 likes · 11 min read

How AI Workflows and Agents Transform Automation: From Rigid Rules to Intelligent Decision‑Making

Data Party THU

Mar 17, 2026 · Artificial Intelligence

How OpenMAIC Is Redefining AI-Powered Learning: From Multi‑Agent Labs to Classroom Revolution

OpenMAIC, the world’s first multi‑agent generative learning framework released by Tsinghua University, transforms technical documents into zero‑barrier interactive courses, supports AI‑driven lesson planning, multi‑agent discussions, and plug‑in extensions, and is rapidly evolving through 2024‑2026 to reshape education and beyond.

AI educationLLMMulti-Agent AI

0 likes · 10 min read

How OpenMAIC Is Redefining AI-Powered Learning: From Multi‑Agent Labs to Classroom Revolution

PaperAgent

Mar 17, 2026 · Artificial Intelligence

Can Attention Replace Fixed Residuals? Inside the ‘Attention Residuals’ Breakthrough

This article analyzes the newly released Attention Residuals paper, explaining how learnable attention weighting replaces fixed residual addition to mitigate information dilution in deep LLMs, detailing the proposed Block AttnRes design, engineering trade‑offs, experimental results, and its significance for foundational model architecture.

Block AttentionLLMResidual Connections

0 likes · 9 min read

Can Attention Replace Fixed Residuals? Inside the ‘Attention Residuals’ Breakthrough

AI Step-by-Step

Mar 17, 2026 · Industry Insights

How OpenClaw Redefines Enterprise Software with AI‑Powered Business Integration

The article analyzes OpenClaw as an AI‑driven business‑connection layer that unifies chat entry, tool execution, and event‑driven automation, showing how enterprises can shorten system gaps, automate long‑tail workflows, and adopt a new agent‑based service model without replacing existing ERP or CRM solutions.

AI agentsHooksLLM

0 likes · 8 min read

How OpenClaw Redefines Enterprise Software with AI‑Powered Business Integration