Tagged articles

21 articles

Page 1 of 1

May 30, 2026 · Artificial Intelligence

Claude Code Self‑Repair Explained: Writing Error Feedback into the Harness

The article shows how to turn Claude Code’s occasional mistakes into a reliable feedback loop by using a CLAUDE.md entry file, Hooks, Permissions and Skills, so errors become visible, verifiable and can be written back into the harness for future runs.

AI AgentsCLAUDE.mdClaude Code

0 likes · 22 min read

Claude Code Self‑Repair Explained: Writing Error Feedback into the Harness

DataFunTalk

May 29, 2026 · Artificial Intelligence

From Prompt to Context to Harness: Unpacking the Three Paradigm Shifts in Agent Engineering

The survey "Agent Harness Engineering: A Survey" reveals how agent systems have evolved from prompt engineering to context engineering and now to harness engineering, introduces the seven‑layer ETCLOVG framework, shows benchmark gains from better harnesses, and argues that observability, governance, and trace‑native evaluation are essential for production‑grade AI agents.

AI AgentsContext EngineeringGovernance

0 likes · 14 min read

From Prompt to Context to Harness: Unpacking the Three Paradigm Shifts in Agent Engineering

Tech Minimalism

May 7, 2026 · Artificial Intelligence

12 Reusable MCP Design Patterns for Production‑Grade Anthropic Agents

The article distills Anthropic’s production‑agent guidance into five groups of twelve concrete MCP patterns—covering tool surface design, interaction semantics, authentication, context economy, and packaging—explaining why each pattern matters, its trade‑offs, and how it helps build safe, stable, low‑cost agent integrations.

AIAnthropicDesign Patterns

0 likes · 22 min read

12 Reusable MCP Design Patterns for Production‑Grade Anthropic Agents

Top Architecture Tech Stack

Apr 27, 2026 · Artificial Intelligence

DeepSeek V4 Pro vs GPT‑5.3 Codex High: Direct Code‑Generation Test Reveals the Gap

A two‑stage evaluation compares DeepSeek V4 Pro and GPT‑5.3 Codex High on a TypeScript LRU‑Cache task and a markdown‑inspection CLI project, showing DeepSeek leads on basic code correctness while GPT‑5.3 delivers a more complete engineering solution, with detailed scores and analysis.

DeepSeek V4 ProGPT-5.3 Codex HighLLM code evaluation

0 likes · 13 min read

DeepSeek V4 Pro vs GPT‑5.3 Codex High: Direct Code‑Generation Test Reveals the Gap

AI Architecture Hub

Apr 24, 2026 · Artificial Intelligence

How Claude Code Achieves a 92% Prompt Caching Hit Rate with Three Unbreakable Engineering Rules

Claude Code’s prompt‑caching delivers a 92% hit rate, slashing a 50‑round agent session cost from $6 to $1.15 by separating stable prefixes from dynamic tails, using a three‑layer cache architecture, exact token‑sequence matching, and three strict engineering rules that keep the cache hot and reliable.

Cache Hit RateClaude CodeCost Reduction

0 likes · 13 min read

How Claude Code Achieves a 92% Prompt Caching Hit Rate with Three Unbreakable Engineering Rules

Architecture Musings

Apr 19, 2026 · Artificial Intelligence

My AI Adoption Journey: Lessons from the Terraform and Ghostty Creator

The author, Mitchell Hashimoto—co‑founder of HashiCorp and creator of Terraform and Ghostty—shares a step‑by‑step, candid account of adopting AI agents, detailing six phases from abandoning chatbots to continuously running agents, the concept of “harness engineering,” and practical insights on when and how to integrate AI into a developer workflow.

AI adoptionGhosttyHarness Engineering

0 likes · 16 min read

My AI Adoption Journey: Lessons from the Terraform and Ghostty Creator

Wu Shixiong's Large Model Academy

Apr 13, 2026 · Artificial Intelligence

Turning ReAct from Demo to Production: Handling Failures, Loops, and Token Budgets

This article explains how to upgrade a ReAct agent from a proof‑of‑concept to a production‑ready system by classifying tool failures, detecting repeated search loops, managing token budgets, and adding structured logging, complete with Python implementations and practical interview guidance.

LLMLoop DetectionToken Budgeting

0 likes · 24 min read

Turning ReAct from Demo to Production: Handling Failures, Loops, and Token Budgets

MeowKitty Programming

Apr 11, 2026 · Industry Insights

Java’s New Frontier: Master AI Agents, Not Just Code, as Oracle, Spring, JetBrains Bet

The article explains how Oracle, Spring, and JetBrains are collectively pushing Java toward an agent‑centric ecosystem, shifting the developer’s role from writing code to orchestrating AI agents, and outlines the specific capabilities, engineering practices, and risks Java engineers must adopt to stay competitive in the coming years.

AI AgentsJavaJetBrains

0 likes · 9 min read

Java’s New Frontier: Master AI Agents, Not Just Code, as Oracle, Spring, JetBrains Bet

Architect

Apr 9, 2026 · Industry Insights

Why Claude Managed Agents Are Redefining AI Workflows: A Deep Dive

Anthropic's Claude Managed Agents shift the focus from building demo loops to providing a fully hosted runtime base that handles sandboxing, state persistence, error recovery, and tool execution, enabling developers to concentrate on business logic and long‑running tasks while navigating new cost and compliance considerations.

AI Agent infrastructureClaude Managed Agentsagent engineering

0 likes · 23 min read

Why Claude Managed Agents Are Redefining AI Workflows: A Deep Dive

AI Tech Publishing

Apr 4, 2026 · Artificial Intelligence

Become a World-Class Agent Engineer: Master Context, Rules, and Termination Conditions

This guide explains how to become a world‑class Agent engineer by managing context bloat, defining clear rules and skills, separating research from implementation, using neutral prompts, and writing explicit termination contracts, while emphasizing that the final results remain the developer’s responsibility.

ClaudeCodex CLIContext Bloat

0 likes · 17 min read

Become a World-Class Agent Engineer: Master Context, Rules, and Termination Conditions

Radish, Keep Going!

Mar 31, 2026 · Artificial Intelligence

Why Agent‑First Systems Fail and How Harness Engineering Fixes Them

The article analyzes OpenAI’s Harness Engineering approach, explains four systemic failure modes of LLM‑driven agents, and details five modular components—readable environment, task state machine, verification loop, architectural constraints, and loop detection—that together enable reliable, large‑scale agent development.

AIHarnessLLM

0 likes · 17 min read

Why Agent‑First Systems Fail and How Harness Engineering Fixes Them

Yunqi AI+

Mar 27, 2026 · Artificial Intelligence

From AI Assistants to Production Agents: How Harness Becomes Core Infrastructure

The article explains how AI‑driven software is shifting from simple functional tools to result‑oriented autonomous systems, and argues that building production‑grade agents requires a dedicated engineering layer—called Harness—that provides task orchestration, state management, tool integration, observability, security, and governance.

AI AgentsHarnessObservability

0 likes · 21 min read

From AI Assistants to Production Agents: How Harness Becomes Core Infrastructure

Alibaba Cloud Native

Mar 26, 2026 · Artificial Intelligence

Why Harness Engineering Is the Next Frontier for AI Agents

The article examines the emerging paradigm of Harness Engineering, tracing its roots from the industrial and information revolutions to AI, and presents four real‑world case studies that demonstrate how prompt, context, and feedback engineering can dramatically improve large‑language‑model agents while highlighting open‑source tools for building scalable, collaborative AI systems.

AIContext EngineeringHarness Engineering

0 likes · 17 min read

Why Harness Engineering Is the Next Frontier for AI Agents

Design Hub

Mar 26, 2026 · Artificial Intelligence

How Anthropic Advances Agent Development: From Code Writing to 4‑6 Hour Autonomy

Anthropic’s recent engineering paper shows that the next breakthrough in AI agents is not whether they can write code, but how to organize them into a planner‑generator‑evaluator harness that can work continuously for four to six hours, handle self‑evaluation, context anxiety, and deliver usable applications.

AI autonomyagent engineeringcontext anxiety

0 likes · 16 min read

How Anthropic Advances Agent Development: From Code Writing to 4‑6 Hour Autonomy

AI Waka

Mar 25, 2026 · Artificial Intelligence

Why Persistent Specs Matter: Building Reliable AI Agents with an Artifact Layer

The article explains how an artifact layer—comprising specs, guidance files, skills, tests, and logs—preserves intent across AI agent sessions, enabling reliable, secure, and maintainable agent‑driven software development through spec‑first practices, bounded loops, and robust verification stacks.

AI AgentsSpec-Driven Developmentagent engineering

0 likes · 16 min read

Why Persistent Specs Matter: Building Reliable AI Agents with an Artifact Layer

Machine Learning Algorithms & Natural Language Processing

Feb 13, 2026 · Industry Insights

Meta and OpenAI Court OpenClaw: Zuckerberg Tests It, Ultraman Offers Compute Power

OpenClaw, the open‑source AI agent framework created by Peter Steinberger, has attracted acquisition overtures from Meta and OpenAI, amassed 189 k GitHub stars in under a month, and sparked discussions about its rapid prototype development, agent‑driven engineering, and the future of app‑less AI services.

AI AgentsFuture of appsGitHub stars

0 likes · 10 min read

Meta and OpenAI Court OpenClaw: Zuckerberg Tests It, Ultraman Offers Compute Power

Architect

Feb 10, 2026 · Artificial Intelligence

Why Pi’s Minimalist Architecture Powers OpenClaw’s AI Coding Agent

The article explains how the ultra‑minimal Pi engine—built around just four tools, a tree‑shaped session model, and an extensible plug‑in system—provides a clean, auditable, and secure foundation for OpenClaw’s AI‑driven code‑writing capabilities, while highlighting practical extensions, engineering constraints, and trade‑offs.

AI coding agentExtensible architectureMinimalist design

0 likes · 16 min read

Why Pi’s Minimalist Architecture Powers OpenClaw’s AI Coding Agent

大转转FE

Jan 26, 2026 · Artificial Intelligence

Exploring AI Agent Development: Tools, Case Studies, and the Future of Engineering

This newsletter curates five in‑depth articles on AI agents, covering a week‑long Vibe Coding desktop assistant project, a deep dive into Claude Agent SDK tools, Huolala’s Agent Skills implementation, the shift to “Agent Engineer” roles, and the evolving opportunities for engineers in the AI era.

AI toolsClaude SDKSoftware Architecture

0 likes · 7 min read

Exploring AI Agent Development: Tools, Case Studies, and the Future of Engineering

Wu Shixiong's Large Model Academy

Nov 14, 2025 · Artificial Intelligence

How to Engineer Reliable Function Calls for LLM Agents: An End‑to‑End Framework

This article explains why function‑call accuracy is critical for LLM agents, identifies four common failure causes, and presents a systematic, five‑step engineering framework—including dynamic routing, chain‑of‑thought planning, result validation, memory injection, and log‑driven optimization—backed by concrete examples and quantitative improvements.

Function CallingInterview preparationLLM

0 likes · 10 min read

How to Engineer Reliable Function Calls for LLM Agents: An End‑to‑End Framework

Architecture and Beyond

Nov 2, 2025 · Artificial Intelligence

Why AI Agents Still Fall Short: Key Challenges and Real-World Solutions

The article examines why current AI agents fall short of expectations, highlighting weak business understanding, limited execution, controllability issues, high customization costs, and the gap between model capabilities and engineering, while proposing SaaS firms' advantages, vertical scenario focus, security concerns, and future development trends.

AI AgentsAI safetyVertical AI

0 likes · 11 min read

Why AI Agents Still Fall Short: Key Challenges and Real-World Solutions

Alibaba Cloud Native

Jun 12, 2025 · Artificial Intelligence

Why AI Agent Engineering Matters: From Product Design to Technical Architecture

This article breaks down AI agent engineering into product and technical engineering, explains how demand modeling, UI/UX design, prompt engineering, multi‑agent coordination, and observability combine to make AI agents usable, scalable, and trustworthy, and shows concrete examples and implementation patterns.

AIObservabilityagent engineering

0 likes · 23 min read

Why AI Agent Engineering Matters: From Product Design to Technical Architecture