Tagged articles
21 articles
Page 1 of 1
Architect
Architect
May 30, 2026 · Artificial Intelligence

Claude Code Self‑Repair Explained: Writing Error Feedback into the Harness

The article shows how to turn Claude Code’s occasional mistakes into a reliable feedback loop by using a CLAUDE.md entry file, Hooks, Permissions and Skills, so errors become visible, verifiable and can be written back into the harness for future runs.

AI AgentsCLAUDE.mdClaude Code
0 likes · 22 min read
Claude Code Self‑Repair Explained: Writing Error Feedback into the Harness
DataFunTalk
DataFunTalk
May 29, 2026 · Artificial Intelligence

From Prompt to Context to Harness: Unpacking the Three Paradigm Shifts in Agent Engineering

The survey "Agent Harness Engineering: A Survey" reveals how agent systems have evolved from prompt engineering to context engineering and now to harness engineering, introduces the seven‑layer ETCLOVG framework, shows benchmark gains from better harnesses, and argues that observability, governance, and trace‑native evaluation are essential for production‑grade AI agents.

AI AgentsContext EngineeringGovernance
0 likes · 14 min read
From Prompt to Context to Harness: Unpacking the Three Paradigm Shifts in Agent Engineering
Tech Minimalism
Tech Minimalism
May 7, 2026 · Artificial Intelligence

12 Reusable MCP Design Patterns for Production‑Grade Anthropic Agents

The article distills Anthropic’s production‑agent guidance into five groups of twelve concrete MCP patterns—covering tool surface design, interaction semantics, authentication, context economy, and packaging—explaining why each pattern matters, its trade‑offs, and how it helps build safe, stable, low‑cost agent integrations.

AIAnthropicDesign Patterns
0 likes · 22 min read
12 Reusable MCP Design Patterns for Production‑Grade Anthropic Agents
Top Architecture Tech Stack
Top Architecture Tech Stack
Apr 27, 2026 · Artificial Intelligence

DeepSeek V4 Pro vs GPT‑5.3 Codex High: Direct Code‑Generation Test Reveals the Gap

A two‑stage evaluation compares DeepSeek V4 Pro and GPT‑5.3 Codex High on a TypeScript LRU‑Cache task and a markdown‑inspection CLI project, showing DeepSeek leads on basic code correctness while GPT‑5.3 delivers a more complete engineering solution, with detailed scores and analysis.

DeepSeek V4 ProGPT-5.3 Codex HighLLM code evaluation
0 likes · 13 min read
DeepSeek V4 Pro vs GPT‑5.3 Codex High: Direct Code‑Generation Test Reveals the Gap
AI Architecture Hub
AI Architecture Hub
Apr 24, 2026 · Artificial Intelligence

How Claude Code Achieves a 92% Prompt Caching Hit Rate with Three Unbreakable Engineering Rules

Claude Code’s prompt‑caching delivers a 92% hit rate, slashing a 50‑round agent session cost from $6 to $1.15 by separating stable prefixes from dynamic tails, using a three‑layer cache architecture, exact token‑sequence matching, and three strict engineering rules that keep the cache hot and reliable.

Cache Hit RateClaude CodeCost Reduction
0 likes · 13 min read
How Claude Code Achieves a 92% Prompt Caching Hit Rate with Three Unbreakable Engineering Rules
Architecture Musings
Architecture Musings
Apr 19, 2026 · Artificial Intelligence

My AI Adoption Journey: Lessons from the Terraform and Ghostty Creator

The author, Mitchell Hashimoto—co‑founder of HashiCorp and creator of Terraform and Ghostty—shares a step‑by‑step, candid account of adopting AI agents, detailing six phases from abandoning chatbots to continuously running agents, the concept of “harness engineering,” and practical insights on when and how to integrate AI into a developer workflow.

AI adoptionGhosttyHarness Engineering
0 likes · 16 min read
My AI Adoption Journey: Lessons from the Terraform and Ghostty Creator
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Apr 13, 2026 · Artificial Intelligence

Turning ReAct from Demo to Production: Handling Failures, Loops, and Token Budgets

This article explains how to upgrade a ReAct agent from a proof‑of‑concept to a production‑ready system by classifying tool failures, detecting repeated search loops, managing token budgets, and adding structured logging, complete with Python implementations and practical interview guidance.

LLMLoop DetectionToken Budgeting
0 likes · 24 min read
Turning ReAct from Demo to Production: Handling Failures, Loops, and Token Budgets
MeowKitty Programming
MeowKitty Programming
Apr 11, 2026 · Industry Insights

Java’s New Frontier: Master AI Agents, Not Just Code, as Oracle, Spring, JetBrains Bet

The article explains how Oracle, Spring, and JetBrains are collectively pushing Java toward an agent‑centric ecosystem, shifting the developer’s role from writing code to orchestrating AI agents, and outlines the specific capabilities, engineering practices, and risks Java engineers must adopt to stay competitive in the coming years.

AI AgentsJavaJetBrains
0 likes · 9 min read
Java’s New Frontier: Master AI Agents, Not Just Code, as Oracle, Spring, JetBrains Bet
Architect
Architect
Apr 9, 2026 · Industry Insights

Why Claude Managed Agents Are Redefining AI Workflows: A Deep Dive

Anthropic's Claude Managed Agents shift the focus from building demo loops to providing a fully hosted runtime base that handles sandboxing, state persistence, error recovery, and tool execution, enabling developers to concentrate on business logic and long‑running tasks while navigating new cost and compliance considerations.

AI Agent infrastructureClaude Managed Agentsagent engineering
0 likes · 23 min read
Why Claude Managed Agents Are Redefining AI Workflows: A Deep Dive
AI Tech Publishing
AI Tech Publishing
Apr 4, 2026 · Artificial Intelligence

Become a World-Class Agent Engineer: Master Context, Rules, and Termination Conditions

This guide explains how to become a world‑class Agent engineer by managing context bloat, defining clear rules and skills, separating research from implementation, using neutral prompts, and writing explicit termination contracts, while emphasizing that the final results remain the developer’s responsibility.

ClaudeCodex CLIContext Bloat
0 likes · 17 min read
Become a World-Class Agent Engineer: Master Context, Rules, and Termination Conditions
Radish, Keep Going!
Radish, Keep Going!
Mar 31, 2026 · Artificial Intelligence

Why Agent‑First Systems Fail and How Harness Engineering Fixes Them

The article analyzes OpenAI’s Harness Engineering approach, explains four systemic failure modes of LLM‑driven agents, and details five modular components—readable environment, task state machine, verification loop, architectural constraints, and loop detection—that together enable reliable, large‑scale agent development.

AIHarnessLLM
0 likes · 17 min read
Why Agent‑First Systems Fail and How Harness Engineering Fixes Them
Yunqi AI+
Yunqi AI+
Mar 27, 2026 · Artificial Intelligence

From AI Assistants to Production Agents: How Harness Becomes Core Infrastructure

The article explains how AI‑driven software is shifting from simple functional tools to result‑oriented autonomous systems, and argues that building production‑grade agents requires a dedicated engineering layer—called Harness—that provides task orchestration, state management, tool integration, observability, security, and governance.

AI AgentsHarnessObservability
0 likes · 21 min read
From AI Assistants to Production Agents: How Harness Becomes Core Infrastructure
Alibaba Cloud Native
Alibaba Cloud Native
Mar 26, 2026 · Artificial Intelligence

Why Harness Engineering Is the Next Frontier for AI Agents

The article examines the emerging paradigm of Harness Engineering, tracing its roots from the industrial and information revolutions to AI, and presents four real‑world case studies that demonstrate how prompt, context, and feedback engineering can dramatically improve large‑language‑model agents while highlighting open‑source tools for building scalable, collaborative AI systems.

AIContext EngineeringHarness Engineering
0 likes · 17 min read
Why Harness Engineering Is the Next Frontier for AI Agents
Design Hub
Design Hub
Mar 26, 2026 · Artificial Intelligence

How Anthropic Advances Agent Development: From Code Writing to 4‑6 Hour Autonomy

Anthropic’s recent engineering paper shows that the next breakthrough in AI agents is not whether they can write code, but how to organize them into a planner‑generator‑evaluator harness that can work continuously for four to six hours, handle self‑evaluation, context anxiety, and deliver usable applications.

AI autonomyagent engineeringcontext anxiety
0 likes · 16 min read
How Anthropic Advances Agent Development: From Code Writing to 4‑6 Hour Autonomy
AI Waka
AI Waka
Mar 25, 2026 · Artificial Intelligence

Why Persistent Specs Matter: Building Reliable AI Agents with an Artifact Layer

The article explains how an artifact layer—comprising specs, guidance files, skills, tests, and logs—preserves intent across AI agent sessions, enabling reliable, secure, and maintainable agent‑driven software development through spec‑first practices, bounded loops, and robust verification stacks.

AI AgentsSpec-Driven Developmentagent engineering
0 likes · 16 min read
Why Persistent Specs Matter: Building Reliable AI Agents with an Artifact Layer

Meta and OpenAI Court OpenClaw: Zuckerberg Tests It, Ultraman Offers Compute Power

OpenClaw, the open‑source AI agent framework created by Peter Steinberger, has attracted acquisition overtures from Meta and OpenAI, amassed 189 k GitHub stars in under a month, and sparked discussions about its rapid prototype development, agent‑driven engineering, and the future of app‑less AI services.

AI AgentsFuture of appsGitHub stars
0 likes · 10 min read
Meta and OpenAI Court OpenClaw: Zuckerberg Tests It, Ultraman Offers Compute Power
Architect
Architect
Feb 10, 2026 · Artificial Intelligence

Why Pi’s Minimalist Architecture Powers OpenClaw’s AI Coding Agent

The article explains how the ultra‑minimal Pi engine—built around just four tools, a tree‑shaped session model, and an extensible plug‑in system—provides a clean, auditable, and secure foundation for OpenClaw’s AI‑driven code‑writing capabilities, while highlighting practical extensions, engineering constraints, and trade‑offs.

AI coding agentExtensible architectureMinimalist design
0 likes · 16 min read
Why Pi’s Minimalist Architecture Powers OpenClaw’s AI Coding Agent
大转转FE
大转转FE
Jan 26, 2026 · Artificial Intelligence

Exploring AI Agent Development: Tools, Case Studies, and the Future of Engineering

This newsletter curates five in‑depth articles on AI agents, covering a week‑long Vibe Coding desktop assistant project, a deep dive into Claude Agent SDK tools, Huolala’s Agent Skills implementation, the shift to “Agent Engineer” roles, and the evolving opportunities for engineers in the AI era.

AI toolsClaude SDKSoftware Architecture
0 likes · 7 min read
Exploring AI Agent Development: Tools, Case Studies, and the Future of Engineering
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 14, 2025 · Artificial Intelligence

How to Engineer Reliable Function Calls for LLM Agents: An End‑to‑End Framework

This article explains why function‑call accuracy is critical for LLM agents, identifies four common failure causes, and presents a systematic, five‑step engineering framework—including dynamic routing, chain‑of‑thought planning, result validation, memory injection, and log‑driven optimization—backed by concrete examples and quantitative improvements.

Function CallingInterview preparationLLM
0 likes · 10 min read
How to Engineer Reliable Function Calls for LLM Agents: An End‑to‑End Framework
Architecture and Beyond
Architecture and Beyond
Nov 2, 2025 · Artificial Intelligence

Why AI Agents Still Fall Short: Key Challenges and Real-World Solutions

The article examines why current AI agents fall short of expectations, highlighting weak business understanding, limited execution, controllability issues, high customization costs, and the gap between model capabilities and engineering, while proposing SaaS firms' advantages, vertical scenario focus, security concerns, and future development trends.

AI AgentsAI safetyVertical AI
0 likes · 11 min read
Why AI Agents Still Fall Short: Key Challenges and Real-World Solutions
Alibaba Cloud Native
Alibaba Cloud Native
Jun 12, 2025 · Artificial Intelligence

Why AI Agent Engineering Matters: From Product Design to Technical Architecture

This article breaks down AI agent engineering into product and technical engineering, explains how demand modeling, UI/UX design, prompt engineering, multi‑agent coordination, and observability combine to make AI agents usable, scalable, and trustworthy, and shows concrete examples and implementation patterns.

AIObservabilityagent engineering
0 likes · 23 min read
Why AI Agent Engineering Matters: From Product Design to Technical Architecture