Author

AI Tech Publishing

In the fast-evolving AI era, we thoroughly explain stable technical foundations.

Articles

Likes

Views

Comments

Latest from AI Tech Publishing

81 recent articles

AI Tech Publishing

Jan 10, 2026 · Artificial Intelligence

Anthropic Engineers Reveal a Pragmatic Framework for Evaluating AI Agents

Anthropic engineers outline why rigorous AI Agent evaluation is essential, describe a comprehensive evaluation harness with tasks, trials, graders, and transcripts, compare capability and regression tests, discuss code-, model-, and human-based graders, and present an eight-step roadmap for building reliable Agent assessment pipelines.

Capability EvaluationCode-based GraderEvaluation Framework

0 likes · 12 min read

Anthropic Engineers Reveal a Pragmatic Framework for Evaluating AI Agents

AI Tech Publishing

Dec 22, 2025 · Artificial Intelligence

How Agent Skills and MCP Servers Work Together

This article explains how Anthropic's Skills and Model Context Protocol (MCP) servers complement each other to let Claude agents follow specific workflows, access external tools, and produce consistent, reliable outputs, illustrated with real‑world use cases and a quick reference guide.

AI agentsAnthropicClaude

0 likes · 13 min read

How Agent Skills and MCP Servers Work Together

AI Tech Publishing

Nov 30, 2025 · Artificial Intelligence

Agent Architecture Design Part 1: Context Compression Strategies and Their Use Cases

The article explains why large‑model agents need context compression, outlines five engineering‑level schemes (both lossless and lossy), demonstrates each with concrete XML snippets and step‑by‑step reasoning, and advises using lossless methods before resorting to lossy prompt‑driven compression.

AgentContext CompressionLLM

0 likes · 12 min read

Agent Architecture Design Part 1: Context Compression Strategies and Their Use Cases

AI Tech Publishing

Nov 25, 2025 · Artificial Intelligence

Three New Ways to Tackle Agent Context Engineering with Claude’s Tools

Anthropic’s recent release introduces three advanced capabilities—Tool Search, Programmatic Tool Calling, and Tool Use Examples—that reduce token consumption, avoid context pollution, and improve tool‑calling accuracy for AI agents, with detailed benchmarks, code samples, and guidance on when each feature is most effective.

AI agentsClaudeTool Search

0 likes · 24 min read

Three New Ways to Tackle Agent Context Engineering with Claude’s Tools

AI Tech Publishing

Nov 23, 2025 · Artificial Intelligence

How Agents Leverage File Systems for Context Engineering

The article examines why file system access is crucial for autonomous agents, outlining common context‑engineering failures such as missing, excessive, or irrelevant information, and demonstrates how using file‑system tools like ls, grep, and write‑file can reduce token waste, enable dynamic storage, improve targeted search, and support continual learning.

LLMautonomous agentscontext engineering

0 likes · 11 min read

How Agents Leverage File Systems for Context Engineering

AI Tech Publishing

Nov 20, 2025 · Artificial Intelligence

Million‑Dollar AI Playbook: From Prompt Engineering to Agents – Anthropic’s Full PDF Unpacked

Anthropic’s enterprise guide shows how early adopters boost productivity—20‑35% faster customer service, 30‑50% higher content output, 15% less coding time—and outlines a four‑step framework, prompt‑engineering formula, and agent roadmap to turn AI into measurable business value.

AI implementationAnthropicLLMOps

0 likes · 10 min read

Million‑Dollar AI Playbook: From Prompt Engineering to Agents – Anthropic’s Full PDF Unpacked

AI Tech Publishing

Nov 17, 2025 · Artificial Intelligence

Frontier AI Models in RL Environments Reveal an Agent Capability Hierarchy

The article evaluates nine cutting‑edge AI models on 150 simulated workplace tasks, showing that even the strongest models complete fewer than 40% of tasks, and uses these results to propose a hierarchical framework of agentic capabilities ranging from tool use to common‑sense reasoning.

AI model evaluationagentic capabilitiescommon sense reasoning

0 likes · 19 min read

Frontier AI Models in RL Environments Reveal an Agent Capability Hierarchy

AI Tech Publishing

Nov 13, 2025 · Artificial Intelligence

Claude’s Prompt Engineering Best Practices: A Step‑by‑Step Guide

This guide outlines Claude team’s best practices for prompt engineering, covering core techniques such as clear instructions, background context, specificity, examples, and advanced methods like pre‑filled responses, chain‑of‑thought, output formatting, and prompt chaining, with concrete examples and code snippets.

AI promptingClaudeLLM

0 likes · 18 min read

Claude’s Prompt Engineering Best Practices: A Step‑by‑Step Guide

AI Tech Publishing

Nov 10, 2025 · Artificial Intelligence

The Real Barriers to Deploying AI Agents: Workflow, Trust, and Data Privacy

A survey of over 30 AI‑agent founders and 40 enterprise users reveals that the biggest obstacles to AI‑agent adoption are workflow integration and human interaction (60%), employee resistance (50%) and data‑privacy concerns (50%), while successful deployments rely on modest, well‑positioned use‑cases, hands‑on support, and clear pricing models.

AI agentsData PrivacyEmployee Trust

0 likes · 10 min read

The Real Barriers to Deploying AI Agents: Workflow, Trust, and Data Privacy

AI Tech Publishing

Nov 5, 2025 · Industry Insights

Why ToB AI Agents Fail: Model Limits and the Tech‑Business Gap

The article analyzes why ToB AI agents struggle to succeed, pinpointing two core issues: inadequate model capabilities that force temporary engineering patches, and a disconnect between technical staff who understand AI and business staff who understand domain needs.

AI agentsEnterprise AIToB

0 likes · 2 min read

Why ToB AI Agents Fail: Model Limits and the Tech‑Business Gap