Tagged articles

108 articles

Page 1 of 2

May 28, 2026 · Artificial Intelligence

How Claude Code Prompt Caching Cuts AI Costs by Up to 90% and Boosts Efficiency

Prompt Caching in Anthropic's Claude Code replaces repeated processing of identical prompt prefixes with a prefix‑hash cache, slashing input‑token costs by up to 90%, reducing first‑token latency by 79%, and improving throughput, while preserving model output exactly as if no cache were used.

AI EngineeringCache InvalidationCache Metrics

0 likes · 30 min read

How Claude Code Prompt Caching Cuts AI Costs by Up to 90% and Boosts Efficiency

AI Large-Model Wave and Transformation Guide

May 26, 2026 · Artificial Intelligence

Qian Xuesen’s 1954 Engineering Control Theory: The Unexpected Blueprint for Large‑Model Harnessing and Ontology

The article links Qian Xuesen’s 1954 work on engineering control theory to today’s challenges in large‑model training, arguing that a three‑step framework—ontology (defining what to control), control theory (designing how to control), and harness (accurate measurement)—is essential for reliable AI systems across domains such as medicine, law, and multimodal perception.

AI Engineeringcontrol theoryharness testing

0 likes · 9 min read

Qian Xuesen’s 1954 Engineering Control Theory: The Unexpected Blueprint for Large‑Model Harnessing and Ontology

IT Services Circle

May 19, 2026 · Artificial Intelligence

Peter Steinberger’s $1.3 M Monthly Token Bill: OpenAI’s Subsidy Powers a 100‑Agent OpenClaw

Peter Steinberger revealed that his OpenAI API usage cost $1.3 million in the past 30 days, consuming 6 030 billion tokens across 7.6 million requests, most of which power a cloud‑run fleet of about 100 Codex agents that automate OpenClaw development, prompting a debate on AI‑driven software costs.

AI EngineeringCodexOpenAI

0 likes · 7 min read

Peter Steinberger’s $1.3 M Monthly Token Bill: OpenAI’s Subsidy Powers a 100‑Agent OpenClaw

Architect

May 17, 2026 · Artificial Intelligence

Agent Skills Survey: How Process Knowledge Becomes Technical Debt

The recent arXiv survey on Agent Skills maps the full lifecycle of skills—representation, acquisition, retrieval, and evolution—and warns that unchecked growth can turn a valuable process asset into technical debt, urging teams to enforce admission quality, robust routing, versioning, testing, and retirement mechanisms.

AI EngineeringAgent SkillsProcess Assets

0 likes · 26 min read

Agent Skills Survey: How Process Knowledge Becomes Technical Debt

James' Growth Diary

May 17, 2026 · Artificial Intelligence

Deep Dive into Claude Code Hooks: Stop Hooks and the Self‑Validation Loop

The article dissects Claude Code's Hooks system, detailing its 27 lifecycle events, four hook types, the special behavior of Stop Hooks with exit code 2, the self‑validation loop, practical patterns like the Ralph Loop, and the design trade‑offs and mitigation strategies.

AI EngineeringClaude CodeSelf-Validation

0 likes · 15 min read

Deep Dive into Claude Code Hooks: Stop Hooks and the Self‑Validation Loop

AI Architecture Hub

May 10, 2026 · Artificial Intelligence

2026 AI Engineer Roadmap: Master Agent Engineering and Scheduling

This guide outlines a six‑stage, 17‑week roadmap for becoming a production‑ready AI agent engineer by 2026, detailing essential skills such as LangGraph orchestration, Claude Agent SDK scheduling, context‑engineering primitives, evaluation pipelines, and curated free resources while warning against over‑hyped frameworks.

AI EngineeringAgentic SystemsClaude Agent SDK

0 likes · 18 min read

2026 AI Engineer Roadmap: Master Agent Engineering and Scheduling

ITPUB

May 8, 2026 · Artificial Intelligence

How Oracle Skills Open Source Signals the Rise of the AI Skill Era

Oracle has open‑sourced its Skills repository on GitHub, providing over 100 curated, version‑compatible guides for Oracle Database, OCI, GraalVM, Fusion and APEX, and defining a new AI‑centric “Skill” abstraction that lets agents safely generate and execute database operations, heralding a Skill‑driven AI engineering era.

AI EngineeringAI agentsDatabase Skills

0 likes · 16 min read

How Oracle Skills Open Source Signals the Rise of the AI Skill Era

Architect

May 5, 2026 · Artificial Intelligence

From Anthropic to Google: Agent Skills Enter the Design‑Pattern Era

Google Cloud Tech’s recent article outlines five Agent Skill design patterns, building on Anthropic’s earlier work that standardized Skill format and loading, and shows how the community is shifting from merely defining Skill syntax to engineering robust, reusable workflow structures for AI agents.

AI EngineeringAgent SkillsDesign Patterns

0 likes · 25 min read

From Anthropic to Google: Agent Skills Enter the Design‑Pattern Era

AI Tech Publishing

May 1, 2026 · Artificial Intelligence

5 Counterintuitive Design Principles for Prompt Caching in Claude Code

The article details five counterintuitive design principles for Claude Code's prompt caching—optimizing prompt layout, using message‑based updates, never switching models or tools mid‑conversation, safely compressing context, and monitoring cache health—backed by concrete examples and up to 90% cost savings.

AI EngineeringCache OptimizationClaude Code

0 likes · 10 min read

5 Counterintuitive Design Principles for Prompt Caching in Claude Code

Architect

May 1, 2026 · Artificial Intelligence

From Vibe Coding to Agentic Engineering: How AI Is Redefining the Engineer‑Architect Boundary

Karpathy’s 2026 Sequoia AI Ascent interview shows that while Vibe Coding lowers the barrier for rapid prototyping, the emerging Agentic Engineering paradigm pushes AI agents into the full software‑development lifecycle, demanding new control planes, verification, context handling and blurring the line between senior engineers and architects.

AI EngineeringControl PlaneSoftware Architecture

0 likes · 34 min read

From Vibe Coding to Agentic Engineering: How AI Is Redefining the Engineer‑Architect Boundary

ZhiKe AI

May 1, 2026 · Artificial Intelligence

From Chatbot to Action: How Large‑Model Agents Turn Queries into Real‑World Tasks

The article explains that large‑model agents differ from traditional chatbots by perceiving goals, planning steps, invoking tools, and executing actions autonomously, covering their definition, core modules, ReAct reasoning‑acting loop, single‑ versus multi‑agent systems, current industry trends, and the reliability, safety, observability, and cost challenges they face.

AI EngineeringAI agentAgent Architecture

0 likes · 18 min read

From Chatbot to Action: How Large‑Model Agents Turn Queries into Real‑World Tasks

ZhiKe AI

Apr 30, 2026 · R&D Management

Why Martin Fowler Says Determinism Is Over in Software Engineering

Martin Fowler argues that software engineering has moved from a deterministic world to a nondeterministic one driven by LLMs, outlining how this paradigm shift reshapes development practices, introduces new risks, and demands a harness‑based engineering approach to manage uncertainty.

AI EngineeringHarness EngineeringMartin Fowler

0 likes · 15 min read

Why Martin Fowler Says Determinism Is Over in Software Engineering

Wu Shixiong's Large Model Academy

Apr 29, 2026 · Interview Experience

ByteDance Interviewer Asks: What Rank r Do You Use for LoRA? I Said 64—He Said I'm Wasting GPU Memory

The article examines a common interview scenario where candidates are asked about LoRA rank selection, outlines two typical mistakes—guessing or staying silent—and presents a three‑step strategy of honest boundary setting, logical derivation, and asking a focused question, illustrating the approach with concrete LoRA calculations and a vLLM case study.

AI EngineeringLoRAinterview strategy

0 likes · 13 min read

ByteDance Interviewer Asks: What Rank r Do You Use for LoRA? I Said 64—He Said I'm Wasting GPU Memory

Java Web Project

Apr 25, 2026 · Artificial Intelligence

Why GPT-5.5’s Silent Release Signals Real Engineering Power

OpenAI’s April 23, 2026 launch of GPT-5.5 delivers record‑high scores on SWE‑Bench Pro (58.6%) and Terminal‑Bench 2.0 (82.7%), adds persistent multi‑file context, dynamic reasoning time, and token efficiency, while real‑world case studies show substantial productivity gains across engineering teams.

AI EngineeringCodexGPT-5.5

0 likes · 13 min read

Why GPT-5.5’s Silent Release Signals Real Engineering Power

Architecture and Beyond

Apr 25, 2026 · Artificial Intelligence

Practical Insights on Recent AI Engineering Deployments

The article examines how large language models function as probabilistic components within deterministic software, discusses fault‑tolerance limits for viable AI use cases, and offers detailed engineering guidance on RAG pipelines, tool‑calling determinism, agent fragility, testing, monitoring, and privacy‑conscious deployment in finance.

AI EngineeringAgent ArchitectureLLM

0 likes · 14 min read

Practical Insights on Recent AI Engineering Deployments

Huawei Cloud Developer Alliance

Apr 23, 2026 · Artificial Intelligence

Why Agent Harness Is Central to AI Engineering: OfficeClaw Design & Implementation

The article explains how Agent Harness, defined by six core components (Execution Loop, Tool Registry, Context Manager, State Store, Lifecycle Hooks, Evaluation Interface), forms the operating system for AI agents, and details Huawei Cloud OfficeClaw’s layered architecture and real‑world deployment that boosts task reliability and efficiency.

AI EngineeringAgent HarnessContext Management

0 likes · 11 min read

Why Agent Harness Is Central to AI Engineering: OfficeClaw Design & Implementation

AI Architecture Hub

Apr 23, 2026 · Artificial Intelligence

Why Prompt Caching Is Critical: Lessons from Building Claude Code

Prompt caching, a prefix‑matching technique that reuses prior LLM interactions, proved essential for Claude Code’s low latency and cost, and the article details counter‑intuitive practices such as arranging static prompts first, updating info via messages, avoiding mid‑session model or tool changes, and ensuring cache‑safe context forks.

AI EngineeringCache OptimizationClaude Code

0 likes · 10 min read

Why Prompt Caching Is Critical: Lessons from Building Claude Code

ZhiKe AI

Apr 22, 2026 · Artificial Intelligence

Why Harness Engineering Is the Hottest AI Engineering Paradigm in 2026

The article explains how the emerging "Harness Engineering" paradigm—highlighted by OpenAI, Stripe and Anthropic—shifts AI development from prompt tweaking to building full control systems, promising ten‑fold efficiency gains, new architectural components, and both opportunities and risks for developers.

AI EngineeringHarness EngineeringRisk analysis

0 likes · 9 min read

Why Harness Engineering Is the Hottest AI Engineering Paradigm in 2026

Wu Shixiong's Large Model Academy

Apr 21, 2026 · Artificial Intelligence

When Should an LLM Agent Extract Memory? A Deep Dive into Trigger Strategies

The article analyzes why memory extraction in LLM‑driven agents incurs cost, compares four frameworks—Claude Code, Generative Agents, MemGPT, and Mem0—detailing their trigger mechanisms, concurrency handling, and trade‑offs, and offers practical guidance for choosing the right strategy in real‑time, social, or batch‑processing scenarios.

AI EngineeringAgent DesignLLM

0 likes · 18 min read

When Should an LLM Agent Extract Memory? A Deep Dive into Trigger Strategies

Wu Shixiong's Large Model Academy

Apr 20, 2026 · Artificial Intelligence

Why Java Skills Alone Won’t Cut It for LLM Application Engineering

The article debunks the myth that Java developers only need a bit of AI knowledge to succeed in LLM application roles, explaining the full engineering stack—from retrieval and prompt design to deployment and performance tuning—through real‑world examples, metrics, and interview‑ready advice.

AI EngineeringInterview preparationLLM

0 likes · 13 min read

Why Java Skills Alone Won’t Cut It for LLM Application Engineering

Baobao Algorithm Notes

Apr 20, 2026 · Industry Insights

From Prompt Writer to Harness Architect: Redefining the Algorithm Engineer in the LLM Era

The article analyzes how the rise of foundation models shifts algorithm engineers from hand‑crafting models to building robust Harness environments, detailing OpenAI’s agent‑first experiments, the new "Model + Harness" formula, and practical steps for staying valuable in a prompt‑centric world.

AI EngineeringHarness architectureLLM

0 likes · 9 min read

From Prompt Writer to Harness Architect: Redefining the Algorithm Engineer in the LLM Era

MeowKitty Programming

Apr 19, 2026 · Artificial Intelligence

Why Java Developers Can Now Treat AI as a Full Engineering Stack

The article explains how recent releases like Java 26 and Spring AI 2.0 have turned Java‑AI from a hobbyist demo into a mature, production‑ready engineering stack, outlining the practical steps Java teams should follow to integrate AI into existing systems.

AIAI EngineeringJava

0 likes · 8 min read

Why Java Developers Can Now Treat AI as a Full Engineering Stack

Su San Talks Tech

Apr 19, 2026 · Artificial Intelligence

Boost Enterprise RAG: Data Pipeline Tricks, Hybrid Search & Rerank

To make Retrieval‑Augmented Generation reliable in production, the article outlines five key engineering tactics—semantic chunking with metadata, hybrid vector‑keyword search, two‑stage retrieval with reranking, query rewriting and expansion, and dynamic result evaluation—each illustrated with concrete examples and code snippets.

AI EngineeringHybrid SearchQuery Rewriting

0 likes · 10 min read

Boost Enterprise RAG: Data Pipeline Tricks, Hybrid Search & Rerank

Qborfy AI

Apr 19, 2026 · Artificial Intelligence

Boosting Claude’s Front‑End Development with a GAN‑Inspired Multi‑Agent Harness

The article details how a GAN‑inspired multi‑agent harness—combining a generator, an evaluator, and a planner—overcomes context‑window anxiety and self‑evaluation bias, enabling Claude to produce higher‑quality front‑end designs and full‑stack applications through iterative scoring, sprint contracts, and extensive cost‑benefit experiments.

AI EngineeringFull-Stack DevelopmentGaN

0 likes · 19 min read

Boosting Claude’s Front‑End Development with a GAN‑Inspired Multi‑Agent Harness

DevOps in Software Development

Apr 17, 2026 · Artificial Intelligence

Designing a Control System for AI Code Generators: The Harness Engineering Framework

This article breaks down Birgitta Böckeler's Harness Engineering framework, explaining its 2×2 control matrix, the distinction between computational and inferential controls, three regulation dimensions, and new concepts like Harnessability and Harness Templates, while offering concrete actions for engineering leaders.

AI Code GenerationAI EngineeringHarness Engineering

0 likes · 10 min read

Designing a Control System for AI Code Generators: The Harness Engineering Framework

AI Waka

Apr 16, 2026 · Artificial Intelligence

Why Modern AI Systems Should Compile Knowledge Instead of Just Retrieving It

Traditional RAG pipelines forget everything after each query, but the LLM Wiki mode proposed by Andrej Karpathy compiles source material into a version‑controlled, cross‑referenced Markdown wiki, enabling knowledge to compound over time, reduce query costs, and provide a transparent, human‑readable knowledge base for AI engineers.

AI EngineeringLLMRAG

0 likes · 23 min read

Why Modern AI Systems Should Compile Knowledge Instead of Just Retrieving It

AI Tech Publishing

Apr 15, 2026 · Artificial Intelligence

8 Critical Harness Design Issues That Threaten Long‑Running Agent Accuracy

The article systematically breaks down why autonomous agents lose control during long‑running engineering tasks—missing context, short‑sighted planning, context anxiety, and plan drift—and shows how a well‑designed harness layer can preempt these problems without changing the underlying model.

AI EngineeringContext ManagementHarness

0 likes · 11 min read

8 Critical Harness Design Issues That Threaten Long‑Running Agent Accuracy

AI Code to Success

Apr 15, 2026 · Artificial Intelligence

Why Prompt Engineering Isn’t Enough: Harness Engineering for Reliable AI Agents

The article explains that while prompt engineering helps AI answer single questions, building a robust execution environment—called Harness Engineering—is essential for agents to work continuously, reliably, and autonomously across complex tasks.

AI EngineeringAgent SystemsContext Engineering

0 likes · 12 min read

Why Prompt Engineering Isn’t Enough: Harness Engineering for Reliable AI Agents

Design Hub

Apr 15, 2026 · Artificial Intelligence

Why Your Company’s “AI‑First” Strategy Might Not Be Real AI‑First

The article dissects CREAO’s AI‑first engineering system, contrasting true AI‑driven workflows with superficial AI assistance, and explains how a unified monorepo, automated CI/CD pipelines, self‑healing loops, and specialized roles enable a 25‑person team to outperform competitors by a factor of 100.

AI EngineeringAI-firstAgent Platform

0 likes · 15 min read

Why Your Company’s “AI‑First” Strategy Might Not Be Real AI‑First

Big Data and Microservices

Apr 13, 2026 · Artificial Intelligence

Why AI Agents Need Harness Engineering: Turning Labs into Production

The article explains how Harness Engineering provides the industrial‑grade infrastructure that lets large language model agents operate reliably in complex, long‑running tasks, bridging the gap between impressive demos and real‑world production systems.

AI EngineeringAgent ArchitectureContext Engineering

0 likes · 16 min read

Why AI Agents Need Harness Engineering: Turning Labs into Production

Tech Freedom Circle

Apr 12, 2026 · Artificial Intelligence

What Is Harness Agent? A Deep Dive into the New AI Engineering Framework

Harness Agent is an AI engineering framework that combines a large language model with a runtime control system—called the Harness—to provide task planning, sandboxed execution, tool integration, memory management, safety guardrails, and observability, turning raw model capabilities into reliable, production‑grade agents.

AI EngineeringAgent ArchitectureDeerFlow

0 likes · 26 min read

What Is Harness Agent? A Deep Dive into the New AI Engineering Framework

Qborfy AI

Apr 11, 2026 · Industry Insights

Why AI Agents Need Harness Engineering: Insights from OpenAI, LangChain, and Anthropic

This article explains how AI agents often stall, repeat mistakes, or diverge on complex tasks, argues that the missing piece is a well‑designed harness, and demonstrates with real‑world case studies from OpenAI, LangChain, and Anthropic how a six‑component harness can boost performance by over 13 percentage points and enable million‑line code generation.

AI EngineeringAgent HarnessAnthropic

0 likes · 12 min read

Why AI Agents Need Harness Engineering: Insights from OpenAI, LangChain, and Anthropic

Machine Learning Algorithms & Natural Language Processing

Apr 9, 2026 · Artificial Intelligence

Migrating from Claude Code to Codex Using Anthropic Harness Principles

The article analyzes Anthropic's harness design for long‑running agent coding, reproduces its three‑role architecture, adapts it to Codex and GPT to cut costs, and presents experimental results that confirm the migrated system remains reliable and self‑healing.

AI EngineeringClaude CodeCodex

0 likes · 12 min read

Migrating from Claude Code to Codex Using Anthropic Harness Principles

AI Large-Model Wave and Transformation Guide

Apr 7, 2026 · Artificial Intelligence

Why Harness Engineering Is the New AI Competitive Edge in 2026

The article argues that as large‑model capabilities converge, the decisive factor in 2026 AI competition shifts from raw model power to the ability to engineer a full‑stack Harness system that multiplies performance tenfold through standardized adapters, dynamic prompt registries, multi‑agent orchestration, context compression, and observability.

AI EngineeringHarnessObservability

0 likes · 14 min read

Why Harness Engineering Is the New AI Competitive Edge in 2026

ArcThink

Apr 6, 2026 · Artificial Intelligence

How Harness Engineering Let a 3‑Person Team Write 1 Million Lines of Code in 5 Months

Harness Engineering combines systematic prompts, context management, and robust validation loops to turn powerful LLMs into reliable agents, enabling a three‑engineer team to produce about one million lines of production code in five months and boosting LangChain’s benchmark ranking by 25 places, proving that well‑designed harnesses outweigh model improvements by an order of magnitude.

AI EngineeringAgent SystemsContext Engineering

0 likes · 25 min read

How Harness Engineering Let a 3‑Person Team Write 1 Million Lines of Code in 5 Months

Architecture Musings

Apr 4, 2026 · Industry Insights

Exploring the Harness Architecture Concept: A First Look

This article examines the emerging "Harness Architecture" idea, arguing that constraints should be applied before AI code generation by leveraging stepwise refinement, modular contracts, and living design documents to improve precision, reduce token usage, and prevent architectural drift in large software projects.

AI EngineeringHarness architectureSpec-Driven Development

0 likes · 8 min read

Exploring the Harness Architecture Concept: A First Look

PaperAgent

Apr 2, 2026 · Artificial Intelligence

Can an LLM Build a Full‑Stack Knowledge Graph System in Under 3 Hours?

Using the GLM‑5.1 large language model, the author automated the end‑to‑end development of an ontology‑based knowledge‑graph extraction and visualization platform—covering backend, frontend, and graph database—in just 2 hours 47 minutes, consuming 747 k tokens and self‑correcting multiple issues.

AI EngineeringFull-Stack DevelopmentGLM-5.1

0 likes · 12 min read

Can an LLM Build a Full‑Stack Knowledge Graph System in Under 3 Hours?

dbaplus Community

Apr 1, 2026 · Artificial Intelligence

What the Claude Code Leak Reveals About AI Engineering Practices

A massive accidental release of Claude Code's 512,000-line TypeScript source, exposed via a source‑map file, lets anyone reconstruct the entire codebase and offers a stark, real‑world case study of high‑performance AI tooling, architectural trade‑offs, and the hidden costs of rapid development.

AI EngineeringAnthropicClaude Code

0 likes · 10 min read

What the Claude Code Leak Reveals About AI Engineering Practices

AsiaInfo Technology: New Tech Exploration

Apr 1, 2026 · Industry Insights

How Harness Engineering Is Redefining Industrial AI Agents

This article analyzes the emergence of Harness Engineering as the third‑generation AI engineering paradigm, explains its three‑layer Industrial Harness architecture, identifies three failure modes of long‑running industrial agents, and validates the approach with quantitative case studies and a roadmap for Physical AI OS deployment.

AI EngineeringHarness EngineeringIndustrial Agents

0 likes · 28 min read

How Harness Engineering Is Redefining Industrial AI Agents

ArcThink

Apr 1, 2026 · Artificial Intelligence

Inside Claude Code: 1,900‑File Source Dive Reveals Six‑Layer Architecture

After a source‑map leak exposed Claude Code’s 1,900 TypeScript files, this analysis dissects its six‑layer architecture, dynamic prompt assembly, four‑level caching, 60+ tool governance pipeline, six built‑in agents, five context‑compression strategies, and the real engineering trade‑offs hidden beneath the product.

AI EngineeringAgent SystemsContext Compression

0 likes · 31 min read

Inside Claude Code: 1,900‑File Source Dive Reveals Six‑Layer Architecture

Lao Guo's Learning Space

Mar 31, 2026 · Operations

Harness Engineering Best Practices: Real‑World AI Ops Lessons from 4 Companies

This article explains Harness Engineering—a methodology that lets AI agents work reliably by steering humans and automating execution—through core principles, a performance boost demonstrated by OpenAI, and detailed case studies from OpenAI, Citi, Ancestry, and Ulta Beauty, followed by a step‑by‑step adoption roadmap.

AI EngineeringArchitecture ConstraintsContext Engineering

0 likes · 11 min read

Harness Engineering Best Practices: Real‑World AI Ops Lessons from 4 Companies

Java Architect Essentials

Mar 31, 2026 · Artificial Intelligence

What the Claude Code Leak Reveals: 510k Lines of AI Engine Exposed

A massive leak of Anthropic’s Claude Code exposed over 1,900 files and 510,000 lines of TypeScript, revealing its React‑Ink UI, Bun runtime, extensive toolset, hidden Kairos mode, electronic pet system, and covert Undercover features, sparking worldwide developer frenzy and security concerns.

AI EngineeringBun runtimeClaude

0 likes · 7 min read

What the Claude Code Leak Reveals: 510k Lines of AI Engine Exposed

Data Party THU

Mar 30, 2026 · Artificial Intelligence

Why AI Needs a ‘Harness’: Building Environments for Persistent Agents

The article analyzes the emerging concept of Harness Engineering—combining AI models with structured environments, standards, and feedback loops—to enable agents that can work continuously, illustrated by OpenAI and Anthropic case studies, practical design guidelines, and a three‑week adoption plan.

AI EngineeringAgent DesignHarness Engineering

0 likes · 10 min read

Why AI Needs a ‘Harness’: Building Environments for Persistent Agents

AI Large Model Application Practice

Mar 30, 2026 · Artificial Intelligence

Why Agent Harnesses Are the Key to Production‑Ready AI Agents

The article analyzes the emerging concept of Agent Harnesses, explaining how they transform unruly large‑model agents into controllable, production‑grade systems by addressing long‑running tasks, legacy code complexity, execution‑delivery gaps, and safety concerns through systematic engineering practices.

AI EngineeringAgent Harnessautomation

0 likes · 18 min read

Why Agent Harnesses Are the Key to Production‑Ready AI Agents

AI Step-by-Step

Mar 29, 2026 · Artificial Intelligence

Engineering Long-Term Memory for Agents: Practical Architecture and Best Practices

The article explains how to engineer persistent, cross‑session memory for AI agents by persisting key user facts, task states, and decisions in a multi‑layer storage architecture, detailing retrieval before each request and update after each interaction.

AI EngineeringAgent ArchitectureLong-term Memory

0 likes · 11 min read

Architect

Mar 28, 2026 · Artificial Intelligence

Why AI Agents Need a Harness: From Model Power to System Reliability

The article analyzes how the growing strength of large language models shifts engineering bottlenecks from model capabilities to system stability, introducing the concept of a "Harness" that integrates models into real‑world workflows through state management, constraints, feedback loops, and verification mechanisms.

AI EngineeringAI OpsAgent Harness

0 likes · 18 min read

Why AI Agents Need a Harness: From Model Power to System Reliability

Nightwalker Tech

Mar 27, 2026 · Artificial Intelligence

Why AI Needs a Harness Engineering Framework to Tackle Long‑Term Complex Tasks

The article explains that AI struggles with extended, complex tasks not because models lack intelligence but due to missing systematic engineering practices, and proposes a Harness Engineering framework that introduces external memory, task decomposition, fixed SOP loops, and test‑driven safeguards to turn AI agents into reliable, production‑grade collaborators.

AI EngineeringHarness frameworkSystematic AI

0 likes · 4 min read

Why AI Needs a Harness Engineering Framework to Tackle Long‑Term Complex Tasks

DataFunTalk

Mar 27, 2026 · Artificial Intelligence

Building a Production‑Ready RAG Engine: Architecture, Challenges & Solutions

This article examines the practical challenges of deploying Retrieval‑Augmented Generation in enterprise settings, outlines a layered RAG architecture with offline document processing and online query handling, and details the hybrid retrieval, multi‑stage ranking, knowledge filtering, and generation techniques that improve accuracy and reduce hallucinations.

AI EngineeringHybrid RetrievalKnowledge Filtering

0 likes · 22 min read

Building a Production‑Ready RAG Engine: Architecture, Challenges & Solutions

Tencent TDS Service

Mar 27, 2026 · Artificial Intelligence

How Kuikly’s AI Engineering Boosted Cross‑Platform Development Efficiency

The article details how the Kuikly cross‑platform framework team tackled AI coding challenges by redesigning their architecture, building precise AI context documents, standardizing requirement flows with Spec‑Kit, and integrating a suite of AI tools, resulting in significant productivity gains and higher code quality.

AI EngineeringCross‑platform developmentKuikly

0 likes · 15 min read

How Kuikly’s AI Engineering Boosted Cross‑Platform Development Efficiency

SuanNi

Mar 25, 2026 · Artificial Intelligence

Can Harness Engineering Enable AI Agents to Master Complex Long‑Running Tasks?

This article analyses the concept of Harness engineering introduced by OpenAI and Anthropic, explains how multi‑agent architectures decompose and manage long‑running AI tasks, examines practical experiments such as a retro game maker and a web‑audio workstation, and distills lessons for future AI system design.

AI EngineeringAnthropicClaude

0 likes · 16 min read

Can Harness Engineering Enable AI Agents to Master Complex Long‑Running Tasks?

Architect's Journey

Mar 25, 2026 · Artificial Intelligence

Why SKILL Makes AI Development Surprisingly Simple

The article introduces the SKILL framework, explains its file‑based structure and LLM‑driven entry point, compares it with traditional API‑centric backends, outlines its suitable use cases and limitations, and argues that mastering SKILL will become a core productivity skill for developers.

AI EngineeringLLMSKILL framework

0 likes · 8 min read

Why SKILL Makes AI Development Surprisingly Simple

Coder Circle

Mar 17, 2026 · Industry Insights

After a Decade of Java, Why the Programmer Era Is Shifting

The article analyzes how AI is now writing code, compressing development cycles from half a day to minutes, reshaping programmer roles through three historical value shifts, highlighting new AI‑centric responsibilities, and offering a concrete learning path for Java developers to thrive in the AI era.

AI EngineeringAI PlatformsArtificial Intelligence

0 likes · 8 min read

After a Decade of Java, Why the Programmer Era Is Shifting

AI Engineer Programming

Mar 16, 2026 · Artificial Intelligence

Why “Agent Development” Misleads: Framework vs. Harness in LLM Agents

The article explains that the term “Agent development” hides a fundamental split between Agent Frameworks, which give developers building blocks to assemble their own agents, and Agent Harnesses, which provide ready‑to‑run agents, and shows how this distinction affects decisions, maintenance, and troubleshooting.

AI EngineeringAgentClaude Code

0 likes · 10 min read

Why “Agent Development” Misleads: Framework vs. Harness in LLM Agents

AI Architecture Hub

Mar 10, 2026 · Industry Insights

How OpenAI Built a Million‑Line Codebase Without Human Typing – Lessons for AI‑Driven Software Engineering

OpenAI’s five‑month "Harness Engineering" experiment showed that a three‑person team could generate a million‑line software product entirely with Codex and GPT‑5, achieving ten‑fold productivity, redefining engineering roles, workflow loops, and offering five practical guidelines for AI‑augmented development while highlighting unresolved challenges.

AI EngineeringAI productivityHarness Engineering

0 likes · 18 min read

How OpenAI Built a Million‑Line Codebase Without Human Typing – Lessons for AI‑Driven Software Engineering

AI Architecture Hub

Mar 6, 2026 · Artificial Intelligence

How Agent Skills Solve AI Prompt Drift and Enable Scalable AI Workflows

This article analyzes the prompt‑drift problem in AI workflows, explains the open‑format Agent Skills standard, dissects a concrete Python code‑audit skill design, compares Anthropic and OpenAI ecosystems, and provides practical guidelines for building high‑availability, version‑controlled Agent Skills.

AI EngineeringAgent SkillsAnthropic

0 likes · 15 min read

How Agent Skills Solve AI Prompt Drift and Enable Scalable AI Workflows

Architect

Feb 28, 2026 · Artificial Intelligence

Designing Agent Tools: Key Lessons from Claude Code’s Action Space

This article distills the Claude Code team's hard‑won insights on building effective AI agents, highlighting why action‑space design outweighs model size, how structured questioning improves bandwidth, when to replace Todos with Tasks, and a repeatable seven‑step loop for evolving toolsets.

AI EngineeringAction Spaceprompt engineering

0 likes · 20 min read

Designing Agent Tools: Key Lessons from Claude Code’s Action Space

AI Large Model Application Practice

Feb 19, 2026 · Artificial Intelligence

When Should You Add a Knowledge Graph? 6 Practical Decision Criteria

This article outlines six concrete criteria—relationship‑centric data, reproducible reasoning, evolving schemas, multi‑hop queries, explainable decisions, and cross‑system data integration—to help engineers decide whether a knowledge graph is the right solution or if a relational database will suffice.

AI EngineeringReasoningdata integration

0 likes · 15 min read

When Should You Add a Knowledge Graph? 6 Practical Decision Criteria

Yunqi AI+

Feb 13, 2026 · Artificial Intelligence

AI Engineering: Methodology and Practice for Turning Generative AI into Production Systems

The article outlines a comprehensive AI engineering methodology—including the TPMR framework, an AI‑driven development lifecycle, talent transformation from co‑pilot to AI pilot, and a practical enterprise adoption roadmap—to move generative AI and large models from experimental prototypes to production‑grade systems.

AI EngineeringAI LifecycleLLMOps

0 likes · 5 min read

AI Engineering: Methodology and Practice for Turning Generative AI into Production Systems

Baobao Algorithm Notes

Feb 3, 2026 · Industry Insights

How One Developer Merges 600 Commits a Day with AI: Inside the Clawdbot Workflow

In this in‑depth interview, Peter Steinberger explains how AI agents let him submit and merge hundreds of commits daily, replace traditional code reviews with prompt‑driven requests, and redesign his development workflow around a closed‑loop validation system that reshapes modern software engineering.

AI EngineeringAgent workflowCode Review

0 likes · 85 min read

How One Developer Merges 600 Commits a Day with AI: Inside the Clawdbot Workflow

High Availability Architecture

Feb 1, 2026 · Artificial Intelligence

10 Proven Strategies to Become an AI Engineer with Claude Code

This guide shares ten practical techniques—from parallel git worktrees and plan‑mode prompting to custom Skills, sub‑agents, and terminal tweaks—that let developers treat Claude as a fully orchestrated AI engineering system, boosting productivity and reducing manual coding overhead.

AI EngineeringClaude CodeGit Worktree

0 likes · 12 min read

10 Proven Strategies to Become an AI Engineer with Claude Code

Architecture and Beyond

Jan 17, 2026 · Artificial Intelligence

Progressive Disclosure & Dynamic Context: Making LLM Agents Reliable Execution Systems

This article explains how progressive disclosure and dynamic context management address the three core bottlenecks of complex LLM agents—context explosion, tool overload, and uncontrolled execution—by structuring context, tools, and SOPs into layered, token‑efficient, and verifiable workflows.

AI EngineeringLLM agentsProgressive disclosure

0 likes · 15 min read

Progressive Disclosure & Dynamic Context: Making LLM Agents Reliable Execution Systems

Tencent Technical Engineering

Jan 9, 2026 · Artificial Intelligence

Why Traditional Spec‑Driven Tools Fail and How AI‑Powered Context Engineering Can Supercharge Development

This article analyzes the shortcomings of spec‑driven tools like speckit and openspec in complex enterprise environments, introduces context engineering and compound engineering concepts, and presents a practical AI‑engineered workflow that reduces marginal cost, captures knowledge automatically, and makes AI tools feel invisible to developers.

AI EngineeringContext Engineeringcompound-engineering

0 likes · 48 min read

Why Traditional Spec‑Driven Tools Fail and How AI‑Powered Context Engineering Can Supercharge Development

DaTaobao Tech

Jan 5, 2026 · Artificial Intelligence

Why AI Engineering Isn’t a Reinvention of Software Architecture – Insights from AI Search

The article examines how AI engineering builds on, rather than discards, traditional software engineering principles, using the evolution of AI‑driven search at Alibaba to illustrate architectural upgrades that manage uncertainty, integrate context engineering, and combine classic design patterns with new AI‑specific tools.

AI EngineeringContext EngineeringSearch

0 likes · 21 min read

Why AI Engineering Isn’t a Reinvention of Software Architecture – Insights from AI Search

Wu Shixiong's Large Model Academy

Nov 22, 2025 · Artificial Intelligence

Why Your RAG System Slows Down Over Time and How to Fix It

The article explains why a production Retrieval‑Augmented Generation (RAG) system becomes slower as it runs—due to growing embedding costs, expanding vector databases, heavier re‑ranking, and larger prompts—and provides concrete engineering optimizations such as batching, async concurrency, caching, partitioned retrieval, HNSW tuning, replica scaling, answer caching, and prompt sparsification to keep performance stable.

AI EngineeringPerformance OptimizationRAG

0 likes · 10 min read

Why Your RAG System Slows Down Over Time and How to Fix It

Wu Shixiong's Large Model Academy

Nov 21, 2025 · Artificial Intelligence

How to Build a Multi‑Layer Cache for Dynamic RAG Systems

This article explains why dynamic Retrieval‑Augmented Generation (RAG) requires a layered caching strategy rather than simple result caching, details a four‑level cache architecture—including embedding, search, answer, and pipeline caches—provides practical key‑generation and TTL guidelines, and outlines dirty‑data defenses to keep caches consistent and performant.

AI EngineeringCachingLLM

0 likes · 10 min read

How to Build a Multi‑Layer Cache for Dynamic RAG Systems

JD Tech

Nov 6, 2025 · Artificial Intelligence

LLMs Revolutionize Recommendation Systems: From Generative Models to Production

This article surveys the evolution of generative recommendation systems powered by large language models, detailing their technical foundations, engineering challenges, recent breakthroughs, and future research directions, while highlighting why the paradigm shift is occurring now.

AI EngineeringGenerative RecommendationLLM

0 likes · 30 min read

LLMs Revolutionize Recommendation Systems: From Generative Models to Production

JD Tech Talk

Oct 27, 2025 · Artificial Intelligence

How Large Language Models Are Revolutionizing Generative Recommendation Systems

Over the past year, generative recommendation has made substantial progress by leveraging large language models' powerful sequence modeling and reasoning abilities, introducing a new paradigm that replaces complex handcrafted features, addresses traditional recommendation bottlenecks, and outlines the evolution, core technologies, engineering challenges, and future directions of LLM‑based recommendation systems.

AI EngineeringEncoder-DecoderLLM

0 likes · 29 min read

How Large Language Models Are Revolutionizing Generative Recommendation Systems

21CTO

Oct 6, 2025 · Artificial Intelligence

How to Become an AI Engineer: Skills, Workflow, and Career Path

This guide explains what AI engineering entails, outlines the end‑to‑end workflow from problem definition and data preparation through model development, deployment, and monitoring, and highlights the essential programming, cloud, and MLOps skills, career tracks, emerging trends, and salary outlook for aspiring AI engineers.

AI EngineeringCloud ComputingMLOps

0 likes · 11 min read

How to Become an AI Engineer: Skills, Workflow, and Career Path

Instant Consumer Technology Team

Sep 30, 2025 · Artificial Intelligence

What Makes Youtu-GraphRAG’s Engineering Stand Out? Inside the AI Blueprint

This article dissects the engineering of Tencent's Youtu-GraphRAG, covering its architectural challenges, real‑time FastAPI/WebSocket design, security measures, iterative retrieval chains, parallel processing, intelligent caching, schema‑driven knowledge handling, and performance tweaks, offering practical insights for AI system builders.

AI EngineeringFastAPIGraphRAG

0 likes · 7 min read

What Makes Youtu-GraphRAG’s Engineering Stand Out? Inside the AI Blueprint

AI2ML AI to Machine Learning

Sep 28, 2025 · Artificial Intelligence

Core Metrics for Enterprise Large‑Model Engineering

The article outlines the five essential engineering domains—application, model, compute, knowledge, and data—in the era of large models, and details concrete scale, efficiency, service, value, quality, and security metrics that enterprises should track to drive intelligent outcomes.

AI Engineeringbusiness valuecompute metrics

0 likes · 7 min read

Core Metrics for Enterprise Large‑Model Engineering

21CTO

Sep 21, 2025 · Artificial Intelligence

Andrew Ng Explains the AI Skills Gap and How to Become an In‑Demand AI Engineer

Andrew Ng highlights a growing mismatch in the tech job market: while companies scramble for AI‑savvy developers, many new CS graduates struggle to find work, and he outlines the core AI‑driven capabilities—prompt engineering, RAG, model evaluation, and rapid prototyping—that separate successful AI engineers from the rest.

AIAI Engineeringjob market

0 likes · 6 min read

Andrew Ng Explains the AI Skills Gap and How to Become an In‑Demand AI Engineer

Alibaba Cloud Developer

Sep 15, 2025 · Artificial Intelligence

Why MCP Isn't a Magic AI Upgrade: Deep Dive into Its Architecture, Host Role, and Real Costs

This article debunks common misconceptions about the Model Context Protocol (MCP), explains its client‑host‑server (CHS) architecture, shows how the Host drives AI decisions while Server and Client remain model‑agnostic, compares MCP with Function Calling, analyzes SDK source code, evaluates practical trade‑offs, and outlines the true engineering value and costs of using MCP in AI applications.

AI EngineeringFunction CallingLLM

0 likes · 35 min read

Why MCP Isn't a Magic AI Upgrade: Deep Dive into Its Architecture, Host Role, and Real Costs

ShiZhen AI

Sep 5, 2025 · Artificial Intelligence

Andrew Ng Highlights Core AI Engineer Skills Amidst Major AI Industry Updates

The article reports that ChatGPT now supports branch conversations, Anthropic restricts service use in certain regions, Andrew Ng outlines essential AI engineer capabilities such as AI‑assisted software building, prompting and agentic workflows, and highlights the market demand, while also covering the Kimi K2 model upgrade, Hugging Face’s FineVision dataset release, and Google’s AI‑driven Deep Loop Shaping method published in *Science*.

AI EngineeringAI for astronomyAI safety

0 likes · 8 min read

Andrew Ng Highlights Core AI Engineer Skills Amidst Major AI Industry Updates

Alibaba Cloud Big Data AI Platform

Aug 8, 2025 · Artificial Intelligence

Unlocking Big Data Ops with Large Models: Opportunities, Challenges, Design

This article summarizes a Cloud Summit talk where Alibaba Cloud’s AI expert Zhang Yingying explains how large language models can enhance big‑data intelligent operations, covering opportunities, challenges, RAG‑based Q&A, multi‑agent diagnostics, and the engineering architecture needed for reliable, scalable deployment.

AI EngineeringBig Data OperationsRAG

0 likes · 20 min read

Unlocking Big Data Ops with Large Models: Opportunities, Challenges, Design

Alibaba Cloud Big Data AI Platform

Aug 8, 2025 · Artificial Intelligence

From Chain‑of‑Thought to Self‑Evolving Agents: Lessons from Alibaba’s Intelligent Ops

This article traces the evolution of Alibaba’s intelligent agents from the initial chain‑of‑thought design through instantiation, structuring, self‑evolution, and middleware integration, highlighting practical challenges, architectural refinements, and open‑source tools for large‑scale AI operations.

AI EngineeringAgentmiddleware

0 likes · 16 min read

From Chain‑of‑Thought to Self‑Evolving Agents: Lessons from Alibaba’s Intelligent Ops

Architect's Alchemy Furnace

Jun 4, 2025 · Artificial Intelligence

What Is an AI Engineer? Roles, Skills, and the Future of LLM‑Powered Systems

This article examines the evolving role of the AI engineer, contrasting it with AI researchers, ML engineers, and software engineers, outlines essential skills such as prompt engineering, MLOps, and data integration, and predicts how AI engineering will become a pivotal, high‑demand discipline in the coming years.

AI EngineeringAI SystemsAgentic RAG

0 likes · 17 min read

What Is an AI Engineer? Roles, Skills, and the Future of LLM‑Powered Systems

Java Architecture Diary

May 19, 2025 · Artificial Intelligence

How Ollama 0.7 Unlocks Local Multimodal AI with One Command

Ollama 0.7 introduces a fully re‑engineered core that brings seamless multimodal model support, lists top visual models, showcases OCR and image analysis capabilities, explains technical breakthroughs, and provides a quick three‑step guide to deploy powerful local AI vision.

AI EngineeringAI modelsOllama

0 likes · 7 min read

How Ollama 0.7 Unlocks Local Multimodal AI with One Command

DevOps

Apr 22, 2025 · Artificial Intelligence

How to Think About Agent Frameworks: A Critical Review of Design Patterns, Challenges, and LangGraph

This article critically examines popular agent frameworks, compares OpenAI and Anthropic definitions, highlights the core difficulty of maintaining proper context for reliable agents, and presents LangGraph’s declarative and imperative features along with practical guidance for building production‑grade agent systems.

AI EngineeringAgent FrameworksAgent Systems

0 likes · 24 min read

How to Think About Agent Frameworks: A Critical Review of Design Patterns, Challenges, and LangGraph

DaTaobao Tech

Feb 21, 2025 · Artificial Intelligence

AI-Powered Face Swapping for the Spring Festival Gala: System Design and Deployment

The paper details the design and deployment of an AI‑driven face‑swap platform for the 2025 CCTV Spring Festival Gala, featuring a dual‑model SDXL pipeline with ControlNet and LoRA fine‑tuning, optimized preprocessing and GPU‑specific acceleration to achieve sub‑3‑second latency at over 10 k QPS, supporting scaling, throttling, and multi‑region load balancing, and ultimately serving ten million users and generating hundreds of millions of personalized gala images.

AI EngineeringAIGCSpring Festival Gala

0 likes · 28 min read

AI-Powered Face Swapping for the Spring Festival Gala: System Design and Deployment

DataFunSummit

Jan 19, 2025 · Artificial Intelligence

Understanding MLOps and LMOps: Evolution, Engineering Practices, and Future Trends for Large Models

This article reviews the development of MLOps, introduces the emerging LMOps framework for large‑model engineering, outlines key architectural components, discusses current challenges and industry trends, and presents future directions and standardization efforts in AI operations.

AI EngineeringAI OpsLMOps

0 likes · 18 min read

Understanding MLOps and LMOps: Evolution, Engineering Practices, and Future Trends for Large Models

21CTO

Dec 18, 2024 · Artificial Intelligence

5 AI Engineering Trends Shaping 2024: Agents, Coding Tools, and the Rise of Small Models

The 2024 AI engineering landscape is defined by mature AI coding assistants, the surge of AI agents like LangChain and LlamaIndex, the emergence of small, locally‑hosted language models, the solidifying role of AI engineers, and heated debates over what truly counts as open‑source AI.

AI EngineeringAI coding toolssmall language models

0 likes · 9 min read

5 AI Engineering Trends Shaping 2024: Agents, Coding Tools, and the Rise of Small Models

Zhihu Tech Column

Dec 9, 2024 · Artificial Intelligence

Large Model Application Engineering: ZhiLight Inference Framework and Zhihu Direct Answer System

The article details Zhihu's technical salon on large‑model engineering, covering the RAG‑based Zhihu Direct Answer system, the open‑source ZhiLight inference framework, prompt engineering, agent research, and future plans for integrating AI into product and community workflows.

AI EngineeringAgentInference Framework

0 likes · 8 min read

Large Model Application Engineering: ZhiLight Inference Framework and Zhihu Direct Answer System

Baobao Algorithm Notes

Aug 29, 2024 · Industry Insights

Why Pretraining Boosts New Engineers More Than SFT: A Practical Guide

The answer argues that fresh graduates should join pre‑training teams because the required engineering tasks—large‑scale data crawling, Hadoop/Spark pipelines, torch and CUDA setup, megatron code debugging, and scaling‑law experiments—rapidly sharpen coding skills, while SFT work focuses mainly on data labeling and offers slower technical growth.

AI EngineeringLarge ModelsSFT

0 likes · 7 min read

Why Pretraining Boosts New Engineers More Than SFT: A Practical Guide

Baobao Algorithm Notes

Aug 26, 2024 · Artificial Intelligence

Master Essential LLM Engineering Skills: Transform, Model, and Infer with Custom Scripts

This guide presents a hands‑on curriculum of core large‑model engineering tasks—including model conversion scripts, custom modeling wrappers, multi‑model inference utilities, and channel‑aware loss tracking—to help practitioners build practical, reusable tools without deep theoretical overhead.

AI EngineeringInference OptimizationPython scripting

0 likes · 8 min read

Master Essential LLM Engineering Skills: Transform, Model, and Infer with Custom Scripts

JD Tech

May 31, 2024 · Artificial Intelligence

Understanding Large Language Models, Retrieval‑Augmented Generation, and AI Agents: Concepts, Engineering Practices, and Applications

This article explains the fundamentals and engineering practices of large language models (LLM), retrieval‑augmented generation (RAG) and AI agents, compares small and large embedding models, provides Python code for vector‑database RAG with Chroma, and discusses integration, use cases, and future challenges in AI development.

AI EngineeringAI agentsLLM

0 likes · 41 min read

Understanding Large Language Models, Retrieval‑Augmented Generation, and AI Agents: Concepts, Engineering Practices, and Applications

G7 EasyFlow Tech Circle

May 29, 2024 · Artificial Intelligence

Engineering Large Model Enterprise Applications: Best Practices

This article outlines the key characteristics of large‑model enterprise applications, compares them with consumer use cases, and presents a comprehensive engineering roadmap—including model selection, knowledge‑base integration, tool implementation, intent recognition, output control, high‑availability deployment, and ongoing optimization—to help practitioners effectively harness AI models in real‑world business environments.

AI EngineeringRAGlarge model

0 likes · 12 min read

Engineering Large Model Enterprise Applications: Best Practices

DevOps

Apr 17, 2024 · Artificial Intelligence

Engineering Capabilities for Enterprise Large Model Applications: Prompt Engineering, RAG, and Fine‑Tuning

The article explores how enterprises can build and improve large‑model applications by combining prompt engineering, retrieval‑augmented generation (RAG), and fine‑tuning, discusses their relationships, optimization dimensions, testing challenges, and provides practical guidance for SE4AI implementation.

AI EngineeringRAGenterprise AI

0 likes · 20 min read

Engineering Capabilities for Enterprise Large Model Applications: Prompt Engineering, RAG, and Fine‑Tuning

HelloTech

Apr 10, 2024 · Artificial Intelligence

An Overview of LangChain: Architecture, Core Components, and Code Examples

LangChain is an open‑source framework that provides Python and JavaScript SDKs, templates, and services such as LangServe and LangSmith to compose models, embeddings, prompts, indexes, memory, chains, and agents via a concise expression language, enabling rapid prototyping, debugging, and deployment of LLM‑driven applications.

AI EngineeringJavaScriptLLM

0 likes · 19 min read

An Overview of LangChain: Architecture, Core Components, and Code Examples

Alibaba Cloud Developer

Apr 10, 2024 · Artificial Intelligence

Master LangChain in 10 Minutes: From Basics to Advanced AI Engineering

This guide walks AI engineers through a rapid 10‑minute boot‑strap of LangChain, explaining its purpose, core concepts, design questions, environment setup, and step‑by‑step code examples that cover APIs, chains, memory, retrieval‑augmented generation, tools, agents, and the overall architecture.

AI EngineeringLLMLangChain

0 likes · 28 min read

Master LangChain in 10 Minutes: From Basics to Advanced AI Engineering

NewBeeNLP

Mar 21, 2024 · Artificial Intelligence

Mastering Large Language Model Training: Key Challenges and Optimization Strategies

This article examines the resource and efficiency challenges of scaling large language model training, explains data, model, pipeline, and tensor parallelism, and provides practical I/O, communication, and stability optimization techniques—including high‑availability storage, RDMA networking, NCCL tuning, and fault‑tolerant recovery—to improve throughput and reliability.

AI EngineeringI/O optimizationcommunication optimization

0 likes · 15 min read

Mastering Large Language Model Training: Key Challenges and Optimization Strategies

DataFunTalk

Mar 20, 2024 · Artificial Intelligence

Challenges and Optimization Techniques for Large Language Model Training

The article outlines the resource and efficiency challenges of scaling large language models, explains data and model parallelism strategies, and details practical I/O, communication, and stability optimizations—including high‑availability storage, RDMA networking, and fault‑tolerance measures—to improve training throughput and reliability.

AI EngineeringI/O optimizationcommunication optimization

0 likes · 13 min read

Challenges and Optimization Techniques for Large Language Model Training

Architecture & Thinking

Jan 14, 2024 · Artificial Intelligence

How Baidu Scales Content Understanding to Trillions of Pages with AI Engineering

This article explains how Baidu processes internet‑scale content by applying deep AI‑driven understanding, detailing cost‑optimization, efficiency improvements, model‑service frameworks, resource‑scheduling systems, and batch‑compute platforms that together enable trillion‑level indexing and feature extraction.

AI EngineeringBatch ComputingHTAP storage

0 likes · 16 min read

How Baidu Scales Content Understanding to Trillions of Pages with AI Engineering

21CTO

Dec 15, 2023 · Artificial Intelligence

Why 2024 Will Be the Year of AI Engineers and LLM‑Driven Apps

The article outlines five major AI engineering trends for 2024—including the rise of AI engineers, evolving LLM tech stacks, open‑source large models, vector databases, and AI agents—highlighting how these shifts will reshape application development and industry competition.

2024 trendsAI EngineeringAI agents

0 likes · 9 min read

Why 2024 Will Be the Year of AI Engineers and LLM‑Driven Apps

Alibaba Cloud Big Data AI Platform

Nov 8, 2023 · Big Data

How Big Data and AI Converge: Insights from Alibaba Cloud’s 2023 Conference

The talk outlines the evolution from model‑centric to data‑centric AI development, explains Alibaba Cloud’s integrated big data‑AI platform, showcases real‑world use cases like knowledge‑base QA and personalized recommendation, and details the underlying cloud‑native services that enable seamless data and AI collaboration.

AI Engineeringmodel training

0 likes · 16 min read

How Big Data and AI Converge: Insights from Alibaba Cloud’s 2023 Conference

DataFunSummit

Oct 7, 2023 · Artificial Intelligence

MLOps Implementation in Network Intelligence: Jiutian Platform Overview

This article presents the Jiutian Network Intelligence platform’s MLOps implementation at China Mobile, detailing its AI engineering workflow, platform functional and technical architecture, technology selections, model deployment, monitoring, and operational challenges, and shares insights on scaling AI services across 31 provinces.

AI EngineeringMLOpsNetwork Intelligence

0 likes · 20 min read

MLOps Implementation in Network Intelligence: Jiutian Platform Overview

Xiaohongshu Tech REDtech

Aug 25, 2023 · Artificial Intelligence

DataFunSummit 2023: Recommendation Systems Online Summit

The DataFunSummit 2023 online summit (August 26‑27) will explore eight recommendation‑system topics—including core and engineering architecture, model training/inference, large models, graphs, cold start, and multi‑task scenarios—featuring Xiaohongshu leaders who will present on graph‑based business architecture, integrated training‑inference pipelines, and user/content cold‑start strategies.

AI EngineeringLarge ModelsRecommendation Systems

0 likes · 6 min read

DataFunSummit

May 5, 2023 · Artificial Intelligence

Advances in Virtual Humans, Multimodal Technology, and General AI – Insights from OPPO

The article presents OPPO's latest research on virtual human audio‑lip and RGB driving, multimodal learning breakthroughs such as CETNETs and cross‑modal matching, and a reflective discussion on the challenges and future directions of general artificial intelligence, highlighting the interconnections among these three domains.

AI EngineeringGeneral AIMultimodal Learning

0 likes · 9 min read

Advances in Virtual Humans, Multimodal Technology, and General AI – Insights from OPPO

Efficient Ops

Jan 16, 2023 · Artificial Intelligence

How MLOps Is Transforming AI Production in China: Trends, Tools, and Standards

This report examines how MLOps is accelerating AI production in China, highlighting industry adoption across sectors, the booming tool ecosystem, the rise of feature platforms, enhanced observability, performance needs for large models, AI asset management, and the emerging national standards and evaluation results.

AI EngineeringAI StandardsFeatureOps

0 likes · 8 min read

How MLOps Is Transforming AI Production in China: Trends, Tools, and Standards

GuanYuan Data Tech Team

Dec 1, 2022 · Artificial Intelligence

Why MLOps Is the Key to Scalable AI Projects

This article explains the concept, significance, and practical case studies of MLOps—showing how integrating DevOps principles with data and machine learning creates reliable, automated pipelines for data quality, model monitoring, error analysis, and continuous integration, ultimately accelerating AI delivery.

AI EngineeringMLOpsMachine Learning Operations

0 likes · 15 min read

Why MLOps Is the Key to Scalable AI Projects

Alibaba Cloud Big Data AI Platform

Nov 4, 2022 · Artificial Intelligence

How AI Platforms Turn Dreams into Reality: Scaling, Efficiency, and Usability

In this talk from the 2022 Yunqi Conference, Jia Yangqing explains how Alibaba's AI platform addresses efficiency, scale, and usability challenges by moving the Damo Academy to the cloud, open‑sourcing ModelScope, and delivering large‑model training, deployment, and inference services at massive scale.

AI EngineeringEfficiencyModel Scaling

0 likes · 10 min read

How AI Platforms Turn Dreams into Reality: Scaling, Efficiency, and Usability