Author

Alibaba Cloud Developer

Alibaba's official tech channel, featuring all of its technology innovations.

1.9k

Articles

Likes

1.5k

Views

Comments

Latest from Alibaba Cloud Developer

100 recent articles max

Alibaba Cloud Developer

Jan 16, 2026 · Artificial Intelligence

How Quest Achieves Autonomous Programming with Agentic Architecture

Quest redesigns long‑running task execution by combining model capability, context management, dynamic reminders, and a minimal Bash‑centric toolset into a closed‑loop Agent architecture that lets AI generate, verify, and deliver complete software artifacts without constant human intervention.

AICoding Automationautonomous programming

0 likes · 12 min read

How Quest Achieves Autonomous Programming with Agentic Architecture

Alibaba Cloud Developer

Jan 16, 2026 · Artificial Intelligence

Mastering Anthropic Skills: Fast, High‑Quality Skill Development Guide

This article explains what Anthropic Skills are, compares them with MCP, and provides a step‑by‑step, best‑practice workflow—including repository setup, OpenSkills commands, naming conventions, and executable skill patterns—to help developers quickly create robust, efficient Skills for Claude and other AI agents.

AI agentsAnthropicOpenSkills

0 likes · 15 min read

Mastering Anthropic Skills: Fast, High‑Quality Skill Development Guide

Alibaba Cloud Developer

Jan 15, 2026 · Artificial Intelligence

How Hierarchical Sparse Attention Breaks KVCache Limits for Ultra‑Long Context LLMs

This article explains how a hierarchical sparse‑attention framework redesigns KVCache storage across GPU, CPU, and remote memory, eliminates bandwidth and capacity bottlenecks, and enables efficient inference for 128K‑token and larger contexts with dramatically reduced GPU memory usage and higher throughput.

Dynamic Sparse AttentionGPU memory optimizationHierarchical Storage

0 likes · 20 min read

How Hierarchical Sparse Attention Breaks KVCache Limits for Ultra‑Long Context LLMs

Alibaba Cloud Developer

Jan 14, 2026 · Artificial Intelligence

How DataAgent Turns AI into a Virtual Data Analyst for Enterprise Insights

DataAgent, built on Spring AI Alibaba, tackles the "last mile" of AI data analysis by combining deterministic workflow orchestration with large‑model reasoning, offering human‑in‑the‑loop feedback, dynamic prompt configuration, hybrid retrieval, containerized Python execution, streaming SSE, multi‑model scheduling, multi‑source connectivity, and secure API‑key management to deliver instant, insight‑rich reports for business users.

AIAnalyticsDataAgent

0 likes · 11 min read

How DataAgent Turns AI into a Virtual Data Analyst for Enterprise Insights

Alibaba Cloud Developer

Jan 12, 2026 · Operations

Why Traditional Monitoring Fails and How UModel Redefines Observability for AI‑Powered Ops

The article explains how legacy monitoring based on isolated metrics, traces, and logs cannot keep up with the massive, fragmented, and dynamic data of modern IT systems, and introduces UModel—a graph‑based observability model that bridges data, model, and engineering gaps to enable AI‑driven operations.

Graph ModelingObservabilityOperations

0 likes · 11 min read

Why Traditional Monitoring Fails and How UModel Redefines Observability for AI‑Powered Ops

Alibaba Cloud Developer

Jan 12, 2026 · Artificial Intelligence

How CodeGenius Re‑engineered Memory to Tame AI Agent Context Bloat

This article explains how the rapid evolution of AI agents caused context explosion, why the original fixed‑window memory failed, and how CodeGenius introduced a layered memory system that unloads stale data, deduplicates files, generates structural summaries, and dynamically compresses dialogue to keep prompts stable, reduce token cost, and improve task continuity.

AI agentContext ManagementLLM cost reduction

0 likes · 18 min read

How CodeGenius Re‑engineered Memory to Tame AI Agent Context Bloat

Alibaba Cloud Developer

Jan 9, 2026 · Backend Development

How to Build a Lightweight, Scalable AI Code Telemetry System with MCP Architecture

This article details the design and implementation of a lightweight, non‑intrusive data‑collection framework for multiple AI coding tools—covering background challenges, industry solutions, design principles, a three‑layer MCP‑based architecture, concrete hook scripts, telemetry handling, cross‑platform deployment, and future enhancements.

AICodeCollectionMCP

0 likes · 17 min read

How to Build a Lightweight, Scalable AI Code Telemetry System with MCP Architecture

Alibaba Cloud Developer

Jan 8, 2026 · Artificial Intelligence

How to Build Human‑In‑The‑Loop (HITL) Capabilities into ReactAgent

This article explains how to integrate a Human‑In‑The‑Loop (HITL) mechanism into ReactAgent, detailing the motivation, design of interaction, tool description, XML‑based UI rendering, Redis‑driven waiting loop, and the broader architectural parallels with design patterns and other agent frameworks.

AgentDesign PatternsHITL

0 likes · 14 min read

How to Build Human‑In‑The‑Loop (HITL) Capabilities into ReactAgent

Alibaba Cloud Developer

Jan 7, 2026 · Backend Development

Mastering AI‑Assisted Backend Development: Context Management, Quality Assurance, and Practical Workflows

This comprehensive guide shows backend developers how to collaborate effectively with AI coding tools by building personal context management systems, accurately judging AI output quality, following a structured PRD‑to‑code workflow, leveraging Python scripts and agent prompts, and applying best‑practice documentation techniques to boost productivity and code reliability.

AI codingContext ManagementSoftware quality

0 likes · 23 min read

Mastering AI‑Assisted Backend Development: Context Management, Quality Assurance, and Practical Workflows

Alibaba Cloud Developer

Jan 6, 2026 · Artificial Intelligence

How Tair‑KVCache‑HiSim Simulates LLM Inference 390 000× Faster with <5% Error

This article explains the design, challenges, and high‑fidelity architecture of Tair‑KVCache‑HiSim, a simulation tool that models multi‑level KV‑Cache behavior for large‑language‑model inference, predicts latency, throughput and cost under SLO constraints, and validates its predictions against real GPU deployments with sub‑5% error.

AI infrastructureKVCacheLLM inference

0 likes · 32 min read

How Tair‑KVCache‑HiSim Simulates LLM Inference 390 000× Faster with <5% Error