Alibaba Cloud Developer
Author

Alibaba Cloud Developer

Alibaba's official tech channel, featuring all of its technology innovations.

1.9k
Articles
0
Likes
1.5k
Views
0
Comments
Recent Articles

Latest from Alibaba Cloud Developer

100 recent articles max
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 5, 2026 · Artificial Intelligence

Mastering AI Coding: Token Mechanics, Tool Calls, and Best‑Practice Prompt Design

This comprehensive guide explains how AI coding assistants like Cursor and Claude Code compute tokens, interact with tools, index codebases with Merkle trees, craft effective prompts, and apply progressive development practices to boost productivity, code quality, and security across real‑world projects.

AI codingBest PracticesClaude Code
0 likes · 44 min read
Mastering AI Coding: Token Mechanics, Tool Calls, and Best‑Practice Prompt Design
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 4, 2026 · Databases

Accelerating AliSQL Vector Search with Nodes Cache and SIMD

AliSQL 8.0 introduces a shared Nodes Cache and per‑transaction cache to speed up vector queries, implements RC‑level transaction isolation for read‑only and read‑write operations, and leverages SIMD‑based pre‑computation to dramatically improve high‑dimensional vector distance calculations and concurrency performance.

AliSQLCache OptimizationSIMD
0 likes · 9 min read
Accelerating AliSQL Vector Search with Nodes Cache and SIMD
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 29, 2025 · Artificial Intelligence

How Alibaba’s Tair KVCache Manager Revolutionizes Enterprise‑Level LLM Cache Management

This article details the architecture and implementation of Tair KVCache Manager, an enterprise‑grade service that centralises KVCache metadata, decouples inference engines from storage, provides elastic scaling, multi‑tenant isolation, high availability, and performance‑optimised cache management for large‑scale LLM inference workloads.

Cache ManagementKVCacheLLM
0 likes · 28 min read
How Alibaba’s Tair KVCache Manager Revolutionizes Enterprise‑Level LLM Cache Management
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 26, 2025 · Artificial Intelligence

How AutoContextMemory Cuts LLM Costs by 70% in Long Conversations

This article explains the challenges of token explosion in long‑running AI agent dialogues and introduces AutoContextMemory, a Java component that automatically compresses, offloads, and summarizes conversation history to dramatically reduce token usage, speed up responses, and preserve critical information.

AgentScopeContext ManagementJava
0 likes · 12 min read
How AutoContextMemory Cuts LLM Costs by 70% in Long Conversations
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 26, 2025 · Artificial Intelligence

How to Build a Fully Automated Knowledge‑Extraction Pipeline for AI Agents with Python

This article presents a complete end‑to‑end pipeline that automatically extracts, generalizes, incrementally updates, and vector‑syncs knowledge from diverse sources such as tickets, documents, and SQL code, turning the traditionally labor‑intensive knowledge‑base construction for agents into a low‑effort, continuously maintainable Python‑driven solution.

LLMPythonRAG
0 likes · 15 min read
How to Build a Fully Automated Knowledge‑Extraction Pipeline for AI Agents with Python
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 24, 2025 · Artificial Intelligence

Boosting LLM Inference: RoleBasedGroup & Mooncake for Stable, High‑Performance Service

Large language model inference faces memory pressure, but by externalizing KVCache with Mooncake and orchestrating roles via the Kubernetes‑native RoleBasedGroup (RBG), developers can achieve stable, high‑throughput, cost‑effective serving with seamless in‑place upgrades and topology‑aware performance.

AI infrastructureKVCacheKubernetes
0 likes · 21 min read
Boosting LLM Inference: RoleBasedGroup & Mooncake for Stable, High‑Performance Service
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 24, 2025 · Artificial Intelligence

Why CLI AI Tools Matter: Aesthetics, Single vs Multi-Agent Design, Tips

This article explores the resurgence of command‑line AI coding tools, examining their product aesthetics rooted in Unix philosophy, the technical contrast between single‑agent and multi‑agent architectures, practical usage scenarios, context‑engineering techniques, and why a lightweight single‑agent core often outperforms more complex multi‑agent systems.

Agent ArchitectureContext Engineeringsoftware development
0 likes · 17 min read
Why CLI AI Tools Matter: Aesthetics, Single vs Multi-Agent Design, Tips