Backend Development 9 min read

The Evolution of Taobao’s Technology Architecture: From Monolith to Distributed Systems

This article examines Taobao’s pivotal architectural transformation from a monolithic codebase to a large‑scale distributed system, detailing the challenges of personnel, business complexity, Oracle bottlenecks, and hardware limits, and describing the comprehensive solutions involving code and interface vertical splitting, custom middleware, database sharding, and new infrastructure.

Top Architect
Top Architect
Top Architect
The Evolution of Taobao’s Technology Architecture: From Monolith to Distributed Systems

Today we focus on the most decisive architectural evolution in Taobao’s history – the transition from the third to the fourth stage – which fundamentally reshaped the platform and provides a blueprint for large‑scale internet companies.

The third stage faced multiple pressures: massive codebase (Denali) maintained by hundreds of engineers, diverse business requirements (users, shops, payments, etc.), Oracle’s connection‑pool and CPU limits, and the need for horizontal scaling of servers and data centers.

To address these issues, Taobao vertically split the Denali codebase by business domain, creating dozens of independent services such as UM (User Manager) and SM (Shop Manager). Interfaces were also split per business, and a custom high‑performance communication framework, HSF, was introduced to support various remote‑call patterns.

Simultaneously, a middleware matrix emerged, including distributed cache Tair, distributed file system TFS, asynchronous messaging Notify, the database routing layer TDDL, and a session framework, all designed to reduce pressure on the Oracle database.

On the database side, Taobao performed both vertical and horizontal sharding: core services kept Oracle, while non‑core services migrated to MySQL; horizontal partitioning and read/write separation were managed by TDDL, allowing transparent routing without application changes.

Additional measures included building the proprietary ISearch engine to offload product search from Oracle, establishing an independent CDN infrastructure, creating an operations platform (the “Hubble Telescope”) for monitoring and alerting, and implementing disaster‑recovery data centers to ensure continuous availability.

The overall evolution required substantial investment but resulted in a robust, scalable, and distributed architecture that underpins Taobao’s current performance and reliability.

distributed systemsArchitectureMicroservicesTaobaoscalabilitymiddlewaredatabase sharding
Top Architect
Written by

Top Architect

Top Architect focuses on sharing practical architecture knowledge, covering enterprise, system, website, large‑scale distributed, and high‑availability architectures, plus architecture adjustments using internet technologies. We welcome idea‑driven, sharing‑oriented architects to exchange and learn together.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.