Backend Development 11 min read

Designing a Scalable Architecture for Million‑Level DAU Systems

The article outlines a comprehensive backend architecture for handling million‑to‑tens‑of‑million daily active users, covering DNS routing, L4/L7 load balancing, monolithic versus microservice deployment, caching, database sharding, hybrid‑cloud strategies, elastic scaling, and multi‑level degradation mechanisms.

IT Architects Alliance

Feb 4, 2022

Designing a Scalable Architecture for Million‑Level DAU Systems

Recent incidents of service outages caused by insufficient scalability highlight the need for a robust architecture that can automatically expand and contract under sudden traffic spikes.

The request flow starts with DNS resolution, which directs users to the appropriate regional data center, followed by L4 load balancing (typically LVS) for traffic forwarding and L7 load balancing (commonly Nginx) for application‑level routing, authentication, logging, and monitoring.

Backend services can be deployed as a monolithic application for simple or small‑team projects, or as microservices when the number of interfaces and team size grows, allowing independent development, packaging, and deployment.

Caching layers (e.g., Redis, Memcached) reduce database latency, while the database layer must support high availability through master‑slave replication, sharding by time or user ID, and partitioning to handle terabyte‑scale data.

For traffic beyond the capacity of private data centers, a hybrid‑cloud architecture distributes load between private and public clouds, requiring network interconnectivity (often via dedicated lines) and careful placement of services across clouds.

Full‑link elastic scaling ensures that L4/L7, services, cache, and databases can dynamically scale; services should be stateless and use metrics‑driven auto‑scaling, while cache and database expansions must account for warm‑up times and data synchronization delays.

A three‑level degradation strategy (invisible, user‑perceptible, and severe) provides a safety net by gracefully shedding load when resources become constrained.

Additional supporting mechanisms such as decision‑support systems and alerting are essential for maintaining reliability at the ten‑million‑plus DAU scale.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

Microservices load balancing scalable architecture database sharding elastic scaling hybrid cloud

Written by

IT Architects Alliance

Discussion and exchange on system, internet, large‑scale distributed, high‑availability, and high‑performance architectures, as well as big data, machine learning, AI, and architecture adjustments with internet technologies. Includes real‑world large‑scale architecture case studies. Open to architects who have ideas and enjoy sharing.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.