Tagged articles

52 articles

Page 1 of 1

Jan 10, 2026 · Operations

How to Scale Your Web App from 10K to Millions: 10 Essential Practices

This guide outlines ten practical steps—adding load balancers, horizontal scaling, stateless services, connection pooling, aggressive caching, read replicas, task queues, auto‑scaling, WebSocket gateways, and comprehensive monitoring—to reliably handle sudden traffic spikes and keep your application responsive and cost‑effective.

Auto ScalingCachingScalability

0 likes · 9 min read

How to Scale Your Web App from 10K to Millions: 10 Essential Practices

Alibaba Cloud Big Data AI Platform

Nov 12, 2025 · Big Data

How MaxCompute’s Resource Advisor Cuts Costs by 60% for Large-Scale Data Workloads

This article details how GoTerra migrated from BigQuery to MaxCompute and used Resource Advisor, tiered quota strategies, and the TopN Fair scheduler to dynamically balance performance and cost across dozens of accounts and hundreds of quota groups, achieving up to 60% cost reduction.

Auto ScalingBig DataMaxCompute

0 likes · 9 min read

How MaxCompute’s Resource Advisor Cuts Costs by 60% for Large-Scale Data Workloads

Full-Stack DevOps & Kubernetes

Oct 30, 2025 · Cloud Native

15 Real-World Kubernetes Use Cases You Need to Know

Explore the 15 most impactful Kubernetes scenarios—from microservices and auto‑scaling to multi‑cloud deployments, AI workloads, edge computing, and compliance—detailing how they boost reliability, efficiency, and cost‑effectiveness, while also highlighting situations where Kubernetes may not be the right choice.

AI WorkloadsAuto ScalingKubernetes

0 likes · 11 min read

15 Real-World Kubernetes Use Cases You Need to Know

Code Wrench

Oct 26, 2025 · Backend Development

Build a Scalable Go Actor Framework with Auto‑Scaling and Graceful Shutdown

Explore the Go Actor model’s core concepts, compare popular Actor libraries, and follow a step‑by‑step implementation that introduces a mailbox, supervisor restart strategy, dynamic ActorPool with auto‑scaler, graceful shutdown via context, and Prometheus metrics, culminating in a complete, production‑ready concurrent framework.

Auto ScalingGoPrometheus

0 likes · 15 min read

Build a Scalable Go Actor Framework with Auto‑Scaling and Graceful Shutdown

dbaplus Community

Aug 20, 2025 · Operations

How Qunar Automates Hotel Capacity Planning with Predictive Scaling

This article details Qunar's end‑to‑end solution for forecasting traffic spikes, estimating required CPU resources, and automatically scaling hotel services using a combined flow‑calendar, algorithmic prediction, and Ops‑driven auto‑scaling pipeline, improving stability and operational efficiency.

Algorithmic ForecastingAuto ScalingKubernetes

0 likes · 12 min read

How Qunar Automates Hotel Capacity Planning with Predictive Scaling

Alibaba Cloud Infrastructure

Jul 9, 2025 · Cloud Native

How We Transformed a FPS Game to Cloud‑Native with OpenKruiseGame in 2 Months

Facing tight deadlines, Yahaha Studios rebuilt the STRIDEN FPS game's server deployment from a traditional Auto Scaling Group to a cloud‑native architecture using OpenKruiseGame, achieving second‑level startup, automated global scaling, lossless scaling, and significant cost reductions while improving player experience.

Auto ScalingDeploymentKubernetes

0 likes · 18 min read

How We Transformed a FPS Game to Cloud‑Native with OpenKruiseGame in 2 Months

Baidu Tech Salon

Jun 17, 2025 · Operations

How Baidu Scaled Its Vertical Search: Elastic Scheduling and Data Management Secrets

This article explains how Baidu's vertical search platform tackled massive data growth and scaling challenges by redesigning its data management system, introducing elastic scheduling, decoupling ETCD access, implementing auto‑scaling, and advancing shard expansion to improve performance, stability, and cost efficiency.

Auto ScalingData ManagementSearch Architecture

0 likes · 18 min read

How Baidu Scaled Its Vertical Search: Elastic Scheduling and Data Management Secrets

Qunar Tech Salon

Mar 27, 2025 · Operations

Automated Capacity Planning and Auto‑Scaling for Hotel Services During Traffic Peaks

This document describes a comprehensive capacity‑planning solution that predicts traffic‑peak impacts for hotel services, automatically estimates required CPU resources, creates timed scaling tasks, and evaluates performance using detailed metrics, thereby improving operational efficiency and reducing manual effort during events such as exam‑ticket printing and holiday travel surges.

Auto ScalingCloud ComputingOperations

0 likes · 12 min read

Automated Capacity Planning and Auto‑Scaling for Hotel Services During Traffic Peaks

Alibaba Cloud Infrastructure

Feb 10, 2025 · Artificial Intelligence

Hybrid Cloud Elastic LLM Inference Solution with ACK Edge and KServe

This article presents a hybrid‑cloud solution that uses ACK Edge and KServe to dynamically allocate on‑premise and cloud GPU resources for large‑language‑model inference, addressing tidal traffic patterns, reducing costs, and ensuring high availability through elastic scaling and custom scheduling policies.

ACK@EdgeAuto ScalingKServe

0 likes · 13 min read

Hybrid Cloud Elastic LLM Inference Solution with ACK Edge and KServe

IT Architects Alliance

Jan 7, 2025 · Cloud Computing

Elastic Architecture: Auto Scaling and Failover for Resilient Systems

The article explains how elastic architecture, through auto‑scaling and failover mechanisms, dynamically adjusts resources and ensures continuous service during traffic spikes and component failures, improving cost efficiency, reliability, and operational stability for modern cloud‑based applications.

Auto ScalingCloud ComputingElastic Architecture

0 likes · 16 min read

Elastic Architecture: Auto Scaling and Failover for Resilient Systems

dbaplus Community

Mar 10, 2024 · Cloud Native

How We Built Elastic Scaling and Hybrid‑Cloud Auto‑Scaling on Kubernetes

After fully containerizing their platform, the team tackled front‑line development scaling challenges by designing a custom elastic‑scaling solution that combines dual‑threshold and timed scaling, integrates hybrid‑cloud ClusterAutoScale, consolidates middleware resources, and implements a comprehensive K8s observability stack, delivering over 30% additional compute capacity and near‑perfect scaling reliability.

Auto ScalingCloud NativeKubernetes

0 likes · 12 min read

How We Built Elastic Scaling and Hybrid‑Cloud Auto‑Scaling on Kubernetes

Full-Stack DevOps & Kubernetes

Mar 6, 2024 · Cloud Native

Master Kubernetes HPA: Automatic Pod Scaling with Real‑World Examples

This article explains how to configure Kubernetes Horizontal Pod Autoscaler (HPA) for automatic pod scaling, covering core concepts, metric selection, and two detailed YAML examples that demonstrate scaling based on CPU utilization and custom data‑processing rates.

Auto ScalingCloud NativeDevOps

0 likes · 6 min read

Master Kubernetes HPA: Automatic Pod Scaling with Real‑World Examples

Architects Research Society

Sep 28, 2023 · Cloud Computing

Throttling Pattern for Cloud Applications: Managing Resource Consumption and SLA Compliance

The article explains how to use throttling together with auto‑scaling to control resource consumption of cloud applications, prevent tenant overload, handle traffic bursts, and ensure service‑level agreements while optimizing costs.

Auto ScalingMulti‑tenantResource Management

0 likes · 13 min read

Throttling Pattern for Cloud Applications: Managing Resource Consumption and SLA Compliance

HelloTech

Aug 1, 2023 · Cloud Native

Elastic Scaling Practices in Cloud‑Native Kubernetes Environments

To overcome native HPA limits and business‑specific constraints in a fully containerized, cloud‑native Kubernetes environment, we implemented a dual‑threshold water‑level and scheduled scaling engine, hybrid‑cloud ClusterAutoScale, mixed‑deployment resource prioritization, and comprehensive Prometheus‑based observability, achieving higher utilization, lower costs, and a roadmap toward deeper optimization and AIOps.

Auto ScalingCloud NativeKubernetes

0 likes · 10 min read

Elastic Scaling Practices in Cloud‑Native Kubernetes Environments

DaTaobao Tech

Jul 5, 2023 · Cloud Native

Cloud‑Native Multi‑Tenant Architecture and Network Isolation in Taobao Open Platform

The Taobao Open Platform adopts a cloud‑native, multi‑tenant architecture that abstracts infrastructure, isolates tenants via independent or shared switch‑plus‑security‑group schemes with dual ENI pod networking, and leverages Kubernetes auto‑scaling to simplify onboarding, cut operational costs, and enable future low‑code and FaaS extensions.

Auto ScalingKubernetesMulti‑tenant

0 likes · 14 min read

Cloud‑Native Multi‑Tenant Architecture and Network Isolation in Taobao Open Platform

ByteDance Cloud Native

Jun 1, 2023 · Cloud Native

How to Deploy and Scale ByConity’s Cloud‑Native Data Warehouse on Kubernetes

ByConity is a cloud‑native, storage‑compute separated data warehouse engine that supports multi‑tenant isolation, high performance, and elastic scaling; this guide explains its three‑layer architecture, hardware requirements, Helm‑based Kubernetes deployment, dynamic scaling, and practical SQL testing steps.

Auto ScalingByConityCloud Native

0 likes · 11 min read

How to Deploy and Scale ByConity’s Cloud‑Native Data Warehouse on Kubernetes

Efficient Ops

Mar 21, 2023 · Operations

How Hupu Scaled to Millions: Inside the Flex Auto‑Scaling Platform

This article details Hupu's massive sports‑traffic environment, the design and implementation of the Flex auto‑scaling platform, its architecture, core functions such as resource statistics, node and pod scaling, scenario scheduling, and the performance optimizations that enable rapid, cost‑effective scaling across multi‑cloud Kubernetes clusters.

Auto ScalingKubernetesPerformance Optimization

0 likes · 15 min read

How Hupu Scaled to Millions: Inside the Flex Auto‑Scaling Platform

JD Cloud Developers

Feb 6, 2023 · Cloud Computing

How to Build a High‑Availability Web Architecture on JD Cloud with Docker and WordPress

This guide walks through designing and deploying a highly available, scalable WordPress web application on JD Cloud using elastic IaaS, PAAS services, load balancers, HA groups, Docker containers, MySQL, Redis, and automated scripts to demonstrate fault‑tolerance across availability zones.

Auto ScalingCloud ComputingLoad Balancer

0 likes · 16 min read

How to Build a High‑Availability Web Architecture on JD Cloud with Docker and WordPress

Alimama Tech

Nov 2, 2022 · Artificial Intelligence

Optimizing GPU Utilization for Multimedia AI Services with high_service

The article presents high_service, a high‑performance inference framework that boosts GPU utilization in multimedia AI services by separating CPU‑heavy preprocessing from GPU inference, employing priority‑based auto‑scaling, multi‑tenant sharing, and TensorRT‑accelerated models to eliminate GIL bottlenecks, reduce waste, and adapt to fluctuating traffic, with future work targeting automated bottleneck detection and further CPU‑GPU offloading.

Auto ScalingGPU utilizationHigh Performance Computing

0 likes · 19 min read

Optimizing GPU Utilization for Multimedia AI Services with high_service

Tencent Cloud Developer

Sep 29, 2022 · Cloud Native

Improving Kubernetes Cluster Utilization: Practices and Optimization Strategies

The session detailed how Tencent’s container experts boost Kubernetes cluster utilization by correcting pod resource requests, employing two‑level auto‑scaling, dynamic over‑commit, adaptive scheduling and eviction, and using HPA/EHPA/VPA, achieving up to 38.7% node usage and roughly 60% cost savings in real‑world workloads.

Auto ScalingKubernetesPod Scheduling

0 likes · 11 min read

Improving Kubernetes Cluster Utilization: Practices and Optimization Strategies

Tencent Cloud Developer

Jul 26, 2022 · Cloud Native

Understanding Knative: A Cloud-Native Serverless Framework

Knative is a CNCF‑incubated, cloud‑native serverless framework on Kubernetes that combines Build, Eventing, and Serving components—featuring a Knative Pod Autoscaler that can scale pods to zero—offering improved resource utilization, rapid traffic response, and developer productivity despite modest performance overhead.

Auto ScalingCNCFCloud-Native

0 likes · 16 min read

Understanding Knative: A Cloud-Native Serverless Framework

360 Smart Cloud

Jul 14, 2022 · Cloud Computing

Auto Scaling (AS) in Cloud Services: Architecture, Use Cases, and Optimization Strategies

This article explains the concept of elastic auto scaling in cloud services, describes typical scenarios such as high‑elastic web apps and compute‑intensive workloads, details the four‑layer architecture and workflow, and outlines functional features, stability improvements, and future optimization directions.

Auto Scalingcloud-computingelasticity

0 likes · 12 min read

Auto Scaling (AS) in Cloud Services: Architecture, Use Cases, and Optimization Strategies

DataFunSummit

Jul 1, 2022 · Big Data

Exploring and Implementing Elastic Scheduling for Xiaomi Hadoop YARN

Shilong Fei from Xiaomi Data Platform presents an in‑depth exploration of elastic scheduling for Hadoop YARN, covering background, design of resource pools, auto‑scaling architecture, challenges such as job stability and user transparency, achieved cost reductions, and future plans for further optimization.

Auto ScalingBig DataHadoop

0 likes · 20 min read

Exploring and Implementing Elastic Scheduling for Xiaomi Hadoop YARN

Zuoyebang Tech Team

Jun 17, 2022 · Big Data

How FlinkSQL Auto‑Tuning Saves Resources and Guarantees SLA

This article describes the design and implementation of an automated FlinkSQL tuning system that monitors metrics, evaluates task health with rule‑based logic, calculates optimal resource adjustments, and performs fast scaling to reduce cluster waste, lower operational costs, and maintain SLA compliance.

AkkaAuto ScalingFlink

0 likes · 15 min read

How FlinkSQL Auto‑Tuning Saves Resources and Guarantees SLA

Shopee Tech Team

May 26, 2022 · Cloud Computing

Shopee's Green Computing Practices: Optimizing Resource Utilization in Data Centers

Shopee reduces data‑center carbon emissions by over 40,000 tons annually through three 2021 green‑computing technologies—Overcommit resource oversubscription, mixed‑model Colocation of latency‑sensitive and batch workloads, and enhanced Auto Scaling that leverages global metrics to cut machine usage and improve resource efficiency.

Auto ScalingCloud ComputingKubernetes

0 likes · 15 min read

Shopee's Green Computing Practices: Optimizing Resource Utilization in Data Centers

HomeTech

Dec 7, 2021 · Big Data

Flink Task Auto-scaling Design and Implementation

This article presents the design and implementation of Flink task auto‑scaling, covering background, manual and automatic scaling mechanisms, architecture with RescaleCoordinator, persistence via Zookeeper and HDFS, scaling policies for parallelism, CPU and memory, and future plans for fine‑grained and time‑based resource adjustments.

Auto ScalingFlinkHDFS

0 likes · 4 min read

Flink Task Auto-scaling Design and Implementation

Liulishuo Tech Team

Oct 29, 2021 · Cloud Computing

Automating Cloud Infrastructure at Liulishuo: Deployment, Management, and Governance Practices

The article describes Liulishuo's Cloud Infra team's end‑to‑end automation of cloud resource provisioning, scaling, and cost governance using Terraform, a custom Luban platform, GitLab CI/CD, and chat‑bot integrations, highlighting the architectural design, implementation steps, and measurable benefits for both operations and business teams.

Auto Scalingcloud automationcost optimization

0 likes · 10 min read

Automating Cloud Infrastructure at Liulishuo: Deployment, Management, and Governance Practices

Tencent Cloud Developer

Jul 7, 2021 · Cloud Native

Design and Practice of Tencent Cloud Native Database TDSQL-C Serverless Architecture

TDSQL‑C Serverless separates compute and storage, delivers instant elastic scaling for MySQL and PostgreSQL, charges per‑second usage, pauses and stops billing when idle, and supports low‑frequency, archival, development, and micro‑service workloads with a ~2‑second cold‑start.

Auto ScalingCloud NativeDatabase

0 likes · 13 min read

Design and Practice of Tencent Cloud Native Database TDSQL-C Serverless Architecture

dbaplus Community

Jun 17, 2021 · Cloud Native

How Dada Achieved Seamless Elastic Scaling for Massive Delivery Peaks

Facing surges during holidays and major shopping events, Dada’s DevOps team built a cloud‑native elastic scaling system that combines fine‑grained capacity management, multi‑cloud support, metric‑driven auto‑scaling, and extreme‑scale down strategies, delivering stable delivery performance while cutting costs.

Auto ScalingOperationscapacity management

0 likes · 17 min read

How Dada Achieved Seamless Elastic Scaling for Massive Delivery Peaks

IT Architects Alliance

Jun 10, 2021 · Cloud Native

Designing High‑Availability Stateless Services: Load Balancing, Scaling, and Deployment Strategies

This article explains how to achieve high availability for stateless services by employing redundancy, vertical and horizontal scaling, various load‑balancing algorithms (random, round‑robin, weighted, least‑connections, source‑hash), and automatic scaling techniques in cloud‑native environments, while also covering performance monitoring and CDN/OSS usage.

Auto ScalingCloud NativeVertical Scaling

0 likes · 10 min read

Designing High‑Availability Stateless Services: Load Balancing, Scaling, and Deployment Strategies

ITFLY8 Architecture Home

Jun 10, 2021 · Operations

How to Build Truly High‑Availability Stateless Services: Strategies & Algorithms

This article explains how to design highly available stateless services by covering redundancy, vertical and horizontal scaling, load‑balancing algorithms, high‑concurrency identification, and the use of CDN/OSS, offering practical guidance for robust backend architecture.

Auto ScalingScalinghigh availability

0 likes · 9 min read

How to Build Truly High‑Availability Stateless Services: Strategies & Algorithms

Alibaba Cloud Developer

May 19, 2021 · Cloud Computing

How to Optimize Cloud Resource Scheduling After Migration

After migrating to the cloud, enterprises must evaluate resource scale, cost pressure, and staffing before deciding whether to build their own scheduling system, and can choose among ECS, Dedicated Host, or private pool solutions, each with specific advantages, drawbacks, and suitable scenarios.

Auto Scalingcapacity planningdedicated host

0 likes · 15 min read

How to Optimize Cloud Resource Scheduling After Migration

ITFLY8 Architecture Home

Apr 23, 2021 · Operations

How Consistent Hashing Minimizes Data Migration During Auto-Scaling

This article explains how consistent hashing, combined with Docker-based microservice architectures, reduces data movement during dynamic scaling by using a ring‑shaped hash space and intelligent node placement, making auto‑scaling more efficient and resilient.

Auto ScalingDistributed SystemsMicroservices

0 likes · 4 min read

How Consistent Hashing Minimizes Data Migration During Auto-Scaling

Efficient Ops

Apr 20, 2021 · Operations

How Dada’s Intelligent Elastic Scaling Cuts Costs and Boosts Delivery Performance

This article details Dada Group’s implementation of an intelligent elastic scaling architecture that automatically adjusts capacity during peak promotions and low‑traffic periods, improving delivery reliability, reducing costs, and supporting multi‑cloud and multi‑runtime environments through sophisticated monitoring and auto‑scaler mechanisms.

Auto ScalingMonitoringOperations

0 likes · 17 min read

How Dada’s Intelligent Elastic Scaling Cuts Costs and Boosts Delivery Performance

Dada Group Technology

Apr 19, 2021 · Operations

Exploring Elastic Capacity and Automated Scaling Architecture at Dada Group

This article presents Dada Group's comprehensive approach to elastic capacity management and automated scaling, detailing the challenges faced during traffic spikes, the design of a cloud‑native auto‑scaler, multi‑metric observability, decision‑making logic, execution mechanisms, extreme scaling practices, and future optimization directions.

Auto ScalingCloud NativeSRE

0 likes · 15 min read

Exploring Elastic Capacity and Automated Scaling Architecture at Dada Group

Xianyu Technology

Dec 17, 2020 · Cloud Native

Elastic Scaling in Serverless Cloud‑Native Applications

Elastic scaling, a cornerstone of Xianyu’s shift to serverless cloud-native architecture, leverages Kubernetes autoscaling components—Cluster‑Autoscaler, HPA, VPA—to dynamically adjust resources via reactive thresholds or predictive models, yet faces challenges like cold‑starts, lack of scale‑to‑zero, and optimal pod‑pool buffering, prompting ongoing research for faster, smarter, safer scaling.

Auto ScalingCloud-nativeKubernetes

0 likes · 19 min read

Elastic Scaling in Serverless Cloud‑Native Applications

Full-Stack DevOps & Kubernetes

Oct 23, 2020 · Cloud Native

Master Kubernetes Horizontal Pod Autoscaling with Metrics Server and Custom Metrics

This guide walks through setting up Kubernetes Horizontal Pod Autoscaling (HPA) using the Metrics Server for CPU and memory metrics, deploying a sample Go web app, performing load tests, and extending autoscaling with Prometheus‑based custom metrics for fine‑grained scaling control.

Auto ScalingHorizontal Pod AutoscalerKubernetes

0 likes · 20 min read

Master Kubernetes Horizontal Pod Autoscaling with Metrics Server and Custom Metrics

Java High-Performance Architecture

Dec 26, 2019 · Cloud Native

Why Kubernetes Is the Ultimate Tool for Fast, Scalable Container Deployments

This article explains what containers are, introduces Kubernetes as a leading open‑source container orchestration platform, and details its key advantages such as universal deployment, seamless cloud migration, efficient resource use, built‑in auto‑scaling, simplified CI/CD, and high reliability.

Auto ScalingKubernetesci/cd

0 likes · 8 min read

Why Kubernetes Is the Ultimate Tool for Fast, Scalable Container Deployments

Alibaba Cloud Developer

Sep 17, 2019 · Cloud Native

How NBF’s FaaS Architecture Powers Serverless at Alibaba’s Mega Sales

This article explains how Alibaba's New‑Retail Business Framework (NBF) implements a non‑typical FaaS architecture that delivers full Serverless capabilities—including containerized bundle management, service publishing, routing, fault tolerance, millisecond‑level auto‑scaling, and rapid rollback—proving its reliability during large‑scale promotional events.

Auto ScalingContainerFaaS

0 likes · 16 min read

How NBF’s FaaS Architecture Powers Serverless at Alibaba’s Mega Sales

Efficient Ops

Sep 27, 2018 · Operations

Tencent Billing’s Secret to Managing Massive Promo Spikes

Tencent’s billing platform powers billions of daily transactions across 180+ countries, supporting both consumer and business payments, and employs sophisticated capacity testing, dynamic auto‑scaling, resource sharing, and change‑control mechanisms to ensure reliable large‑scale promotional events without service disruptions.

Auto ScalingOperationsTencent Billing

0 likes · 15 min read

Tencent Billing’s Secret to Managing Massive Promo Spikes

High Availability Architecture

May 10, 2018 · Cloud Native

Kubernetes Automatic Scaling with Custom Metrics Using Prometheus and HPA v2

This article explains how to configure Kubernetes Horizontal Pod Autoscaler (HPA) to scale workloads based on custom business metrics collected by Prometheus, covering installation of Metrics Server, deployment of a demo app, setup of the Prometheus adapter, and practical load‑testing steps.

Auto ScalingHorizontal Pod AutoscalerKubernetes

0 likes · 7 min read

Kubernetes Automatic Scaling with Custom Metrics Using Prometheus and HPA v2

Tencent Cloud Developer

Apr 8, 2018 · Game Development

WeChat Mini-Game Architecture Design and Development Guide

The guide outlines a scalable, globally unified architecture for WeChat mini-games—combining stateless compute layers with CLB, auto-scaling, a star-router model for peer communication, and horizontally-sharded storage using Tencent Cloud’s DCDB or TCaplus to handle rapid viral growth.

Auto ScalingGame ArchitectureGame Development

0 likes · 9 min read

WeChat Mini-Game Architecture Design and Development Guide

ITFLY8 Architecture Home

Mar 29, 2018 · Backend Development

How 1号店 Scaled Its Search Engine for 11.11: Distributed Architecture, Sharding, and Auto‑Scaling

This article explains how 1号店 built a distributed search engine with horizontal scaling, custom sharding and routing strategies, automated deployment, rapid expansion, and real‑time monitoring to handle the massive traffic spikes of the annual 11.11 e‑commerce promotion.

Auto ScalingDistributed Searche-commerce backend

0 likes · 13 min read

How 1号店 Scaled Its Search Engine for 11.11: Distributed Architecture, Sharding, and Auto‑Scaling

Alibaba Cloud Developer

Mar 8, 2018 · Operations

How Cainiao Ark’s Elastic Scheduling Boosts Resource Efficiency and Cuts Costs

This article explains why Cainiao needed an elastic scheduling system, how its unique business and technical characteristics make it ideal for such a solution, and details the architecture, decision‑making layers, strategies, and real‑world results that together improve resource utilization, stability, and cost efficiency.

Auto ScalingCainiao ArkResource Management

0 likes · 27 min read

How Cainiao Ark’s Elastic Scheduling Boosts Resource Efficiency and Cuts Costs

Alibaba Cloud Developer

Dec 19, 2017 · Operations

How Alibaba’s TPP Intelligent Scheduler Boosts Resource Utilization and Handles Double‑11 Traffic

The article details Alibaba's Taobao Personalization Platform (TPP) intelligent scheduling system, explaining its architecture, optimization algorithms, convergence logic, and performance results that dramatically improve CPU utilization and automate scaling during both regular operation and high‑traffic events like Double‑11.

AlibabaAuto Scalingcloud operations

0 likes · 21 min read

How Alibaba’s TPP Intelligent Scheduler Boosts Resource Utilization and Handles Double‑11 Traffic

Hujiang Technology

Aug 16, 2017 · Cloud Native

Implementing DevOps with Container Orchestration: CI/CD Pipeline, Docker Image Optimization, and Automated Operations

This article explains how to build a DevOps workflow using container orchestration, covering agile development with Docker image slimming, a Jenkins‑based CI/CD pipeline, simplified Gitflow, container monitoring with cAdvisor‑InfluxDB‑Grafana, and auto‑scaling via Marathon, while separating automated deployment from production release.

Auto ScalingJenkinsci/cd

0 likes · 16 min read

Implementing DevOps with Container Orchestration: CI/CD Pipeline, Docker Image Optimization, and Automated Operations

Efficient Ops

Apr 25, 2017 · Cloud Native

How CaaS Transforms Microservice Development and Operations with Docker

This article explains the differences between monolithic and microservice architectures, introduces Docker as a key tool for microservices, and details how a Container‑as‑a‑Service (CaaS) platform can provide service discovery, routing, logging, monitoring, auto‑scaling, SLA enforcement, and deployment strategies to simplify development and operations.

Auto ScalingCloud NativeDocker

0 likes · 16 min read

How CaaS Transforms Microservice Development and Operations with Docker

ITFLY8 Architecture Home

Oct 12, 2016 · Backend Development

How to Implement and Manage Feature Toggles in Java for Scalable Systems

This article explains how to design and operate feature toggles in Java applications, covering single‑instance implementation, cross‑instance synchronization via a meta‑server or Diamond, handling composite switches, avoiding security pitfalls, and automating degradation and upgrade based on runtime metrics.

Auto ScalingConfiguration Managementfeature toggle

0 likes · 8 min read

How to Implement and Manage Feature Toggles in Java for Scalable Systems

dbaplus Community

Aug 9, 2016 · Cloud Native

Scaling Qiniu Cloud's Custom Data Processing with Docker Containerization

Qiniu Cloud transformed its high‑traffic data processing platform by containerizing services with Docker, addressing challenges such as massive request volume, CPU‑intensive workloads, IO bottlenecks, and burst traffic through architecture evolution, queueing, rate limiting, auto‑scaling, and secure, isolated custom processing pipelines.

Auto ScalingMicroservicesdata-processing

0 likes · 20 min read

Scaling Qiniu Cloud's Custom Data Processing with Docker Containerization

21CTO

Nov 17, 2015 · Backend Development

Scaling Search for 11.11: Distributed Engine, Smart Routing & Auto‑Scaling

This article explains how a major e‑commerce platform built a horizontally scalable distributed search engine, designed efficient sharding and routing strategies, and implemented automated deployment, rapid scaling, and real‑time monitoring to handle the massive traffic of the 11.11 shopping festival.

Auto ScalingDistributed SearchRouting

0 likes · 13 min read

Scaling Search for 11.11: Distributed Engine, Smart Routing & Auto‑Scaling

Architect

Oct 18, 2015 · Cloud Computing

Practices and Reflections on Enterprise Cloud Platforms

The article shares the author’s experience designing and operating enterprise‑grade cloud platforms, covering resource and application management, the Platform EGO architecture, comparisons with Mesos, Yarn and Kubernetes, and practical insights on scaling, scheduling, security, and architectural evolution.

Auto ScalingCloud ComputingDistributed Systems

0 likes · 18 min read

Practices and Reflections on Enterprise Cloud Platforms

21CTO

Sep 16, 2015 · Databases

How TDSQL Achieves Scalable, High‑Availability Distributed SQL on MySQL

This article explains how TDSQL transforms MySQL into a distributed, high‑availability SQL system by addressing NoSQL limitations, introducing a Scheduler‑Agent‑Gateway architecture, automatic scaling, sharding, robust disaster‑recovery mechanisms, and future integration with container technologies.

Auto ScalingShardingdistributed database

0 likes · 19 min read

How TDSQL Achieves Scalable, High‑Availability Distributed SQL on MySQL