Operations 9 min read

Inside Tencent Blue Whale: The Enterprise Operating System Powering Modern Ops

This article explains how Tencent's Blue Whale platform serves as an enterprise operating system (EOS) that unifies basic operations and internal SaaS services across private and public clouds, enabling standardized, automated, and intelligent management for large‑scale IT environments.

Efficient Ops
Efficient Ops
Efficient Ops
Inside Tencent Blue Whale: The Enterprise Operating System Powering Modern Ops

Introduction

Blue Whale has supported Tencent Games through toolization, standardization, automation, and service‑orientation, and in 2016 it will further enable intelligent operations while also providing basic services for Tencent‑invested companies and tenants on Tencent Cloud.

What Is Blue Whale?

Blue Whale is a "technology operation architecture" and an "Enterprise Operating System (EOS)" designed for operations and operational development.

Enterprise Cloud Definition

It refers to a company’s own IaaS, which may consist of: Private data centers co‑built with carriers or rented rack units, possibly virtualized or containerized. Cloud hosts rented from public providers such as AWS, Tencent Cloud, Azure, etc. Dedicated lines connecting private clouds and rented public‑cloud hosts.

In the usual sense, a PC operating system simplifies management of hardware for applications. An Enterprise Operating System (EOS) similarly simplifies management of enterprise IT infrastructure (enterprise cloud) and control of online business, while allowing rapid development of various operational systems such as release management, fault handling, CRM, OA, ERP, and other SaaS services.

By adopting an SOA model, internal systems can be linked cost‑effectively, leveraging existing IT assets and enabling planned development of new operational systems.

In short, an EOS is built for technical operation teams (operations and operational development) and focuses on two directions: basic operations and operational system construction .

Capabilities for Basic Operations

Control OS or containers across public clouds and private IDC, execute massive concurrent commands or scripts, and transfer large files efficiently while handling cross‑cloud issues such as transmission efficiency, data security, and IP conflicts.

Manage and label OS or containers across clouds, supporting multi‑level topology and custom key‑value tags.

Perform real‑time cross‑cloud big‑data collection and computation, gathering performance metrics and business logs for monitoring and business analysis, with data loss rates below one in a million.

Standardize script operations for visualized, auditable, and reusable workflows, allowing assembly of isolated scripts and file transfer tasks into jobs with static string or dynamic file parameters.

Capabilities for Internal Operational Systems

Provide a private PaaS enabling PaaS‑style development for various internal management systems, e.g., drag‑and‑drop front‑end builders, sample libraries for complex front‑end work, and backend frameworks for scheduling, queuing, and Docker‑based deployment.

Implement enterprise SOA with a service bus, exposing existing IT system functions as micro‑services via an API gateway for new operational systems.

Expose job management, configuration management, and other basic operation functions as APIs integrated into the API gateway.

Based on EOS, deliver generic or vertical SaaS services such as monitoring, release management, business analytics, performance management, and meeting management.

The diagram shows that Blue Whale operates independently of business architectures, allowing diverse business lines with different frameworks, languages, and environments (Windows, Linux, Docker, etc.) to be managed uniformly.

In this model, technical operation teams can divide responsibilities as illustrated below.

Within Tencent Games, Blue Whale is the foundation for refined operations, service‑oriented operations, and intelligent operations.

As an independent technical operation infrastructure, its stable development clarifies collaboration between development and application operations teams, ensuring long‑term stability.

The SaaS services produced collectively ensure basic operation quality and provide value‑added services that improve user experience and support operational decision‑making.

Conclusion

The concept of "tool culture" emphasizes that enterprises need tools or operational systems to ensure efficiency and security throughout core business processes.

However, high construction and maintenance costs often cause enterprises to delay adoption, even though the survival of a business increasingly depends on effective operations.

automationplatform architectureSOACloud Managemententerprise operations
Efficient Ops
Written by

Efficient Ops

This public account is maintained by Xiaotianguo and friends, regularly publishing widely-read original technical articles. We focus on operations transformation and accompany you throughout your operations career, growing together happily.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.