Tag

data center

1 views collected around this technical thread.

Architecture Digest
Architecture Digest
Jun 10, 2025 · Operations

How Much Bandwidth Does Douyin (TikTok) Really Have? Inside Its Massive Data Centers

This article explains how Douyin, TikTok, Baidu, Alibaba Cloud and Tencent operate self‑built data centers with terabit‑level outbound bandwidth, details ByteDance's server count growth from tens of thousands to hundreds of thousands, and describes the CDN technologies that enable billions of users to stream smoothly.

BandwidthServer CountTikTok
0 likes · 8 min read
How Much Bandwidth Does Douyin (TikTok) Really Have? Inside Its Massive Data Centers
Architects' Tech Alliance
Architects' Tech Alliance
May 26, 2025 · Artificial Intelligence

NVLink Fusion: NVIDIA’s High‑Bandwidth Interconnect for Heterogeneous AI Computing

NVLink Fusion, unveiled at Computex 2025, extends NVIDIA’s NVLink technology to enable high‑bandwidth, low‑latency connections between CPUs and GPUs or third‑party accelerators, offering up to 900 GB/s bandwidth, flexible heterogeneous configurations, ecosystem expansion, performance gains for AI training and inference, and potential cost reductions.

AICPUGPU
0 likes · 12 min read
NVLink Fusion: NVIDIA’s High‑Bandwidth Interconnect for Heterogeneous AI Computing
Top Architecture Tech Stack
Top Architecture Tech Stack
May 22, 2025 · Operations

Understanding the Bandwidth and Server Scale of Douyin (TikTok) Data Centers

This article explains how Douyin (TikTok) and other major Chinese platforms achieve massive concurrent usage by operating data centers with hundreds of thousands of servers, employing terabit-level outbound bandwidth, dual‑link designs, CDN acceleration, and multi‑node load balancing, and provides estimates of server counts and bandwidth capacities.

BandwidthDouyinServer Scale
0 likes · 8 min read
Understanding the Bandwidth and Server Scale of Douyin (TikTok) Data Centers
Java Captain
Java Captain
May 12, 2025 · Operations

How ByteDance Powers Douyin/TikTok with Massive Bandwidth and Server Infrastructure

The article explains ByteDance's enormous data‑center bandwidth, server counts, and CDN architecture that enable hundreds of millions of concurrent users on Douyin and TikTok, detailing estimates of total outbound capacity, multi‑link designs, and the role of cloud and IDC resources.

BandwidthByteDanceDouyin
0 likes · 8 min read
How ByteDance Powers Douyin/TikTok with Massive Bandwidth and Server Infrastructure
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Apr 25, 2025 · Fundamentals

Alibaba Network Proposal OSFP MSA Passes Unanimously, Introducing the First Liquid‑Cooled OSFP Cage Standard

Alibaba Cloud’s infrastructure network team’s split‑type OSFP Cage proposal was unanimously approved by the OSFP MSA committee, becoming the first standard supporting liquid‑cooled OSFP cold plates, offering low‑cost, easy‑assembly solutions that address the growing power‑consumption challenges of high‑density AI switches.

AI SwitchesHardware StandardOSFP
0 likes · 5 min read
Alibaba Network Proposal OSFP MSA Passes Unanimously, Introducing the First Liquid‑Cooled OSFP Cage Standard
Architects' Tech Alliance
Architects' Tech Alliance
Apr 21, 2025 · Artificial Intelligence

UALink 1.0: An Open High‑Speed Interconnect Challenging Nvidia’s AI Dominance

The UALink 1.0 specification, driven by AMD, Intel, Broadcom and other industry leaders, introduces an open, low‑latency, high‑bandwidth interconnect that can link up to 1,024 AI accelerators, offering a cost‑effective alternative to Nvidia’s NVLink and reshaping the AI‑HPC market.

AI interconnectHigh Performance ComputingNvidia competition
0 likes · 11 min read
UALink 1.0: An Open High‑Speed Interconnect Challenging Nvidia’s AI Dominance
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Apr 18, 2025 · Artificial Intelligence

Alibaba Cloud Showcases Optical Interconnect Innovations at OFC 2025 50th Anniversary

At the OFC 2025 50th anniversary in San Francisco, Alibaba Cloud presented cutting‑edge optical interconnect research and solutions for AI computing and modern data‑center networks, highlighted by invited talks, breakthrough demos, and two data‑driven QoT estimation papers co‑authored with Hong Kong Polytechnic University.

AI computingCloud NetworkingPhotonic Integration
0 likes · 6 min read
Alibaba Cloud Showcases Optical Interconnect Innovations at OFC 2025 50th Anniversary
ByteDance SYS Tech
ByteDance SYS Tech
Apr 11, 2025 · Operations

How User‑Space MPTCP with DPDK Doubles Throughput in Data Centers

This article details the design, implementation, and performance evaluation of a user‑space MPTCP stack built on DPDK, showing how a layered, zero‑copy architecture and same‑core lock‑free forwarding can boost data‑center throughput by up to 100% while reducing latency by about 10%, all while remaining compatible with existing TCP applications.

DPDKMPTCPPerformance Optimization
0 likes · 12 min read
How User‑Space MPTCP with DPDK Doubles Throughput in Data Centers
Code Mala Tang
Code Mala Tang
Mar 21, 2025 · Artificial Intelligence

What Are the Four Waves of AI and How NVIDIA Is Shaping the Future?

NVIDIA’s GTC 2025 keynote outlines the four AI waves—from perception to physical AI—while highlighting the company’s latest Blackwell chips, DGX Spark/Station computers, Dynamo inference accelerator, robotics collaborations, GM autonomous‑driving partnership, and AI‑native 6G efforts, underscoring massive data‑center investment and future challenges.

AI hardwareArtificial IntelligenceNvidia
0 likes · 11 min read
What Are the Four Waves of AI and How NVIDIA Is Shaping the Future?
ByteDance SYS Tech
ByteDance SYS Tech
Feb 18, 2025 · Operations

How Can Data Center Planning Cut Costs and Boost Efficiency?

This article explains how a mixed‑integer programming tool developed by ByteDance's SYS‑DCD team integrates cost, reliability, delivery speed, and environmental metrics to optimize data‑center planning, reduce power waste, and accelerate deployment across multiple regional scenarios.

Linear Programmingdata centerenergy efficiency
0 likes · 15 min read
How Can Data Center Planning Cut Costs and Boost Efficiency?
Deepin Linux
Deepin Linux
Dec 25, 2024 · Fundamentals

An Introduction to RDMA: Principles, Programming, and Applications

This article explains RDMA technology, covering its core principles, programming model with Verbs API, various communication modes, and its impact on data‑center networking, high‑performance computing, and distributed storage, highlighting its low‑latency, zero‑copy advantages over traditional TCP/IP.

High Performance ComputingRDMAZero Copy
0 likes · 30 min read
An Introduction to RDMA: Principles, Programming, and Applications
Architects' Tech Alliance
Architects' Tech Alliance
Dec 15, 2024 · Fundamentals

Comprehensive Analysis of Ethernet Evolution, Switch Market, and High‑Speed Chip Developments

This article provides a detailed overview of Ethernet’s origins, its rapid speed evolution, the expanding role of Ethernet in data centers, automotive, industrial and cloud networks, and examines the current switch market, chip technologies, and future high‑speed developments driven by AI and AIGC.

AIEthernetHigh Speed Networking
0 likes · 10 min read
Comprehensive Analysis of Ethernet Evolution, Switch Market, and High‑Speed Chip Developments
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Oct 25, 2024 · Artificial Intelligence

Highlights of Chinese Enterprises at the 2024 OCP Global Summit: AI Network Architecture, High‑Performance Cooling, and WAN Innovations

The 2024 OCP Global Summit in San Jose showcased Chinese tech leaders like Alibaba Cloud and ByteDance presenting cutting‑edge AI network architectures, liquid‑cooling solutions, SRv6 deployments, high‑performance data‑center designs, and future WAN routing innovations, underscoring China's growing influence in AI infrastructure worldwide.

AI networkingHigh Performance ComputingOCP Summit
0 likes · 8 min read
Highlights of Chinese Enterprises at the 2024 OCP Global Summit: AI Network Architecture, High‑Performance Cooling, and WAN Innovations
Architects' Tech Alliance
Architects' Tech Alliance
Sep 8, 2024 · Artificial Intelligence

Design and Architecture of Multi‑Million GPU Clusters for Large‑Scale AI Model Training

The article surveys the network architectures and congestion‑control techniques used in massive GPU clusters—such as Byte’s megascale, Baidu HPN, Alibaba HPN7, and Tencent Xingmai 2.0—highlighting how high‑bandwidth, low‑latency designs and advanced RDMA technologies enable training of trillion‑parameter multimodal AI models.

AI infrastructureGPU clustersHPN
0 likes · 11 min read
Design and Architecture of Multi‑Million GPU Clusters for Large‑Scale AI Model Training
Architects' Tech Alliance
Architects' Tech Alliance
Sep 1, 2024 · Fundamentals

Full Liquid‑Cooled Cold Plate Server Design and Performance Testing (2024)

This article presents a comprehensive reference design and performance evaluation of a 2U four‑node high‑density server employing full liquid‑cooled cold plates for CPUs, memory, storage, NICs, and power supplies, detailing system architecture, flow design, CFD validation, and future optimization directions.

CFD simulationdata centerhigh density
0 likes · 11 min read
Full Liquid‑Cooled Cold Plate Server Design and Performance Testing (2024)
Architects' Tech Alliance
Architects' Tech Alliance
Aug 30, 2024 · Cloud Native

AmpereOne A192-32X: A 192‑Core ARM Server CPU and Its LGA5964 Socket

The article provides an in‑depth technical overview of Ampere’s custom‑core AmpereOne A192‑32X 192‑core ARM server processor, covering its architecture, cloud‑native features, performance comparisons with AMD EPYC and Intel Xeon, cooling design, LGA5964 socket details, and benchmark results from real‑world stress testing.

ARM server CPUAmpereOneLGA5964
0 likes · 10 min read
AmpereOne A192-32X: A 192‑Core ARM Server CPU and Its LGA5964 Socket
Architects' Tech Alliance
Architects' Tech Alliance
Aug 21, 2024 · Fundamentals

Comprehensive Liquid‑Cooling Reference Design for Server Components (2024)

This white‑paper presents a 2024 reference design and performance evaluation of full‑liquid‑cooling solutions for CPUs, memory, SSDs, PCIe/OCP cards, power supplies and IO boards, detailing architecture, advantages, implementation methods and deployment scenarios for data‑center and telecom applications.

Hardware Engineeringdata centerliquid cooling
0 likes · 12 min read
Comprehensive Liquid‑Cooling Reference Design for Server Components (2024)
ByteDance SYS Tech
ByteDance SYS Tech
Aug 9, 2024 · Cloud Computing

How ByteDance’s Open Compute Innovations Are Shaping Cloud Infrastructure

ByteDance won the 2024 Open Compute Best Practice Award for its groundbreaking work in cloud firmware, OpenBMC, VDUSE virtualization, and kernel memory optimizations, illustrating how open‑source collaboration drives more efficient, scalable data‑center infrastructure in the AI era.

Kernel OptimizationOpen ComputeOpenBMC
0 likes · 10 min read
How ByteDance’s Open Compute Innovations Are Shaping Cloud Infrastructure
Architects' Tech Alliance
Architects' Tech Alliance
Jul 22, 2024 · Fundamentals

Comprehensive Overview of Data Center Architecture and Its Core Components

This article provides a detailed overview of modern data center architecture, covering physical and IT infrastructure, network topologies such as three‑tier and spine‑leaf, storage solutions like DAS, NAS and SAN, server designs, cloud data‑center components, physical site considerations, and various data‑center deployment models.

Cloud Computingarchitecturedata center
0 likes · 20 min read
Comprehensive Overview of Data Center Architecture and Its Core Components
Practical DevOps Architecture
Practical DevOps Architecture
Jun 13, 2024 · Operations

Comprehensive Data Center Operations Training Course Overview

This extensive training program covers everything a data center operations engineer needs—from foundational infrastructure management and server hardware maintenance to advanced network configuration, security hardening, monitoring, fault handling, and practical hands‑on skills for real‑world challenges.

Server managementdata centerinfrastructure
0 likes · 6 min read
Comprehensive Data Center Operations Training Course Overview