Tag

PGO

0 views collected around this technical thread.

Tencent Cloud Developer
Tencent Cloud Developer
Oct 19, 2023 · Fundamentals

Profile-Guided Optimization (PGO) Principles and Practice in Go and C++

Profile‑Guided Optimization (PGO) collects runtime profiling data to recompile programs for higher performance, reducing branch mispredictions and improving code layout; Go gained built‑in PGO in 1.21 with typical 5 % gains, while C++ sees 15‑18 % QPS improvements and devirtualization benefits, and future work aims at deeper block ordering and register allocation.

C++GoPGO
0 likes · 16 min read
Profile-Guided Optimization (PGO) Principles and Practice in Go and C++
JD Retail Technology
JD Retail Technology
Oct 11, 2023 · Databases

Performance Tuning of JED Database on Huawei Kunpeng ARM vs Intel X86 Platforms

This technical report details the background, hardware configuration, database setup, tuning results, and step‑by‑step optimization procedures—including BIOS, OS, network, container NUMA binding, MySQL CRC32 patching, and Go PGO tuning—performed to improve JED performance on ARM compared with Intel.

ARMDatabaseKernel
0 likes · 12 min read
Performance Tuning of JED Database on Huawei Kunpeng ARM vs Intel X86 Platforms
ByteDance SYS Tech
ByteDance SYS Tech
Jan 6, 2023 · Fundamentals

How ByteDance Scaled Profile‑Guided Optimization to Boost CPU Efficiency

This article explains ByteDance's large‑scale adoption of profile‑guided optimization (PGO), covering its principles, instrumentation and sampling methods, the automated platform built for data collection and compilation, and the resulting performance gains across dozens of critical services.

ByteDanceCompiler OptimizationInstrumentation
0 likes · 12 min read
How ByteDance Scaled Profile‑Guided Optimization to Boost CPU Efficiency
Baidu Geek Talk
Baidu Geek Talk
May 25, 2022 · Backend Development

Large-Scale C/C++ Service Compilation Performance Optimization and Platformization (OMAX)

The article details OMAX’s end‑to‑end platform for large‑scale C/C++ service compilation, covering optimization flags, profile‑guided and link‑time techniques, Facebook BOLT post‑link tuning, and real‑world results that cut CPU use, latency and deployment time while shrinking binary size.

BOLTC++Cloud Services
0 likes · 24 min read
Large-Scale C/C++ Service Compilation Performance Optimization and Platformization (OMAX)
Tencent Cloud Developer
Tencent Cloud Developer
Sep 3, 2019 · Mobile Development

Improving iOS App Launch Speed with Binary Reordering and Page Fault Reduction

By generating an order file that places frequently executed code contiguously, the authors reduce iOS app cold‑launch page faults by about 15%, achieving roughly a 10% launch‑time improvement without code changes, using runtime hooking and static analysis to build the symbol ordering.

App LaunchBinary ReorderingHooking
0 likes · 13 min read
Improving iOS App Launch Speed with Binary Reordering and Page Fault Reduction