Tag

CPU/GPU efficiency

1 views collected around this technical thread.

Tencent Architect
Tencent Architect
Jul 29, 2021 · Artificial Intelligence

Performance Optimization of Advertising Coarse‑Ranking Training on the Light Framework

This article analyzes the bottlenecks of advertising coarse‑ranking training on the Light framework and presents a series of optimizations—including parallel data download, thread‑queue buffering, integer‑to‑string conversion with fmt, and zlib replacement with czlib—that together achieve up to 58% QPS improvement and notable CPU efficiency gains.

CPU/GPU efficiencyData ParallelismPerformance Optimization
0 likes · 11 min read
Performance Optimization of Advertising Coarse‑Ranking Training on the Light Framework