Tag

AngelPTM

1 views collected around this technical thread.

Tencent Advertising Technology
Tencent Advertising Technology
Mar 10, 2023 · Artificial Intelligence

Optimizing Large-Scale Model Training with Tencent's AngelPTM and ZeRO-Cache

This article presents Tencent's latest advancements in large‑scale model training, detailing the AngelPTM framework and its ZeRO‑Cache optimization techniques that reduce memory and storage costs, improve hardware utilization, and achieve high‑performance training for trillion‑parameter AI models across various applications.

AI modelsAngelPTMLarge-Scale Training
0 likes · 14 min read
Optimizing Large-Scale Model Training with Tencent's AngelPTM and ZeRO-Cache