Tag

Embedding Storage

1 views collected around this technical thread.

DataFunSummit
DataFunSummit
Feb 17, 2025 · Artificial Intelligence

NorthStar Large‑Model Training Framework: Architecture, APIs, Pipeline and Multi‑GPU Strategies

The article introduces the NorthStar large‑model training framework developed by DeWu, detailing its background challenges, pipeline architecture, rich API support, multi‑GPU training modes, multi‑level embedding storage, hardware selection considerations, and a brief Q&A on data versus model parallelism.

AI FrameworkEmbedding StorageLarge Model Training
0 likes · 9 min read
NorthStar Large‑Model Training Framework: Architecture, APIs, Pipeline and Multi‑GPU Strategies
Alimama Tech
Alimama Tech
Nov 17, 2021 · Artificial Intelligence

Binary Code Based Hash Embedding for Efficient Deep Recommendation Models

Binary Code based Hash Embedding (BH) dramatically compresses deep recommendation model storage by converting feature IDs to binary codes and partitioning them into flexible blocks, yielding deterministic, collision‑free indices that achieve up to 1,000× size reduction while retaining about 99% of original accuracy, making it ideal for storage‑constrained deployments.

CTR predictionEmbedding Storagebinary code
0 likes · 13 min read
Binary Code Based Hash Embedding for Efficient Deep Recommendation Models