Tag

Hopper

1 views collected around this technical thread.

IT Services Circle
IT Services Circle
Feb 27, 2025 · Artificial Intelligence

DeepSeek Announces FlashMLA: An Efficient Multi‑Layer Attention Decoding Kernel for Hopper GPUs

DeepSeek’s OpenSourceWeek introduced FlashMLA, a GPU‑optimized MLA decoding kernel for Hopper GPUs that leverages FlashAttention and CUTLASS to dramatically improve large‑model inference performance, with early adoption showing up to 30% higher compute utilization and doubled speed in some scenarios.

Artificial IntelligenceDeepSeekFlashMLA
0 likes · 3 min read
DeepSeek Announces FlashMLA: An Efficient Multi‑Layer Attention Decoding Kernel for Hopper GPUs
IT Services Circle
IT Services Circle
Mar 24, 2022 · Artificial Intelligence

NVIDIA Unveils H100 GPU with Hopper Architecture: Massive Performance Gains for AI

At the recent GTC event, NVIDIA introduced the H100 GPU built on the Hopper architecture using TSMC 4nm process, featuring 800 billion transistors, 16,896 CUDA cores, up to 700 W power, 3 TB/s memory bandwidth, and a specialized Transformer engine that accelerates large‑model training up to six times faster, alongside the Grace CPU Superchip and new AI supercomputing systems.

AIGPUGrace CPU
0 likes · 11 min read
NVIDIA Unveils H100 GPU with Hopper Architecture: Massive Performance Gains for AI