Tag

GPU architecture

1 views collected around this technical thread.

Architects' Tech Alliance
Architects' Tech Alliance
Jun 9, 2025 · Artificial Intelligence

What Makes Nvidia’s Blackwell GPUs a Game-Changer for AI Performance?

In March 2024 Nvidia unveiled the Blackwell GPU family and the GB200 NVL72 architecture, featuring 3‑4 nm processes, redesigned CUDA cores, next‑gen ray‑tracing, upgraded DLSS, massive FP16/FP8 compute gains, 8 TB/s memory bandwidth, and NVLink Gen5, while also presenting complex power, cooling, and packaging challenges for large‑scale AI deployments.

AI accelerationBlackwellGPU
0 likes · 6 min read
What Makes Nvidia’s Blackwell GPUs a Game-Changer for AI Performance?
Architects' Tech Alliance
Architects' Tech Alliance
May 6, 2025 · Artificial Intelligence

Evolution of NVIDIA GPU Architectures for AI from Volta to Blackwell

The article reviews NVIDIA's GPU architecture progression—from Volta's pioneering Tensor Cores through Turing, Ampere, Hopper, and the latest Blackwell and Rubin designs—highlighting key innovations, performance gains for deep learning, and related resource updates for AI practitioners.

Artificial IntelligenceGPU architectureHigh Performance Computing
0 likes · 9 min read
Evolution of NVIDIA GPU Architectures for AI from Volta to Blackwell
Architects' Tech Alliance
Architects' Tech Alliance
Apr 2, 2024 · Artificial Intelligence

Evolution and Forecast of Nvidia NVLink, NVLink C2C, and B100/X100 GPU Architectures

The article analyses the historical evolution of Nvidia's NVLink and NVLink C2C interconnect technologies, compares them with PCIe, Ethernet and InfiniBand, and uses these trends to predict future AI‑chip architectures such as the B100 and X100 GPUs, highlighting design trade‑offs and packaging challenges.

AI chipB100GPU architecture
0 likes · 15 min read
Evolution and Forecast of Nvidia NVLink, NVLink C2C, and B100/X100 GPU Architectures
Architects' Tech Alliance
Architects' Tech Alliance
Mar 26, 2024 · Artificial Intelligence

Analysis and Forecast of Nvidia AI Chip Roadmap: From H100 to X100

The article analyzes Nvidia's AI chip evolution, assumes consistent storage‑compute‑interconnect ratios and predictable process scaling, and projects the architectures of H200, B100 and X100, highlighting the limits of chiplet packaging and the critical role of low‑latency, high‑reliability interconnect technologies for future AI compute scaling.

AI chipsChipletGPU architecture
0 likes · 12 min read
Analysis and Forecast of Nvidia AI Chip Roadmap: From H100 to X100
Architects' Tech Alliance
Architects' Tech Alliance
Aug 21, 2023 · Artificial Intelligence

AI Compute Landscape: GPU Architectures, Tensor Cores, NVLink, and Scaling Challenges

The article surveys the AI compute ecosystem, explaining why CPUs are unsuitable for AI workloads, how heterogeneous CPU‑plus‑accelerator designs dominate, and detailing the evolution of NVIDIA GPUs, Tensor Cores, memory technologies, and inter‑GPU networking that enable large‑scale model training.

AI computeAI hardwareGPU architecture
0 likes · 11 min read
AI Compute Landscape: GPU Architectures, Tensor Cores, NVLink, and Scaling Challenges
Architects' Tech Alliance
Architects' Tech Alliance
Mar 2, 2023 · Artificial Intelligence

In‑Depth Analysis of AI Servers for ChatGPT: Architecture, Costs, and Market Trends

This article provides a comprehensive technical overview of AI servers used for large‑scale models like ChatGPT, covering GPU‑centric architectures, classification by application and chip type, hardware cost breakdowns, market demand forecasts, domestic vendor strengths, and the impact of export restrictions on advanced accelerator chips.

AI ServersChatGPTCloud Computing
0 likes · 17 min read
In‑Depth Analysis of AI Servers for ChatGPT: Architecture, Costs, and Market Trends
Architects' Tech Alliance
Architects' Tech Alliance
Mar 20, 2021 · Fundamentals

Evolution of NVIDIA GPU Architectures from Fermi to Ampere

This article outlines the progression of NVIDIA GPU architectures—from the early Fermi and Kepler designs through Maxwell, Pascal, Volta, Turing, and the latest Ampere—detailing compute capabilities, SM structures, FP64/FP32 ratios, Tensor Core introductions, and their impact on AI and high‑performance computing.

AICUDAGPU architecture
0 likes · 19 min read
Evolution of NVIDIA GPU Architectures from Fermi to Ampere
TAL Education Technology
TAL Education Technology
May 14, 2020 · Artificial Intelligence

An Introduction to GPU Computing and CUDA Architecture

This article provides a concise overview of GPU computing fundamentals, covering GPU hardware components, memory hierarchy, parallel execution models, and the CUDA programming framework, illustrating how CPUs and GPUs cooperate in heterogeneous computing environments.

CUDACUDA ProgrammingGPU
0 likes · 16 min read
An Introduction to GPU Computing and CUDA Architecture
Architects' Tech Alliance
Architects' Tech Alliance
Jul 2, 2017 · Fundamentals

Differences Between NVIDIA Tesla and GeForce GPUs: Architecture, Performance, and Use Cases

This article compares NVIDIA's Tesla and GeForce GPU families, detailing their target markets, design differences, core architectures, double‑precision performance, ECC support, memory bandwidth, interface options, software and OS compatibility, power efficiency, and management features to help readers choose the right GPU for HPC or gaming workloads.

GPUGPU architectureGeForce
0 likes · 11 min read
Differences Between NVIDIA Tesla and GeForce GPUs: Architecture, Performance, and Use Cases