Tag

RoCEv2

2 views collected around this technical thread.

Architects' Tech Alliance
Architects' Tech Alliance
May 26, 2025 · Fundamentals

Understanding RDMA, InfiniBand, and RoCEv2 for High‑Performance Distributed Training

The article explains how distributed AI training performance depends on reducing inter‑card communication latency, introduces RDMA technology and its implementations (InfiniBand, RoCEv2, iWARP), compares their latency and scalability against traditional TCP/IP, and outlines the hardware components and trade‑offs of InfiniBand and RoCEv2 networks.

High Performance ComputingInfiniBandRDMA
0 likes · 12 min read
Understanding RDMA, InfiniBand, and RoCEv2 for High‑Performance Distributed Training
Architects' Tech Alliance
Architects' Tech Alliance
Sep 12, 2024 · Artificial Intelligence

Comparison of InfiniBand and RoCEv2 Architectures for AI Compute Networks

This article examines the two dominant AI compute network architectures, InfiniBand and RoCEv2, detailing their designs, flow‑control mechanisms, performance, cost and scalability characteristics, and evaluates their respective advantages and limitations to guide network selection for AI data centers.

AI computeInfiniBandRDMA
0 likes · 9 min read
Comparison of InfiniBand and RoCEv2 Architectures for AI Compute Networks
Architects' Tech Alliance
Architects' Tech Alliance
Jun 20, 2024 · Artificial Intelligence

Comparative Analysis of InfiniBand and RoCEv2 Architectures for AI Compute Networks

This article provides a detailed comparison of InfiniBand and RoCEv2 network architectures, examining their technical features, flow‑control mechanisms, performance, cost, and suitability for AI compute environments to guide designers in selecting the optimal solution.

AI computeInfiniBandRDMA
0 likes · 9 min read
Comparative Analysis of InfiniBand and RoCEv2 Architectures for AI Compute Networks
Architects' Tech Alliance
Architects' Tech Alliance
Mar 31, 2021 · Operations

NVMe over RoCEv2 Network Architecture, Control Optimization Requirements, and Test Specification

This article details the NVMe‑over‑RoCEv2 network architecture, defines plug‑and‑play and fast‑fault detection mechanisms, outlines IP domain management, LLDP and state‑notification requirements, security considerations, and provides test scenarios and tools for validating high‑performance storage networking.

LLDPNVMeRoCEv2
0 likes · 14 min read
NVMe over RoCEv2 Network Architecture, Control Optimization Requirements, and Test Specification