Tag

heterogeneous computing

1 views collected around this technical thread.

Architects' Tech Alliance
Architects' Tech Alliance
May 26, 2025 · Artificial Intelligence

NVLink Fusion: NVIDIA’s High‑Bandwidth Interconnect for Heterogeneous AI Computing

NVLink Fusion, unveiled at Computex 2025, extends NVIDIA’s NVLink technology to enable high‑bandwidth, low‑latency connections between CPUs and GPUs or third‑party accelerators, offering up to 900 GB/s bandwidth, flexible heterogeneous configurations, ecosystem expansion, performance gains for AI training and inference, and potential cost reductions.

AIGPUHigh‑Bandwidth Interconnect
0 likes · 12 min read
NVLink Fusion: NVIDIA’s High‑Bandwidth Interconnect for Heterogeneous AI Computing
Architects' Tech Alliance
Architects' Tech Alliance
Jul 25, 2024 · Artificial Intelligence

NVIDIA H20 AI Chip Launch and the Rapid Growth of China's AI Chip Market

The article reviews NVIDIA's newly released H20 AI accelerator for China, compares its performance and pricing with domestic chips, outlines the expanding Chinese AI chip ecosystem—including Huawei, Cambricon, HaiGuang, Alibaba, ByteDance, and Baidu—while highlighting market size growth, multi‑chip heterogeneity strategies, and the strong demand forecast through 2024.

AI chipsAI computeGPU
0 likes · 8 min read
NVIDIA H20 AI Chip Launch and the Rapid Growth of China's AI Chip Market
Architects' Tech Alliance
Architects' Tech Alliance
May 9, 2024 · Artificial Intelligence

AI Servers: Market Opportunities, Architecture, and Future Demand Driven by Generative AI

The article examines how the surge of generative AI (AIGC) is fueling rapid growth in AI server demand, detailing the emerging AIGC ecosystem, server hardware composition, model scaling, heterogeneous computing, training vs. inference workloads, market size forecasts, and the competitive landscape of AI server manufacturers.

AI ServersAI infrastructureGPU
0 likes · 15 min read
AI Servers: Market Opportunities, Architecture, and Future Demand Driven by Generative AI
DataFunSummit
DataFunSummit
Apr 5, 2024 · Big Data

HuoLala Big Data Infrastructure: Challenges, Practices, and Future Outlook

Senior big data engineer Zhu Yaogai from HuoLala shares the team’s three‑year journey, detailing background challenges, the construction of a multi‑layer big‑data infrastructure, solutions for cost efficiency, operational automation, heterogeneous computing, and future plans, illustrating how high cost‑effectiveness, operational efficiency, and analytical performance drive their evolution.

Cloud NativeCost EfficiencyData Infrastructure
0 likes · 11 min read
HuoLala Big Data Infrastructure: Challenges, Practices, and Future Outlook
JD Retail Technology
JD Retail Technology
Feb 1, 2024 · Artificial Intelligence

Evolution and Optimization of JD Retail Advertising Online Model System: From Deep Learning to Distributed Graph Computing and Power Collaboration

The article details JD Retail Advertising's three‑stage evolution of its online model system—deep‑learning era, large‑model era, and power‑collaboration era—highlighting heterogeneous computing optimizations, platform and system capabilities, distributed graph computing, online learning, and dynamic power allocation to dramatically improve algorithm iteration speed and model performance.

AILarge Modelsadvertising
0 likes · 13 min read
Evolution and Optimization of JD Retail Advertising Online Model System: From Deep Learning to Distributed Graph Computing and Power Collaboration
Architects' Tech Alliance
Architects' Tech Alliance
Dec 22, 2023 · Artificial Intelligence

AI Server Architecture, Market Trends, and Competitive Landscape in 2023

An in‑depth overview of AI server components, market growth, AIGC‑driven demand, heterogeneous computing architectures, major vendors, and future trends, highlighting hardware composition, cost breakdown, competitive rankings, and the impact of GPU, CPU, and emerging AI accelerators on the industry.

AI ServersAI hardwareGPU
0 likes · 14 min read
AI Server Architecture, Market Trends, and Competitive Landscape in 2023
Architects' Tech Alliance
Architects' Tech Alliance
Nov 15, 2023 · Fundamentals

FPGA: A Versatile Chip Igniting New Momentum and the Future of Domestic Substitution (2023)

The article analyzes the rapid growth of FPGA technology, its flexible architecture and low‑cost development, the expanding role of FPGA in data‑center acceleration, the strategic moves of AMD, Intel and Nvidia in heterogeneous computing, and forecasts a strong market expansion worldwide through 2025.

AI accelerationFPGAdata center
0 likes · 10 min read
FPGA: A Versatile Chip Igniting New Momentum and the Future of Domestic Substitution (2023)
Architects' Tech Alliance
Architects' Tech Alliance
Sep 4, 2023 · Artificial Intelligence

Overview of AI Chip Types, Architectures, and Market Trends

The article explains the various AI‑capable chips such as CPUs, GPUs, FPGAs, NPUs, and TPUs, compares their performance and efficiency, describes heterogeneous CPU+xPU solutions, and provides market share data while highlighting the growing adoption of specialized AI accelerators.

AI accelerationAI chipsGPU
0 likes · 7 min read
Overview of AI Chip Types, Architectures, and Market Trends
Architects' Tech Alliance
Architects' Tech Alliance
Jul 29, 2023 · Artificial Intelligence

AI Server Market Overview and Technical Architecture

The article provides a comprehensive analysis of the AI server market, detailing server hardware components, cost distribution, logical architecture, firmware, rapid market growth, competitive landscape, AI-driven heterogeneous computing, and future industry trends, while highlighting key vendors and deployment configurations.

AI ServersGPUHardware Architecture
0 likes · 10 min read
AI Server Market Overview and Technical Architecture
Tencent Cloud Developer
Tencent Cloud Developer
Jul 6, 2023 · Cloud Computing

Hybrid vCPU: Tencent Cloud's Exploration of Virtualizing Heterogeneous CPU Architecture

Tencent Cloud’s Hybrid vCPU research, presented at KVM Forum 2023, outlines a three‑stage roadmap from homogeneous cores to mixed x86, ARM, and RISC‑V CPUs, detailing how virtualizing heterogeneous topologies, frequencies, caches, and PMU features can boost VM performance, security, live‑migration flexibility, and data‑center utilization.

Cloud ComputingHybrid CPUKVM
0 likes · 25 min read
Hybrid vCPU: Tencent Cloud's Exploration of Virtualizing Heterogeneous CPU Architecture
DataFunTalk
DataFunTalk
May 2, 2023 · Artificial Intelligence

Automatic Parallelism in PaddlePaddle: Architecture, Implementation, and Application Practice

This article presents a comprehensive overview of PaddlePaddle's automatic parallel design for heterogeneous scenarios, covering background motivation, architectural principles, key implementation details, practical usage interfaces, and future outlook, while illustrating concepts with detailed diagrams and examples.

AI frameworksPaddlePaddleautomatic parallelism
0 likes · 19 min read
Automatic Parallelism in PaddlePaddle: Architecture, Implementation, and Application Practice
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Mar 22, 2023 · Artificial Intelligence

CUTLASS Extreme Performance Optimization and Its Application in Alibaba's Recommendation System

At the GTC conference, the talk presents Alibaba Cloud’s heterogeneous computing platform and introduces the Open Deep Learning API (ODLA), then details how CUTLASS‑based operator fusion dramatically accelerates attention and MLP layers in large‑scale recommendation models, achieving multi‑fold performance gains in production.

CUTLASSDeep LearningGPU computing
0 likes · 5 min read
CUTLASS Extreme Performance Optimization and Its Application in Alibaba's Recommendation System
Architects' Tech Alliance
Architects' Tech Alliance
Nov 27, 2022 · Fundamentals

Trends and Future Directions of Server CPUs in the Post‑Moore Era

The article analyzes post‑Moore challenges for server CPUs, discusses the shift from general‑purpose to specialized processors, highlights architectural innovations, chiplet integration, edge‑computing demands, and the evolving strategies of major vendors to improve performance, power efficiency, and scalability.

AIChipletPost-Moore
0 likes · 17 min read
Trends and Future Directions of Server CPUs in the Post‑Moore Era
Tencent Cloud Developer
Tencent Cloud Developer
Sep 30, 2022 · Cloud Computing

Understanding GPU Computing and Cloud-Based GPU Solutions

The article explains how massive parallel pixel calculations demand GPUs, whose high cost and inflexibility are solved by Tencent Cloud’s elastic, virtualized GPU services—including vGPU, qGPU, TACO abstraction, and spot instances—delivering up to 16 EFLOPS for AI, scientific, graphics, and video workloads.

Cloud GPUGPU computingParallel Computing
0 likes · 5 min read
Understanding GPU Computing and Cloud-Based GPU Solutions
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jul 30, 2022 · Cloud Computing

Highlights from the First China Computing Conference: Cloud Computing as the Foundation of the Digital Economy

The inaugural China Computing Conference in Jinan featured keynote speeches by Alibaba Cloud leaders emphasizing cloud computing as the backbone of the digital economy, showcased breakthrough immersion liquid‑cooling technology, the Zhenduan heterogeneous computing platform with record‑breaking AI benchmark results, and announced a series of innovative cloud‑native solutions and awards.

AI benchmarksAlibaba CloudCloud Computing
0 likes · 6 min read
Highlights from the First China Computing Conference: Cloud Computing as the Foundation of the Digital Economy
Architects' Tech Alliance
Architects' Tech Alliance
Jul 21, 2022 · Artificial Intelligence

The Evolution of CPU and Heterogeneous Computing Architecture in the AI Era

This article surveys the rapid growth of data‑center capacity, the rise of AI and big‑data workloads, and how emerging accelerators such as GPUs, DPUs, SmartNICs and heterogeneous CPU designs from Intel, AMD, Arm and Apple are reshaping server hardware and driving a new wave of performance and efficiency competition.

AIGPUcpu
0 likes · 12 min read
The Evolution of CPU and Heterogeneous Computing Architecture in the AI Era
Baidu Geek Talk
Baidu Geek Talk
Jul 18, 2022 · Artificial Intelligence

GPU Container Virtualization for AI Heterogeneous Computing: Architecture and Best Practices

The article surveys GPU container virtualization for AI heterogeneous computing, detailing utilization challenges, historical architectures, various virtualization methods, Baidu's dual-engine user- and kernel-space design with isolation and scheduling features, performance benefits, best‑practice scenarios, and deployment guidance, concluding with a technical Q&A.

AI computingCloud NativeContainerization
0 likes · 30 min read
GPU Container Virtualization for AI Heterogeneous Computing: Architecture and Best Practices
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jul 5, 2022 · Fundamentals

High‑Performance Chiplet and Interconnect Architectures: Insights from the HiPChips Workshop at ISCA 2022

The HiPChips workshop at ISCA 2022 gathered leading academia and industry experts to discuss the motivations, recent research breakthroughs, technical challenges, and ecosystem efforts surrounding high‑performance chiplet and interconnect architectures for future computing systems.

ChipletComputer ArchitectureHigh Performance Computing
0 likes · 10 min read
High‑Performance Chiplet and Interconnect Architectures: Insights from the HiPChips Workshop at ISCA 2022
Architects' Tech Alliance
Architects' Tech Alliance
Jul 5, 2022 · Fundamentals

Understanding High‑Performance Computing (HPC): Principles, Architecture, and Performance Metrics

This article explains the fundamentals of high‑performance computing, covering serial and parallel processing, heterogeneous CPU‑GPU architectures, FLOPS measurement levels, key terminology, and why HPC is essential for scientific and engineering simulations, while also noting market reports and resource links.

FLOPSHPCHigh Performance Computing
0 likes · 6 min read
Understanding High‑Performance Computing (HPC): Principles, Architecture, and Performance Metrics
Tencent Cloud Developer
Tencent Cloud Developer
Jun 29, 2022 · Fundamentals

C++ Asynchronous Programming: Understanding libunifex and Sender/Receiver Model

This article thoroughly explains libunifex’s sender/receiver model for C++ asynchronous programming, covering its design goals, module structure, pipeline composition, key functions like schedule, then, sync_wait, and the connect/start mechanisms, while demonstrating practical examples and integration with C++20 coroutines and cancellation support.

Asynchronous ProgrammingC++Cancellation
0 likes · 16 min read
C++ Asynchronous Programming: Understanding libunifex and Sender/Receiver Model