Artificial Intelligence 10 min read

Understanding AI Chip Architecture: How ASIC Accelerators Differ from CPUs and GPUs

The article explains why dedicated AI chips (ASICs) are needed, compares their performance and power efficiency to traditional CPUs and GPUs, describes the architecture of Google's TPU and other AI accelerators, and provides historical context for the evolution of AI hardware.

Architects' Tech Alliance

Apr 5, 2020

Understanding AI Chip Architecture: How ASIC Accelerators Differ from CPUs and GPUs

Recently, a two‑year‑old Chinese AI‑FPGA startup was acquired by Xilinx, highlighting the rapid growth of AI chip companies. Most readers are unfamiliar with AI chip architectures, prompting this overview.

AI chips are typically ASICs designed specifically for AI algorithms such as CNNs and RNNs, which mainly perform massive matrix multiplications and additions. General‑purpose CPUs and GPUs can run these algorithms but are slower and less power‑efficient, making them unsuitable for real‑time applications like autonomous driving or mobile facial recognition.

The article compares computational capabilities: a high‑end CPU (e.g., IBM POWER8) can achieve roughly 64 Gops, while Google's TPU1, with a 256×256 systolic array of 64 k MAC units running at ~700 MHz, can reach about 90 Tops—several orders of magnitude higher. Real‑world utilization is lower due to memory bandwidth limits and control overhead.

Other AI accelerators such as Cambricon's DianNao and various FPGA‑based solutions are also mentioned, illustrating the trend toward specialized hardware for deep‑learning workloads.

Historical notes trace AI computation from early CPU‑based neural networks, through the GPU boom, to modern ASICs and NPUs, emphasizing that each platform serves different use cases.

Finally, the article includes a disclaimer and promotional material for an e‑book collection on architecture topics.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

deep learning ASIC AI Chip TPU CPU vs GPU

Written by

Architects' Tech Alliance

Sharing project experiences, insights into cutting-edge architectures, focusing on cloud computing, microservices, big data, hyper-convergence, storage, data protection, artificial intelligence, industry practices and solutions.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.