Cloud AI 100 - Microarchitectures - Qualcomm

WikiChip
WikiChip
- WikiChip
  
  Home
  
  Random Article
  
  Recent Changes
  
  Chip Feed
- The Fuse Coverage
  
  Recent News
  
  ISSCC
  
  IEDM
  
  VLSI
  
  Hot Chips
  
  SuperComputing
- Social Media
  
  Twitter
  
  Flipboard
Popular
- Companies
  
  Intel
  
  AMD
  
  ARM
  
  Qualcomm
- Microarchitectures
  
  Skylake (Client)
  
  Skylake (Server)
  
  Zen
  
  Coffee Lake
  
  Zen 2
- Technology Nodes
  
  14 nm
  
  10 nm
  
  7 nm
Architectures
Popular x86
- Intel
  
  Client
  
  Skylake
  
  Kaby Lake
  
  Coffee Lake
  
  Ice Lake
  
  Server
  
  Skylake
  
  Cascade Lake
  
  Cooper Lake
  
  Ice Lake
  
  Big Cores
  
  Sunny Cove
  
  Willow Cove
  
  Small Cores
  
  Goldmont
  
  Goldmont Plus
  
  Tremont
  
  Gracemont
- AMD
  
  Zen
  
  Zen+
  
  Zen 2
  
  Zen 3
Popular ARM
- ARM
  
  Server
  
  Neoverse N1
  
  Zeus
  
  Big
  
  Cortex-A75
  
  Cortex-A76
  
  Cortex-A77
  
  Little
  
  Cortex-A53
  
  Cortex-A55
- Cavium
  
  Vulcan
- Samsung
  
  Exynos M1
  
  Exynos M2
  
  Exynos M3
  
  Exynos M4
Chips
Popular Families
- Intel
  
  Core i3
  
  Core i5
  
  Core i7
  
  Core i9
  
  Xeon D
  
  Xeon E
  
  Xeon W
  
  Xeon Bronze
  
  Xeon Silver
  
  Xeon Gold
  
  Xeon Platinum
- AMD
  
  Ryzen 3
  
  Ryzen 5
  
  Ryzen 7
  
  Ryzen Threadripper
  
  EPYC
  
  EPYC Embedded
- Ampere
  
  eMAG
- Apple
  
  Ax
- Cavium
  
  ThunderX
  
  ThunderX2
- HiSilicon
  
  Kirin
- MediaTek
  
  Helio
- NXP
  
  i.MX
  
  QorIQ Layerscape
- Qualcomm
  
  Snapdragon 400
  
  Snapdragon 600
  
  Snapdragon 700
  
  Snapdragon 800
- Renesas
  
  R-Car
- Samsung
  
  Exynos

From WikiChip

< qualcomm

Revision as of 06:27, 15 September 2021 by David (talk | contribs) (→‎Memory Hierarchy)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Edit Values
Cloud AI 100 µarch
General Info
Arch Type	NPU
Designer	Qualcomm
Manufacturer	TSMC
Introduction	March, 2021
Process	7 nm
PE Configs	16
Pipeline
Type	VLIW
Decode	4-way
Cache
L2 Cache	1 MiB/core
Side Cache	8 MiB/core

Cloud AI 100 is an NPU microarchitecture designed by Qualcomm for the server and edge market. Those NPUs are sold under the Cloud AI brand.

1 Process Technology
2 Architecture
- 2.1 Key Features
3 Block Diagram
- 3.1 SoC
- 3.2 AI Core
4 Memory Hierarchy
5 Overview
6 AI Core
7 Performance claims
8 Bibliography

Process Technology

The Cloud AI 100 SoC is fabricated on TSMC's 7-nanometer process.

Architecture

Key Features

Block Diagram

SoC

AI Core

Memory Hierarchy

L1D$ / L1I$
- Private per AI Core
L2
- 1 MiB / AI Core
Vector Tightly-Coupled Memory (VTCM)
- 8 MiB / AI Core
DRAM
- 8-32 GiB
  - LPDDR4x-4266
    - 68.25 - 136.5 GB/s

Overview

AI Core

Performance claims

Performance-per-watt was published by Quall based on an Int8 3×3 convolution operation with uniformly distributed weights and input action comprising 50% zeros which Qualcomm says is typical for Deep CNN with Relu operators. To that end, Qualcomm says the AI 100 can achieve up to ~150 TOPs at ~12 W at over 12 TOPS/W in edge cases and ~363 TOPs at under 70 W at 5.24 TOPs/W in data center uses. Numbers are at the SoC level.

SoC Power	12.05 W	19.74 W	69.26 W
TOPS	149.01	196.94	363.02
TOPS/W	12.37	9.98	5.24

Bibliography

Linley Fall Processor Conference 2021
Qualcomm, IEEE Hot Chips 33 Symposium (HCS) 2021.

Retrieved from "https://en.wikichip.org/w/index.php?title=qualcomm/microarchitectures/cloud_ai_100&oldid=99318"

Categories:

Facts about "Cloud AI 100 - Microarchitectures - Qualcomm"

RDF feed

codename	Cloud AI 100 +
designer	Qualcomm +
first launched	March 2021 +
full page name	qualcomm/microarchitectures/cloud ai 100 +
instance of	microarchitecture +
manufacturer	TSMC +
name	Cloud AI 100 +
process	7 nm (0.007 μm, 7.0e-6 mm) +
processing element count	16 +

WikiChip

The Fuse Coverage

Social Media

Companies

Microarchitectures

Technology Nodes

Intel

AMD

ARM

Cavium

Samsung

Intel

AMD

Ampere

Apple

Cavium

HiSilicon

MediaTek

NXP

Qualcomm

Renesas

Samsung

Contents

Process Technology

Architecture

Key Features

Block Diagram

SoC

AI Core

Memory Hierarchy

Overview

AI Core

Performance claims

Bibliography