AVX512 VNNI - x86

WikiChip
WikiChip
- WikiChip
  
  Home
  
  Random Article
  
  Recent Changes
  
  Chip Feed
- The Fuse Coverage
  
  Recent News
  
  ISSCC
  
  IEDM
  
  VLSI
  
  Hot Chips
  
  SuperComputing
- Social Media
  
  Twitter
  
  Flipboard
Popular
- Companies
  
  Intel
  
  AMD
  
  ARM
  
  Qualcomm
- Microarchitectures
  
  Skylake (Client)
  
  Skylake (Server)
  
  Zen
  
  Coffee Lake
  
  Zen 2
- Technology Nodes
  
  14 nm
  
  10 nm
  
  7 nm
Architectures
Popular x86
- Intel
  
  Client
  
  Skylake
  
  Kaby Lake
  
  Coffee Lake
  
  Ice Lake
  
  Server
  
  Skylake
  
  Cascade Lake
  
  Cooper Lake
  
  Ice Lake
  
  Big Cores
  
  Sunny Cove
  
  Willow Cove
  
  Small Cores
  
  Goldmont
  
  Goldmont Plus
  
  Tremont
  
  Gracemont
- AMD
  
  Zen
  
  Zen+
  
  Zen 2
  
  Zen 3
Popular ARM
- ARM
  
  Server
  
  Neoverse N1
  
  Zeus
  
  Big
  
  Cortex-A75
  
  Cortex-A76
  
  Cortex-A77
  
  Little
  
  Cortex-A53
  
  Cortex-A55
- Cavium
  
  Vulcan
- Samsung
  
  Exynos M1
  
  Exynos M2
  
  Exynos M3
  
  Exynos M4
Chips
Popular Families
- Intel
  
  Core i3
  
  Core i5
  
  Core i7
  
  Core i9
  
  Xeon D
  
  Xeon E
  
  Xeon W
  
  Xeon Bronze
  
  Xeon Silver
  
  Xeon Gold
  
  Xeon Platinum
- AMD
  
  Ryzen 3
  
  Ryzen 5
  
  Ryzen 7
  
  Ryzen Threadripper
  
  EPYC
  
  EPYC Embedded
- Ampere
  
  eMAG
- Apple
  
  Ax
- Cavium
  
  ThunderX
  
  ThunderX2
- HiSilicon
  
  Kirin
- MediaTek
  
  Helio
- NXP
  
  i.MX
  
  QorIQ Layerscape
- Qualcomm
  
  Snapdragon 400
  
  Snapdragon 600
  
  Snapdragon 700
  
  Snapdragon 800
- Renesas
  
  R-Car
- Samsung
  
  Exynos

From WikiChip

AVX512 VNNI - x86

< x86

Revision as of 23:56, 6 November 2018 by David (talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

AVX-512 Vector Neural Network Instructions (AVX512 VNNI) is an x86 extension, part of the AVX-512, designed to accelerate convolutional neural network-based algorithms.

Overview

The AVX512 VNNI x86 extension extends AVX-512 Foundation by introducing four new instructions for accelerating inner convolutional neural network loops.

VPDPBUSD - Multiplies the individual bytes (8-bit) of the first source operand by the corresponding bytes (8-bit) of the second source operand, producing intermediate word (16-bit) results which are summed and accumulated in the double word (32-bit) of the destination operand.
VPDPBUSDS - Same as above except on intermediate sum overflow which saturates to 0x7FFF_FFFF/0x8000_0000 for positive/negative numbers.
VPDPWSSD - Multiplies the individual words (16-bit) of the first source operand by the corresponding word (16-bit) of the second source operand, producing intermediate word results which are summed and accumulated in the double word (32-bit) of the destination operand.
VPDPWSSDS - Same as above except on intermediate sum overflow which saturates to 0x7FFF_FFFF/0x8000_0000 for positive/negative numbers.

Retrieved from "https://en.wikichip.org/w/index.php?title=x86/avx512_vnni&oldid=83915"

Category:

WikiChip

The Fuse Coverage

Social Media

Companies

Microarchitectures

Technology Nodes

Intel

AMD

ARM

Cavium

Samsung

Intel

AMD

Ampere

Apple

Cavium

HiSilicon

MediaTek

NXP

Qualcomm

Renesas

Samsung

Overview