Difference between revisions of "intel/microarchitectures/silvermont"

	Edit Values
	Silvermont µarch
	General Info

Revision as of 20:39, 9 April 2016

ERROR: "atype" is missing!

Silvermont is Intel's 22 nm microarchitecture for the Atom family of system on chips. Introduced in 2013, Silvermont was the successor to Saltwell, targeting smartphones, tablets, embedded devices, and consumer electronics.

Codenames

Platform	Core	Target
Merrifield	Tangier	Smartphones
Bay Trail	Valleyview	Tablets
Edisonville	Avoton	Microservers
Edisonville	Rangeley	Embedded Networking

Architecture

Silvermont introduced a number of significant changes from the previous Atom microarchitecture in addition to the increase performance and lower power consumption.

Key changes from Saltwell

Pipeline is now OoOE
14 stage (2 shorter)
10 stage panelty for miss (3 shorter)
Support up to Westmere
Multi-core modular system (up to 8 cores)

Block Diagram

Core Modules

Silvermont employees a modular core design. Each module consists of 2 cores and 2 threads with exclusive hardware - resources are not shared. Within each module is a 1 MB L2 cache shared between the two cores. The L1 is still identical to Saltwell's: 32K L1I$ and 24K L1D$. Each module as a dedicated point-to-point interface (IDI) to the system agent. Each module has a per-core frequency and power management support. This is a departure from previous microarchitectures as well as similar desktop (e.g. Core) where all cores are tied to the same frequency.

System Agent

The system agent (Silvermont System Agent') acts very much like a North Bridge however it does a much better job than previous Atom microarchitectures performance-wise because it's capable of reordering all requests from all consumers (e.g. Core, GPU).

IDI

While the previous Atom architecture did away with the memory controller by integrating and other support chips on-die, it still used a Front Side Bus implementation to talk to North Bridge. In Silvermont, this was replaced with a lightweight in-die interconnect (IDI) - same one used in the Core processors. The use of IDI should have noticeable performance impact per thread.

Memory Hierarchy

Cache
- Hardware prefetchers
- L1 Cache:
  - 32 KB 8-way set associative instruction
  - 24 KB 6-way set associative data
  - Per core
- L2 Cache:
  - 512 KB 8-way set associative
  - ECC
  - Per core
- L3 Cache:
  - No level 3 cache
- Non-Cache Shared State Memory
- RAM
  - Maximum of 1GB, 2 GB, and 4 GB
  - dual 32-bit channels, 1 or 2 ranks per channel

Pipeline

While Silvermont share some similarities with Saltwell, it introduces a number of significant changes that sets it apart from part Atom microarchitectures. Like Saltwell, the pipeline is still uses a dual-issue design; however it has a pipeline that is 2 stages shorter with a branch misprediction penalty of 3 cycles lower. Silvermont is the first microarchitecture to introduce out-of-order execution (OoOE)

Silvermont pipeline decodes and issues 2 instructions and dispatches 5 operations/cycle.

Instruction Fetch

Instruction Fetch, just like in previous microarchs make up the first three stages of the pipeline. However, with the introduction of out-of-order execution, silvermont's more aggressive fetching and branch prediction mean stalled instructions do not clog the entire pipeline as it did in Saltwell.

Instruction Decode

In previous generations of microarchitectures, common software code had roughly 5% of instructions split up into micro-ops. In Silvermont this is reduced down to just 1-2%. This reduction translates directly into performance because it eliminates the 3-4 additional cycles of overhead. Silvermont has a second branch predictor that can make more accurate predictions based on previously unknown information (e.g. target address from memory or register) and override the generic predictor. Nevertheless the expense of branch misprediction penalties was also reduced by 3 stages (down to 10 cycles from 13 in Saltwell).

Branch Prediction

Silvermont has two branch predictions: one that controls the instruction fetching and a second one that can override the first during the decode stage after gather additional information. The second predictor controls the speculative instruction issuing. For the first predictor, Silvermont uses a Branch Target Buffer to determine the next fetch address which also includes a 4-entry Return Stack Buffer for calls and returns handling.

Die

8-core Avoton Die:

Cores

Tangier - SoCs for Smartphones
Valleyview - SoCs for Tablets
Avoton - SoCs for Microservers
Rangeley - SoCs for Embedded Networking

@@ Line 1: / Line 1: @@
 {{intel title|Silvermont|arch}}
+{{microarchitecture
+| name          = Silvermont
+| manufacturer  = Intel
+| introduction  = 2013
+| phase-out     = 2015
+| process       = 22 nm
+| cores         = 1
+| cores 2       = 2
+| cores 3       = 4
+| cores 4       = 8
+| pipeline      = Yes
+| type          = Superscalar
+| OoOE          = Yes
+| speculative   = Yes
+| renaming      = Yes
+| isa           = IA-32
+| isa 2         = x86-64
+| stages        = 14
+| issues        = 2
+| inst          = Yes
+| feature       = MMX
+| extension     = SSE
+| extension 2   = SSE2
+| extension 3   = SSE3
+| extension 4   = SSSE3
+| extension 5   = SSE4
+| extension 6   = SSE4.1
+| extension 7   = SSE4.2
+| extension 8   = VT-x
+| extension 9   = AES-NI
+| extension 10  = CLMUL
+| cache         = Yes
+| l1i           = 32 KB
+| l1i per       = Core
+| l1i desc      = 8-way set associative
+| l1d           = 24 KB
+| l1d per       = Core
+| l1d desc      = 6-way set associative
+| l2            = 512
+| l2 per        = 2 Cores
+| l2 desc       = 8-way set associative
+| core names       = Yes
+| core name        = Tangier
+| core name 2      = Valleyview
+| core name 3      = Avoton
+| core name 4      = Rangeley
+| succession       = Yes
+| predecessor      = Saltwell
+| predecessor link = intel/microarchitectures/saltwell
+| successor        = Airmont
+| successor link   = intel/microarchitectures/airmont
+}}
 '''Silvermont''' is [[Intel]]'s [[22 nm]] [[microarchitecture]] for the {{intel|Atom|Atom family}} of [[system on chip]]s. Introduced in 2013, Silvermont was the successor to {{intel|Saltwell}}, targeting smartphones, tablets, embedded devices, and consumer electronics.
@@ Line 25: / Line 82: @@
 * Support up to {{intel|Westmere}}
 * Multi-core modular system (up to 8 cores)
+=== Block Diagram ===
+[[File:silvermont block.png]]
 === Core Modules ===
@@ Line 31: / Line 91: @@
 ==== System Agent ====
-The system agent acts very much like a [[North Bridge]] however it does a much better job than previous {{intel|Atom}} microarchitectures performance-wise because it's capable of reordering all requests from all consumers (e.g. Core, GPU).
+The system agent (''Silvermont System Agent''') acts very much like a [[North Bridge]] however it does a much better job than previous {{intel|Atom}} microarchitectures performance-wise because it's capable of reordering all requests from all consumers (e.g. Core, GPU).
 ==== IDI ====
@@ Line 41: / Line 101: @@
 ** L1 Cache:
 *** 32 KB 8-way [[set associative]] instruction
-**** 1 read and 1 write port
 *** 24 KB 6-way set associative data
-**** 1 read and 1 write port
-*** 8 transistors (instead of 6) to reduce voltage
 *** Per core
 ** L2 Cache:
 *** 512 KB 8-way set associative
 *** ECC
-*** separate voltage rail, fixed @ 1.05V
 *** Per core
 ** L3 Cache:
 *** No level 3 cache
 ** Non-Cache Shared State Memory
-*** 256KB low-power SRAM
-*** separate voltage plane
-*** always-on block that stores architectural states while in various power saving modes
 ** RAM
 *** Maximum of 1GB, 2 GB, and 4 GB
 *** dual 32-bit channels, 1 or 2 ranks per channel
+=== Pipeline ===
+While Silvermont share some similarities with {{\\|Saltwell}}, it introduces a number of significant changes that sets it apart from part {{intel|Atom}} microarchitectures. Like {{\\|Saltwell}}, the pipeline is still uses a dual-issue design; however it has a pipeline that is 2 stages shorter with a branch misprediction penalty of 3 cycles lower. Silvermont is the first microarchitecture to [[introduce out-of-order execution]] (OoOE)
+[[File:silvermont pipeline.svg]]
+Silvermont pipeline decodes and issues 2 instructions and dispatches 5 operations/cycle.
+==== Instruction Fetch ====
+Instruction Fetch, just like in previous microarchs make up the first three stages of the pipeline. However, with the introduction of out-of-order execution, silvermont's more aggressive fetching and branch prediction mean stalled instructions do not clog the entire pipeline as it did in {{intel|Saltwell}}.
+====Instruction Decode====
+In previous generations of microarchitectures, common software code had roughly 5% of instructions split up into micro-ops. In Silvermont this is reduced down to just 1-2%. This reduction translates directly into performance because it eliminates the 3-4 additional cycles of overhead. Silvermont has a second branch predictor that can make more accurate predictions based on previously unknown information (e.g. target address from memory or register) and override the generic predictor. Nevertheless the expense of branch misprediction penalties was also reduced by 3 stages (down to 10 cycles from 13 in {{intel|Saltwell}}).
+=== Branch Prediction ===
+Silvermont has two branch predictions: one that controls the instruction fetching and a second one that can override the first during the decode stage after gather additional information. The second predictor controls the speculative instruction issuing. For the first predictor, Silvermont uses a [[Branch Target Buffer]] to determine the next fetch address which also includes a 4-entry Return Stack Buffer for calls and returns handling.
+== Die ==
+-core {{intel|Avoton}} Die:
+[[File:silvermont die (quad-core).png]]
+== Cores ==
+* {{intel|Tangier}} - SoCs for Smartphones
+* {{intel|Valleyview}} - SoCs for Tablets
+* {{intel|Avoton}} - SoCs for Microservers
+* {{intel|Rangeley}} - SoCs for Embedded Networking

codename	Silvermont +
name	Silvermont +

WikiChip

The Fuse Coverage

Social Media

Companies

Microarchitectures

Technology Nodes

Intel

AMD

ARM

Cavium

Samsung

Intel

AMD

Ampere

Apple