From WikiChip
Editing intel/microarchitectures/broadwell (client)
Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.
The edit can be undone.
Please check the comparison below to verify that this is what you want to do, and then save the changes below to finish undoing the edit.
This page supports semantic in-text annotations (e.g. "[[Is specified as::World Heritage Site]]") to build structured and queryable content provided by Semantic MediaWiki. For a comprehensive description on how to use annotations or the #ask parser function, please have a look at the getting started, in-text annotation, or inline queries help pages.
Latest revision | Your text | ||
Line 1: | Line 1: | ||
{{intel title|Broadwell|arch}} | {{intel title|Broadwell|arch}} | ||
{{microarchitecture | {{microarchitecture | ||
− | + | | name = Broadwell | |
− | |name=Broadwell | + | | designer = Intel |
− | |designer=Intel | + | | manufacturer = Intel |
− | |manufacturer=Intel | + | | introduction = October, 2014 |
− | |introduction=October, 2014 | + | | phase-out = |
− | |process=14 nm | + | | process = 14 nm |
− | |cores=2 | + | | cores = 2 |
− | |cores 2=4 | + | | cores 2 = 4 |
− | |cores 3=6 | + | | cores 3 = 6 |
− | |cores 4=8 | + | | cores 4 = 8 |
− | |cores 5= | + | | cores 5 = 16 |
− | |cores 6= | + | | cores 6 = 32 |
− | + | ||
− | | | + | | pipeline = Yes |
− | | | + | | type = Superscalar |
− | | | + | | OoOE = Yes |
− | | | + | | speculative = Yes |
− | | | + | | renaming = Yes |
− | | | + | | isa = IA-32 |
− | | | + | | isa 2 = x86-64 |
− | |stages min=14 | + | | stages min = 14 |
− | |stages max=19 | + | | stages max = 19 |
− | | | + | | issues = 4 |
− | |extension=MOVBE | + | |
− | |extension 2=MMX | + | | inst = Yes |
− | |extension 3=SSE | + | | feature = |
− | |extension 4=SSE2 | + | | extension = MOVBE |
− | |extension 5=SSE3 | + | | extension 2 = MMX |
− | |extension 6=SSSE3 | + | | extension 3 = SSE |
− | |extension 7=SSE4.1 | + | | extension 4 = SSE2 |
− | |extension 8=SSE4.2 | + | | extension 5 = SSE3 |
− | |extension 9=POPCNT | + | | extension 6 = SSSE3 |
− | |extension 10=AVX | + | | extension 7 = SSE4.1 |
− | |extension 11=AVX2 | + | | extension 8 = SSE4.2 |
− | |extension 12=AES | + | | extension 9 = POPCNT |
− | |extension 13=PCLMUL | + | | extension 10 = AVX |
− | |extension 14=FSGSBASE | + | | extension 11 = AVX2 |
− | |extension 15=RDRND | + | | extension 12 = AES |
− | |extension 16=FMA3 | + | | extension 13 = PCLMUL |
− | |extension 17=F16C | + | | extension 14 = FSGSBASE |
− | |extension 18=BMI | + | | extension 15 = RDRND |
− | |extension 19=BMI2 | + | | extension 16 = FMA3 |
− | |extension 20=VT-x | + | | extension 17 = F16C |
− | |extension 21=VT-d | + | | extension 18 = BMI |
− | |extension 22=TXT | + | | extension 19 = BMI2 |
− | |extension 23=TSX | + | | extension 20 = VT-x |
− | |extension 24=RDSEED | + | | extension 21 = VT-d |
− | |extension 25=ADCX | + | | extension 22 = TXT |
− | |extension 26=PREFETCHW | + | | extension 23 = TSX |
− | |l1i=32 | + | | extension 24 = RDSEED |
− | |l1i per=core | + | | extension 25 = ADCX |
− | |l1i desc=8-way set associative | + | | extension 26 = PREFETCHW |
− | |l1d=32 | + | |
− | |l1d per=core | + | | cache = Yes |
− | |l1d desc=8-way set associative | + | | l1i = 32 KB |
− | |l2=256 | + | | l1i per = core |
− | |l2 per=core | + | | l1i desc = 8-way set associative |
− | |l2 desc=8-way set associative | + | | l1d = 32 KB |
− | |l3=1.5 | + | | l1d per = core |
− | |l3 per=core | + | | l1d desc = 8-way set associative |
− | |l4=128 | + | | l2 = 256 KB |
− | |l4 per=package | + | | l2 per = core |
− | |l4 desc=on Iris Pro GPUs only | + | | l2 desc = 8-way set associative |
− | |core name=Broadwell Y | + | | l3 = 1.5 MB |
− | |core name 2=Broadwell U | + | | l3 per = core |
− | |core name 3=Broadwell H | + | | l3 desc = |
− | |core name 4=Broadwell DT | + | | l4 = 128 MB |
− | |core name 5=Broadwell EP | + | | l4 per = package |
− | |core name 6=Broadwell EX | + | | l4 desc = on Iris Pro GPUs only |
− | |core name 7=Broadwell E | + | |
− | |predecessor=Haswell | + | | core names = Yes |
− | |predecessor link=intel/microarchitectures/haswell | + | | core name = Broadwell Y |
− | |successor=Skylake | + | | core name 2 = Broadwell U |
− | + | | core name 3 = Broadwell H | |
− | + | | core name 4 = Broadwell DT | |
− | |successor | + | | core name 5 = Broadwell EP |
− | + | | core name 6 = Broadwell EX | |
− | + | | core name 7 = Broadwell E | |
− | + | ||
− | + | | succession = Yes | |
− | + | | predecessor = Haswell | |
− | + | | predecessor link = intel/microarchitectures/haswell | |
− | + | | successor = Skylake | |
+ | | successor link = intel/microarchitectures/skylake | ||
}} | }} | ||
'''Broadwell''' ('''BDW''') is [[Intel]]'s [[microarchitecture]] based on the [[14 nm process]] for mobile, desktops, and servers. Introduced in early 2015, Broadwell is a [[process shrink]] of {{\\|Haswell}} which introduced several enhancements. Broadwell is named after [[wikipedia:Broadwell, Illinois|Broadwell, Illinois]]. | '''Broadwell''' ('''BDW''') is [[Intel]]'s [[microarchitecture]] based on the [[14 nm process]] for mobile, desktops, and servers. Introduced in early 2015, Broadwell is a [[process shrink]] of {{\\|Haswell}} which introduced several enhancements. Broadwell is named after [[wikipedia:Broadwell, Illinois|Broadwell, Illinois]]. | ||
Line 103: | Line 104: | ||
| Broadwell EP || BDW-EP || {{intel|Xeon E5}}, Dual-Processor platform | | Broadwell EP || BDW-EP || {{intel|Xeon E5}}, Dual-Processor platform | ||
|- | |- | ||
− | | Broadwell EX || BDW-EX || {{intel|Xeon | + | | Broadwell EX || BDW-EX || {{intel|Xeon E5}}, Multi-Processor platform, QPI |
|- | |- | ||
| Broadwell E || BDW-E || High-End Desktops (HEDT) | | Broadwell E || BDW-E || High-End Desktops (HEDT) | ||
Line 109: | Line 110: | ||
== Process Technology == | == Process Technology == | ||
− | + | Broadwell is designed to be manufactured using [[14 nm]] Tri-gate [[FinFET]] transistors. This correlates to 8 nm Fin width and a 42 nm Fin pitch (shown below). SRAM cell is at .0588 um<sup>2</sup>. | |
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | Broadwell is designed to be manufactured using [[14 nm]] Tri-gate [[FinFET]] transistors. This correlates to 8 nm Fin width and a 42 nm Fin pitch (shown below). SRAM cell is at | ||
+ | Scaling: | ||
− | [[ | + | [[File:intel 14nm gate.png|215px|left]] |
− | |||
{| class="wikitable" | {| class="wikitable" | ||
|- | |- | ||
− | ! !! Haswell !! Broadwell !! | + | ! !! Haswell !! Broadwell !! |
|- | |- | ||
| || [[22 nm]] || [[14 nm]] || | | || [[22 nm]] || [[14 nm]] || | ||
|- | |- | ||
− | | Fin Pitch || 60 nm || 42 || 0.70x | + | | Fin Pitch || 60 nm || 42 nm || 0.70x |
− | |||
− | |||
− | |||
− | |||
|- | |- | ||
| Gate Pitch || 90 nm || 70 nm || 0.78x | | Gate Pitch || 90 nm || 70 nm || 0.78x | ||
|- | |- | ||
| Interconnect Pitch || 80 nm || 52 nm || 0.65x | | Interconnect Pitch || 80 nm || 52 nm || 0.65x | ||
− | |||
− | |||
|} | |} | ||
+ | {{clear}} | ||
== Architecture== | == Architecture== | ||
Line 152: | Line 133: | ||
=== Key changes from {{\\|Haswell}} === | === Key changes from {{\\|Haswell}} === | ||
− | |||
* ~5% IPC improvement | * ~5% IPC improvement | ||
* FP multiplication instructions has reduced latency (3 cycles, down from 5) | * FP multiplication instructions has reduced latency (3 cycles, down from 5) | ||
** Affects AVX, SSE, and FP instructions | ** Affects AVX, SSE, and FP instructions | ||
− | * {{x86|CLMUL}} instructions are now a single [[ | + | * {{x86|CLMUL}} instructions are now a single [[μop]], improving latency and throughput |
* The second-level TLB (STLB) | * The second-level TLB (STLB) | ||
** Table was enlarged (1,536 entries, up from 1024) | ** Table was enlarged (1,536 entries, up from 1024) | ||
Line 177: | Line 157: | ||
==== New instructions ==== | ==== New instructions ==== | ||
+ | {{main|#Added instructions|l1=See #Added_instructions for the complete list}} | ||
Broadwell introduced a number of new instructions: | Broadwell introduced a number of new instructions: | ||
* {{x86|RDSEED|<code>RDSEED</code>}} - Generates 16, 32 or 64 bit random numbers seeds ([[NIST SP 800-90B]] & [[NIST SP 800-90C]]) | * {{x86|RDSEED|<code>RDSEED</code>}} - Generates 16, 32 or 64 bit random numbers seeds ([[NIST SP 800-90B]] & [[NIST SP 800-90C]]) | ||
* {{x86|ADCX|<code>ADCX</code>}} - Arbitrary precision integer operations | * {{x86|ADCX|<code>ADCX</code>}} - Arbitrary precision integer operations | ||
* {{x86|PREFETCHW|<code>PREFETCHW</code>}} - Prefetch data into caches, hinting a write is expected in the future | * {{x86|PREFETCHW|<code>PREFETCHW</code>}} - Prefetch data into caches, hinting a write is expected in the future | ||
− | |||
=== Block Diagram === | === Block Diagram === | ||
Line 187: | Line 167: | ||
=== Memory Hierarchy === | === Memory Hierarchy === | ||
− | |||
* Cache | * Cache | ||
** L1 Cache: | ** L1 Cache: | ||
− | *** 32 | + | *** 32 KB 8-way [[set associative]] instruction, 64 B line size |
− | *** 32 | + | *** 32 KB 8-way set associative data, 64 B line size |
*** Write-back policy | *** Write-back policy | ||
*** Per core | *** Per core | ||
** L2 Cache: | ** L2 Cache: | ||
− | *** 256 | + | *** 256 KB 8-way set associative, 64 B line size |
*** Write-back policy | *** Write-back policy | ||
*** Per core | *** Per core | ||
** L3 Cache: | ** L3 Cache: | ||
− | *** 1.5 | + | *** 1.5 - 3 MB per core, 64 B line size |
− | *** | + | *** 16-20 -way set associative |
*** Write-back policy | *** Write-back policy | ||
** L4 Cache: | ** L4 Cache: | ||
− | *** 128 | + | *** 128 MB |
*** [[eDRAM]] | *** [[eDRAM]] | ||
*** shared with GPU ({{intel|Crystal Well}}) | *** shared with GPU ({{intel|Crystal Well}}) | ||
Line 211: | Line 190: | ||
* TLBs: | * TLBs: | ||
** ITLB | ** ITLB | ||
− | *** | + | *** 4KB page translations: |
**** 128 entries; 4-way set associative | **** 128 entries; 4-way set associative | ||
**** dynamic partition; divided between the two threads | **** dynamic partition; divided between the two threads | ||
− | *** | + | *** 2MB/4MB page translations: |
**** 8 entries; fully associative | **** 8 entries; fully associative | ||
**** Duplicated for each thread | **** Duplicated for each thread | ||
** DTLB | ** DTLB | ||
− | *** | + | *** 4KB page translations: |
**** 64 entries; 4-way set associative | **** 64 entries; 4-way set associative | ||
**** fixed partition; divided between the two threads | **** fixed partition; divided between the two threads | ||
− | *** | + | *** 2MB/4MB page translations: |
**** 32 entries; 4-way set associative | **** 32 entries; 4-way set associative | ||
− | *** | + | *** 1G page translations: |
**** 4 entries; 4-way set associative | **** 4 entries; 4-way set associative | ||
** STLB | ** STLB | ||
− | *** | + | *** 4KB+2M page translations: |
**** 1536 entries; 6-way set associative | **** 1536 entries; 6-way set associative | ||
**** shared | **** shared | ||
− | *** | + | *** 1GB page translations: |
**** 16 entries; 4-way set associative | **** 16 entries; 4-way set associative | ||
Line 236: | Line 215: | ||
Broadwell's pipeline is identical to Haswell. | Broadwell's pipeline is identical to Haswell. | ||
− | == | + | == Die == |
− | + | ===Dual-core Broadwell die=== | |
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | === | ||
− | Broadwell | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | + | : [[File:broadwell die (dual-core).jpg|850px]] | |
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
* 1,300,000,000 transistors | * 1,300,000,000 transistors | ||
− | * 82 | + | * 82 mm<sup>2</sup> |
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
===Dual-core Broadwell with {{intel|Iris Pro}} die=== | ===Dual-core Broadwell with {{intel|Iris Pro}} die=== | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
: [[File:broadwell with iris pro die (dual-core).png|850px]] | : [[File:broadwell with iris pro die (dual-core).png|850px]] | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
===Deca-core Broadwell === | ===Deca-core Broadwell === | ||
− | + | * {{intel|Core i7-6950X}} | |
− | + | * Deca-core microprocessor | |
− | * | ||
− | |||
* 3,400,000,000 transistors | * 3,400,000,000 transistors | ||
− | * 246 mm<sup>2</sup> | + | * 246 mm<sup>2</sup> |
− | |||
:[[File:broadwell (deca-core) die shot.png|650px]] | :[[File:broadwell (deca-core) die shot.png|650px]] | ||
Line 379: | Line 277: | ||
created and tagged accordingly. | created and tagged accordingly. | ||
− | Missing a chip? please dump its name here: | + | Missing a chip? please dump its name here: http://en.wikichip.org/wiki/WikiChip:wanted_chips |
--> | --> | ||
− | + | <table class="wikitable sortable"> | |
− | <table class=" | + | <tr><th colspan="12" style="background:#D6D6FF;">Broadwell Chips</th></tr> |
− | <tr | + | <tr><th colspan="9">Main processor</th><th colspan="3">IGP</th></tr> |
− | <tr | + | <tr><th>Model</th><th>µarch</th><th>Platform</th><th>Core</th><th>Launched</th><th>SDP</th><th>TDP</th><th>Freq</th><th>Max Mem</th><th>Name</th><th>Freq</th><th>Max Freq</th></tr> |
− | + | {{#ask: [[Category:microprocessor models by intel]][[instance of::microprocessor]][[microarchitecture::Broadwell]] | |
− | < | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | < | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | < | ||
− | {{#ask: [[Category:microprocessor models by intel]] [[instance of::microprocessor]] [[microarchitecture::Broadwell | ||
|?full page name | |?full page name | ||
|?model number | |?model number | ||
− | |? | + | |?microarchitecture |
− | |? | + | |?platform |
− | |||
|?core name | |?core name | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
|?first launched | |?first launched | ||
− | |? | + | |?sdp |
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
|?tdp | |?tdp | ||
− | |?base frequency | + | |?base frequency |
− | + | |?max memory | |
− | |||
− | |||
− | |||
− | |?max memory | ||
|?integrated gpu | |?integrated gpu | ||
|?integrated gpu base frequency | |?integrated gpu base frequency | ||
|?integrated gpu max frequency | |?integrated gpu max frequency | ||
|format=template | |format=template | ||
− | |template=proc table | + | |template=proc table 2 |
− | + | |userparam=13 | |
− | |||
− | |||
− | |userparam= | ||
|mainlabel=- | |mainlabel=- | ||
− | |||
}} | }} | ||
− | {{ | + | <tr><th colspan="12">Count: {{#ask:[[Category:microprocessor models by intel]][[instance of::microprocessor]][[microarchitecture::Broadwell]]|format=count}}</th></tr> |
</table> | </table> | ||
− |
Facts about "Broadwell - Microarchitectures - Intel"
codename | Broadwell + |
core count | 2 +, 4 +, 6 +, 8 +, 10 +, 12 +, 14 +, 16 +, 18 +, 20 + and 22 + |
designer | Intel + |
first launched | October 2014 + |
full page name | intel/microarchitectures/broadwell (client) + |
instance of | microarchitecture + |
instruction set architecture | x86-64 + |
manufacturer | Intel + |
microarchitecture type | CPU + |
name | Broadwell + |
pipeline stages (max) | 19 + |
pipeline stages (min) | 14 + |
process | 14 nm (0.014 μm, 1.4e-5 mm) + |