From WikiChip
Difference between revisions of "intel/microarchitectures/broadwell (client)"
(Created page with "{{intel title|Broadwell|arch}} '''Broadwell''' is Intel's microarchitecture based on the 14 nm process for mobile, desktops, and servers. Introduced in early 2015...") |
|||
| Line 1: | Line 1: | ||
{{intel title|Broadwell|arch}} | {{intel title|Broadwell|arch}} | ||
| + | {{microarchitecture | ||
| + | | name = Broadwell | ||
| + | | manufacturer = Intel | ||
| + | | introduction = October, 2014 | ||
| + | | phase-out = | ||
| + | | process = 14 nm | ||
| + | | cores = 2 | ||
| + | | cores 2 = 4 | ||
| + | | cores 3 = 6 | ||
| + | | cores 4 = 8 | ||
| + | | cores 5 = 16 | ||
| + | | cores 6 = 32 | ||
| + | |||
| + | | pipeline = Yes | ||
| + | | type = Superscalar | ||
| + | | OoOE = Yes | ||
| + | | speculative = Yes | ||
| + | | renaming = Yes | ||
| + | | isa = IA-32 | ||
| + | | isa 2 = x86-64 | ||
| + | | stages min = 14 | ||
| + | | stages max = 19 | ||
| + | | issues = 4 | ||
| + | |||
| + | | inst = Yes | ||
| + | | feature = | ||
| + | | extension = MOVBE | ||
| + | | extension 2 = MMX | ||
| + | | extension 3 = SSE | ||
| + | | extension 4 = SSE2 | ||
| + | | extension 5 = SSE3 | ||
| + | | extension 6 = SSSE3 | ||
| + | | extension 7 = SSE4.1 | ||
| + | | extension 8 = SSE4.2 | ||
| + | | extension 9 = POPCNT | ||
| + | | extension 10 = AVX | ||
| + | | extension 11 = AVX2 | ||
| + | | extension 12 = AES | ||
| + | | extension 13 = PCLMUL | ||
| + | | extension 14 = FSGSBASE | ||
| + | | extension 15 = RDRND | ||
| + | | extension 16 = FMA | ||
| + | | extension 17 = BMI | ||
| + | | extension 18 = BMI2 | ||
| + | | extension 19 = F16C | ||
| + | | extension 20 = RDSEED | ||
| + | | extension 21 = ADCX | ||
| + | | extension 22 = PREFETCHW | ||
| + | |||
| + | | cache = Yes | ||
| + | | l1i = 32 KB | ||
| + | | l1i per = core | ||
| + | | l1i desc = 8-way set associative | ||
| + | | l1d = 32 KB | ||
| + | | l1d per = core | ||
| + | | l1d desc = 8-way set associative | ||
| + | | l2 = 256 KB | ||
| + | | l2 per = core | ||
| + | | l2 desc = 8-way set associative | ||
| + | | l3 = 1.5 MB | ||
| + | | l3 per = core | ||
| + | | l3 desc = | ||
| + | |||
| + | | core names = Yes | ||
| + | | core name = Broadwell Y | ||
| + | | core name 2 = Broadwell U | ||
| + | | core name 2 = Broadwell H | ||
| + | | core name 2 = Broadwell DT | ||
| + | | core name 2 = Broadwell EP | ||
| + | | core name 2 = Broadwell EX | ||
| + | | core name N = Broadwell E | ||
| + | |||
| + | | succession = Yes | ||
| + | | predecessor = Haswell | ||
| + | | predecessor link = intel/microarchitectures/haswell | ||
| + | | successor = Skylake | ||
| + | | successor link = intel/microarchitectures/skylake | ||
| + | }} | ||
'''Broadwell''' is [[Intel]]'s [[microarchitecture]] based on the [[14 nm process]] for mobile, desktops, and servers. Introduced in early 2015, Broadwell is a [[process shrink]] of {{\\|Haswell}} which introduced several enhancements. | '''Broadwell''' is [[Intel]]'s [[microarchitecture]] based on the [[14 nm process]] for mobile, desktops, and servers. Introduced in early 2015, Broadwell is a [[process shrink]] of {{\\|Haswell}} which introduced several enhancements. | ||
| Line 26: | Line 104: | ||
=== Key changes from {{\\|Haswell}} === | === Key changes from {{\\|Haswell}} === | ||
| + | * ~5% IPC improvement | ||
* FP multiplication instructions has reduced latency (3 cycles, down from 5) | * FP multiplication instructions has reduced latency (3 cycles, down from 5) | ||
** Affects AVX, SSE, and FP instructions | ** Affects AVX, SSE, and FP instructions | ||
| Line 36: | Line 115: | ||
* Address prediction for branches and returns was improved | * Address prediction for branches and returns was improved | ||
* Improved cryptography acceleration instructions | * Improved cryptography acceleration instructions | ||
| + | |||
| + | Core features maintained a 2:1 ratio of performance:power. | ||
| + | |||
| + | ==== Graphics ==== | ||
| + | * 50% higher sampler throughput | ||
| + | * Improvements for increased geometry, Z, Pixel Fill | ||
| + | * Direct X 11.2, OpenGL 4.3 | ||
| + | * OpenCL 1.2 and 2.0 (with Shared Virtual Memory) | ||
| + | * Up to 24 EUs (20% addition, up from 20 in {{\\|Haswell}}), 48 EUs on {{intel|Iris Pro Graphics}} | ||
Revision as of 20:45, 12 April 2016
| Edit Values | |
| Broadwell µarch | |
| General Info |
Broadwell is Intel's microarchitecture based on the 14 nm process for mobile, desktops, and servers. Introduced in early 2015, Broadwell is a process shrink of Haswell which introduced several enhancements.
Codenames
| Core | Target |
|---|---|
| Broadwell Y | Core M family, SoC for Smartphones, 2-in-1s Tablets, and notebooks |
| Broadwell U | Core ultrabooks |
| Broadwell H | IoT (QM87, HM86/HM87 Chipsets), All-in-ones |
| Broadwell DT | Unlocked desktop MPUs |
| Broadwell EP | Xeon E5, Dual-Processor platform |
| Broadwell EX | Xeon E5, Multi-Processor platform, QPI |
| Broadwell E | High-End Desktops (HEDT) |
Architecture
Broadwell is for the most part identical to Haswell with server enhancements.
Key changes from Haswell
- ~5% IPC improvement
- FP multiplication instructions has reduced latency (3 cycles, down from 5)
- Affects AVX, SSE, and FP instructions
- CLMUL instructions are now a single μop, improving latency and throughput
- The second-level TLB (STLB)
- Table was enlarged (1,536 entries, up from 1024)
- 1GB page mode (16 entries, 4-ways set associative)
- Larger out-of-order scheduler
- Faster store-to-load forwarding
- Address prediction for branches and returns was improved
- Improved cryptography acceleration instructions
Core features maintained a 2:1 ratio of performance:power.
Graphics
- 50% higher sampler throughput
- Improvements for increased geometry, Z, Pixel Fill
- Direct X 11.2, OpenGL 4.3
- OpenCL 1.2 and 2.0 (with Shared Virtual Memory)
- Up to 24 EUs (20% addition, up from 20 in Haswell), 48 EUs on Iris Pro Graphics