From WikiChip
Editing intel/microarchitectures/kaby lake

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then save the changes below to finish undoing the edit.

This page supports semantic in-text annotations (e.g. "[[Is specified as::World Heritage Site]]") to build structured and queryable content provided by Semantic MediaWiki. For a comprehensive description on how to use annotations or the #ask parser function, please have a look at the getting started, in-text annotation, or inline queries help pages.

Latest revision Your text
Line 76: Line 76:
 
}}
 
}}
 
[[File:7th Gen Core-i7-badge.png|thumb|right|175px|Kaby Lake is Intel's 7th Generation {{intel|Core i7}} MPUs.]]
 
[[File:7th Gen Core-i7-badge.png|thumb|right|175px|Kaby Lake is Intel's 7th Generation {{intel|Core i7}} MPUs.]]
'''Kaby Lake''' ('''KBL''') is [[Intel]]'s successor to {{\\|Skylake}}, an enhanced [[14 nm process]] [[microarchitecture]] for mainstream desktops and mobile devices. Kaby Lake is the first "Optimization" released as part of Intel's {{intel|PAO}} model. The microarchitecture was developed by Intel's R&D center in [[wikipedia:Haifa, Israel|Haifa, Israel]]. {{\\|Cannon Lake}} was originally set to replace {{\\|Skylake}} as the next microarchitecture using a [[10 nm process]], however Intel later revised their roadmap to include Kaby Lake (with Cannon Lake being pushed back to [[2018]]).  
+
'''Kaby Lake''' ('''KBL''') is [[Intel]]'s successor to {{\\|Skylake}}, an enhanced [[14 nm process]] [[microarchitecture]] for mainstream desktops and mobile devices. Kaby Lake is the first "Optimization" released as part of Intel's {{intel|PAO}} model. The microarchitecture was developed by Intel's R&D center in [[wikipedia:Haifa, Israel|Haifa, Israel]]. {{\\|Cannon Lake}} was originally set to replace {{\\|Skylake}} as the next microarchitecture using a [[10 nm process]], however Intel later revised their roadmap to include Kaby Lake (with Cannon Lake being pushed back to [[2017]]).  
  
 
For desktop and mobile, Kaby Lake is branded as 7th Generation Intel {{intel|Core i3}}, {{intel|Core i5}}. and {{intel|Core i7}} processors. For workstation class processors, Intel branded it as {{intel|Xeon E3|Xeon E3 v6}}. There are no Kaby Lake-based server microprocessors.
 
For desktop and mobile, Kaby Lake is branded as 7th Generation Intel {{intel|Core i3}}, {{intel|Core i5}}. and {{intel|Core i7}} processors. For workstation class processors, Intel branded it as {{intel|Xeon E3|Xeon E3 v6}}. There are no Kaby Lake-based server microprocessors.
Line 83: Line 83:
 
{| class="wikitable"
 
{| class="wikitable"
 
|-
 
|-
! Core !! Abbrev !! Platform || Description !! Graphics !! Target
+
! Core !! Abbrev !! Description !! Graphics !! Target
 
|-
 
|-
| {{intel|Kaby Lake Y|l=core}} || KBL-Y || || Extremely low power || GT2 || 2-in-1s detachable, tablets, and computer sticks
+
| {{intel|Kaby Lake Y|l=core}} || KBL-Y || Extremely low power || GT2 || 2-in-1s detachable, tablets, and computer sticks
 
|-
 
|-
| {{intel|Kaby Lake U|l=core}} || KBL-U || || Ultra-low Power || GT2/GT3 || Light notebooks, portable All-in-Ones (AiOs), Minis, and conference room
+
| {{intel|Kaby Lake U|l=core}} || KBL-U || Ultra-low Power || GT2/GT3 || Light notebooks, portable All-in-Ones (AiOs), Minis, and conference room
 
|-
 
|-
| {{intel|Kaby Lake R|l=core}} || KBL-R || || Ultra-low Power || GT2 || Kaby Lake U Refresh
+
| {{intel|Kaby Lake R|l=core}} || KBL-R || Ultra-low Power || GT2 || Kaby Lake U Refresh
 
|-
 
|-
| {{intel|Kaby Lake H|l=core}} || KBL-H || || High-performance Graphics || GT2/GT3 || Ultimate mobile performance, mobile workstations
+
| {{intel|Kaby Lake H|l=core}} || KBL-H || High-performance Graphics || GT2/GT3 || Ultimate mobile performance, mobile workstations
 
|-
 
|-
| {{intel|Kaby Lake S|l=core}} || KBL-S || || Performance-optimized lifestyle || GT2/GT3 || Desktop performance to value, AiOs, and minis
+
| {{intel|Kaby Lake S|l=core}} || KBL-S || Performance-optimized lifestyle || GT2/GT3 || Desktop performance to value, AiOs, and minis
 
|-
 
|-
| {{intel|Kaby Lake G|l=core}} || KBL-G || {{intel|Pedlow|l=platform}} || Gaming Chip || GT2 + AMD {{amd|Vega|l=arch}} || Kaby Lake + Radeon Vega 20/24
+
| {{intel|Kaby Lake G|l=core}} || KBL-G || Gaming Chip || GT2 + AMD {{amd|Vega|l=arch}} || Kaby Lake + Radeon Vega 20/24
 
|-
 
|-
| {{intel|Kaby Lake X|l=core}} || KBL-X || {{intel|Basin Falls|l=platform}} || Extreme Performance || || High-end desktops & enthusiasts market
+
| {{intel|Kaby Lake X|l=core}} || KBL-X || Extreme Performance || || High-end desktops & enthusiasts market
 
|-
 
|-
| {{intel|Kaby Lake DT|l=core}} || KBL-DT || {{intel|Greenlow|l=platform}} || Workstation || GT2 || Workstations & entry-level servers
+
| {{intel|Kaby Lake DT|l=core}} || KBL-DT || Workstation || GT2 || Workstations & entry-level servers
 
|}
 
|}
  
Line 199: Line 199:
  
 
=== CPUID ===
 
=== CPUID ===
{| class="wikitable tc1 tc2 tc3 tc4 tc5"
+
{| class="wikitable tc1 tc2 tc3 tc4"
! Core !! Extended<br>Family !! Family !! Extended<br>Model !! Model !! Stepping
+
! Core !! Extended<br>Family !! Family !! Extended<br>Model !! Model
 
|-
 
|-
| rowspan="2" | {{intel|Kaby Lake Y|Y|l=core}}/{{intel|Kaby Lake U|U|l=core}}/{{intel|Kaby Lake R|R|l=core}} || 0 || 0x6 || 0x8 || 0xE || 0x9
+
| rowspan="2" | {{intel|Kaby Lake Y|Y|l=core}}/{{intel|Kaby Lake U|U|l=core}}/{{intel|Kaby Lake R|R|l=core}} || 0 || 0x6 || 0x8 || 0xE
 
|-
 
|-
| colspan="5" | Family 6 Model 142 Stepping 9
+
| colspan="4" | Family 6 Model 142
 
|-
 
|-
| rowspan="2" | {{intel|Kaby Lake DT|DT|l=core}}/{{intel|Kaby Lake H|H|l=core}}/{{intel|Kaby Lake S|S|l=core}}/{{intel|Kaby Lake X|X|l=core}} || 0 || 0x6 || 0x9 || 0xE || 0x9
+
| rowspan="2" | {{intel|Kaby Lake DT|DT|l=core}}/{{intel|Kaby Lake H|H|l=core}}/{{intel|Kaby Lake S|S|l=core}}/{{intel|Kaby Lake X|X|l=core}} || 0 || 0x6 || 0x9 || 0xE
 
|-
 
|-
| colspan="5" | Family 6 Model 158 Stepping 9
+
| colspan="4" | Family 6 Model 158
 
|}
 
|}
  
Line 245: Line 245:
  
 
* Families
 
* Families
** {{intel|Core i3}} processors dropped support for ECC memory in some models
+
** {{intel|Core i3}} processors dropped support for ECC memory (except for Embedded models)
 
** {{intel|Pentium (2009)|Pentium}} desktop processors now have {{intel|Hyper-Threading}} support (note that Mobile Pentium already had that feature.)
 
** {{intel|Pentium (2009)|Pentium}} desktop processors now have {{intel|Hyper-Threading}} support (note that Mobile Pentium already had that feature.)
 
** {{intel|Pentium (2009)|Pentium}} desktop & mobile processors now have Memory Protection ({{intel|MPX}}) and {{intel|OS Guard}} support
 
** {{intel|Pentium (2009)|Pentium}} desktop & mobile processors now have Memory Protection ({{intel|MPX}}) and {{intel|OS Guard}} support
Line 332: Line 332:
 
**** fixed partition
 
**** fixed partition
 
*** 1G page translations:
 
*** 1G page translations:
**** 4 entries; 4-way set associative
+
**** 4 entries; fully associative
 
**** fixed partition
 
**** fixed partition
 
** STLB
 
** STLB
Line 347: Line 347:
  
 
== Core ==
 
== Core ==
{{main|intel/microarchitectures/skylake#Core|l1=Skylake § Core}}
 
Kaby Lake's core is identical to {{\\|Skylake#Core|Skylake's}}.
 
 
 
=== Pipeline ===
 
=== Pipeline ===
 
{{main|intel/microarchitectures/skylake#Pipeline|l1=Skylake § Pipeline}}
 
{{main|intel/microarchitectures/skylake#Pipeline|l1=Skylake § Pipeline}}
 
Kaby Lake's pipeline is identical to {{\\|Skylake#Pipeline|Skylake's}}.
 
Kaby Lake's pipeline is identical to {{\\|Skylake#Pipeline|Skylake's}}.
 +
 +
==== Scheduler Ports & Execution Units ====
 +
<table class="wikitable">
 +
<tr><th colspan="2">Scheduler Ports Designation</th></tr>
 +
<tr><th rowspan="5">Port 0</th><td>Integer/Vector Arithmetic, Multiplication, Logic, Shift, and String ops</td></tr>
 +
<tr><td>[[FP]] Add, [[Multiply]], [[FMA]]</td></tr>
 +
<tr><td>Integer/FP Division and [[Square Root]]</td></tr>
 +
<tr><td>[[AES]] Encryption</td></tr>
 +
<tr><td>Branch2</td></tr>
 +
<tr><th rowspan="2">Port 1</th><td>Integer/Vector Arithmetic, Multiplication, Logic, Shift, and Bit Scanning</td></tr>
 +
<tr><td>[[FP]] Add, [[Multiply]], [[FMA]]</td></tr>
 +
<tr><th rowspan="3">Port 5</th><td>Integer/Vector Arithmetic, Logic</td></tr>
 +
<tr><td>Vector Permute</td></tr>
 +
<tr><td>[[x87]] FP Add, Composite Int, CLMUL</td></tr>
 +
<tr><th rowspan="2">Port 6</th><td>Integer Arithmetic, Logic, Shift</td></tr>
 +
<tr><td>Branch</td></tr>
 +
<tr><th>Port 2</th><td>Load, AGU</td></tr>
 +
<tr><th>Port 3</th><td>Load, AGU</td></tr>
 +
<tr><th>Port 4</th><td>Store, AGU</td></tr>
 +
<tr><th>Port 7</th><td>AGU</td></tr>
 +
</table>
 +
 +
{| class="wikitable collapsible collapsed"
 +
|-
 +
! colspan="3" | Execution Units
 +
|-
 +
! Execution Unit !! # of Units !! Instructions
 +
|-
 +
| ALU || 4 || add, and, cmp, or, test, xor, movzx, movsx, mov, (v)movdqu, (v)movdqa, (v)movap*, (v)movup*
 +
|-
 +
| DIV || 1 || divp*, divs*, vdiv*, sqrt*, vsqrt*, rcp*, vrcp*, rsqrt*, idiv
 +
|-
 +
| Shift || 2 || sal, shl, rol, adc, sarx, adcx, adox, etc...
 +
|-
 +
| Shuffle || 1 || (v)shufp*, vperm*, (v)pack*, (v)unpck*, (v)punpck*, (v)pshuf*, (v)pslldq, (v)alignr, (v)pmovzx*, vbroadcast*, (v)pslldq, (v)psrldq, (v)pblendw
 +
|-
 +
| Slow Int || 1 || mul, imul, bsr, rcl, shld, mulx, pdep, etc...
 +
|-
 +
| Bit Manipulation || 2 || andn, bextr, blsi, blsmsk, bzhi, etc
 +
|-
 +
| FP Mov || 1 || (v)movsd/ss, (v)movd gpr
 +
|-
 +
| SIMD Misc || 1 || STTNI, (v)pclmulqdq, (v)psadw, vector shift count in xmm
 +
|-
 +
| Vec ALU || 3 || (v)pand, (v)por, (v)pxor, (v)movq, (v)movq, (v)movap*, (v)movup*, (v)andp*, (v)orp*, (v)paddb/w/d/q, (v)blendv*, (v)blendp*, (v)pblendd
 +
|-
 +
| Vec Shift || 2 || (v)psllv*, (v)psrlv*, vector shift count in imm8
 +
|-
 +
| Vec Add || 2 || (v)addp*, (v)cmpp*, (v)max*, (v)min*, (v)padds*, (v)paddus*, (v)psign, (v)pabs, (v)pavgb, (v)pcmpeq*, (v)pmax, (v)cvtps2dq, (v)cvtdq2ps, (v)cvtsd2si, (v)cvtss2si
 +
|-
 +
| Vec Mul || 2 || (v)mul*, (v)pmul*, (v)pmadd*
 +
|-
 +
|colspan="3" | This table was taken verbatim from the Intel manual. Execution unit mapping to {{x86|MMX|MMX instructions}} are not included.
 +
|}
  
 
== Configurability ==
 
== Configurability ==
  
Kaby Lake builds upon the Skylake architecture, most dies are slight enhancements of their Skylake counterparts. The biggest change is the removal of the high performance quad core GT4 die, which has been replaced by the Kaby Lake G processors. And the introduction of the first low power quad core processor.
+
Kaby Lake builds upon the Skylake architecture, most dies are slight enchantments of their Skylake counterparts. The biggest change is the removal of the high performance quad core GT4 die, presumably because of low demand. And the introduction of the first low power quad core processor.
 
   
 
   
 
<gallery widths=300px heights=150px caption="Physical Layout Breakdown" style="float:right">
 
<gallery widths=300px heights=150px caption="Physical Layout Breakdown" style="float:right">
Line 458: Line 509:
  
 
All parts incorporate 4 GiB of [[HBM 2]] along with an [[AMD]] {{amd|Vega|l=arch}} GPU. The HBM2 and GPU are interconnected using Intel's [[EMIB]], however, the CPU and GPU are connected using standard in-package wires over standard PCIe 3.0. x8 lanes are permanently reserved for direct GPU-CPU communication. This leaves x8 additional lanes for all other peripherals that need direct connection to the CPU.
 
All parts incorporate 4 GiB of [[HBM 2]] along with an [[AMD]] {{amd|Vega|l=arch}} GPU. The HBM2 and GPU are interconnected using Intel's [[EMIB]], however, the CPU and GPU are connected using standard in-package wires over standard PCIe 3.0. x8 lanes are permanently reserved for direct GPU-CPU communication. This leaves x8 additional lanes for all other peripherals that need direct connection to the CPU.
 
[[File:intel-radeon emib solution.svg|650px]]
 
  
 
Intel claims that the use of HBM2 instead of [[GDDR5]] results in 80% less power. It's worth noting that since those are {{intel|Kaby Lake H|l=core}} parts with {{amd|Radeon}} Graphics, they effectively have two GPUs and both GPUs are usable. Fairly significant power saving can be achieved by defaulting to the integrated graphics when high performance is not required. In total there are 3 display outputs from the integrated graphics and an additional 6 outputs from the Radeon graphics for a total of 9.
 
Intel claims that the use of HBM2 instead of [[GDDR5]] results in 80% less power. It's worth noting that since those are {{intel|Kaby Lake H|l=core}} parts with {{amd|Radeon}} Graphics, they effectively have two GPUs and both GPUs are usable. Fairly significant power saving can be achieved by defaulting to the integrated graphics when high performance is not required. In total there are 3 display outputs from the integrated graphics and an additional 6 outputs from the Radeon graphics for a total of 9.
Line 498: Line 547:
 
| Capacity || 4 GiB || 4 GiB
 
| Capacity || 4 GiB || 4 GiB
 
|-
 
|-
| Clock || 800 MHz || 700 MHz
+
| Clock || 700 MHz || 800 MHz
 
|-
 
|-
 
| Bandwidth || 204.8 GB/s || 179.2 GB/s
 
| Bandwidth || 204.8 GB/s || 179.2 GB/s
Line 567: Line 616:
 
* 4 CPU cores + 24 GPU EUs
 
* 4 CPU cores + 24 GPU EUs
  
: [[File:kaby lake (quad core).png|class=wikichip_ogimage|650px]]
+
: [[File:kaby lake (quad core).png|650px]]
  
  
Line 739: Line 788:
 
* [[:File:8th-gen-radeon-rx-vega-m-product-overview.pdf|8th Gen Intel® Core processors With RadeonTM RX Vega M Graphics]]
 
* [[:File:8th-gen-radeon-rx-vega-m-product-overview.pdf|8th Gen Intel® Core processors With RadeonTM RX Vega M Graphics]]
  
== Bibliography ==
+
== References ==
 
* Intel Developer Forum 2015, San Francisco, August 18-20, 2015
 
* Intel Developer Forum 2015, San Francisco, August 18-20, 2015
 
* Intel Technology and Manufacturing Day, March 28, 2017
 
* Intel Technology and Manufacturing Day, March 28, 2017
 
* 8th Generation core announcement, August 21, 2017
 
* 8th Generation core announcement, August 21, 2017
* IEEE Hot Chips 30 Symposium (HCS) 2018.
 
* Schor, David. (September, 2018). "''[https://fuse.wikichip.org/news/1634/hot-chips-30-intel-kaby-lake-g/ Hot Chips 30: Intel Kaby Lake G]''"
 
  
 
== Artwork ==
 
== Artwork ==

Please note that all contributions to WikiChip may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see WikiChip:Copyrights for details). Do not submit copyrighted work without permission!

Cancel | Editing help (opens in new window)
codenameKaby Lake +
core count2 + and 4 +
designerIntel +
first launchedAugust 30, 2016 +
full page nameintel/microarchitectures/kaby lake +
instance ofmicroarchitecture +
instruction set architecturex86-64 +
manufacturerIntel +
microarchitecture typeCPU +
nameKaby Lake +
pipeline stages (max)19 +
pipeline stages (min)14 +
process14 nm (0.014 μm, 1.4e-5 mm) +