From WikiChip
Editing intel/microarchitectures/cascade lake

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then save the changes below to finish undoing the edit.

This page supports semantic in-text annotations (e.g. "[[Is specified as::World Heritage Site]]") to build structured and queryable content provided by Semantic MediaWiki. For a comprehensive description on how to use annotations or the #ask parser function, please have a look at the getting started, in-text annotation, or inline queries help pages.

Latest revision Your text
Line 83: Line 83:
 
|contemporary link=intel/microarchitectures/coffee lake
 
|contemporary link=intel/microarchitectures/coffee lake
 
}}
 
}}
[[File:cascade lake chip.JPG|right|thumb|Cascade Lake]]
 
 
'''Cascade Lake''' ('''CSL'''/'''CLX''') is [[Intel]]'s successor to {{\\|Skylake (server)|Skylake}}, a [[14 nm]] [[microarchitecture]] for enthusiasts and servers. Cascade Lake is the "Optimization" phase as part of Intel's {{intel|PAO}} model.
 
'''Cascade Lake''' ('''CSL'''/'''CLX''') is [[Intel]]'s successor to {{\\|Skylake (server)|Skylake}}, a [[14 nm]] [[microarchitecture]] for enthusiasts and servers. Cascade Lake is the "Optimization" phase as part of Intel's {{intel|PAO}} model.
  
Line 103: Line 102:
  
 
== Brands ==
 
== Brands ==
Cascade Lake is sold under five different families.
+
Cascade Lake is sold under four different families.
  
 
{| class="wikitable tc4 tc5 tc6 tc7 tc8" style="text-align: center;"
 
{| class="wikitable tc4 tc5 tc6 tc7 tc8" style="text-align: center;"
Line 110: Line 109:
 
|-
 
|-
 
! Cores !! {{intel|Hyper-Threading|HT}} !! {{intel|Turbo Boost|TBT}} !! {{x86|AVX-512}} !! AVX-512 Units !! {{intel|Ultra Path Interconnect|UPI}} links !! Scalability
 
! Cores !! {{intel|Hyper-Threading|HT}} !! {{intel|Turbo Boost|TBT}} !! {{x86|AVX-512}} !! AVX-512 Units !! {{intel|Ultra Path Interconnect|UPI}} links !! Scalability
|-
 
| || {{intel|Xeon W}}  || style="text-align: left;" | High-performance Workstations || 8-28 || {{tchk|yes}} || {{tchk|yes}} || {{tchk|yes}} || 1 || - || -
 
|-
 
| colspan="11" |
 
 
|-
 
|-
 
| [[File:xeon bronze (2017).png|50px]] || {{intel|Xeon Bronze}} || style="text-align: left;" | Entry-level performance / <br>Cost-sensitive || 6 || {{tchk|no}} || {{tchk|no}} || {{tchk|yes}} || 1 || 2 || Up to 2
 
| [[File:xeon bronze (2017).png|50px]] || {{intel|Xeon Bronze}} || style="text-align: left;" | Entry-level performance / <br>Cost-sensitive || 6 || {{tchk|no}} || {{tchk|no}} || {{tchk|yes}} || 1 || 2 || Up to 2
Line 125: Line 120:
 
| [[File:xeon platinum (2017).png|50px]] || {{intel|Xeon Platinum}} || style="text-align: left;" | Highest performance / flexibility || 4-28 || {{tchk|yes}} || {{tchk|yes}} || {{tchk|yes}} || 2 || 3 || Up to 8
 
| [[File:xeon platinum (2017).png|50px]] || {{intel|Xeon Platinum}} || style="text-align: left;" | Highest performance / flexibility || 4-28 || {{tchk|yes}} || {{tchk|yes}} || {{tchk|yes}} || 2 || 3 || Up to 8
 
|}
 
|}
 
:[[File:xeon sp naming change.svg|800px]]
 
  
 
=== Identification ===
 
=== Identification ===
 
:[[File:cascade lake naming scheme.svg|750px]]
 
:[[File:cascade lake naming scheme.svg|750px]]
 
 
Where,
 
 
* "''F''" suffix integrates the {{intel|Omni-Path}} Host Fabric Interface (HFI) die on-package
 
* "''L''" suffix indicates the SKU is a large memory (4.5 TiB) tier SKU
 
* "''M''" suffix indicates the SKU is a medium memory (2 TiB) tier SKU
 
* "''N''" suffix indicates the SKU is a networking-specialized model
 
* "''S''" suffix indicates the SKU is a search application-specialized model
 
* "''T''" suffix indicates that SKU has an extended lifetime (10 year use) guarantees and [[NEBS]]-friendly packing specification
 
* "''U''" suffix indicates the SKU is a single-socket model (even if part of the [[Xeon Gold]] family that normally supports up two 4-way [[SMP]])
 
* "''V''" suffix indicates the SKU targets the VM density value market
 
* "''Y''" suffix indicates the SKU has {{intel|Speed Select Technology}} (SST)
 
  
 
Note that Speed Select (SST) SKUs were originally suffixed with the 'C' suffix but were later changed to 'Y'. Some of the early engineering samples that are circulating around still suffixed with a 'C'.
 
Note that Speed Select (SST) SKUs were originally suffixed with the 'C' suffix but were later changed to 'Y'. Some of the early engineering samples that are circulating around still suffixed with a 'C'.
  
 
== Release Dates ==
 
== Release Dates ==
Cascade Lake was released on April 2, 2019. {{intel|Cascade Lake W|l=core}} for workstations were released on June 3, 2019.
+
Cascade Lake was released on April 2, 2019.
  
 
== Process Technology ==
 
== Process Technology ==
Line 182: Line 162:
 
*** Hardware mitigations for {{cve|CVE-2018-3620}}/{{cve|CVE-2018-3646}} (L1 Terminal Fault, Foreshadow)
 
*** Hardware mitigations for {{cve|CVE-2018-3620}}/{{cve|CVE-2018-3646}} (L1 Terminal Fault, Foreshadow)
 
*** Hardware mitigations for {{cve|CVE-2018-12130}}/{{cve|CVE-2018-12126}}/{{cve|CVE-2018-12127}}/{{cve|CVE-2019-11091}} (MDS; MFBDS, RIDL, MSBDS, Fallout, MLPDS, MDSUM)
 
*** Hardware mitigations for {{cve|CVE-2018-12130}}/{{cve|CVE-2018-12126}}/{{cve|CVE-2018-12127}}/{{cve|CVE-2019-11091}} (MDS; MFBDS, RIDL, MSBDS, Fallout, MLPDS, MDSUM)
**** ''note that while steppings 6 & 7 are fully mitigated, earlier stepping 5 is not protected against MSBDS, MLPDS, nor MDSUM''
 
 
** New {{x86|CPUID}} Level Type field for "die"
 
** New {{x86|CPUID}} Level Type field for "die"
 
* Integrated Memory Controller
 
* Integrated Memory Controller
 
** Added support for [[persistent memory]]
 
** Added support for [[persistent memory]]
 
*** Support for DDR-T / Optane DIMMs
 
*** Support for DDR-T / Optane DIMMs
**** Apache Pass DIMMs
 
 
* Memory
 
* Memory
 
** Higher data rate (2933 MT/s, up from 2666 MT/s)
 
** Higher data rate (2933 MT/s, up from 2666 MT/s)
Line 193: Line 171:
 
** Extended memory support for up to 2 TiB per socket (up from 1.5 TiB)
 
** Extended memory support for up to 2 TiB per socket (up from 1.5 TiB)
 
** Large memory support for up to 4.5 TiB per socket
 
** Large memory support for up to 4.5 TiB per socket
* I/O
 
** x64 PCIe lanes exposed to the platform (up from x48) (''{{intel|Xeon W}} only'')
 
  
{{expand list}}
 
 
====New instructions ====
 
====New instructions ====
 
Cascade Lake introduced a number of {{x86|extensions|new instructions}}:
 
Cascade Lake introduced a number of {{x86|extensions|new instructions}}:
Line 275: Line 250:
 
**** fixed partition
 
**** fixed partition
 
*** 1G page translations:
 
*** 1G page translations:
**** 4 entries; 4-way associative
+
**** 4 entries; fully associative
 
**** fixed partition
 
**** fixed partition
 
** STLB
 
** STLB
Line 312: Line 287:
 
== Higher core-count multi-chip processors ==
 
== Higher core-count multi-chip processors ==
 
[[File:cascade lake ap overview.png|thumb|right|Overview slide]]
 
[[File:cascade lake ap overview.png|thumb|right|Overview slide]]
[[File:intel cascade lake ap chip with heatsink.JPG|right|thumb|Cascade Lake AP]]
 
[[File:cascade lake ap board.JPG|right|thumb|Cascade Lake AP board]]
 
 
{{main|intel/cores/cascade_lake_ap|l1=Cascade Lake Advanced Performance}}
 
{{main|intel/cores/cascade_lake_ap|l1=Cascade Lake Advanced Performance}}
Intel introduced a number of new products, code name {{intel|Cascade Lake Advanced Performance|l=core}}, which doubled the core count. Intel achieved this by packaging two extreme core count (XCC) dies together on the same substrate in a BGA package. The two dies are interconnected through 1 {{intel|UPI}} link which paved the way for models up to 56 cores. This was done to support a fully-connected system in a 2-way SMP system which is where those chips are designed to go. In those systems, every die is interconnected to every other die (four in total) over a {{intel|UPI}} link. With two dies per package, each CPU exposes 12 DDR4 memory channels and x40 PCIe Gen3 lanes.
+
Intel introduced a number of new products, code name {{intel|Cascade Lake Advanced Performance|l=core}}, which doubled the core count. Intel achieved this by packaging two extreme core count (XCC) dies together on the same substrate in a BGA package. The two dies are interconnected through 1 {{intel|UPI}} link which paved the way for models up to 56 cores. This was done to support a fully-connected system in a 2-way SMP system which is where those chips are designed to go. In those systems, every die is interconnected to every other die (four in total) over a {{intel|UPI}} link. With to dies per package, each CPU exposes 12 DDR4 memory channels and x40 PCIe Gen3 lanes.
  
  
Line 359: Line 332:
 
One of the other features that was added in Cascade Lake is Speed Select Technology (SST) which allows a chip to be configured in the field for various workloads such as throughput and single-core performance. This is done through configurations that allow the end user to determine how the power budget is spent. For example, single-thread performance can be further increased for fewer chips by further decreasing the base frequency of more inactive core. Alternatively, higher throughput can be achieved by increasing the base frequency of some cores while reducing the base frequency of other cores. All in all, the conigurations are ultimately bound by the power budget of the chip, however the custom tweaking of frquency (base, turbo, or otherwise) enable custom tuning of affinitized workloads.
 
One of the other features that was added in Cascade Lake is Speed Select Technology (SST) which allows a chip to be configured in the field for various workloads such as throughput and single-core performance. This is done through configurations that allow the end user to determine how the power budget is spent. For example, single-thread performance can be further increased for fewer chips by further decreasing the base frequency of more inactive core. Alternatively, higher throughput can be achieved by increasing the base frequency of some cores while reducing the base frequency of other cores. All in all, the conigurations are ultimately bound by the power budget of the chip, however the custom tweaking of frquency (base, turbo, or otherwise) enable custom tuning of affinitized workloads.
  
== Specialized SKUs ==
+
== Persistent Memory Support ==
Though Intel has been providing various customers with specialized SKUs for a long time, with Cascade Lake, Intel started doubling down on specialized SKUs. Many SKUs that were sought after by customers were introduced as mainstream SKUs in Intel's standard Xeon lineup. New SKUs can be grouped into the following categories:
 
 
 
* Speed Select Technology SKUs
 
* Specialized for network function virtualization
 
* Specialized for networking and IoT (NEBS Friendly)
 
* Specialized Search Application
 
* Extended Memory
 
 
 
=== Speed Select Technology (SST) ===
 
{{main|intel/speed_select_technology|l1=Speed Select Technology (SST)}}
 
Speed Select Technology is a new feature found on SST-enabled SKUs that allows for finer per-core power/performance configuration. SST-enabled SKUs come with additional controls that allow system administrators to change the turbo and base frequencies of certain cores. Those cores (called prioritized cores) can then have certain applications with higher priority affinitized to them. Since the power budget of the processor is fixed, with less prioritized cores, it’s possible to increase the base or turbo frequency. This can be furthered improved by reducing the frequencies of lower-priority cores (below their pre-defined base frequencies). In other words, SST allows for higher performance for priority workloads through the sacrifice of lower-priority workloads.
 
 
 
=== VM Density Value Specialized SKUs ===
 
VM density value optimized are SKUs that have been optimized to provide higher ROI for customers that benefit more from the higher thread count. There are two new VM density SKUs. Those models are suffixed with a ‘V’.
 
 
 
=== Network Function Virtualization (NFV) SKUs ===
 
NFV models are optimized for dynamic density VMs with additional headroom for higher subscribe capacity. NFV SKUs also feature speed select profiles with configurable priority based on the kind of workloads that are running. There are three new NFV SKUs. Those models are suffixed with a ‘N’.
 
  
=== Search Application Value specialized ===
+
Intel Optane DC Persistent Memory (DCPMM) (Codename: Apache Pass) will be supported starting with Cascade Lake. The next architecture/codename is Barlow Pass.
The search-optimized SKU is mainly designed for cloud search applications. Those models are designed such that they have predictable performance and latencies. There is one new search-optimized SKUs. Those models are suffixed with an ‘S’.
 
  
=== Specialized for networking and IoT (NEBS Friendly) ===
+
DCPMM is designed to improve overall Data Center (DC) system performance and to lower latencies by putting more data closer to the processor on NV Media.
Models that are suffixed with “T” have extended lifetime (10 year use) guarantees and NEBS-friendly packing specification. There are four new NEBS-friendly SKUs.
 
  
=== Extended Memory SKUs ===
+
DCPMM Modules are DDR4 socket compatible, and will be available in capacities up to 512GB/Module.  
Beyond the normal supported memory, Intel also offers fourteen additional SKUs with extended memory support. Xeon Scalable models with the ‘M’ suffix offer medium DDR memory tier support with up to 2 TiB of memory per socket. Above those SKUs are ‘L’-suffixed SKUs which offer large memory support of up to 4.5 TiB of memory per socket.
 
  
== Persistent memory support ==
+
=== DCPMM Modules Sizes ===
{{see also|x86/persistent_memory_extensions|snia/npm|l1=Persistent Memory Extensions|l2=NVM Programming Model}}
 
Cascade Lake introduces support for persistent memory. For Cascade Lake, this comes in the form of first-generation Optane DC Persistent Memory Modules (DCPMM), codename Apache Pass. PMMs are designed to improve the overall data center system performance by bringing a larger amount of data closer to the processor, in terms of latency, but behind DRAM.
 
  
For first-generation Optane DC DIMMs, Intel supports capacities of 128, 256, and 512 GiB. The DIMMs are DDR4 pin-compatible and although they are slightly slower than DRAM, they are considerably faster than a typical SSD - fast enough to double up as "slow main memory". Optane DC DIMMs are designed to work with direct byte-addressable load/store accesses and have built-in encryption. They allow cache line access (i.e., 64B granularity) and offer idle latency close to that of DDR4 DIMMs.
+
* 128GB
 +
* 256GB
 +
* 512GB
  
[[File:cascade-presistence.png|right|thumb|Persistence domain]]
+
=== DCPMM Specs ===
Although Optane DC DIMMs are DDR4 pin-compatible, meaning they use the same electrical and mechanical interface as DDR4, they are not a direct drop-in replacement. Those DIMMs have different characteristics and therefore they interface with the CPU over proprietary protocol extensions. For this reason, Cascade Lake features an overhauled memory controller capable of interfacing with both DDR4 DIMMs and Optane DC DIMMs. Memory channels can be shared between DDR4 and Optane DC modules. For example, a single channel can have one regular DDR4 DIMM while the other DIMM can be an Optane DC DIMM. All in all, Optane DC DIMMs allow for greater than 3 TiB of system memory per socket.
 
  
Although the instructions to support persistent memory were already introduced in {{\\|Skylake (Server)|Skylake}}, Cascade Lake can now actually make use of them. The bare minimum for persistent memory support can be realized by simply putting the data in the write pending queue (WPQ) of the integrated memory controller within the persistence domain. The persistence domain is a unique checkpoint. Once data makes it to that point, it’s persistence is guaranteed by the platform interface. This is shown in the diagram on the right in the bottom dotted box. Any data within that box is either saved on the DIMM, on the way to the DIMM, or in the WPQ in the IMC. Regardless of where it is, the platform is required to store enough energy (e.g., through on-board supercapacitors) to save everything within that box in the event of a power loss.
+
* Capacity Per CPU: 3TB
 
+
* Speed 2666MT/sec
=== Encryption ===
+
* Close to DRAM latency
[[File:cascade-lake-optane-dimm-encryption.png|right|thumb|PMM Encryption]]
+
* Cache line size access
One of the unique problems associated with persistent memory is the security of the persistent data itself. To that end, Optane DIMMs protects all data with 256b AES-XTP. There are two supported modes. The first one is memory mode where the DIMM is treated like DRAM memory (one big cache) and as such, the key is regenerated each boot and data is lost on a power cycle. The second mode is App Direct which keeps the key on a power cycle. In this mode, the passphrase must be securely stored to unlock the data. Interestingly, one of the features that are currently missing is support for any form of virtualization. Currently, in a virtualized environment, the DIMM is unlocked using a single passphrase meaning the host has access to all the data. It’s reasonable to expect that in future Xeons, support for virtualization will be added such that data can be remained encrypted and private for that VM.
+
* Operational Modes
 +
** APP Direct Mode - low latency persistent memroy
 +
** Memory Mode - Used as DDR4 DRAM
  
 
== Scalability ==
 
== Scalability ==
Line 552: Line 506:
 
</table>
 
</table>
 
{{comp table end}}
 
{{comp table end}}
 
=== SKU Comparison ===
 
Below are a number of SKU comparison graphs based on their specifications.
 
 
<div style="float: left; margin: 10px">
 
{{#ask: [[Category:microprocessor models by intel]] [[microarchitecture::Cascade Lake]]
 
|?core count
 
|?base frequency
 
|charttitle=Cores vs. Base Frequency
 
|numbersaxislabel=Frequency (MHz)
 
|labelaxislabel=Core Count
 
|height=400
 
|width=400
 
|theme=vector
 
|group=property
 
|grouplabel=subject
 
|charttype=scatter
 
|format=jqplotseries
 
|mainlabel=-
 
}}
 
</div>
 
 
<div style="float: left; margin: 10px">
 
{{#ask: [[Category:microprocessor models by intel]] [[microarchitecture::Cascade Lake]]
 
|?core count
 
|?turbo frequency (1 core)
 
|charttitle=Cores vs. Turbo Frequency
 
|numbersaxislabel=Frequency (MHz)
 
|labelaxislabel=Core Count
 
|height=400
 
|width=400
 
|theme=vector
 
|group=property
 
|grouplabel=subject
 
|charttype=scatter
 
|format=jqplotseries
 
|mainlabel=-
 
}}
 
</div>
 
 
<div style="float: left; margin: 10px">
 
{{#ask: [[Category:microprocessor models by intel]] [[microarchitecture::Cascade Lake]]
 
|?core count
 
|?tdp
 
|charttitle=Cores vs. TDP
 
|numbersaxislabel=TDP (W)
 
|labelaxislabel=Core Count
 
|height=400
 
|width=400
 
|theme=vector
 
|group=property
 
|grouplabel=subject
 
|charttype=scatter
 
|format=jqplotseries
 
|mainlabel=-
 
}}
 
</div>
 
 
<div style="float: left; margin: 10px;">
 
{{#ask: [[Category:microprocessor models by intel]] [[microarchitecture::Cascade Lake]]
 
|?turbo frequency (1 core)
 
|?tdp
 
|charttitle=Frequency vs. TDP
 
|numbersaxislabel=TDP (W)
 
|labelaxislabel=Frequency (MHz)
 
|height=400
 
|width=90%
 
|theme=vector
 
|group=property
 
|grouplabel=subject
 
|charttype=scatter
 
|format=jqplotseries
 
|mainlabel=-
 
}}
 
</div>
 
 
{{clear}}
 
 
== Bibliography ==
 
* Intel DC Tech Day, May 2019
 
* Intel. ''personal communication''.
 
  
 
== Documents ==
 
== Documents ==
 
* [[:File:cascade-lake-advanced-performance-press-deck.pdf|Cascade Lake Advanced Performance]]
 
* [[:File:cascade-lake-advanced-performance-press-deck.pdf|Cascade Lake Advanced Performance]]

Please note that all contributions to WikiChip may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see WikiChip:Copyrights for details). Do not submit copyrighted work without permission!

Cancel | Editing help (opens in new window)
codenameCascade Lake +
core count2 +, 4 +, 6 +, 8 +, 10 +, 12 +, 16 +, 18 +, 20 +, 22 +, 24 +, 26 +, 28 +, 32 +, 48 + and 56 +
designerIntel +
first launched2019 +
full page nameintel/microarchitectures/cascade lake +
instance ofmicroarchitecture +
instruction set architecturex86-64 +
manufacturerIntel +
microarchitecture typeCPU +
nameCascade Lake +
pipeline stages (max)19 +
pipeline stages (min)14 +
process14 nm (0.014 μm, 1.4e-5 mm) +