From WikiChip
Editing pezy/pezy-scx

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then save the changes below to finish undoing the edit.

This page supports semantic in-text annotations (e.g. "[[Is specified as::World Heritage Site]]") to build structured and queryable content provided by Semantic MediaWiki. For a comprehensive description on how to use annotations or the #ask parser function, please have a look at the getting started, in-text annotation, or inline queries help pages.

Latest revision Your text
Line 30: Line 30:
 
| successor link  =  
 
| successor link  =  
 
}}
 
}}
'''PEZY-SCx''' ('''PEZY-S'''uper'''C'''omputer'''x''') is a family of [[many-core microprocessor]]s designed by [[PEZY]]. Those processors power many of [[Japan]]'s most efficient supercomputers.
+
'''PEZY-SCx''' is a family of [[many-core microprocessor]]s designed by [[PEZY]]. Those processors power many of [[Japan]]'s most efficient supercomputers.
  
 
== Overview ==
 
== Overview ==
PEZY-SCx is a family of high-performance, low-power many-core microprocessors designed by [[PEZY]] for a series of supercomputer developed in [[Japan]]. PEZY collaborates closely with ExaScaler, a company that provides immersion cooling systems. Together, they have developed a series of supercomputers called [[ZettaScaler]].
+
PEZY-SCx is a family of high-performance, low-power many-core microprocessors designed by [[PEZY]] for a series of supercomputer developed in Japan.
 
+
{{expand section}}
=== Architecture ===
 
The basic architecture of all the PEZY-SCx chips is fairly similar. At the heart is the Processing Element. Depending on the model, 1000s of those PEs are then integrated on a single [[die]].
 
 
 
The PEZY-SCx are designed as accelerators, that is, the a host processor (typically an [[Intel]] {{intel|Xeon E5}}) off-loads the PEZY-SC code to execute. Those chips support OpenCL-like programming called PZCL.
 
 
 
====  Processing Element (PE) ====
 
[[File:pezy-sc pe.svg|right|200px]]
 
The [[physical core|cores]] are called the '''processing elements''' ('''PE'''). The PEs are designed to be very simple [[RISC]] cores that are confgured as [[MIMD]] although in principle each PE can run different workloads. Each PE is a 16-stage [[in-order]] [[superscalar]] capable of issuing two instructions per cycle with [[out-of-order]] completion whenever possible supporting 8-way fine-grain [[simultaneous multithreading]]. A processing element supports 8-way SMT with dedicated register files for each thread. Threads are are interleaved each cycle with switching done to reduce [[forwarding]] and in order mitigate the lack of [[branch prediction]]. Explicit switching of active threads is also done in order to hide high latency operations.
 
 
 
The instruction set architecture implemented is a proprietary one designed by PEZY. The instruction set supports various operations such as data flashing, synchronization, acquisition of IDs, and thread switching. Each PE has an ID which is used by the code to track processes. The PEs do not maintain [[cache-coherency]] and there is no per-PE [[data cache]]. Complex instructions are processed by the Special Function Units (SFU) located in each city. A fair amount of sacrifices were made in order to ensure the cores remain small enough so that a large amount of them can be packed into a small area
 
 
 
==== Village & City ====
 
For every pair of PEs is 2 KiB of [[level 1 data cache]]. Each '''City''' is made of 64 KiB of [[L2 cache]], a number of special function units, and 4 smaller blocks called "Villages". A '''village''' consists of four processing elements. Each city also contains a Special Function Unit (SFU) which is used to execute complex instructions.
 
 
 
::[[File:pezy-sc city.svg|450px]]
 
  
 
== Models ==
 
== Models ==
The origin of the PEZY-SCx family is the {{pezy|PEZY-1}}, a 512-core chip.
 
 
=== 1st generation ===
 
{{main|pezy/pezy-scx/pezy-sc|l1=PEZY-SC}}
 
The first series of supercomputers, [[ZettaScaler#ZettaScaler-1.x|ZettaScaler-1.x]], were based on the {{\\|PEZY-SC}}. The {{\\|PEZY-SC}} featured four "Prefecture", each consisting of 16 cities for a total of 256 PEs per Prefecture along with 2 MiB of [[L3 cache]]. This chip had four such Prefecture units for a total of 1,024 [[physical core|cores]] and 8,192 [[logical cores|threads]]. Operating at 733 MHz, this chip was capable of 3 TFLOPS ([[single-precision]]) and 1.5 TFLOPS ([[double-precision]]). A number of signal-related issues particularly relating to PCIe signal failure were addressed by PEZY with the introduction of the {{\\|PEZY-SCnp}} which made use of a new package ("np"). The PEZY-SCnp, while identical to the earlier model does feature slightly higher clock resulted in slightly higher peak performance.
 
 
 
: [[File:pezy-sc main block.svg|600px]]
 
 
=== 2nd generation ===
 
{{main|pezy/pezy-scx/pezy-sc2|l1=PEZY-SC2}}
 
The second series of supercomputers, [[ZettaScaler#ZettaScaler-2.x|ZettaScaler-2.x]], were based on the {{\\|PEZY-SC2}}.
 
 
: [[File:pezy-sc2 main block.svg|600px]]
 
 
=== future generations ===
 
PEZY has laid out future generations based on [[TSMC]]'s [[7nm]] and [[5nm]] processes.
 
 
== Summary ==
 
 
<!-- NOTE:  
 
<!-- NOTE:  
 
           This table is generated automatically from the data in the actual articles.
 
           This table is generated automatically from the data in the actual articles.
Line 80: Line 46:
 
{{comp table start}}
 
{{comp table start}}
 
<table class="comptable sortable tc4">
 
<table class="comptable sortable tc4">
{{comp table header|main|8:List of PEZY-SCx Processors}}
+
{{comp table header|main|5:List of PEZY-SCx Processors}}
{{comp table header|main|5:Main Features|3:Performance}}
+
{{comp table header|main|5:Main processor}}
{{comp table header|cols|Process|Launched|Cores|Threads|Die|%Frequency|FLOPS (SP)|FLOPS (DP)}}
+
{{comp table header|cols|Process|Launched|Cores|%Frequency}}
 
{{#ask: [[Category:microprocessor models by pezy]] [[family::PEZY-SCx]]
 
{{#ask: [[Category:microprocessor models by pezy]] [[family::PEZY-SCx]]
 
  |?full page name
 
  |?full page name
Line 89: Line 55:
 
  |?first launched
 
  |?first launched
 
  |?core count
 
  |?core count
|?thread count
 
|?die area
 
 
  |?base frequency#MHz
 
  |?base frequency#MHz
|?peak flops (single-precision)#TFLOPS
 
|?peak flops (double-precision)#TFLOPS
 
 
  |format=template
 
  |format=template
 
  |template=proc table 3
 
  |template=proc table 3
  |userparam=10
+
  |userparam=6
 
  |mainlabel=-
 
  |mainlabel=-
 
  |sort=process,model number
 
  |sort=process,model number
Line 106: Line 68:
 
== See also ==
 
== See also ==
 
* Intel {{intel|Xeon Phi}}
 
* Intel {{intel|Xeon Phi}}
 
== Bibliography ==
 
* IEEE Cool Chips XVIII Symposium 2015.
 
* JSICR HPC (2015-HPC-152) "Suiren(睡蓮)による計算科学アプリケーションの性能評価" (Performance evaluation of scientific applications on Suiren System)
 
* The Fifth International Symposium on Computing and Networking 2017 (CANDAR'17). Keynote address.
 
 
[[category:supercomputing in japan]]
 

Please note that all contributions to WikiChip may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see WikiChip:Copyrights for details). Do not submit copyrighted work without permission!

Cancel | Editing help (opens in new window)
Facts about "PEZY-SCx - PEZY"
designerPEZY Computing +
first announced2014 +
first launched2014 +
full page namepezy/pezy-scx +
instance ofmicroprocessor family +
main designerPEZY Computing +
manufacturerTSMC +
namePEZY-SCx +
process28 nm (0.028 μm, 2.8e-5 mm) +, 16 nm (0.016 μm, 1.6e-5 mm) +, 7 nm (0.007 μm, 7.0e-6 mm) + and 5 nm (0.005 μm, 5.0e-6 mm) +
technologyCMOS +