From WikiChip
Difference between revisions of "uc davis/kilocore"
< uc davis

(Created page with "{{ucdavis title|KiloCore}} {{mpu | name = KiloCore | no image = | image = ucd kilocore 2.png | image size = | caption...")
 
Line 6: Line 6:
 
| image size          =  
 
| image size          =  
 
| caption            = KiloCore on a daughterboard
 
| caption            = KiloCore on a daughterboard
| designer            = VLSI Computation Laboratory at UC Davis
+
| designer            = UC Davis
 
| manufacturer        = IBM
 
| manufacturer        = IBM
 
| model number        =  
 
| model number        =  
 
| part number        =  
 
| part number        =  
 
| market              =  
 
| market              =  
| first announced    = September 21, 2015
+
| first announced    = June 17, 2016
 
| first launched      =  
 
| first launched      =  
 
| last order          =  
 
| last order          =  
Line 36: Line 36:
 
| technology          = CMOS
 
| technology          = CMOS
 
| die size            = 64 mm²
 
| die size            = 64 mm²
| word size          =  
+
| word size          = 16 bit
 
| core count          = 1,000
 
| core count          = 1,000
 
| thread count        =  
 
| thread count        =  
Line 73: Line 73:
 
| socket 0 type      = BGA
 
| socket 0 type      = BGA
 
}}
 
}}
'''KiloCore''' is a prototype [[microprocessor]] containing 1,000 cores developed by the VLSI Computation Laboratory (VCL) at UC Davis. The chip, which was manufactured on [[IBM]]'s [[32 nm process]] PD-SOI technology, is said to have a maximum computation rate of 1.78 trillion instructions per second. This chip was presented at the 2016 Symposia on VLSI Technology and Circuits on June 17, 2016.
+
'''KiloCore''' is a prototype {{arch|16}} [[microprocessor]] containing 1,000 cores developed by the VLSI Computation Laboratory (VCL) at UC Davis. The chip, which was manufactured on [[IBM]]'s [[32 nm process]] PD-SOI technology, is said to have a maximum computation rate of 1.78 trillion instructions per second. This chip was presented at the 2016 Symposia on VLSI Technology and Circuits on June 17, 2016.
 +
 
 +
== Architecture ==
 +
The chip is arranged as an array of 992 cores arranged as a grid 32 by 31. 8 Additional cores are found along with 12 memory modules of 64 KB SRAM ea (for a total of 768 KB). Communication between cores is done via a [[circuit-switched network]] and a very-small-area packet router.
 +
 
 +
=== Cores ===
 +
Each core is an independent processing unit capable of issuing one instruction [[in-order]] per cycle. Instructions may come from the local instruction memory or they may be fetched from one of the independent memory module. Likewise data may come from the data memroy or from the independent memory module.
 +
 
 +
Each core contains 128x40-bit local instruction memory. Data memory is also stored in each as 2 banks of 128x16-bit each (for a total of 256x16-bit). The core also has three data address generators, two 32x16 input FIFO buffers, a 16-bit fixed-poit data path.
 +
 
 +
=== Memory Module ===
 +
Each memory module contains 64 KB of SRAM and has an area of 0.164 mm². The module also contains two 32x16-bit FIFO buffers.
 +
 
 +
== ISA ==
 +
Each core supports 72 general instructions supporting [[signed]] and [[unsigned]] operations. The processor operates on {{arch|16}} data [[word size]] with the exception of the multiply-accumulator which has a 40-bit output. Larger word size operations such as {{arch|32}} may be emulated via software.
 +
 
 +
== Cache ==
 +
* Per core
 +
** 640 bytes (128x40-bit) local instruction memory
 +
** 512 bytes (256x16-bit) local data memory
 +
* 768 KB SRAM on-die
 +
** 12 shared SRAM memory modules, 64 KB each

Revision as of 03:18, 20 June 2016

Template:mpu KiloCore is a prototype 16-bit microprocessor containing 1,000 cores developed by the VLSI Computation Laboratory (VCL) at UC Davis. The chip, which was manufactured on IBM's 32 nm process PD-SOI technology, is said to have a maximum computation rate of 1.78 trillion instructions per second. This chip was presented at the 2016 Symposia on VLSI Technology and Circuits on June 17, 2016.

Architecture

The chip is arranged as an array of 992 cores arranged as a grid 32 by 31. 8 Additional cores are found along with 12 memory modules of 64 KB SRAM ea (for a total of 768 KB). Communication between cores is done via a circuit-switched network and a very-small-area packet router.

Cores

Each core is an independent processing unit capable of issuing one instruction in-order per cycle. Instructions may come from the local instruction memory or they may be fetched from one of the independent memory module. Likewise data may come from the data memroy or from the independent memory module.

Each core contains 128x40-bit local instruction memory. Data memory is also stored in each as 2 banks of 128x16-bit each (for a total of 256x16-bit). The core also has three data address generators, two 32x16 input FIFO buffers, a 16-bit fixed-poit data path.

Memory Module

Each memory module contains 64 KB of SRAM and has an area of 0.164 mm². The module also contains two 32x16-bit FIFO buffers.

ISA

Each core supports 72 general instructions supporting signed and unsigned operations. The processor operates on 16-bit data word size with the exception of the multiply-accumulator which has a 40-bit output. Larger word size operations such as 32-bit may be emulated via software.

Cache

  • Per core
    • 640 bytes (128x40-bit) local instruction memory
    • 512 bytes (256x16-bit) local data memory
  • 768 KB SRAM on-die
    • 12 shared SRAM memory modules, 64 KB each