Editing habana/microarchitectures/goya

{{habana title|Goya|arch}}
{{microarchitecture
|atype=NPU
|name=Goya
|designer=Habana
|manufacturer=TSMC
|introduction=2018
|process=16 nm
|processing elements=8
|contemporary=Gaudi
|contemporary link=habana/microarchitectures/gaudi
}}
[[File:Habana GOYA.png|right|thumb|Goya Logo]]
'''Goya''' is a [[16-nanometer]] microarchitecture for inference [[neural processors]] designed by [[Habana Labs]].

== Process Technology ==
Goya-based processors are fabricated on [[TSMC]] [[16-nanometer]] process.

==Architecture ==
=== Block Diagram ===
:[[File:habana goya block diagram.svg|400px]]

== Overview ==
Goya is designed as a microarchitecture for the [[acceleration]] of inference. Since the target market is the data center, the [[thermal design point]] for those chips was relatively high - at around 200 W. The design uses a heterogenous approach comprising of a large General Matrix Multiply (GMM) engine, Tensor
Processor Cores (TPCs), and a large shared memory pool.

There are eight TPCs. Each TPC also incorporates its own local memory but omits caches.  Each core is a [[VLIW]] DSP design that has been optimized for AI applications. This includes [[AI]]-specific [[instructions]] and operations. The TPCs are designed for flexibility and can be programmed in plain [[C]]. The TPC supports mixed-prevision operations including 8-bit, 16-bit, and 32-bit SIMD vector operations for both [[integer]] and [[floating-point]]. This was done in order to allow accuracy loss tolerance to be controlled on a per-model design by the programmer. Goya offers both coarse-grained precision control and fine-grained down to the tensor level.

== Scalability ==
{{empty section}}
== See also ==
* {{\\|Gaudi}}
* {{habana|HL}} series
codename	Goya +
designer	Habana +
first launched	2018 +
full page name	habana/microarchitectures/goya +
instance of	microarchitecture +
manufacturer	TSMC +
name	Goya +
process	16 nm (0.016 μm, 1.6e-5 mm) +
processing element count	8 +