From WikiChip
ARM3 - Microarchitectures - Acorn
< acorn
Revision as of 03:23, 10 July 2017 by At32Hz (talk | contribs) (Clock Generator)

Edit Values
ARM3 µarch
General Info
Arch TypeCPU
DesignerAcorn Computers
ManufacturerVLSI Technology, Sanyo
Introduction1989
Process1.5 µm
Core Configs1
Pipeline
TypeScalar, Pipelined
Stages3
Decode1-way
Instructions
ISAARMv2a
Cache
L1 Cache4 KiB/core
64-way set associative
Cores
Core NamesARM3,
ARM250
Succession

ARM3 is the second-generation commercial ARM implementation designed by Acorn Computers as a successor to the ARM2.

History

See also: ARM's History

The ARM3 builds on the success of the ARM2 with higher performance through the introduction of on-die cache but without any major changes to the core itself. The ARM3 was designed by a team of four engineers in nine months and was introduced in 1989. The ARM3 can operate at up to 25 MHz with a peak performance of 25 MIPS and a sustainable performance of 12 MIPS.

Process Technology

See also: 1.5 µm process

The ARM3 was implemented on a 1.5 µm double-level metal CMOS process.

Architecture

The major goal of the ARM3 was to improve performance. A target of three times the performance of the ARM2 was set. In order to support a faster microprocessor, the system would have to use faster DRAM which would negatively impact the overall cost. Instead, the design team opted to integrating cache.

Key changes from ARM2

  • Goal 3x the performance
    • Integrated cache
    • Integrated clock generator
    • Integrated control logic
    • Integrated co-processor interface

New instructions

New ARM3 instructions:

Memory:

  • SWP - Swap word memory-register, Atomic (uninterruptible)

Block Diagram

Entire Chip

arm3 block diagram.svg

Core

arm2 block diagram.svg

Memory Hierarchy

  • Cache
    • L1 Cache (unified)
      • 4 KiB, 64-way set associative
      • 16 B line size
      • Write-through policy
      • Per core
  • System DRAM
    • Up to 64 MiB

Overview

Control

The ARM3's control logic is a state machine implemented as three PLAs. Layout was generated automatically using EDA tools using Psuedo nMOS logic in order to save on space, albeit at the slight expense of static power dissipation.

Core

Pipeline

Main article: ARM2 Pipeline

ARM3's pipeline is identical to the ARM2.

Clock Generator

To increase performance, Acorn needed to increase the clock speed. This was not possible without leaving commodity DRAM for higher speed and more expensive DRAM. The alternative approach Acorn went with is an intermediate cache on-die. The ARM3 has two clock inputs, one fast one for the internal cache and one slower one for the external memory accesses. The two inputs are entirely independent allowing the system to optimize for both (fast cache and slower memory) at the same time.

Coprocessor Interface

Acorn introduced cache with the ARM3. This meant that most instructions are now coming from the on-chip cache instead of system memory. The way the coprocessor was originally designed in the ARM2 meant that the coprocessor could simply access its instructions from memory. This was no longer compatible with the new design. Acorn solved this problem by making the coprocessor lag behind exactly one cycle. Instructions that are identified as being coprocessor instructions are broadcasted off-chip on the following cycle. This was done using the processor's 32-bit data port along with timing and handshaking signals.

Cache

New text document.svg This section is empty; you can help add the missing info by editing this page.

Die

  • 12 MHz, 1 W
  • 1.5 µm DLM CMOS
  • 8.72 mm x 9.95 mm
  • 86.764 mm² die size
  • 309,656 transistors
    • 206,454 SRAM
    • 62,973 CAM
    • 40,229 logic
  • QFP-160
    • 119 signal pins
    • 41 power/ground pins


arm3 die shot.png


arm3 die shot (annotated).png

All ARM3 Chips

 List of ARM3-based Processors
ModelProcessLaunchedFrequencyPower DissipationMax Memory
VL86C020ARM3198920 MHz
0.02 GHz
20,000 kHz
, 25 MHz
0.025 GHz
25,000 kHz
, 33 MHz
0.033 GHz
33,000 kHz
, 36 MHz
0.036 GHz
36,000 kHz
2 W
2,000 mW
0.00268 hp
0.002 kW
64 MiB
65,536 KiB
67,108,864 B
0.0625 GiB
6.103516e-5 TiB
Count: 1

References

  • Thomas, A. R. P., et al. "A 2nd Generation 32b RISC Processor with 4KByte Cache." Solid-State Circuits Conference, 1989. ESSCIRC'89. Proceedings of the 15th European. IEEE, 1989.
codenameARM3 +
core count1 +
designerAcorn Computers +
first launched1989 +
full page nameacorn/microarchitectures/arm3 +
instance ofmicroarchitecture +
instruction set architectureARMv2a +
manufacturerVLSI Technology + and Sanyo +
microarchitecture typeCPU +
nameARM3 +
pipeline stages3 +
process1,500 nm (1.5 μm, 0.0015 mm) +