

Chapter 2: Machines, Machine Languages, and Digital Logic

Instruction sets, SRC, RTN, and the mapping of register transfers to digital logic circuits



### **Chapter 2 Topics**

- 2.1 Classification of Computers and Instructions
- 2.2 Kinds and Classes of Instruction Sets
- 2.3 Informal Description of the Simple RISC Computer, SRC
  - Students may wish to consult Appendix C, Assembly and Assemblers for information about assemblers and assembly.
- 2.4 Formal Description of SRC using Register Transfer Notation (RTN)
- 2.5 RTN Description of Addressing Modes
- 2.6 Register Transfers and Logic Circuits: from Behavior to Hardware
  - Students may wish to consult Appendix A, Digital Logic for additional information about Digital Logic circuits.



## What are the components of an ISA?

- Sometimes known as *The Programmers Model* of the machine
- Storage cells
  - General and special purpose registers in the CPU
  - Many general purpose cells of same size in memory
  - Storage associated with I/O devices
- The Machine Instruction Set
  - The instruction set is the entire repertoire of machine operations
  - Makes use of storage cells, formats, and results of the fetch/execute cycle
  - i. e. Register Transfers
- The Instruction Format
  - Size and meaning of fields within the instruction
- The nature of the Fetch/Execute cycle
  - Things that are done before the operation code is known

# Fig. 2.1 Programmer's Models of Various Machines

2/eWe saw in Chap. 1 a variation in number and type of storage cells



С

D

Α

S



## What Must an Instruction Specify?

- Which operation to perform:
  - Ans: Op code: add, load, branch, etc.
- Where to find the operand or operands add r0, r1, r3
  - In CPU registers, memory cells, I/O locations, or part of instruction
- Place to store result
  - Again CPU register or memory cell
- Location of next instruction

The default is usually memory cell pointed to by program counter—PC: the next instruction in sequence

Sometimes there is no operand, or no result, or no next instruction. Can you think of examples?

add r0, r1, r3

add r0, r1, r3

add r0, r1, r3

# S Fig. 2.2 Accessing Memory—Reading from Memory



For a Memory Read: Copyright © 2004 Pearson Prentice Hall, Inc. CPU applies desired address to Address lines  $A_0$ - $A_{n-1}$ CPU issues Read command, R Memory returns the value at that address on Data lines  $D_0$ - $D_{b-1}$  and asserts the COMPLETE signal

Computer Systems Design and Architecture Second Edition

С

D

Α

<mark>2</mark>/e

# Figure 2.2 Accessing Memory—Writing to Memory



For a Memory Write: Copyright © 2004 Pearson Prentice Hall, Inc.

CPU applies desired address to Address lines  $A_0-A_{n-1}$  and and data to be written on Data lines  $D_0-D_{b-1}$ CPU issues Write command, W Memory asserts the COMPLETE signal when the data has been written to memory.

Computer Systems Design and Architecture Second Edition

С

D

Α

<mark>2</mark>/e

S

© 2004 Prentice Hall



### Instructions Can Be Divided into 3 Classes

### Data movement instructions

- Move data from a memory location or register to another memory location or register without changing its form
- Load—source is memory and destination is register
- Store—source is register and destination is memory
- Arithmetic and logic (ALU) instructions
  - Changes the form of one or more operands to produce a result stored in another location
  - Add, Sub, Shift, etc.
- Branch instructions (control flow instructions)
  - Any instruction that alters the normal flow of control from executing the next instruction in sequence
  - Br Loc, Brz Loc2,—unconditional or conditional branches



# Tbl. 2.1 Examples of Data Movement Instructions

| Instruct.                | Meaning                                  | Machine       |
|--------------------------|------------------------------------------|---------------|
| MOV A, B                 | Move 16 bits from mem. loc. A to loc. B  | VAX11         |
| <−<br>lwz R3, A          | Move 32 bits from mem. loc. A to reg. R3 | PPC601        |
| <<br>li \$3, 455         | Load the 32 bit integer 455 into reg. 3  | MIPS R3000    |
| mov R4, dout             | Move 16 bits from R4 to out port dout    | DEC PDP11     |
| <pre> IN, AL, KBD </pre> | Load a byte from in port KBD to accum.   | Intel Pentium |
| LEA.L (A0), A2           | Load address pointed to by A0 into A2    | M68000        |

- Lots of variation, even with one instruction type
- Notice differences in direction of data flow left-to-right or right-to-left



| Instruction       | Meaning                                                                         | <b>Machine</b>    |
|-------------------|---------------------------------------------------------------------------------|-------------------|
| MULF A, B, C      | multiply the 32-bit floating point values at<br>mem loc'ns. A and B, store at C | VAX11             |
| nabs r3, r1       | Store abs value of r1 in r3                                                     | PPC601            |
| ori \$2, \$1, 255 | Store logical OR of reg \$ 1 with 255 into reg \$2                              | <b>MIPS R3000</b> |
| DEC R2            | Decrement the 16-bit value stored in reg R2                                     | DEC PDP11         |
| SHL AX, 4         | Shift the 16-bit value in reg AX left by 4 bits                                 | Intel 8086        |

#### •Notice again the complete dissimilarity of both syntax and semantics

С

D

Α

2/e

S



### Tbl 2.3 Examples of Branch Instructions

| Instruction      | Meaning                                                                              | <u>Machine</u>    |
|------------------|--------------------------------------------------------------------------------------|-------------------|
| BLSS A, Tgt      | Branch to address Tgt if the least significant bit of mem loc'n. A is set (i.e. = 1) | VAX11             |
| bun r2           | Branch to location in R2 if result of previous                                       | PPC601            |
|                  | floating point computation was Not a Number (NAN                                     | 1)                |
| beq \$2, \$1, 32 | Branch to location (PC + 4 + 32) if contents                                         | <b>MIPS R3000</b> |
|                  | of \$1 and \$2 are equal                                                             |                   |
| SOB R4, Loop     | Decrement R4 and branch to Loop if R4 $\neq$ 0                                       | DEC PDP11         |
| JCXZ Addr        | Jump to Addr if contents of register $CX = 0$ .                                      | Intel 8086        |



# CPU Registers Associated with Flow of Control—Branch Insts.

- Program counter usually contains the address of, or "points to" the next instruction
- Condition codes may control branch
- Branch targets may be contained in separate registers

**Processor State** 

Program Counter





**Condition Codes** 



i

# HLL Conditionals Implemented by Control Flow Change

- Conditions are computed by arithmetic instructions
- Program counter is changed to execute only instructions associated with true conditions

| C language          | As | sembly                | lang             | uage       |                                                                               |
|---------------------|----|-----------------------|------------------|------------|-------------------------------------------------------------------------------|
| f NUM==5 then SET=7 | L1 | CMP.W<br>BNE<br>MOV.W | #5,<br>L1<br>#7, | NUM<br>SET | ;the comparison<br>;conditional branch<br>;action if true<br>;action if false |



# CPU Registers may have a "personality"

- Architecture classes are often based on how where the operands and result are located and how they are specified by the instruction.
- They can be in CPU registers or main memory





# 3, 2, 1, & 0 Address Instructions

- The classification is based on arithmetic instructions that have two operands and one result
- The key issue is "how many of these are specified by memory addresses, as opposed to being specified implicitly"
- A 3 address instruction specifies memory addresses for both operands and the result: R ← Op1 op Op2
- A 1 address instruction has a register, called the accumulator register to hold one operand & the result (no address needed): Acc ← Acc op Op1
- A 0 address + uses a CPU register stack to hold both operands and the result: TOS ← TOS op SOS where TOS is Top Of Stack, SOS is Second On Stack)
- The 4-address instruction, hardly ever seen, also allows the address of the next instruction to specified explicitly.



- Explicit addresses for operands, result & next instruction
- Example assumes 24-bit addresses
  - Discuss: size of instruction in bytes



## Fig 2.4 The 3 Address Instruction





- Address of next instruction kept in a processor state register—the PC (Except for explicit Branches/Jumps)
- Rest of addresses in instruction
  - Discuss: savings in instruction word size

# Fig. 2.5 The 2 Address Instruction





- Be aware of the difference between address, Op1Addr, and data stored at that address, Op1.
- Result overwrites Operand 2, Op2, with result, Res
- This format needs only 2 addresses in the instruction but there is less choice in placing data

С

D

Α

<mark>2</mark>/e

S





- Special CPU register, the accumulator, supplies 1 operand and stores result
- One memory address used for other operand





- Uses a push down stack in CPU
- Arithmetic uses stack for both operands. The result replaces them on the TOS
- Computer must have a 1 address instruction to push and pop operands to and from the stack

Computer Systems Design and Architecture Second Edition



# Example 2.1 Expression evaluation for 3-0 address instructions.

Evaluate  $a = (b+c)^*d$ -e for 3- 2- 1- and 0-address machines.

| 3-Address | 2-Address | Accumulator | Stack  |
|-----------|-----------|-------------|--------|
| add a,b,c | load a,b  | lda b       | push b |
| mpy a,a,d | add a,c   | add c       | push c |
| sub a,a,e | mpy a,d   | mpy d       | add    |
|           | sub a,e   | sub e       | push d |
|           |           | sta a       | mpy    |
|           |           |             | push e |
|           |           |             | sub    |
|           |           |             | pop a  |

- # of instructions & # of addresses both vary
- Discuss as examples: size of code in each case





- It is the most common choice in today's general purpose computers
- *Which* register is specified by small "address" (3 to 6 bits for 8 to 64 registers)
- Load and store have one long & one short address: 1 1/2 addresses
- 2-Operand arithmetic instruction has 3 "half" addresses

Computer Systems Design and Architecture Second Edition



### Real Machines are Not So Simple

- Most real machines have a mixture of 3, 2, 1, 0, 1 1/2 address instructions
- A distinction can be made on whether arithmetic instructions use data from memory
- If ALU instructions only use registers for operands and result, machine type is load-store
  - Only load and store instructions reference memory
- Other machines have a mix of register-memory and memorymemory instructions



### **Addressing Modes**

- An addressing mode is hardware support for a useful way of determining a memory address
- Different addressing modes solve different HLL problems
  - Some addresses may be known at compile time, e.g. global vars.
  - Others may not be known until run time, e.g. pointers
  - Addresses may have to be *computed*: Examples include:
    - Record (struct) components:
      - variable base(full address) + const.(small)
    - Array components:
      - const. base(full address) + index var.(small)
  - Possible to store constant values w/o using another memory cell by storing them with or adjacent to the instruction itself.



### **HLL Examples of Structured Addresses**

- Clanguage: rec -> count
  - rec is a pointer to a record: full address variable
  - count is a field name: fixed byte offset, say 24
- C language: v[i]
  - v is fixed base address of array: full address constant
  - i is name of variable index: no larger than array size
- Variables must be contained in registers or memory cells
- Small constants can be contained in the instruction
- Result: need for "address arithmetic."
  - E.g. Address of Rec -> Count is address of Rec + offset of count.

| <b></b> | Count |
|---------|-------|
|         |       |
| Hec →   |       |



# Fig 2.9 Common Addressing Modes a-d



Computer Systems Design and Architecture Second Edition

С

D

S



## Fig 2.9 Common Addressing Modes e-g





# Fig. 2.10a Example Computer, SRC Simple RISC Computer

- 32 general purpose registers of 32 bits
- 32 bit program counter, PC and instruction reg., IR
- 2<sup>32</sup> bytes of memory address space





### **SRC Characteristics**

- (=) Load-store design: only way to access memory is through load and store instructions
- (-) Operation on 32-bit words only, no byte or half-word operations.
- (=) Only a few addressing modes are supported
- (=) ALU Instructions are 3-register type
- (-) Branch instructions can branch unconditionally or conditionally on whether the value in a specified register is = 0, <> 0, >= 0, or < 0.</li>
- (-) Branch-and-link instructions are similar, but leave the value of current PC in any register, useful for subroutine return.
- (–) Can only branch to an address in a register, not to a direct address.
- (=) All instructions are 32-bits (1-word) long.
  - (=) Similar to commercial RISC machines
  - (-) Less powerful than commercial RISC machines.



# **SRC Basic Instruction Formats**

- There are three basic instruction format types
- The number of register specifier fields and length of the constant field vary
- Other formats result from unused fields or parts



Details of formats:

Computer Systems Design and Architecture Second Edition



Computer Systems Design and Architecture Se



# Tbl 2.4 Example Load & Store Instructions: Memory Addressing

- Address can be constant, constant+register, or constant+PC
- Memory contents or address itself can be loaded

| Instruction    | ор | ra | rb | <b>c</b> 1 | Meaning                       | Addressing Mode   |
|----------------|----|----|----|------------|-------------------------------|-------------------|
| ld r1, 32      | 1  | 1  | 0  | 32         | R[1] ← M[32]                  | Direct            |
| ld r22, 24(r4) | 1  | 22 | 4  | 24         | $R[22] \leftarrow M[24+R[4]]$ | Displacement      |
| st r4, 0(r9)   | 3  | 4  | 9  | 0          | $M[R[9]] \leftarrow R[4]$     | Register indirect |
| la r7, 32      | 5  | 7  | 0  | 32         | R[7] ← 32                     | Immediate         |
| ldr r12, -48   | 2  | 12 | —  | -48        | R[12] ← M[PC -48]             | Relative          |
| lar r3, 0      | 6  | 3  | _  | 0          | $R[3] \leftarrow PC$          | Register (!)      |

#### (note use of la to load a constant)



# Assembly Language Forms of Arithmetic and Logic Instructions

| <u>Example</u> | Meaning                                                                 |
|----------------|-------------------------------------------------------------------------|
| neg r1, r2     | ;Negate (r1 = -r2)                                                      |
| not r2, r3     | ;Not (r2 = r3´)                                                         |
| add r2, r3, r4 | ;2's complement addition                                                |
|                | ;2's complement subtraction                                             |
|                | ;Logical and                                                            |
|                | ;Logical or                                                             |
| addi r1, r3, 1 | ;Immediate 2's complement add                                           |
|                | ;Immediate logical and                                                  |
|                | ;Immediate logical or                                                   |
|                | Example<br>neg r1, r2<br>not r2, r3<br>add r2, r3, r4<br>addi r1, r3, 1 |

 Immediate subtract not needed since constant in addi may be negative



2/Ehere are actually only two branch op codes: br rb, rc, c3<2..0> ; branch to R[rb] if R[rc] meets ; the condition defined by c3<2..0> brl ra, rb, rc, c3<2..0> ; R[ra] ← PC; branch as above

• It is c3<2..0>, the 3 lsbs of c3, that governs what the branch condition is:

| <u>lsbs</u> | <u>condition</u> | Assy language form | <b>Example</b> |
|-------------|------------------|--------------------|----------------|
| 000         | never            | brinv              | brlnv r6       |
| 001         | always           | br, brl            | br r5, brl r5  |
| 010         | if $rc = 0$      | brzr, brlzr        | brzr r2, r4    |
| 011         | if rc ≠ 0        | brnz, brlnz        |                |
| 100         | if rc ≥ 0        | brpl, bripi        |                |
| 101         | if rc < 0        | brmi, brlmi        |                |

Note that branch target address is always in register R[rb].
It must be placed there explicitly by a previous instruction.

С

D

Α

S

# Tbl. 2.6 Branch Instruction Examples

| D<br>A            |                | Tbl. 2.6                      | Branch Instruc                                                                                          | ctio   | n E    | xan    | npl    | es         |                   |
|-------------------|----------------|-------------------------------|---------------------------------------------------------------------------------------------------------|--------|--------|--------|--------|------------|-------------------|
| <mark>2</mark> /e | Ass'y<br>lang. | Example instr.                | Meaning                                                                                                 | ор     | ra     | rb     | rc     | c3<br>(20) | Branch<br>Cond'n. |
|                   | brlnv          | brlnv r6                      | $R[6] \leftarrow PC$                                                                                    | 9      | 6      |        |        | 000        | never             |
|                   | br             | br r4                         | $PC \leftarrow R[4]$                                                                                    | 8      |        | 4      |        | 001        | always            |
|                   | brl            | brl r6,r4                     | $R[6] \leftarrow PC;$                                                                                   | 9      | 6      | 4      |        | 001        | always            |
|                   | brzr           | brzr r5,r1                    | $PC \leftarrow R[4]$<br>if (R[1]=0)<br>$PC \leftarrow R[5]$                                             | 8      | _      | 5      | 1      | 010        | zero              |
|                   | brlzr          | brlzr r7,r5,r1                | $R[7] \leftarrow PC;$                                                                                   | 9      | 7      | 5      | 1      | 010        | zero              |
|                   | brnz           | brnz r1, r0                   | if (R[0]≠0) PC← R[1]                                                                                    | 8      |        | 1      | 0      | 011        | nonzero           |
|                   | brlnz          | brlnz r2,r1,r0                | $R[2] \leftarrow PC;$                                                                                   | 9      | 2      | 1      | 0      | 011        | nonzero           |
|                   | brpl<br>brlpl  | brpl r3, r2<br>brlpl r4.r3.r2 | if $(R[0] \neq 0) PC \leftarrow R[1]$<br>if $(R[2] \ge 0) PC \leftarrow R[3]$<br>$R[4] \leftarrow PC$ : | 8<br>9 | —<br>4 | 3<br>3 | 2<br>2 | 100        | plus<br>plus      |
|                   | brmi           | brmi r0, r1                   | if $(R[2] \ge 0) PC \leftarrow R[3]$<br>if $(R[1] < 0) PC \leftarrow R[0]$                              | 8      | _      | 0      | 1      | 101        | minus             |
|                   | brlmi          | brlmi r3,r0,r1                | $R[3] \leftarrow PC;$<br>if (r1<0) PC \leftarrow R[0]                                                   | 9      | 3      | 0      | 1      |            | minus             |

Computer Systems Design and Architecture Second Edition

С



### **Branch Instructions—Example**

C: goto Label3

SRC:

| lar r0, Label3 | ; put branch target address into tgt reg. |
|----------------|-------------------------------------------|
| br r0          | ; and branch                              |
| • • •          |                                           |

Label3 ···
#### Example of conditional branch

| in C:   | #defi  | ne Cost 125     |                                                         |  |  |  |
|---------|--------|-----------------|---------------------------------------------------------|--|--|--|
|         | if (X< | <0) then X = -X | •<br>•                                                  |  |  |  |
| in SRC: |        |                 |                                                         |  |  |  |
| Cost    | .equ   | 125             | ;define symbolic constant                               |  |  |  |
|         | .org   | 1000            | ;next word will be loaded at address 1000 <sub>10</sub> |  |  |  |
| X:      | .dw    | 1               | ;reserve 1 word for variable X                          |  |  |  |
|         | .org   | 5000            | program will be loaded at location 5000 <sub>10</sub>   |  |  |  |
|         | lar    | r31, Over       | ;load address of "false" jump location                  |  |  |  |
|         | ld     | r1, X           | ;load value of X into r1                                |  |  |  |
|         | brpl   | r31, r1         | ;branch to Else if r1≥0                                 |  |  |  |
|         | neg    | r1, r1          | ;negate value                                           |  |  |  |
| Over:   | • • •  |                 | ;continue                                               |  |  |  |
|         |        |                 |                                                         |  |  |  |



#### **RTN** (Register Transfer Notation)

- Provides a formal means of describing machine structure and function
- Is at the "just right" level for machine descriptions
- Does not replace hardware description languages.
- Can be used to describe *what* a machine does (an Abstract RTN) without describing *how* the machine does it.
- Can also be used to describe a particular hardware implementation (A Concrete RTN)



#### RTN Notation (Cont'd.)

- At first you may find this "meta description" confusing, because it is a language that is used to describe a language.
- You will find that developing a familiarity with RTN will aid greatly in your understanding of new machine design concepts.
- We will describe RTN by using it to describe SRC.



#### Some RTN Features— Using RTN to describe a machine's static properties

#### Static Properties

- Specifying registers
  - IR(31..0) specifies a register named "IR" having 32 bits numbered 31 to 0
- "Naming" using the := naming operator:
  - op(4..0) := IR(31..27) specifies that the 5 msbs of IR be called op, with bits 4..0.
  - Notice that this does not create a new register, it just generates another name, or "alias" for an already existing register or part of a register.



This fragment of RTN describes the SRC add instruction. It says, "when the op field of IR = 12, then store in the register specified by the ra field, the result of adding the register specified by the rb field to the register specified by the rc field."



#### Using RTN to describe the SRC (static) Processor State

| program counter              |
|------------------------------|
| (memory addr. of next inst.) |
| instruction register         |
| one bit run/halt indicator   |
| start signal                 |
| general purpose registers    |
|                              |



#### **RTN Register Declarations**

- General register specifications shows some features of the notation
- Describes a set of 32 32-bit registers with names R[0] to R[31]





### Memory Declaration: RTN Naming Operator

- Defining names with formal parameters is a powerful formatting tool
- Used here to define word memory (big endian)





# RTN Instruction Formatting Uses Renaming of IR Bits

Instruction formats  $op\langle 4..0 \rangle := IR\langle 31..27 \rangle$ : operation code field  $ra\langle 4..0 \rangle := IR\langle 26..22 \rangle$ : target register field  $rb\langle 4..0 \rangle := IR\langle 21..17 \rangle$ : operand, address i

$$rc\langle 4..0 \rangle := IR\langle 16..12 \rangle$$
:

$$c1\langle 21..0 \rangle := IR\langle 21..0 \rangle$$
:  
 $c2\langle 16..0 \rangle := IR\langle 16..0 \rangle$ :

 $c3\langle 11..0 \rangle := IR\langle 11..0 \rangle$ :

operation code field target register field operand, address index, or branch target register second operand, conditional test, or shift count register long displacement field short displacement or immediate field count or modifier field



#### Specifying dynamic properties of SRC: RTN Gives Specifics of Address Calculation

Effective address calculations (occur at runtime):

 $\begin{array}{ll} disp\langle 31..0\rangle \coloneqq ((rb=0) \rightarrow c2\langle 16..0\rangle \{ sign \ extend \} \colon & displacement \\ (rb\neq 0) \rightarrow R[rb] + c2\langle 16..0\rangle \{ sign \ extend, \ 2's \ comp. \} ) \colon & address \\ rel\langle 31..0\rangle \coloneqq PC\langle 31..0\rangle + c1\langle 21..0\rangle \{ sign \ extend, \ 2's \ comp. \} \colon & relative \\ address \end{array}$ 

- Renaming defines displacement and relative addrs.
- New RTN notation is used
  - condition  $\rightarrow$  expression means <u>if</u> condition <u>then</u> expression
  - modifiers in { } describe type of arithmetic or how short numbers are extended to longer ones
  - arithmetic operators (+ \* / etc.) can be used in expressions
- Register R[0] cannot be added to a displacement



### Detailed Questions Answered by the RTN for Addresses

- What set of memory cells can be addressed by direct addressing (displacement with rb=0)
  - If c2(16)=0 (positive displacement) absolute addresses range from 0000000H to 0000FFFFH
  - If c2(16)=1 (negative displacement) absolute addresses range from FFFF0000H to FFFFFFH
- What range of memory addresses can be specified by a relative address
  - The largest positive value of C1(21..0) is 2<sup>21</sup>-1 and its most negative value is -2<sup>21</sup>, so addresses up to 2<sup>21</sup>-1 forward and 2<sup>21</sup> backward from the current PC value can be specified
- Note the difference between rb and R[rb]



### Instruction Interpretation: RTN Description of Fetch/Execute

- Need to describe actions (not just declarations)
- Some new notation





#### **RTN Sequence and Clocking**

- In general, RTN statements separated by : take place during the same clock pulse
- Statements separated by ; take place on successive clock pulses
- This is not entirely accurate since some things written with one RTN statement can take several clocks to perform
- More precise difference between : and ;
  - The order of execution of statements separated by : does not matter
  - If statements are separated by ; the one on the left must be complete before the one on the right starts



#### More about Instruction Interpretation RTN

- In the expression IR ← M[PC]: PC ← PC + 4; which value of PC applies to M[PC] ?
- The rule in RTN is that all right hand sides of ":" separated RTs are evaluated before any LHS is changed
  - In logic design, this corresponds to "master-slave" operation of flip-flops
- We see what happens when Run is true and when Run is false but Strt is true. What about the case of Run and Strt both false?
  - Since no action is specified for this case, the RTN implicitly says that no action occurs in this case



#### **Individual Instructions**

- instruction\_interpretation contained a forward reference to instruction\_execution
- instruction\_execution is a long list of conditional operations
  - The condition is that the op code specifies a given inst.
  - The operation describes what that instruction does
- Note that the operations of the instruction are done after (;) the instruction is put into IR and the PC has been advanced to the next inst.



#### RTN Instruction Execution for Load and Store Instructions

 $\begin{array}{ll} \text{instruction\_execution} \coloneqq (\\ \text{Id} (\coloneqq \text{op}=1) \rightarrow \text{R}[\text{ra}] \leftarrow \text{M}[\text{disp}] \colon & \text{load register} \\ \text{Idr} (\coloneqq \text{op}=2) \rightarrow \text{R}[\text{ra}] \leftarrow \text{M}[\text{rel}] \colon & \text{load register relative} \\ \text{st} (\coloneqq \text{op}=3) \rightarrow \text{M}[\text{disp}] \leftarrow \text{R}[\text{ra}] \colon & \text{store register} \\ \text{str} (\coloneqq \text{op}=4) \rightarrow \text{M}[\text{rel}] \leftarrow \text{R}[\text{ra}] \colon & \text{store register relative} \\ \text{Ia} (\coloneqq \text{op}=5) \rightarrow \text{R}[\text{ra}] \leftarrow \text{disp:load displacement address} \\ \text{Iar} (\coloneqq \text{op}=6) \rightarrow \text{R}[\text{ra}] \leftarrow \text{rel} \colon & \text{Ioad relative address} \end{array}$ 

- The in-line definition (:= op=1) saves writing a separate definition
   Id := op=1 for the Id mnemonic
- The previous definitions of disp and rel are needed to understand all the details



#### SRC RTN—The Main Loop

```
ii := instruction_interpretation:

ie := instruction_execution :

ii := ( ¬Run∧Strt → Run ← 1:

Run → (IR ← M[PC]: PC ← PC + 4;

ie));

ie := (

Id (:= op= 1) → R[ra] ← M[disp]:

Idr (:= op= 2) → R[ra] ← M[rel]:

....

stop (:= op= 31) → Run ← 0:

); ii
```

statement on the opcode

Big switch

Thus ii and ie invoke each other, as coroutines.



#### Use of RTN Definitions: Text Substitution Semantics

$$\begin{split} & \text{Id } (:= \text{op}= 1) \to \mathsf{R}[\text{ra}] \leftarrow \mathsf{M}[\text{disp}]: \\ & \text{disp}\langle 31..0 \rangle := ((\text{rb}=0) \to c2\langle 16..0 \rangle \{\text{sign extend}\}: \\ & (\text{rb}\neq 0) \to \mathsf{R}[\text{rb}] + c2\langle 16..0 \rangle \{\text{sign extend}, 2\text{'s comp.}\} ): \end{split}$$

An example:

- If IR = 00001 00101 00011 0000000000001011
- then  $Id \rightarrow R[5] \leftarrow M[R[3] + 11]$ :



#### **RTN Descriptions of SRC Branch Instructions**

- Branch condition determined by 3 lsbs of inst.
- Link register (R[ra]) set to point to next inst.

$$\begin{array}{ll} \mbox{cond} := (\ c3\langle 2..0\rangle = 0 \rightarrow 0: \\ \ c3\langle 2..0\rangle = 1 \rightarrow 1: & alw \\ \ c3\langle 2..0\rangle = 2 \rightarrow R[rc] = 0: & \text{if } r \\ \ c3\langle 2..0\rangle = 3 \rightarrow R[rc] \neq 0: & \text{if } r \\ \ c3\langle 2..0\rangle = 4 \rightarrow R[rc]\langle 31\rangle = 0: & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 2..0\rangle = 5 \rightarrow R[rc]\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 31\rangle = 1): & \text{if } r \\ \ c3\langle 31\rangle = 1$$

never ways register is zero register is nonzero

- if positive or zero
- if negative
- conditional branch

branch and link



#### **RTN for Arithmetic and Logic**

add (:= op=12) 
$$\rightarrow$$
 R[ra]  $\leftarrow$  R[rb] + R[rc]:  
addi (:= op=13)  $\rightarrow$  R[ra]  $\leftarrow$  R[rb] + c2 $\langle 16..0 \rangle$  {2's comp. sign ext.}:  
sub (:= op=14)  $\rightarrow$  R[ra]  $\leftarrow$  R[rb] - R[rc]:  
neg (:= op=15)  $\rightarrow$  R[ra]  $\leftarrow$  R[rb] - R[rc]:  
and (:= op=20)  $\rightarrow$  R[ra]  $\leftarrow$  R[rb]  $\land$  R[rc]:  
andi (:= op=21)  $\rightarrow$  R[ra]  $\leftarrow$  R[rb]  $\land$  c2 $\langle 16..0 \rangle$  {sign extend}:  
or (:= op=22)  $\rightarrow$  R[ra]  $\leftarrow$  R[rb]  $\lor$  R[rc]:  
ori (:= op=23)  $\rightarrow$  R[ra]  $\leftarrow$  R[rb]  $\lor$  c2 $\langle 16..0 \rangle$  {sign extend}:  
not (:= op=24)  $\rightarrow$  R[ra]  $\leftarrow$   $\neg$ R[rc]:

■ Logical operators: <u>and ∧ or v</u> and <u>not</u> ¬



#### **RTN for Shift Instructions**

- Count may be 5 lsbs of a register or the instruction
- Notation: @ replication, # concatenation

$$\begin{split} n &:= ( (c3\langle 4..0\rangle=0) \rightarrow R[rc]\langle 4..0\rangle: \\ (c3\langle 4..0\rangle\neq 0) \rightarrow c3\langle 4..0\rangle ): \\ \text{shr} (:= op=26) \rightarrow R[ra]\langle 31..0\rangle \leftarrow (n @ 0) \ \# R[rb]\langle 31..n\rangle: \\ \text{shra} (:= op=27) \rightarrow R[ra]\langle 31..0\rangle \leftarrow (n @ R[rb]\langle 31\rangle) \ \# R[rb]\langle 31..n\rangle: \\ \text{shl} (:= op=28) \rightarrow R[ra]\langle 31..0\rangle \leftarrow R[rb]\langle 31-n..0\rangle \ \# (n @ 0): \\ \text{shc} (:= op=29) \rightarrow R[ra]\langle 31..0\rangle \leftarrow R[rb]\langle 31-n..0\rangle \ \# R[rb]\langle 31..32-n\rangle: \end{split}$$



 Arithmetic shift right by 13 concatenates 13 copies of the sign bit with the upper 19 bits of the operand

shra r1, r2, 13

R[2]= 1001 0111 1110 1010 1110 1100 0001 0110



#### Assembly Language for Shift

• Form of assembly language instruction tells whether to set c3=0

shr ra, rb, rc ;Shift rb right into ra by 5 lsbs of rc shr ra, rb, count ;Shift rb right into ra by 5 lsbs of inst shra ra, rb, rc ;AShift rb right into ra by 5 lsbs of rc shra ra, rb, count ;Shift rb right into ra by 5 lsbs of inst shl ra, rb, rc ;Shift rb left into ra by 5 lsbs of rc shl ra, rb, count ;Shift rb left into ra by 5 lsbs of inst shc ra, rb, rc ;Shift rb left into ra by 5 lsbs of rc shc ra, rb, count;Shift rb circ. into ra by 5 lsbs of rc



#### End of RTN Definition of instruction\_execution

```
nop (:= op= 0) \rightarrow :
stop (:= op= 31) \rightarrow Run \leftarrow 0:
);
instruction_interpretation.
```

No operation Stop instruction End of instruction\_execution

- We will find special use for nop in pipelining
- The machine waits for Strt after executing stop
- The long conditional statement defining instruction\_execution ends with a direction to go repeat instruction\_interpretation, which will fetch and execute the next instruction (if Run still =1)



#### Confused about RTN and SRC?

- SRC is a Machine Language
  - It can be interpreted by either hardware or software simulator.
- RTN is a Specification Language
  - Specification languages are languages that are used to specify other languages or systems—a *metalanguage*.
  - Other examples: LEX, YACC, VHDL, Verilog

Figure 2.10 may help clear this up...



Copyright © 2004 Pearson Prentice Hall, Inc.



### A Note about Specification Languages

- They allow the description of *what* without having to specify *how*.
- They allow precise and unambiguous specifications, unlike natural language.
- They reduce errors:
  - errors due to misinterpretation of imprecise specifications written in natural language
  - errors due to confusion in design and implementation "human error."
- Now the designer must debug the specification!
- Specifications can be automatically checked and processed by tools.
  - An RTN specification could be input to a simulator generator that would produce a simulator for the specified machine.
  - An RTN specification could be input to a compiler generator that would generate a compiler for the language, whose output could be run on the simulator.



#### Addressing Modes Described in RTN (Not SRC)

|                   |                  |                           | rarget register    |
|-------------------|------------------|---------------------------|--------------------|
| <u>Mode name</u>  | <u>Assembler</u> | RTN meaning               | <u>Use</u>         |
|                   | <u>Syntax</u>    | $\checkmark$              |                    |
| Register          | Ra               | R[t] ← R[a]               | Tmp. Var.          |
| Register indirect | (Ra)             | R[t] ← M[R[a]]            | Pointer            |
| Immediate         | #X               | R[t] ← X                  | Constant           |
| Direct, absolute  | Х                | R[t] ← M[X]               | Global Var.        |
| Indirect          | (X)              | R[t] ← M[ M[X] ]          | Pointer Var.       |
| Indexed, based,   | X(Ra)            | R[t] ← M[X + R[a]]        | Arrays, structs    |
| or displacement   |                  |                           |                    |
| Relative          | X(PC)            | R[t] ← M[X + PC]          | Vals stored w pgm  |
| Autoincrement     | (Ra)+            | R[t] ← M[R[a]]; R[a] ← R  | [a] + 1 Sequential |
| Autodecrement     | - (Ra)           | R[a] ← R[a] - 1; R[t] ← M | [R[a]] access.     |
|                   |                  |                           |                    |

Torgot register



## Fig. 2.12 Register transfers can be mapped to Digital Logic Circuits.

Implementing the RTN statement A ← B



Copyright © 2004 Pearson Prentice Hall, Inc.



#### Fig. 2.13 Multiple Bit Register Transfer

• Implementing  $A(m..1) \leftarrow B(m..1)$ 





## Fig. 2.14 Data Transmission View of Logic Gates

Logic gates can be used to control the transmission of data:



Copyright © 2004 Pearson Prentice Hall, Inc.





Copyright © 2004 Pearson Prentice Hall, Inc.



#### Fig. 2.16 m-bit Multiplexer and Symbol



![](_page_68_Figure_3.jpeg)

Multiplexer gate signals G<sub>i</sub> may be produced by a binary to one-out-of-n decoder

![](_page_69_Figure_0.jpeg)

![](_page_69_Figure_1.jpeg)

- Merged data can be separated by gating at the right time
- It can also be strobed into a flip-flop when valid

![](_page_70_Picture_0.jpeg)

![](_page_70_Figure_1.jpeg)

![](_page_70_Figure_2.jpeg)

- Selected gate and strobe determine which Register is Transferred to where.
- $A \leftarrow C$ , and  $B \leftarrow C$  can occur together, but not  $A \leftarrow C$ , and  $B \leftarrow D$

<mark>2</mark>/e

![](_page_71_Picture_0.jpeg)

#### Fig. 2.19 Open-Collector NAND Gate Output Circuit

![](_page_71_Figure_2.jpeg)

Copyright © 2004 Pearson Prentice Hall, Inc.


#### (c) Truth table

Copyright © 2004 Pearson Prentice Hall, Inc.



### Fig. 2.21 Open Collector Wired OR Bus

- DeMorgan's OR by not of AND of nots
- Pull-up resistor removed from each gate open collector
- One pull-up resistor for whole bus
- Forms an OR distributed over the connection







# Fig. 2.23 Registers Connected by a Tri-state Bus



- Can make any register transfer R[i]←R[j]
- Can't have  $G_i = G_i = 1$  for  $i \neq j$
- Violating this constraint gives low resistance path from power supply to ground—with predictable results!



Notice that what could be described in one step in the abstract RTN took three steps on this particular hardware Computer Systems Design and Architecture Second Edition

#### Figure 2.25 Timing of the Register Transfers



С

D

S



#### RT's Possible with the One Bus Structure

- R[i] or Y can get the contents of anything but Y
- Since result different from operand, it cannot go on the bus that is carrying the operand
- Arithmetic units thus have result registers
- Only one of two operands can be on the bus at a time, so adder has register for one operand
- $R[i] \leftarrow R[j] + R[k]$  is performed in 3 steps:  $Y \leftarrow R[k]$ ;  $Z \leftarrow R[j] + Y$ ;  $R[i] \leftarrow Z$ ;
- R[i] ← R[j] + R[k] is high level RTN description
- Y←R[k]; Z←R[j] + Y; R[i]←Z; is concrete RTN
- Map to control sequence is: R[2]<sub>out</sub>, Y<sub>in</sub>; R[1]<sub>out</sub>, Z<sub>in</sub>; Z<sub>out</sub>, R[3]<sub>in</sub>;



## From Abstract RTN to Concrete RTN to Control Sequences

- The ability to begin with an abstract description, then describe a hardware design and resulting concrete RTN and control sequence is powerful.
- We shall use this method in Chapter 4 to develop various hardware designs for SRC



#### **Chapter 2 Summary**

- Classes of computer ISAs
- Memory addressing modes
- SRC: a complete example ISA
- RTN as a description method for ISAs
- RTN description of addressing modes
- Implementation of RTN operations with digital logic circuits
- Gates, strobes, and multiplexers