## Chapter 2

## Software Architecture of the 80386 Microprocessor

## Outline

2.2 Internal Architecture of the 80386 Microprocessor
2.3 Software Model of the 80386 Microprocessor
2.4 Memory Address Space and Data Organization
2.5 Data Types
2.6 Segment Registers and Memory Segmentation
2.7 Instruction Pointer
2.8 Data Registers
2.9 Pointer and Index Registers
2.10 Flags Register
2.11 Generating a Memory Address
2.12 The Stack
2.13 Input/Output Address Space

## Internal Architecture of the 80386DX Microprocessor

- parallel processing -> high
 performance
- Six processing units:
- Bus units
- Execution unit
- Segment unit
- Page unit
- Prefetch unit
- Decode unit
- Each unit has a dedicated function and they all operate at the same time


## Bus Interface Unit

- Interface to the outside world
- Responsible for
- Fetching instruction
- Reading and writing of data for memory
- Inputting and outputting of data for input/output peripherals
- Information transfers over the microprocessor bus
- De-multiplexed bus
- 386DX
- 32-bit data bus
- Real-mode: 20-bit address, 1M-byte physical address space
- Protected-mode: 32-bit address bus, 4G-byte physical address space


## Prefetch Unit

- Instruction Stream queue
- Whenever the queue is not full, prefetch the next sequential instructions
- Queue-16-byte; 4-byte/memory cycle
- Prioritizes bus accesses—data operands highest priority
- FIFO instruction queue
- Holds bytes of instruction code until the decode unit is ready to accept them.
- Time to fetch many of the instructions in a microcomputer program is "hidden".
- Bus unit "Idle state" - If queue is full and the execution unit is not requesting access to data in memory, BIU does not perform bus cycles.


## Decode Unit

- Offloads the responsibility of instruction decoding from the execution unit.
- Reads machine code instructions from the output side of the instruction queue
- Decodes the instructions into the microcode instruction format used by the execution unit
- Contains an instruction queue that holds 3 fully decoded instruction
- Decoded instructions are held until requested by the execution unit


## Execution Unit

- Responsible for executing instructions
- Element of the EU
- Arithmetic/logic unit (ALU)
- Performs the operation identified by the instruction: ADD, SUB, AND, etc.
- Flags register
- Holds status and control information
- General-purpose registers
- Holds address or data information
- Control ROM
- Contains microcode sequences that define operations performed by machine instructions
- Special multiply, shift, and barrel shift hardware
- Accelerate multiply, divide, and rotate operations


## Operations of the Execution Unit

- Reads instructions from the instruction queue
- Accesses general purpose registers if necessary
- Generates memory address of data storage locations in memory if necessary
- Passes memory addresses to the segmentation and paging units and requests the bus unit to perform read or write bus cycles to access data operands in memory
- Performs the operation defined by the instruction on the selected data
- Tests the state of flags if necessary
- Updates the state of the flags based on the result produced by executing the instruction.


## Segmentation and Paging Unit

- Off-load memory-management and protection services from the bus unit
- Segmentation unit
- Implements real-mode and protected-mode segmentation model
- Contains general registers, segment registers, and instruction pointer
- Holds address and data operand information
- Segmentation unit address generation logic
- Real-mode address generation
- CS:IP $\rightarrow$ code
- DS:SI $\rightarrow$ data
- Protected-mode address translation
- Translates logical address to linear address
- Protection checking


## Segmentation and Paging Unit

- Paging unit
- Implements protected-mode paging model
- Contains translation look-aside buffer
- Acts as a cache for recently used page directory entries and page table entries
- Translates linear address output of segmentation unit to a physical page address
- Not used in real mode


## The Software Model

- programmer's understanding the operation of the microcomputer from a software point of view
- Elements of the software model
- Register set
- Memory address space
- Input/output address space
- What the programmer must know about the microprocessor
- Registers available within the device
- Purpose of each registers
- Function of each registers
- Operating capabilities of each registers
- Limitations of each register
- Size of memory and input/output address spaces
- Organization of memory and input/output address spaces
- Types of data


## Register Set



- 8-32-bit registers
- (4) Data registers- EAX, EBX, ECX, EDX, can be used as 32, 16 or 8bit
- (2) Pointer registers- EBP, ESP
- (2) Index registers- ESI, EDI
- 7-16-bit registers
- (1) Instruction pointer- IP
- (6) Segment registers- CS, DS, SS, ES, FS, GS
- Flags (status) register-EFLAGS
- Control register- CR0


## Memory and Input/Output



- Architecture implements independent memory and input/output address spaces
- Memory address space- 1,048,576 bytes long (1M-byte)
- Input/output address space-65,536 bytes long (64K-bytes)


## Address Space



- Memory organized as individual bytes
- Memory address space corresponds to the 1M addresses in the range 00000 H to FFFFFH $00000 \mathrm{H}=00000000000000000000_{2}$ FFFFFH= $11111111111111111111_{2}$ $2^{20}=1,048,576=1 \mathrm{M}$ unique addresses
- Data organization:
- Byte: content of any individual byte address
- Word: contents of two contiguous byte addresses
- Double-word: contents of 4 contiguous byte addresses


## Dedicated and General Use of Memory

- Memory address space is partitioned into
 general use and dedicated use areas
- Dedicated ( OH - 3FFH):
- Interrupt vector table
- 1st 1024 bytes
- Addresses $0 \mathrm{H} \rightarrow 3$ 3FH
- 256 4-byte pointers
- 16-bit segment base address
- 16-bit offset
- General use:
- 400H $\rightarrow$ FFFFFH
- Used for stack, code, and data


## Aligned Words, Double words

- Words and double words of data can be stored in memory at an even or odd


Aligned double words address boundary

- Examples of even address boundaries: $00000_{16}, 00002_{16}, 00004_{16}$
- Examples of odd address boundaries: $00001_{16}, 00003_{16}, 00005_{16}$
- Aligned double-words are stored at even addresses that are a multiple of 4
- Examples are double words 0 and 4


## Misaligned Words

- $80 \times 86$ architecture supports access or aligned or misaligned data

- Words stored across a double-word boundary are said to be "misaligned or unaligned words"
Examples are words 3 and 7
- Misaligned double-words are stored at addresses that are not a multiple of 4
Examples: double words 1, 2 and 3
- There is a performance impact for accessing unaligned data in memory (32-bit data bus)


## Examples of Words of Data



Example [Fig. 2.5 (a)]
$\left(02001_{16}\right)=0101$ 1010 $_{2}=5 \mathrm{AH}=$ MS-byte $\left(02001_{16}\right)=11110000_{2}=$ FOH= LS-byte
as a word they give
$0101101011110000_{2}=5 \mathrm{AFOH}$

## Example of Double Word

| Address | Memory (binary) | Memory (hexadecimal) |
| :---: | :---: | :---: |
|  |  |  |
| 02105 ${ }_{16}$ | 00000001 | 01 |
| 02104 ${ }_{16}$ | 00100011 | 23 |
| 0210316 | 10101011 | AB |
| $02102{ }_{16}$ | 11001101 | CD |
| $02101{ }_{16}$ | $x \times x \times 2 x x x$ | XX |
| 02100 ${ }_{16}$ | XXXX $\times$ XXX | XX |
|  |  |  |

(a)

- LSB:

Address $02102 \mathrm{H}=\mathrm{CDH}$

- MSB:

Address 02105H = 01H

- Arranging as 32-bit data gives

Address 02102H =0123ABCDH
$=000000010010001110101011$
$11001101_{2}$

- Aligned or misaligned double word?


## Unsigned Integers

MSB

|  | LSB |  |  |  |  |  |
| :--- | :--- | :--- | :--- | :--- | :--- | :--- |
| $D_{7}$ |  |  |  |  |  |  |

(a)

(b)

(c)

- All numbers are binary in memory
- All bits represent data
- Types:

Sizes Range<br>8-bit $\quad \mathrm{OH} \rightarrow 255_{10}$<br>16-bit $\mathrm{OH} \rightarrow 65,535_{10}$<br>32-bit $\mathrm{OH} \rightarrow 4,294,967,295_{10}$

## Signed Integers


(a)

(b)


- MSB is sign bit ( $0 / 1$-> +is + )
- Remaining bits represent value
- Negative numbers expressed in 2's complement notation
- Types:

$$
\begin{array}{ll}
\text { Sizes } & \text { Range } \\
\text { 8-bit } & -128 \rightarrow+127 \\
\text { 16-bit } & -32,768 \rightarrow+32,767 \\
\text { 32-bit } & -2,147,483,648 \rightarrow+2,147,483,647
\end{array}
$$

## Integer Examples

## Example 2.3

Unsigned double word integer $=00010000 \mathrm{H}$
Expressing in binary form
$=00000000000000010000000000000000_{2}=2^{16}=65,536$

## Example 2.4

Signed double word integer = FFFEFFFFH
Expressing in binary form
= $11111111111111101111111111111111_{2}$
Sign bit $=1=$ minus
Subtracting 1 from LSB and complementing all bits gives
$=-0000000000000010000000000000^{0001_{2}}$
$=-2^{16}+2^{0}$
$=-65,537$

## BCD Numbers

| Decimal | BCD |
| :---: | :---: |
| 0 | 0000 |
| 1 | 0001 |
| 2 | 0010 |
| 3 | 0011 |
| 4 | 0100 |
| 5 | 0101 |
| 6 | 0110 |
| 7 | 0111 |
| 8 | 1000 |
| 9 | 1001 |

(a)

(b)

(c)

- Direct coding of numbers as binary coded decimal (BCD) numbers supported
- Unpacked BCD [Fig.2.10(b)]
- Lower four bits contain a digit of a BCD
number
- Upper four bits filled with zeros (zero filled)
- Packed BCD [Fig. 2.10(c)]
- Lower significant BCD digit held in lower 4 bits of byte
- More significant BCD digit held in upper 4 bits of byte
Example: Packed BCD byte at address 01000 H is $10010001_{2}$, what is the decimal number?
Organizing as BCD digits gives,

$$
1001_{\mathrm{BCD}} 0001_{\mathrm{BCD}}=91_{10}
$$

## ASCII Data

|  |  |  | $b_{7}$ | 0 | 0 | 0 | 0 | 1 | 1 | 1 |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |

(a)

- American Code for Information Interchange (ASCII) code
- ASCII information storage in memory
- Coded one character per byte
- 7 LS-bits $=b_{7} b_{6} b_{5} b_{4} b_{3} b_{2} b_{1}$
- MS-bit filled with 0

Example: Addresses 01100H-01104H contain ASCII coded data 01000001, 01010011, 01000011, 01001001, and 01001001, respectively. What does the data stand for?

$$
\begin{aligned}
& 01000001_{\text {ASCII }}=\mathrm{A} \\
& 0 \\
& 0
\end{aligned} 1010011_{\mathrm{ASCI}}=\mathrm{S}
$$

## Active Segments of Memory



- Memory Segmentation
- Not all of the 80386 real-mode address space is active at one time
- Address value in a segment register points to the lowest addressed byte in an active segment
- Size of each segment is 64 K contiguous byte
- Total active memory is 384k bytes
- 64K-bytes for code
- 64K-bytes for stack
- 256K-bytes for data
- Six Segment Registers
- Code segment (CS) register- Code storage
- Stack segment (SS) register- Stack storage
- Data segment (DS, ES, FS, GS) register-

Data storage

## User access, Restrictions, and Orientation

- Segment registers are user accessible

- Programmer can change values under software control
- Permits access to other parts of memory
- Example: a new data space can be activated by replace values in DS, ES, FS, and GS
- Restriction on the starting address of a segment of memory
- Reside on a 16 byte address boundary
- Examples: $00000 \mathrm{H}, 00010 \mathrm{H}, 0002 \mathrm{H}$
- Orientation of segments:
- Contiguous-A\&B or D,E\&G
- Adjacent-none shown
- Disjointed-C\&F
- Overlapping-B\&C


## Accessing Code Memory Space

Instruction pointer (IP): location of the next double word of instruction code to be fetched from the current code segment

- 16-bit offset—address pointer
- Logical address CS:IP forms 20-bit physical address for next instruction
- Instruction fetch sequence
- 80386DX prefetches a double word of instruction code from code
segment in memory into instruction stream queue
- $\mathrm{IP}=\mathrm{IP}+4$
- Decoded by the instruction decoder
- Placed in the instruction queue to await execution
- 80386DX prefetches up to 16 byte of code
- Decoded instruction is read from output of instruction queue - Operands read from data memory, internal registers, or the instruction queue
- Operation specified by the instruction performed on operands
- Results written to data memory or and internal register
- Flags updated


## General Purpose Data Registers

- Four general purpose data registers

(a)

| Register | Operations |
| :---: | :---: |
| EAX, AX, AH, AL | ASCII adjust for addition/subtraction Convert byte to word/word to double word/ double word to quad word <br> Decimal adjust for addition/subtraction <br> Unsigned multiply/divide <br> Signed divide <br> Input/output operations <br> Load/store flags <br> Load/compare/store string operations <br> Table-lookup translations |
| EBX, BX, BH, BL | Table-lookup translations |
| ECX, CX, CH, CL | Loop operations <br> Repeat string operations Variable shift/rotate operations |
| EDX, DX, DH, DL | Indirect input/output operations Input/output string operations Unsigned word/double word multiply Signed word/double word divide Unsigned word/double word divide |

- Accumulator (A) register
- Base (B) register
- Count (C) register
- Data (D) register
- Can hold 8-bit, 16-bit, or 32-bit data
- AH/AL = high and low byte value
- $A X$ = word value
- EAX = double word value
- Uses:
- Hold data such as source or destination operands for most operations-ADD, AND, SHL
- Hold address pointers for accessing memory
- Some also have dedicated special uses
- C-count for loop,
- B-table look-up translations, base address
- D-indirect I/O and string I/O


## Pointer Registers

- Pointers are offset addresses used to access information in a segment of memory -Two pointer registers
-Stack pointer register
- ESP = 32-bit extended stack pointer
- SP = 16-bit stack pointer
- Base pointer register
- EBP = 32-bit extended base pointer
- BP = 16-bit base pointer
- Use to access information in stack segment of memory
- SP and BP are offsets from the current value of the stack segment base address
- Select a specific storage location in the current 64K-byte stack segment
- SS:SP—points to top of stack (TOS)
- SS:BP—points to data in stack


## Index Registers

- Value in an index register is also an address pointer
-Two index registers
- Source index register
- ESI = 32-bit source index register

- SI = 16-bit source index register

Destination index registers

- EDI = 32-bit destination index register
- DI = 16-bit destination index register
- Used to access source and destination DI---- Dostination inderyperands in data segment of memory
- DS:SI—points to source operand in data segment
- DS:DI-points to destination operand in data segment
- Also used to access information in the extra segment (ES)


## Flags Register

- FLAGS register: 32-bit register used to hold single bit status and control information called flags
- 9 active flags in real mode

- Two categories
- Status Flags-indicate conditions that are the result of executing an instruction
- Execution of most instructions updates status
- Used by control flow instructions as test conditions
- Control Flags-control operating functions of the processor
- Used by software to turn on/off operating capabilities


## Status Flags

- Examples of Status Flags-CF, PF, ZF, SF, OF, AF
- Carry flag (CF)
-1 = carry-out or borrow-in from MSB of the result during the execution of an arithmetic instruction
- 0 = no carry or borrow has occurred
- Parity flag (PF)
- 1 = result produced has even parity
- $0=$ result produced has odd parity
- Zero flag (ZF)
- 1 = result produced is zero
- 0 = result produced is not zero
- Sign bit (SF)
- 1 = result is negative
- 0 = result is positive
- Others
- Overflow flag (OF)
- Auxiliary carry flag (AF)


## Control Flags

Trap flag (TF)

- 1/0 = turn on/off single-step mode
- Mode useful for debugging

- Employed by monitor to execute one instruction at a time (single step execution)
Interrupt flag (IF)
- Used to enable/disable external maskable interrupt requests
- 1/0 = enable/disable external interrupts

为

- Used to determine the direction in which string operations occur
- 1/0 = automatically decrement/increment string address-proceed from high address to low address


## Logical and Physical Addresses



## Generating a Real-Mode Memory Address

Segment base address $=1234 \mathrm{H}$
Offset $=0022 \mathrm{H}$


Adding binary segment address and offset $00010010001101000000_{2}+0000000000100010_{2}$
$=00010010001101100010_{2}$
$=12362 \mathrm{H}$
In hex:
$12340 \mathrm{H}+0022 \mathrm{H}=12362 \mathrm{H}$

## Boundaries of a Segment

- Six active segments: CS, DS, ES. GS, FS, SS
- Each 64K-bytes in size $\rightarrow$ maximum of



## Relationship between Logical and Physical Addresses



- Many different logical address can map to the same physical address
- Examples:
$2 \mathrm{BH}: 13 \mathrm{H}=002 \mathrm{~B} 0 \mathrm{H}+0013 \mathrm{H}=002 \mathrm{C} 3 \mathrm{H}$
$2 \mathrm{CH}: 3 \mathrm{H}=002 \mathrm{COH}+0003 \mathrm{H}=002 \mathrm{C} 3 \mathrm{H}$
- Said to be "aliases"


## The Stack

- Stack-temporary storage area for information such as data and addresses
- Located in stack segment of memory
- Real mode-64K bytes long
- Organized as 32K words
- Information saved as words or double words, not bytes
Organization of stack
- SS:0002H $\rightarrow$ end of stack (lowest addressed word)
- SS:FFFEH $\rightarrow$ bottom of stack (highest addressed word)
- SS:SP $\rightarrow$ top of stack (last stack location where data was pushed)
- Stack grows down from higher to lower address
- Used by call, push, pop, and return operations

PUSH ESI $\rightarrow$ causes the current content of the ESI
register to be pushed onto the stack
POP ESI $\rightarrow$ causes the value at the top of the stack
to be popped back into the ESI register

## The Stack - Push Stack Operation

- Status of the stack prior to execution of the instruction PUSHAX


$$
\begin{aligned}
& \text { AX }=1234 \mathrm{H} \\
& \text { SS }=0105 \mathrm{H} \\
& A_{\text {EOS }}=S S: 02 \rightarrow 01052 \mathrm{H}=\text { end of stack } \\
& S P=0008 \mathrm{H} \\
& A_{\text {TOS }}=S S: S P \rightarrow 01058 \mathrm{H}=\text { current top of } \\
& \text { stack } \\
& A_{\text {BOS }}=\text { SS:FFFE } \rightarrow 1104 \mathrm{EH} \\
& \text { BBAAH }=\text { Last value pushed to stack } \\
& \text { Addresses }<01058 \mathrm{H}=\text { invalid stack data } \\
& \text { Addresses }>=01058 \mathrm{H}=\text { valid stack data }
\end{aligned}
$$

- In response to the execution of PUSH AX instruction

1. SP $\rightarrow 0006$ decremented by 2 $\mathrm{A}_{\text {TOP }} \rightarrow 01056 \mathrm{H}$
2. Memory write to stack segment

$$
\begin{aligned}
& \mathrm{AL}=34 \mathrm{H} \rightarrow 01056 \mathrm{H} \\
& \mathrm{AH}=12 \mathrm{H} \rightarrow 01057 \mathrm{H}
\end{aligned}
$$

## The Stack- Pop Stack Operation

- Status of the stack prior to execution of the instruction POP AX

$$
\begin{aligned}
& \mathrm{AX}=\mathrm{XXXXH} \\
& \mathrm{SS}=0105 \mathrm{H} \\
& \mathrm{SP}=0006 \mathrm{H} \\
& \mathrm{~A}_{\text {Tos }}=\mathrm{SS}: \mathrm{SP} \rightarrow 01056 \mathrm{H}=\text { current top of } \\
& \text { stack } \\
& 1234 \mathrm{H}=\text { Last value pushed to stack } \\
& \text { Addresses }<01056 \mathrm{H}=\text { invalid stack data } \\
& \text { Addresses }>=01056 \mathrm{H}=\text { valid stack data }
\end{aligned}
$$



execution of POP AX instruction

1. Memory read to $A X$

$$
\begin{aligned}
& 01056 \mathrm{H}=34 \mathrm{H} \rightarrow \mathrm{AL} \\
& 01057 \mathrm{H}=12 \mathrm{H} \rightarrow \mathrm{AH}
\end{aligned}
$$

2. $\mathrm{SP} \rightarrow 0008 \mathrm{H}$ incremented by 2 $\mathrm{A}_{\text {TOP }} \rightarrow 01058 \mathrm{H}$
execution of ${ }^{(19)} \mathrm{POP} \mathrm{BX}$ instruction
3. Memory read to BX

$$
\begin{aligned}
& 01058 \mathrm{H}=\mathrm{AAH} \rightarrow \mathrm{BL} \\
& 01059 \mathrm{H}=\mathrm{BBH} \rightarrow \mathrm{BH}
\end{aligned}
$$

2. $\mathrm{SP} \rightarrow 000 \mathrm{AH}$ incremented by 2 $\mathrm{A}_{\text {TOP }} \rightarrow 0105 \mathrm{AH}$

## I/O Address Space

PAGE 0


- Input/output address space
- Place where I/O devices are normally implemented
- I/O addresses are only 16-bits in length
- Independent 64K-byte address space
- Address range 0000 H through FFFFH
-Page 0
- First 256 byte addresses $\rightarrow 0000 \mathrm{H}$
- 00FFH
- Can be accessed with direct or variable I/O instructions
- Ports F8H through FF reserved


## Organization of the I/O Data

- Input/output data organization
- Supports byte, word, and double-word I/O ports
- 64K independent byte-wide l/O ports
- 32 K independent word-wide I/O ports
- 16K independent double-word-wide I/O ports
- Examples (aligned I/O ports):

Byte ports $0,1,2 \rightarrow$ addresses $0000 \mathrm{H}, 0001 \mathrm{H}$, and 0002 H
Word ports $0,1,2 \rightarrow$ addresses $0000 \mathrm{H}, 0002 \mathrm{H}, 0004 \mathrm{H}$
Double-word ports $0,1,2 \rightarrow$ addresses $0000 \mathrm{H}, 0004 \mathrm{H}$, 0008H
Advantages of Isolated I/O
. Complete memory address space available for use by


- I/O instructions tailored to maximize performance
- Disadvantage of Isolated I/O
- All inputs/outputs must take place between I/O port and accumulator register

