Computer Architecture (Final Exam Review)

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/66

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

67 Terms

1
New cards

ALU

Arithmetic logic unit. Performs arithmetic operations and logic operations.

2
New cards

Amdahl's Law

The law of diminishing returns for speedup in processors.

3
New cards

CISC

Complex Instruction Set Computer. Design where instructions can execute low-level operations

Example: Intel

4
New cards

CPI

Cycles per Instruction. Average number of clock cycles per instruction, and measures performance

5
New cards

CPU

Central Processing Unit. Brain of the computer. Adds and tests numbers. Includes ALU, memory, and registers.

6
New cards

DMA

Direct Memory Access. Allows hardware to access RAM independent of the CPU

7
New cards

FP

Floating Point. Arithmetic approximation of real numbers

8
New cards

GFLOPS

Billion Floating Point Operations Per Second. Measurement of computer performance in 10^9

9
New cards

GPU

Graphics Processor Unit. Renders images to display on a device. Ex: GeForece graphics card

10
New cards

INT

Integer. A whole number that can be positive, negative, or 0.

11
New cards

LRU

Least Recently Used. Algorithm for selecting less recently used pages.

12
New cards

LSB

Least Significant Bit. Bit in a binary number with the lowest numerical value

13
New cards

Microcode

A low-level instruction set

14
New cards

MFLOPS

Million Floating Point Operations Per Second. Measurement of computer performance in 10^6

15
New cards

MIPS

Million Instruction Per Second. An instruction set architecture (ISA) that is a RISC

16
New cards

MMU

Memory Management Unit. Translates virtual addresses to physical addresses

17
New cards

Moore's Law

Processing power for computers will double every two years

18
New cards

MSB

Most Significant Bit. Bit in a binary number with the highest numerical value

19
New cards

Pipeline

Overlapping multiple instructions/stages during execution, like an assembly line does

20
New cards

PC

Program Counter. Keeps track of the next instruction to be executed.

21
New cards

PLA

Programmable Logic Array. Device used to implement logic circuits

22
New cards

DRAM

Dynamic RAM. RAM that must be constantly refreshed.

23
New cards

RAM

Random Access Memory. Stores currently used data for quick access.

24
New cards

SRAM

Static RAM. RAM that does not need to be constantly refreshed.

25
New cards

RISC

Reduced Instruction Set Computer. An instruction set architecture (ISA) that can have a lower CPI than a CISC.

Example: MIPS

26
New cards

TLB

Translation Look-aside Buffer. Memory cache for tags that are translated to a physical page number

27
New cards

TFLOPS

Trillion Floating Point Operations per Second. Measurement of computer performance in 10^12

28
New cards

VHDL

VHISC (Very High Speed Integrated Circuit) Hardware Description Language. Language that describes digital systems.

29
New cards

VHSIC

Very High Speed Integrated Circuit. U.S. program that researched integrated circuits

30
New cards

VLIW

Very Long Instruction Word. Architecture that exploits instruction level parallelism

31
New cards

VM

Virtual Memory. Simulated memory that is written on a hard drive.

32
New cards

IF

Instruction Fetch. Retrieves data from the instruction set

33
New cards

ID

Instruction Decode. Figures out what the instruction does and what registers to use

34
New cards

EX

Execute. Performs an instruction with the ALU

35
New cards

MEM

Memory. Accesses an operand in data memory

36
New cards

WB

Write Back. Writes the result back into the register.

37
New cards

What is the 3 hazards in pipelining and describe them and how they are overcome?

Structural. A required resource is busy. Solved by pipeline stalling or hardware redesign.

Data: Data from previous operation is not ready. Solved by pipeline stalling or forwarding.

Control: Wrong instruction is in the pipeline. Solved by pipeline stalling or branch prediction

38
New cards

Mac Pro

Desktop - General Purpose, variety of software

39
New cards

HP Itanium

Server - High performance, capacity, reliability

40
New cards

Toaster

Embedded - Stringent power/performance

41
New cards

DSP

Custom - Unique specialty circuitry

42
New cards

Watson

Supercomputer - Massive parallel processing

43
New cards

ARM Structure

Number of Registers: 32 Register Bit Length: 64 Instruction Bit Length: 32 Data Path Bit Length: 64

44
New cards

LEGv8 Structure

Number of Registers: 32 Register Bit Length: 32 Instruction Bit Length: 32 Data Path Bit Length: 32

45
New cards

To increment to the next instruction, what do we do to the existing PC and why?

Add 4 to the PC, because each instruction is 4 bytes.

46
New cards

The Mill

- Operations execute in program order

- The compiler controls when ops issue and retire

- Short pipeline

- Is not yet silicon

- No rename registers to eliminate hazards

- Has no general registers since transient data lives on the Belt which is a FIFO

47
New cards

Neural Nets

- Inductive Reasoning. Given input and output data (training examples), we construct the rules

- Supervised and Unsupervised Training; Supervised: both inputs and outputs are provided. Unsupervised: Only input is provided.

- Good types of processors for NNs

-Software

48
New cards

Quantum Computers

Uses quantum bits, or qubits, but instead of 1's and 0's can be 1, 0, or both; called superposition.

49
New cards

Who has the Most Powerful Computer?

China

50
New cards

Who Has the Largest Server Farm?

Google

51
New cards

What is the difference between a stall and an interrupt?

A stall pauses the pipeline. An interrupt flushes the pipeline and jumps to a memory location to execute code.

52
New cards

What are three types of cache misses and their definitions?

Compulsory: First to a block.

Capacity: Finite cache size and the replaced block is accessed later

Conflict: Competition for entries in a set. Eliminated in a fully associative cache

53
New cards

Design a 256KB direct‐mapped data cache that uses a 32‐bit address and 8 words per block. Calculate the following: (a) How many bits are used for the byte offset and why?

(b) How many bits are used for the set (index) field?

(c) How many bits are used for the tag?

(a) How many bits are used for the byte offset and why? 5 bits because 265/8 = 32 and 2^5 = 32 bytes

(b) How many bits are used for the set (index) field? 13 because there are 32 bytes per block and 8192 = 2^13; 13 bits are used

(c) How many bits are used for the tag? 14 bits because 5 + 13 + 14 = 32; 14 bits remain for the tag

54
New cards

What are the following bits/flags used for? Valid, Dirty, Reference

Valid: Cache loaded with valid data

Dirty: Cache changed since it was read from main memory.

Reference: Estimates LRU bits

55
New cards

Cache Placement

Direct Mapped - each memory block has a specific cache block

Fully Associative - Any memory block can go in any cache block

Set associative - Each memory block has any set of cache blocks

56
New cards

Calculating Cache Sizes

[20 points] Design a 256KB direct-mapped data cache that uses a 32-bit address and 8 words per block. Calculate the

following:

(a) How many bits are used for the byte offset?

(b) How many bits are used for the set (index) field?

(c) How many bits are used for the tag?

(a) How many bits are used for the byte offset?

256/8 = 32. 2^5 = 32. Therefore, 5 bits

(b) How many bits are used for the set (index) field?

256*8 = 8192. 2^13 = 8192. Therefore, 13 bits are used.

(c) How many bits are used for the tag?

32-13-5 = 14 bits.

57
New cards

North Bridge vs South Bridge

Both are core logic chips on a motherboard. North bridge is connected to CPU, so it can perform tasks of higher performance.

58
New cards

**Multithreading

Course Grained: Simplest, one thread runs until an event with big latency

Fine Grained: One thread followed by another and so forth.

Simultaneous: Most efficient. Combination of coarse grained and fine grained

59
New cards

Strong Scaling vs. Weak Scaling

Strong: Time does not vary with # of processors for fixed total problem size

Weak: Time varies with # of processors for fixed problem size/processor

60
New cards

SISD/SIMD/MISD/MIMD

SISD: Single Instruction, Single Data (Has one control unit and more than one procession unit). Example: Intel

SIMD: Single Instruction, Multiple Data. Processes multiple data with the same instruction*. Example: Xplor

MISD: Multiple Instruction, Single Data Example: iWarp

MIMD: Multiple Instruction, Multiple Data. Example: Intel/

61
New cards

Shared Memory Multiprocessor

UMA: Uniform Memory Access. All processors have the same physical memory.

NUMA: Non-Uniform Memory Access. All processors have their own physical memory

Synchronization and Locks: All processor have their own physical memory

62
New cards

**Difference b/w shared memory multiprocessor and message-passing multiprocessor

Shared memory multiprocessor is faster. Does not need to wait for a response to send.

63
New cards

Reliability: MTTF, MTBF, MTTR, AFR, Availability

MTTF: Mean Time To Failure -Improved by Fault Avoidance, Tolerance, & Forecasting

MTBF: Mean Time Between Failures

MTTR: Mean Time to Repair

AFR: Annual Failure Rate,

Availability = MTTF / (MTTF + MTTR)

64
New cards

Problem Sets to Know:

ARM STRUCTURE FILL IN SHEET

Binary Logic Operation Gates

(INV, OR, AND, NOR, NAND, XOR, XNOR, MUX)

Cache problems

Cache Placement (Direct Mapped, Fully Associative, Set Associative)

Integer

- Hexadecimal, 2's Complement, Invert, Negate, 2's Complement, AND, XNOR

-Single and Double Precision, Carry Look Ahead concepts, Fast Multiplication

Floating Point***: Single & Double Precision

65
New cards

Abstraction??

* Layers of Code

- MatLab, Office Products, Web Browsers

- C, C++, C#, Java, Fortran

- Assembly

- Binary

- Microcode

- Nanocode

* Assembler

*Linker

*Loader

66
New cards

RAID Level??

Redundant Array of Inexpensive Disks

0, 1, 2, 3, 4, 5, 6

67
New cards

What the various blocks are and what they do and why they do it??