CSE 130 Part 5

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/33

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

34 Terms

New cards

What are the prefixes of kilo, mega, and giga? What are their sizes (base10)?

Kilo: k, 10³

Mega: M, 10⁶

Giga: G, 10⁹

New cards

What are the prefixes of milli, micro, nano? What are their sizes (base10)?

Milli: m, 10^-³

Micro: μ, 10^-6

Nano: n, 10^-9

New cards

What is throughput?

How many tasks can be done per second

New cards

What is latency?

How long one task takes

New cards

What is energy efficiency?

How much useful work is done per unit of energy

New cards

Compare intrinsic and technology-dependent issues

Intrinsic: algorithmic, design-wise issues (e.g. branch prediction)

Technology-dependent: limited by current technology/hardware (e.g. battery life)

New cards

What is capacity?

Amount of a resource that is available

New cards

What is utilization?

Percentage of capacity being used

New cards

What is overhead?

Resource that is wasted

New cards

What is useful work?

Resource spent on actual work for a given workload

New cards

Pipelining vs Concurrency in terms of latency and throughput

Pipelining

Latency: Sum of all modules’ latency
Throughput: Min of all modules’ latency

Concurrency

Latency: Averaged latency by weight and module latency
Throughput: throughput x N (using identical modules)

New cards

When can input throughput and output throughput differ?

When using compression or decompression

New cards

Can physical data transfer be faster than over the network?

Yes, for large datasets

New cards

In networking, what is one example of improving performance?

HTTP pipelining
Throughput = p/RTT → Throughput = cwnd/RTT

New cards

In memory, what is one consideration for performance?

Latency-throughput curve— where increasing throughput can improve performance up to a point, beyond the saturation threshold, latency increases significantly and degrades performance

New cards

In a hard disk example, what are three ways to improve performance?

Batching: handle requests as a group to amortize fixed overhead
Dallying: delay a request (e.g. delay disk write because user may save again soon)
Speculation: read next likely data early

New cards

What is burst?

A brief increase in the request rate above average

New cards

Why does overload happen?

When incoming request rate > average throughput for one stage. Buffers will fill up, and requests wait in the buffer → increased latency

New cards

What is Amdahl’s Law (Law of Diminishing Returns)?

Total speedup = 1 / ( (1-P) + P/S ) where S = speedup for the portion, P = % of the portion

New cards

What are 4 steps of designing for performance?

Measure the system if it needs to be faster
Measure again to find the bottleneck
Predict the impact of removing the bottleneck
a. Can it be removed? How effective?
b. If not, can we redesign it?
Implement new solution, repeat

New cards

What is workload?

Tasks processed by a system

New cards

We can measure performance on… (3)

Real workload
Simulated system
Benchmarks + real traces

New cards

Caching vs memoization

Caching has a bounded size, whereas memoization does not

New cards

What is locality of reference?

Clustering of memory references in both time and address space

New cards

What are the two types of locality of reference?

Temporal: recent items likely to be referenced again soon

Spatial: near items likely to be referenced

New cards

What is a working set?

Set of items used during an interval

New cards

What happens when working set » primary storage?

Thrashing— frequent overwriting of primary storage

New cards

Write-through vs Write-back

Write-through: writing to cache and then to secondary storage immediately

Write-back: writing to cache and to secondary storage later when evicted

New cards

What is cache coherency?

Consistency of data across multiple caches in a system

New cards

What is associativity? What are the three types?

Which data can be stored in each cache slot

Fully associative: data can be anywhere in the cache
Direct-mapped: each data is mapped to exactly one cache line
N-way associative: cache divided into sets, each set contains N cache lines

New cards

What is fetch policy?

When/how to fetch data (on-demand or by prediction)

New cards

What are the five types of removal policies?

Least recently used (LRU ): every time data is used, place/move it to the head, push the rest down
FIFO: place it at head if not in queue, remove the tail
Clock: second chance page replacement with a circular pointer (only evict if reference bit is 0, put at head if 1)
Not recently used (NRU): remove elements in the following order
1. Not referenced, not dirty
2. Not referenced, dirty
3. Referenced, not dirty
4. Referenced, dirty
OPT— optimal algorithm, used for comparison

New cards

What is Belady’s anomaly?

For some removal policies, more cache ≠ higher hit ratio!

New cards

What are the three types of a cache miss?

Compulsory miss— first reference (cannot be prevented)
Capacity miss— can be prevented by infinite cache size
Conflict miss— can be prevented by a fully-associative cache