Scaling Reliability Observability

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/21

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 9:11 AM on 5/30/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

22 Terms

1
New cards

single point of failure

component whose failure breaks system

2
New cards

replication

copies data or services across nodes

3
New cards

failover

switching to backup after failure

4
New cards

health check

test if service is alive

5
New cards

autoscaling

adjusting server count based on load

6
New cards

rate limiter

limits request volume per user or key

7
New cards

token bucket

rate limiting with refillable tokens

8
New cards

fixed window

rate limit over fixed time interval

9
New cards

sliding window

rate limit over moving interval

10
New cards

backpressure

slowing intake when system is overloaded

11
New cards

queue depth

number of waiting tasks

12
New cards

retry storm

many retries worsening outage

13
New cards

exponential backoff

wait longer between retries

14
New cards

idempotent operation

safe to retry without duplicate effect

15
New cards

SLA

service level agreement

16
New cards

SLO

service level objective

17
New cards

error budget

allowed amount of unreliability

18
New cards

logs

event records

19
New cards

metrics

numeric system measurements

20
New cards

traces

request path through services

21
New cards

alert

notification when system crosses threshold

22
New cards

dashboard

visual display of system health