CHAPTER 4: DESIGN A RATE LIMITER

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/19

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

20 Terms

New cards

Requirement questions

Step 1

Is it a client side rate limiter or client side?
Does the rate limiter throttle request based on IP? User ID?
How many request are we talking?
Should it be a separate service or in application code?
Do we tell users they are throttled?

New cards

Common requirements

Low latency
High throughput
Low memory usage
High fault tolerance. If there are any problems with the rate limiter (for example, a cache server goes offline), it does not affect the entire system.

New cards

Status code for too many requests

429

New cards

Benefits of build your own rate limiter

You can choose algorithm

New cards

Algorithms for rate limiting

Token bucket
Leaky bucket
Fixed window counter
Sliding window counter
Sliding window log

New cards

Considerations for token bucket algo

Bucket size (burst request allowance)
Refill rate
Bucket grouping (per endpoint per IP per server)

New cards

Issue with fixed window counter

If the burst falls between 1 windows, up to 2x the limit would be allowed

New cards

How does sliding window log work

Keep track of each request timestamp
Get count of request in last 1s on every request

New cards

How does sliding window counter work

Uses fixed window counter, but use a percentage of each window
Not 100% accurate

New cards

Issues with refilling token bucket

A lot of load to constantly incrementing token counter

New cards

Alternative to refilling token bucket

Get token count
If positive, decrement token, update token timestamp, allow request
If negative, calculate new token count based on last updated token timestamp

New cards

How to make rules configurable

Use YAML config file, each rule has request filter, rate limit unit and requests per unit
Cache the rules
Add worker to update cache if rule changes

New cards

HTTP Headers for rate limiting

X-Ratelimit-Limit: total number of requests allowed
X-Ratelimit-Remaining: the remaining number of allowed requests
X-Ratelimit-Retry-After

New cards

Options when request hit rate limit

Drop
Put it in message queue and retry later

New cards

What’s the race condition challenge

Client needs to calculate how many token in the bucket as well incrementing the bucket in 1 atomic action
Otherwise 2 concurrent request may not count each other

New cards

Solution to race condition challenge

Use lua script for 1 atomic action
Use sliding window algo with Sorted String Set in Redis

New cards

Components needed for rate limiter

Rate limiter service
Redis
Rule store (cache)

New cards

How to improve performance

Close to user (multiple data centers or edge servers)
Synchronize data eventual consistency

New cards

How to reduce hitting rate limit

Cache on client side
Gracefully handle rate limit
Add sufficient back off time for retry

New cards

High level diagram