Chapter 1. Proximity Service

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/22

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

23 Terms

1

New cards

Functional requirements

Return all nearby business
Business owner can add, update, delete info
User can read business info

2

New cards

Non functional requirements

Low latency
Data privacy for user location
High availability and scalability for peak hours

3

New cards

What is the read write pattern for this system and what’s the solution?

It is read heavy, use leader follower pattern.

4

New cards

What are 2 real life geospacial database?

Postgres with GIS extension
Redis geohash

5

New cards

How do we use geospacial index?

Convert location to index
Get index of nearby area
Search for business belong to the area index

6

New cards

What are 3 types of geospatial index?

Geohash
Quadtree
Google S2

7

New cards

How does geohash work?

Uses base32 representation
6 chars is around 1 square mile and 10 chars is 1 square meter
You can query business using LIKE ‘9q8zn%’

8

New cards

What are some issues and solutions with geohash?

Business close enough but not match prefix
Not enough business with current prefix

Solution: expand search range to get 8 neighbor cells

9

New cards

How to build quadtree?

Given list of business with coordinates
Create a parent node representing the coordinates it covers
If has more than 100 business, subdivide coordinates into 4 areas, making them the child nodes

<ol><li><p>Given list of business with coordinates</p></li><li><p>Create a parent node representing the coordinates it covers</p></li><li><p>If has more than 100 business, subdivide coordinates into 4 areas, making them the child nodes</p></li></ol>

10

New cards

Quadtree vs Geohash

Quadtree is a tree structure, geohash is stored as table (though it’s index can be trie)
For quadtree, the leaf node siblings have 100+ business, but geohash sublings might not. (Dynamic grid size)
Easier to update geohash index than a tree

11

New cards

How do we store quadtree?

We can store it in memory, because given a billion business, there would be only 1M leaf nodes if each node has 1000 business

12

New cards

Time complexity of building quadtree

For each business, it takes logn time to send it to leaf node, so it’s O(nlogn)
For 200M business it would take a few minutes

13

New cards

How to find nearby business with quadtree

Given coordinate, find the node for it
Traverse up until gathered enough business

14

New cards

How does quadtree handle updates?

Real time: insert business to the tree
Batch job: re-build tree weekly

15

New cards

Operational considerations of quadtree

Keep multiple replica for availability and scalability
Blue green or canary deployment

16

New cards

What is google S2

It maps a location to 1D index based on hilbert curve
1 points that have similar value are close in 2D space

17

New cards

Advantages of google S2

Great with geofencing because it can cover arbitrary areas
More flexible cell size and precision

18

New cards

What do geospacial index have in common?

Represent 2D space with 1D index
A range in 1D index covers an area in 2D space
Nearby locations have similar 1D index value

19

New cards

Which one should I pick during interview?

Geohash

20

New cards

What to cache?

List of business IDs in the grid
Business metadata

21

New cards

How to cache business ID list in grids?

Select 3 geohash precision
8 bytes x 200 million x 3 precisions = ~5 GB

22

New cards

How to have international support?

Deploy servers in different regions
DNS geo based routing
Add country specific cache that can scale independnetly

23

New cards

Component diagram

Location service: get nearby
Business service: get business info
Cache: business info + geohash
DB: leader follower setup

<ol><li><p>Location service: get nearby</p></li><li><p>Business service: get business info</p></li><li><p>Cache: business info + geohash</p></li><li><p>DB: leader follower setup</p></li></ol>

Explore top notes

Period 1: The Renaissance to the Wars of Religion (1450–1648)

Updated 766d ago

Note

Scansion Basics

Updated 603d ago

Note

AP Econ Vocab Macro Unit 1

Updated 821d ago

Note

Early Childhood Health: Nutrition

Updated 846d ago

Note

Updated 30d ago

Note

Chapter 14 - Stocks, Bond, and Insurance

Updated 810d ago

Note

Regulation of Transcription In Prokaryotes

Updated 1007d ago

Note

🦅 APUSH Unit 2 Notes

Updated 363d ago

Note

Explore top flashcards

Updated 557d ago

Flashcards (55)

AP Stats Summer Study

Updated 637d ago

Flashcards (20)

A&P Anatomical Regions/ Surfaces

Updated 607d ago

Flashcards (48)

Intro to Entrepreneurship Final Exam Vocab Terms

Updated 393d ago

Flashcards (195)

Exam #1 True or False Questions

Updated 802d ago

Flashcards (50)

jekyll and hyde quotes

Updated 767d ago

Flashcards (32)

Anatomy and Physiology: Endocrine hormones

Updated 57d ago

Flashcards (44)

Chapter 10 - Vitamins

Updated 207d ago

Flashcards (23)