Metrics Monitoring and Alerting System

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/9

flashcard set

Earn XP

Description and Tags

These flashcards cover key concepts and parameters from the metrics monitoring and alerting system lecture notes.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

10 Terms

1
New cards

What is the primary goal of a metrics monitoring and alerting system?

To ensure high availability and reliability.

2
New cards

What types of metrics should be collected in the designed system?

Operational metrics such as CPU load, Memory, and Data disk space.

3
New cards

What is the scale of the infrastructure discussed in the notes?

100 million daily active users, 1000 server pools, and 100 machines per pool.

4
New cards

How long should data be retained in the system?

Data retention should be for 1 year.

5
New cards

What is the data resolution strategy for long-term storage?

Keep newly received metrics for 7 days, roll them up to 1m resolution for the next 30 days, and further roll them up to 1h resolution after 30 days.

6
New cards

What are the supported alert channels mentioned in the chapter?

Email, phone, PagerDuty, or webhooks.

7
New cards

What are the five core components involved in a metrics monitoring and alerting system?

Data collection, Data transmission, Data storage, Alerting, Visualization.

8
New cards

What type of data model is used for metrics data recording?

Time-series data model, which includes a set of values with timestamps.

9
New cards

What is the recommendation regarding the type of database for storing metrics data?

It is not recommended to use a general-purpose database; dedicated time-series databases are preferable.

10
New cards

Name two popular time-series databases mentioned in the notes.

InfluxDB and Prometheus.