Ch.1 Data Processing Fundamentals

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/9

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

10 Terms

1
New cards
What are the 4 stages of a Data Pipeline?
ingestion, storage, processing, visualization
2
New cards
What is Big Data Volume?
the scale of data being processed
3
New cards
What is Big Data Velocity?
the speed at which data is processed: ingested, analyzed, and visualized
4
New cards
What is Big Data Variety?
the type of data sources, formats, and quality
5
New cards
What is a Data Warehouse?
data that is structured/ processed, ready to use, and a strict structure
6
New cards
What is a Data Lake?
data that is unstructured/ unprocessed, ready to analyze, and a flexible structure
7
New cards
What is the difference between On-Line Transactional Processing (OLTP) & On-Line Analytical Processing (OLAP)
OLTP usually is used to modify a DB, while OLAP usually is used to query a DB
8
New cards
What does ETL stand for? How can it be used for OLTP & OLAP systems?
extract, transform, load. ETL processes can take data from OLTP and move it to OLAP
9
New cards
What is the difference between Batch Data Ingestion & Streaming Data Ingestion
batch ingestion is in intervals, while streaming is continuous
10
New cards
What are the 4 common Data Transformations?
formatting, labeling, filtering, validating