Overview of Data Engineering

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/9

flashcard set

Earn XP

Description and Tags

Flashcards covering key vocabulary and concepts related to data engineering.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

10 Terms

1
New cards

ETL (Extract, Transform, Load)

A process in data engineering that moves data from various sources, processes it, and loads it into a target system.

2
New cards

Data Warehouse

A central hub designed to store cleaned, structured, and processed data optimized for reporting and analysis.

3
New cards

Data Lake

A large storage system that stores raw, unstructured, and structured data in its original form.

4
New cards

Data Maturity

Measures how well an organization utilizes, integrates, and manages data for competitive advantage.

5
New cards

DataOps

An approach for data management that improves data quality and speeds data development and analysis through automation and collaboration.

6
New cards

Apache Spark

An in-memory, high-speed data processing engine that supports both batch and streaming data processing.

7
New cards

NoSQL Databases

Flexible databases like MongoDB and Cassandra, designed to handle unstructured and semi-structured data.

8
New cards

ETL Process

The three-step process of extracting, transforming, and loading data into target systems for analysis.

9
New cards

Data Pipeline

Automated workflows that manage the flow of data through various processes from source to destination.

10
New cards

Big Data

Extensive datasets that require special technology for processing, storage, and analysis, often using tools like Hadoop and Spark.