Azure Data Engineering Overview

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/22

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

23 Terms

1
New cards

Data engineering

Working with multiple types of data to perform data operations using various tools and scripting languages

2
New cards

Streaming data

Continuous flow of data records

3
New cards

Data pipeline

Orchestrated activities for data transfer and transformation, often used for ETL or ELT operations

4
New cards

Operational data

Transactional data used by applications

5
New cards

Analytical data

Data optimized for analysis and reporting

6
New cards

Apache Spark

Open-source engine for distributed data processing

7
New cards

Data Warehouse

Analytical data stored in a relational database, typically modeled as a star schema

8
New cards

Data Lake

Analytical data stored in files, distributed storage for massive scalability

9
New cards

Azure Stream Analytics

Azure service for running data pipelines and managing analytical data in a data lake or data warehouse

10
New cards

Azure Synapse Analytics

Cloud platform for data analytics, large-scale data warehousing, and advanced analytics

11
New cards

Azure Data Lake Storage Gen2

Distributed cloud storage for data lakes with HDFS-compatibility and flexible security

12
New cards

Hierarchical Namespace

Feature to enable in a blob container to use Azure Data Lake Storage Gen2

13
New cards

Azure Blob Storage

Storage service in Azure for unstructured data

14
New cards

Structured data

Data in a relational database table

15
New cards

Unstructured data

Data stored in a data lake

16
New cards

Global replication

Option to enable for using Azure Data Lake Storage Gen2

17
New cards

Azure Data Factory

Native pipeline functionality in Azure Synapse Analytics for ingesting and transforming data

18
New cards

SQL Server based pools

In Azure Synapse Analytics, used for scalable relational data processing

19
New cards

Data Explorer

Tool for high-performance real-time data analytics with Kusto query language

20
New cards

Serverless SQL pool

Feature in Azure Synapse Analytics for data exploration and analysis of files in the data lake

21
New cards

Dedicated SQL pool

Pool in Azure Synapse Analytics for hosting large-scale relational data warehouses

22
New cards

Apache Spark pool

Pool in Azure Synapse Analytics for processing and analyzing data with Apache Spark

23
New cards

Pipelines

Feature in Azure Synapse Analytics for transferring data between stores and applying transformations