Data_Engineering_Part4

0.0(0)
studied byStudied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/16

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 8:04 PM on 12/19/24
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

17 Terms

1
New cards

Spark Job Definition Inline Monitoring

A feature that allows real-time monitoring of Spark job submissions and execution status.

2
New cards

Pipeline Spark Activity Inline Monitoring

Enables viewing of Spark application execution details within pipelines, including snapshots and logs.

3
New cards

Apache Spark Advisor

Provides real-time advice and recommendations for optimizing Spark notebook runs.

4
New cards

Run Series Analysis

Automatically categorizes Spark applications based on recurring activities and detects anomalies.

5
New cards

Autotune Analysis

Compares outcomes from autotune to enhance Spark application performance.

6
New cards

Outlier Detection

Identifies and analyzes outlier runs in the Spark run series to troubleshoot performance.

7
New cards

Extended History Server

A tool used for debugging and diagnosing completed Spark applications.

8
New cards

Spark Capacity Consumption

Reports on the resource usage of Spark jobs, providing insights for administrators.

9
New cards

Livy API

An API helpful for submitting Spark code within Microsoft Fabric without creating Notebooks.

10
New cards

Intelligent Cache

A feature that caches data to reduce latency and improve the execution speed of Spark jobs.

11
New cards

Data Skew

An issue where data distribution is uneven across tasks, leading to performance bottlenecks.

12
New cards

Kusto Queries

Used to retrieve logs and metrics from Azure Log Analytics and inspect Spark application data.

13
New cards

Session Jobs

Livy jobs that maintain an active Spark session for state-sharing between jobs.

14
New cards

Batch Jobs

Livy jobs that do not maintain an ongoing session, useful for single job execution.

15
New cards

Output of Autotune

Performance metrics provided after autotuning Spark SQL queries for optimization.

16
New cards

Focused Mode

A feature that allows zooming in on specific visual details for better monitoring.

17
New cards

Spark SQL Analytics Endpoint

A way to access SQL-based queries from Spark applications for data management.