Data Tools and Skills Flashcards

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/17

flashcard set

Earn XP

Description and Tags

This set of vocabulary flashcards covers the various tools used in a data journey—ranging from Excel and PowerBI to SQL, Python, and AI—detailing their specific strengths, weaknesses, and functional roles in professional data analytics.

Last updated 1:35 AM on 6/25/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

18 Terms

1
New cards

Excel

The fastest way to work with small data and answer quick questions; however, it involves manual repetition and has performance limitations with large files.

2
New cards

Data Exploration

The step of looking at data for the first time, understanding the content, and noting any numbers or bad data quality issues.

3
New cards

Data Cleanup (Excel)

The process of removing bad rows and unnecessary columns, replacing nulls, and reshaping data to prepare it for analytics.

4
New cards

PowerBI

A tool used for automation and as a home for interactive data visualizations, reporting, and dashboarding.

5
New cards

Power Query

A component within PowerBI used to build data transformation and cleanup steps.

6
New cards

Star Schema

An optimized data model built in PowerBI that is fast for reporting and analytics.

7
New cards

DAX (Data Analysis Expressions)

The language used in PowerBI to create calculations and formulas, similar to Excel formulas but for professional data modeling.

8
New cards

PowerBI Service

A cloud-based server where PowerBI projects are published, allowing users to automate data refreshes and share reports via links.

9
New cards

SQL

Described as the 'king of working directly with data,' it is the most efficient tool for data transformations, preparations, and reshaping tables into professional models.

10
New cards

Data Warehouse

A centralized data platform organized into multiple layers, such as Bronze, Silver, and Gold, to manage complex data for a company.

11
New cards

Stored Procedures

Logic written using databases and SQL that defines how to load data through warehouse layers in sequence.

12
New cards

Data Pipeline

The automated process of moving data from source systems to the database and through various processing layers.

13
New cards

Single Point of Truth

A centralized data product within a warehouse that ensures all projects deliver the same numbers, reducing the risk of confusion and loss of trust.

14
New cards

Python

A programming language used in data projects for advanced automation, connecting to modern endpoints like APIs and streams, and wrapping around SQL for better logging and quality checks.

15
New cards

Pandas

A Python library used for table work, data cleaning, and exploration that is often faster than SQL for deep dives into specific files.

16
New cards

Great Expectations

A Python library used for implementing data quality checks within data pipelines.

17
New cards

Scikit-learn

A Python library used for advanced analytics, machine learning, and building systems that can predict the future based on data.

18
New cards

Silent Errors

Errors in data systems, such as a wrong inner join in a SQL query, where the job runs successfully without a crash but results in incorrect numbers in reports.