Data Analysis and Modeling Concepts

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/18

flashcard set

Earn XP

Description and Tags

These flashcards cover key terms and definitions related to data analysis and modeling concepts.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

19 Terms

1
New cards

Data Cleaning

The process of transforming raw, messy data into a structured, usable format.

2
New cards

Exploratory Data Analysis (EDA)

The process of summarizing and visualizing a dataset to understand its main characteristics before formal modeling.

3
New cards

Imputation

A technique used to fill missing data using mean, median, or mode for numeric data.

4
New cards

Univariate Analysis

Examines one variable at a time, such as computing mean or median.

5
New cards

Bivariate Analysis

Explores the relationship between two variables, such as correlation or scatter plots.

6
New cards

Multivariate Analysis

Involves three or more variables to understand complex interactions.

7
New cards

Descriptive Analytics

Explains what has happened, such as summarizing historical sales.

8
New cards

Predictive Analytics

Forecasts what could happen using historical data and models, like sales forecasting.

9
New cards

Prescriptive Analytics

Recommends actions based on predictions, like pricing strategies to maximize profit.

10
New cards

SQL JOIN

Combines rows from two or more tables based on a related column.

11
New cards

INNER JOIN

Returns rows with matching values in both tables.

12
New cards

LEFT JOIN

Returns all rows from the left table and matched rows from the right table.

13
New cards

VLOOKUP

A function that searches for a value in the first column of a table and returns a value in the same row from a specified column.

14
New cards

Pivot Table

A tool in Excel that summarizes large datasets by grouping and aggregating data.

15
New cards

Subquery

A query nested within another SQL query, used in SELECT, FROM, or WHERE clauses.

16
New cards

Stored Procedures

Precompiled SQL code stored in the database, used to encapsulate complex logic.

17
New cards

List Comprehension

A concise way to create lists in Python using a single line of code.

18
New cards

DataFrame

A 2-dimensional labeled data structure in Pandas, akin to a spreadsheet or SQL table.

19
New cards

Key Performance Indicators (KPIs)

Quantifiable metrics used to evaluate the success of an organization or project.