Exploring and Pre-Processing Data

0.0(0)
studied byStudied by 0 people
0.0(0)
linked notesView linked note
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/9

flashcard set

Earn XP

Description and Tags

These flashcards cover key definitions and concepts from the Business Data Mining course, focusing on variable types, statistical measures, and data pre-processing techniques.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

10 Terms

1
New cards

CRISP-DM

A data mining process model that includes phases like business understanding, data understanding, data preparation, modeling, evaluation, and deployment.

2
New cards

Variable Types

Categories of variables including categorical, ordered, intervals, and true numeric values, which determine how data can be analyzed.

3
New cards

Categorical Variable

A variable that represents types or groups and is not ordered, such as colors or names.

4
New cards

Continuous Variable

A numerical variable that supports all mathematical operations, such as weight, height, or length.

5
New cards

Boxplots

Visual representations used to summarize the distribution of a dataset and identify outliers.

6
New cards

Normalization

The process of adjusting values measured on different scales to a notionally common scale, often by subtracting the minimum value and dividing by the range.

7
New cards

Mean

The average of a set of values, calculated by summing the values and dividing by the count of values.

8
New cards

Median

The middle value in a dataset when the values are sorted in order.

9
New cards

Standard Deviation

A measure of the amount of variation or dispersion of a set of values.

10
New cards

Outlier

An observation that significantly differs from other observations in a dataset, which can skew results if not handled properly.