Machine Learning: Data Pre-processing

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/15

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 6:46 AM on 4/28/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

16 Terms

1
New cards

Data Quality

A measure of how well data meets a specific need, affecting the accuracy and reliability of predictions.

2
New cards

Data Pre-processing

The process of transforming raw data that contains inconsistencies into a format that improves model performance.

3
New cards

Accuracy

The degree to which data correctly represents an object in a real-world context.

4
New cards

Completeness

A measure of how many missing values a dataset contains.

5
New cards

Consistency

The absence of difference between instances stored in multiple locations.

6
New cards

Timeliness

The degree to which data represents reality at a given point or period in time.

7
New cards

Uniqueness

The absence of duplicate instances within a dataset.

8
New cards

Validity

How well data conforms to a specified format.

9
New cards

Data Cleaning

The process of removing missing values, duplicates, and incorrectly formatted data.

10
New cards

Data Integration

The process of combining data from different sources into a unified view.

11
New cards

Data Reduction

The process of reducing the dimensionality of a dataset, simplifying the data.

12
New cards

Data Transformation

The process of converting features into a format suitable for specific models or algorithms.

13
New cards

Missing Values

Occur when a feature has no recorded value for an instance in a dataset.

14
New cards

Missing Completely at Random (MCAR)

Values that have the same probability of being missing for all cases.

15
New cards

Missing at Random (MAR)

Values that have the same probability of being missing for specific observable cases.

16
New cards

Missing Not at Random (MNAR)