Machine Learning Concepts and Data Preparation

0.0(0)
studied byStudied by 0 people
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/10

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

11 Terms

1
New cards

Structured Data

Data organized in a defined format, typically in tables with rows and columns.

2
New cards

Unstructured Data

Data that does not have a predefined format and is often text-heavy or includes multimedia.

3
New cards

Feature Matrix

A representation of independent variables in a dataset, usually denoted as X.

4
New cards

Label Vector

A representation of dependent variables or outcomes in a dataset, typically denoted as y.

5
New cards

One Hot Encoding

A technique to convert categorical variables into a binary format, where each category is represented as a single binary column.

6
New cards

Training Set

A subset of data used to train a model, allowing it to learn patterns and make predictions.

7
New cards

Testing Set

A subset of data used to evaluate the performance and accuracy of a trained model.

8
New cards

Data Preparation Steps

Procedures such as loading datasets, handling categorical variables, and scaling features.

9
New cards

Independent Variables

Predictor variables that influence the outcome in a dataset.

10
New cards

Dependent Variables

Outcome variables that are predicted based on the independent variables.

11
New cards

Missing Values

Data entries that