Unit 1: Machine Learning in a Nutshell

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/13

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

14 Terms

1
New cards

Generalization

the model’s ability to adapt to previously unseen data

2
New cards

label

-target variable; attribute you are trying to represent
-represented by y

CODE: {y = df[‘colName’] }

3
New cards

feature

-input for that prediction
-often grouped together as a vector; represented by X


CODE: { X = df[feature_list] } { X = df.drop(columns=’y_colName’ }

4
New cards

supervised learning

-learnt by using prior knowledge
-attempts to discover the relationship between features and an associated label for the purpose of future prediction.

5
New cards

unsupervised learning

discovering patterns in data without the use of training data containing labeled examples; aka on its own!

  • first step in ML pipeline

6
New cards

regression

the label is any real valued number

7
New cards

classification

the label is a categorical variable

8
New cards

clustering

  • unsupervised learning technique

  • group subsets of data that are collectively similar to each other based on the similarity of their feature value

9
New cards

The ML Process

knowt flashcard image
10
New cards

Business Understanding

defining the business objectives

  • what does the business need?

example: ____

11
New cards

data understanding and preparation

transforming raw data into a form of suitable for modeling

  • what data do we have? is the data clean?

  • how do we prepare the data for our model?

example: ______

12
New cards

modeling

  • what techniques should we apply to the model?

13
New cards

evaluation

training a model and confirmed its performance on unseen data, perform additional analysis

  • is our model best suited for our problem?

example:

14
New cards

deployment

preparing to move a model into production after evaluating it

  • how do we make our model available to stakeholders and other users?

example: