Machine Learning

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/15

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

16 Terms

1
New cards

is machine learning AI?

Machine learning is a subset of AI

2
New cards

what is hypermarameter

a parameter whose value must be set by the researcher before learning begins

3
New cards

what is labeled data

also called training data; used in supervised learning

4
New cards

categories of supervised learning

1)regression model (output is predicted using training data); 2)classification problem (output is sorted into distinct group, like spam emails).

<p>1)regression model (output is predicted using training data); 2)classification problem (output is sorted into distinct group, like spam emails). </p>
5
New cards

unsupervised learning

1) does not use labeled data 2) a set up inputs (x) is used for analysis with no corresponding target (y) 3) the algorithm discovers underlying structure in data

6
New cards

dimension reduction

reducing the number of input, complexity of data. Kinda like if I eat and work I carpool and eat I have less to do

7
New cards

clustering

reducing data by categorizing them. Example, putting kids going to soccer and going to piano together and then drive them. * observation with a cluster are similar and different accross clusters.

<p>reducing data by categorizing them. Example, putting kids going to soccer and going to piano together and then drive them. * observation with a cluster are similar and different accross clusters.</p><p> </p>
8
New cards

deep learning

self teaching system which a computer learns from interacting with itself. can be supervised, unsupervised or reinforcement learning (train and error)

9
New cards

overfitting

the ML fits the training data too well, unable to generalize new data

<p>the ML fits the training data too well, unable to generalize new data </p>
10
New cards

descriobe in sample error

1) regarding training sample 2)bias arises from underfitted models

11
New cards

out of sample errors

1) prediction erros in validation and text sample 2) variance error from over fitted models

12
New cards

residual errors

results from randomness in the data

13
New cards

complexity reduction and cross validation solves

overfitting

14
New cards

a type of cross validation — K fold

1) split the data set into K number of sections/folds 2) first section is used to test the model, and the test are used to train model 3) this reduces the problem of holdout sample (data not used to train the sample). * usually K = 5 or 10 (sections)

15
New cards

Lasso regression

1) is an example of penalty regression (penilize for having too many variable) 2) contracts the regression coefficient to 0

16
New cards

support vector machine

an algorithm for classification, regression and outlier detection. It draws a line (3D) between data to separate them