4-1 algorithms in ML, linear regression, and logistic regression

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/24

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

25 Terms

1
New cards

model

a mathematical representation of a real world process; a predictive model forecast a future outcome based on past behaviors

2
New cards

algorithm

a set of rules used to make a calculation or solve a problem

3
New cards

regression

a prediction method whose output is a real number, a value that represents a quantity along a line

4
New cards

classification

a prediction method that assigns each data point to a predefined category, a type f operating system

5
New cards

target

dependent variable, is the output of the model or the variable you wish to predict

6
New cards
  1. supervised learning

  2. classification

  3. regression

knowt flashcard image
7
New cards

trade off between accuracy and interpretability

knowt flashcard image
8
New cards

model interpretability

  • similar models like linear regression and decision trees ar easily interpretable but might underperform

  • crucial for trust, regulatory compliance, and clinical adoption

9
New cards

model performance

  • complex models like deep neural networks often deliver higher performance but were less interpretable

  • choosing between interpretability and performance involves balancing accuracy with the need for understandable predictions

10
New cards

computational efficiency

  • training time

  • interference time

  • memory usage

  • scalability

11
New cards

bias variance tradeoff

complex models

  • capture intricate patterns

  • risk overfitting and may lack generalization

simple models

  • less detailed

  • more robust and generalize better to new data

12
New cards

balancing act

complex models

  • low bias

  • high variance

simple models

  • high bias

  • low variance

*find balance

13
New cards

linear regression

  • estimates the linear relationship between a scalar response and one or more variables

  • predicts continuous outcome based on one or more predictors/variables

  • relationship sare modeled using linear predictor functions whose unknown model parameters are estimated from the data

<ul><li><p>estimates the linear relationship between a scalar response and one or more variables</p></li><li><p>predicts continuous outcome based on one or more predictors/variables</p></li><li><p>relationship sare modeled using linear predictor functions whose unknown model parameters are estimated from the data</p></li></ul><p></p>
14
New cards

ordinary least squares

  • using gradient descent to minimize the sum of the squares of the residuals

  • the better the linear regression fits the data in comparison to the simple average, the closer the value of R2 is to 1

15
New cards

4 common regression metrics for linear regression

knowt flashcard image
16
New cards

linearity

the change in the outcome variable should be proportional to the change in the predictor variables

17
New cards

independence

the observations should be independent from each other

18
New cards

homoscedasticity

the spread of residuals should be roughly the same across all values of the predicted outcome

19
New cards

normality

the errors should follow a normal distribution

20
New cards

advantages of linear regression

  • simple model

  • computationally efficient

  • interpretability of the output

21
New cards

disadvantages of linear regression

  • overly simplistic — cannot capture real world complexity

  • linearity assumption

  • severely affected by outliers

  • independence of variables

22
New cards

advantages of logistic regression

  • easier to implement, interpret, and very efficient to train

  • easily extend to multiple classes

  • good accuracy for many simple data sets and it performs well when the dataset is linearly separable

23
New cards

disadvantages of logistic regression

  • if the number of observations is less than the number of features, it may lead to overfitting

  • the major limitation is the assumption of linearity between the dependent variable and the independent variables

24
New cards

when to use linear regression

  • outcome is continuous and unbounded

  • assume linear relationship between predictors and the outcome

  • want to quantify the effect of each predictor on the outcome

25
New cards

when to use logistic regression

  • the outcome is binary (yes/no, 1/0)

  • want to estimate probabilities of an outcome

  • interested in the odds ratios for various predictors