CIS 328 Final Exam

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/23

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

24 Terms

1
New cards

Input Variable

The input of a model/function

2
New cards

Output Variable

The output of a model/function

3
New cards

Reducible Error

The model that we construct is not a perfect estimate of the actual model, which will cause error in the prediction

4
New cards

Irreducible Error

We cannot reduce the irreducible error no matter how well we estimate the model

5
New cards

Goals of Prediction

We want to use the inputs in order to predict the values of the output variable

6
New cards

Goals of Inference

We want to know the relationship between the independent and dependent variable so that we can influence the output variable through manipulating one or more of then input variables

7
New cards

Parametric Approach

  • making assumptions about the form or shape of the model (e.g. Linear, exponential)

  • use training data to estimate the models parameters

8
New cards

Non-parametric Approach

  • Does not make assumptions about the functional form of f

  • seeks an estimated f that is:

    • as close to data points as possible

    • not too wiggly or rough

9
New cards

Regression

Usually has a quantitative response

10
New cards

Classification

Usually has a qualitative response

11
New cards

When do we use simple linear regression?

When you want to model, understand, or predict the linear relationship between one continuous independent variable (X) and one continuous dependent variable (Y)

12
New cards

Estimating coefficients

  • Estimate the coefficients using the training data

  • Find the intercept and the slope so that the line is as close as possible to all data points

  • measure closeness: least squares method

13
New cards

Assessing coefficients

  • We use standard errors to estimate the confidence intervals

  • Standard errors are also used to perform hypothesis testing

  • Details of the processes and computations are beyond the scope of this course

14
New cards

Assessing the models accuracy

  • The quality of a linear regression fit is typically assessed using two related queries

    • RSE (Residual Standard Error): an absolute measure of the lack of fit

    • R-square: an alternative measure of fit

  • R-squared is an alternative measure of fit

    • takes a form of proportion

    • always between 0 and 1

15
New cards

When to use multiple linear regression?

When you want to estimate a model with more than one predictor

16
New cards

Estimating and assessing coefficients (MLR)

  • coefficients are estimated using the same least squares approach as simple linear regression

  • the model dimension increases as the number of predictors increases

17
New cards

Important Predictors

  • if f-value is statistically significant, atleast one of the predictors is related to y

  • finding out which predictors are significant is determined by the p-value

  • Determining the r-squared of each predictor will show the biggest impact on Y

18
New cards

Multiple R-Squared

  • As more predictors are added to the regression model, the multiple R-squared keeps increasing, just due to chance.

  • a model with many predictors tends to capture random noise in data

19
New cards

Adjusted R-Square

  • a modified version of R-square which has been adjusted for number of predictors in the model

  • compares the explanatory power of the model

20
New cards

Non-linear Relationships

  • LR assumes a linear relationship between response and predictors

  • In many cases, the true response is linear

21
New cards

Polynomial Regression

  • a quadratic slope

  • model a non-linear relations in data by fitting a curve to the relationship between an independent variable and a dependent variable

22
New cards

Qualitative Predictors

  • can include qualitative variables as predictors

  • if a qualitative value has only 2 values then (0, 1) are the dummy variables

  • qualitative predictors with 3 or more values, we need to create (p-1) dummy variables.

23
New cards

Probability Sampling

  • random selection

  • allow statistical inference about entire population

24
New cards

Nonprobabilty Sampling

  • Non-random selection based on convenience or other criteria

  • easy to collect initial data

  • not support statistical inference because of bias

Explore top flashcards