Ch 7 linear regression

studied byStudied by 1 person
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 29

flashcard set

Earn XP

30 Terms

1

what is regression analysis?

if data is obtained, a statistical procedure called regression analysis can be used to develop an equation showing how the variables are related.

New cards
2

what is a dependent variable?

the variable being predicted, in statistical notation: y = dependent variable

New cards
3

what is an independent variable or predictor variable?

variables being used to predict the value of the dependent variable. in statistical notation: x = independent variable

New cards
4

what is linear regression?

a regression analysis involving one independent variable and one dependent variable.

New cards
5

what is simple linear regression?

a regression analysis for which any one unit change in the independent variable, x, is assumed to result in the same change in the dependent variable, y.

New cards
6

what is the multiple linear regression?

regression analysis involving two or more independent variables.

New cards
7

what is the simple linear regression model?

  • the equation that describes how y is related to x and an error term

  • simple linear regression model: y = B0+B1+E

  • parameters: the characteristics of the population, B0 and B1

  • random variable: error term, E

  • the error term accounts for the variability in y that cannot be explained by the linear relationship between x and y

New cards
8

what is estimated regression:

  • the parameter values are usually not known and must be estomated using sample data

  • sample statistics (denoted b0 and b1) are computed as estimates of the population B0 and B1

  • the equation obtained by substituting the values of the sample statistics bo and b1 for Bo and B1 in the regression equation.

New cards
9

what is the estimated simple linear regression equation:

  • y^ = bo + b1x

  • y^ = point estimator of E(y|x)

  • bo = estimated y-intercept

  • b1 = estimated slope

  • the graph of the estimated simple linear regression equation is calles the estimated regression line

New cards
10

what is the estimated regression line?

the graph of the estimated simple linear regression equation

New cards
11

possible lines in simple linear regression:

knowt flashcard image
New cards
12

what is the least squares method?

  • a procedure for using sample data to find the estimated regression equation

  • determine the values of bo and b1

New cards
13

what is the inteprestation on bo and b1 in the least squares method?

  • the slope b1 is the estimated change in the mean of the dependent variable y that is associated with a one unit increase in the independent variable x

  • the y-intercept bo is the estimated value of the dependent variable y when the independent variable x is equal to zero.

New cards
14

what is the ith residual?

  • the error made using the regression model to estimate the mean value of the deoendent variable for the ith observation

    • denoted as ei = yi - y^

    • we are finsing the regression that minimizes the sum of squared errors

<ul><li><p>the error made using the regression model to estimate the mean value of the deoendent variable for the ith observation</p><ul><li><p>denoted as e<sub>i</sub> = y<sub>i</sub> - y^</p></li><li><p>we are finsing the regression that minimizes the sum of squared errors</p></li></ul></li></ul>
New cards
15

what is an experimental region?

the range of values of the independent variables in the data used to estimate the model

  • the regression model is valid only over this region

New cards
16

what is the sum of squares due to error (SSE)?

the value of SSE is a measure of the error in using the estimated regression equation to predict the values if the dependent variable in the sample.

SSE = ei²

New cards
17

what is the total sum of squares (SST)?

the difference yi - y- provides a measure of the error involved in using y^- to predict travel time for the ith term. SST = (yi - y^-)²

New cards
18

what is sum of squares due to regression (SSR)?

measures how much the y^ values on the estimated regression line deviate from y^- . relation between SST, SSR, and SSE: SST = SSR + SSE

New cards
19

what is the coefficient of determination?

  • the ratio SSR/SST used to evalute the goodness of fit for the estimated regression equation.

  • r² = SSR/SST

  • take values between zero and one

  • interpreted as the percentage of the total sum of squares that can be explained by using the estimated regression equation

  • square of the correlation between yi and y^i

  • referred to as the simple coefficient of determination in simple regression

New cards
20

what is slope coefficient Bj?

represents the change in the mean value of the dependent variable y that corresponds to a one unit increase in the independent variable xj, holding the values of all other indpendent variables in the model constant.

New cards
21

what is the multiple regression equation that describes how the mean value of y is related to x1, x2, … , xq?

E( y | x1, x2, … , xq = B0 + B1x1 + B2x2 + … + Bqxq

New cards
22

what is statistical inference?

process of making estimates and drawing conclusions about one or more characteristics of a population (the value of one or more parameters) through the analysis of sample data drawn from the population

New cards
23

in regression, inference is commonly used to estimate and draw conclusions about:

  • the regression parameters B0, B1, B2, … , Bq

  • the mean value and/or the predicted value of the dependent variable y for specific values of the independent variables x1*, x2* , …, xq*

  • consider both hypothesis testing and interval estimation

New cards
24

conditions necessary for valid inference in the least squares regreesion model:

  1. for any combination of values of the independent variables x1, x2, …, xq

  2. the values of e are statistically independent

New cards
25

testing individual regression parameters:

  • to determine whther statistically significant relationships exist between the dependent variable y and each of the indpendent variables x1, x2, … , xq individually

    • if Bj = 0, there is no linear relationship between the independent variable y and the independent variable xj

    • if Bj ≠ 0, there is a linear relationship between y and xj

New cards
26

what is a confidence interval?

an estimate of a population parameter that provides an interval believed to contain tha value of the parameter at some level of confidence

New cards
27

what is a confidence level?

indicates how frequently interval estimates based on samples of the same size taken from the same population using identical sampling techniques will contain the treu value of the parameter we are estimating

New cards
28

addressing nonsignificant independent variables:

  • if practical experiences dictates that the nonsignificant indpendent variable has a relationship with the dependent variable, the independent variable should be left in the model

  • if the model sufficiently explains the dependent variable without the nonsignificant independent variable, then consider rerunning the regression without the nonsignificant independent variable

  • the appropriate treatment of the inclusion of exculsion of the y-intercept when b0 is not statistically significant may require special consideration

New cards
29

testing for an overall regression relationship:

  • us an F-test based on the F probability distribution

  • if the F-test leads us to reject the hypothesis that the values of B1, B2, …, Bq are all zero:

    • conclude that there is an overall regression relationship

    • otherwise, conclude that there is no overall regression relationship

New cards
30
New cards
robot