biostats lec lesson 8 linear regression

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/22

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

23 Terms

1
New cards

Linear regression

strives to show the relationship between two variables by applying a linear equation to observed data.

2
New cards

Linear regression

One variable is supposed to be an independent variable, and the other is to be a dependent variable.

3
New cards

regression analysis

the relationship between X and Y is described by means of an equation of the curve which best fits the data.

4
New cards

regression equation

After the _________________ has been derived, values of independent variables can be substituted in order to determine the predicted values of the dependent variable.

5
New cards

EFFECT OF EACH INDEPENDENT VARIABLE

measured by the magnitude and sign of the corresponding regression coefficient.

6
New cards

LINEAR REGRESSION ANALYSIS

Applied when the relationship between X and Y can be described by a straight line

7
New cards

MULTIPLE REGRESSION ANALYSIS.

Applied when the effects of two or more independent variables are simultaneously considered

8
New cards

normality, homoscedasticity and independence.

The valid application of simple linear regression analysis entails three basic assumptions regarding the dependent variable, Y, namely ________________.

9
New cards

method of least-squares

The most common method for fitting a regression line is the method of least-squares.

10
New cards

method of least-squares

This method calculates the best-fitting line for the observed data by minimizing the sum of the squares of the vertical deviations from each data point to the line (if a point lies on the fitted line exactly, then its vertical deviation is 0).

11
New cards

first step

finding the equation of the line which best fits the data.

12
New cards

proximity

However, if one were also to consider the broken lines besides C, choosing between the two lines will now be a difficult decision, unless a more specific definition of "____________" will be given.

13
New cards

proximity

the concept of "________________" to the given data is also used in determining the best fitting line, but it is expressed in measureable terms by the method of least squares.

14
New cards

X

is the independent variable and plotted along the x-axis

15
New cards

Y

is the dependent variable and plotted along the y-axis

16
New cards

b

The slope of the line is __________

17
New cards

a

is the intercept (the value of y when x = 0).

18
New cards

method of least squares

Using the _______________ as criterion for selecting the best-fitting line, the values of the slope and the intercept should be computed using the formulas.

19
New cards

corollary

A _________________ question that may be asked is the extent to which the independent variable, X, can be used to predict values of the dependent variable, Y.

20
New cards

R2

The first approach is to determine the value of the coefficient of determination.

21
New cards

β

The second is to test the null hypothesis that the regression coefficient for the population, __________ , is equal to zero.

22
New cards

THE COEFFICIENT OF DETERMINATION, R2

measures the proportion of the total variability in the dependent variable, Y, that can be explained by or attributed to the independent variable, X. It can be computed by getting the square of the correlation coefficient, r.

23
New cards

1. For any fixed value of X, Y has a normal distribution. 2. The variance of Y is the same for any value of X.

3. The value of Y at one value of X does not depend on, and is not affected by the value of Y at another value of X.

These are the assumptions: