6. Simple Linear Regression

0.0(0)
studied byStudied by 0 people
0.0(0)
linked notesView linked note
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
full-widthPodcast
1
Card Sorting

1/22

flashcard set

Earn XP

Description and Tags

These flashcards cover key terms and definitions related to simple linear regression, helping to consolidate knowledge for exam preparation.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

23 Terms

1
New cards

Scatterplot

A graphical representation that shows the relationship between two numerical variables.

2
New cards

Predictor variable

The independent variable, plotted on the horizontal axis of a scatterplot.

3
New cards

Response variable

The dependent variable, plotted on the vertical axis of a scatterplot.

4
New cards

Strong relationship

A clear pattern of dependence between predictor and response variables, even in non-linear data.

5
New cards

Weak relationship

A lack of discernible pattern between data points in a scatterplot, making it difficult to identify relationships.

6
New cards

Correlation coefficient (r)

A measure that indicates the strength and direction of a linear relationship between two numerical variables, ranging from -1 to 1.

7
New cards

Line of best fit

A straight line that best represents the data on a scatterplot, used to make predictions.

8
New cards

Least squares method

A technique used to determine the best-fitting line by minimizing the sum of the squares of the residuals.

9
New cards

Linear model

A mathematical representation of the relationship between variables in a linear regression that can be used for prediction and inference.

10
New cards

Slope

The rate of change in the response variable for every one-unit increase in the predictor variable in a linear regression model.

11
New cards

Y-intercept

The predicted value of the response variable when the predictor variable is zero in a linear regression model.

12
New cards

Residual

The difference between the observed value and the predicted value in a regression analysis.

13
New cards

R-squared (R)

A statistical measure that represents the proportion of the variance for a dependent variable that's explained by an independent variable or variables in a regression model.

14
New cards

Assumptions of linear regression

Includes linearity, constant variability (homoscedasticity), independent observations, and normally distributed residuals.

15
New cards

Outliers

Data points that differ significantly from other observations and may affect the results of regression analysis.

16
New cards

Influential point

A data point that significantly affects the slope and intercept of the regression line.

17
New cards

Statistical inference

The process of using data from a sample to make conclusions about a population, often involving hypothesis testing.

18
New cards

Confidence interval

A range of values derived from sample data that is likely to contain the population parameter with a certain level of confidence.

19
New cards

Prediction interval

A range of values that is likely to contain the value of a new observation based on the estimated regression line.

20
New cards

Extrapolation

The act of predicting values outside the range of the observed data, which can lead to unreliable predictions.

21
New cards

Categorical predictor variable

A predictor variable that contains categories rather than numerical values.

22
New cards

Multicollinearity

A situation in regression analysis where two or more predictor variables are highly correlated.

23
New cards

Multiple Linear Regression

A type of linear regression that models the relationship between two or more predictor variables and a single response variable.