Vocab Unit 6

studied byStudied by 50 people
0.0(0)
Get a hint
Hint

Subsets

1 / 11

12 Terms

1

Subsets

Groups within a larger pool of data that are different in a specific way and thus often should be considered separately when creating linear models

New cards
2

Extrapolation

Using a linear model to predict values beyond those found within the domain of the data… can be highly unreliable, as it assumes that all other conditions still hold

New cards
3

Influential Point

A point that results in a very different slope for a regression model if it is removed

New cards
4

Outlier

A y-value that is far from its predicted value, resulting in a large initial residual… may or may not be an influential point

New cards
5

Leverage

A data value whose x-value is far from the mean of x has high leverage, like an outlier in the x direction rather than y… may or may not be an influential point

New cards
6

Lurking Variable

A hidden variable that simultaneously affects both variables in an association, accounting for the correlation that may appear between the two

New cards
7

Residuals Plot

A scatterplot of the residuals versus the x-values of an association, with the x-axis denoting a residual of 0… a residuals plot with no apparent pattern (blob), bouncing above and below the x-axis, means that the determined LSR model is appropriate

New cards
8

Re-Expression

Transforming a data set by taking the logarithm, square root, reciprocal, or some other math operation of ALL values in the data set to make it more conducive for linear regression

New cards
9

Nearly Normal Residuals Condition

To perform inference for regression, the residuals must be ~Normally distributed [linear normal probability plot]

New cards
10

Straight Enough Condition

To perform inference for regression, the association (scatterplot) studied must be ~linear [check residual plot]

New cards
11

Equal Variance Condition

To perform inference for regression, the variability of y must be ~constant for all values of x; check the spread of the residuals around the predicted value of the residuals plot

New cards
12

Standard Error of the Slope

The variation of the slope due to sampling variability, which is influenced by three factors: spread about the line (se), spread of the x-values (sx), and sample size (n)

New cards

Explore top notes

note Note
studied byStudied by 56 people
... ago
4.5(2)
note Note
studied byStudied by 18 people
... ago
5.0(1)
note Note
studied byStudied by 26 people
... ago
5.0(1)
note Note
studied byStudied by 24 people
... ago
5.0(1)
note Note
studied byStudied by 7 people
... ago
5.0(1)
note Note
studied byStudied by 22 people
... ago
5.0(1)
note Note
studied byStudied by 5 people
... ago
5.0(1)
note Note
studied byStudied by 2066 people
... ago
4.6(5)

Explore top flashcards

flashcards Flashcard (38)
studied byStudied by 52 people
... ago
5.0(1)
flashcards Flashcard (38)
studied byStudied by 4 people
... ago
5.0(1)
flashcards Flashcard (65)
studied byStudied by 1 person
... ago
5.0(1)
flashcards Flashcard (799)
studied byStudied by 10 people
... ago
5.0(2)
flashcards Flashcard (78)
studied byStudied by 5 people
... ago
5.0(1)
flashcards Flashcard (35)
studied byStudied by 21 people
... ago
5.0(1)
flashcards Flashcard (53)
studied byStudied by 2 people
... ago
4.0(1)
flashcards Flashcard (43)
studied byStudied by 5 people
... ago
5.0(1)
robot