Lecture 9: Statistical Inference — Vocabulary

0.0(0)
studied byStudied by 0 people
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/22

flashcard set

Earn XP

Description and Tags

Vocabulary flashcards covering key concepts from Lecture 9 on statistical inference.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

23 Terms

1
New cards

Intercept

The predicted value of the response when all predictors are zero; for categorical predictors, corresponds to the reference (baseline) category.

2
New cards

Slope

The change in the response for a one-unit change in a numeric predictor; in additive models, slopes are the same across groups.

3
New cards

Null model

An intercept-only model with no explanatory variables; estimates the overall mean and is analogous to a one-sample t-test.

4
New cards

Categorical explanatory variable

A qualitative predictor with distinct categories (e.g., penguin species) used in regression.

5
New cards

Dummy variables

0/1 indicators used to encode categorical predictors so each non-reference category has its own intercept.

6
New cards

Reference category

The baseline level of a categorical predictor against which other levels are compared in the model.

7
New cards

Additive model

A regression model where the effects of predictors add up with a common slope across groups; lines are parallel.

8
New cards

Interaction model

A model that includes interaction terms, allowing slopes to differ by category; lines are not parallel.

9
New cards

Dummy coding

Encoding scheme using 0/1 variables to represent categories and enable group-specific intercepts.

10
New cards

Linear model (LM)

A regression framework modeling a continuous response as a linear combination of predictors; t-tests and ANOVA are special cases.

11
New cards

ANOVA

A method to test for differences among means and to compare nested models in regression.

12
New cards

AIC (Akaike Information Criterion)

A model quality metric that balances goodness-of-fit with model complexity; lower is better; differences >~4 suggest meaningful improvement.

13
New cards

Parsimony

Preference for simpler models that sufficiently explain the data, balancing fit and complexity.

14
New cards

Confidence interval

A range around a parameter estimate that would contain the true parameter in repeated samples; a single study’s interval either contains it or not.

15
New cards

Shapiro-Wilk test

A normality test for residuals, used as an alternative or complement to QQ plots.

16
New cards

QQ plot

A diagnostic plot comparing observed quantiles to theoretical quantiles to assess normality of residuals.

17
New cards

Residuals

Differences between observed values and model-predicted values, used to assess fit and detect patterns or outliers.

18
New cards

Log transformation

Applying a logarithm to data to stabilize variance or meet model assumptions; not always effective.

19
New cards

Back-transformation

Transforming predictions from a transformed scale (e.g., log) back to the original scale for interpretation.

20
New cards

Non-independence

A violation of the assumption that observations are independent; may require alternative tests or models.

21
New cards

Population parameter

The true value of a quantity in the population (e.g., the true mean); estimated from sample data.

22
New cards

Point prediction

A single predicted value from the regression equation for a given set of predictor values.

23
New cards

Hypothesis testing

A framework for deciding if data provide evidence against a null hypothesis, often using p-values.