General Statistics & Hypothesis Testing Flashcards

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/29

flashcard set

Earn XP

Description and Tags

Flashcards containing key terms and definitions related to general statistics and hypothesis testing.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

30 Terms

1
New cards

Population

The complete set of subjects or observations about which we want to draw conclusions.

2
New cards

Sample

A subset of the population used to make inferences about the population.

3
New cards

Null Hypothesis (H₀)

A default statement to be tested, often representing 'no effect' or 'no difference.'

4
New cards

Alternative Hypothesis (Hₐ or H₁)

A statement contradicting H₀, representing the effect or difference we suspect exists.

5
New cards

Test Statistic

A calculated value (e.g., z-score, t-score) used to determine whether to reject H₀.

6
New cards

Critical Value

A threshold value from a distribution (e.g., t-distribution) that defines the rejection region for H₀.

7
New cards

p-value

The probability of observing a test statistic as extreme as, or more extreme than, the one calculated, assuming H₀ is true; Reject H₀ if p-value < α.

8
New cards

Significance Level (α)

The probability of rejecting H₀ when it is true (Type I error). Common values: 0.01, 0.05, 0.10.

9
New cards

Type I Error

Incorrectly rejecting a true H₀ (false positive).

10
New cards

Type II Error

Failing to reject a false H₀ (false negative).

11
New cards

One-Sided Test

Tests for an effect in one direction (e.g., Hₐ: μ > μ₀).

12
New cards

Two-Sided Test

Tests for any difference (Hₐ: μ ≠ μ₀).

13
New cards

Regression

A statistical method to model the relationship between a dependent variable (Y) and one or more independent variables (X).

14
New cards

Line of Best Fit (OLS Regression Line)

The line that minimizes the sum of squared residuals in a scatterplot. ( \hat{Y} = \hat{\beta}0 + \hat{\beta}1 X ).

15
New cards

Slope (β̂₁)

The estimated change in Y for a one-unit increase in X.

16
New cards

Intercept (β̂₀)

The predicted value of Y when X = 0. May not always have a meaningful interpretation.

17
New cards

Residual (êᵢ)

The difference between the observed (Yi) and the predicted (\hat{Y}i).

18
New cards

Sum of Squared Residuals (SSR)

A measure of how well the regression line fits the data: ( \sum \hat{e}_i^2 ).

19
New cards

R-squared (R²)

The proportion of variance in Y explained by X. Range: 0 (no fit) to 1 (perfect fit).

20
New cards

Homoskedasticity

Constant variance of residuals across all X values. Violation: Heteroskedasticity (uneven spread).

21
New cards

Simple Linear Regression Model

Assumes: ( Y = \beta0 + \beta1 X + \varepsilon ), where ( \mathbb{E}(\varepsilon|X) = 0 ).

22
New cards

Causal Inference

Requires the SLR assumptions to interpret (\beta_1) as the causal effect of X on Y. Challenge: Omitted variable bias.

23
New cards

ANOVA (Analysis of Variance)

Tests whether group means differ by comparing between-group and within-group variability.

24
New cards

MSTR (Mean Square Treatment)

Measures variation between sample means.

25
New cards

MSE (Mean Square Error)

Measures variation within samples.

26
New cards

Pooled-Variance t-test

Compares two means assuming equal variances.

27
New cards

Unequal-Variance t-test (Welch’s Test)

Compares two means without assuming equal variances.

28
New cards

Excel’s Analysis ToolPak

Used to perform regression, ANOVA, and hypothesis tests. Output Includes: Coefficients, standard errors, t-stats, p-values.

29
New cards

Confidence Interval

A range of values likely to contain the population parameter (e.g., slope).

30
New cards

Sampling Variation

Random fluctuations in sample statistics due to drawing different samples. Even if H₀ is true, sample results vary.