CRAM TIME BABBYYY (statistics)

0.0(0)
studied byStudied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/71

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 12:11 AM on 2/2/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

72 Terms

1
New cards

What are the three Kolmogorov Axioms of probability?

1) $P(A) \geq 0$

2
New cards

2) $P(\Omega) = 1$

3
New cards

3) For disjoint events

$P(\cup Ai) = \sum P(Ai).

4
New cards

Define Conditional Probability $P(A|B)$

$P(A|B) = \frac{P(A \cap B)}{P(B)}$

5
New cards

State the Law of Total Probability

$P(A) = \sum{i} P(A|Bi)P(B_i)$

6
New cards

What is the formula and meaning of Bayes' Theorem?

$P(Bj|A) = \frac{P(A|Bj)P(Bj)}{\sum P(A|Bi)P(B_i)}$.

7
New cards

When are two events Independent?

When $P(A \cap B) = P(A)P(B)$

8
New cards

Difference between Mutually Exclusive and Independent events

Exclusive means they cannot happen together ($P(A \cap B) = 0$). Independent means one doesn't affect the other.

9
New cards

Define the Cumulative Distribution Function (CDF) $F_X(x)$

$F_X(x) = P(X \leq x)$. It is non-decreasing and ranges from 0 to 1.

10
New cards

What is the Probability Density Function (PDF) $f_X(x)$?

For continuous variables

11
New cards

Define the Expected Value (Mean) $E[X]$

$\int x f_X(x) dx$ (continuous) or $\sum x P(X=x)$ (discrete).

12
New cards

Define Variance $Var(X)$ and its shortcut formula

$Var(X) = E[(X - E[X])^2] = E[X^2] - (E[X])^2.

13
New cards

What is Standard Deviation $\sigma$

$\sigma = \sqrt{Var(X)}.

14
New cards

What is the Coefficient of Variation ($CV$)

$CV = \frac{\sigma}{|E[X]|}.

15
New cards

Bernoulli Distribution $Ber(p)$

Single trial: Success (1) with prob $p$

16
New cards

Binomial Distribution $Bin(n

p)$

17
New cards

Poisson Distribution $Poi(\lambda)$

Number of events in an interval with rate $\lambda$. $E[X] = Var(X) = \lambda$.

18
New cards

Geometric Distribution $Geo(p)$

Number of trials until the first success.

19
New cards

Uniform Distribution $U(a

b)$

20
New cards

Normal (Gaussian) Distribution $N(\mu

\sigma^2)$

21
New cards

ShutterstockStandard Normal Distribution $Z$

$Z \sim N(0

22
New cards

Exponential Distribution $Exp(\lambda)$

Time between events in a Poisson process. $E[X] = 1/\lambda$.

23
New cards

What is the Memoryless Property?

$P(X > s+t | X > s) = P(X > t)$. It means the remaining time until an event does not depend on how much time has passed.

24
New cards

What is a Marginal Distribution?

The distribution of one variable found by integrating/summing out others from the joint PDF/PMF.

25
New cards

Define Covariance $Cov(X

Y)$

26
New cards

Define Correlation $\rho_{XY}$

$\rho_{XY} = \frac{Cov(X

27
New cards

Independence of two variables $X

Y$

28
New cards

What is $E[aX + bY]$?

$aE[X] + bE[Y]$. (Always true).

29
New cards

What is $Var(aX + bY)$ for Independent variables?

$a^2Var(X) + b^2Var(Y).

30
New cards

State the Central Limit Theorem (CLT)

The sum of $n$ i.i.d. variables tends to a Normal distribution $N(n\mu

31
New cards

What is the Standard Error of the mean?

$SE = \frac{\sigma}{\sqrt{n}}.

32
New cards

What is a Point Estimator $\hat{\theta}$

A statistic used to estimate an unknown population parameter $\theta$.

33
New cards

Define Unbiasedness (Estimador não enviesado)

$E[\hat{\theta}] = \theta$.

34
New cards

Define Bias (Viés)

$E[\hat{\theta}] - \theta$.

35
New cards

Define Mean Squared Error (MSE)

$MSE(\hat{\theta}) = Var(\hat{\theta}) + [Bias(\hat{\theta})]^2.

36
New cards

What is the Likelihood Function $L(\theta)$?

The joint probability of the data viewed as a function of the parameter $\theta$.

37
New cards

What is the Maximum Likelihood Estimator (MLE)?

The value of $\theta$ that maximizes $L(\theta)$.

38
New cards

What is a Confidence Interval (CI)?

A range of values that has a probability $1-\alpha$ of containing the true population parameter.

39
New cards

What is a Pivotal Quantity?

A random variable that depends on the data and the parameter

40
New cards

When do we use the Student's t-distribution?

When testing the mean of a Normal population but the variance $\sigma^2$ is unknown and must be estimated from the sample.

41
New cards

What is the effect of Sample Size ($n$) on CI width?

Larger $n$ results in a narrower (more precise) interval

42
New cards

What is the CI for a Proportion $p$

$\hat{p} \pm z_{1-\alpha/2} \sqrt{\frac{\hat{p}(1-\hat{p})}{n}}.

43
New cards

What is the Null Hypothesis ($H_0$)

The status quo or the assumption of "no effect." We only reject it if there is strong evidence.

44
New cards

Define Type I Error ($\alpha$)

Rejecting $H_0$ when it is actually true (False Positive).

45
New cards

Define Type II Error ($\beta$)

Failing to reject $H_0$ when it is actually false (False Negative).

46
New cards

What is the Significance Level ($\alpha$)

The maximum probability of committing a Type I Error that we are willing to accept.

47
New cards

What is the p-value?

The probability of seeing data as extreme as ours

48
New cards

What is the Power of a test?

$1 - \beta$. The probability of correctly rejecting a false $H_0$.

49
New cards

What is a Rejection Region ($W$)

The set of values for the test statistic that lead to rejecting $H_0$.

50
New cards

What is Linear Regression?

A model that describes the relationship between a dependent variable ($Y$) and independent variables ($x$) using a straight line.

51
New cards

What are Residuals?

The difference between observed values and the values predicted by the model: $ei = yi - \hat{y}_i$.

52
New cards

Define the Least Squares Method

Picking estimates that minimize the sum of squared residuals: $\sum (yi - \hat{y}i)^2$.

53
New cards

What does the Coefficient of Determination ($R^2$) represent?

The proportion of the variance in $Y$ that is predictable from $x$.

54
New cards

What does a Slope ($\beta_1$) of zero mean?

It means there is no linear relationship between $x$ and $Y$

55
New cards

$x$ does not help predict $Y$.

56
New cards

What is Homoscedasticity?

The assumption that the variance of the residuals is constant across all values of $x$.

57
New cards

What is the distribution of the Sample Mean $\bar{X}$?

If the population is $N(\mu

58
New cards

State the Law of Iterated Expectations

$E[X] = E[E[X|Y]].

59
New cards

What is the distribution of the Sample Variance $S^2$?

For a Normal population

60
New cards

State Chebyshev's Inequality

$P(|X - \mu| \geq k\sigma) \leq 1/k^2$.

61
New cards

What is a Quantile of order $p$

The value $xp$ such that $P(X \leq xp) = p$.

62
New cards

Explain the Correction for Continuity

Adjusting boundaries by $\pm 0.5$ when using a continuous distribution (Normal) to approximate a discrete one (Binomial/Poisson).

63
New cards

What is Fisher Information?

A measure of how much information an observable random variable carries about an unknown parameter.

64
New cards

What are Degrees of Freedom ($df$)

The number of independent pieces of information in a statistic.

65
New cards

Significance vs. Power

Significance ($\alpha$) limits false alarms (Type I)

66
New cards

Power ($1-\beta$) is the ability to find a real effect.

67
New cards

Independence vs. Uncorrelated

Independence implies uncorrelated ($\rho=0$). Uncorrelated does not necessarily imply independence (could be non-linear).

68
New cards

What is the Standard Normal Density formula?

$\phi(z) = \frac{1}{\sqrt{2\pi}} e^{-z^2/2}.

69
New cards

Expected Value of a function $E[g(X)]$

$\int g(x) f_X(x) dx$ or $\sum g(x) P(X=x).

70
New cards

Chi-square Goodness-of-Fit Test

A test to see if observed sample frequencies follow a specific theoretical distribution.

71
New cards

Total Variance Decomposition in Regression

$SST = SSR + SSE$ (Total variation = Explained variation + Error variation).

72
New cards

What is Adjusted $R^2$?

A modification of $R^2$ that accounts for the number of predictors