Comprehensive AP Statistics Study Guide: Significance Tests, Confidence Intervals, and Calculator Operations

Significance Tests for Proportions and Means

In AP Statistics, significance tests are structured procedures for evaluating claims about population parameters based on sample data. For proportions, there are two primary tests. The One-sample z-test for a proportion uses the null hypothesis H0:p=p0H_0: p = p_0. The alternative hypothesis HaH_a can state that the true proportion is greater than, less than, or different from the null value. Conditions for inference include a random sample, the sample size being less than or equal to 10%10\% of the population (n10%Nn \le 10\% N), and the Large Counts condition (np010n p_0 \ge 10 and n(1p0)10n(1 - p_0) \ge 10). The test statistic is calculated as z=p^p0p0(1p0)nz = \frac{\hat{p} - p_0}{\sqrt{\frac{p_0(1 - p_0)}{n}}}. This test is performed on the calculator using the 1-PropZTest function.

The Two-sample z-test for a difference in proportions evaluates the null hypothesis H0:p1p2=0H_0: p_1 - p_2 = 0. It requires independent random samples or a randomized experiment. Conditions specify that samples must be less than 10%10\% of their respective populations (n110%N1n_1 \le 10\% N_1 and n210%N2n_2 \le 10\% N_2). The Large Counts condition for this test uses a pooled (combined) proportion, p^c=X1+X2n1+n2\hat{p}_c = \frac{X_1 + X_2}{n_1 + n_2}. The requirements are n1p^c10n_1 \hat{p}_c \ge 10, n1(1p^c)10n_1(1 - \hat{p}_c) \ge 10, n2p^c10n_2 \hat{p}_c \ge 10, and n2(1p^c)10n_2(1 - \hat{p}_c) \ge 10. The test statistic is z=(p^1p^2)0p^c(1p^c)n1+p^c(1p^c)n2z = \frac{(\hat{p}_1 - \hat{p}_2) - 0}{\sqrt{\frac{\hat{p}_c(1 - \hat{p}_c)}{n_1} + \frac{\hat{p}_c(1 - \hat{p}_c)}{n_2}}} and the calculator function is 2-PropZTest.

For means, the One-sample t-test for a mean (or paired t-test) tests H0:μ=μ0H_0: \mu = \mu_0. It assumes a random sample or randomized experiment and that n10%Nn \le 10\% N. The Normality condition is satisfied if the population is normal, the sample data show no strong skew or outliers, or the sample size is large enough via the Central Limit Theorem (n30n \ge 30). The test statistic is t=xˉμ0snt = \frac{\bar{x} - \mu_0}{\frac{s}{\sqrt{n}}} with degrees of freedom df=n1df = n - 1. The calculator function is T-Test. The Two-sample t-test for a difference in means tests H0:μ1μ2=0H_0: \mu_1 - \mu_2 = 0. Conditions involve independent random samples, the 10%10\% rule for both populations, and the Normality condition for both groups (n1,n230n_1, n_2 \ge 30 or normal distributions). The statistic is t=(xˉ1xˉ2)(μ1μ2)s12n1+s22n2t = \frac{(\bar{x}_1 - \bar{x}_2) - (\mu_1 - \mu_2)}{\sqrt{\frac{s_1^2}{n_1} + \frac{s_2^2}{n_2}}}. Degrees of freedom are taken as the smaller of n11n_1 - 1 and n21n_2 - 1 or determined via technology. The calculator function is 2-SampTTest.

Inference for Slope

The t-test for a slope is used to determine if there is a linear relationship between two quantitative variables. The null hypothesis is typically H0:β=β0H_0: \beta = \beta_0 (often β=0\beta = 0). Five conditions must be met: 1) the relationship between xx and yy is fairly linear; 2) the sample size is less than 10%10\% of the population (n10%Nn \le 10\% N); 3) for each xx, the distribution of yy is normal; 4) for each xx, yy has approximately the same standard deviation; and 5) the data comes from a random sample or randomized experiment. The test statistic is t=bβ0SEbt = \frac{b - \beta_0}{SE_b} where bb is the sample slope and SEbSE_b is the standard error of the slope. The degrees of freedom are df=n2df = n - 2. The calculator command is LinRegTTest.

Confidence Intervals for Proportions, Means, and Slope

Confidence intervals estimate the true value of a population parameter. For a single proportion, the One-sample z-interval uses the formula p^±zp^(1p^)n\hat{p} \pm z^* \sqrt{\frac{\hat{p}(1 - \hat{p})}{n}}, where the Large Counts condition relies on observed counts (np^10n\hat{p} \ge 10 and n(1p^)10n(1 - \hat{p}) \ge 10). For the difference in proportions, the formula is (p^1p^2)±zp^1(1p^1)n1+p^2(1p^2)n2(\hat{p}_1 - \hat{p}_2) \pm z^* \sqrt{\frac{\hat{p}_1(1 - \hat{p}_1)}{n_1} + \frac{\hat{p}_2(1 - \hat{p}_2)}{n_2}}.

For means, a One-sample t-interval is calculated as xˉ±t(sn)\bar{x} \pm t^* \left(\frac{s}{\sqrt{n}}\right) with df=n1df = n - 1. A Two-sample t-interval for the difference in means is (xˉ1xˉ2)±ts12n1+s22n2(\bar{x}_1 - \bar{x}_2) \pm t^* \sqrt{\frac{s_1^2}{n_1} + \frac{s_2^2}{n_2}}. The t-interval for a slope is given by b±tSEbb \pm t^* SE_b with df=n2df = n - 2. Each interval requires specific conditions such as randomness, independence (10%10\% rule), and normality/large sample size as previously detailed for significance tests. Calculator functions include 1-PropZInt, 2-PropZInt, TInterval, 2-SampTInt, and LinRegTInt.

Chi-Square Tests

Chi-Square tests are applied to categorical data. All three Chi-Square tests share the same basic formula for the test statistic: χ2=(OE)2E\chi^2 = \sum \frac{(O - E)^2}{E}, where OO is the observed count and EE is the expected count. In all cases, all expected counts must be greater than 55.

The Chi-Square test for Goodness-of-Fit (GOF) evaluates if a categorical variable's claimed distribution is correct. H0H_0 states the claimed distribution is correct, while HaH_a states it is incorrect. The degrees of freedom are df = \text{# of categories} - 1. The calculator command is \chi^2GOF-Test.

The Chi-Square test for Homogeneity determines if the distribution of a categorical variable is the same across multiple populations or treatments. H0H_0 states there is no difference in the distribution across populations. The degrees of freedom are calculated as df=(r1)(c1)df = (r - 1)(c - 1) where rr is rows and cc is columns. The calculator command is \chi^2-Test.

The Chi-Square test for Independence checks for an association between two categorical variables within a single population. H0H_0 states there is no association or the variables are independent. The degrees of freedom are also df=(r1)(c1)df = (r - 1)(c - 1), and the calculator command is \chi^2-Test. Observed counts for these tests (Homogeneity and Independence) must be entered into a matrix, typically Matrix A, on the calculator.

Identifying the Correct Significance Test

To choose the appropriate significance test, first look for indicators like the phrase "Do the data provide convincing statistical evidence." If the data involve Minitab output or a linear relationship, a Linear Regression t-test is required. If the data are presented in a table of frequencies, a Chi-Square test is indicated: use Goodness-of-Fit for one variable across one population, Homogeneity for one variable across multiple populations, and Independence for two variables in one population.

If the data are not in a frequency table, determine if they involve Proportions (%, proportions) or Means (averages). For proportions, check the number of samples: one sample requires a 1-prop z-test, while two samples require a 2-prop z-test. For means, check the number of samples: one sample or a paired experiment requires a 1-mean t-test (paired t-test), while two separate groups require a 2-mean t-test.

Calculator Functions and Usage

For One Variable Data, use 1-Var Stats to find the mean, standard deviation, and five-number summary. For Two Variable Data, LinReg (a + bx) provides the least squares regression line, correlation (rr), and coefficient of determination (r2r^2). Use DiagnosticOn to ensure rr and r2r^2 are displayed.

Probability calculations include:

  • binompdf(n, p, X): Probability of exactly XX successes (nn trials, probability pp).
  • binomcdf(n, p, X): Probability of at most XX successes (P(Xx)P(X \le x)).
  • normalcdf(lower, upper, mean, SD): Find the area/probability for an interval in a normal distribution.
  • invNorm(area left, mean, SD): Find a boundary value for a given area in a normal distribution.
  • tcdf(lower, upper, df): Find the area for an interval in a t-distribution.
  • invT(area left, df): Find a boundary value for a t-distribution.
  • \chi^2cdf(lower, upper, df): Find the area for an interval in a chi-square distribution.

Confidence Intervals and Significance Tests commands require specific inputs such as successes (xx), sample size (nn), means (xˉ\bar{x}), standard deviations (sxs_x), and confidence levels (C-Level). Note that for two-sample t-procedures, the "Pooled" option should generally be set to "No."

Statistical Practice Problems

  1. Proportion Significance Test: A random sample of 100100 students found 5454 support a schedule change. The principal tests if more than 50%50\% support it (H0:p=0.5H_0: p = 0.5, Ha:p>0.5H_a: p > 0.5).    Function: 1-PropZTest. Inputs: p0=0.5p_0 = 0.5, x=54x = 54, n=100n = 100, prop >p0> p_0.    P-value: 0.21190.2119.

  2. Difference in Means Confidence Interval: Samples of burgers in U.S. (xˉ1=4.53\bar{x}_1 = 4.53, s1=0.24s_1 = 0.24, n1=15n_1 = 15) and Japan (xˉ2=4.01\bar{x}_2 = 4.01, s2=0.38s_2 = 0.38, n2=10n_2 = 10).    Function: 2-SampTInt. Inputs: xˉ1=4.53\bar{x}_1 = 4.53, s1=0.24s_1 = 0.24, n1=15n_1 = 15, xˉ2=4.01\bar{x}_2 = 4.01, s2=0.38s_2 = 0.38, n2=10n_2 = 10, C-Level: 0.990.99, Pooled: No.    Result: (0.1478,0.8922)(0.1478, 0.8922).

  3. Normal Boundary Calculation: Croatian males height XN(180,72)X \sim N(180, 7^2). Tallest 5%5\% corresponds to area left = 0.950.95.    Function: invNorm. Inputs: area =0.95= 0.95, μ=180\mu = 180, σ=7\sigma = 7.    Answer: 191.51cm191.51\,cm.

  4. Binomial Probability: Natural blackjack probability=4.5%\text{Natural blackjack probability} = 4.5\%. P(X3)P(X \ge 3) in 2020 rounds.    Function: 1binomcdf(n,p,X)1 - binomcdf(n, p, X). Inputs: 1binomcdf(20,0.045,2)1 - binomcdf(20, 0.045, 2).    Answer: 0.06010.0601.

  5. Mean Confidence Interval: Sample of 4040 professors, xˉ=5.4\bar{x} = 5.4, s=1.6s = 1.6.    Function: TInterval. Inputs: xˉ=5.4\bar{x} = 5.4, s=1.6s = 1.6, n=40n = 40, C-Level: 0.950.95.    Result: (4.8872,5.9128)(4.8872, 5.9128).

  6. Binomial Exact Successes: 3%3\% Siberian Huskies have heterochromia. P(X=3)P(X = 3) in sample of 100100.    Function: binompdf. Inputs: n=100n = 100, p=0.03p = 0.03, X=3X = 3.    Answer: 0.22750.2275.

  7. Difference in Proportions Confidence Interval: 2012 sample: 10/10,00010/10,000. 2016 sample: 100/20,000100/20,000.    Function: 2-PropZInt. Inputs: x1=10x_1 = 10, n1=10,000n_1 = 10,000, x2=100x_2 = 100, n2=20,000n_2 = 20,000, C-Level: 0.950.95.    Result: (0.0054,0.0026)(-0.0054, -0.0026). (Note: Change is p2016p2012p_{2016} - p_{2012}).

  8. P-value from t-statistic: Test for mean salary >$45,327> \$45,327 with n=10n = 10 and t=2.51t = 2.51.    Function: tcdf(lower, upper, df). Inputs: tcdf(2.51, 999, 9).    Answer: 0.01670.0167.

  9. Chi-Square Test Statistic: Vehicles (Car, SUV, Truck) vs. (Owned, Leased).    Function: \chi^2-Test. Matrix A: [[29,20,11],[21,10,4]][[29, 20, 11], [21, 10, 4]].    Test statistic (calculated χ2\chi^2): 1.531.53.

  10. Normal Distribution Probability: GRE Verbal XN(150,8.52)X \sim N(150, 8.5^2). P(145X160)P(145 \le X \le 160).     Function: normalcdf. Inputs: lower =145= 145, upper =160= 160, μ=150\mu = 150, σ=8.5\sigma = 8.5.     Answer: 0.60530.6053.

  11. Significance Test for Difference in Means: Machine A (xˉ1=1.35\bar{x}_1 = 1.35, s1=0.10s_1 = 0.10, n1=10n_1 = 10), Machine B (xˉ2=1.42\bar{x}_2 = 1.42, s2=0.08s_2 = 0.08, n2=10n_2 = 10). Testing for a difference (Ha:μ1μ2H_a: \mu_1 \ne \mu_2).     Function: 2-SampTTest. Inputs: Stats, xˉ1=1.35\bar{x}_1 = 1.35, s1=0.10s_1 = 0.10, n1=10n_1 = 10, xˉ2=1.42\bar{x}_2 = 1.42, s2=0.08s_2 = 0.08, n2=10n_2 = 10, μ1μ2\mu_1 \ne \mu_2, Pooled: No.     P-value: 0.09690.0969.