Statistic Tests

Importance of statistics: Understanding and interpreting data.
Focus on the application of statistics in various fields, especially through the use of software like Microsoft Excel.

Definition: A statistical test used to determine if there is a significant difference between a sample mean and a known population mean.
Characteristics:
- There is only one group being tested (the sample).
- When to use: Comparing a sample mean to a population mean when the population variance is known.

Numerator: Difference between the sample mean and the population mean, which reflects the extent of deviation from the null hypothesis.
Denominator: Represents the standard error of the mean (SEM), which measures the dispersion of the sample mean around the population mean.

Significance: Reflects how much the sample mean varies from the population mean.
Formula for SEM: $SEM = \frac{\sigma}{\sqrt{n}}$
- Where:
- $\sigma$ = population standard deviation
- $n$ = sample size

SEM decreases as the sample size increases; larger samples provide more accurate estimates of the population mean.

Famous 7 Steps to Conduct a Z Test:
1. State the null hypothesis (H0): Assumes no effect or difference.
2. Set the level of risk: Typical alpha levels are 0.05, 0.01, etc.
3. Select the appropriate test statistic (Z test in this case).
4. Compute the test statistic value (Z value) using the formula provided.
5. Determine the critical value: This is the threshold that the computed Z value must exceed to reject the null hypothesis.
6. Compare the obtained Z value to the critical value: If Z exceeds the critical value, reject H0.
7. Decision: Conclude whether to reject or fail to reject the null hypothesis based on the comparison.

State the null and research hypothesis (H0: no difference, H1: difference exists).
Set the level of risk associated with the null hypothesis (e.g., alpha = 0.05).
Select the Z test statistic for calculation.
Compute the test statistic value using the Z test formula.
Determine the value needed for rejecting the null hypothesis (e.g., critical Z value for alpha = 0.05).
Compare the obtained Z value and the critical value.
Make a decision: If obtained Z is more extreme than critical Z, reject H0; otherwise accept H0.

Example Z value obtained: 2.38
p-value significance: p < 0.05
- Z value: Represents the test statistic used.
- 2.38: Indicates the obtained test statistic value.
- p < 0.05: Suggests that the result is statistically significant, leading to rejection of H0.

Scenario for One-Sample T-Test Use:
- Comparing a sample mean against a known mean when the population standard deviation is unknown.
- Estimating population parameters when population standard deviation cannot be obtained.

Using the Data Analysis Toolpak: Provides straightforward methods for performing t-tests.
Reference to tutorial: https://youtu.be/v-ZcqrdTcIQ for practical guidance on conducting t-tests using Microsoft Excel.

Definition: The process of making inferences about population parameters based on sample statistics.
Purpose: To determine how well our sample represents the population.

Evaluation of sample statistics (like sample mean) to ensure they act as good estimates for population parameters.
Example: Average bid for a hypothetical scenario (e.g., a game show) and assessing how confident we are in our estimates.

Point Estimate: A single value estimate of a population parameter. Example: Sampling 3 people’s estimates in a class as the point estimate of all viewers’ bids.
Confidence Intervals: Provides a range where the true population parameter is expected to fall.
- Formula: $95\% \ CI = Sample Mean \pm 1.96 \times SE$
- Significance of confidence levels (e.g., 95% vs. 99.7%).

General formula: Point estimate ± margin of error (which incorporates standard error).
Adjustment per confidence level (1.96 for 95%, and 3 for 99.7%).

Sample Mean: $\bar{X} = 34.46$
Known Population Standard Deviation: $\sigma = 5.83$
Objective: Calculate the point estimate and the 95% Confidence Interval based on sample data (216 serum albumin levels) with repetition of experiments for reliability.

Distinction between sample distribution and sampling distribution.
Differences between z-scores and Z-tests.
Appropriate conditions under which to apply a one-sample Z-test versus estimating confidence intervals with known population variance.
Conclusion on methodology and usage of statistical tests and estimates.