U5

Studied by 0 people

0.0(0)

View linked note

get a hint

hint

A sampling distribution is

1 / 42

There's no tags or description

Looks like no one added any tags here yet for you.

43 Terms

A sampling distribution is

a distribution of all possible samples of a given size drawn from a population. It represents the distribution of statistics (such as the mean or proportion) calculated from these samples

New cards

_____ (mu,p) describes Population, _____ (lined x, hatted p) describes Sample
- _____ (lined x, hatted p) estimates corresponding ____ (mu , p)
- unbiased estimator: statistic whose mean is ____ to value of ____ being estimated (does NOT consistently either over/underestimate value of parameter)
take every possible sample of size n & graph some ____ → sampling distribution: frequency distribution of ALL possible values of a statistic (we can only approximate)

parameter, statistic, statistic, parameter, equal, parameter, statistics

New cards

10% Condition (review 6.3)
- often can assume 10% Condition: “Assuming the population size it at least …”
  - Regardless of population size, take same ____ size for same ____ assuming equal probabilities between distributions

sample, variability

New cards

If claim of some probability for some interval of parameter, but sampling distribution has significantly small probability of that interval, it is (un/likely) for that probability’s smallness to be due to chance, so the claim is likely ____ (precursor to significance tests)

unlikely, false

New cards

confidence interval:

estimate population parameter from sample statistic

New cards

C% confidence interval: statistic (unbiased point estimator) ± margin of error, where margin of error = critical value * standard deviation of population (if don’t know standard deviation of population, must use ______ as estimate)
- critical value depends on ____ level
- confidence level (does/n’t) tell the chance that a particular interval captures the parameter; the parameter is ___ already, so one you make a sample, the interval either contains or does not contain the parameter
  - Instead (interpretation): “The interval was constructed using a method that produces intervals that capture the true (parameter) in C% of all possible samples of size (n).”

standard error, confidence, doesn’t, fixed

New cards

How is a mean sample distribution found?

you take the average of all means (quantitative) or proportions (categorical) of each possible sample size (n) and use these averages as your data points

New cards

How do you find the mean of a discrete random variable?

New cards

How do you find the standdev of a DRV?

New cards

A population parameter is a measure of

a characteristic of a population, such as the mean or proportion of a certain attribute

New cards

A sample statistic is a measure of a

characteristic of a sample, such as the mean or proportion of a certain attribute

New cards

Sample statistics estimate

population parameter

New cards

What are these variable letters for population parameter: standard deviation, mean, proportion

lowercase sigma, mu, rho

New cards

What are these variable letters for sample statistic: mean, standard deviation, proportion

barred x, s, hatted p

New cards

The mean, median, and mode of a normal distribution are all (different/equal), and the distribution is_____l, meaning that it has only one peak or mound.

equal, unimodal

New cards

The center of a normal model is represented by the ___, which is the arithmetic average of the data. The shape of a normal model is characterized by the ___-shaped curve, which is symmetrical around the ____. The spread of a normal model is represented by the _________ which measures the dispersion or ____ of the data around the mean.

mean, bell, mean, standard deviation, spread

New cards

A model displaying categorical or proportion-based data is nearly normal if

The number of successes and failures is at least 10 (Large Counts)

New cards

A model displaying quantitative or mean-based data is nearly normal if

The sample size is at least 30 (Central Limit Theorem which you’ll learn about in the next post)

Population is normally distributed

New cards

A z-score indicates how many

standard deviations above or below the mean a piece of data is

New cards

A z score (does/n’t) contain units

doesn’t

New cards

When asked to interpret a z-score, it is imperative that you indicate the

direction (pos or neg)

New cards

The Central Limit Theorem states that

if a sample size (n) is large enough, the sampling distribution of the sample mean will be approximately normal, regardless of the shape of the population distribution.

New cards

a sample size of n > _ is considered to be large enough for the Central Limit Theorem to hold.

New cards

for the Central Limit Theorem to apply, the samples must be ______ of each other and the sample must be a______. Additionally, it is generally recommended to use the Central Limit Theorem only for____ variables, as it may not hold for_____ variables with small sample sizes.

independent, SRS, continuous, discrete

New cards

An unbiased estimator is one that

produces estimates that are on average as close as possible to the true population parameter

New cards

A sample is unbiased if the estimator value (_____) is equal to the ______ _____

sample statistic, population parameter

New cards

A sampling distribution has a minimum amount of variability (spread) if all samples have statistics that are approximately ____ to one another

equal

New cards

It is (im/possible) to have no variability, due to the nature of random sampling

impossible

New cards

he sample you are using to make the inference is only a small subset of the entire population, and so it is subject to_______

sampling error

New cards

a larger sample size will minimize ____ in a sampling distribution

variability

New cards

f a sample is equally spread out around the mean, it is not necessarily unbiased; it is _____ to be biased than a sample that is heavily skewed in one direction or the other.

less likely

New cards

A consistent estimator is

a statistical method that produces an estimate that becomes more accurate as the sample size increases. In other words, it consistently approaches the true value of the population parameter.

New cards

Bias measures how precise the archer is (how ___ to the bullseye), while variability measures how _____ he/she is. In this analogy, the bullseye represents the true ____, and the archer's shots represent the estimates produced by the ___.

close, consistent, population parameter, estimator

New cards

You can usually tell if you will solve a problem using sample proportions if the problem gives you a ________

probability or percentage

New cards

The large counts condition can be expressed as np ≥ 10 and n(1-p) ≥ 10, where n is the sample size and p is the sample proportion. This means that both the number of ____(np) and the number of ____ (n(1-p)) in the sample should be at least ___. If these conditions are met, then you can assume that the sampling distribution for the sample proportion is approximately ___, and you can use statistical techniques that rely on normality, such as confidence intervals or hypothesis tests (we'll cover this in future sections!). ✔

successes, failures, 10, normal

New cards

For the shape (normal) of distributions of means, you can check the _______, but for proportions you must always check the_____

CLT, LCC

New cards

normal distribution

a continuous random probability distribution

New cards

sampling distribution for a difference in sample proportions

compares the mean and standard deviation of a binomial random variable; parameter shown by p; statistic shown by dotted p; conditions SRS, normal or np and n(1-p)≥10 (large counts), for standard deviation population is at least 10. mean for distribution is mu dotted p equals p. standard deviation for distribution is standard deviation of dotted p is equal to the square root of the probability of non p divided by n

New cards

sampling distribution for sample mean

parameter shown by p1-p2, statistic shown by hatted p1 - hatted p2, conditions: SRS, large counts, 10% rule, mean for distribution is mu sub(p1-p2) = p1-p2, standard devitaion for distribution shown by sampling distribution for sample population formula added with data from both populations and all square rooted

New cards

sampling distribution for the difference in sample means

shown by mu parameter, shown by lined x statistic, conditions: SRS, normal or sample size at least 30, 10% rule, mean for distribution is mu

New cards

standard deviation

parameter shown by sigma sub p, statistic shown by s

New cards

When you are working with numerical data and you want to estimate the mean of a population, you can use the______ of the ____ (x̄) to make inferences about the ____ However, before you can solve the problem, you must first assure the sampling distribution is normally distributed using the_____

sampling distribution, sample mean, population mean, CLT

New cards

To find the standard deviation of differences in sample means, divide the variances by each sample size before square rooting to find the overall standard deviation. Just like with proportions, the “________” applies to sampling distributions for the difference in two means as well

pythag theorem of stats

New cards