$go to Math$

AP Statistics Unit 3: Collecting Data

Unit 3: Collecting Data

Studied by 1 person

0.0(0)

LearnA personalized and smart learning plan

Practice TestTake a test on your terms and definitions

Spaced RepetitionScientifically backed study method

Matching GameHow quick can you match all your cards?

FlashcardsStudy terms and definitions

1 / 100

Earn XP

Description and Tags

AP Statistics | 2024-2025

Statistics

Collecting Data

AP Statistics

Unit 3: Collecting Data

10th

101 Terms

What is a population?

the entire group of individuals we want information about

New cards

What is a sample?

a subset of individuals n the population from which we actually collect data

New cards

What is a census?

data from every individual in the population

New cards

What is a sample survey?

a study that collects data from a sample that is chosen to represent a specific population

New cards

What are the steps for planning a sample survey?

Decide what population you want to describe
Decide what you want to measure
Decide how to choose a sample from the population

New cards

What does poor sampling lead to in your results?

bias

New cards

What is bias?

using a value that will consistently overestimate or underestimate the value you want to know

New cards

What is convenience sampling?

choosing individuals who are easy to reach

New cards

What is voluntary response sampling?

allowing people to choose to be in the sample by responding to a general invitation

New cards

Why might voluntary response sampling show bias?

because people will strong feelings (often in the same direction) are most likely to respond

New cards

How do you ensure that the conclusion of your study doesn’t become rendered invalid?

by doing everything in your power to ensure that the sample was collected truly, utterly, and completely randomly

New cards

What is random sampling?

a chance process to determine which members of a population are included in the sample

New cards

What is a simple random sample (SRS)?

a sample chosen in such a way that every group of n individuals in the population has an equal chance to be selected as the sample

New cards

Why might you choose a sample by chance?

to avoid bias affecting the results

New cards

How can you choose an SRS?

using technology or Table D

New cards

What are the 3 steps to choosing an SRS?

Label
Randomize
Select

New cards

What is N in regard to SRS?

the number of individuals in the population

New cards

What is n in regard to SRS?

sample size

New cards

What is the Label step of choosing an SRS with technology?

Give each individual in the population a distinct numerical label from 1 to N

New cards

What is the Randomize step of choosing an SRS with technology?

Use a random number generator to obtain n different integers from 1 to N

New cards

What is the Select step of choosing an SRS with technology?

Choose the individuals that correspond to the randomly selected integers

New cards

How do you find SRS using a calculator?

Math → PRB → 5: randomInt(1, N)

New cards

What is the Label step of choosing an SRS with Table D?

Give each member of the population a numerical label with the same number of digits. Use as few digits as possible

New cards

What is the Randomize step of choosing an SRS with Table D?

Read consecutive groups of digits of the appropriate length from left to right across a line in Table D. Ignore any groups of digits that wasn’t used as a label or that duplicates a label already in the sample. Stop when you have chosen n different labels

New cards

What is the Select step of choosing an SRS with Table D?

Choose the individuals that correspond to the randomly selected integers

New cards

What is a table of random digits?

a long string of the digits 0, 1, 2, 3, 4, 5, 6, 7, 8, 9 with these two properties:

each entry in the table is equally likely to be any of the 10 digits (0-9)
the entries are independent of each other, and knowledge of one part of the table gives no information about any other part

New cards

What are strata?

groups of similar groups

New cards

What is a stratified random sample?

a sample that takes an SRS within each group and combines the SRS’s into one overall sample

New cards

Why is it beneficial to use a stratified random sample?

it provides a more precise estimate with less variability

New cards

How do you choose a variable to stratify by?

pick the variable that is the best predictor of what you’re measuring

New cards

When is it preferred to use cluster sampling instead of SRS or stratified random sampling?

when the populations are large and spread over a wide area

New cards

What is a cluster?

a group of individuals that are located near each other

New cards

What is a cluster sampling?

randomly choosing clusters and including each member of the selected clusters in the sample

New cards

Why are cluster samples used?

for practical reasons like saving time and money

New cards

When do cluster samples work best?

when the cluster looks like the population, just on a smaller scale

New cards

How do you describe stratified random sampling?

Define the strata

obtain an SRS of [ n/number of strata] from each [strata]
result – stratified random sample of n students

New cards

How do you describe cluster sampling?

Use […] as clusters, assuming x individuals per [cluster]
Randomly selected [n/number of individuals per cluster]
Result – the n individuals will be our sample

New cards

What is the drawback of SRS?

there is a large amount of variability, and it is time-consuming

New cards

What is the drawback of stratified random sampling?

there might not be many individuals for some strata, which can influence the result

New cards

What is the drawback of cluster sampling?

the clusters used may not be good representations of the entire population

New cards

What is systematic random sampling?

selecting a sample from an ordered arrangement of the population by randomly selecting one of the first k individuals and every kth individual thereafter

New cards

What can affect sample surveys in addition to sampling variability?

errors

New cards

What do good sampling techniques include?

the art of reducing all sources of error

New cards

When does undercoverage occur?

when some members of the population are less likely to be chosen or cannot be chosen in a sample

New cards

When does nonresponse occur?

when an individual chosen for the sample can’t be contacted or refuses to participate

New cards

When does response bias occur?

when there is a systematic pattern of inaccurate answers to a survey question

New cards

What is the most important influence on the answers given to a sample survey?

the wording of questions

New cards

Why should you rely on random sampling?

to avoid bias in selecting samples from the lists of available individuals
the laws of probability allow trustworthy inference about the population

New cards

What is a margin of error?

how far we expect the sample proportion to be from the actual

New cards

What is the benefit of increasing the sample size?

increased precision (but not accuracy)

New cards

What are errors in design methods (designer flaw)?

convenience sampling
voluntary response sampling

New cards

What are errors causing response bias (response flaw)?

undercoverage
nonresponse
wording of questions

New cards

What is an observational study?

a study that observes individuals and measures variables of interest but does not attempt to influence the response

New cards

What is a retrospective observational study?

one that examines existing data

New cards

What is a prospective observational study?

one that tracks individuals into the future

New cards

When does confounding occur?

when two variables are associated in such a way that their effects on a response variable cannot be distinguished from each other

New cards

What does an experiment do?

deliberately imposes some treatment on individuals to measure their responses

New cards

What is a placebo?

a treatment that has no active ingredient, but is otherwise like other treatments

New cards

What is the only source of full convincing data when our goal is to understand cause and effect?

experiments

New cards

What is a treatment?

a specific condition applied to the individuals in an experiment

New cards

What is an experimental unit?

the object to which a treatment is randomly assigned

New cards

What are experimental units called when they are human beings?

subjects

New cards

How do experiments differ from observational studies?

observational studies observe individuals and ask them questions, while experiments impose some treatment in order to measure the response

New cards

Why do observational studies of the effect on an explanatory variable on a response variable often fail?

because of confounding between the explanatory variable and one or more other variables

New cards

What do well-designed experiments take steps to do?

prevent confounding

New cards

What is a factor in an experiment?

an explanatory variable that is manipulated and may cause a change in the response variable

New cards

What are levels in an experiment?

the different values of a factor

New cards

Why is a control group used?

to provide a baseline for comparing effects of other treatments

New cards

What is the placebo effect?

the effect that some subjects in an experiment will respond favorably to any treatment, even an inactive treatment

New cards

What is a double-blind experiment?

an experiment in which neither the subjects nor those who interact with them and measure the response variable know which treatment a subject received

New cards

What is a single-blind experiment?

an experiment in which either the subjects don’t know which treatment they are receiving or the people who interact with them and measure the response variable don’t know which subjects are receiving which treatment

New cards

What is random assignment in an experiment?

using chance to assign experimental units to treatments

New cards

What is the purpose of random assignment?

to help create roughly equivalent groups of experimental units by balancing the effects of other variables among the treatments

New cards

What does control mean in an experiment?

keeping other variables constant for all experimental units

New cards

What does random assignment ensure?

that the effects of uncontrolled variables are balanced among treatment groups

New cards

What is replication in an experiment?

using enough experimental units to distinguish a difference in the effects of the treatments from chance variation due to the random assignment

New cards

What can replication also refer to?

repeating the experiment with different subjects

New cards

How does an experiment benefit from replication?

confounding is prevented and variability is reduced

New cards

What are the 4 principles of experimental design?

comparison
random assignment
control
replication

New cards

What is comparison?

using a design that compares two or more treatments

New cards

What is a completely randomized design?

a design in which the experimental units are assigned to the treatments completely by chance

New cards

What is a block?

a group of experimental units that are known before the experiment to be similar in some way that is expected to affect the response to the treatments

New cards

What is a randomized block design?

a design in which the random assignment of experimental units to treatments is carried out separately within each black

New cards

What does blocking account for?

a source of variability

New cards

What are the best variables to use for blocking?

ones that best predict the response variable

New cards

What is a matched pairs design?

a design for comparing two treatments that uses blocks of size 2

New cards

How are pairs used in matched pairs designs?

either two very similar experimental units are paired and the two treatments are randomly assigned within each pair, or each experimental unit receives both treatments in a random order

New cards

What do researchers usually hope to see in an experiment?

a difference in the responses that is so large that it is unlikely to have happened just by chance variation

New cards

How can we learn whether the treatments effects are larger than we would expect to see if only chance was operation?

by using the laws of probability

New cards

When is an observed effect statistically significant?

when it is so large that it would rarely occur

New cards

True or false: A statistically significant association in data from a well-designed experiment does not imply causation.

false

New cards

What do we need to do when we do an experiment and find a difference between two groups?

we need to determine if this difference can be attributed to the chance of variation in random assignment or because there really is a difference in effects of the treatments

New cards

How can we determine if the results of our experiment are statistically significant?

by conducting a simulation that models truly random outcomes and using the results to conclude statistically significance

New cards

What does the scope of inferences refer to?

the types of inferences (conclusions) that can be drawn from a study

New cards

When can we make inferences about the population?

when the individuals are randomly selected from a population

New cards

When can we make inferences about cause and effect?

when the individuals are randomly assigned to groups