Unit 3: Collecting Data

studied byStudied by 1 person
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 100

flashcard set

Earn XP

Description and Tags

AP Statistics | 2024-2025

101 Terms

1

What is a population?

the entire group of individuals we want information about

New cards
2

What is a sample?

a subset of individuals n the population from which we actually collect data

New cards
3

What is a census?

data from every individual in the population

New cards
4

What is a sample survey?

a study that collects data from a sample that is chosen to represent a specific population

New cards
5

What are the steps for planning a sample survey?

  1. Decide what population you want to describe

  2. Decide what you want to measure

  3. Decide how to choose a sample from the population

New cards
6

What does poor sampling lead to in your results?

bias

New cards
7

What is bias?

using a value that will consistently overestimate or underestimate the value you want to know

New cards
8

What is convenience sampling?

choosing individuals who are easy to reach

New cards
9

What is voluntary response sampling?

allowing people to choose to be in the sample by responding to a general invitation

New cards
10

Why might voluntary response sampling show bias?

because people will strong feelings (often in the same direction) are most likely to respond

New cards
11

How do you ensure that the conclusion of your study doesn’t become rendered invalid?

by doing everything in your power to ensure that the sample was collected truly, utterly, and completely randomly

New cards
12

What is random sampling?

a chance process to determine which members of a population are included in the sample

New cards
13

What is a simple random sample (SRS)?

a sample chosen in such a way that every group of n individuals in the population has an equal chance to be selected as the sample

New cards
14

Why might you choose a sample by chance?

to avoid bias affecting the results

New cards
15

How can you choose an SRS?

using technology or Table D

New cards
16

What are the 3 steps to choosing an SRS?

  1. Label

  2. Randomize

  3. Select

New cards
17

What is N in regard to SRS?

the number of individuals in the population

New cards
18

What is n in regard to SRS?

sample size

New cards
19

What is the Label step of choosing an SRS with technology?

Give each individual in the population a distinct numerical label from 1 to N

New cards
20

What is the Randomize step of choosing an SRS with technology?

Use a random number generator to obtain n different integers from 1 to N

New cards
21

What is the Select step of choosing an SRS with technology?

Choose the individuals that correspond to the randomly selected integers

New cards
22

How do you find SRS using a calculator?

Math → PRB → 5: randomInt(1, N)

New cards
23

What is the Label step of choosing an SRS with Table D?

Give each member of the population a numerical label with the same number of digits. Use as few digits as possible

New cards
24

What is the Randomize step of choosing an SRS with Table D?

Read consecutive groups of digits of the appropriate length from left to right across a line in Table D. Ignore any groups of digits that wasn’t used as a label or that duplicates a label already in the sample. Stop when you have chosen n different labels

New cards
25

What is the Select step of choosing an SRS with Table D?

Choose the individuals that correspond to the randomly selected integers

New cards
26

What is a table of random digits?

a long string of the digits 0, 1, 2, 3, 4, 5, 6, 7, 8, 9 with these two properties:

  • each entry in the table is equally likely to be any of the 10 digits (0-9)

  • the entries are independent of each other, and knowledge of one part of the table gives no information about any other part

New cards
27

What are strata?

groups of similar groups

New cards
28

What is a stratified random sample?

a sample that takes an SRS within each group and combines the SRS’s into one overall sample

New cards
29

Why is it beneficial to use a stratified random sample?

it provides a more precise estimate with less variability

New cards
30

How do you choose a variable to stratify by?

pick the variable that is the best predictor of what you’re measuring

New cards
31

When is it preferred to use cluster sampling instead of SRS or stratified random sampling?

when the populations are large and spread over a wide area

New cards
32

What is a cluster?

a group of individuals that are located near each other

New cards
33

What is a cluster sampling?

randomly choosing clusters and including each member of the selected clusters in the sample

New cards
34

Why are cluster samples used?

for practical reasons like saving time and money

New cards
35

When do cluster samples work best?

when the cluster looks like the population, just on a smaller scale

New cards
36

How do you describe stratified random sampling?

Define the strata

  • obtain an SRS of [ n/number of strata] from each [strata]

  • result – stratified random sample of n students

New cards
37

How do you describe cluster sampling?

  • Use […] as clusters, assuming x individuals per [cluster]

  • Randomly selected [n/number of individuals per cluster]

  • Result – the n individuals will be our sample

New cards
38

What is the drawback of SRS?

there is a large amount of variability, and it is time-consuming

New cards
39

What is the drawback of stratified random sampling?

there might not be many individuals for some strata, which can influence the result

New cards
40

What is the drawback of cluster sampling?

the clusters used may not be good representations of the entire population

New cards
41

What is systematic random sampling?

selecting a sample from an ordered arrangement of the population by randomly selecting one of the first k individuals and every kth individual thereafter

New cards
42

What can affect sample surveys in addition to sampling variability?

errors

New cards
43

What do good sampling techniques include?

the art of reducing all sources of error

New cards
44

When does undercoverage occur?

when some members of the population are less likely to be chosen or cannot be chosen in a sample

New cards
45

When does nonresponse occur?

when an individual chosen for the sample can’t be contacted or refuses to participate

New cards
46

When does response bias occur?

when there is a systematic pattern of inaccurate answers to a survey question

New cards
47

What is the most important influence on the answers given to a sample survey?

the wording of questions

New cards
48

Why should you rely on random sampling?

  • to avoid bias in selecting samples from the lists of available individuals

  • the laws of probability allow trustworthy inference about the population

New cards
49

What is a margin of error?

how far we expect the sample proportion to be from the actual

New cards
50

What is the benefit of increasing the sample size?

increased precision (but not accuracy)

New cards
51

What are errors in design methods (designer flaw)?

  • convenience sampling

  • voluntary response sampling

New cards
52

What are errors causing response bias (response flaw)?

  • undercoverage

  • nonresponse

  • wording of questions

New cards
53

What is an observational study?

a study that observes individuals and measures variables of interest but does not attempt to influence the response

New cards
54

What is a retrospective observational study?

one that examines existing data

New cards
55

What is a prospective observational study?

one that tracks individuals into the future

New cards
56

When does confounding occur?

when two variables are associated in such a way that their effects on a response variable cannot be distinguished from each other

New cards
57

What does an experiment do?

deliberately imposes some treatment on individuals to measure their responses

New cards
58

What is a placebo?

a treatment that has no active ingredient, but is otherwise like other treatments

New cards
59

What is the only source of full convincing data when our goal is to understand cause and effect?

experiments

New cards
60

What is a treatment?

a specific condition applied to the individuals in an experiment

New cards
61

What is an experimental unit?

the object to which a treatment is randomly assigned

New cards
62

What are experimental units called when they are human beings?

subjects

New cards
63

How do experiments differ from observational studies?

observational studies observe individuals and ask them questions, while experiments impose some treatment in order to measure the response

New cards
64

Why do observational studies of the effect on an explanatory variable on a response variable often fail?

because of confounding between the explanatory variable and one or more other variables

New cards
65

What do well-designed experiments take steps to do?

prevent confounding

New cards
66

What is a factor in an experiment?

an explanatory variable that is manipulated and may cause a change in the response variable

New cards
67

What are levels in an experiment?

the different values of a factor

New cards
68

Why is a control group used?

to provide a baseline for comparing effects of other treatments

New cards
69

What is the placebo effect?

the effect that some subjects in an experiment will respond favorably to any treatment, even an inactive treatment

New cards
70

What is a double-blind experiment?

an experiment in which neither the subjects nor those who interact with them and measure the response variable know which treatment a subject received

New cards
71

What is a single-blind experiment?

an experiment in which either the subjects don’t know which treatment they are receiving or the people who interact with them and measure the response variable don’t know which subjects are receiving which treatment

New cards
72

What is random assignment in an experiment?

using chance to assign experimental units to treatments

New cards
73

What is the purpose of random assignment?

to help create roughly equivalent groups of experimental units by balancing the effects of other variables among the treatments

New cards
74

What does control mean in an experiment?

keeping other variables constant for all experimental units

New cards
75

What does random assignment ensure?

that the effects of uncontrolled variables are balanced among treatment groups

New cards
76

What is replication in an experiment?

using enough experimental units to distinguish a difference in the effects of the treatments from chance variation due to the random assignment

New cards
77

What can replication also refer to?

repeating the experiment with different subjects

New cards
78

How does an experiment benefit from replication?

confounding is prevented and variability is reduced

New cards
79

What are the 4 principles of experimental design?

  • comparison

  • random assignment

  • control

  • replication

New cards
80

What is comparison?

using a design that compares two or more treatments

New cards
81

What is a completely randomized design?

a design in which the experimental units are assigned to the treatments completely by chance

New cards
82

What is a block?

a group of experimental units that are known before the experiment to be similar in some way that is expected to affect the response to the treatments

New cards
83

What is a randomized block design?

a design in which the random assignment of experimental units to treatments is carried out separately within each black

New cards
84

What does blocking account for?

a source of variability

New cards
85

What are the best variables to use for blocking?

ones that best predict the response variable

New cards
86

What is a matched pairs design?

a design for comparing two treatments that uses blocks of size 2

New cards
87

How are pairs used in matched pairs designs?

either two very similar experimental units are paired and the two treatments are randomly assigned within each pair, or each experimental unit receives both treatments in a random order

New cards
88

What do researchers usually hope to see in an experiment?

a difference in the responses that is so large that it is unlikely to have happened just by chance variation

New cards
89

How can we learn whether the treatments effects are larger than we would expect to see if only chance was operation?

by using the laws of probability

New cards
90

When is an observed effect statistically significant?

when it is so large that it would rarely occur

New cards
91

True or false: A statistically significant association in data from a well-designed experiment does not imply causation.

false

New cards
92

What do we need to do when we do an experiment and find a difference between two groups?

we need to determine if this difference can be attributed to the chance of variation in random assignment or because there really is a difference in effects of the treatments

New cards
93

How can we determine if the results of our experiment are statistically significant?

by conducting a simulation that models truly random outcomes and using the results to conclude statistically significance

New cards
94

What does the scope of inferences refer to?

the types of inferences (conclusions) that can be drawn from a study

New cards
95

When can we make inferences about the population?

when the individuals are randomly selected from a population

New cards
96

When can we make inferences about cause and effect?

when the individuals are randomly assigned to groups

New cards
97

What do well-designed experiments do?

randomly assign individuals to treatment groups

New cards
98

Why can’t inferences about cause and effect be made in regard to observational studies?

they don’t randomly assign individuals to groups

New cards
99

What type of observational studies can make inferences about population?

ones that use random sampling

New cards
100

What does a well-designed experiment tell us?

that changes in the explanatory variable cause changes in the response variable

New cards
robot