STAT 100: Exam 2

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/96

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 2:13 AM on 4/3/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

97 Terms

1
New cards

causal inference

inference abt which factor(s) may be responsible for causing a particular effect on something

2
New cards

Causal Inference: Observed Outcome

what ACTUALLY happened in the real world

3
New cards

Causal inference: Controlling

act of taking steps to ensure the baseline characteristics for some factors are the SAME between the CONTROL and TREATMENT observations

4
New cards

Causal inference: POOR Internal Validity --> ???

there is a CAUSAL BIAS -> prevent us from making an causal inferences

5
New cards

Case-Control Study

study in which individual control observations are each matched to individual "treatment" observations

- MANY PAIRS OF MATCHED OBSERVATION

6
New cards

Confounding bias prevents us from _____

prevents us from MAKING VALID INFERENCES based on the study's results

7
New cards

Confounding Factors

factors that themselves have a causal effect on both the MAIN CAUSAL factor we are trying to study AND the OUTCOME factor we are trying to study

8
New cards

Uncontrolled confounders can ________

can make it LOOK like there is a cause and effect relationship between two factors EVEN WHEN THERE ISN'T

9
New cards

2 steps to the process of identifying a possible cofounder

1. Establish a difference between the treatment and control observations based on the possible confounding factor

2. Establish that the confounding factor is related to the outcome

10
New cards

Confounding Bias: What do we need to remember with case-control studies?

CASE-CONTROL STUDIES: we DO know that the treatment observations won't be EXACTLY like the control observations + that we're counting on having lots of matches so that all of the little differences will average out

11
New cards

Ch. 14 - Moderators

factors that change the effect a cause has on the outcome

12
New cards

Ch. 14 - Moderators/Interaction Effects: Another way to say that there IS an interaction effect

The ATE is different DEPENDING on who we're talking abt

13
New cards

Ch. 14 - 2 steps to figure out whether you have a moderating factor creating an interaction effect by (VISUALLY):

1. Create an interaction plot

2. Visually determine if the slopes are OBVIOUSLY different

14
New cards

Ch. 14 - Moderators/Interaction Effects: Interaction Plots --- MORE overlap of the error bars suggests WHAT?

a LOWER likelihood of there being an interaction, EVEN IF the slopes appear to be different

15
New cards

Ch. 15 - Random Assignment: Is it an implicit OR explicit control? AND WHY?

Implicit Control - bc random assignment CANNOT guarantee similarity between groups

16
New cards

Ch. 15 - Block Randomization Study Design (2 steps)

1. entire group of SIMILAR people are MATCHED and BLOCKED together

2. within EACH BLOCK, some are RANDOMLY assigned to the treatment group, while the others are RANDOMLY assigned to the control group

17
New cards

Ch. 15 - Which study design should be used when...

Group of observations + NO random assignment

Cohort Control

18
New cards

Representatives: Sampling (Ch. 16)

act of selecting observations from which to collect data

19
New cards

Sample Statistic (ch. 16 - pg. 163)

computing averages for a sample

20
New cards

Generalization Inference (ch. 16 - pg. 163)

the act of using a sample statistic to determine the value of a parameter

21
New cards

Generalization: Estimate (ch. 16 - pg. 164)

value we use as a stand-in for the parameter

22
New cards

External Validity: What does having a DIFFERENCE in the distribution of important underlying baseline characteristics between the sample and the population of interest mean? (ch. 16 - pg. 164)

IT MEANS THAT THE SAMPLE IS NOT REPRESENTATIVE of THE WHOLE POPULATION.

23
New cards

Sampling: In what cases do we say that there is a sample bias preventing us from making a generalization inference? (ch. 16 - pg. 166)

when a sample IS NOT REPRESENTATIVE of a population of interest --> not appropriate to use sample data to make generalizations about the population of interest

24
New cards

Sampling: Inclusion Criteria (ch. 16 - pg. 166)

The way in which statisticians and data scientists define the population of interest

WHO'S NOT HERE???????

25
New cards

Volunteer Sampling (Ch. 17 - pg. 173)

sampling strategy in which you make a survey or study available to some or all respondents who meet the inclusion criteria

26
New cards

Probability Sampling (ch. 17 - pg. 173)

sampling strategy in which all eligible observations are assigned a non-zero probability to be included in the sample + then some observations are selected using a random process

- random process = distinguising feature of probability samples

27
New cards

Quota Sampling

sampling strategy in which you have specific quotas (limits) for specific strata of respondents within the population of interest

28
New cards

Quota Sampling: Quota is USUALLY based on ???? (ch. 17 - 174)

characteristics of the population

29
New cards

Population-based Quota Sampling: How does it improve the external validity? (ch. 17 - pg. 175)

it forces the sample characteristics to match the population's aggregate characteristics

30
New cards

importance of survey incentive (2) (ch. 17 - pg. 178)

1. constitute an ethical return for the time the respondents take to complete the survey

2. help reduce non-response

31
New cards

Quota Sampling Strategies:

1. Did everyone in the population of interest have a chance of being in the sample?

2. Was some component of random sampling utilized?

3. Were quotas or blocks utilized?

1. Sometimes

2. No random sampling

3. Yes Quotas/Blocks

32
New cards

Probability Sampling Strategies

1. Did everyone in the population of interest have a chance of being in the sample?

2. Was some component of random sampling utilized?

3. Were quotas or blocks utilized?

1. Yes

2. YES random sampling

3. No quotas/blocks

33
New cards

Stratified Sampling Strategies:

1. Did everyone in the population of interest have a chance of being in the sample?

2. Was some component of random sampling utilized?

3. Were quotas or blocks utilized?

1. Yes

2. Yes random sampling

3. Yes quotas/blocks

34
New cards

KEY GIVEWAYS: "Population-based" = ???

quotas

35
New cards

KEY GIVEWAYS: "Adjusted" = ???

quota

36
New cards

KEY GIVEWAYS: Randomly Selected = ????

random sampling

37
New cards

KEY GIVEWAYS: Representative = ????

Random Sampling

38
New cards

Multi-Stage probability Samples help with what + examples

provide external validity for samples that are trying to generalize toe very broad populations

EX: All Americans or all human adults

39
New cards

What is stratification?

dividing population into groups FIRST, then random sampling inside each group

40
New cards

causal inference: Causal Factor + Outcome Factor

Causal: something that we will do that may have an effect

Outcome: the thing that will be affected

41
New cards

Causal inference: causal graph

a graphic depiction of the relationship between a causal factor and an outcome factor

42
New cards

Causal inference: Counterfactual question

what WOULD HAVE happened in the PARALLEL universe

- WHAT IF????

43
New cards

Causal inference: Treatment Observation

observations that were EXPOSED to some causal factor

44
New cards

Causal inference: Baseline Characteristics

attributes that we think might be RELATED to the OUTCOME we are studying

45
New cards

Causal inference: Matching

process of finding suitable CONTROL observations to COMPARE to TREATMENT observation and VICE VERSA

46
New cards

Causal inference: Internal Validity

depends on the extent to which the CONTROL observation are LIKE the TREATMENT observation

- depends on the SIMILARITY in the BASELINE characteristics between the CONTROL observation and TREATMENT observation

47
New cards

Cohort-Control study: ____ ____ is NOT a baseline characteristics that we have to control for

Sample Size

48
New cards

Causal Inference: When comparing PERCENTAGES, what should you compute?

AD and RD

49
New cards

Causal Inference: When comparing MEANS, what should you compute?

Effect size

50
New cards

Template for Counterfactual Questions

What if (TREATMENT observations) didn't use (TREATMENT)

51
New cards

PG. 83 - What should you use to compare company size and profit? AND WHY?

Correlation

52
New cards

When comparing baseline characteristics, what effect size would give us CONCERN?

0.20ish

- We want the effect size to be less than 0.1 if possible, to make sure the baseline characteristics are AS SIMILAR AS POSSIBLE

53
New cards

Ch. 14 - Moderators/Interaction Effects: True or False - The impact that a treatment has on an outcome (ATE) DOES NOT have to be the same for everyone.

True - The impact DOES NOT have to be the same for everyone.

54
New cards

Ch. 15 - How would you know if a study used random assignment?

it would DIRECTLY say it in the problem

55
New cards

Ch. 15 - Random Assignment

A strategy that utilizes a random process to determine which observations receive a TREATMENT, and which serve as a CONTROL

56
New cards

Ch. 15 - Matching: Is it an implicit OR explicit control? AND WHY?

Explicit Control - BC Matching DOES guarantee that the control observations and treatment observations are VERY ALIKE of the baseline characteristics used to conduct the matching process (pg. 144)

57
New cards

Ch. 15 - Does block randomization tend to have HIGH or LOW internal validity?

HIGH internal validity -- esp when the size of each block is relatively large

- combines the benefits of explicit controls via matching on important baseline characteristics AND incorporates the benefits of implicit controls via including random assignment

58
New cards

Column Percents: Relative Difference - Calculation and Benchmark

DIVING one column percent from the other

- used usually when percentages are below 50%

DANGER ZONE: LESS than 0.8 or GREATER than 1.25

if in danger zone --> interpret that difference to indicate a real-world difference between the two groups

59
New cards

Ratio of SD: Calculation + Benchmark

LARGER SD DIVIDED by SMALLER SD

- DANGER ZONE: if the ratio is approximately 3 OR HIGHER, we say that one distribution is more spread out than the other

60
New cards

When do you compute EFFECT SIZE?

only when the RATIO OF SD has SIMILAR SPREAD (LESS THAN 3)

61
New cards

Representatives: Sample (Ch. 16)

observations selected

62
New cards

Representatives: Population of Interest (Ch. 16)

All observations that we are interested in studying

63
New cards

Representatives: INSTEAD collecting data from ALL of the observations in the population of interest.... we ____ (ch. 16 - pg. 163)

select a SAMPLE of observations from the population of interest + collect data from the sample of observations

64
New cards

External Validity (ch. 16 - pg. 164)

degree to which a sampling strategy supports making a generalization inference

65
New cards

External Validity: Sample's representatives (ch. 16 - pg. 164)

extent to which the observations in the sample are SIMILAR to the observations in the population of interest

66
New cards

Sampling: A Sample is representative when __________ (ch. 16 - pg. 165)

when everyone in the population of interest is represented by someone in the sample

67
New cards

What's another term for Volunteer Sampling? (ch. 17 - pg. 173)

Convenience Sampling

68
New cards

Do probability samples require that EVERY observation has the SAME probability of being selected for the sample?

NO - it DOESN'T require that every observation has the same probability of being selected for the sample

69
New cards

Adding strata to a sampling strategy is an _________ control. (ch. 17 - pg. 175)

EXPLICIT control --> GUARANTEES that the sample's characteristics will be similar to the population's aggregate characteristics

70
New cards

Random Sampling is an ______ control. (ch. 17 - pg. 175)

IMPLICIT control --> DOES NOT GUARANTEE anything + adds some measure of representatives for all the other non-important characteristics that might impede our ability to generalize the sample statistic to the entire population of interest

71
New cards

survey incentive (ch. 17 - pg. 178)

the act of offering something in return for a respondent providing their data

72
New cards

Volunteer Sampling Strategies:

1. Did everyone in the population of interest have a chance of being in the sample?

2. Was some component of random sampling utilized?

3. Were quotas or blocks utilized?

1. Sometimes

2. No random sampling

3. No quotas/blocks

73
New cards

Quota Sampling: TRUE OR FALSE - Sample Size IS NOT the same thing as a quota.

TRUE

74
New cards

Causal inference: Control Observation

observations that are VERY SIMILAR to treatment observations, BUT were NOT EXPOSED to the causal factor

75
New cards

Causal Inference: Treatment EFFECT (+ how is it represented on the causal graph)

best guess as to HOW MUCH of an effect a causal factor has on an outcome factor

- represented as the arrow between the causal factor and outcome factor

76
New cards

Causal inference: 2 things we need to pay attention to when examining a study's results?

1. Study Design

2. INTERNAL VALIDITY (what was compared?)

77
New cards

Case-Control Study: Average Treatment Effect (ATE) represents WHAT?

The typical amount by which we expect the outcome factor to change if an observation is given a specific treatment

78
New cards

Cohort-Control study

study in which an ENTIRE group of control observations is matched to the group of "treatment" observations based on their aggregate characteristics

79
New cards

Case-Control Study: How should you compute the average treatment effect?

(ADD all the differences within the pair) AND then divide it by the number of observations

ALWAYS TREATMENT - OBSERVATION

80
New cards

PG. 74 - What should you use to compare sales in Spring and Fall? AND WHY?

Ratio of SD and effect size

81
New cards

8 STEPS when working with Baseline Differences and Confounding Effects

1. Identify the causal factor and outcome factor

2. Draw a causal graph

3. Write the counterfactual question

4. Identify the treatment observations

5. Identify the study design

6. Compare baseline characteristics

7. Determine if any of the baseline characteristics with differences are related to the outcome factor

8. Identify confounding factors and biases

82
New cards

Ch. 14 - What causes interactions to occur?

Moderating Factors

83
New cards

Ch. 14 - Moderators/Interaction Effects: What is an indication of a moderating factor?

If the treatment impact on the outcome works BETTER OR WORSE for SOME people --> indication of a moderating factor

(S26 Ch14 Slides)

84
New cards

Ch. 14 - 3 Steps to figure out whether you have a moderating factor creating an interaction effect

1. Split both the treatment and control groups into groups based on the moderating factor

2. Compute SEPARATE ATE for the groups

3. Seeing if the ATEs are different

85
New cards

Ch. 15 - Matching Pairs Design (2 steps)

1. two observations are MATCHED based on important baseline characteristics

2. one observation in the pair is RANDOMLY assigned to the treatment group, while the other is RANDOMLY assigned to the control group

86
New cards

Ch. 15 - Difference between Matched Pairs Studies and Case Controls

Matched Pairs: Researchers DECIDE which observations get exposed to the treatment and which does not

Case Controls: participants based on their OWN LIFE CHOICES have either already been exposed to the treatment or not

87
New cards

Ch. 15 - Which study design should be used when...

individual observations + random assignment

Matched Pairs

88
New cards

Ch. 15 - Which study design should be used when...

group of observations + random assignment

Block randomization

89
New cards

Ch. 15 - Which study design should be used when...

individual observations + NO random assignment

Case-Control

90
New cards

Column Percents: Absolute Difference - Calculation and Benchmark

SUBTRACTING one column percent from the other and taking the absolute value

DANGER ZONE: the absolute difference is GREATER THAN 10%

-- if greater than 10%, there IS a difference to indicate a real-world difference between two groups

91
New cards

Effect Size: Calculation + Benchmark

(Group 1 Mean - Group 2 Mean) DIVIDED by LARGER SD

General Benchmark:

0.10 or LESS - NO difference in the averages between the groups

0.25ish - SMALL difference in the averages between the groups

0.50 - MODERATE difference in the averages between the groups

0.75ish - LARGE difference in the averages between the groups

92
New cards

Parameters (ch. 16 - pg. 163)

Characteristics of an entire population of interest

93
New cards

Sampling: A sample MUST have a VERY _______ (high or low) external validity to support generalization (ch. 16 - pg. 166)

VERY HIGH EXTERNAL VALIDITY

94
New cards

Probability Sampling - What do you need to determine first before randomly selecting participants? (ch. 17 - 173)

MUST determine the desired sample size FIRST, and then randomly selected

95
New cards

Stratified Random Sample (ch. 17 - pg. 175)

Adding strata to a probability sample

96
New cards

Internal Validity: Key questions to ask (2)

  1. What was compared?

  2. Was it apples-to-apples?

97
New cards

External Validity: Key questions to ask (2)

  1. Who’s not here?

  2. Who should be here?

Explore top notes

note
4.5: fair trade
Updated 1213d ago
0.0(0)
note
APUSH Period 2 Review
Updated 527d ago
0.0(0)
note
kpl 2 alleviivaukset ruotsi-suomi
Updated 1283d ago
0.0(0)
note
Conservation of Mass
Updated 1223d ago
0.0(0)
note
Ch 14 - Aggregate Demand
Updated 1088d ago
0.0(0)
note
4.5: fair trade
Updated 1213d ago
0.0(0)
note
APUSH Period 2 Review
Updated 527d ago
0.0(0)
note
kpl 2 alleviivaukset ruotsi-suomi
Updated 1283d ago
0.0(0)
note
Conservation of Mass
Updated 1223d ago
0.0(0)
note
Ch 14 - Aggregate Demand
Updated 1088d ago
0.0(0)

Explore top flashcards

flashcards
Quiz #3
35
Updated 1210d ago
0.0(0)
flashcards
SPANISH QUIZ #4 Semester 2
56
Updated 36d ago
0.0(0)
flashcards
Kapitel 4 woordenschat
88
Updated 1072d ago
0.0(0)
flashcards
Quiz #3
35
Updated 1210d ago
0.0(0)
flashcards
SPANISH QUIZ #4 Semester 2
56
Updated 36d ago
0.0(0)
flashcards
Kapitel 4 woordenschat
88
Updated 1072d ago
0.0(0)