Epidemiology Methods I Exam #2

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/89

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

90 Terms

1
New cards

When do we conclude that an exposure and an outcome are associated?

If mean/risk/prevalence difference not equal to 0.

If risk/prevalence/odds ratio not equal to 1.

Above holds only in the absence of error.

2
New cards

Error

The extent to which an effect estimate deviates from the truth.

Two sources of error:

systematic error

random error

3
New cards

Systemic error

caused by flaws in the research methodology

Sources:

unmeasured confounders

Measurement error

non-random sampling.

Consequences: bias in the effect estimates.

4
New cards

Random error

caused by chance variations

most important source is sampling variation.

Consequence: lower precision of effect estimates (but no bias)

5
New cards

Sampling variation

The variation in a measure of occurrence or association across random samples from the same source population.

Caused by the at random selection of individuals from the source population.

6
New cards

Random selection and sampling variation

Every individual in the source population has the same probability of being selected for the study participation.

However, in a single study, individuals with certain characteristics may be oversampled by chance.

7
New cards

Random sampling impacts

Causes variation in estimates of ocurence and association.

The estimates vary around the true occurrence or association.

The average estimate across many samples will approximate the true occurence or association.

8
New cards

Standard error

Quantifies the variability in the sampling distribution.

They are used to calculate p-values and confidence intervals for effect estimates.

is a statistical measure that captures the variability in an effect estimate across many hypothetical samples.

Standard error helps us determine the precision of an effect estimate.

9
New cards

Sampling distribution of ratios

Typically right skewed.

10
New cards

Normal sampling distribution

standard error only have a relevant interpretation.

11
New cards

The natural logarithm (Ln) scale for ratios

Range and distribution before transformation: null effect=1

Range and distribution after transformation: null effect=0

12
New cards

Hypothesis testing

The goal of research is determining if there is an exposure-outcome association

Random sampling variation makes it impossible to determine if the hypothesis holds based on the effect estimate only.

13
New cards

Test hypothesis

the value which we compare the observed effect estimate.

Can be any value. but typically we use the null hypothesis.

14
New cards

Null value=0

Risk/prevalence difference

Mean difference

Ln (risk/prevalence ratio)

Ln (odds ratio)

15
New cards

Null value=1

Risk/prevalence ratio

Odds Ratio

16
New cards

P-value

The probability of observing the effect estimate if the test (null) hypothesis is true.

is determined using the test statistic which helps us determine the location of the effect estimate in a normal sampling distribution.

An estimate is statistically different from the test hypothesis if the P-value is <0.05.

17
New cards

Test Statistics

Tells us how many standard deviations the test hypothesis the observed effect estimate is.

Computed using:

-the effect estimate

-the test hypothesis

-the standard error

18
New cards

Statistical significance

Tells us whether there is mathematically meaningful difference in the outcome between the two compared groups.

is not equal to clinical relevance.

Typically a probability of 5% is used as the cut off.

Effect estimates are considered statistically significant when the P-value<0.05.

Alternative hypothesis: there is an association.

19
New cards

Hypothesis testing limitations

It relies on the strong assumption: we can only reject the null hypothesis if the effect estimate is completely unbiased.

P<0.05 does not say anything about the clinical relevance of an effect estimate.

20
New cards

Confidence intervals formula

knowt flashcard image
21
New cards

Confidence interval width

The higher the confidence level, the wider the confidence interval.

22
New cards

Type I error

rejecting the H0 when H0 is true.

falsley concluding there is an association.

If the effect estimate is unbiased then the probability of making a chance finding equals alpha.

23
New cards

Type II error

failing to reject the H0 when Ha is true.

We falsely conclude there is no association.

Happens when the statistical power is too low to detect a statistically significant association (lack of power)

Statistical power depends on:

the sample size

alpha

the size of the true effect

24
New cards

Power

is the probability that we detect an association when in reality there is an association.

1-Type II error

25
New cards

Power Analysis

  1. The sample size needed to detect a specific effect size given a specific alpha level and the power level.

  2. The minimum effect size that can be detected given a specific level of alpha, power, and sample size.

26
New cards
27
New cards

Clinical relevance

pertains to whether there is clinically/practically meaningful difference in the outcome between the two groups.

-Depends on the size of the effect estimate.

-is context dependent

continuous outcomes: is the mean difference representing a relevant increase/decrease in the outcome.

Binary outcomes: evaluation of the effect size in combination with the background risk; relevant increase/decrease in outcome occurrence.

28
New cards

Statistical significance depends on

The power to detect an effect estimate that is different from the null variable.

Power can be somewhat increased by researchers increasing sample size, increasing alpha.

As power increases more likely to find statistically significant effect estimates.

29
New cards
<p>Implications of statistical significance and clinical relevance</p>

Implications of statistical significance and clinical relevance

30
New cards

Fundamental problem of casual inference

It is impossible to observe both potential outcomes for the same individual for the same individual over the same time period.

Solution: compare average potential outcomes.

31
New cards

Casual effect

Difference between the ratio of the two average potential outcomes.

32
New cards

Casual assumptions

Exchangeability

Consistency

Positivity

33
New cards

Exchangeability assumption

On average the exposed group and unexposed group are exchangeable in all aspects other than the exposure.

The only characteristic that differs across the two groups is the exposure.

If it holds then we would get the same estimates for E[Y(x)] and E[Y(x*)]

if (hypothetically) we were able to exchange the exposure across the two groups.

34
New cards

Violation of exchangeability

The two differ in aspects other than the exposure and the aspects in which the groups differ also affect the outcome.

35
New cards

Exchangeability and Confounders

Means that there cannot be any unmeasured confounders of the exposure outcome association.

is violated when the distribution of confounders differs across the two exposure groups.

When violated the effect estimate will be biased.

36
New cards

Solutions to the violation of Exchangeability

Randomization

Restriction

Matching

Stratification by confounders

37
New cards

Consistency assumption

The observed outcome for an individual is equal to the potential outcome when the exposure is set to the observed exposure value.

Typically holds in experimental studies where E[Y(x)] and E[Y(x*)] are obtained by randomizing individuals to well defined intervention and control conditions.

38
New cards

Consistency in RCTs

A well defined intervention condition: every person in the intervention condition receives the exact same intervention.

A well defined control condition: every person in the control condition recieves the exact same control.

There is one way in which individuals qualify as exposed/not exposed to vitamin D.

39
New cards

Violations of consistency assumption

The observed outcome for an individual is not equal to the potential outcome when the exposure is set to the observed exposure value.

Violated when two conditions hold.

  1. There is various ways in which an individual qualifies as (not) exposed.

  2. The exposure association outcome differs across the types of exposure.

40
New cards

Solution to the violations of consistency

Use a more precise definition of the exposure values of interest.

But don’t take too far because definitions come at the cost of generalizability.

41
New cards

Positivity assumption

When defining casual effects based on potential outcomes, we essentially assume that one individual (with all their background characteristics) has a positive probability of taking on both exposure values.

Holds when all probabilities are nonzero.

42
New cards

Violation of Positivity

The probability to observe an exposure level of interest based on a specific combination of confounder values equal to zero.

43
New cards

Solutions to violations in positivity

When one or more probability is zero we cannot make comparisons based on that combination of confounder values.

Solution is to restrict the comparisons to combinations of confounder values for which the probabilities of observing x and x* are both nonzero.

For observing drinking and pregnant women it would be beneficial to restrict the analysis to only women 21 years or older.

44
New cards

Directed Acyclic graphs

DAGs are graphs used to visualize the associations between variables. DAGs are informed by subject matter knowledge regarding the associations among the variables of interest.

45
New cards

DAGs and the Research Process

DAGs can help guide data collection and statistical analyses.

Ideally DAGs are developed at the start of the study

  1. Identify confounders that we need to collect data on.

  2. Identify mediators that we may want to collect data on.

  3. identify potential sources of selection bias.

46
New cards

Basic Features of DAGs

Directed paths- lines with one arrowhead shows that the exposure precedes the outcome and the exposure affects the outcome.

The absence of an arrow conveys that the exposure does not affect the outcome.

Acyclic- none of the paths form a closed loop

47
New cards

Reciprocal effects in DAGs

A reciprocal effect means that the exposure affects the outcome and that the outcome in turn affects the exposure.

48
New cards

Third variables

the two focal variables are exposure and outcome but third variables such as colliders, mediators, and confounders can be present.

49
New cards

Confounder

Affect the exposure and the outcome.

A change in the confounder leads to a change in the exposure and the outcome.

Create an open backdoor path between exposure and outcome.

If we leave backdoor path open then there will be confounding bias.

<p>Affect the exposure and the outcome. </p><p>A change in the confounder leads to a change in the exposure and the outcome. </p><p>Create an open backdoor path between exposure and outcome. </p><p>If we leave backdoor path open then there will be confounding bias. </p>
50
New cards

Confounding bias

-The exposure-outcome effect estimates will be larger or smaller than the true size of the exposure-outcome effect.

-The exposure-outcome effect estimate will be nonzero while in truth the exposure-outcome effect is zero.

51
New cards

Mediator

-Affected by the exposure and effects the outcome.

-is on the casual pathway from the exposure to the outcome.

This structure created by a mediator in a DAG is called a “chain”

Changes in the exposure lead to changes in the mediator, and changes in the mediator lead to changes in the outcome. Mechanisms through which the exposure affects the outcome.

<p>-Affected by the exposure and effects the outcome. </p><p>-is on the casual pathway from the exposure to the outcome. </p><p>This structure created by a mediator in a DAG is called a “chain”</p><p>Changes in the exposure lead to changes in the mediator, and changes in the mediator lead to changes in the outcome. Mechanisms through which the exposure affects the outcome. </p>
52
New cards

Mediators can be used to decompose

the total exposure outcome effect into

  1. An indirect effect through the mediator

  2. A direct effect not through the mediator.

53
New cards

Indirect and Direct pathway Mediator

Indirect effect: the effect of the exposure on the outcome through the mediator

Direct effect: the effect of the exposure on the outcome not through the mediator.

54
New cards

Overadjustment bias

In and of themselves mediators do not bias the exposure outcome effect estimate.

When adjusted in statistical analyses mediators cause “overadjustment bias”

After adjustment for a mediator, the exposure-outcome effect estimate represents the direct effect rather than the total effect.

55
New cards

Mediation analysis

A method for estimating direct and indirect effects with corresponding confidence intervals.

It is increasing used in epidemiology to better understand why exposures and outcomes are associated.

56
New cards

Collider

Affected by the exposure and affected by the outcome

is a common cause of the exposure and outcome.

Change in both the exposure and outcome lead to changes in the collider

a consequence of both the exposure and outcome.

<p>Affected by the exposure and affected by the outcome</p><p>is a common cause of the exposure and outcome.</p><p>Change in both the exposure and outcome lead to changes in the collider</p><p>a consequence of both the exposure and outcome. </p>
57
New cards

Collider Bias

Colliders do not bias nor explain the exposure-outcome effect.

However colliders can bias the exposure-outcome effect estimates when

  1. Selection into the study is restricted to certain collider variables.

  2. Adjustment is made for colliders in statistical analyses.

  3. Analyses are stratified by the collider.

58
New cards

Reverse causation

The hypothesized exposure is in truth consequence of the hypothesized outcome.

can occur in most study designs but cross-sectional study designs are especially prone to reverse causation.

<p>The hypothesized exposure is in truth consequence of the hypothesized outcome. </p><p>can occur in most study designs but cross-sectional study designs are especially prone to reverse causation. </p>
59
New cards

Sufficient cause framework

conceptualizes causation as a collection of multiple causes of outcomes.

A collection of component causes that when all present in one individual causes the outcome in the individual.

60
New cards

Component causes

single causes of outcome.

61
New cards

Rothman’s sufficient component cause model

sufficient cause represents mechanisms that lead to occurence of the outcome.

62
New cards

Causal Pies

Sufficient causes are visualized using pie charts.

Each pie chart represents a sufficient cause and each slice within the pie chart represents a component cause.

63
New cards

Confounding bias is present

The effect estimate will be larger, smaller, of opposite direction, or falsely nonull compared to the true effect.

64
New cards

True exposure-outcome not equal to null value

The exposure outcome estimate will be larger, smaller, or of opposite sign compared to the true exposure outcome effect.

65
New cards

Overestimation due to confounding

When the exposure outcome effect estimate is larger than the true exposure-outcome effect.

Also called bias away from the null.

Occurs when the combined influence of the confounder exposure and confounder effect is the same in direction as the true exposure outcome effect.

66
New cards

Underestimation due to confounding

when exposure outcome effect is smaller than the true exposure outcome effect.

also called bias towards the null

67
New cards

Qualitative confounding

When the exposure outcome effect estimate is in the opposite direction of the true exposure outcome effect.

68
New cards

Nonnull

If the true exposure outcome effect= the null value

69
New cards

Marginal effect estimate

The effect estimate applies to the whole study population.

70
New cards

Conditional effect estimate

The effect estimate applies only to a subset of individuals with specific confounder values.

71
New cards

Design stages to minimize confounding

Randomization

Restriction

Matching

72
New cards

Randomization

Random assignment to the intervention group or control group

The intervention and control group will be exchangeable in all apsects other than the intervention/control condition.

73
New cards

Randomization strengths

Targets both known and unknown confounders because in expectation any factors other than exposure will be equally distributed across the two groups.

Works for confounders on any measurement scale.

74
New cards

Randomization limitations

works in expectation due to sampling variability confounders may still be unequally distributed across groups in any one study.

Protocol violations and drop outs might result in unequal distributions of confounders across the intervention and control group.

It is not ethical to randomize individuals to harmful exposures.

75
New cards

Restriction

Restriction of the study population to one specific level of the confounder.

It eliminates the variation in the confounder, so within the restricted sample it no longer effects the exposure and outcome.

Results in conditional effect estimates.

76
New cards

Restriction strengths

Most effective way to prevent confounding by a known confounder.

77
New cards

Restriction limiations

Does not prevent confounding by unknown confounders.

Reduces generalizability of the findings.

Only works if there is truly no variation in the confounder in the restricted sample.

May not be feasible when there are many confounders and or continuous confounders.

78
New cards

Matching

Compared groups are selected in a way that ensures the same distribution of confounders across groups.

Cohort studies: comparison group is selected in such that the distribution of the confounders matches the exposed group.

Case control studies: controls are selected in such that the distribution of the confounders matches that of the groups of cases.

79
New cards

Types of matching

Pair matching

Frequency matching

80
New cards

Pair matching

One to one individual matching based on potential confounders

For every exposed individual with certain combination of confounder values we include an unexposed individual (or controls) with the same combination of confounder values.

This ensures that distribution of confounder values is the same across exposure groups.

81
New cards

Frequency matching

Selection of groups in such a way that the distribution of confounders is the same across the two groups.

In each exposure group (or cases/controls) a similar proportion of individuals is included with certain confounder values.

82
New cards

Matching strengths

can increase efficiency and power in certain situations.

83
New cards

Matching limitations

Does not prevent confounding by unknown confounder.

May not be feasible and reduce generalizability when there are many confounders and or continuous variable confounders.

one on one matching we may not always be able to find a match.

84
New cards

Limitations of design stage methods to minimize confounding

It is not always feasible to apply design stage methods eg:

The exposure is harmful, so randomization is not possible

Restriction or matching is not feasible because there are too many confounders or continuous confounders.

Randomization or matching failed

Data was not collected with your specific study in mind.

85
New cards

Marginal effect estimate

The effect estimate applies to the whole study population.

86
New cards

Conditional effect estimate

The effect estimate applies only to a subset of individuals with specific confounder values.

87
New cards

Stratification

Perform the analyses separately for different strata of the confounder variable(s).

Results in conditional effect estimates the effect estimate applies only to individuals in that confounder stratum.

88
New cards

Mantel Haenszel methods

Recombine the effect estimates after stratification.

89
New cards

Inverse probability weighting

IPW provides weighted averages of stratum specific effect estimates.

Gives more weight to strata that have a larger number of participants in it.

Individuals are weighted by their inverse of the probability of their exposure value.

Creates a psuedo population in which there are no confounder exposure association.

90
New cards