Lecture 8: Confidence Intervals 2: CIs for Proportions and Means (2.17.25)

studied byStudied by 0 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 13

flashcard set

Earn XP

Description and Tags

Feb 17, 2025 Yuan Fang, PhD

14 Terms

1

CI for the population mean μ

Cl = point estimate ± critical value (at a confidence level) x SE (point estimate)

Higher confidence levels have larger z values, which translate to larger margins of error and wider CIs

If you use 99% confidence rather than 95% confidence, you will have a wider interval

i.e. If you want more certainty in your estimates, you will need a wider

This formula works for a large sample size. For smaller sample size, we use t-distribution to obtain the critical value instead of normal distribution

  • The t distribution is another probability model for a continuous variable

  • The t distribution takes a slightly different shape depending on the exact sample size

    • t distributions are indexed by degrees of freedom (df) which is defined as n - 1

    • If df = ∞, the t distribution is the same as the normal distribution

  • It is important to note that appropriate use of the t distribution assumes that the variable of interest is approximately normally distributed

  • Specifically, the t values for the CIs are larger for smaller samples, resulting in larger margins of error

    • i.e., there is more imprecision with small samples

<p>Cl = <span style="color: purple">point estimate</span> ± <span style="color: red">critical value</span> (at a <span style="color: blue">confidence level</span>) x <span style="color: green">SE (point estimate)</span></p><p>Higher confidence levels have larger z values, which translate to larger margins of error and wider CIs</p><p>If you use 99% confidence rather than 95% confidence, you will have a wider interval</p><p>i.e. If you want more certainty in your estimates, you will need a wider</p><p>This formula works for a <span style="color: blue"><strong>large sample size</strong></span>. For <span style="color: red">smaller sample size</span>, we use <span style="color: red"><em>t</em>-distribution</span> to obtain the critical value instead of normal distribution</p><ul><li><p>The <em>t</em> distribution is another probability model for a continuous variable</p></li><li><p>The <em>t</em> distribution takes a slightly different shape <span style="color: red">depending on the exact sample size</span></p><ul><li><p><em>t</em> distributions are indexed by <span style="color: red">degrees of freedom</span> (df) which is defined as <span style="color: red"><em>n</em> - 1</span></p></li><li><p>If df = <span>∞, the t distribution is the same as the normal distribution</span></p></li></ul></li><li><p>It is important to note that appropriate use of the <em>t</em> distribution assumes that the <span style="color: blue">variable of interest is approximately normally distributed</span></p></li><li><p>Specifically, the t values for the CIs are larger for smaller samples, resulting in larger margins of error</p><ul><li><p>i.e., there is more imprecision with small samples</p></li></ul></li></ul><p></p>
New cards
2

Critical Values of the t Distribution

  • Table entries represent values from t distribution with upper tail area equal to α

New cards
3

Dichotomous outcome

  • Outcome variable is dichotomous (p = population proportion)

  • Risks = Number of subjects with disease / Total number of subjects

  • One study sample

  • Data

    • On each participant, measure outcome (yes/no)

    • n, x = number of positive responses

<ul><li><p>Outcome variable is dichotomous (p = population proportion)</p></li><li><p><span style="color: red">Risks = Number of subjects with disease / Total number of subjects</span></p></li><li><p>One study sample</p></li><li><p>Data</p><ul><li><p>On each participant, measure outcome (yes/no)</p></li><li><p>n, x = number of positive responses</p></li></ul></li></ul><p></p>
New cards
4

Confidence Intervals for p

  • Dichotomous outcome

  • One sample

  • Do NOT use the t-interval distribution here

<ul><li><p>Dichotomous outcome</p></li><li><p>One sample</p></li><li><p><span style="color: red">Do NOT use the t-interval distribution here</span></p></li></ul><p></p>
New cards
5

Connecting scientific question to statistical comparison

Cohort study

  • Are people on a drug to reduce blood pressure experiencing any benefit?

  • Continuous data (blood pressure). Blood pressure should be reduced over time (end of study - measurement - baseline measurement)

  • μd (average of differences in measurements, or mean differences) should be lower than 0. If no effect, μd = 0

Randomized controlled trial

  • Are people on a new drug to reduce blood pressure doing better than people on placebo?

  • Continuous data (blood pressure). Blood pressure should be compared at end of study between the placebo group and treatment group.

  • μt - μp = 0 or μt / μp = 1 means on average, no effect

  • On average, no difference between the treatment group and placebo group, so difference in average measurements, or difference on means

Randomized controlled trial

  • Do people on a new drug to reduce blood pressure have fewer adverse outcomes than people on the standard care?

  • Dichotomous data (adverse event). Number of adverse events should be compared between the new drug group vs. the standard of care drug group. Use proportions.

  • pt - pp = 0 or pt / pp = 1 means on average, no difference. Overall, no difference between the treatment group and placebo group

New cards
6

Comparison between two groups

  • Look at the difference in means between 2 groups

  • Outcome is continuous

    • SBP, weight, cholesterol

  • Two independent study samples

    • Difference in means μ1 - μ2

  • Data

    • On each participant, identify group and measure outcome

    • What we should have: n1, X1,s12 (or s1), n2, X2, s22 (or s2)

New cards
7

Two Independent Samples

Randomized Controlled Trial: Set of Subjects Who Meet Study Eligibility Criteria

Cohort Study: Set of Subjects Who Meet Study Inclusion Criteria

New cards
8

Confidence Interval for (μ1 - μ2)

  • If 0 is included in a confidence interval for a difference, then 0 is a possible difference

    • Equivalent is a possibility

    • Statistically not different

  • Because 95% CIs do not include 0, we conclude that there are statistically meaningful differences between means

    • This is primarily due to the large sample sizes

    • Are these clinically relevant or meaningful differences

  • Statistical significance: when null value is not in the confidence interval

    • Statistically significantly different from 0

    • This means observed difference is different form null beyond that expected by chance

    • 0 is the null value for a difference

  • Clinical significance: What difference is clinically meaningful? A difference you would amount for when counseling a patient or forming a recommendation

New cards
9

Samples are not independent; Matched or paired samples

  • Look at mean differences

  • Outcome is continuous

    • SBP, weight, cholesterol

  • Two matched or paired study samples

  • Data

    • On each participant, measure outcome under experimental condition

    • Interested in μd

    • Compute differences: x1 - x2

    • Used in crossover trials

    • Used to compare data before and after an intervention

New cards
10

Two Dependent/Matched Samples

Measures one subject serially in time or repreated under different experimental conditions

Differences can be negative too

New cards
11

Confidence Intervals μd: Confidence interval for mean difference

  • Continuous outcome

  • Two matched/paired

  • Because the samples are dependent, statistical techniques that account for the dependency must be used

  • Even though the samples are dependent, the differences themselves are independent

  • Therefore, the differences on their own can be viewed as an independent sample

    • Here, the sample size n is the number of distinct participants or distinct pairs, not the total number of observations (2n)

    • We can use SKIN and SUIT again

New cards
12

Comparison of proportions from two independent samples

  • Outcome is dichotomous

    • Result of surgery (success, failure)

    • Cancer remission (yes/no)

  • Two independent study samples (RCT, cohort study, etc.)

  • Compare risks from the two independent samples

  • Data

    • One each participant, identify group and measure outcome (yes/no)

n1, 1, n2, p̂2

New cards
13

Risk difference

  • Risk difference: difference in proportions (risks) between comparison groups

    • proportion from group 1 - proportion from group 2

    • Absolute difference (p1 - p2)

    • Absolute Risk Reduction (ARR): reference group (e.g., unexposed persons, persons without a risk factor, or persons assigned to the control group in a clinical trial setting) subtract the other group, i.e. risk in control group - risk in treatment group

New cards
14

Relative Risk

  • Relative risk (aka risk ratio): ratio of proportions (risks)

    • Generally, the reference group (e.g., unexposed persons, persons without a risk factor, or persons assigned to the control group in a clinical trial setting) is considered the denominator of the ratio

proportion from group 1 / proportion from group 2

New cards
robot