Statistics 2 Lecture 1

studied byStudied by 0 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 10

encourage image

There's no tags or description

Looks like no one added any tags here yet for you.

11 Terms

1

Criteria for establishing causality: Jon Stuart Mill

  • We can only argue that B is caused by A if:

    • There is a relationship between A and B (association)

    • B bust take place after A (appropriate time order)

    • The association between A and B is not explained by other factors (elimination of alternative explanations)

New cards
2

Eliminate alternative explanations

  • Can (in part) be achieved by controlling for other variables → eliminate their effect

  • Two ways:

    • Experimental control

    • Statistical control

New cards
3

Eliminate alternative explanations: Experimental control

  • In the research design

  • RCT often considered the gold standard

    • Time order manipulated (Crit. 2)

    • Alternative explanations (partially) excluded through randomization (Crit. 3)

  • Both observable and non-observable characteristics must be equal

  • → Feasible and realistic?

New cards
4

Eliminate alternative explanations: Statistical control

  • In the data-analysis strategy

  • Option 1: Examine x-y relationship within subgroups (based on other variables) → often unrealistic

  • Option 2: include alternative explanations in your statistical model

New cards
5

Multivariate associations

  • Important to recognize relevant alternative explanations → Know your theory

  • And adjust your statistical analyses and interpretation accordingly → Know your statistics

  • So that you can avoid biased results due to lurking variables: Variables that are not included in a study, but do explain or influence the association under study

New cards
6

Multivariate associations: Spurious associations

  • When both variables are also related to a third variable and the association between x and y disappears (mostly) when controlling for this third variable

  • → Estimated association between variables can thus change dramatically depending on the data analysis strategy

  • Association between x and y disappears

<ul><li><p>When both variables are also related to a third variable and the association between x and y disappears (mostly) when controlling for this third variable</p></li><li><p>→ Estimated association between variables can thus change dramatically depending on the data analysis strategy</p></li><li><p>Association between x and y disappears </p></li></ul><p></p>
New cards
7

Multivariate associations: Suppression

Sometimes we find (almost) no association between x and y until we control for a third variable

<p>Sometimes we find (almost) no association between x and y until we control for a third variable </p>
New cards
8

Multivariate association: Simpson’s paradox

  • Sometimes the relationship between x and y is even reserved within levels of a third variable

  • Example: Typing speed and typo’s

    • → On average: negative association - Experienced typists type faster and make few typos

    • → At the individual level: Positive association - the faster you type the more typo’s you make

<ul><li><p>Sometimes the relationship between x and y is even reserved within levels of a third variable </p></li><li><p>Example: Typing speed and typo’s</p><ul><li><p>→ On average: negative association - Experienced typists type faster and make few typos</p></li><li><p>→ At the individual level: Positive association - the faster you type the more typo’s you make </p></li></ul></li></ul><p></p>
New cards
9

Multivariate associations: Chain relations

  • Another reason for a disappearing association between x1 and y when controlling for x2 is a chain relation (mediation):

  • x1 has an indirect effect on y, via mediator x2

  • In other words:

    • x1 causes x2

    • x2 causes y

  • Often a research topic in intervention studies:

    → Important to identify the ‘working mechanisms’ of an intervention

  • Example: The association between years of education and lifespan (x1) disappears or weakens when we account for income

<ul><li><p>Another reason for a disappearing association between x1 and y when controlling for x2 is a <strong>chain relation </strong>(<strong><em>mediation</em></strong>)<strong>:</strong></p></li><li><p>x1 has an indirect effect on y, via <strong>mediator </strong>x2</p></li><li><p><em>In other words:</em></p><ul><li><p><em>x1 causes x2</em></p></li><li><p><em>x2 causes y</em></p></li></ul></li><li><p>Often a research topic in intervention studies:</p><p>→ Important to identify the <em>‘working mechanisms</em>’ of an intervention</p></li><li><p>Example: The association between years of education and lifespan (x1) disappears or weakens when we account for income</p></li></ul><p></p>
New cards
10

Multivariate associations: Statistical Interaction

  • In addition, we often see interaction effects between predictors: The association between x1 and y differs across levels of x2

  • Various options:

    • No association between x and y

      • But it does exist at subpopulations based on x2.x1

      • For example: Positive and negative effects in subpopulations cancel each other out.

    • Positive association between x and y

      • But different strengths or even negative/non-existing within subpopulation based on x2

    • Negative association between x and y

      • But different strengths or even positive or non-existing within subpopulation based on x2

  • Again: The average x – y association does not necessarily reflect the association in all subpopulations!

<ul><li><p>In addition, we often see <strong>interaction effects </strong>between predictors: The association between x1 and y differs across levels of x2</p></li><li><p><strong>Various options:</strong></p><ul><li><p>No association between x and y</p><ul><li><p>But it does exist at subpopulations based on x2.<span style="color: rgb(255, 255, 255)">x1</span></p></li><li><p>For example: Positive and negative effects in subpopulations cancel each other out.</p></li></ul></li><li><p>Positive association between x and y</p><ul><li><p><em>But different strengths or even negative/non-existing within subpopulation based on x2</em></p></li></ul></li><li><p>Negative association between x and y</p><ul><li><p><em>But different strengths or even positive or non-existing within subpopulation based on x2</em></p></li></ul></li></ul></li><li><p>Again: The average x – y association <strong>does not necessarily reflect the association in all subpopulations</strong>!</p></li></ul><p></p>
New cards
11

Multivariate associations: Multiple causes

  • However, differences in a criterion variable usually have multiple causes:

    • Several variables together explain differences in y

  • These predictors are mostly correlated:

    • This is called confounding.

    • The estimated association between x and y changes when an extra x is added to the regression model.

    • For example, for spurious associations, suppression and Simpson’s paradox.

      And sometimes not correlated:

    • The estimated association between x and y does not change when the extra x is added to the regression model.

    • Nevertheless, there may be statistical interaction

<ul><li><p>However, differences in a criterion variable usually have <strong>multiple causes</strong>:</p><ul><li><p>Several variables <em>together </em>explain differences in y</p></li></ul></li><li><p>These predictors are <em>mostly </em><strong>correlated:</strong></p><ul><li><p>This is called <em>confounding.</em></p></li><li><p>The estimated association between x and y changes when an extra x is added to the regression model.</p></li><li><p>For example, for <em>spurious associations, suppression </em>and <em>Simpson’s paradox.</em></p><p>And <em>sometimes </em><strong>not correlated:</strong></p></li></ul><ul><li><p>The estimated association between x and y does not change when the extra <em>x </em>is added to the regression model.</p></li><li><p>Nevertheless, there may be statistical interaction</p></li></ul></li></ul><p></p>
New cards
robot