Measures of Association

Chapter 13: Measures of Association

13.1 Getting Ready

  • In this chapter, the focus is on measuring the strength of relationships between two variables in a contingency table.

  • This foundational material supports subsequent chapters, all dedicated to different methods for assessing relationships between numeric variables.

  • Recommended Materials: Load the anes20 data set in R, and the DescTools and descr libraries. It’s also advised to use a calculator or R for calculations.

13.2 Going Beyond Chi-squared

  • Chi-squared test: A tool for measuring statistical significance, but inadequate for assessing relationship strength or effect size.

  • Strength of the relationship: Refers to how much the dependent variable's values depend on the independent variable's values.

    • If chi-squared is insignificant, the dependent variable’s outcomes vary randomly around expected outcomes (null hypothesis).

    • If significant, it confirms a relationship exists but not the extent of dependency between the variables.

  • Column percentages can provide insight but lack precision and uniform standards for interpretation.

    • Example: Difference in religious importance by education.

    • Education Data:

      • Rows indicate education levels and columns indicate the religious importance assigned by these educational categories.

      • 25.5% of respondents without a high school diploma assign low importance to religion, vs. 41% among graduates, resulting in a 15.5 percentage point increase.

      • Moderate importance has a 7 percentage point decrease from low to high education.

13.3 Measures of Association for Crosstabs

13.3.1 Cramer’s V
  • Measures of association summarize strength between two variables.

  • Cramer’s V is especially useful for nominal-level variables and is derived from the chi-squared statistic.

    • Formula:
      Cramer’s V = rac{ ext{sqrt}( ext{Chi-squared})}{N imes ext{min}( ext{rows} - 1, ext{columns} - 1)}

    • Interpretation: Ranges from 0 (no relationship) to 1 (perfect relationship).

    • Example Calculation: For religious importance by education with ase chi-squared value 108.2, N=8129,
      Cramer’s V = ext{sqrt}(108.2) / (8129 imes 2) = 0.082

13.3.2 Lambda
  • Lambda ( ) is a proportional reduction in error statistic, representing the error reduction from prediction using the independent variable.

    • Formula:
      ext{Lambda} = rac{E1 - E2}{E1}

    • Where:

    • E1 = Error from guessing without an independent variable.

    • E2 = Error from guessing with an independent variable.

  • Example: For region and religious importance, you can guess correctly more often with lambda calculations indicating approximately a 3.1% reduction in guessing error.

13.4 Ordinal Measures of Association

  • Cramer’s V and Lambda fail to incorporate directional context.

  • Example: Discussion of educational levels affecting religious importance has a negative trend, while age increases suggest a positive relationship.

13.4.1 Gamma
  • Gamma (γ): Measures positive versus negative pair rankings in a crosstab.

    • Formula:
      ext{Gamma} = rac{N_ ext{similar} - N_ ext{different}}{N_ ext{similar} + N_ ext{different}}

  • A positive gamma indicates that similar rankings dominate, while negative indicates the opposite.

  • Applying this to prior examples allows for quantifying relationships concretely with gamma showing an inherent bias towards overstating relationships due to disregarding tied observations.

13.4.2 Tau-b and Tau-c
  • Both tau-b and tau-c correct for tied pairs in the calculation of ordinal association, ensuring more accurate relationship assessments.

    • Suitable for square (tau-b) vs. rectangular (tau-c) tables, both scale from -1 (perfect negative) to +1 (perfect positive).

13.5 Revisiting the Gender Gap in Abortion Attitudes

  • Previous chapter examined gender gaps in attitudes towards abortion using chi-square tests, finding no significant differences in illegal views but minor differences in beliefs about choice.

  • By using crosstabs, a continuum from illegal to choice can reveal nuanced understandings of sexual preferences and implications of less significant statistical associations.

13.6 Next Steps

  • Subsequent chapters shift towards numeric variable relationships, utilizing scatter plots and correlation coefficients.

13.7 Exercises

13.7.1 Concepts and Calculations
  • Analyze relationships, including percentages of political identification separated by sex.

  • Examine and calculate Cramer’s V, Lambda, and Gamma for further insights.

13.7.2 R Problems
  • Experiment with independent variables influencing attitudes towards the Supreme Court's abortion stance, leveraging crosstab functions clearly outlined.