book 3

Reliability and Statistical Power

  • Definition of Statistical Power: The likelihood of detecting a difference between experimental groups given a certain sample size.

  • Impact of Reliability on Statistical Power:

    • More reliable scales increase statistical power for a given sample size.

    • Improved reliability allows a smaller sample size to yield equivalent power compared to less reliable measures.

  • Sample Size Considerations:

    • A specified degree of confidence is needed to detect differences in experimental groups.

    • Increased sample size generally increases power.

    • Reliability-enhancing measures and larger samples both reduce error, enhancing power.

  • Factors Affecting Power Gains from Reliability:

    • Initial sample size.

    • Probability level for detecting a Type I error.

    • Effect size considered significant (e.g., mean difference).

    • Proportion of error variance attributed to unreliability versus other sources.

  • Example:

    • Setting a Type I error probability to .01, requiring a 10-point effect size, and an error variance of 100:

    • Sample size increases from 128 to 172 for power to increase from .80 to .90.

    • Reducing error variance from 100 to 75 yields the same power increase without adjusting sample size.

  • Correlation and Scale Reliability:

    • For a sample size of 50, two scales with reliabilities of .38 at a correlation of r = .24 yield significance at p < .10. Increasing reliability to .90 achieves significance at p < .01, while keeping it at .38 would require doubling the sample size.

  • Enhancing Reliability:

    • Increases in reliability can be achieved by increasing the number of items or the correlation between them.

    • More items or better item quality can enhance power similarly to larger sample sizes.

Administering Items to a Development Sample

  • Sample Size Guidance:

    • The consensus on what constitutes a "large" sample is vague, but Nunnally suggests 300 subjects.

    • Development of reliable scales can be achieved with fewer subjects depending on the number of items and scales being extracted.

  • Pitfalls of Small Sample Size:

    • Patterns of covariation among items may lack stability, leading to misleading assessments of internal consistency (alpha).

    • Small samples increase the likelihood of chance influencing item correlations, potentially excluding good items based purely on noise.

  • Risks of Nonrepresentativeness:

    • The development sample could fail to represent the target population, potentially skewing results.

    • Nonrepresentativeness can occur through:

    • Quantitative Nonrepresentativeness: Narrow range of attribute presence in the sample versus the larger population.

    • Qualitative Nonrepresentativeness: Different item meanings for the sample compared to the intended population (e.g., cultural differences impacting interpretation).

  • Consequences of Nonrepresentativeness:

    • A narrow range does not disqualify a sample but can yield imprecise scale means.

    • Qualitative differences in responses can misrepresent the underlying structure necessary for reliable scales.

    • Language and terminology must match the population's understanding for valid responses.

  • Focus Groups and Cognitive Interviews: Tools to discern how concepts are understood by participants, ensuring items are comprehensible and valid.

Conducting the Survey

  • Survey Method Options:

    • In-person interviews, telephone surveys, mail surveys, and Internet surveys.

    • In-person interviews yield highest response rates and better personal contact.

    • Telephone surveys are less expensive than in-person but still expensive with declining landline samples.

    • Mail surveys are low-cost but prone to nonresponse bias; Internet surveys are rising in popularity due to low cost.

  • Internet Survey Methods:

    • Can utilize links sent via e-mail, posted on known websites, or initial contact through mail.

    • Challenges with demographic representation persist as Internet utilization varies.

  • Myths about Internet Findings:

    • Internet samples can be diverse, and findings often align with traditional methods, dispelling myths about lower quality data.

Key Takeaways regarding Survey Research

  • Probability Sampling: Ensures each member has a known chance of selection; includes simple random, stratified random, and cluster sampling.

  • Sampling Bias: Occurs when selected samples misrepresent the population, most notably through nonresponse bias.

  • Minimizing Bias: Achieved by maximizing response rates through pre-notification, reminders, clear questionnaires, and incentives.

  • Survey Types: Can be conducted in various ways, each with respective advantages and limitations.

Measurement in the Broader Research Context

  • Scale Development Consideration:

    • Check for existing measurement tools before developing a new scale.

    • Use resources such as Mental Measurements Yearbook and Tests in Print for locating instruments.

  • Benefits of the Internet: Expands access to existing measurement instruments and information.

    • Examples of initiatives: PROMIS for health outcome assessments.

  • Evaluating Measurement Sources: Scrutiny of Internet-based findings and instruments is essential for reliability and validity.

Pre-Scale Development Steps

  • Focus Groups: Gather insights on how constructs are perceived; understand everyday vernacular for better tool construction.

  • Cognitive Interviewing: Evaluate how items are comprehended, clearing up confusion surrounding terms or item structure.

  • Mode of Administration: Choice affects data collection integrity; scales should ideally match their development mode for consistency.

  • Response Styles and Nonconstructive Variances: Must be considered in surveys to prevent skewed findings or misinterpretations.

Correlational Research

  • Definition: Nonexperimental research where two variables are measured but neither is manipulated.

  • When to Choose Correlational Research:

    • When no causal relationship is believed or manipulation is not feasible or ethical (e.g., measuring daily hassles without manipulating them).

  • Characteristics of Correlational Research:

    • Can include quantitative and categorical variables; distinction lies in whether manipulation occurs.

    • Examples of correlational research abound in examining relationships across various contexts.