Study Notes on Sampling Methods & Surveys

Sampling Methods & Surveys

Overview

  • The lecture covers essential topics related to sampling methods and surveys, including:
    • Populations
    • Samples
    • Sampling Methods
    • Data Collection
    • Surveys
    • Bias

Classic Illustration

  • Questions often arise regarding large populations, but gathering information from the entire population can be impractical or impossible.
    • Instead, information is gathered from a smaller subset (sample) of the population.
  • Understanding the relationship between the population, sample, and inference is crucial.

Ideas Behind Sampling

  • Goal of Sampling: Select a sample that resembles the population as closely as possible, but in a smaller size.

Types of Samples

Stratified Random Sample vs. Cluster Sample
  • Stratified Random Sample:
    • Strata are groups that are similar within each group but not sampled entirely from all strata.
    • Example: When tasting a Boston Cream pie, randomly select bites from the cake, custard, and frosting, ensuring representation of each layer.
  • Cluster Sample:
    • Clusters differ within each group but are similar across groups; entire groups are sampled instead of individual members.
    • Example: Randomly select a vertical slice of the pie to taste all layers at once.

Generating Random Samples

  • Scenario: Estimating the average annual income of U.S. families from all 120 million households.
    • Simple Random Sample: Randomly select households from the full list.
    • Systematic Sample: Choose households at regular intervals (e.g., every 60,000th household).
    • Stratified Random Sample: Divide households by census regions and take a random sample from each.
    • Cluster Sample: Select entire census regions and include all households within the selected regions.

Census Regions and Divisions of the United States

  • A visual representation highlighting different census regions and their subdivisions.

Bad Sampling Methods

  • Convenience Sample: Obtained from individuals who are easiest to access, leading to possible misrepresentation of the broader population.
    • Example: Asking friends about their favorite hockey teams, which may not reflect the views of the total fanbase.
  • Voluntary Sample: Participants respond to a general invitation; these samples can lead to underrepresentation in survey results, introducing bias.

Forms of Bias

  • Sampling Bias: A systematic failure of sampling to represent the population accurately.
  • Non-sampling Bias:
    • Nonresponse Bias: Occurs when a significant number of selected participants do not respond to the survey.
    • Response Bias: Any aspect of the survey that influences participants' responses.

Examples of Survey Pitfalls

Vague Concepts
  • Scenario: A market research firm asked, "What is your favorite soap?" which led to ambiguous results due to an unclear term.
    • To improve, the question should clarify context (e.g., specify dish soap).
Central Tendency Bias
  • Scenario: Respondents are asked their opinion on sensitive topics like the death penalty using a 1-5 scale.
    • Respondents may avoid extremes, often choosing middle options. To eliminate this bias, force respondents to take a side by asking for a binary response (support/oppose).
Error-Prone Response Options
  • Scenario: Age categories in a poll (e.g., 30-40, 40-50) overlap, causing potential confusion.
    • Revision needed to create distinct ranges, such as 30-39 and 40-49.

Ethical Implications

  • The methodologies and designs of surveys carry ethical responsibilities, particularly regarding bias and transparency of intentions in data collection.
  • Ensuring accurate representation and minimizing bias is fundamental to ethical standards in research, which contributes to reliable data interpretation and conclusions.

Conclusion

  • Understanding the various sampling methods, their biases, and the implications of survey design is essential for accurate statistical analysis and interpretation. Proper methodologies lead to trustworthy data that inform policy, research, and decision-making processes.