ES

Sampling Concepts and Methods Notes

Context of Sampling in Everyday Communication with Numbers

  • Research workflow: address a research question or hypothesis

  • Identify variables: independent vs. dependent

  • Determine how to measure variables

  • Plan data collection

  • Purpose: Statistical inference to answer the research question

  • Scope: On whom? And how?

Key Concepts: Population, Sample, and Inference

  • Population: The entire pool from which a statistical sample is drawn

  • Sample: A subset of the population used for analysis

  • Sample Frame: A list of all individuals in the population who can be sampled; sometimes not fully attainable

  • Sampling: A process where a predetermined number of observations are taken from a larger population

  • Population parameter: A specific value that describes a characteristic of the population (unknown until measured)

  • Sample statistic: A numerical value calculated from the sample to estimate the population parameter

  • Statistical inference: Using sample data to make conclusions about the population

  • Confidence interval: A range of values used to estimate the population parameter

  • 95% confidence interval: A commonly used interval level; interpretation depends on repeated sampling

  • Interpretation is important: CI conveys uncertainty and reliability of the estimate

Complete Enumeration vs. Sampling

  • Complete enumeration (studying the population directly)

    • Strength: Provides a precise answer to the research question

    • Weakness: Time, energy, and money required

  • Sampling (studying a subset of the population)

    • Strength: Saves time, energy, and money

    • Weakness: Provides an estimate rather than an exact population value

  • Important generalization: With properly drawn random and representative samples, estimation can be very accurate; otherwise, estimates can be far from the truth

  • Sampling can be done using probabilistic or non-probabilistic techniques

Probability vs. Non-Probability Sampling

  • Probability sampling (random sampling)

    • Every member of the defined population has an equal chance of being selected

    • Also called unbiased or representative sampling

    • Requires a sampling frame (at least partially)

    • Goals: Generalize findings to a broader population; maximize representativeness

    • Common methods: Simple random, systematic, stratified

  • Non-probability sampling (non-random sampling)

    • Does not give all individuals equal chances of selection

    • Some members are more likely to be included than others

    • Also called biased or unrepresentative sampling

    • Often chosen to save resources or when generalization is not the primary goal

    • Common methods: Volunteer, Convenience, Purposive, Snowball, Quota

Probability Sampling Methods

  • A probability sampling method uses some form of random selection; every defined population member has an equal chance of being chosen

  • Requires a sampling frame (or partial frame)

  • Three main methods:

    • Simple Random Sampling

    • Systematic Random Sampling

    • Stratified Random Sampling

Simple Random Sampling

  • Basic technique: select a group of subjects from a larger group where every unit has an equal chance of being selected

  • 3 steps:

    1. Get a list of everyone in the population (with identifiers)

    2. Generate appropriate random numbers (e.g., using random number generators)

    3. Select individuals whose identifiers match the random numbers

Systematic Random Sampling

  • Sample members are selected according to a random starting point and a fixed, periodic interval

  • Four steps:

    1. Get a list of everyone in the population

      • Calculate skip interval = Population size/sample size

      • k = N / n (formula)

        k = The skip interval (or sampling interval).

      • N = The total population size.

      • n = The desired sample size. 

      • To select a sample, randomly choose a starting point between 1 and k, then select every k-th element from the population until the sample size n is achieved.

    2. Pick a random starting point between 1 and the skip interval

    3. A constant interval is selected to facilitate participant
      selection
      8,18,28,38,48,...

      This method is known as systematic sampling and is effective for producing a representative sample when the population is ordered.

Stratified Random Sampling

  • Population can be partitioned into subpopulations (strata) of similar units

  • Within each stratum, apply the same random selection process as simple random sampling

  • Rationale: ensures representation from each subpopulation

Non-Probability Sampling Methods

  • Non-probability sampling: samples are gathered in a process that does not give all individuals equal chances of being selected

  • Five common methods:

    • Volunteer Sampling

    • Convenience Sampling

    • Purposive Sampling

    • Snowball Sampling

    • Quota Sampling

Volunteer Sampling

  • Participants self-select into the study

  • Often those with a strong interest in the topic

  • Examples: Research on healing power of prayer; firearms regulation surveys conducted by phone

Convenience Sampling

  • Also called grab, accidental, opportunity, or haphazard sampling

  • Sample drawn from the part of the population closest at hand

  • Example: Interviewing people outside a coffee shop

Purposive (Judgmental) Sampling

  • Selecting participants based on specific characteristics or study objectives

  • Nonrandom

  • Examples: Attending political rallies for interviews; Native Hawaiian family narratives

Snowball (Network) Sampling

  • Existing subjects recruit future subjects from among their acquaintances

  • Useful when potential participants are hard to locate

  • Example: Researching experiences in Alcoholics Anonymous (AA)

Quota Sampling

  • Assemble a sample that has the same proportions as the population for known characteristics (demographics, etc.)

  • Similar to stratified sampling but non-random

  • Example: Studying experiences of different ethnic groups with discrimination

  • Important note: Non-random selection and no use of a sampling frame

Strengths and Weaknesses of Non-Probability Sampling

Strengths

  • Save time, energy, and money

  • Convenient and often feasible

Weaknesses

  • Not all individuals have equal chances of being selected

  • Results are not generally generalizable to a broader population

  • Example: A study based on a sample of UH undergraduates may not generalize to U.S. adults

  • The sample can be systematically different from the population (bias)

  • May over- or under-represent certain outcomes

  • Limited by resources (time, energy, money, etc.) and lack of a sampling frame

  • Not inherently bad; suitability depends on research objective

  • Example in practice: Hawaiian Identity through family narratives (topic-sensitive, not necessarily generalizable)

Concrete Example: UH Manoa Sample Scenario

  • Population (N): 17,490

  • Sample size (n): 100

  • Sample gender breakdown: Male 60%, Female 40%

  • Population gender breakdown: Male 60%, Female 40%

  • Example numbers: 10,494 males (60%), 6,996 females (40%) in population; 60 males, 40 females in the sample

  • Purpose of the example: show proportional stratified-like allocation within a simple random framework

Practical Implications and Takeaways

  • Probability sampling yields representative samples when frames exist and sampling is executed properly

  • Randomized methods (simple, systematic, stratified) enable generalization and accurate statistical inference if framed correctly

  • Non-probability methods are valuable for feasibility, exploratory work, or when generalization is not the primary goal

  • Always consider the objective of the study when choosing a sampling method

Recap and Core Principles

  • Complete enumeration vs. sampling: trade-off between precision and resources

  • Probability methods (Simple, Systematic, Stratified): aim for generalizability and representativeness; require frames

  • Non-probability methods (Volunteer, Convenience, Purposive, Snowball, Quota): resource-efficient; limit generalizability

  • Key terms to remember: Population, Sample, Sampling Frame, Population Parameter, Sample Statistic, Confidence Interval (e.g., 95%), and the importance of interpretation

  • Fundamental formulas:

    • Skip interval in systematic sampling:
      Population size/sample size

    • Skip interval in systematic sampling: Population size / Sample size; used to determine how many elements to skip in the sampling process. These concepts form the basis for understanding sampling techniques and their application in research.

  • Practical note: Always align sampling design with research goals, resource constraints, and the level of generalizability required for the study