Basic Concepts in Statistics

Population vs. Sample

  • Population: Refers to all the individuals or items within a defined group being studied.
    • Example: For an investigation of first-year students at UFS using a specific product A, the population consists of all students enrolled at UFS.
  • Sample: A subset of the population selected for observation and analysis.
    • Example: The subset would be first-year students at UFS who are using product A.

Parameter vs. Statistic

  • Parameter: A statistical measure that describes a characteristic of a population.
    • Example: The average height of all first-year UFS students is a parameter.
  • Statistic: A statistical measure that describes a characteristic of a sample.
    • Example: The average height of a sample of 30 first-year UFS students serves as a statistic.
  • Key Relationship: Statistics are derived from samples; parameters are derived from populations.

Types of Data

  • Quantitative Data: Represents measurable quantities and can be expressed as numbers.

    • Discrete Data: Specific, countable numerical values that cannot be divided.
    • Example: Number of students in a class (e.g., 25 students), pages in a book (e.g., 200 pages), cash withdrawal amounts (e.g., $40).
    • Continuous Data: Numerical values that can take on any value within a range, including decimals.
    • Example: Height (e.g., 1.75 m), weight (e.g., 70.2 kg), or time (e.g., 5.3 seconds).
  • Qualitative Data: Descriptive data that can be categorized but not measured numerically.

    • Nominal Scale: Data that cannot be ranked.
    • Example: Gender (options: male or female), where male might be coded as 1 and female as 2.
    • Ordinal Scale: Data that can be ranked.
    • Example: Customer satisfaction rated as bad, average, or good, where bad = 1, average = 2, good = 3.

Sampling Methods

  • Sampling is a technique used to select a portion of the population for analysis and conclusions.
Probability Sampling
  • Definition: Probability sampling involves random selection, giving each member of the population an equal chance of being included.
  • Advantages: Reduces bias, leading to more valid conclusions.
Simple Random Sampling
  • Definition: A basic sampling technique where every individual in the population has an equal chance of being chosen.
    • Method: Use a random number table (e.g., Table on page 555 of the textbook).
    • Process:
    • Determine the population size (e.g., 600).
    • Identify the range for valid random numbers (e.g., three-digit numbers).
    • Extract numbers from the random number table, discarding those outside the range of the population size.
    • Example Output: If the random selections were 546, discard 624, accept 305, etc. until the desired sample size is reached.
Stratified Random Sampling
  • Definition: Divides the population into strata (subgroups) and random samples are taken from each stratum.
    • Process:
    • Identify and categorize population into different strata (e.g., universities A, B, C).
    • Determine total population size and calculate the proportion for each stratum based on its size relative to the total.
    • Example Calculation: For sample size of 100 from a total of 600 population (200, 300, 100):
      • Sample for university A = (200/600) × 100 = 33
      • Repeat for others.
Systematic Sampling
  • Definition: Involves selecting every k-th individual from a list or group.
    • K Value Calculation: K = rac{Population ext{ size}}{Sample ext{ size}}
    • Example:
    • If population size is 500 and sample size is 100, then K = rac{500}{100} = 5.
    • Select every 5th individual.
    • Method:
    • Using random number for the starting point, add K to identify subsequent elements to include in the sample.
    • Example: Start at 50, yielding elements 50, 55, 60, etc. until sufficient sample size is reached.

Summary of Key Points

  • Understand the differentiation between population (entire group) and sample (subset).
  • Grasp the concepts of parameter (population measure) vs. statistic (sample measure).
  • Familiarity with types of data, distinguishing between quantitative (numerical) and qualitative (categorical) data, is crucial.
  • Mastery of sampling methods—simple random, stratified, and systematic sampling—ensures valid research conclusions.