Chapter 1 (2024 -1)

Chapter 1: Introduction to Biostatistics

What is Statistics?

  • Definition: A field of study focused on:

    • Collecting data

    • Organizing data

    • Summarizing data

    • Analyzing data

    • Drawing inferences from data.

Understanding Data

  • Data: Numbers obtained from measurements or counting; serve as the raw material for statistics.

Types of Statistics

  • Descriptive Statistics: Methods for organizing, presenting, and summarizing data.

  • Inferential Statistics: Methods for making decisions about a population based on analysis of a sample.

Course Hierarchy

  • Descriptive Statistics:

    • Chapter 1: Introduction to Biostatistics

    • Chapter 2: Descriptive Statistics

    • Chapter 15: Vital Statistics

    • Chapter 12: Chi-square Distribution

  • Probability Theory:

    • Chapter 3: Basic Probability Concepts

    • Chapter 4: Probability Distributions

    • Chapter 5: Important Sampling Distributions

  • Inferential Statistics:

    • Chapter 7: Hypothesis Testing

    • Chapter 9: Linear Regression and Correlation

    • Chapter 13: Nonparametric Statistics

Sources of Data

  • Types of Sources:

    • Day-to-day logs of organizational transactions.

    • Surveys (questionnaires).

    • Experiments (medical strategies).

    • External sources (published reports, data banks).

Statistics vs. Biostatistics

  • Statistics: Tools are applied across various fields.

  • Biostatistics: Application of statistical tools in biological sciences and medicine.

Data vs. Variables

  • Variable: Observable characteristic that varies among entities.

    • Quantitative: Measurable characteristics (e.g., heights).

    • Qualitative: Categorical characteristics (e.g., gender).

Random Variables

  • Definition: A quantitative variable whose values arise by chance.

    • Discrete Random Variable: Has distinct gaps in possible values (e.g., daily admissions to a hospital).

    • Continuous Random Variable: No gaps in values (e.g., height).

Population vs. Sample

  • Population: The largest collection of entities of interest at a given time.

  • Sample: A subset of the population.

Why Use Samples?

  • Cost and time efficiency compared to studying the whole population.

  • Some variables may involve destructive measuring methods.

  • Populations may be infinite.

Measurement and Measurement Scales

  • Measurement: Assigning numbers to objects/events based on rules.

  • Types of Measurement:

    • Qualitative: Nominal and Ordinal Scales.

    • Quantitative: Interval and Ratio Scales.

Nominal vs. Ordinal Scales

  • Nominal Scale: Classification without rank (e.g., gender).

  • Ordinal Scale: Classification with rank, e.g., pain levels.

Interval vs. Ratio Scales

  • Interval Scale: Identifies order but has no true zero (e.g., temperature).

  • Ratio Scale: Identifies order with a true zero (e.g., weight).

Statistical Inference

  • Definition: Drawing conclusions about populations from sample information.

  • Research Study: Scientific study involving design and data analysis.

  • Experiments: Observations following specific manipulations.

Sampling Methods

  • Sampling with Replacement vs. without Replacement: Methodologies affecting sample selection.

  • Simple Random Sampling: Every member of the population has an equal chance of selection.

  • Systematic Random Sampling: Selection based on a fixed interval from a randomly chosen start.

  • Stratified Random Sampling: Population partitioned into strata; samples drawn from each stratum.

Importance of Accurate Measurement

  • Accuracy and Validity: Correctness of measurement.

  • Precision and Reliability: Consistency of measurement.

  • Treatment Group: Exposed to treatment.

  • Control Group: Not exposed to treatment.

Computers in Biostatistical Analysis

  • Benefits:

    • Fast and accurate calculations.

    • Random number generation capabilities.

    • Software like MS Excel/MegaStat for data analysis.

robot