Notes on Biostatistics
INTRODUCTION TO BIOSTATISTICS
Biostatistics is a crucial branch of statistics focused on the application of statistical methods to health science information.
Facilitates data collection, representation, analysis, and decision-making in the context of health.
DEFINITION OF STATISTICS
Statistics: The collection, organization, summarization, presentation, and interpretation of data.
Examples of statistics include:
Number of road traffic accidents
Size of enrollment in educational institutions
Number of patients in a clinic.
It also involves calculated characteristics of a data set such as:
Average household size
Prevalence of smoking
Infant mortality rate
Crude death rate
Immunization coverage in children under 5 years.
DEFINITION OF BIOSTATISTICS
Biostatistics: A branch of statistics that deals with health sciences applications.
Focuses on:
Development of methods for data collection
Effective data presentation
Analysis and interpretation of results
Decision-making based on data analysis.
AIMS/OBJECTIVES OF STUDYING BIOSTATISTICS
Conducting investigations in health-related fields.
Managing research effectively.
Making correct inferences about populations based on sample data.
Understanding statistical claims made by others.
Planning, implementing, and evaluating health programs.
Evaluating health research proposals.
BRANCHES OF STATISTICS
1. Descriptive Statistics
Producing quantitative summaries of information.
Methods include:
Enumeration, organization, graphical presentation of data (like tables and charts).
2. Inferential Statistics
Making generalizations about a population based on sample information.
Involves:
Estimation
Hypothesis testing.
COMMON STATISTICAL TERMS
Population: Complete collection of all elements under study.
Census: Data collection from every member of the population.
Sample: A subset of the population with similar characteristics.
PARAMETERS AND STATISTICS
Parameter: Describes a characteristic of a population (e.g., population mean).
Statistic: Describes a characteristic of a sample (used to estimate parameters).
Examples of symbols:
Mean: ar{x} (sample mean),
u (population mean)Standard Deviation: s (sample), heta (population).
DATA AND VARIABLES
Data: Facts and figures collected for analysis.
Includes observations from measuring or counting.
Variables: Characteristics that can take on different values (e.g., age, sex, weight).
TYPES OF DATA
1. Qualitative Data
Descriptive/categorical without a numerical measure.
Dichotomous: Two categories (e.g., male/female).
Multichotomous: More than two categories (e.g., blood groups A, B, AB, O).
2. Quantitative Data
Numeric data that can be measured.
Discrete: Whole numbers (e.g., number of patients).
Continuous: Can take any value within a range (e.g., height).
CONTINUOUS DATA CATEGORIES
Interval Data
No true zero point (e.g., temperature).
Ratio Data
True zero point exists (e.g., weight, blood pressure).
TYPES OF VARIABLES
Independent variables: Cause or influence the dependent variable (e.g., teaching method).
Dependent variables: Measure outcomes or effects (e.g., student performance).
Intervening variables: Link independent and dependent variables (e.g., student motivation).
Moderator variables: Affect relationships between variables.
Control variables: Minimize extraneous variable effects.
Extraneous variables: Unrelated to the study but could influence outcomes.
MEASUREMENT OF VARIABLES
Levels of measurement include:
Nominal: Categorization without order (e.g., gender).
Ordinal: Order present (e.g., satisfaction ratings).
Interval: Equal intervals with no absolute zero (e.g., temperature).
Ratio: Equal intervals with an absolute zero (e.g., weight).
CONCLUSION
Understanding biostatistics is essential for valid research and effective public health decision-making. It combines statistical principles with medical and health data analysis, thereby contributing to improved health outcomes.