Introduction of Biostatistics

Introduction to Biostatistics and Epidemiology

A. Biostatistics

  • Statistics Overview: Branch of mathematics focusing on data collection, organization, presentation, analysis, and interpretation.

    • Mathematical Statistics: Development of statistical inference methods; requires advanced mathematical knowledge.

    • Applied Statistics: Application of mathematical methods across various fields (e.g., economics, psychology, public health).

Branches of Statistics

  • Descriptive Statistics: Summarizes and presents data to simplify analysis.

  • Inferential Statistics: Generalizes findings from samples to populations, including estimations, hypothesis testing, and predictions.

Examples of Statistics Classification

  • Descriptive: "In 2011, there were 34 deaths from avian flu."

  • Inferential: "In 2025, the world population is predicted to be 8 billion."

  • More examples listed in the text demonstrate understanding of both statistics types.

Classification of Statistics

  • Parametric Statistics: Assumes underlying normal data distribution; includes hypothesis testing based on population means.

    • Assumptions: Normality, homoscedasticity, interval/ratio level measurements, absence of outliers.

  • Nonparametric Statistics: Does not assume any distribution; requires larger sample sizes to achieve equivalent power to parametric tests.

B. Epidemiology

  • Definition: Study of how diseases and health conditions affect populations, aiming to understand and mitigate health issues.

    • Historical Context: Originally focused on infectious diseases; now includes chronic diseases and health-related conditions.

  • Key Concepts:

    • Distribution: Analysis of when, where, and in whom health issues occur.

    • Determinants: Factors influencing health problems, including biological, social, and behavioral factors.

Terminologies in Biostatistics

  • Variable: Characteristic that can vary (e.g., height, weight).

  • Data: Values represented by a variable; sets of measurements are called datasets.

  • Population: Complete set of subjects under study; Sample: Subset taken from the population.

  • Census: Data collection from a complete population; Parameter & Statistic: Measures from populations and samples respectively.

C. Measurements

  • Levels of Measurement:

    • Nominal: Categories without order (e.g., gender).

    • Ordinal: Categories with rank but no precise differences (e.g., letter grades).

    • Interval: Ordered categories with defined differences but no true zero (e.g., temperature).

    • Ratio: Defined differences with true zero (e.g., weight).

  • Classification of variables by type, with examples.

D. Variables

  • Experimental Variables:

    • Response Variable (dependent): Affected by other variables.

    • Explanatory Variable (independent): Affects the response variable.

  • Correlation Studies: Predictor and criterion variables defined but correlation research does not classify them as independent or dependent.

Qualitative and Quantitative Variables

  • Qualitative: Distinct categories (e.g., gender).

  • Quantitative:

    • Discrete: Countable (e.g., number of children).

    • Continuous: Measurable with infinite values (e.g., height).

E. Summation Notation Rules

  • Summation rules described related to constants and variables.

F. Classifications of Data

  • Internal Data: Related to the organization’s activities.

  • External Data: Collected from outside the organization.

  • Statistical Data: Published figures by institutions; Nonstatistical Data: Information without quantitative representation.

G. Sources of Data

  • Data Collection Methods: Observational, experimental; importance of both.

  • Types of Studies: Cross-sectional, retrospective, prospective outlined with definitions and examples.

  • Clinical Trials: Defined as experiments to evaluate interventions versus controls.

H. Methods of Data Collection

  • Various methods include questionnaires, interviews, registration, observation, and experiments, with an exploration of each.

  • Questionnaires: Closed-ended and open-ended types with characteristics outlined.

  • Interview Method: Various approaches discussed.

  • Observation and Registration Methods: Explained with examples.

I. Sampling Methods

  • Definitions and differences between target population and study population, with examples.

  • Criteria for Sampling Design: Representation, reliability, practicality, efficiency.

  • Probability Sampling Methods: Simple random, systematic, stratified, cluster, and multistage sampling detailed with procedures and examples.

J. Non-Probability Sampling Methods

  • Overview of diverse sampling techniques like convenience, purposive, and snowball sampling.

  • Quota Sampling: Types detailed, including proportional and non-proportional methods.

  • Additional examples provided for clarity.

K. Sample Size Determination

  • Defined importance of estimating the proper sample size; includes methodologies such as census, replicating, tables, and formula applications.

  • Power Analysis: Importance in determining the sample size while considering effect size and significance levels.

  • G*Power Calculator: Explained with steps for conducting power analysis and its uses for various research designs.