Biostatistics: The application of statistical methods to the field of biological sciences and medicine.
Statistics: Art and science of data that involves:
Planning research
Collecting data
Describing data
Summarizing and presenting data
Analyzing data
Interpreting results
Making decisions or discovering new knowledge
Biostatistics employs statistical tools across various fields including:
Business
Education
Psychology
Agriculture
Economics
It specifically refers to the application of statistical methods to biological and medical data.
Enhance the intellectual content of data.
Organize data into comprehensible formats.
Base validity on the test of experience.
Identify real problems and foster curiosity.
Pose a specific question.
Design methods for data collection.
Collect data and interpret the results:
Analysis and summary of data to address the original question.
Data: Raw material for statistics; defined as figures obtained by:
Counting (e.g., patient numbers)
Measurement (e.g., temperature, weight, blood pressure).
1. Routinely Kept Records:
Hospital medical records are rich sources of patient data.
2. External Sources:
Published reports, data banks, and research literature may contain pre-existing data relevant to research questions.
3. Surveys:
Can be used to gather specific information (e.g., transportation modes for clinic visits).
4. Experiments:
Data is collected as a result of experiments (e.g., testing strategies for patient compliance).
Variable: A characteristic that can take on different values across subjects, such as:
Heart rate
Heights of adult males
Weights of preschool children
Patient ages.
Constant: An observation that does not vary (e.g., number of fingers, number of eyes).
Different classifications for variables:
Categorical Measurements: Unordered categories.
Ordinal Measurements: Ranked categories.
Quantitative Measurements: Ordered intervals with equal spacing.
Quantitative Variables: Numeric scales (e.g., height, weight).
Qualitative Variables: Characteristics that may not be measured but can be ranked (e.g., sex, blood group).
Discrete Variable: Has gaps in possible values (e.g., daily hospital admissions).
Continuous Variable: Can take any value within a given interval (e.g., height).
Qualitative Variables (Categorical or Nominal): Often explored in categories.
Binary Variables (Two Categories): Attributes with a yes/no or presence/absence classification (e.g., male/female, disease/no disease).
1. Nominal: Variables without a rank (e.g., blood group).
2. Ordinal: Variables that can be ordered (e.g., stages of cancer).
Quantitative variables can be converted into categorical variables for analysis.
Systolic Blood Pressure Categorization:
140 mm Hg: Hypertensive
90-140 mm Hg: Normal
<90 mm Hg: Hypotensive
Blood Glucose Categorization:
120: Hyperglycemia
80-120: Normal
<80: Hypoglycemia
Address two main types of inaccuracies:
Imprecision: Random variability in results.
Bias: Systematic deviation from the truth.
Population: The total collection of values of a variable that is of interest (e.g., weights of all children in a school).
Populations can be finite or infinite.
Sample: A subset taken from a population (e.g., weights of a fraction of children).