Biostatistics Summary
Chapter 17: Biostatistics
- Biostatistics Definition
- The application of data analysis and interpretation in health care research.
- Typically relies on computers for statistical computations.
- Researchers may hire statisticians to ensure robust analysis.
- Data analysis aims to interpret results, answering research questions or testing hypotheses.
Data Analysis
- Purpose
- Applies statistical tests to organize, describe, and summarize data.
- Essential for verifying research questions or hypotheses.
Interpretation of Study Results
Elements of a Valid Study
- Adequate time duration
- Sufficient number of participants
- Appropriate measurements
- Correct statistical tests utilized.
Statistical Symbols
- Refer to textbook (Table 17-1, p. 211) for symbols and meanings.
Types of Data/Data Categorization
- Item of Data
- A singular piece of data.
- Data Set
- A collection of data items.
- QUANTITATIVE Data
- Numerical values (e.g., pocket depths, number of sealed teeth).
- Can be displayed as counts, percentages, and means.
- QUALITATIVE Data
- Non-numerical; cannot be quantified.
- Example: Patient survey responses on dental visit satisfaction.
Variables
- CONTINUOUS (QUANTITATIVE)
- Numeric, can be fractional.
- Infinite measurements on a continuum (e.g., height, weight).
- DISCRETE (QUANTITATIVE)
- Numeric whole numbers with finite values (e.g., number of children).
- CATEGORICAL (QUALITATIVE)
- Non-numeric, organized into groups (e.g., ethnicity, gender).
- DICHOTOMOUS (QUALITATIVE)
- Only two categories (e.g., male/female).
Scales of Measurement
- Importance
- Determines which statistics to apply for data analysis.
4 Scales:
- Nominal
- Categorizes data without rank (e.g., toothpaste types).
- Ordinal
- Data with order but unequal intervals (e.g., plague classification).
- Interval
- Equal distances without an absolute zero (e.g., temperature).
- Ratio
- Equal distances with an absolute zero (e.g., number of teeth).
Categories of Statistics
- Descriptive Statistics
- Summarizes quantitative data (central tendency, frequency tables).
- Not used for conclusions.
- Inferential Statistics
- Generalizes findings from samples to populations.
Measures of Central Tendency
- Mean
- Average, sensitive to outliers.
- Formula: M = rac{ ext{Sum of all scores} }{ ext{Total number of scores} } .
- Median
- Middle value when numbers are ordered.
- Mode
- Most frequently occurring score.
- Bimodal: Two modes; Multimodal: More than two modes.
Measures of Dispersion
- Definition
- Indicates variation within data.
- Range
- Distance between highest and lowest scores.
- Standard Deviation (SD)
- Measurement of deviation from the mean SD = ext{sqrt( Variance )} .
- Variance
- Average deviation of scores from the mean.
Distribution
- Describes data dispersion; shown through graphs.
- Normal/Gaussian Distribution
- Bell curve where most scores occupy the center.
Empirical Rule
- Suggests that almost all data within a normal distribution falls within three standard deviations of the mean.
Skewness
- Negative Skew: Left skewed, with lower scores predominating.
- Positive Skew: Right skewed, with higher scores predominating.
Types of Graphs
- Chart Types:
- Polygon, Pie chart, Bar graph, Scattergram, Histogram, Line graph.
Correlation
Studies relationship between variables, quantifiable as a correlation coefficient r , ranging from -1 to +1.
Positive Correlation: Both variables increase together.
Negative Correlation: One variable increases while the other decreases.
Correlation Coefficient Interpretation:
- 0.00 - 0.25: Little association
- 0.26 - 0.49: Weak
- 0.50 - 0.69: Moderate
- 0.70 - 0.89: High
- 0.90 - 1.00: Very High
Statistical Decision Making
- Null Hypothesis: No association between variables (accepted unless evidence suggests rejection).
- Positive/Research Hypothesis: Suggests a relationship or effect.
Accepting or Rejecting the Hypothesis
- Based on alpha (p-value):
- p <= 0.05: Statistically significant.
- p > 0.05: Not significant.
Inferential Statistics
- Generalizations about a population from sample data.
- Parametric Data: T-tests, ANOVA for means comparison.
- Nonparametric Data: Chi-square test for frequency counts.
Data Research and Results
- Validity: Internal vs. external validity.
- Reliability: Consistency of study results.
- Significance: Statistical vs. practical significance.