Choosing the Suitable Statistical Test

This lecture covers choosing the appropriate statistical test for data analysis.
Non-experimental (Observational) studies are contrasted with experimental studies.

After descriptive statistics, the process moves to analytical statistics.
Key analytical skills involve selecting the correct statistical test.
The goal is to answer the research question and decide whether to reject or fail to reject the null hypothesis.

Research starts with an idea, leading to a research question.
A null hypothesis and an alternative hypothesis are formulated.
Sample data is analyzed to obtain a p-value.
If the p-value is less than a predefined alpha (α), the null hypothesis is rejected; otherwise, we fail to reject the null hypothesis.
The proper statistical test is crucial in this process.

Question 1: Is it a bivariate or multivariable analysis?
- Bivariate analysis: studies the relationship between two variables.
- Examples:
  - Age and height.
  - Type of treatment and complication.
  - Sex and smoking.
  - Smoking and coffee consumption.
- Multivariable analysis (regression modeling/analysis):
- Studies the effect of multiple variables on an outcome variable.
- Examples:
  - Effect of smoking, sex, coffee consumption on blood pressure.
  - Effect of smoking, sex, coffee consumption on having a heart attack.
- Note: Regression can be used for bivariate analysis if examining the effect of only one variable on the outcome.

Question 2: Are we studying a difference or a correlation (if bivariate)?
- Difference: studying the difference between two or more groups or conditions.
- Example:
  - The difference between males and females regarding coffee consumption.
  - The difference in body weight before and after being on a specific diet.
- Correlation: studying the association between two variables.
- Examples:
  - The association between age and weight.
  - The association between coffee consumption and the number of sleeping hours.

Question 3: Are we working with independent or paired data (if bivariate)?
- Independent (unpaired) data: observations in each sample are unrelated.
- No relationship between subjects in each sample.
- Subjects in one group cannot be in the other group.
- No subject/group can influence the other.
- Dependent (Paired) data: paired samples include:
- Pre-test/post-test samples (a variable measured before and after an intervention).
- Cross-over trials.
- Matched samples.
- When a variable is measured twice or more on the same individual.

Question 4: Identify the types of data variables being studied.
- The type of data variable is crucial for choosing the suitable test.
- Types of Data:
- Categorical: No unit.
  - Nominal: No order (e.g., colors, types of treatment).
  - Ordinal: Ordered (e.g., pain scale, satisfaction levels).
- Numerical: Unit.
  - Discrete: Counted/integer (e.g., number of children).
  - Continuous: Measured/decimals (e.g., height, weight).
    - Time to event data (survival)
- Normality of Distribution:
- Determine if a numeric variable is normally distributed before certain statistical tests.
- A histogram can visually represent the distribution.

Question 5: Are we comparing two groups/conditions or more than two?
- Examples:
- Comparing two groups: diseased vs. not diseased.
- Comparing three groups: normal, osteopenia, osteoporosis.
- Comparing two conditions: pre-test vs. post-test.
- Comparing three conditions: before, during, after the operation.