research

M.Sc. Research Methods Statistics

Overview of Research Methods

1. Introduction to Research Methods

Focus on the differences between qualitative and quantitative research.
Importance of qualitative methods in health informatics.

2. Key Qualitative Methods

a. Content Analysis

Role: Systematic examination of communication artifacts.
Technique: Involves coding the data and identifying themes within the content.

b. Grounded Theory

Role: Aims to generate a theory grounded in empirical data.
Technique:
- Open Sampling: Selection of participants randomly.
- Rich Qualitative Data: Data is comprehensive and informative.
- Open Coding: Identifying themes and concepts in data; initial classification without preconceptions.
- Stop and Memo: Continuous reflection and note-taking throughout the data analysis.
- Constant Comparison: Analyzing data from interviews, observations, literature reviews, ensuring the theory evolves from data.
- Axial and Selective Coding: Subsequently refine codes to develop and deepen understanding.
- Theoretical Saturation: Continues data collection until no new insights are gained.
- Glasser and Strauss—Parting of Ways: Diverging views on grounded theory practice.

Research Designs in Health Informatics

3. Empirical Design Types

Cross-sectional Design: Observations at a single point in time.
Longitudinal Design: Observations over an extended period to track changes.
Case Study: In-depth analysis of a single entity or event.
Comparative Design: Examining differences across groups or conditions.
Experiments: Controlled tests to assess causal relationships.

4. The Positivist Paradigm

Typically quantitative and hypothesis-driven.

a. Key Concepts

Hypothesis: A states that A causes B.
Dependent Variable: The outcome impacted by the independent variable.
Independent Variable: The variable manipulated to observe its effect.

b. Control Mechanisms

Objective: Ensure only the independent variable influences the dependent variable.
Techniques include:
- Eliminating confounding factors.
- Holding factors constant.
- Random selection of subjects.
- Using control groups to illustrate design.
- Blind research to minimize bias.

Importance of Statistics in Research

5. Statistical Foundations

a. Variability and Probability

Variability: Analyzes differences in data representation like means or histograms.
Statistical Probability: Provides measures of how likely results are due to chance, with lesser probability yielding higher confidence in results.
- Levels of Significance:
  - P = 0.01 (1% chance).
  - P = 0.05 (5% chance).
  - Acceptable levels vary by context.

6. Practical Application of Statistics

a. Data Analysis Example

Condition 1 Scores: [10, 5, 6, 3, 4, 4, 4, 2, 5, 5].
Condition 2 Scores: [6, 1, 3, 4, 8, 7, 5, 6, 7, 5].
Totals:
- Condition 1 Total: 64
- Condition 2 Total: 40
- Mean for Condition 1: 6.4
- Mean for Condition 2: 3.6

Experimental Statistics

7. Overview of Statistics Types

a. Distinction Between Statistic Types

Inferential Statistics: Used to infer properties about a population from a sample.
Descriptive Statistics: A tool for summarizing and describing data.

Parametric vs Non-parametric Techniques

8. Definitions

a. Parametric Techniques

Applicable under specific conditions:
- Data on interval scales.
- Normal distribution of data.
- Homogeneity of variance (similar variability among groups).

b. Non-parametric Techniques

Used when one or more of the parametric assumptions do not hold.

Selecting Statistical Tests

9. Test Selection Criteria

a. Type of Test

Two-tailed vs One-tailed tests.

b. Considerations in Experimental Design

Types of subjects (between/within).
Aim of the experiment (A causes B).
Number of conditions being tested.

10. Decision Trees for Statistical Tests

a. Non-parametric Decision Tree

Choose based on:
- Type of analysis required (i.e., Difference between conditions).
- Specific techniques for dependent variables.

Non-parametric Tests

11. Specific Techniques

a. Mann-Whitney Test

When: Used for two independent conditions.
Analysis Method: Ranks scores and examines total for calculations.
Formula:
- $U = n1n2 + (n1(n1 + 1)/2) - T1$, where $T1$ is the sum of ranks for one group.

12. Wilcoxon Test

When: Within subject analysis for two dependent conditions.
Mechanism includes:
- Compare ranks to analyze differences (Negative ranks, Positive ranks).
- Summation of negative/positive ranks for test statistic.

13. Friedman Test

When: Within subjects for more than two conditions.
Mechanism: Rank within rows; compare sums to check for non-random differences.

14. Kruskal-Wallis Test

When: Between subjects for more than two conditions.
Mechanism: Rank totalizes and checks for variability in ranks across conditions.

Correlation Analysis

15. Spearman Correlation Coefficient

Examines relationships between variables without asserting causation:
- Formula:
  $R = 1 - \frac{6 \sum{d^2}}{n(n^2 - 1)}$
- Important to rank the data for comparison.

16. Chi-square Test

Application: Used to assess categorical data.
Mechanism: Compares observed with expected frequencies.
- Formula:
  $\chi^2 = \sum \frac{(O - E)^2}{E}$
  where $O$ is observed frequency and $E$ is expected frequency.
Requires a minimum of 5 expected observations per category for reliability.

17. Outro: Fundamentals of Statistics

Goals include providing an introduction to statistics, applying decision trees for test selection, and focusing on non-parametric, inferential statistics.