Concurrent Validity: Equivalence Testing and Model Two Regression
Equivalence Testing
A statistical test used to assess agreement, becoming increasingly popular.
Commonly used in:
- Pharmacology: FDA approval of medical treatments and drugs.
- Nutrition: Assessing if a supplementation or dietary protocol is equivalent to a placebo effect.
- Exercise Science: Evaluating if wearables accurately estimate energy expenditure.
Example: Comparing maximal running velocity from a GPS unit to a radar gun.
Built around two one-sided t-tests.
If the 90% confidence interval of the mean difference falls within an a priori decided acceptable difference, the measures are considered equivalent.
Null Hypothesis:
- Traditional null hypothesis testing: No difference in population means.
- Equivalence testing: The means of measures are not equivalent (they are different).
Equivalence testing examines only the mean difference, similar to traditional null hypothesis testing.
Random error is important and can be assessed using limits of agreement.
Recommendation: Use limits of agreement and equivalence testing together.
- Formal statistical test for equivalence.
- Clinical judgment for agreement.
Model Two Regression
A different form of regression than ordinary least squares.
Used for assessing agreement.
Various forms exist, including:
- Ordinary least products (most popular).
- Weighted least products (a variant of ordinary least products).