RHMI Exam

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/90

Earn XP

Description and Tags

These flashcards cover the basics of research methodology and statistical theory, including descriptive and inferential statistics, NHST, ANOVA, and linear regression.

Last updated 3:02 AM on 6/23/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

91 Terms

New cards

Mean

The average calculated by adding all observed values and dividing by the number of values, denoted by the formula: $\bar{X} = \frac{1}{N} \sum_{i=1}^{N} X_{i}$

New cards

Median

The 'middle observation' where half of the observations are larger and half are smaller than this value.

New cards

Mode

The value or values observed most frequently in the data.

New cards

Range

The distance between the maximum and minimum values in a dataset; it is very sensitive to outliers.

New cards

Standard Deviation

A measure of spread describing how far each point is from the centre of mass, denoted by the formula: $S = \sqrt{\frac{1}{N} \sum_{i=1}^{N} (X_{i} - \bar{X})^2}$

New cards

Operationalisation

The process of relating unobservable theoretical constructs to concrete measures (e.g., using the Beck Depression Inventory to measure depression).

New cards

Nominal Scale

A measurement scale where values have no particular relationship or meaningful numbering scheme (e.g., eye colour).

New cards

Ordinal Scale

A measurement scale with a natural ordering, but where the differences between values are not meaningful (e.g. ranking data).

New cards

Interval Scale

A measurement scale with a natural ordering where differences between numbers are meaningful, but ratios are not (e.g., year).

New cards

Ratio Scale

A measurement scale with a natural ordering where 'zero means zero' and both differences and ratios between numbers are meaningful (e.g., age).

New cards

Discrete variable

Values that come in specific categories, with no values existing in between (e.g. year, party voted for).

New cards

Continuous variable

A value that varies smoothly; there’s always something ‘in between’ (e.g. response time).

New cards

Predictor

A variable used to explain other variables, also known as an independent variable or treatment.

New cards

Outcome

A variable to be explained in terms of other variables, also known as a dependent variable or response.

New cards

Test-retest Reliability

A measure of consistency obtained by conducting the same measurement at two different times to see if results match.

New cards

Inter-rater reliability

A measure of consistency obtained by different people conducting the same measurements to see if results match.

New cards

Internal consistency reliability

A measure of consistency obtained by conducting the same measurements with theoretically equivalent versions of a measure to see if results match (e.g. what’s your favourite colour v. what colour would you choose).

New cards

Frequentist Probability

The degree of belief that probability is objective and represents the long-run frequency of repeatable events.

New cards

Bayesian Probability

A subjective view of probability represented as a 'degree of belief' held by an idealised, rational agent.

New cards

Binomial Distribution

A distribution used to describe count data of one of two possible events happening.

New cards

dbimon()/dnorm

Probability density of a specific outcome (doesn’t work for normal distributions).

New cards

pbinom()/pnorm()

Chance that the outcome doesn’t exceed a threshold

New cards

qbinom()/qnorm()

Compute some quantile of the distribution.

New cards

rbinom()/rnorm()

Sample a random number from a distribution.

New cards

Normal Distribution

A continuous distribution described by the mean ( $\mu$ ) and standard deviation ( $\sigma$ ), where mean, median, and mode are identical.

New cards

Central Limit Theorem

The theory stating that as sample size increases, the sampling distribution of the mean becomes normal and converges on the true population mean ( $\mu$ ).

New cards

Standard Error of the Mean (SEM)

A measure reflecting the uncertainty about the mean, calculated as: $SEM = \frac{\sigma}{\sqrt{N}}$ (as sample size increases, the variance goes down)

New cards

Confidence Interval (CI)

The range bounded by $\pm 1.96$ SEMs that is 95% likely to cover the true population mean: $CI_{95} = \bar{X} \pm 1.96 \frac{\hat{\sigma}}{\sqrt{N}}$

New cards

Null Hypothesis ( $H_0$ )

The hypothesis being tested which states there is no effect; all NHST statistical claims are specifically about this hypothesis.

New cards

Fisher (NHST)

States that hypothesis testing is about trying to falsify a single hypothesis (H) and that Type I error reflects the probability of observing a test statistic at least as extreme as the one that was actually found.

New cards

Neyman (NHST)

States that hypothesis testing is about choosing between two rival hypotheses (H_Aor H_B) and that Type I error describes a rate you must be willing to tolerate if you want to reject the null.

New cards

Type I Error

A false positive; rejecting the null hypothesis when it is actually true, typically controlled at $\alpha = 0.05$ .

New cards

Type II Error

A false negative; accepting the null hypothesis when it is actually false, dependent on sample size, effect size and ( $\alpha$ ).

New cards

Type I and Type II Error Trade-off

Lower ( $\alpha$ ) means higher ( $\beta$ )
Increasing sample size, all else equal, increases power (1- $\beta$ ) and decreases the Type II error rate

New cards

Type II Error ( $\beta$ )

A false negative; accepting the null hypothesis when it is actually false.

New cards

Power ( $1-\beta$ )

The probability of correctly rejecting a false null hypothesis, which increases with larger sample sizes.

New cards

Chi-squared Statistic

$\chi^2$ : calculated by summing the difference between observed and expected values of categorical data (the larger the value, the worser the fit to the data).

New cards

Goodness of Fit Test

A chi-squared test that compares the observed frequencies of one variable against a hypothesis about the true probabilities of that variable, calculated as: $\chi^2 = \sum \frac{(O_i - E_i)^2}{E_i}$ , where $O_i$ are the observed frequencies and $E_i$ are the expected frequencies.

New cards

Test of Independence

A chi-squared test that tests whether two nominal-scale variables are related to each other, calculated as: $\chi^2=\sum\sum\frac{(O_{ij}-E_{ij})^2}{E_{ij}}$ , where $O_{ij}$ are the observed frequencies and $E_{ij}$ are the expected frequencies.

New cards

Critical Region for Chi-squared

Calculated by finding the 95% quantile of the distribution w/ the respective degrees of freedom

( qchisq(.95, df = …) )

New cards

Chi-squared Standard Residuals

Indicate how many 'standard deviations' away each cell is from the expected frequency, with values beyond ±1.96 suggesting significance.

New cards

Cramer’s V

A measure of effect size for chi-squared tests calculated as: $V = \sqrt{\frac{\chi^2}{N(k-1)}}$

New cards

Cramer’s V (0 to 0.1)

Negligible association

New cards

Cramer’s V (0.1 to 0.3)

Weak association

New cards

Cramer’s V (0.3 to 0.5)

Moderate association

New cards

Cramer’s V (0.5 to 1)

High association

New cards

Chi-squared Assumptions

Large Expected Frequencies: The sampling distribution is valid only if the expected frequencies in each category are sufficiently large (typically at least 5), as it breaks down for too few observations.
Independence of Data: The observations must be independent; there should be no special relationship among them, ensuring that the sampling methods do not introduce bias.

New cards

Large Expected Frequencies Violated

Use Fisher Exact Test: works by calculating the exact probability of obtaining a particular contingency table, but assumes rows and columns are fixed

New cards

Independence of Data Violated

Use McNemar Test: when have multiple observations for each person, e.g. pre-test and post-test

New cards

Z-score

A standardized score with a mean of 0 and a standard deviation of 1: $Z = \frac{X - \mu}{\sigma}$ (conceptually equivalent to chi-squared adjusted residuals)

New cards

T-test statistic

Calculated under the premise that the population distribution is normally distributed. It is determined by averaging several potential values for the population standard deviation, represented as: $t=\frac{\bar{X}-\mu}{\frac{\sigma}{\sqrt{N}}}$ , which approaches a normal distribution as the sample size increases.

New cards

T-statistic

$t$ : symmetric about zero, in which deviations demonstrate support against the null hypothesis

New cards

Cohen’s d

A simple measure of effect size for t-tests: $d = \frac{\text{mean 1} - \text{mean 2}}{\text{std dev}}$

New cards

Cohen’s d (0.2)

Small effect size

New cards

Cohen’s d (0.5)

Medium effect size

New cards

Cohen’s d (0.8)

Large effect size

New cards

One Sample T-test

A statistical test used to determine if the mean of a single sample differs significantly from a known population mean.

New cards

Independent Sample T-test

A statistical test used to compare the means of two independent samples to determine if they differ significantly from each other.

New cards

Paired T-test

A statistical test used to compare the means of two related groups to determine if they differ significantly from each other.

New cards

T-test Assumptions

Population distributions are normal
Observations are independently sampled
Homogeneity of Variance (groups have the same standard deviation)

New cards

T-test Normality Violated

Use QQ-plots to observe the quantiles of data, as compared against the theoretical quantiles of the normal distribution. If not identical (a nice straight line), either use the Shapiro-Wilk Test or Wilcoxon.

New cards

Shapiro-Wilk Test

A statistical test used to determine whether a sample comes from a normally distributed population. It assesses the normality of data by comparing the observed distribution to an expected normal distribution. Values less than 1 and a significant p-value imply deviations from normality.

New cards

Wilcoxon

A non-parametric statistical test used to evaluate whether there is a significant difference between the distributions of two related samples or matched observations. It is used when the assumptions of the t-test are violated. However, can lead to higher Type II error.

New cards

Wilcoxon (0.1 to 0.3)

Small effect size

New cards

Wilcoxon (0.3 to 0.5)

Medium effect size

New cards

Wilcoxon (>0.5)

Large effect size

New cards

QQ-plot

A scatterplot of actual quantiles of the observed data against theoretical quantiles of the normal distribution to assess the normality of data. If the points deviate significantly from the diagonal line, the normality assumption is considered violated.

New cards

Student T-test

A statistical test that assumes that both groups have equal variance.

New cards

Welch T-test

An adaptation of the t-test used when the assumption of equal variance between groups is violated.

New cards

One-way ANOVA

A statistical test used to determine if the population means for multiple groups are identical by comparing variability between groups ( $SS_b$ ) and within groups ( $SS_w$ ). If the between groups variability is significantly greater than within groups variability, it suggests that at least one group mean is different.

New cards

Between Groups Variability ( $SS_b$ )

The variability in scores that is attributed to the differences among group means in an ANOVA.

New cards

Within Groups Variability ( $SS_w$ )

The variability in scores that is attributed to differences within individual scores in the same group in an ANOVA.

New cards

F-statistic

The ratio of mean square between groups to mean square within groups: $F = \frac{MS_b}{MS_w}$ (means are more different when this value is larger and small when the null is true)

New cards

Two-way ANOVA

An extension of ANOVA that evaluates the effect of two independent variables on a dependent variable, allowing for interaction effects between the variables. Results are different from running two separate one-way ANOVAs as residuals are different ( $SS_{R}$ ).

New cards

Residual Sum of Squares ( $SS_{R}$ )

The total variation in the dependent variable that is not explained by the independent variables in an ANOVA analysis. It reflects the variability within the groups after accounting for the effects of the independent variables.

New cards

Interaction Sum of Squares ( $SS_{A:B}$ )

The portion of total variation in an ANOVA that is attributed to the interaction between two independent variables. It assesses how the interaction influences the dependent variable beyond the individual effects of the variables.

New cards

Eta Squared ( $\eta^2$ )

A measure of effect size in ANOVA representing the proportion of total variance attributable to a factor, calculated by dividing ( $SS_{B}$ ) by ( $SS_{tot}$ ).

New cards

Holm Correction

A method to control the family-wise Type I error rate by sorting p-values and adjusting them sequentially.

New cards

Bonferroni Correction

A method to control the family-wise Type I error rate by multiplying all original p-values by the number of tests (tends to lose a lot of power).

New cards

Post Hoc Tests

Statistical tests applied after ANOVA to determine which specific group means are significantly different from each other, and for which there are no particular hypotheses.

New cards

Linear Regression Model

A mathematical relationship expressed as: $Y_{i} = b_{1}X + b_{0} + \epsilon_{i}$ , where $b_1$ is the slope, $b_0$ is the intercept, and $\epsilon$ is the residual.

New cards

ANOVA Assumptions

Residuals are normally distributed (i.e. within-groups variance)
Homogeneity of variance across all groups
Independence

New cards

Residual Normality Violated (ANOVA)

Use Shapiro-Wilk Test on residuals

New cards

Akaike Information Criterion (AIC)

A measure for model selection that penalizes model complexity: $AIC = \frac{SS_{res}}{\sigma^2} + 2K$

New cards

Cook’s Distance

A metric quantifying the influence of a data point by combining its 'outlier-ness' and its leverage.

New cards

Variance Inflation Factor (VIF)

A measure used to quantify the extent of collinearity among predictors in a regression model.

New cards

Measures of central tendency

Mean, median, mode

New cards

Measures of spread

Range, interquartile range, standard deviation

New cards

Interquartile range

A measure of spread that describes the difference between the first and third quartiles in a dataset, indicating the range of the middle 50% of values

New cards

Theoretical constructs

Concepts or models used in statistical analysis to represent phenomena, often not directly observable (e.g. attitudes, beliefs, information processing speeds)

New cards

Measure

Tool for getting people to produce data that is informative about the construct (e.g. survey items, reaction times)