Multivariate W1-3

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/51

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 7:26 PM on 4/20/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

52 Terms

1
New cards

Measure the strength of patterns in our data

One of two main goals of multivariate analysis.

2
New cards

Determine if this pattern is strong enough to be believed

The second of two main goals of multivariate analysis.

3
New cards

Data

Information collected to gain knowledge about a field or to answer a question of interest.

4
New cards

Design (in statistics)

Choosing subjects.

5
New cards

Description (in statistics)

Summarizing data.

6
New cards

Inference (in statistics)

Making predictions about a population based on a sample.

7
New cards

Population (statistical definition)

Total set of subjects of interest.

8
New cards

Sample (statistical definition)

Subset of the population on which a study collects its data.

9
New cards

Parameter

Numerical summary of a population. Example: percentage of all adult Americans.

10
New cards

Statistic

Numerical summary of a sample. Example: percentage of adult Americans in a specific location.

11
New cards

Nominal level of measurement

Data that consist of names, labels, or categories only; qualitative and cannot be ranked or ordered.

12
New cards

Ordinal level of measurement

Qualitative data that can be arranged in some order (low to high); no quantitatively fixed space between items.

13
New cards

Interval level of measurement

Quantitative data where intervals are meaningful but ratios are not; has an arbitrary zero point.

14
New cards

Mean

Average value; sum of all values divided by total number of values.

15
New cards

Advantage of mean

Easy to interpret; foundational statistic usable in many procedures; sample mean is best estimate of population mean.

16
New cards

Disadvantage of mean

Requires interval or ratio level data; can be highly influenced by outliers, especially in small sample sizes.

17
New cards

Median

Exact middle value in a sorted data set.

18
New cards

Data requirement for median

Requires ordinal data.

19
New cards

Advantage of median

True model of central tendency because it is always in the middle; not strongly influenced by outliers.

20
New cards

Mode

The most common value in a data set.

21
New cards

Standard deviation

A measure of variability for interval variables equal to the square root of the variance.

22
New cards

Best summary for nominal data

Mode.

23
New cards

Best summary for ordinal data

Mode, Median, and Range.

24
New cards

Best summary for interval-ratio data

Mean, Median, Mode, Range, and Standard Deviation.

25
New cards

Normal distribution characteristics

Symmetrical/even distribution.

26
New cards

Central limit theorem

Things that start as not normal (bi-modal) can become normal with enough added data; sample looks like population as you collect more data.

27
New cards

Z-score

A way to identify one's position and location in a distribution of data; changes a raw score to a standardized or normalized score.

28
New cards

Z-score formula

The individuals data (Xi) - the sample mean (x̅), divided by the samples data’s standard deviation (sx)

29
New cards

Purpose of z-scores

Crucial to make comparisons across variables with different units (income, education years, number of children).

30
New cards

Inferential statistics definition

We rarely observe population parameters, only sample statistics; to draw inferences, samples must be representative.

31
New cards

Sampling error

Almost every sample will miss the true population parameter by a little.

32
New cards

Sampling distribution

Theoretical distribution of a statistic for all possible samples of a given size (N); does not exist empirically.

33
New cards

Sampling with replacement

Choosing a subject randomly as first member, then putting them back before choosing the next member.

34
New cards

Mean of sampling distribution

Equal to the true population mean (

35
New cards

Standard error

Standard deviation of the sampling distribution; equals population standard deviation divided by square root of N (

36
New cards

Population distribution

A real distribution representing characteristics of all members of the population of interest.

37
New cards

Sampling distribution (summary)

A theoretical probability distribution representing results of all possible samples drawn from the population.

38
New cards

Sample distribution

A real distribution describing characteristics of a sample.

39
New cards

Central Limit Theorem summary

Most random samples with relatively large N are close to true population mean; larger N means smaller standard error and closer clustering to true parameter.

40
New cards

Point estimate

Sample statistic used to estimate the exact value of a population parameter (mean, proportion, etc.).

41
New cards

Confidence interval

Range built around a sample statistic within which the population parameter is likely to fall.

42
New cards

Confidence interval width principle

Confidence in a range goes up the wider it is because it can account for more possible values.

43
New cards

Default confidence interval used in class

95 percent confidence interval.

44
New cards

When to reject null hypothesis

p<0.05, it is unlikely the observed difference is by random chance.

45
New cards

Two sample t-test

Compare differences between two sample statistics from two mutually exclusive, independently and randomly selected groups.

46
New cards

Example of two sample t-test

Gender wage gap between men and women.

47
New cards

ANOVA (Analysis of Variance)

Compares differences across more than two groups; essentially a 3+ group t-test.

48
New cards

ANOVA logic

Uses a sampling distribution of variation in means; decomposes variance to compare between-group differences to within-group differences.

49
New cards

ANOVA example age and capital punishment

Different age groups should have different support levels; a particular age group should have similar support levels.

50
New cards

Three steps of quantitative research

  1. Have research idea and hypothesize questions; 2. Find appropriate data/variables; 3. Choose statistical analysis and interpret findings.

51
New cards

Three bivariate associations

X1→Y (direct cause); X1→X2 (cause to mediator); X2→Y (mediator to outcome).

52
New cards

Mediating variable example

Does racism cause educational barriers which then causes Y.