Stats Vocab

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/56

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 8:18 PM on 5/31/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

57 Terms

1
New cards

pth percentile

value with p% of the observations less than it

2
New cards

5 number summary

minimum, Q1, median, Q3, maximum

3
New cards

outlier

any data value >= Q3 + 1.5IQR, or

4
New cards

interquartile range (IQR)

Q3 - Q1

5
New cards

standardized value / z-score

given an observation x from a distribution with mean m and standard deviation s,

6
New cards

z = (z - m)/s

7
New cards

explanatory variable

independent variable; influences changes in the response variable

8
New cards

response variable

dependent variable; depends on the explanatory variable

9
New cards

parameter

a numerical characteristic of a whole population

10
New cards

statistic

a numerical measurement of a sample (rather than the population)

11
New cards

correlation coefficient (r)

measures the direction and strength of the linear association between two quantitative variables (-1 <= r <= 1)

12
New cards

regression line

straight line that describes how a response variable changes as the explanatory variable changes

13
New cards

square of the correlation (r^2)

the fraction of the variation in the response variable that is explained by the least-squares regression

14
New cards

residual

difference between an observed value of the response variable and the predicted value by the regression line:

15
New cards

residual = observed y - predicted y

16
New cards

lurking variable

a variable that is not among the explanatory or response variables but may influence the relationships among the variables

17
New cards

variance

the square of the standard deviation

18
New cards

Simpson's Paradox

the reversal of an apparent association when some variables are (dis)aggregated/combined

19
New cards

confounding

variables whose effects on a response variable cannot be distinguished from each other

20
New cards

anecdotal data

represent individual cases and are not representative of any larger group of cases

21
New cards

available data

data that are easily accessible/available

22
New cards

sample survey

a study that collects data from a sample representing the larger population

23
New cards

census

a study that collects data from all cases in the population of interest

24
New cards

observational study

a study that observes individuals without influencing the responses

25
New cards

experiment

a study that imposes treatments on experimental units to observe their responses

26
New cards

experimental units / test subjects

the individuals on which the experiment is done

27
New cards

treatment

a change in the explanatory variable

28
New cards

outcome

measured variables that are used to compare to the treatments

29
New cards

elements of experimental design

control, randomization, repetition

30
New cards

block design

forming blocks of experimental units that are similar in some way (ex: gender), then randomizing within each block

31
New cards

matched pair design

an special case of block design, where the blocks consist of two experimental units that share as many attributes as possible (ex: twins)

32
New cards

double-blind study

a study in which neither the experimenters nor the subjects know which treatment any subject has received

33
New cards

simple random sample (SRS)

sampling method where all groups of size n from the population have an equal probability of being chosen

34
New cards

stratified random sample

sampling method that separates the population into different strata based on some attribute (ex: socioeconomic level), chooses an SRS from each strata, and combines these samples according to the makeup of the population

35
New cards

multistage random sample

sampling method that selects successively smaller groups within the population in stages (ex: states -> cities -> districts -> schools -> students)

36
New cards

sample proportion (p̂)

proportion of 'successes' in a sample; a statistic

37
New cards

population proportion (p)

proportion of 'successes' in a population; a parameter that we will never know

38
New cards

bias

center of the sampling distribution is not equal to the true value of the parameter

39
New cards

variability

the spread of the sampling distribution

40
New cards

institutional review board

a board that reviews all planned studies in advance to protect subjects from possible harm

41
New cards

informed consent

giving potential participants enough information about a study to enable them to choose whether they wish to participate

42
New cards

confidentiality

only the researchers can identify responses of individual subjects

43
New cards

anonymity

subjects are anonymous, so their names are not known even to the director of the study

44
New cards

random

individual outcomes of a phenomenon are uncertain, but these outcomes occur a predictable percentage of the time in large numbers of independent trials

45
New cards

probability

the proportion of times the event occurs in many repeated trials of a random phenomenon

46
New cards

independent

the outcome of one trial does not influence the outcome of any other trial

47
New cards

sample space

the set of all possible outcomes of the random phenomenon

48
New cards

event

a set of outcomes

49
New cards

complement of an event A

exactly the outcomes that are not in A

50
New cards

binomial experiment

an experiment with a fixed number of trials for which each trial results in either a success or failure

51
New cards

Central Limit Theorem

a simple random sample of size n from a population with mean μ and standard deviation σ is drawn. when n is large, the sampling distribution of the sample mean x̄ is approximately N(μ, σ/sqrt(n))

52
New cards

standard error

the statistic s / sqrt(n), where s is the sample standard deviation and n is the size of the population

53
New cards

level C confidence interval

an interval centered around the sample mean that is computed from a sample. any level C confidence interval will have probability C of containing the population proportion

54
New cards

Type I error

error occurring when we reject the null hypothesis when in fact it is true

55
New cards

Type II error

error occurring when we fail to reject the null hypothesis when it is in fact false

56
New cards

power of a statistical test

1 - probability of a Type II error (given that the null hypothesis is false)

57
New cards