Probability and Statistics Midterm Vocab

0.0(0)
studied byStudied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/66

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 1:12 AM on 2/8/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

67 Terms

1
New cards

Anecdotal evidence

one or a few specific cases/stories

2
New cards

Non-response bias

only a small fraction of the randomly sampled people choose to respond, the survey may no longer be representative of the population

3
New cards

Voluntary response

sample consists of people who volunteer to respond because they have strong opinions on the subject

4
New cards

Convenience sample

individuals who are easily accessible are more likely to be included in the sample

5
New cards

Perspective study

identifies individuals and collects information as events unfold

6
New cards

Retrospective studies

collect data after events have taken place

7
New cards

Simple sampling

randomly select cases from population, no implied connection between points selected

8
New cards

Stratified sample

strata are made up of similar observations. we take a simple random sample from each strata

9
New cards

Cluster sample

clusters are not made up of homogeneous observations, we take a simple random sample of clusters and then sample all observations in that cluster

10
New cards

Multistage sample

clusters are usually not made up of homogeneous observations. we take a random sample of clusters and then take a random sample of observations from sampled clusters

11
New cards

Principles of experimental design

control, randomize, replicate, block

12
New cards

Block

if there are variables that are known or suspected to affect the response variable. first group subjects into blocks based on these variables, and then randomize cases within each block to treatment groups

13
New cards

Double-blind

both subject and researchers don’t know what group subjects are in

14
New cards

P-value

probability value. The probability that we would get results this extreme assuming the null hypothesis is true

15
New cards

One-sided hypothesis

focuses on one specific value

16
New cards

p (hat)

sample proportion

17
New cards

Null hypothesis

“There is nothing going on.” The variables are independent, observed differences are due to chance

18
New cards

Alternative hypothesis

“There is something going on.” Variables are dependent, not due to chance.

19
New cards

Either reject or ____ the null hypothesis

fail to reject

20
New cards

Convincing evidence

allows us to reject the null hypothesis

21
New cards

Type 1 error

rejecting the null hypothesis when its true

22
New cards

Type 2 error

don’t reject the null hypothesis when the alternative hypothesis is true

23
New cards

two-sided

looking for extreme results on either side of the distribution

24
New cards

Confirmation bias

we only look for data that supports our own idea

25
New cards

Random process

is a situation in which we know what outcomes could happen, but we don’t know what particular outcome we will get

26
New cards

Frequentist interpretation

the probability of an outcome is the proportion of times the outcome would occur if we observed the random process an infinite number of times

27
New cards

Bayesian interpretation

interprets probability as a subjective degree of belief. for the same events, two separate people could have different viewpoints

28
New cards

Law of large numbers

states that as more observations are collected, the proportion of occurrences with a particular outcome, p(hat)n, converges to the probability of that outcome, p (population proportion)

29
New cards

Gambler’s fallacy

random processes are supposed to compensate for whatever happened in the past (ex. 10 heads, next one should be tails)

30
New cards

Disjoint (mutually exclusive) outcomes

cannot happen at the same time (can’t get heads and tails on a single coin toss)

31
New cards

Non-disjoint outcomes

can happen at the same time (can’t get heads and tails on a single coin toss)

32
New cards

General addition rule

P(A or B) = P(A) + P(B) - P(A and B) for disjoint events: P(A or B) = P(A) + P(B)

33
New cards

Probability distribution

lists all possible events and the probabilities with which they occur

34
New cards

Rules for probability distributions:

  1. the events listed must be disjoint

  2. each probability must be between 0 and 1

  3. the probabilities must total 1

35
New cards

Sample space

all the possible outcomes of a trial

36
New cards

Complementary events

two mutually exclusive events whose probabilities add up to 1

37
New cards

Independent

knowing the outcome of one provides no useful info about the outcome of the other

38
New cards

Associated or dependent

The outcome of one affects the outcome of another

39
New cards

Product rule for independent events

if A and B are independent events, then P(A and B) = P(A) x P(B)

40
New cards

Complementary events

are opposite, but always add up to 1 (like heads and tails)

41
New cards

Conditional probability

P(A|B) = P(A and B)/P(B) (A given B is true)

42
New cards

General multiplication rule

If A and B represent two outcomes or events, then P(A and B) = P(A|B) x P(B)

43
New cards

Rule of thumb

if we are sampling without replacement, then the rule of thumb is if we sample less than 10% of the population, the effect on conditional probability is small, so we assume independence

44
New cards

The probability of 1 specific value amongst infinite values is ______

effectively zero

45
New cards

Normal distribution

Unimodal and symmetric, bell shaped curve

46
New cards

Measures of spread

variance, standard deviation, and range

47
New cards

z score

how many SD we are from the mean (only for normal!)

48
New cards

percentile

percentage of observations that fall below a given data point

49
New cards

Unusual

observations that are more than 2 SDs away from the mean

50
New cards

Statistic

any number that’s used to represent something

51
New cards

One version of scientific methodology:

observe data, form hypothesis, gather more data, test/evaluate/adjust hypothesis, gather more data, etc.

52
New cards

explanatory variable

variable that is manipulated or changed

53
New cards

Response variable

outcome (yes or no)

54
New cards

Population

all those who could be tested

55
New cards

Sample

subgroup or subset of the population

56
New cards

Numerical variable

continuous (number line) or discrete (gaps between numbers)

57
New cards

Categorical variable

ex. (T-shirt size) nominal or ordinal (natural order)

58
New cards

68-95-99.7 Rule

for nearly normally distributed data:

  • about 68% falls within 1 SD of mean

  • about 95% falls within 2 SD of the mean

  • about 99.7% falls within 3 SD of the mean

59
New cards

Population proportion

percentage of a population that meets some specific criterion

60
New cards

Sample proportion

how many in the sample meet that criterion divided by the total amount in the sample (p hat)

61
New cards

Plurality

not the majority, but the highest number

62
New cards

Population parameter of interest

the # we want to know

63
New cards

Sample statistic

usually a point estimate, the # we get from a sample

64
New cards

Central Limit Theorem (proportion version)

sample proportions will be nearly normally distributed with mean equal to the population proportion, p, and standard error equal to ______

65
New cards

CLT conditions

  • Independence - n<10% of population

  • Sample size - there should be at least 10 expected successes and 10 expected failures

66
New cards

Confidence interval

a plausible range of values for the population parameter

67
New cards

significance level (alpha)

probability of a Type 1 error, usually 0.05