Stats Vocab - Chapter 1

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/52

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

53 Terms

1
New cards

Quantitative

Quantity - Numbers

2
New cards

Qualitative

Qualities - Words / No mathematical value

3
New cards

Continuous data

Data you can measure, like height, weight, temperature, or time.

You can get decimal values (e.g., 5.7 cm, 98.6°F), and there are infinite possibilities between any two values

4
New cards

Discrete

Countable (whole numbers only)

Fx: 3 kids, 10 pencils, 25 students

5
New cards

Nominal

Aka: Categorical

Name-only / Used to classify or group things

FX: Eye color (blue, green, brown)

Type of fruit (apple, banana, orange)

Gender (male, female, nonbinary)

Zip code (just a label, not a value)

6
New cards

Ordinal or Rank

Shows which comes first, second, third, etc. - but doesn’t tell you how much better or worse one is from the other


Fx: Ranked choice voting

Fx2:

Survey answers: "Strongly agree," "Agree," "Neutral," "Disagree"

T-shirt sizes: Small, Medium, Large

Contest results: 1st place, 2nd place, 3rd place

Military ranks: Private, Corporal, Sergeant

7
New cards

A variable

Something that varies from individual to individual in your dataset — for example, each person has a different age or height

8
New cards

An individual

Any person, animal or thing described in a set of data

9
New cards

A variable is

Any attribute that can take different values for different individuals

10
New cards

Categorical

No math numbers

Can still be number but used as a group. Like were you born in the 80s or 90s. Or what year a movie came out

11
New cards

2-4, 10-19

Categorical
-Because you can’t do meaningful math with them

12
New cards

2, 10, 19, 30

Quantitative
-Because you can calculate averages and add and subtract them in formulas

13
New cards

Inference

To draw conclusions about a population based on simple data

14
New cards

Order or Rank Data

Data that has a clear order or ranking, but the differences between the values are not exact or evenly spaced

Ex: 1st place, 2nd place, 3rd place (we know the order, but not how much faster one was than the other)

"Very satisfied," "Satisfied," "Neutral," "Dissatisfied," "Very dissatisfied"

15
New cards

Interval data

Numerical data where the order matters and the differences between values are meaningful and equal, but there is no true zero

Fx: Temperature in Celsius or Fahrenheit

(20°C is hotter than 10°C, and the difference is 10 degrees—but 0°C doesn’t mean “no temperature”)

Dates on a calendar (e.g., the year 2000 vs. 2010)

What the heck is year 0
The guys who invented ferenhiet was 32 when he invented it

16
New cards

Ratio data

The order matters

The differences are meaningful

There is a true zero — which means none of the quantity is present

You can add, subtract, multiply, and divide

Fx: Weight (0 kg means no weight)

Height (0 cm means no height)

Money ($0 means no money)

Time (0 seconds = no time passed)

🧠 Key Idea:

You can say things like “twice as much” or “half as long.”

17
New cards

Nonresponse bias

People in the sample did not reply

18
New cards

Undercoverage bias

The sample itself misses key parts of the population

19
New cards

Multicenter study

Cluster study = Same

20
New cards

Observational study

When researchers watch and record what happens without interfering or changing anything. (Observe subjects in their natural setting)

21
New cards

Experimental study

When researchers actively change something (apply a treatment) and measure the results

Fx: Researchers assign groups (e.g., treatment vs. control), control variables. They can show cause and effect

22
New cards

Cross-sectional study

Snapshot study. Looks at data from a group of people at one single point in time — like a snapshot

23
New cards

Retrospective study

Looks backward in time — it uses past data to find patterns or connections

24
New cards

Prospective (aka Longitudinal) study

A __________ study looks forward in time — researchers start now and follow people into the future to see what happens.

"Start now, watch what happens later."

25
New cards
26
New cards

Confounding

Occurs when two variables are associated in such a way that their effects on a response variable cannot be distinguished from each other

A __________ variable is a hidden or third variable that:

Affects both the independent variable

27
New cards

Blocks

Subjects are grouped into blocks based on something they have in common (like age, gender, or location)

"Group first, then randomize — apples with apples, oranges with oranges."

28
New cards
29
New cards

Random sample

A way of picking people or items by chance, so the choice isn’t biased. - Several mini buckets. You go to 4 random classrooms and pick 2 students from each.

Still random, but not everyone in the school had the same chance. Some classes could have 30 students while others 10 students.

30
New cards

Simple random sample

A type of random sample where everyone has the same chance of being picked. - One big hat. You write all 100 names on slips of paper, put them in a hat, mix well, and draw 10 names.

31
New cards

Probability sample

Means any sampling method where each person has a known chance of being selected

Fx: This includes random sampling, stratified sampling, cluster sampling, etc.

32
New cards

Systematic sampling

Picking every kth person from a list after choosing a random starting point.

Fx: Like picking every 5th name on a list, starting at a random spot.

33
New cards

Convenience sampling

Consists of individuals from the population who are easy to reach

34
New cards

Multi-stage sampling

Uses more than one step or method to select the sample — like combining clusters and random sampling

35
New cards

Self-response or Volunteer sampling

Consists of people who choose to be in a sample by responding to a general invitation

36
New cards

Stratified sampling

Divide the population into groups (strata) that share something in common — and then randomly select people from each group.

Fx: A researcher wants to survey 100 college students.

The college is 70% undergrads and 30% grad students.

So they randomly select 70 undergrads and 30 grad students

37
New cards

Cluster sampling

When you split the population into groups — then randomly pick some whole clusters and survey everyone in them.

38
New cards

Measurement errors

Mistakes or differences between the true value and what you actually record or observe. “Wrong number by accident.”

Often more subjective, “ Do you not not like abortion?” Subjective and the question is guided, leading to bias.

39
New cards

Sampling errors

You survey 100 students and 60% like pizza. But if you asked all 1,000 students, maybe only 55% do. That 5% difference is the _____________.

The difference between the result from your sample and the real result from the whole population. “Close, but not exact — because it's just a sample.”

40
New cards

Logic errors

Mistakes in how you think about or set up your study — leading to wrong conclusions. “The method or reasoning is flawed.”

Fx: New Coke example

41
New cards

Precision errors

When your measurements are not consistent or repeatable — the values jump around even if nothing changes. “All over the place.”

You weigh yourself 3 times in a row and get:

150 lbs, 154 lbs, 148 lbs.

The scale has ________ error — it's not giving consistent results.

42
New cards

Reliability error

When a measurement tool doesn’t give consistent results every time you use it.

“It gives different answers even if nothing has changed.”

=Not repeatable

43
New cards

Validity

How well a test or method measures what it’s supposed to measure. “Are we measuring the right thing?”

You want to measure math skills, but your test mostly checks reading — it’s not valid.

Even if people get consistent scores (reliable), the test doesn’t measure the right thing — so it’s not valid.

44
New cards

Accuracy

How close your result or measurement is to the true value

45
New cards

Accuracy error

When your measurement is consistently off from the true value — even if it's always the same.

You hit the same spot every time, but it’s the wrong spot.

Example:

If a scale always says 120 lbs when the true weight is 130 lbs, it’s accurate error — it’s off by 10 lbs every time.

46
New cards

Validity error

When your measurement doesn’t actually measure what it’s supposed to measure.

You're measuring something — just not the right thing.

47
New cards

Uncertainty versus Variability

Variability = Natural differences in the world

Uncertainty = How sure we are about our conclusions

“Variability is in the data; uncertainty is in the knowledge.”

48
New cards

Measurements versus numbers

"Measurements have error; numbers have differences."

49
New cards

Measurements

Values taken using a tool or process to quantify something (like weight, height, temperature).

Often associated with: Uncertainty because tools and techniques aren’t perfect.

Example: A thermometer reads 98.4°F — but is it exactly that? Maybe not.

50
New cards

Numbers (Data Values)

The actual values or observations recorded (could come from measurement or counts).

Often associated with: Variability when you’re comparing different data points.

Example: Five people have incomes of $40K, $50K, $55K, $70K, and $90K — those differences show variability.

51
New cards

Stages statistical studies

PPDAC (Planned Parenthood Die All Conservatives)

(Problem) · Step 2 (Plan) · Step 3 (Data) · Step 4 (Analysis) · Step 5 (Conclusion)

52
New cards

Goals of statistical studies

1. Ask a Question

Identify a research question or objective

2. Plan the Study

Design the study / Decide what data to collect

3. Collect the Data

Gather data / Conduct the study

4. Analyze the Data

Summarize and make sense of the data

5. Draw Conclusions

Interpret the results / Make inferences

53
New cards

Strata

Groups of individuals in a population that share characteristics thought to be associated with the variables being measured in a study