Stats Final Exam

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/260

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 4:16 PM on 6/24/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

261 Terms

1
New cards

What is statistics?

The process of collecting, analyzing, and presenting data.

2
New cards

What are descriptive statistics?

Methods that summarize collected data using graphs and calculations.

3
New cards

What are inferential statistics?

Methods that use sample data to make conclusions or predictions about a larger population.

4
New cards

What procedure should you use for one quantitative variable?

Mean, median, range, quartiles, IQR, standard deviation, histogram, dot plot, or boxplot.

5
New cards

What procedure should you use for one categorical variable?

Frequency table, relative frequency, bar chart, or pie chart.

6
New cards

What procedure should you use for two categorical variables?

Two-way table, conditional probability, independence/dependence, or segmented bar chart.

7
New cards

What procedure should you use for two quantitative variables?

Scatterplot, correlation, regression, prediction, residuals, r, or r^2.

8
New cards

What procedure should you use when a problem says success/failure with fixed n?

Binomial distribution.

9
New cards

What procedure should you use when a continuous variable is normal with mean and SD?

Normal distribution, z-score, NORM.DIST, or NORM.INV.

10
New cards

What procedure should you use to estimate a population mean or proportion?

Confidence interval.

11
New cards

What procedure should you use to test a claim about a population mean or proportion?

Hypothesis test.

12
New cards

Fastest way to avoid wrong answers

Before calculating, identify the data type and the goal of the problem.

13
New cards

What is a variable?

A characteristic, number, or quantity that can be measured or counted.

14
New cards

Quantitative variable

A numeric variable where an average makes sense, such as age, GPA, weight, or exam score.

15
New cards

Categorical variable

A group, quality, or label where an average usually does not make sense, such as color, major, or yes/no.

16
New cards

Discrete quantitative variable

A countable/listable integer-valued variable, such as number of siblings or number correct.

17
New cards

Continuous quantitative variable

A variable that can take values in a range, including decimals, such as height, weight, time, or temperature.

18
New cards

Nominal categorical variable

Categories with no natural order, such as colors, yes/no, or dog/cat/bird.

19
New cards

Ordinal categorical variable

Categories with a meaningful order, such as low/medium/high or first/second/third.

20
New cards

Why are ZIP codes and ID numbers usually not quantitative?

They are labels, not measurements where an average makes sense.

21
New cards

Mean formula

Mean = sum of values / number of values.

22
New cards

Excel command for mean

=AVERAGE(range)

23
New cards

Median

The middle value after sorting; the 50th percentile.

24
New cards

Excel command for median

=MEDIAN(range)

25
New cards

When should you use the median instead of the mean?

When the distribution has outliers or is strongly skewed.

26
New cards

What happens if the mean is higher than the median?

High values/outliers are pulling the mean up; the distribution may be right-skewed.

27
New cards

What happens if the mean is lower than the median?

Low values/outliers are pulling the mean down; the distribution may be left-skewed.

28
New cards

Range formula

Range = maximum - minimum.

29
New cards

Excel command for range

=MAX(range)-MIN(range)

30
New cards

Q1 meaning

The first quartile; about 25% of data are at or below Q1.

31
New cards

Q3 meaning

The third quartile; about 75% of data are at or below Q3.

32
New cards

Excel command for Q1

=QUARTILE.INC(range,1)

33
New cards

Excel command for Q3

=QUARTILE.INC(range,3)

34
New cards

IQR formula

IQR = Q3 - Q1.

35
New cards

What does IQR measure?

The spread of the middle 50% of the data.

36
New cards

Lower outlier fence formula

Lower fence = Q1 - 1.5(IQR).

37
New cards

Upper outlier fence formula

Upper fence = Q3 + 1.5(IQR).

38
New cards

Outlier rule

Any value below the lower fence or above the upper fence is considered an outlier.

39
New cards

Standard deviation meaning

The typical/average distance values are from the mean; larger SD means more spread out.

40
New cards

Excel command for population standard deviation

=STDEV.P(range)

41
New cards

Excel command for sample standard deviation

=STDEV.S(range)

42
New cards

What is the standard deviation if all values are identical?

0, because every value is exactly 0 away from the mean.

43
New cards

Dot plot is best for…

Small-to-medium quantitative data sets where individual values should be visible.

44
New cards

Histogram is best for…

Quantitative data grouped into intervals; good for shape, center, spread, gaps, and skew.

45
New cards

Boxplot is best for…

Quantitative data, especially when comparing groups using the five-number summary.

46
New cards

What does a boxplot show?

Minimum, Q1, median, Q3, maximum, and possible outliers.

47
New cards

Segmented bar chart is best for…

Comparing proportions for two categorical variables.

48
New cards

When describing a distribution, mention…

Shape, center, spread, and outliers; also clusters or gaps if present.

49
New cards

Right-skewed distribution

Long tail to the right; mean is usually greater than median.

50
New cards

Left-skewed distribution

Long tail to the left; mean is usually less than median.

51
New cards

When comparing two distributions, compare…

Center, spread, shape, and outliers - not only averages.

52
New cards

Frequency

A count of how many times a category occurs.

53
New cards

Relative frequency formula

Relative frequency = count / total.

54
New cards

What does a two-way table show?

Counts or proportions for two categorical variables at the same time.

55
New cards

What words signal conditional probability?

Given, among, of those, or within.

56
New cards

Conditional proportion formula from a table

Category count within the given group / total of the given group.

57
New cards

Denominator for a one-event probability

Usually the grand total.

58
New cards

Denominator for a conditional probability

The total for the GIVEN group.

59
New cards

Why use percentages instead of raw counts in segmented bar charts?

Percentages allow fair comparison when group sizes are different.

60
New cards

Bivariate data

Data with two variables.

61
New cards

In regression/correlation, what kind of variables are x and y?

Both variables should be quantitative.

62
New cards

Explanatory or predictor variable

The x variable; the variable used to make a prediction.

63
New cards

Response variable

The y variable; the variable being predicted or explained.

64
New cards

Why look at the scatterplot first?

To check form, direction, strength, and unusual points before calculating correlation/regression.

65
New cards

Scatterplot form options

Linear, curvilinear, or no relationship.

66
New cards

Scatterplot direction options

Positive, negative, or none.

67
New cards

Scatterplot strength options

Strong, moderate, or weak.

68
New cards

Positive association

As x increases, y tends to increase.

69
New cards

Negative association

As x increases, y tends to decrease.

70
New cards

Correlation r

Measures strength and direction of a linear relationship; ranges from -1 to 1.

71
New cards

Excel command for correlation

=CORREL(x_range,y_range)

72
New cards

What does r close to 1 mean?

A strong positive linear relationship.

73
New cards

What does r close to -1 mean?

A strong negative linear relationship.

74
New cards

What does r close to 0 mean?

Little or no linear relationship.

75
New cards

Correlation caution

Correlation does not prove causation; lurking/confounding variables may explain the association.

76
New cards

Coefficient of determination r^2

The percent of variation in y described by the regression line using x.

77
New cards

How do you interpret r^2 = 0.588?

58.8% of the variation in y is described by the regression line using x.

78
New cards

Regression equation form

y-hat = a + bx, where a is the intercept and b is the slope.

79
New cards

What is y-hat?

The predicted value of y.

80
New cards

Slope interpretation

For every 1-unit increase in x, the predicted y changes by b units.

81
New cards

Positive slope

Predicted y increases as x increases.

82
New cards

Negative slope

Predicted y decreases as x increases.

83
New cards

Y-intercept interpretation

When x = 0, the predicted y is the intercept value; only interpret if x = 0 makes sense in context.

84
New cards

How do you make a regression prediction?

Plug the given x-value into the regression equation and solve for y-hat.

85
New cards

Interpolation

Predicting within the range of observed x-values; usually more reliable.

86
New cards

Extrapolation

Predicting outside the range of observed x-values; riskier and less reliable.

87
New cards

Regression residual formula

Residual = actual y - predicted y.

88
New cards

Positive residual

Actual y is above the regression line; the model underestimated y.

89
New cards

Negative residual

Actual y is below the regression line; the model overestimated y.

90
New cards

Regression outlier

A point with a large vertical gap from the line of best fit.

91
New cards

Can outliers simply be removed?

No. They must be investigated to determine whether they are valid data.

92
New cards

Probability

A long-run value: as trials increase, the proportion of an outcome tends to stabilize.

93
New cards

P(A)

The probability that event A occurs.

94
New cards

Complement of A

The event that A does not occur, written A′ or Ac.

95
New cards

Complement rule

P(A′) = 1 - P(A).

96
New cards

Intersection

A and B; the outcomes where both events occur.

97
New cards

Union

A or B; the outcomes where A occurs, B occurs, or both occur.

98
New cards

Union formula

P(A or B) = P(A) + P(B) - P(A and B).

99
New cards

Conditional probability formula

P(A | B) = P(A and B) / P(B).

100
New cards

What does the vertical bar | mean in probability?

Given.