Unit 1 AP Stats Review

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/104

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 2:55 AM on 4/22/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

105 Terms

1
New cards

What is the main focus of Unit 1 in statistics?

One-variable data, where each individual contributes one measurement.

2
New cards

Define 'individual' in the context of statistics.

An individual is the 'who' the data describe, such as a person, school, or game.

3
New cards

What is a variable?

A variable is a characteristic measured on each individual, such as height or GPA.

4
New cards

What are data in statistics?

Data are the actual recorded values of a variable.

5
New cards

Differentiate between categorical and quantitative variables.

Categorical variables take category names (e.g., blood type), while quantitative variables take numerical values (e.g., age).

6
New cards

What is a common mistake when identifying individuals and variables?

Mixing up individuals (e.g., 'students') with variables (e.g., 'GPA').

7
New cards

What are discrete quantitative variables?

Discrete quantitative variables take a finite or countable number of values with noticeable gaps.

8
New cards

What are continuous quantitative variables?

Continuous quantitative variables can take infinitely many values with no gaps.

9
New cards

What does a distribution show?

A distribution shows what values a variable takes and how often it takes them.

10
New cards

How should you describe a distribution?

Always tie your description to context, interpreting what the shape means in real situations.

11
New cards

What is a frequency table?

A frequency table lists each category and its count (frequency).

12
New cards

What is a relative frequency table?

A relative frequency table lists each category and its proportion of the total.

13
New cards

What is the advantage of relative frequencies?

Relative frequencies allow comparisons across groups of different sizes.

14
New cards

What is a bar chart?

A bar chart displays categories on one axis and frequencies on the other, with bars separated.

15
New cards

What is a pie chart?

A pie chart shows relative frequencies as slices of a circle, emphasizing parts of a whole.

16
New cards

What is a dot plot?

A dot plot places a dot for each data value along a number line, useful for small-to-moderate data sets.

17
New cards

What is a stemplot?

A stemplot splits each number into a stem and a leaf, preserving exact data values while showing distribution shape.

18
New cards

What is a histogram?

A histogram groups quantitative data into intervals (bins) and uses bars to show how many values fall in each bin.

19
New cards

What is a common pitfall when creating stemplots?

Not including a key to clarify what the stems and leaves represent.

20
New cards

What is the difference between a bar chart and a histogram?

Bar charts display categorical data with separated bars, while histograms display quantitative data with touching bars.

21
New cards

What is the purpose of using frequency and relative frequency tables?

To organize and summarize data for better understanding and comparison.

22
New cards

What is a common mistake when interpreting pie charts?

Over-interpreting tiny differences in pie chart slices.

23
New cards

What should you do when describing a graph?

Mention what the values represent, not just the graph type.

24
New cards

What is the importance of context when describing distributions?

Context helps interpret the meaning behind statistical shapes and values.

25
New cards

What is a common error when working with categorical data?

Using a histogram instead of a bar chart for categorical data.

26
New cards

How can you interpret a relative frequency as a probability-like statement?

By expressing it in terms of percentage, such as 'about 30 percent'.

27
New cards

What is the relative frequency of students taking 2 AP classes?

0.41 (900 out of 2200 students)

28
New cards

What type of plot is used to graph a quantitative variable measured over time?

Time plot

29
New cards

What are the three key aspects to look for in a time plot?

Overall trend, seasonality, unusual spikes or drops

30
New cards

What does a cumulative relative frequency plot show?

How counts or proportions accumulate as you move from smaller to larger values

31
New cards

What type of graph is best for test scores of 25 students?

Dotplot or stemplot

32
New cards

What is the purpose of SOCS in describing quantitative distributions?

To capture Shape, Outliers, Center, and Spread

33
New cards

What does the term 'unimodal' refer to in distribution shape?

A distribution with one clear peak

34
New cards

What does a right-skewed distribution indicate about the mean and median?

The mean is usually greater than the median

35
New cards

How do you calculate the interquartile range (IQR)?

IQR = Q3 - Q1

36
New cards

What is the 1.5·IQR rule used for?

To flag outliers in a data set

37
New cards

What does a bimodal distribution indicate?

The presence of two distinct subgroups within the data

38
New cards

What is the difference between descriptive and inferential statistics?

Descriptive statistics summarize data, while inferential statistics draw conclusions from data

39
New cards

What is the formula for calculating the sample mean?

x̄ = (Σx_i) / n

40
New cards

What does the term 'spread' refer to in statistics?

The scope of values from the smallest to the largest

41
New cards

What is a common mistake when interpreting histograms?

Calling a histogram a bar chart and treating bins like categories

42
New cards

What does a cumulative frequency plot help to estimate?

Medians and quartiles

43
New cards

What is the meaning of 'skewed left' in a distribution?

The distribution spreads far and thinly toward lower values (long left tail)

44
New cards

What is the significance of clusters in a distribution?

They suggest natural subgroups within the data

45
New cards

What is the median in a data set?

The middle value when the data is sorted

46
New cards

What does a uniform distribution look like?

The histogram is approximately a horizontal line with roughly equal frequencies across bins

47
New cards

What is the purpose of using the median instead of the mean?

The median is less sensitive to outliers

48
New cards

What does the term 'gaps' refer to in a distribution?

Holes where no values fall within the data set

49
New cards

What is the importance of context in describing a distribution?

It ties the statistical description to real-world significance

50
New cards

What is a common mistake when using dotplots for large data sets?

They become unreadable when the data set is very large

51
New cards

What does a bell-shaped distribution indicate?

It is symmetric with a central mound and two sloping tails

52
New cards

What is the role of outliers in a data set?

They can indicate unusual values that may require further investigation

53
New cards

What is the difference between a population and a sample?

A population is the entire group of interest; a sample is a subset of that population.

54
New cards

What symbol represents the population mean?

μ (mu)

55
New cards

What symbol represents the sample mean?

x̄ (x-bar)

56
New cards

What is the median?

The median is the middle value of a sorted dataset; if the number of values is odd, it's the middle one; if even, it's the average of the two middle values.

57
New cards

What are quartiles?

Quartiles split ordered data into four equal parts: Q1 (25th percentile), Q2 (median or 50th percentile), and Q3 (75th percentile).

58
New cards

What does IQR stand for and how is it calculated?

IQR stands for Interquartile Range, calculated as IQR = Q3 - Q1.

59
New cards

What is the range in statistics?

The range is the difference between the maximum and minimum values in a dataset.

60
New cards

What is variance?

Variance is the average of the squared differences from the mean, indicating how much data points differ from the mean.

61
New cards

What is the standard deviation?

The standard deviation is the square root of the variance, representing the typical distance of data points from the mean.

62
New cards

What is the five-number summary?

The five-number summary consists of the minimum, Q1, median, Q3, and maximum of a dataset.

63
New cards

What is a z-score?

A z-score indicates how many standard deviations a value is from the mean, calculated as z = (x - μ) / σ.

64
New cards

What is the purpose of a boxplot?

A boxplot visually summarizes the distribution of a dataset, showing the median, quartiles, and potential outliers.

65
New cards

How do you determine if a value is an outlier using the IQR?

A value is considered an outlier if it is below Q1 - 1.5IQR or above Q3 + 1.5IQR.

66
New cards

What effect do outliers have on the mean?

Outliers can significantly pull the mean in their direction, making it less representative of the dataset.

67
New cards

What is the difference between mean and median in a skewed distribution?

In a skewed distribution, the mean is affected by outliers and may differ from the median, which is more resistant to extreme values.

68
New cards

What is a percentile?

A percentile indicates the percentage of observations that fall below a certain value in a dataset.

69
New cards

What is the formula for calculating the sample standard deviation?

s = √(Σ(x_i - x̄)² / (n - 1)), where s is the sample standard deviation.

70
New cards

What does it mean if a score is at the 90th percentile?

It means that 90% of the scores are at or below that value.

71
New cards

What is the significance of the median in a dataset?

The median provides a measure of central tendency that is not skewed by outliers.

72
New cards

How does increasing all values in a dataset by a constant affect the mean?

Increasing all values by a constant adds that constant to the mean.

73
New cards

How does multiplying all values in a dataset by a constant affect the mean?

Multiplying all values by a constant multiplies the mean by that same constant.

74
New cards

What is the purpose of calculating the IQR?

The IQR measures the spread of the middle 50% of data, providing a robust measure of variability.

75
New cards

What does a boxplot reveal about skewness?

The length of the whiskers and the position of the median indicate the skewness of the data distribution.

76
New cards

What is the common mistake regarding standard deviation?

A common mistake is interpreting standard deviation as an average value instead of a measure of variability.

77
New cards

What are the key components to compare when analyzing two distributions?

Center, Spread, Shape, and Context.

78
New cards

How do you determine which group has larger typical values?

By comparing the medians or means, depending on appropriateness.

79
New cards

What measures can be used to assess variability between two groups?

Interquartile Range (IQR), Standard Deviation (SD), and Range.

80
New cards

What is the importance of context in comparing distributions?

It helps interpret differences in real-world terms.

81
New cards

What types of graphs can be used for comparing distributions?

Back-to-back stemplots, side-by-side histograms, parallel boxplots, and cumulative frequency plots.

82
New cards

Why is it important to keep scales consistent when comparing histograms?

Different scales can mislead the interpretation of data.

83
New cards

When comparing categorical distributions, what should be compared?

Relative frequencies, not counts.

84
New cards

What is the effect of adding a constant to every value in a dataset?

Measures of center increase by that constant, but measures of spread remain unchanged.

85
New cards

What happens to measures of center when multiplying every value by a constant?

Measures of center are multiplied by that constant.

86
New cards

What does a z-score represent?

How many standard deviations a value is from the mean.

87
New cards

What is the formula for calculating a z-score using sample summaries?

z = (x - x̄) / s

88
New cards

What does a negative z-score indicate?

The value is below the mean.

89
New cards

What is the shape of a Normal distribution?

Bell-shaped and symmetric.

90
New cards

What are the parameters of a Normal distribution?

Mean (μ) and standard deviation (σ).

91
New cards

What is the standard Normal distribution?

A Normal distribution with μ = 0 and σ = 1.

92
New cards

What does the 68-95-99.7 rule state about a Normal distribution?

About 68% of observations lie within 1 standard deviation, 95% within 2, and 99.7% within 3 standard deviations of the mean.

93
New cards

How do you find the proportion of a Normal distribution for a given value?

Standardize the value using z = (a - μ) / σ and use technology or a standard Normal table.

94
New cards

What is the effect of multiplying by a negative constant on a distribution?

It reflects the distribution on the number line, reversing the direction of skew.

95
New cards

What is the purpose of density curves?

To model a distribution with a smooth curve rather than raw data.

96
New cards

What does the area under a density curve represent?

The proportion of observations.

97
New cards

What is the median of a density curve?

The point with half the area to the left.

98
New cards

What is the balance point of a density curve?

The mean.

99
New cards

What should you check when comparing categorical data?

Ensure you compare proportions rather than raw counts.

100
New cards

What common mistake is made when comparing distributions?

Comparing counts instead of proportions for categorical data.