AP Statistics Final Exam Review

0.0(0)

Studied by 0 people

0.0(0)

Call with Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/508

Earn XP

Description and Tags

im cooked

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No study sessions yet.

509 Terms

New cards

Why is it important to look at the shape of a column of data before interpreting any statistics?

-Because shape determines the location and the spread of the column of data.
-Because graphs for shape come first in the computer output.
-To see if the data is unimodal, symmetrical, and without any exceptions.
-To see that all data values were included in the statistical analysis.

To see if the data is unimodal, symmetrical, and without any exceptions.

New cards

Why look at a chart to get shape information instead of looking at the data values themselves?

-A chart is needed to summarize the shape characteristic so that we can see it.
-Statistical computer output will give charts, but not the column of data.
-Charts are much easier to look at than columns of data.
-This statement is false, it is always better to look at the data values directly to get shape information.

A chart is needed to summarize the shape characteristic so that we can see it.

New cards

Match the two meanings below with their appropriate column titles in a frequency table.

A. Relative frequency.
B. Frequency.

-Count of the data values in a category.
-Proportion of the data values in a category.

Count of the data values in a category. : Frequency.
Proportion of the data values in a category. : Relative frequency.

New cards

In the frequency table below, what is the relative frequency for the category Mar? (answer to three decimal places, ex. 1.089)

0.156

New cards

Which of the following answers is NOT one of the first three columns in a frequency table?

-Relative Frequency
-Cumulative Frequency.
-Category.
-Frequency.

Cumulative Frequency.

New cards

In the frequency table below, what is the frequency for the category May? (answer to zero decimal places, ex. 1.)

New cards

In the frequency table below, what is the cumulative frequency and the cumulative relative frequency for the category Jun?

100% and 100%
160.0 and 1.0
1.0 and 1.0
35.0 and 0.21875

160.0 and 1.0

New cards

What type of graph is most appropriate to use to display the shape of qualitative data?

-A pie chart where the slices of the pie touch one each other.
-A histogram where the bars do touch each another.
-A bar chart where the bars do touch each other.
-A bar chart where the bars do not touch each other.

A bar chart where the bars do not touch each other.

New cards

A Pareto chart just a bar chart with the bars rearranged from the highest bar to the lowest bar.

TRUE or FALSE

TRUE

New cards

What type of graph is most appropriate to use to display the shape of discrete data?

-A mathematical curve such as a bell-shaped curve.
-A pie chart where the slices of the pie touch one each other.
-A bar chart where the bars do not touch each other.
-A histogram where the bars do touch each another.

A histogram where the bars do touch each another.

New cards

In a histogram for discrete data, what characteristic is shown on the x-axis, and what characteristic is shown on the y-axis?

-The data values in the data set on the x-axis, and the frequency of the data values on the y-axis.
-Bins of the data values, and bins of the frequency of the data values.
-Bins of the data values, and the frequency of the data values.
-The frequency of the data values, and the data values in the data set.

The data values in the data set on the x-axis, and the frequency of the data values on the y-axis.

New cards

In a histogram for continuous data, what characteristic is shown on the x-axis, and what characteristic is shown on the y-axis?

-The frequency of the data values, and the data values in the data set.
-Bins of the data values, and bins of the frequency of the data values.
-Bins of the data values on the x-axis, and the frequency of the data values in the bins on the y-axis.
-The data values in the data set, and the frequency of the data values.

Bins of the data values on the x-axis, and the frequency of the data values in the bins on the y-axis.

New cards

When making a histogram for continuous data, a bin is just a range of possible data values.

TRUE or FALSE

TRUE

New cards

A boxplot is used to display the shape of qualitative data.

TRUE or FALSE

FALSE

New cards

A boxplot shows location and spread information as does a histogram.

TRUE or FALSE

TRUE

New cards

In a boxplot of continuous data, what percent of the data values lie inside the box, and what percent of the data values lie outside the box?

75%, and 25%.
100%, and 0%.
25%, and 25%.
50%, and 50%.

50%, and 50%.

New cards

In the boxplot below, what is the value of the median?

10
36.25
27.5
40
17.5

27.5

New cards

In any boxplot (an example is shown below), the width of the box shows what characteristic of the data values?

-The location of the data values.
-The count of the data values.
-The spread of the data values.
-The shape of the data values.

The spread of the data values.

New cards

What is the general approach to analyzing the information in a histogram?

-First look for the mean, then look for the standard deviation.
-First look at the overall shape, then look for exceptions.
-First look at the modality, then look at the symmetry.
-First look the peak of the histogram, then look at the tails.

First look at the overall shape, then look for exceptions.

New cards

Which of the following answers is NOT one of our overall shapes?

-A symmetric shape.
-A skewed shape.
-A uniform shape.
-A modeless shape.

A modeless shape

New cards

A skewed left shape means that the peak of the histogram is on the left side of the histogram.

TRUE or FALSE

FALSE

New cards

Skewness in a histogram is a property of what in the histogram?

-The peak.
-The number of peaks.
-The tails.
-The extreme values.

The tails.

New cards

Which of the following answers is NOT an exception when analyzing a histogram?

-Any bimodal shapes.
-Any gaps or peaks.
-Any patterns or grouping
-Any extreme values.

Any bimodal shapes.

New cards

How is a gap distinguished from an extreme value?

-A gap is narrow, an extreme value is wide.
-There can be only one gap, while there can be many extreme values
-A gap is close to the peak. An extreme value is far from the peak.
-A gap is fits as part of the overall shape. An extreme value is outside the overall shape.

A gap is fits as part of the overall shape. An extreme value is outside the overall shape.

New cards

Match the appropriate set of statistics to the descriptions below.

A. Resistant statistics.
B. Efficient statistics.

-Extract the most information from a column of data.
-Less affected by extreme values.

Extract the most information from a column of data. : Efficient statistics.
Less affected by extreme values. : Resistant statistics.

New cards

Two data sets have the same size, but Data set A has a sum of squares of 97 and Data set B has a sum of squares of 197. In which data set are the data values more widely spread out?

-Data set B, because 197 can have more variation than 97.
-Data set A, because a sum of squares of 97 is smaller than a sum of squares of 197.
-Data set A, because 97 is closer to zero than 197.
-Data set B, because a sum of squares of 197 is bigger than a sum of squares of 97.

Data set B, because a sum of squares of 197 is bigger than a sum of squares of 97.

New cards

What is the formula for the sum of squares (SS)?

Sumofsquares=(x−xbar)2
Sumofsquares=(x−xbar)
Sumofsquares=∑(x)2−∑(xbar)2
Sumofsquares=∑(x−xbar)2

𝑆𝑢𝑚𝑜𝑓𝑠𝑞𝑢𝑎𝑟𝑒𝑠=∑(𝑥−𝑥𝑏𝑎𝑟)2

New cards

A deviation can never equal zero.

TRUE or FALSE

FALSE

New cards

Is the average deviation better than the standard deviation (σ)?

-No, because the standard deviation is a standardized version of the average deviation.
-Yes, because the average is the most representative value.
-No, because the average deviation always equals zero.
-Yes, because the average deviation is much quicker to calculate and much easier to use.

No, because the average deviation always equals zero.

New cards

Can the standard deviation (σ) be used to find probability?

-Yes, it can with a normal curve.
-Yes, because it is part of a distribution.
-No, because the average deviation is used.
-No, because spread is not needed to find probability.

Yes, it can with a normal curve.

New cards

What information about a data value is given by its deviation?

-The statistical distance and direction from the mean.
-The mathematical distance and direction from the mean.
-The magnitude (absolute value) of the difference between the data value and the median.
-How different the data value is from the mean and median.

The mathematical distance and direction from the mean.

New cards

A data value close to the mean has a?

Large deviation.
Small deviation.
No deviation.
Middle deviation.

Small deviation.

New cards

Match the appropriate statistic for a distribution of efficient statistics.
.
A. Histogram.
B. Standard deviation.
C. Mean.

Shape.
Location.
Spread.

Shape. : Histogram.
Location. : Mean.
Spread. : Standard deviation.

New cards

In statistics, a deviation applies to only one data value.

TRUE or FALSE

TRUE

New cards

To calculate variance, statistics does not average the deviations, instead it averages the?

The positive and negative deviations separately.
Squared deviations.
The square-root of the squared deviations.
The absolute value of the deviations.

Squared deviations

New cards

Does the sum of squares (SS) usually get bigger as more data values are added to the data set?

Yes, because there are more numbers in the data set to sum.
Yes, because bigger data sets must have bigger sum of squares than smaller data sets.
No, because the new data values could have negative squares.
No, because the sum of squares is adjusted for the number of data values in the data set.

Yes, because there are more numbers in the data set to sum.

New cards

The efficient measure of spread for a column of data is the?

Average deviation.
Standard deviation.
Middle deviation.
The average of all the deviation in the data set.

Standard deviation.

New cards

What is the statistical term for all the squared deviations added together?

Sum of squares.
Variance.
Standard deviation.
Root mean square.

Sum of squares.

New cards

Variance (σ2) is what type of measure of spread?

A raw measure of spread.
A measure of spread normalized for shape.
A measure of spread adjusted for bias.
A measure of spread standardized for the number of data values.

A measure of spread standardized for the number of data values.

New cards

A data value less than the mean has a?

Positive deviation.
Left deviation.
Negative deviation.
None of the above.

Negative deviation.

New cards

What is the appropriate denotation for variance?

φ¯ for population, v¯ for sample
μ2 for population, s2 for sample
σ2 for population, s2 for sample
VAR for population, var for sample

σ2 for population, s2 for sample

New cards

What is the appropriate denotation for standard deviation?

s population, σ for sample.
σ for population, std for sample.
µ for population, s for sample
σ for population, s for sample.

σ for population, s for sample.

New cards

The mean (µ) is NOT needed to calculate a deviation.

TRUE or FALSE

FALSE

New cards

The sum of squares (SS) is what type of measure of spread?

A standardized measure of spread.
A normalized measure of spread.
A raw measure of spread.
An adjusted measure of spread.

A raw measure of spread.

New cards

How is the sum of squares (SS) standardized into variance (σ2)?

Divide by the number of data values.
Divide by the degrees of freedom.
Take the square-root of the sum of squares.
Sum of squares is the same value as the variance.

Divide by the degrees of freedom.

New cards

Does the average deviation always equal zero for every data set?

Yes, because the deviations cluster about the mean.
Yes, because the positive and negative deviation cancel each other.
No, because this is not true.
No, because the average deviation equals zero for some data sets, but not for all datasets.

Yes, because the positive and negative deviation cancel each other.

New cards

The mean (µ) is used when thinking about the data values in a column of data because? (select two of the answers below)

The mean is easier to calculate than the median.
The mean can be used in advanced statistical methods.
A single number is easy to think about.
The mean best represents the values in the column of data.

A single number is easy to think about.
The mean best represents the values in the column of data.

New cards

Which one of the following answers is NOT correct about standard deviation?

Standarddeviation=√σ2
Standarddeviation=√Variance
Standarddeviation2=Variance−
Standarddeviation=Variance2

𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛=𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒2

New cards

A bigger standard deviation for a data set means the data values are?

Spread wider apart.
Spread narrower together.
Spread more evenly about the mean.
Have a more symmetrical spread.

Spread wider apart.

New cards

Match the appropriate statistic for a distribution of resistant statistics.

A. Boxplot.
B. Median.
C. Inter-quartile range.

Shape.
Location.
Spread.

Shape. : Boxplot.
Location. : Median.
Spread. : Inter-quartile range.

New cards

What are resistant statistics resistant to?

Any possible bias in the data values.
The presence of extreme values.
The presence of gaps in the shape .
Any minor recording errors in the data set.

The presence of extreme values.

New cards

What makes resistant statistics work?

They weight the data values lower when far from the mean.
The data values are divided by their deviation.
They look at the position of the data values, and not at their values.
They remove the high and low data values to eliminate any extreme values.

They look at the position of the data values, and not at their values.

New cards

To find resistant statistics, what must be true of the data set?

The data values must be ranked from lowest to highest.
The position of the data values must be found in the initial data set.
The data set must be checked to make certain that no two data values occupy the same position.
The data values must be ranked from highest to lowest.

The data values must be ranked from lowest to highest.

New cards

Why do positional statistics work for a data set containing extreme values?

Because extreme values occur near the mean where the positions of the data values are close together.
Because the range of position is (0, n), but the range of the values is (-∞, ∞).
Because positional statistics are not a very sensitive type of statistics.
Because in the tails, a big change in value is usually a small change in position.

Because in the tails, a big change in value is usually a small change in position.

New cards

Percentiles are NOT positional statistics.

TRUE or FALSE

FALSE

New cards

When denoting a percentile (Pk) what does the k stand for?

The percentage of data values greater than (to the right of) the percentile.
This is archaic nomenclature, now k is no longer used.
That the data set has been ranked for lowest to highest.
Which percentile is desired, the 0th percentile up through the 100th percentile.

Which percentile is desired, the 0th percentile up through the 100th percentile.

New cards

The value of the third quartile (Ǫ3) can be less than the value of the first quartile (Ǫ1).

TRUE or FALSE

FALSE

New cards

How many data values are less than the value of the third quartile (Ǫ3)?

Three quarters (75%)of the data values.
One third (33%) of the data values.
One quarter (25%) of the data values.
The middle half (50%) of the data values.

Three quarters (75%)of the data values.

New cards

Percentiles (Pk) (or quartiles (Ǫk) must always be a data value in the data set.

TRUE or FALSE

FALSE

New cards

Which of the answers below are NOT one of the steps to find any percentile (Pk)?

Step 3: Find the Value of the percentile from the ranked data set.
None of the other choices.
Step 1: Calculate the Index.
Step 2: Move to the correct Position.

None of the other choices.

New cards

The appropriate equation to use in Step 1: Calculate the Index of finding a percentile (Pk)is shown below.

i=k100×n

TRUE or FALSE

TRUE

New cards

In Step 2: Move to the correct position of finding a percentile (Pk), how is the appropriate move decided?

-If i is an integer, move up to the next higher data value. If i has a decimal, average that and the next higher data values .
-If i is an integer, average that and the next lower data values. If i has a decimal, move down to the next lower data value.
-If k/100 is less than 0.5, move down to the next lower data value. If k/100 is greater than 0.5, move up to the next higher data value.
-If i is an integer, average that and the next higher data values. If i has a decimal, move up to the next higher data value.

If i is an integer, average that and the next higher data values. If i has a decimal, move up to the next higher data value.

New cards

What is the 40th percentile (P40) in the following ranked set of data (n = 15)? (to one decimal place = 00.0)

9, 13, 14, 14, 15 18, 19, 24, 30, 37 40, 41, 44, 44, 193

18.5

New cards

What is the first quartile (Ǫ1) / third quartile (Ǫ3) in the following ranked set of data (n = 15)? (to one decimal place = 00.0)
9,13, 14, 14, 15 18, 19, 24, 30, 37 40, 41, 44, 44, 193

14 / 40
13.5 / 40.5
14 / 41
13 / 40

14 / 41

New cards

What is the 60th percentile (P60) in the following ranked set of data (n = 15)? (to one decimal place = 00.0)

9, 13, 14, 14, 15 18, 19, 24, 30, 37 40, 41, 44, 44, 193

33.5

New cards

What is the 90th percentile (P90) in the following ranked set of data (n = 15)? (to one decimal place = 00.0)

9, 13, 14, 14, 15 18, 19, 24, 30, 37 40, 41, 44, 44, 193

New cards

What is sum of the data values less than the 20th percentile (P20) the following ranked set of data (n = 15)? (to one decimal place = 00.0)

9, 13, 14, 15, 15 18, 19, 24, 30, 37 40, 41, 44, 44, 193

New cards

What is the value of the median (M) in the following ranked set of data (n = 15)? (to one decimal place = 00.0)

9, 13, 14, 14, 15 18, 19, 24, 30, 37 40, 41, 44, 44, 193

New cards

What is the value of the interquartile range (IQR)in the following ranked set of data (n = 15)? (to one decimal place = 00.0)

9, 13, 14, 14, 15 18, 19, 24, 30, 37 40, 41, 44, 44, 193

New cards

The interquartile range (IQR) can never be negative.

TRUE or FALSE

TRUE

New cards

The range is a reliable measure of spread.

TRUE or FALSE

FALSE

New cards

The interquartile range (IQR) measures the spread of what part of the data set?

Middle half (50%)
Upper half (50%)
Middle third (33%)
Middle majority (90%)

Middle half (50%)

New cards

What does a larger value for the interquartile range (IQR) mean about the data values?

The values of the data values are spread wider apart.
The median is larger.
The values of the data values are spread narrower together.
There are fewer data values less than the first quartile (Ǫ1) and greater than the third quartile (Ǫ3).

The values of the data values are spread wider apart.

New cards

What information is given by a five number summary?

All of these other answers.
It gives the median as a measure of location.
It gives the min and the max as a non-resistant measure of spread.
It gives the quartiles as a resistant measure of spread

All of these other answers.

New cards

Are there any extreme values in the ranked set of data below (n = 15)?
-19, -3, 11, 14, 15 18, 19, 24, 30, 37 40, 41, 44, 44, 90

There are no extreme values in this data set because all the data values are inside the fences.
Both -19 and 90 are extreme values.
90 is an extreme value because the upper fence is 81.5.
-19 is an extreme value because the lower fence is -27.5.

90 is an extreme value because the upper fence is 81.5.

New cards

Regarding a five number summary, what are the fences used for?

To find the most appropriate min and max.
To give a lower limit and an upper limit on the data values to detect extreme values.
To provide limits in which to use efficient statistics.
To keep the data values together for analysis.

To give a lower limit and an upper limit on the data values to detect extreme values.

New cards

What is the value of the lower fence, and of the upper fence, in the five number summary below?
{0.2, 6.05, 6.45, 6.95, 8.2}

0.2, and 8.2
5.15, and 7.85
-1.15, and 9.55
4.70, and 8.30

4.70, and 8.30

New cards

Are there any extreme values in the data set that has the five number summary below?
{0.2, 6.05, 6.45, 6.95, 8.2}

A histogram of this data set does not show any extreme values.
0.2 and 8.2 are both extreme values as they are outside of the fences.
0.2 because the lower fence is 4.70.
No extreme values because the fences are -1.15 and 9.55,

0.2 because the lower fence is 4.70.

New cards

Why is the distribution of a data set so important?

Because the distribution describes the data values in the data set.
Because the distribution tells how the data values are distributed.
Because the distribution can be used to find the probability of an event.
Because distribution is an easy way to look at the shape, location, and spread of the data values.

Because the distribution can be used to find the probability of an event.

New cards

What information does an area under the normal curve give?

The event being studied.
The size of the event of interest.
The probability of an event.
The range of the probability of an event.

The probability of an event.

New cards

Why is the probability of an event so important in the science of statistics?

Probability of an event gives the likelihood of the event
Probability of an event can be used in advanced statistical methods.
Probability of an event can be used to make a bet.
Probability of an event can be used to make a better decision.

Probability of an event can be used to make a better decision.

New cards

The probability of an event can mean two things:

The proportion of the population described by the event.
The chance an individual will be randomly selected.

TRUE or FALSE

FALSE

New cards

What are the critical parameters of a normal curve?

Mean (µ) and standard deviation (σ).
x and y.
Uni-modal and symmetrical.
Bell shape and an area of 1.

Mean (µ) and standard deviation (σ).

New cards

Only one critical parameter is needed to calculate probability with a normal curve.

TRUE or FALSE

FALSE

New cards

A normal curve can be described with one specific mathematical equation, if the values of the variables are known.

TRUE or FALSE

TRUE

New cards

The curve below is a normal curve.

TRUE or FALSE

FALSE

New cards

On a real number line, a change in the mean shifts a normal curve horizontally, and a change in the standard deviation shifts the normal curve vertically.

TRUE or FALSE

FALSE

New cards

Standard deviation (σ) is another way to give the width of a normal curve

TRUE or FALSE

TRUE

New cards

If the area under a normal curve less than z = -1.75 equals 0.04, what is the area under the curve to the right of z = -1.75?

0.96

New cards

If the area under a normal curve less than z = -1.75 equals 0.04, what is the area under the curve greater than z = +1.75?

0.04

New cards

If the area under a normal curve less than z = -1.75 equals 0.04, what is the area under the curve between z = -1.75 and z = +1.75?

0.92

New cards

For the Shapiro-Wilk test for normality, a P-value = 0.25 means that the data follow a normal shape.

TRUE or FALSE

TRUE

New cards

Which of the following answers is NOT true of a statistical event.

An event is an interval on the x-axis.
An event lies between -∞ and +∞.
An event can be a in the tail, in the body, or on the side of a normal curve.
An event is an area under a normal curve.

An event is an area under a normal curve.

New cards

What is the appropriate mathematics to use to find a left tail area?

An area from the z-table - 1 .
An area from the z-table - an area from the z-table.
An area from the z-table.
1 - an area from the z-table.

An area from the z-table.

New cards

What is the appropriate mathematics to use to find a two-tail area?

An area from the z-table + an area from the z-table.
An area from the z-table / an area from the z-table .
An area from the z-table - an area from the z-table.
(An area from the z-table) * 2

An area from the z-table + an area from the z-table.

New cards

What is the appropriate mathematics to use to find a right body area?

An area from the z-table - an area from the z-table.
An area from the z-table.
1 - An area from the z-table.
An area from the z-table - 1.

1 - An area from the z-table.

New cards

What is the appropriate mathematics to use to find a middle body area?

A left area from the z-table - a right area from the z-table.
A left area from the z-table + a right area from the z-table - 1.
1 - a left area from the z-table - a right area from the z-table.
A left area and a right area from the z-table.

1 - a left area from the z-table - a right area from the z-table.

New cards

What is the appropriate mathematics to use to find a right-side area?

A left area from the z-table + a right area from the z-table - 1.
The larger area from the z-table + the smaller area from the z-table.
1 - the larger area from the z-table.
The larger area from the z-table - the smaller area from the z-table.

The larger area from the z-table - the smaller area from the z-table.

New cards

The areas defined in the Empirical Rule of statistics are valid for all normal curves.

TRUE or FALSE

TRUE

100

New cards

What is the area under a normal curve between the mean plus one standard deviation (µ+1σ), and the mean plus two standard deviations (µ+2σ)?

13.5