AP Statistics: Chapter 1 Notes - Quantitative Variables & Distribution
Overview of Quantitative Variables
Transition from categorical to quantitative variables.
Importance of organizing and describing quantitative data.
Dot Plots
Definition: Simple representation of quantitative data where each data value is a dot on a line.
Example: Goals scored by US women's Olympic soccer team.
Most values between 1-4 goals, with notable outliers (13 and 14 goals).
Construction:
Draw horizontal axis and label (e.g., number of goals scored).
Scale axis from minimum to maximum data values.
Place a dot for each data point over its respective value.
Describing Distributions with CUSP
CUSP Acronym:
Center: Where the middle of the data lies.
Unusual Features: Identifying outliers that deviate from the overall data trend.
Shape: Whether distribution is symmetric, skewed left, or skewed right.
Pread: Range of the data points.
Importance of outliers and skewness in data analysis.
Shape Analysis
Symmetric vs. Skewed:
Symmetric: Roughly equal distribution on both sides of the center.
Skewed Left: More data points on the right; outliers on the left.
Skewed Right: More data points on the left; outliers on the right.
Use words like "roughly" and "approximately" to describe distribution shapes to allow for minor deviations.
Comparing Distributions
Use CUSP to compare distributions:
Example: South Africa vs UK household sizes.
Center: South Africa's average around 6-7; UK around 4-5.
Outliers: South Africa has high-number outliers (e.g., 26 or 27).
Shape: South Africa skewed right; UK roughly symmetric.
Spread: South Africa has larger spread with values ranging widely.
Stem-and-Leaf Plots
Definition: Visual representation that maintains actual numerical values of the data.
Construction:
Divide data into stems (all but final digit) and leaves (final digit).
Organize stems in a vertical column; leaves to the right.
Provide a clear key explaining the data representation.
Example: Responses about shoe ownership.
Key: A stem of '4' and a leaf of '9' means 49 pairs of shoes.
Histograms
Most common way to display quantitative variables:
Definition: A bar graph representation where bars touch to show frequency distribution.
Construction Steps:
Divide data range into equal-width classes.
Count frequencies for each range.
Scale axes properly: x-axis (data ranges), y-axis (frequencies).
Frequency vs. Relative Frequency Histograms:
Frequency: Counts of how many fall into each class range.
Relative Frequency: Percentages based on total counts.
Shape, center, and spread can also be described, following the CUSP method.
Creating Histograms on a Calculator (TI-84)
Input data using the
Statbutton and selectEdit.Enter data into lists.
Activate
Stat Plot(above y=):Turn
Onthe desired plot.Choose histogram as the type.
Set x-list according to where data is stored.
Use
zoomoption to fit window to data withZoom Stat(Option 9).Analyze via
Traceto view numerical values of each histogram bar, including data range and counts.Ensure axes are labeled, and provide a title for clarity.
Conclusion
Important to organize data visually to analyze effectively.
Each type of graph presents unique insights into the data distribution.
Next discussions will build on understanding and analyzing distributions statistically and making inferences.