AP Statistics: Chapter 1 Notes - Quantitative Variables & Distribution

Overview of Quantitative Variables
  • Transition from categorical to quantitative variables.

  • Importance of organizing and describing quantitative data.

Dot Plots
  • Definition: Simple representation of quantitative data where each data value is a dot on a line.

  • Example: Goals scored by US women's Olympic soccer team.

    • Most values between 1-4 goals, with notable outliers (13 and 14 goals).

  • Construction:

    • Draw horizontal axis and label (e.g., number of goals scored).

    • Scale axis from minimum to maximum data values.

    • Place a dot for each data point over its respective value.

Describing Distributions with CUSP
  • CUSP Acronym:

    • Center: Where the middle of the data lies.

    • Unusual Features: Identifying outliers that deviate from the overall data trend.

    • Shape: Whether distribution is symmetric, skewed left, or skewed right.

    • Pread: Range of the data points.

  • Importance of outliers and skewness in data analysis.

Shape Analysis
  • Symmetric vs. Skewed:

    • Symmetric: Roughly equal distribution on both sides of the center.

    • Skewed Left: More data points on the right; outliers on the left.

    • Skewed Right: More data points on the left; outliers on the right.

  • Use words like "roughly" and "approximately" to describe distribution shapes to allow for minor deviations.

Comparing Distributions
  • Use CUSP to compare distributions:

    • Example: South Africa vs UK household sizes.

    • Center: South Africa's average around 6-7; UK around 4-5.

    • Outliers: South Africa has high-number outliers (e.g., 26 or 27).

    • Shape: South Africa skewed right; UK roughly symmetric.

    • Spread: South Africa has larger spread with values ranging widely.

Stem-and-Leaf Plots
  • Definition: Visual representation that maintains actual numerical values of the data.

  • Construction:

    • Divide data into stems (all but final digit) and leaves (final digit).

    • Organize stems in a vertical column; leaves to the right.

    • Provide a clear key explaining the data representation.

  • Example: Responses about shoe ownership.

    • Key: A stem of '4' and a leaf of '9' means 49 pairs of shoes.

Histograms
  • Most common way to display quantitative variables:

    • Definition: A bar graph representation where bars touch to show frequency distribution.

  • Construction Steps:

    • Divide data range into equal-width classes.

    • Count frequencies for each range.

    • Scale axes properly: x-axis (data ranges), y-axis (frequencies).

  • Frequency vs. Relative Frequency Histograms:

    • Frequency: Counts of how many fall into each class range.

    • Relative Frequency: Percentages based on total counts.

  • Shape, center, and spread can also be described, following the CUSP method.

Creating Histograms on a Calculator (TI-84)
  1. Input data using the Stat button and select Edit.

  2. Enter data into lists.

  3. Activate Stat Plot (above y=):

    • Turn On the desired plot.

    • Choose histogram as the type.

    • Set x-list according to where data is stored.

  4. Use zoom option to fit window to data with Zoom Stat (Option 9).

  5. Analyze via Trace to view numerical values of each histogram bar, including data range and counts.

  6. Ensure axes are labeled, and provide a title for clarity.

Conclusion
  • Important to organize data visually to analyze effectively.

  • Each type of graph presents unique insights into the data distribution.

  • Next discussions will build on understanding and analyzing distributions statistically and making inferences.