Chapter 2: Modeling Distributions of Quantitative Data
2.1 Describing Location in a Distribution
- The pth percentile is the percent of values in a distribution that are less than or equal to the individual’s data value.
- Ex: 4 students of 25 earned a higher test score than Emily. 21/25 ppl have scores less than or equal to her, so she’s in the 84th percentile.
- Ex: The table reveals that 20.0% of U.S. presidents took office by the time they turned 50. In other words, the 20th percentile of the distribution of inauguration age is 50.000 years.
- A Cumulative Relative Frequency Graph plots a point corresponding to the cumulative relative frequency
- Make one by
- Useful to measure percentiles in a distribution
- The standardized score (z-score) tells us how many standard deviations a value is from the mean
- z = (value - mean) / (standard deviation)
- Values larger than mean have positive z-scores

- Transformations (a & b are constants):
- Neither transformations will change the shape.
- If data is being added/subtracted by a constant (a), all the values in a data set, measures of center and location—mean, five-number summary, percentiles—increase (decrease) by 𝑎
- Measures of variability—range, IQR, SD—do not change.
- When you multiply (divide) all the values in a data set by a positive constant 𝑏 everything changes: measures of center, location, and variability are multiplied (divided) by 𝑏.
- It’s possible to have multiple transformations of a and b in one distribution
2.2 Density Curves and Normal Distributions:
- Density Curves always add up to 1 or 100%
- Its basically a dot plot with a shit ton of calculus (area under the dot plot)
- If a density curve is symmetric both the mean and the median will lie at the same point of symmetry.
- If a curve is skewed, however, the mean is closer to tail, median is farther from tail

- Normal Density Curve is drawn over a histogram (calculus shit again)
- Shape: All Normal distributions have the same overall shape: symmetric, single-peaked (unimodal), and bell-shaped.
- Center: The mean μ is located at the midpoint of the symmetric density curve and is the same as the median.
- Variability: The standard deviation σ measures the variability (width) of a Normal distribution.

- The Empirical rule states that: 68% for [-1, 1] SD, 95% for [-2,2] SD, 99.7 for [-3,3] SD

- Reminder: You need to draw the normal graph & the scaled graph:

- Make sure to write N=(Mean, SD) for left graph, and N=(0,1) for right normalized graph
- To convert a percentile to a z score do backwards stuff:
- Look in the body of Table A for the value closest to 0.20. A 𝑧-score of –0.84 gives the closest value (0.2005).
- −0.84 = (x-5.3) / 0.9 → x = 4.544mins