Lecture 2

Key Updates

  • Reminder about midterm exam approaching.

  • Recommended to finish lectures 1-3 and quizzes 1-3 before the exam.

  • Content will be posted early for review preparation.

Statistics Basics

Key Definitions

  • Mode: The most frequently occurring value in a dataset (e.g., in the dataset 2, 3, 3, 5, 7, 10, the mode is 3).

  • Median: The middle value in an ordered dataset. For 2, 3, 3, 5, 7, 10, the median is 4 (average of 3 and 5).

  • Mean: The average value calculated by summing all values and dividing by the number of values. For the same dataset, the mean is 5 (30/6).

  • Frequency: The number of times a number appears in a dataset.

Frequency Table Example

  • Example using a taste test with a scale from 1 (bad) to 5 (excellent) given by 20 people.

Binning Data

  • Useful for large datasets.

  • Example: Grouping revenues into bins (e.g., $0-$50 million, $50-$100 million).

Relative Frequency

  • Calculated by dividing the frequency of a category by the total frequency.

  • Example: If 4 students received an A out of 25, the relative frequency is 0.16 (4/25).

Range

  • Difference between highest and lowest values in a dataset. Example: Quiz 1's range is 9, while Quiz 2's range is 8.

Graphing Essentials

Key Components of Good Graphs

  • Title/caption, source of data, vertical and horizontal scales/labels, and legends for clarity.

  • Considerations for presenting multiple datasets.

Pareto Chart

  • A special type of bar graph where bars are arranged in frequency order.

Deceptive Graphs

  • Look for irrelevant visuals and inconsistent intervals when analyzing graphs.

  • Example reviewed: Misleading scales can distort data interpretation.

Skewness & Symmetry

  • Symmetrical distributions have mirror-image left and right halves.

  • Non-symmetrical distributions affect mean, median, and mode interpretations.

Standard Deviation

Importance

  • Key measure of variation in data.

  • Often represented as a single number for ease of understanding.

Calculation Steps

  1. Compute the mean of the dataset.

  2. Find deviations from the mean.

  3. Square the deviations.

  4. Calculate the variance by averaging the squared deviations.

  5. Standard deviation = square root of variance.

Example Calculation

  • Given values: 3, 5, 7. Mean = 5, deviations = [-2, 0, 2], squares = [4, 0, 4]. Variance = 2.67, Standard Deviation ≈ 1.63.

Conclusion

  • Review assigned chapters fully and complete quizzes to prepare for the exam.