Statistics Study Notes
Introduction
- This chapter discusses statistics and its importance, covering various foundational topics essential for understanding statistical concepts.
- Sections include:
- A. What are Statistics?
- B. Importance of Statistics
- C. Descriptive Statistics
- D. Inferential Statistics
- E. Variables
- F. Percentiles
- G. Measurement
- H. Levels of Measurement
- I. Distributions
- J. Summation Notation
- K. Linear Transformations
- L. Logarithms
- M. Exercises
Variables by Heidi Ziemer
- Prerequisites: None
- Learning Objectives:
- Define and distinguish between independent and dependent variables.
- Define and distinguish between discrete and continuous variables.
- Define and distinguish between qualitative and quantitative variables.
Independent and Dependent Variables
- Definition of Variables:
- Properties or characteristics that can vary in value; unlike constants such as π.
- Independent Variables: Manipulated by the researcher to observe effects.
- Dependent Variables: The outcomes measured to see the effects of changes in the independent variable.
Examples:
Blueberries and Aging:
- Independent Variable: Type of dietary supplement (none, blueberry, strawberry, spinach).
- Dependent Variables: Memory test and motor skills test.
Beta-Carotene and Cancer:
- Independent Variable: Supplements (beta-carotene vs. placebo).
- Dependent Variable: Occurrence of cancer rates.
Brightness of Brake Lights:
- Independent Variable: Brightness of brake lights.
- Dependent Variable: Time to react and hit brakes.
Levels of an Independent Variable
- Treatment comparisons can have two levels (experimental and control) or more depending on the number of experimental conditions; e.g., comparing five types of diets yields 5 levels.
Qualitative and Quantitative Variables
- Qualitative Variables: Attribute-based (e.g., hair color, gender); no numerical ordering.
- Quantitative Variables: Numeric-based (e.g., height, weight).
- Example in diets: type of supplement is qualitative; memory test scores are quantitative.
Discrete and Continuous Variables
- Discrete Variables: Countable values; e.g., number of children (cannot have fractions).
- Continuous Variables: Measured on a continuum; e.g., time to respond to a question (can have fractions).
Percentiles by David Lane
- Prerequisites: None
- Learning Objectives:
- Define percentiles.
- Use formulas for computing percentiles.
Definition of Percentiles
- A score that ranks another score relative to a distribution;
- Example: A score at the 65th percentile means 65% of scores are lower.
Different Definitions of Percentiles
- Definition 1: Lowest score greater than 65% of scores.
- Definition 2: Smallest score that is greater than or equal to 65% of scores.
- Weighted Average Definition: Combines results from Definitions 1 and 2 while handling rounding issues elegantly.
Example Calculations:
- Calculating the 25th Percentile:
- Formula to find rank (R) of Pth percentile: R = \frac{P}{100} \times (N + 1)
- If R is an integer, the percentile is the corresponding ranked score. If not, interpolate between scores.
- Interpolation Formula Example: For 2nd score at rank 2 and 3rd at rank 3, interpolate using fractional rank.
Distributions by David M. Lane and Heidi Ziemer
- Prerequisites: Chapter 1: Variables
- Learning Objectives:
- Define “distribution.”
- Interpret frequency distributions.
- Distinguish between frequency and probability distributions.
- Construct a grouped frequency distribution for continuous variables.
- Identify skewness of distributions.
- Identify bimodal, leptokurtic, and platykurtic distributions.
Distributions of Discrete Variables
- Example: Counting M&M colors leads to a frequency table depicting their distributions (counts of each color).
Continuous Variables
- Use grouped frequency distributions for continuous data, e.g., response times in milliseconds.
- Group scores within intervals to summarize data effectively and construct histograms.
Probability Densities
- Continuous distributions represented visually via probability density graphs; normally distributed (bell-shaped) curves.
Key Points:
- The area under the curve = 1.
- The chance of obtaining an exact measure is zero.
- Probabilities are represented by areas under the curve between two points.
Shapes of Distributions
- Positive Skew: Longer tail on the right, mean > median.
- Negative Skew: Longer tail on the left, mean < median.
- Bimodal Distribution: Two peaks.
- Leptokurtic: Taller peak, fatter tails.
- Platykurtic: Flatter peak, thinner tails.
Summation Notation by David M. Lane
- Learning Objectives:
- Use summation notation for sums of all numbers.
- Use notation for subsets of numbers and sum of squares.
Summation Basics
- Greek letter \Sigma signifies summation; example for grapes:
- \Sigma{i=1}^{n} Xi = X1 + X2 + \ldots + X_n.
- Notation for summing squares exemplified and requires careful interpretation (difference between total sum squaring and squaring totals).
Exercises
- Practice Problems for reinforcement, covering topics from chapters:
- Inferential vs. descriptive statistics application.
- Identifying independent/dependent variables.
- Interpreting percentiles and computational exercises.