AP Stats: Chapter 6.1

Introduction

A random variable takes numerical values that describe the outcomes of a chance process
- Use capital letters to designate random variables
The probability distribution of a random variable gives it possible values and their probabilities
- Two main types: discrete and continuous

Discrete Random Variables

A discrete random variable, X, takes a fixed set of possible values with gaps between them
Probability Distribution for a Discrete Random Variable
- ^^The probability distribution of a discrete random variable, X, list the values x, and their probabilities p:^^

value	x⌄1	x⌄2	x⌄3	…
probability	p⌄1	p⌄2	p⌄3	…

^^For the probability distribution to be valid, the probabilities must satisfy two requirements:^^
- ^^every probability, p, is a number between 1 and 0^^
- ^^the sum of the probabilities is 1^^

Analyzing Discrete Random Variables: Describing Change

Display probability distribution with a histogram
- Values of variable go on the horizontal axis
- Probability goes on the vertical axis
A probability distribution histogram is really just a relative frequency histogram because probabilities are long-run frequencies.

Measuring Center: The Mean (Expected Value) of a Discrete Random Variable

The mean x̄ of a quantitative data set with n observations is
- x̄ = sum of data values/number of data values
The mean of any discrete random variable is an average of the possible outcomes, but a weighted average in which each outcome is weighted by its probability
The mean (expected value) of a discrete random variable is its average value over many, many repetitions of the same chance process.
- Suppose X is a discrete random variable with probability distribution:

value	x⌄1	x⌄2	x⌄3	…
probability	p⌄1	p⌄2	p⌄3	…

To find the mean (expected value) of X, multiply each possible value of X by its probability then add all of the products.

Measuring Variability: The Standard Deviation (and Variance) of a Discrete Random Variable

The standard deviation of a discrete random variable measures how much the values of the variable typically vary from the mean.
- Suppose that X is a discrete random variable with the probability distribution
- and that μ⌄x is the mean of X. The variance of X is
- Var(X)=σ^2⌄x = (*X⌄1 - μ⌄x)^2* *p⌄1 + (X*⌄2 - μ⌄x)^2* *p⌄2 + (X⌄3 - μ⌄x)^2* p⌄3 + …
- The standard of deviation of X is the square root of the variance.
Using a Calculator to Analyze Discrete Random Variables: TI-83/84
- Enter the values of the random variable list in L1 and the corresponding probabilities in list L2
- To graph a histogram of the probability distribution:
- In the stat plot menu define Plot 1 to be a histogram with Xlist: L1 and Freq: L2
- Adjust your window settings as follows: Xmin = -1, Xmax = 11, Xscl = 1, Ymin = -0.1, Ymax = 0.5, Yscl = 0.1.
- Press GRAPH
- To calculate the mean and standard deviation of the random variable, use one-variable statistics with the values in L1 and the probabilities (relative frequencies) in L2. Press STAT, arrow to the CALC menu, and choose 1-Var Stats.
- OS 2.55 or later: In the dialog box, specify List: L1 and FreqList: L2. Then choose Calculate.
- Older OS: Execute the command 1-Var Stats L1,L2.

Continuous Random Variables

A continuous random variable can take any value in an interval on the number line
^^How to Find Probabilities for a Continuous Random Variable^^
- ^^The probability of any event involving a continuous random variable is the area under the density curve and directly above the values on the horizontal axis that make up the event^^
The probability for a continuous random variable assigns probabilities to intervals of outcomes rather than to individual outcomes

Summary

A random variable takes numerical values determined by the outcome of a chance process. The probability distribution of a random variable gives its possible values and their probabilities. There are two types of random variables: discrete and continuous.
A discrete random variable has a fixed set of possible values with gaps between them.
- A valid probability distribution assigns each of these values a probability between 0 and 1 such that the sum of all the probabilities is exactly 1.
- The probability of any event is the sum of the probabilities of all the values that make up the event.
- We can display the probability distribution as a histogram, with the values of the variable on the horizontal axis and the probabilities on the vertical axis.
A continuous random variable can take any value in an interval on the number line.
- A valid probability distribution for a continuous random variable is described by a density curve with area 1 underneath.
- The probability of any event is the area under the density curve directly above the values on the horizontal axis that make up the event.
We can describe the shape of a probability distribution histogram or density curve in the same way as we did a distribution of quantitative data—by identifying symmetry or skewness and any major peaks.
Use the mean to summarize the center of a probability distribution. The mean of a random variable μX is the balance point of the probability distribution histogram or density curve.
- The mean is the long-run average value of the variable after many repetitions of the chance process. It is also known as the expected value of the random variable, E(X).
- If X is a discrete random variable, the mean is the average of the values of X, each weighted by its probability:
- μX=E(X)= ∑ x⌄i * p⌄i = x⌄1 p⌄1 + x⌄*2 * p⌄2 + x⌄3 * *p⌄3 + ···
Use the standard deviation to summarize the variability of a probability distribution. The standard deviation of a random variable σX measures how much the values of the variable typically vary from the mean.
- If X is a discrete random variable, the variance of X is the “average” squared deviation of the values of the variable from their mean:
- σX2=∑ (x⌄i−μ⌄X)^2*p⌄i* = (x⌄1−μ⌄X)^2 * *p⌄1 + (x⌄2−μ⌄X)^2 *p⌄2 + (xV3−μ⌄X)^2* * p⌄3 + …
The standard deviation of X is the square root of the variance.