Study Notes for STAT 1000: Density Curves & Normal Distributions
Density Curves
Definition: Density curves provide a model for the distribution of a continuous random variable.
They offer an overall picture without considering small irregularities or outliers.
Smooth curves are easier to work with than histograms.
Histogram Example
A histogram of test scores for high school students taking a provincial math exam illustrates the distribution of scores.
Observations:
Scores exhibit a fairly regular, symmetric distribution.
Pattern descends smoothly from the center; no gaps or outliers present.
Fitting a Curve: A smooth curve can be fitted to approximate the histogram's distribution, representing a mathematical model of the data.
Area Representation
The total area of the curve represents proportions of observations.
Like histograms, the area underneath the density curve can be scaled to equal one, indicating relative frequencies or proportions within intervals.
For example, the proportion of students scoring less than can be calculated as follows:
Representing students: .
Properties of Density Curves
All density curves have three key properties:
They lie entirely above the -axis.
The area under the curve is equal to one.
They represent a proper function: each value of corresponds to a unique value of .
Large Populations
When dealing with large populations (e.g., heights, incomes, GPAs):
Continuous populations lead to smoother histograms approximating density curves.
For infinitely large populations, histograms can have as many intervals as needed, producing a density curve.
Uniform Distribution
Definition: A uniform distribution is one in which all values in a range are equally likely.
Mathematical Properties:
Area under the curve equals the area of a rectangle defined by base and height.
For example, in the interval : Area , confirming it's a valid density curve.
For an interval : Proportion .
Finding Proportions in a Uniform Distribution
Calculation of proportions falling within intervals is straightforward due to the shape of uniform distribution:
For any range , the calculation is .
Example Problems
Area between values: Calculate the area (proportion of observations) between and .
Result using the uniform distribution height will yield P(1.6 < X < 3.3) = (3.3 - 1.6) \cdot \text{height} = 1.7 \cdot 0.25 = 0.425.
Solving Percentiles: Find values corresponding to a proportion (e.g., of observations in a uniform distribution).
Triangular Distribution
This type of distribution can be defined with its maximum height by ensuring that the area sums to . For a triangle:
Area . For the triangle with a defined base of .
The height must equal for normalization.
Parameters & Statistics
Today’s statistics class defined key terms:
Sample mean vs. population mean .
Sample standard deviation vs. population standard deviation .
Distinction of parameters (describes populations) vs. statistics (from samples).
Example Problem
MGD Beer example illustrating sample statistics (average content of cans) and indicating whether the values are parameters or statistics.
The Normal Distribution
A specific type of density curve, known for its bell shape and symmetric distribution. Key characteristics include:
Defined by parameters: population mean and standard deviation .
The distribution is symmetric and has a total area under the curve equal to one.
Notation: The normal distribution is denoted as .
Properties of Normal Distribution
Two main parameters:
Mean (): Location of the center of the distribution.
Standard deviation (): Measure of spread; must be positive.
Relationship Between Parameters
The mean indicates central location, while the standard deviation reflects the spread:
For an example with and , the normal distribution is defined by the equation: .
68-95-99.7 Rule
Key statistical rule for normal distributions:
Approximately:
of values fall within one standard deviation of the mean ().
of values fall within two standard deviations ().
of values fall within three standard deviations ().
Example Application
If , the proportion between and falls, according to the rule, within .
The proportion for values between and can be calculated as , indicating coverage within standard deviations.
Standard Normal Distribution
A special type of normal distribution with:
Mean , standard deviation .
Denoted by the variable , where .
Transforming To Standard Normal
To convert a normal variable into a standard normal variable , calculate the z-score as follows:
.
Example Calculations
For a height of cm when :
. This means the height is standard deviations above the population mean.
Proportion Calculations with Standard Normal Distribution
Techniques for finding proportions using the transformation process:
Sketch the normal curve, shading the area required for the computation. Use properties and known values from the standard normal table to derive answers.
Example problems
For P(-1 < Z < 1) the probability is approximately .
For P(Z > 2), details of symmetry can be used for computation leading to through the left area corresponding to Z < -2.
Backward Normal: Finding z-values
Methods of determining values corresponding to known proportions: find desired values in tables and transforming as necessary based on the projected proportions.
Finding Percentiles & Quantile Ranges
Steps to find specific values corresponding to given percentiles, involving locating values through known standard proportion tables and computation adjustments due to existing symmetry.
Example: Determine the interquartile range of a standard normal distribution by finding and .
Conclusion
Summary of how to approach finding areas under the normal curve, how to transform and find corresponding mean and standard deviations, as well as predicting values relative to targeted proportions. Examples provided illustrate practical applications of these concepts within various contexts, including the evaluation of normal variables methodology outlined.
Next Steps: Unit 05 - Probability & Sampling Distribution
Probability (Unit 05)
Definition: Probability quantifies the likelihood of an event occurring. It is a value between and (or and ).
Key Concepts:
Experiment: A process that leads to well-defined outcomes.
Outcome: A single possible result of an experiment.
Sample Space (): The set of all possible outcomes of an experiment.
Event: A subset of the sample space; a collection of one or more outcomes.
Rules of Probability
Rule 1: Probability Range: For any event , .
Rule 2: Sum of Probabilities: The sum of probabilities of all possible outcomes in a sample space is . .
Rule 3: Complement Rule: The probability that an event does not occur is .
Rule 4: Addition Rule for Disjoint Events: If two events and are disjoint (mutually exclusive), meaning they cannot occur at the same time, then .
Example: Rolling a or a on a single die roll.
Rule 5: General Addition Rule: For any two events and (not necessarily disjoint), .
is the probability that both and occur.
Rule 6: Multiplication Rule for Independent Events: If two events and are independent, meaning the occurrence of one does not affect the occurrence of the other, then .
Example: Flipping a coin twice and getting heads both times.
Rule 7: Conditional Probability: The probability of event occurring given that event has already occurred is denoted as .
, provided P(A) > 0.
Rule 8: General Multiplication Rule: For any two events and (not necessarily independent), .
Random Variables
Definition: A random variable is a numerical outcome of a random phenomenon.
Types of Random Variables:
Discrete Random Variable: A random variable that can take on a finite or countably infinite number of values. These values are often integers.
Examples: Number of heads in 3 coin flips (), number of cars passing a certain point in an hour.
Continuous Random Variable: A random variable that can take on any value within a given range.
Examples: Height, weight, temperature, time. (As discussed with density curves).
Probability Distributions for Discrete Random Variables
A probability distribution for a discrete random variable lists all possible values the variable can take and their corresponding probabilities.
Properties:
for each possible value .
(The sum of