Unit 6: Anticipating Patterns

Terms and Concepts

  • Probability: the chance of the outcome of an event

  • Sample space: a set of all possible outcomes

  • Tree diagram: representation is useful in determining the sample space for an experiment, especially if there are relatively few possible outcomes.

Basic Probability Rules and Terms

  • Rule 1: For any event A, the probability of A is always greater than or equal to 0 and less than or equal to 1

  • Rule 2: The sum of the probabilities for all possible outcomes in a sample space is always 1

  • Impossible event: If an event can never occur, its probability is 0

  • Sure event: Of an event must occur every time, its probability is 1

  • “Odds in favor of an event”: ratio of the probability of the occurrence of an event to the probability of the nonoccurrence of that event.

    • Odds in favor of an event = P(Event A occurs) / P(Event A does not occur) or P(Event A occurs) : P(Event A does not occur)

  • Complement: the set of all possible outcomes in a sample space that do not lead to the event

  • Disjoint or mutually exclusive events: events that have no outcome in common. In other words, they cannot occur together.

  • Union: events A and B is the set of all possible outcomes that lead to at least one of the two events A and B

  • Intersection: events A and B is the set of all possible outcomes that lead to both events A and B

  • Conditional Events: A given B is a set of outcomes for event A that occurs if B has occurred

Random Variables and Their Probability Distribution

  • Variable: quantity whose value varies from subject to subject

  • Probability experiment: an experiment whose possible outcomes may be known but whose exact outcome is a random event and cannot be predicted with certainty in advance

  • Random variables: The outcome of a probability experiment takes a numerical value

  • Discrete random variable: quantitative variable that takes a countable number of values

  • Continuous random variable: a quantitative variable that can take all the possible values in a given range

Discrete Random Variable

  • Expected value: Computed by multiplying each value of the random variable by its probability and then adding over the sample space

  • Variance: sum of the product of squared deviation of the values of the variable from the mean and the corresponding probabilities

Combinations

  • Combination: the number of ways r items can be selected out of n items if the order of selection is not important.

Binomial Distribution

  • 3 Characteristics of a binomial experiment

    • There are a fixed number of trials

    • There are only 2 possible outcomes

    • The n trials are independent and are repeated using identical conditions

  • Binomial probability distribution:

    • Mean: μ = np

    • Variance: σ2 = npq

    • Standard deviation: σ = √npq

Geometric Distribution

  • 3 Characteristics of a geometric experiment

    • There are one or more Bernoulli trials with all failures except the last one, which is a success. In other words, you keep repeating what you are doing until the first success.

    • In theory, the number of trials could go on forever. There must be at least one trial.

    • The probability, p, of a success and the probability, q, of a failure is the same for each trial. p + q = 1 and q = 1 − p.

  • X = the number of independent trials until the first success

  • Mean: μ = 1/p

  • Standard Deviation: σ = √1/𝑝(1/𝑝−1)

The Probability Distribution of Continuous Random Variables

  • The continuous probability distribution (cdf): graph or a formula giving all possible values taken by a random variable and the corresponding probabilities

Let X be a continuous random variable taking values in the range (a, b)

  • The area under the density curve is equal to the probability

  • P(L < X < U) = the area under the curve between L and U, where a ≤ L ≤ U ≤ b

  • The total probability under the curve = 1

  • The probability that X takes a specific value is equal to 0, i.e., P(X = x0) = 0

Sampling Distribution

  • Parameter: a numerical measurement describing some characteristic of a population.

  • Statistic: a numerical measurement describing some characteristic of a sample.

  • Sampling distribution: the probability distribution of all possible values of a statistic, different samples of the same size from the same population will result in different statistical values

  • Standard error: standard deviation of the distribution of the statistics.

Central Limit Theorem

  • Central limit theorem: If the sample size is large enough then we can assume it has an approximately normal distribution.

    • The sample size has to be greater than 30 to assume an approximately normal distribution

  • The shape of the distribution of “X bar” becomes more symmetrical and bell-shaped

  • The center of the distribution of “X bar” remains at μ

  • The spread of the distribution “X bar” decreases, and the distribution becomes more peaked

Calculator Steps

Probabilities for means on the calculator

  • 2nd DISTR

  • 2:normalcdf

  • normalcdf (lower value of the area, upper value of the area, mean, standard deviation / √sample size)

  • where

    • mean is the mean of the original distribution

    • standard deviation is the standard deviation of the original distribution

    • sample size = n

Percentiles for means on the calculator

  • 2nd DISTR

  • 3:InvNorm

  • k = invNorm (area to the left of 𝑘, mean, standard deviation / √sample size)

  • Where→

    • k = the kth percentile

    • mean is the mean of the original distribution

    • standard deviation is the standard deviation of the original distribution

    • sample size = n

Probabilities for sums on the calculator

  • 2nd DISTR

  • 2: normalcdf (lower value of the area, upper value of the area, (n)(mean), (√n)(standard deviation))

  • where:

    • mean is the mean of the original distribution

    • standard deviation is the standard deviation of the original distribution

    • sample size = n

Percentiles for sums on the calculator

  • 2nd DIStR

  • 3:invNorm

  • k = invNorm (area to the left of k, (n)(mean), (√n)(standard deviation)

  • where:

    • k is the kth percentile

    • mean is the mean of the original distribution

    • standard deviation is the standard deviation of the original distribution

    • sample size = n

robot