Lecture 5: Probability: Binomial Distribution, Normal Distribution, standard normal distribution, and Z-Tables

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/16

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

17 Terms

1
New cards

Independence

Two events, A and B, are independent if P(A | B) = P(A) or if P(B | A) = P(B)

2
New cards

To calculate probability…

  • Typically, count the number of participants that had the characteristic of interest and divide by the population size

  • For conditional probabilities, the population size (denominator) was modified to reflect the subpopulation of interest

  • P(A and B) = P(A) x P(B) if A and B happen simultaneously

  • P (C and D) = P(C) + P(D) if C and D are mutually exclusive

3
New cards

Binomial distribution

  • Model for dichotomous outcome

  • The binomial formula generates the probability of observing exactly x successes out of n

  • We typically do not talk about mean and variance for dichotomous variables, but we can quantify mean and variance for every probability distribution

  • Mean and variance of (random variables generated from) the binomial distribution

4
New cards

Binomial distribution: Model for dichotomous outcome

  1. Two possible values (responses) for each data point: success and failure

  2. Replications of the process are independent

  3. P(success) is constant for each replication

5
New cards

Binomial distribution: The binomial formula generates the probability of observing exactly x successes out of n

P(x success) = (n! / x! (n-x)!) px (1-p)n-x

6
New cards

Binomial Distribution: Notation

  • n = number of times the process is repeated

  • p = P(succes) where success is outcome of interest

  • x = number of successes of interest 0 ≤ x ≤ n

  • ! = factorial; k! = k(k-1)(k-2)… 1

7
New cards

What the binomial formula does in one step

The binomial coefficient (the fraction with factorials) figures out how many such orderings are possible and then multiply by the common probability

8
New cards

Binomial distribution: Mean and variance of (random variables generated from) the binomial distribution

  • Mean or expected number of successes: μ = np

  • Variance: σ2 = np(1-p)

  • Standard deviation: σ

9
New cards

Normal distribution

  • Aka Gaussian distribution

  • Model for a continuous outcome when the distribution is well described by a bell-shaped curve

  • Notation: μ = distribution mean and σ = distribution standard deviation

  • x-axis is used to display the scale of the characteristic/variable being analyzed (e.g., height, weight, systolic blood pressure)

  • y-axis reflects the probability density (relative likelihood) of observing each value

  • Curve is highest in the middle, suggesting that the values near the middle have higher probabilities or are more likely to occur; values at either extreme are much less likely to occur

  • Mean = median = mode (hump/most frequent value)

  • Area under the density curve before X=a represent P(X ≤ a)

10
New cards

Properties of normal distribution

  1. The normal distribution is symmetric about the mean i.e. P(X > μ) = P(X < μ) = 0.5.

    1. The mean = the median = the mode

  2. The mean and variance, μ and σ2, completely characterize the normal distribution

  3. P(a < X < b) = the area under the normal density curve from a to b

    1. 𝑃 (𝜇 − 𝜎 < 𝑋 < 𝜇 + 𝜎) = 0.68
      𝑃 (𝜇 − 2𝜎 < 𝑋 < 𝜇 + 2𝜎) = 0.95
      𝑃 (𝜇 − 3𝜎 < 𝑋 < 𝜇 + 3𝜎) = 0.99

  4. The probability density function of a normal distribution is given by p(x) = (1 / σ√2π) e-(x-μ)² / 2𝜎²

11
New cards

Standard normal distribution Z

  • Normal distribution with μ = 0 and 𝜎 = 1

  • P(-1 < X < 1) = 0.68

  • P(-2 < X < 2) = 0.95

  • P(-3 < X < 3) = 0.99

  • Z = x-μ / 𝜎

12
New cards

Normal distribution calculate probability

  • Template: Computing probabilities about normal distributions

  • For the normal distribution, and for other distributions for any continuous variable, there is no area in a single line, and thus the absolute likelihood P(x = specific value) is defined as 0

13
New cards

Normal distribution calculate probability: Template: Computing probabilities about normal distributions

  • First standardize or convert a problem about a normal distribution (X) into a problem about the standard normal distribution (Z)

  • Then use the Z table to compute the desired probability

14
New cards

Normal distribution calculate probability: For the normal distribution, and for other distributions for any continuous variable, there is no area in a single line, and thus the absolute likelihood P(x = specific value) is defined as 0

This is not true for the binomial distribution and for other probability distributions for discrete/categorical/ordinal variables

15
New cards

Percentiles of the normal distribution

  • The kth percentile is defined as the score that holds k percent of the scores below it

  • 90th percentile is the score that holds 90% of the scores below it

16
New cards

Percentiles of the normal distribution: 90th percentile is the score that holds 90% of the scores below it

  • Q1 = 25th percentile

  • Median = 50th percentile

  • Q3 = 75th percentile

17
New cards

For the normal distribution, the following is used to compute percentiles

x = μ + z 𝜎

z = x - μ / 𝜎

where

μ = mean of the random variable X

𝜎 = standard deviation

z = value from the standard distribution for the desired percentile