Lecture 5: Probability: Binomial Distribution, Normal Distribution, standard normal distribution, and Z-Tables

studied byStudied by 0 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 16

encourage image

There's no tags or description

Looks like no one added any tags here yet for you.

17 Terms

1

Independence

Two events, A and B, are independent if P(A | B) = P(A) or if P(B | A) = P(B)

New cards
2

To calculate probability…

  • Typically, count the number of participants that had the characteristic of interest and divide by the population size

  • For conditional probabilities, the population size (denominator) was modified to reflect the subpopulation of interest

  • P(A and B) = P(A) x P(B) if A and B happen simultaneously

  • P (C and D) = P(C) + P(D) if C and D are mutually exclusive

New cards
3

Binomial distribution

  • Model for dichotomous outcome

  • The binomial formula generates the probability of observing exactly x successes out of n

  • We typically do not talk about mean and variance for dichotomous variables, but we can quantify mean and variance for every probability distribution

  • Mean and variance of (random variables generated from) the binomial distribution

New cards
4

Binomial distribution: Model for dichotomous outcome

  1. Two possible values (responses) for each data point: success and failure

  2. Replications of the process are independent

  3. P(success) is constant for each replication

New cards
5

Binomial distribution: The binomial formula generates the probability of observing exactly x successes out of n

P(x success) = (n! / x! (n-x)!) px (1-p)n-x

New cards
6

Binomial Distribution: Notation

  • n = number of times the process is repeated

  • p = P(succes) where success is outcome of interest

  • x = number of successes of interest 0 ≤ x ≤ n

  • ! = factorial; k! = k(k-1)(k-2)… 1

New cards
7

What the binomial formula does in one step

The binomial coefficient (the fraction with factorials) figures out how many such orderings are possible and then multiply by the common probability

New cards
8

Binomial distribution: Mean and variance of (random variables generated from) the binomial distribution

  • Mean or expected number of successes: μ = np

  • Variance: σ2 = np(1-p)

  • Standard deviation: σ

New cards
9

Normal distribution

  • Aka Gaussian distribution

  • Model for a continuous outcome when the distribution is well described by a bell-shaped curve

  • Notation: μ = distribution mean and σ = distribution standard deviation

  • x-axis is used to display the scale of the characteristic/variable being analyzed (e.g., height, weight, systolic blood pressure)

  • y-axis reflects the probability density (relative likelihood) of observing each value

  • Curve is highest in the middle, suggesting that the values near the middle have higher probabilities or are more likely to occur; values at either extreme are much less likely to occur

  • Mean = median = mode (hump/most frequent value)

  • Area under the density curve before X=a represent P(X ≤ a)

New cards
10

Properties of normal distribution

  1. The normal distribution is symmetric about the mean i.e. P(X > μ) = P(X < μ) = 0.5.

    1. The mean = the median = the mode

  2. The mean and variance, μ and σ2, completely characterize the normal distribution

  3. P(a < X < b) = the area under the normal density curve from a to b

    1. 𝑃 (𝜇 − 𝜎 < 𝑋 < 𝜇 + 𝜎) = 0.68
      𝑃 (𝜇 − 2𝜎 < 𝑋 < 𝜇 + 2𝜎) = 0.95
      𝑃 (𝜇 − 3𝜎 < 𝑋 < 𝜇 + 3𝜎) = 0.99

  4. The probability density function of a normal distribution is given by p(x) = (1 / σ√2π) e-(x-μ)² / 2𝜎²

New cards
11

Standard normal distribution Z

  • Normal distribution with μ = 0 and 𝜎 = 1

  • P(-1 < X < 1) = 0.68

  • P(-2 < X < 2) = 0.95

  • P(-3 < X < 3) = 0.99

  • Z = x-μ / 𝜎

New cards
12

Normal distribution calculate probability

  • Template: Computing probabilities about normal distributions

  • For the normal distribution, and for other distributions for any continuous variable, there is no area in a single line, and thus the absolute likelihood P(x = specific value) is defined as 0

New cards
13

Normal distribution calculate probability: Template: Computing probabilities about normal distributions

  • First standardize or convert a problem about a normal distribution (X) into a problem about the standard normal distribution (Z)

  • Then use the Z table to compute the desired probability

New cards
14

Normal distribution calculate probability: For the normal distribution, and for other distributions for any continuous variable, there is no area in a single line, and thus the absolute likelihood P(x = specific value) is defined as 0

This is not true for the binomial distribution and for other probability distributions for discrete/categorical/ordinal variables

New cards
15

Percentiles of the normal distribution

  • The kth percentile is defined as the score that holds k percent of the scores below it

  • 90th percentile is the score that holds 90% of the scores below it

New cards
16

Percentiles of the normal distribution: 90th percentile is the score that holds 90% of the scores below it

  • Q1 = 25th percentile

  • Median = 50th percentile

  • Q3 = 75th percentile

New cards
17

For the normal distribution, the following is used to compute percentiles

x = μ + z 𝜎

z = x - μ / 𝜎

where

μ = mean of the random variable X

𝜎 = standard deviation

z = value from the standard distribution for the desired percentile

New cards
robot