Probability and Distributions Exam Notes

Probability Density Function

  • A function f(x)f(x) is a probability density function for a continuous random variable XX if the total area under its curve is equal to 1.

Joint Probability Distribution

  • In a joint probability distribution, the mean, median, and mode are not necessarily equal.

Probability Calculation: Selecting Boys and Girls

  • Problem: In a group of 5 boys and 3 girls, two are selected at random. Let xx be the number of boys and yy be the number of girls. Find the probability of x=2x = 2.

  • Solution:

    • Total number of ways to select 2 people from 8: 8<br>choose2=8!2!6!=28{8 <br>choose 2} = \frac{8!}{2!6!} = 28

    • Number of ways to select 2 boys from 5: 5<br>choose2=5!2!3!=10{5 <br>choose 2} = \frac{5!}{2!3!} = 10

    • Probability of selecting 2 boys: 1028=514\frac{10}{28} = \frac{5}{14}

Empirical Rule of Normal Distribution

  • In the empirical rule of normal distribution, approximately 68% of values are within 1 standard deviation from the mean.

Probability Distribution

  • A probability distribution is a representation of different values of a random variable with their corresponding probabilities.

Standard Normal Distribution (Z-Distribution)

  • The standard normal distribution, also called the z-distribution, is a special normal distribution where the mean is 0 and the standard deviation is 1.

Probability Calculation with Density Function

  • Problem: A continuous random variable XX can assume values between x=2x = 2 and x=4x = 4 and has a density function. Find P(2.4 < x < 3.5).

  • Note: The density function f(x)f(x) is not provided in the question, so a numerical answer cannot be computed without this information. If f(x)f(x) was provided, the probability P(2.4 < x < 3.5) would be found by calculating the definite integral of f(x)f(x) from 2.4 to 3.5: 2.43.5f(x)dx\int_{2.4}^{3.5} f(x) dx

Continuous vs. Discrete Random Variables

  • The number of students in a classroom is an example of a discrete random variable, not a continuous random variable.

Continuous Probability Distribution

  • Continuous probability distributions deal with intervals rather than point values.

Gaussian Distribution Probability Density Function

  • Problem: If the value of random variable is 2, mean is 5 and the standard deviation is 4, then find the probability density function of the gaussian distribution.

  • Solution: The probability density function (PDF) of a Gaussian (normal) distribution is given by: f(x)=1σ2πe12(xμσ)2f(x) = \frac{1}{\sigma \sqrt{2\pi}} e^{-\frac{1}{2} (\frac{x - \mu}{\sigma})^2} where:

    • xx is the value of the random variable (2 in this case)

    • μ\mu is the mean (5 in this case)

    • σ\sigma is the standard deviation (4 in this case)
      Plugging in the values: f(2)=142πe12(254)2f(2) = \frac{1}{4 \sqrt{2\pi}} e^{-\frac{1}{2} (\frac{2 - 5}{4})^2}
      f(2)=142πe12(34)2f(2) = \frac{1}{4 \sqrt{2\pi}} e^{-\frac{1}{2} (\frac{-3}{4})^2}
      f(2)=142πe12(0.5625)f(2) = \frac{1}{4 \sqrt{2\pi}} e^{-\frac{1}{2} (0.5625)}
      f(2)=142πe0.28125f(2) = \frac{1}{4 \sqrt{2\pi}} e^{-0.28125}
      f(2)14×2.5066×0.7540.0997f(2) \approx \frac{1}{4 \times 2.5066} \times 0.754 \approx 0.0997

Probability Calculation with Given Density Function

  • Problem: A continuous random variable XX can assume values between x=2x = 2 and x=4x = 4 and has a density function given by: f(x)=x+18f(x) = \frac{x+1}{8}. Find P(2 < x < 4).

  • Solution: To find P(2 < x < 4), integrate the density function f(x)f(x) from 2 to 4:
    P(2 < x < 4) = \int{2}^{4} \frac{x+1}{8} dx P(2 < x < 4) = \frac{1}{8} \int{2}^{4} (x+1) dx
    P(2 < x < 4) = \frac{1}{8} [\frac{x^2}{2} + x]_{2}^{4}
    P(2 < x < 4) = \frac{1}{8} [(\frac{4^2}{2} + 4) - (\frac{2^2}{2} + 2)]
    P(2 < x < 4) = \frac{1}{8} [(8 + 4) - (2 + 2)]
    P(2 < x < 4) = \frac{1}{8} [12 - 4]
    P(2 < x < 4) = \frac{1}{8} [8]
    P(2 < x < 4) = 1.0

Expected Value and Mean

  • The mean also refers to the expected value of a distribution.

Distribution or Spread of Data

  • Standard deviation is the distribution or spread of the numbers in data set.

Probability within a Range

  • The probability distribution is used to define the random variable's probability coming within a distinct range of values.

Probability Calculation: Normally Distributed Time

  • Problem: The time a student spends learning a computer software package is normally distributed with a mean of 8 hours and a standard deviation of 1.5 hours. A student is selected at random. What is the probability that the student spends less than 6 hours learning?

    • Solution: To solve this, calculate the Z-score: Z=XμσZ = \frac{X - \mu}{\sigma} Where:

      • X=6X = 6 (the value we're interested in)

      • μ=8\mu = 8 (the mean)

      • σ=1.5\sigma = 1.5 (the standard deviation)
        Z=681.5=21.51.33Z = \frac{6 - 8}{1.5} = \frac{-2}{1.5} \approx -1.33

    • Use a Z-table or calculator to find the probability that Z < -1.33. This value is approximately 0.0918.

    • Therefore, the probability that a student spends less than 6 hours learning the software is approximately 0.09.