Fundamental Concepts and Applications of Probability and Statistics

Fundamentals of Probability Theory

Probability theory is centered on the study of random variables and their potential outcomes. The sample space, denoted as SS, represents the comprehensive set of all possible outcomes of a random variable. An outcome specifically refers to the result obtained from a single trial or observation. An event, designated as EE, is defined as a collection consisting of one or more specific outcomes from the sample space. The mathematical definition for the probability of an event happening is expressed as P(E)=Successful outcomesTotal possible outcomesP(E) = \frac{\text{Successful outcomes}}{\text{Total possible outcomes}}.

Properties of Probability and Complementary Events

Probability values are governed by several fundamental mathematical properties. The probability of the sample space, P(S)P(S), is equal to 11, signifying a sure event. Conversely, the probability of a null or empty set, P()P(\emptyset), is equal to 00, signifying an impossible event. All probability values must fall within the range of 0P(A)10 \le P(A) \le 1, meaning probability is always non-negative.

Complementary events are defined by the occurrence of an event AA versus the non-occurrence of that event, denoted as AcA^c or "not A". The relationship between the number of outcomes in event AA and its complement relative to the sample space is given by n(A)+n(Ac)=n(S)n(A) + n(A^c) = n(S). When normalized by the total number of outcomes in the sample space, this leads to the formula n(A)n(S)+n(Ac)n(S)=1\frac{n(A)}{n(S)} + \frac{n(A^c)}{n(S)} = 1, which represents the principle that the probability of success plus the probability of failure equals one: P(A)+P(Ac)=1P(A) + P(A^c) = 1. Consequently, the probability of an event can be calculated as P(A)=1P(Ac)P(A) = 1 - P(A^c).

The Addition Rules of Probability

The addition rule is applied to determine the probability that at least one of two or more events will occur in a single trial. These rules are categorized based on whether the events are mutually exclusive or non-mutually exclusive. Mutually exclusive events are two or more events that can never happen simultaneously in the same trial. For these events, the probability of EE or FF occurring is the sum of their individual probabilities: P(E or F)=P(E)+P(F)P(E \text{ or } F) = P(E) + P(F).

Non-mutually exclusive events are two or more events that can happen at the same time in the same trial. To calculate the probability of EE or FF occurring in this scenario, one must subtract the probability of both events occurring simultaneously to avoid double-counting. The formula is: P(E or F)=P(E)+P(F)P(E and F)P(E \text{ or } F) = P(E) + P(F) - P(E \text{ and } F).

Probability Problem Solving: Discrete Objects

Consider a box containing 88 red balls, 1010 white balls, and 1212 blue balls. If one ball is drawn at random, the probability that it is not blue can be calculated in two ways. First, by calculating the success outcomes (red and white) over the total: P(not blue)=8+1030=1830=0.6P(\text{not blue}) = \frac{8 + 10}{30} = \frac{18}{30} = 0.6. Alternatively, using the complement rule: P(not blue)=1P(blue)=11230=1830=0.6P(\text{not blue}) = 1 - P(\text{blue}) = 1 - \frac{12}{30} = \frac{18}{30} = 0.6.

Experiments involving sequential draws, such as drawing from a bag containing 33 white balls and 55 red balls, highlight the difference between sampling with and without replacement. If two balls are drawn without replacement and we seek the probability that both are red, the probability of the first being red is 58\frac{5}{8} and the second is 47\frac{4}{7}. The joint probability is 58×47=20560.357\frac{5}{8} \times \frac{4}{7} = \frac{20}{56} \approx 0.357. If sampling with replacement, the probability for each draw remains 58\frac{5}{8}, resulting in a joint probability of 58×58=25640.391\frac{5}{8} \times \frac{5}{8} = \frac{25}{64} \approx 0.391.

When seeking the probability of drawing one red and one white ball (in any order) without replacement from the same bag, we calculate the sum of the sequences: (R1 then W2)(R_1 \text{ then } W_2) and (W1 then R2)(W_1 \text{ then } R_2). This is represented as (58×37)+(38×57)=1556+1556=30560.536(\frac{5}{8} \times \frac{3}{7}) + (\frac{3}{8} \times \frac{5}{7}) = \frac{15}{56} + \frac{15}{56} = \frac{30}{56} \approx 0.536. With replacement, the calculation becomes (58×38)+(38×58)=1564+1564=30640.469(\frac{5}{8} \times \frac{3}{8}) + (\frac{3}{8} \times \frac{5}{8}) = \frac{15}{64} + \frac{15}{64} = \frac{30}{64} \approx 0.469.

Probability in Card Games, Committees, and Dice

In a standard deck of 5252 playing cards (including 1313 of each suit: clubs, diamonds, hearts, and spades), finding the probability of drawing a number card (defined here as Ace to 1010, totaling 4040 cards) or a club (1313 cards) requires the non-mutually exclusive addition rule because some clubs are also number cards. There are 1010 number cards that are clubs (Ace through 1010 of clubs). Thus, P(number or club)=4052+13521052=43520.827P(\text{number or club}) = \frac{40}{52} + \frac{13}{52} - \frac{10}{52} = \frac{43}{52} \approx 0.827.

When conducting an experiment across three separate 5252-card decks, if we want a diamond from the first, an ace from the second, and the ace of hearts from the third, the probability is the product of individual events: P=1352×452×152=14×113×152=127040.00037P = \frac{13}{52} \times \frac{4}{52} \times \frac{1}{52} = \frac{1}{4} \times \frac{1}{13} \times \frac{1}{52} = \frac{1}{2704} \approx 0.00037.

Combinatorial analysis is often required for selection problems. For a committee of 33 formed from 55 men and 44 women, the probability that exactly two are men is calculated using combinations: P=(52)×(41)(93)=10×484=40840.47619P = \frac{\binom{5}{2} \times \binom{4}{1}}{\binom{9}{3}} = \frac{10 \times 4}{84} = \frac{40}{84} \approx 0.47619. Similarly, in a class of 1212 boys and 44 girls, the probability of selecting 33 boys at random can be found via consecutive probabilities: 1216×1115×1014=11280.39286\frac{12}{16} \times \frac{11}{15} \times \frac{10}{14} = \frac{11}{28} \approx 0.39286, or via combinations: P=(123)(163)=220560=11280.39286P = \frac{\binom{12}{3}}{\binom{16}{3}} = \frac{220}{560} = \frac{11}{28} \approx 0.39286.

In independent events, such as Juan solving 90%90\% (0.90.9) and Pedro solving 70%70\% (0.70.7) of tasks, the probability at least one solves a problem is P(J)+P(P)P(J and P)=0.9+0.7(0.9×0.7)=0.97P(J) + P(P) - P(J \text{ and } P) = 0.9 + 0.7 - (0.9 \times 0.7) = 0.97. For contradiction cases, such as A telling the truth in 75%75\% of cases and B in 80%80\%, the probability they contradict each other is the sum of the probabilities that one tells the truth while the other lies: (0.75×0.20)+(0.25×0.80)=0.15+0.20=0.35(0.75 \times 0.20) + (0.25 \times 0.80) = 0.15 + 0.20 = 0.35.

For dice-rolling, a pair of dice has 3636 possible sum outcomes ranging from 22 to 1212. The probability of obtaining a sum of 55 (which can occur in 44 ways: 14,41,23,321-4, 4-1, 2-3, 3-2) or a sum of 1010 (which can occur in 33 ways: 46,64,554-6, 6-4, 5-5) is P=436+336=7360.19444P = \frac{4}{36} + \frac{3}{36} = \frac{7}{36} \approx 0.19444.

In a set of 5050 tickets numbered 11 to 5050, the probability of drawing a ticket that is odd (numbered 1,3,,491, 3, \dots, 49 for 2525 total) or greater than 2525 (numbered 26,27,,5026, 27, \dots, 50 for 2525 total) involves overlapping outcomes. There are 1212 odd numbers greater than 2525 (27,29,31,33,35,37,39,41,43,45,47,4927, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49—note: calculation error in transcript page 15 implies P=0.75P = 0.75 but shows a fraction suggesting a different setup). Following the transcript's logic: P=2550+2550(25× unclear fraction)=0.75P = \frac{25}{50} + \frac{25}{50} - (25 \times \text{ unclear fraction}) = 0.75.

Repeated Probability (Binomial Distribution)

Repeated probability, specifically the Binomial Distribution, describes the probability that an event will occur exactly rr times out of nn trials. The formula is expressed as P=(nr)SrFnrP = \binom{n}{r} S^r F^{n-r}, where nn is the number of trials, rr is the number of successes, SS is the probability of success, and FF is the probability of failure. The identity S+F=1S + F = 1 must always hold.

In a scenario where a couple has 33 children (n=3n=3), the probability of having at least one boy (where S=0.5S=0.5 and F=0.5F=0.5) can be calculated by summing the probabilities of having exactly 11, 22, or 33 boys: ((31)×0.51×0.52)+((32)×0.52×0.51)+((33)×0.53×0.50)=78=0.875(\binom{3}{1} \times 0.5^1 \times 0.5^2) + (\binom{3}{2} \times 0.5^2 \times 0.5^1) + (\binom{3}{3} \times 0.5^3 \times 0.5^0) = \frac{7}{8} = 0.875. A more efficient method for "at least one" is using the complement rule: P(at least 1)=1Fn=10.53=0.875P(\text{at least 1}) = 1 - F^n = 1 - 0.5^3 = 0.875.

A tactical example involves NATO forces in Syria using a missile with a hit probability of S=0.3S = 0.3 (F=0.7F = 0.7). To determine the number of missiles (nn) required to reach at least an 80%80\% probability of hitting the target, we use the formula 0.8=10.7n0.8 = 1 - 0.7^n. Solving for nn via logarithms: 0.7n=0.20.7^n = 0.2. This results in n=4.512n = 4.512, which implies that approximately 55 shots are necessary.

Statistics: Measures of Central Tendency and Dispersion

In statistics, the mean refers to the average, calculated as the sum of all scores divided by the number of values in the set (xˉ=xn\bar{x} = \frac{\sum x}{n}). The median is the middle number in a set when the values are arranged in ascending or descending order. For an odd number of data points, the location is n+12\frac{n+1}{2}. For an even number of data points, the median is the average of the two central numbers.

The mode is the value that occurs most frequently in a given set. A set can be unimodal (one mode), bimodal (two modes), or multimodal (more than two modes). The range measures dispersion and is defined as the difference between the largest value (Highest Value, HVHV) and the smallest value (Lowest Value, LVLV) in the set: R=HVLVR = HV - LV.

Statistical Case Study Analysis

Consider a dataset based on 100100 random samples with the following characteristics:

  • The sum of all 100100 measurements is 81.17081.170.
  • 4545 measurements occur between 0.7590.759 and 0.8000.800.
  • Smallest value (1st1^{\text{st}} measurement) is 0.7590.759.
  • Largest value (100th100^{\text{th}} measurement) is 0.8580.858.
  • Specific observations include: 0.8010.801 (observed three times), 0.8020.802 (once), 0.8030.803 (twice), and 0.8040.804 (four times).
  • High-end distribution: 4545 measurements are between 0.8050.805 and 0.8580.858.

For this dataset, the mean is calculated as 81.17100=0.8117\frac{81.17}{100} = 0.8117. To find the median for n=100n=100, the location involves the 50th50^{\text{th}} and 51st51^{\text{st}} terms. Counting from the bottom, the first 4545 terms are 0.800\le 0.800. The 46th46^{\text{th}}, 47th47^{\text{th}}, and 48th48^{\text{th}} terms are 0.8010.801. The 49th49^{\text{th}} term is 0.8020.802. The 50th50^{\text{th}} and 51st51^{\text{st}} terms are both 0.8030.803. Thus, the median is 0.8030.803. The mode is the most frequent value, which is 0.8040.804 as it was observed four times. The range is computed as 0.8580.759=0.0990.858 - 0.759 = 0.099.