Visual Display of Data & Statistics

Visual Display of Data

Frequency Distribution

  • Definition: A frequency distribution is a method of organizing data to show how often each value or group of values occurs. Data is categorized into intervals, and the frequency (number of times each category appears) is recorded, moving data away from raw lists.

Types of Frequency Distribution

Tabular Form
  • Data is presented in a table showing categories (or intervals) and their corresponding frequencies.
  • Example: Test scores of 20 students
    • Score Range: 0–10, Frequency: 2
    • Score Range: 11–20, Frequency: 4
    • Score Range: 21–30, Frequency: 6
    • Score Range: 31–40, Frequency: 5
    • Score Range: 41–50, Frequency: 3
    • This table indicates, for instance, that 66 students scored between 2121 and 3030.
Graphical Form
  • Frequencies are displayed using visual representations.
Bar Chart
  • A bar chart graphically displays a frequency distribution using rectangular bars of equal width.
  • The height (or length) of each bar represents the frequency of a category or group.
  • Best suited for categorical data (e.g., favorite colors, fruits, movie genres).
  • Categories are placed along the x-axis, and frequencies are shown on the y-axis.
  • Bars are separated by gaps, unlike histograms.
  • Example: Favorite fruits of 10 students
    • Survey list: Apple, Orange, Banana, Apple, Mango, Apple, Banana, Mango, Apple, Orange
    • Frequency Table:
      • Fruit: Apple, Frequency: 4
      • Fruit: Orange, Frequency: 2
      • Fruit: Banana, Frequency: 2
      • Fruit: Mango, Frequency: 2
    • A bar chart would show four distinct bars for Apple (44), Orange (22), Banana (22), and Mango (22) with gaps between them.
Histogram
  • A histogram graphically displays a frequency distribution for numerical data.
  • It groups numbers into intervals (called bins or classes), and the bins touch each other to emphasize the continuous nature of the data.
  • The x-axis represents the intervals (e.g., score ranges).
  • The y-axis represents the frequency (the number of data points within each interval).
  • Bins are adjacent (no gaps).
  • How to construct bin size or range for a Histogram:
    1. Decide on the number of classes (bins):
      • For large datasets: 1010 to 2020 classes.
      • For small datasets: 44 to 66 classes.
      • Thumb rule: Number of bins =extnumberofobservations/extdesiredclasssize(atleast4)= ext{number of observations} / ext{desired class size (at least 4)}.
    2. Compute the width of each class:
      • extClasswidth=extRangeofdataextNumberofclassesfromStep1ext{Class width} = \frac{ ext{Range of data}}{ ext{Number of classes from Step 1}}
      • Always round this result up to a convenient number.
    3. Select lower limits:
      • Choose the smallest data value (or a convenient smaller value) as the lower limit of the first class.
      • Add multiples of the class width (from Step 2) to generate the lower limits of the remaining classes.
    4. Find upper class limits:
      • Rule: Upper limit == Lower limit ++ width smallest significant unit in the data (e.g., for whole numbers, subtract 11).
      • This prevents overlap of class intervals.
    5. Define class boundaries:
      • Take the midpoint between the upper limit of one class and the lower limit of the next class (e.g., apply a ext±0.5ext{±}0.5 rule for whole numbers).
      • This ensures intervals touch and all data values are included without ambiguity.
  • Example: Test scores of 20 students (Dataset: 12,25,33,41,27,38,45,21,29,19,10,30,22,36,40,14,18,32,47,2412, 25, 33, 41, 27, 38, 45, 21, 29, 19, 10, 30, 22, 36, 40, 14, 18, 32, 47, 24)
    • Step 1: Number of classes. There are n=20n = 20 observations. Choose 55 classes (since the data set is small).
    • Step 2: Class width.
      • Range =4710=37= 47 - 10 = 37.
      • Class width =375=7.4= \frac{37}{5} = 7.4. Round up to 88.
    • Step 3: Lower limits. Starting at the minimum value 1010, successive lower limits are: 10,18,26,34,4210, 18, 26, 34, 42.
    • Step 4: Upper class limits. Using the rule (lower limit ++ width 1− 1):
      • 10
        ightarrow 17 (10+81=1710 + 8 - 1 = 17)
      • 18
        ightarrow 25 (18+81=2518 + 8 - 1 = 25)
      • 26
        ightarrow 33 (26+81=3326 + 8 - 1 = 33)
      • 34
        ightarrow 41 (34+81=4134 + 8 - 1 = 41)
      • 42
        ightarrow 49 (42+81=4942 + 8 - 1 = 49)
    • Step 5: Class boundaries. Applying the ext±0.5ext{±}0.5 rule (halfway between limits):
      • [9.5,17.5)[9.5, 17.5) (data points 10,12,14,18,19,21,22,24,25,27,29,30,32,33,36,38,40,41,45,4710, 12, 14, 18, 19, 21, 22, 24, 25, 27, 29, 30, 32, 33, 36, 38, 40, 41, 45, 47)
      • [17.5,25.5)[17.5, 25.5) (e.g., 17.517.5 is the midpoint between 1717 and 1818)
      • [25.5,33.5)[25.5, 33.5)
      • [33.5,41.5)[33.5, 41.5)
      • [41.5,49.5)[41.5, 49.5)
    • Frequency Table:
      • Class Interval (limits): 10–17, Class Boundaries: 9.5–17.5, Frequency: 3
      • Class Interval (limits): 18–25, Class Boundaries: 17.5–25.5, Frequency: 6
      • Class Interval (limits): 26–33, Class Boundaries: 25.5–33.5, Frequency: 5
      • Class Interval (limits): 34–41, Class Boundaries: 33.5–41.5, Frequency: 4
      • Class Interval (limits): 42–49, Class Boundaries: 41.5–49.5, Frequency: 2
    • A histogram would show these bins with their corresponding frequencies, with no gaps between bars.
Scatter Plots
  • A scatter plot displays the relationship between two variables.
  • Each data point is represented as a dot on a coordinate plane, with the x-axis for one variable and the y-axis for the other.
  • The pattern of points reveals if the variables are related.
  • Uses of scatter plots:
    • To check for a positive relationship (one variable increases as the other increases).
    • To check for a negative relationship (one variable increases as the other decreases).
    • To detect no clear relationship (points scattered randomly).
    • To identify possible outliers (points significantly far from the rest).

Linear Regression

Bivariate Data

  • Definition: Bivariate data consists of two variables measured on the same individual, object, or event. It is used to analyze the relationship between these two variables, often through graphs, correlation, or regression.
  • If both variables are numerical, they are typically plotted on a scatter diagram to study their relationship (positive, negative, or no correlation).
  • If one variable depends on the other, linear regression is often used.
  • General Representation of Bivariate Data: For nn observations, data is represented as a set of ordered pairs: ext(x<em>1,y</em>1),(x<em>2,y</em>2),(x<em>3,y</em>3),,(x<em>n,y</em>n)ext{{(x<em>1, y</em>1), (x<em>2, y</em>2), (x<em>3, y</em>3), …, (x<em>n, y</em>n)}}. For i=1,2,,ni = 1, 2, …, n:
    • xix_i is the value of the first variable (independent variable).
    • yiy_i is the corresponding value of the second variable (dependent variable).
  • Tabular Representation: Can also be shown in a two-column table (Variable 1: x<em>ix<em>i, Variable 2: y</em>iy</em>i).
  • Example: Hours studied (x) and test score (y)
    • Hours Studied (x): 2, Test Score (y): 55
    • Hours Studied (x): 4, Test Score (y): 65
    • Hours Studied (x): 6, Test Score (y): 72
    • Hours Studied (x): 8, Test Score (y): 85
    • Hours Studied (x): 10, Test Score (y): 92
    • This data checks if increased study hours lead to increased test scores.

Regression and Linear Regression

  • Definition of Regression: A statistical method to study the relationship between a dependent variable and one or more independent variables. It helps predict the dependent variable's value based on independent variable values.
  • Definition of Linear Regression: The simplest form of regression, assuming a linear relationship between the dependent variable yy and the independent variable xx. It is modeled by the equation: y=mx+by = mx + b
    • mm is the slope of the line, indicating the rate of change of yy with respect to xx.
    • bb is the intercept, representing the value of yy when x=0x = 0.
  • Graphically, data points are plotted on a scatter plot, and the straight line y=mx+by = mx + b is drawn to best describe the data trend.
Examples of Linear Relationship
  • Example 1: Positive Linear Relationship
    • Dataset: ext(1,3),(2,5),(3,7),(4,9),(5,11)ext{{(1, 3), (2, 5), (3, 7), (4, 9), (5, 11)}}
    • Variables increase together.
    • Best-fit line: y=2x+1y = 2x + 1 (slope m=2m = 2, intercept b=1b = 1).
  • Example 2: Negative Linear Relationship
    • Dataset: ext(1,10),(2,8),(3,6),(4,4),(5,2)ext{{(1, 10), (2, 8), (3, 6), (4, 4), (5, 2)}}
    • As xx increases, yy decreases.
    • Best-fit line: y=2x+12y = -2x + 12 (slope m=2m = -2, intercept b=12b = 12).

Slope of a Line Through Two Points

  • If a straight line passes through two distinct points (x<em>1,y</em>1)(x<em>1, y</em>1) and (x<em>2,y</em>2)(x<em>2, y</em>2), its slope is given by:
    • m=y<em>2y</em>1x<em>2x</em>1m = \frac{y<em>2 - y</em>1}{x<em>2 - x</em>1} (where x<em>1x</em>2x<em>1 \neq x</em>2).
  • Once mm is found, the equation of the line can be written using the point-slope form:
    • yy<em>1=m(xx</em>1)y - y<em>1 = m(x - x</em>1)
  • This can be rearranged into the slope-intercept form:
    • y=mx+by = mx + b where b=y<em>1mx</em>1b = y<em>1 - mx</em>1.

Residuals in Linear Regression

  • Definition: A residual is the difference between the observed value of the dependent variable (y<em>iy<em>i) and the value predicted by the regression line (exty^</em>iext{\hat{y}}</em>i) for a data point (x<em>i,y</em>i)(x<em>i, y</em>i).
    • extResidual=y<em>iexty^</em>iext{Residual} = y<em>i - ext{\hat{y}}</em>i
  • Residuals measure how far each data point lies from the fitted line.
  • Graphical Representation: The residuals are the vertical distances from each observed data point to the regression line.
  • Note: When the line represents the linear regression line, the slope mm and the y-intercept bb are known as regression coefficients.
  • Example: Farmer's Fertilizer and Crop Yield
    • Data from 6 plots of land for fertilizer used (x, in kg) and crop yield (y, in quintals):
      • x: 2, y (Observed Yield): 40, exty^ext{\hat{y}} (Predicted Yield): 39.7, Residual ei=yexty^e_i = y - ext{\hat{y}}: 0.3
      • x: 4, y (Observed Yield): 55, exty^ext{\hat{y}} (Predicted Yield): 50.1, Residual ei=yexty^e_i = y - ext{\hat{y}}: 4.9
      • x: 6, y (Observed Yield): 65, exty^ext{\hat{y}} (Predicted Yield): 60.5, Residual ei=yexty^e_i = y - ext{\hat{y}}: 4.5
      • x: 8, y (Observed Yield): 70, exty^ext{\hat{y}} (Predicted Yield): 70.9, Residual ei=yexty^e_i = y - ext{\hat{y}}: -0.9
      • x: 10, y (Observed Yield): 85, exty^ext{\hat{y}} (Predicted Yield): 81.3, Residual ei=yexty^e_i = y - ext{\hat{y}}: 3.7
      • x: 12, y (Observed Yield): 95, exty^ext{\hat{y}} (Predicted Yield): 91.7, Residual ei=yexty^e_i = y - ext{\hat{y}}: 3.3
  • Note on Residual Squares:
    • For each data point (x<em>i,y</em>i)(x<em>i, y</em>i), the residual is e<em>i=y</em>iexty^ie<em>i = y</em>i - ext{\hat{y}}_i.
    • The residual square is e<em>i2=(y</em>iexty^i)2e<em>i^2 = (y</em>i - ext{\hat{y}}_i)^2.
    • The sum of these across all points gives the Residual Sum of Squares (R2): R2=ext<em>i=1ne</em>i2R^2 = ext{\sum}<em>{i=1}^n e</em>i^2.

Interpolation, Extrapolation, and Correlation

Interpolation and Extrapolation

  • Definition (Interpolation): The process of estimating or predicting the value of a dependent variable for an independent variable that lies within the range of the observed data points.
  • Definition (Extrapolation): The process of estimating or predicting the value of a dependent variable for an independent variable that lies outside the range of the observed data points.
  • Example: Farmer's Fertilizer and Crop Yield (using the fitted regression line y=5.29x+31.33y = 5.29x + 31.33)
    • Data:
      • x (Fertilizer, kg): 2, y (Observed Yield): 40
      • x (Fertilizer, kg): 4, y (Observed Yield): 55
      • x (Fertilizer, kg): 6, y (Observed Yield): 65
      • x (Fertilizer, kg): 8, y (Observed Yield): 70
      • x (Fertilizer, kg): 10, y (Observed Yield): 85
      • x (Fertilizer, kg): 12, y (Observed Yield): 95
    • The regression coefficients for the least squares fitted line are: mext5.29m ext{\approx} 5.29, bext31.33b ext{\approx} 31.33. The fitted line is y=5.29x+31.33y = 5.29x + 31.33.
    • Question 1: Interpolate the yield when the farmer uses x=7x = 7 kg of fertilizer.
      • y(7)=5.29(7)+31.33=37.03+31.33=68.36y(7) = 5.29(7) + 31.33 = 37.03 + 31.33 = 68.36
      • The required yield is 68.3668.36 quintals.
    • Question 2: Extrapolate the yield when the farmer uses x=15x = 15 kg of fertilizer.
      • y(15)=5.29(15)+31.33=79.35+31.33=110.68y(15) = 5.29(15) + 31.33 = 79.35 + 31.33 = 110.68
      • The required yield is 110.68110.68 quintals.
    • Question 3: If the farmer obtains a crop yield of y=90y = 90 quintals, estimate the amount of fertilizer used (x).
      • 90=5.29x+31.3390 = 5.29x + 31.33
      • 5.29x=9031.335.29x = 90 - 31.33
      • 5.29x=58.675.29x = 58.67
      • x=58.675.29ext11.09x = \frac{58.67}{5.29} ext{\approx} 11.09
      • The estimated amount of fertilizer used is approximately 11.0911.09 kg.
  • Note: Interpolation is generally more reliable than extrapolation because interpolation predicts values within the observed data range (where the trend is established), while extrapolation predicts values outside this range (where the trend may not hold).

Correlation and Correlation Coefficient

  • Definition (Correlation): A statistical measure describing the strength and direction of the linear relationship between two variables. It indicates whether an increase in one variable is consistently associated with an increase (positive), decrease (negative), or no consistent change (no correlation) in the other variable.
  • Definition (Correlation Coefficient): Denoted by ω\omega (or rr), it is a numerical value that quantifies the degree of linear correlation between two variables.
  • Properties:
    • The correlation coefficient ranges from 1-1 to 11: 1extextωext1-1 ext{\leq} ext{\omega} ext{\leq} 1.
    • ω=1\omega = 1: Perfect positive correlation.
    • ω=1\omega = -1: Perfect negative correlation.
    • ω=0\omega = 0: No linear correlation.
    • Generally, an absolute value less than 0.50.5 is considered too weak to suggest a meaningful correlation.
  • Rule of Thumb for Strength of Correlation:
    • Weak: |\omega| < 0.5
    • Moderate: 0.5 ext{\leq} |\omega| < 0.7
    • Strong: ωext0.7|\omega| ext{\geq} 0.7
    • This can be visualized on a scale from 1-1 (Perfect Negative) through 00 (No Correlation) to 11 (Perfect Positive).
  • Graphical Representation of Correlation Coefficients: Scatter plots can visually depict strong positive (\omega \text{\approx} 0.9), weak positive (\omega \text{\approx} 0.3), strong negative (\omega \text{\approx} -0.9), and no correlation (\omega \text{\approx} 0).
  • Relation Between Correlation Coefficient and Slope:
    • The slope mm of the regression line and the correlation coefficient ω\omega are related in terms of sign only:
    • If \omega > 0, then the slope m > 0 (the line rises from left to right).
    • If \omega < 0, then the slope m < 0 (the line falls from left to right).
    • If ω=0\omega = 0, then the slope m \text{\approx} 0 (no linear relationship).
    • Important: The magnitude (value) of the slope is not related to the value of the correlation coefficient. Correlation measures the strength of linear association, while slope measures the rate of change.
  • Correlation vs. Causation:
    • Correlation measures the strength and direction of a linear relationship.
    • Causation means that changes in one variable directly cause changes in the other.
    • Correlation does not imply causation.
    • Examples:
      • Ice cream sales and drowning incidents may be positively correlated, but both are caused by hot summer weather, not a direct causal link between ice cream and drowning.
      • Shoe size and reading ability in children may be correlated, but the common underlying cause is age.
    • Therefore, while correlation and regression suggest patterns, they should not be interpreted as proof of cause-and-effect without further evidence.

Outliers

  • Definition (Outlier): A data point that lies significantly far from the overall pattern of the data.
  • Outliers can result from unusual conditions, measurement errors, or genuinely rare events, and they can substantially affect correlation and regression analysis.
  • Example (Car Age vs. Resale Value):
    • x (Car Age, years): 1, y (Resale Value, \$1000): 25
    • x (Car Age, years): 2, y (Resale Value, \$1000): 22
    • x (Car Age, years): 3, y (Resale Value, \$1000): 20
    • x (Car Age, years): 4, y (Resale Value, \$1000): 18
    • x (Car Age, years): 5, y (Resale Value, \$1000): 15
    • x (Car Age, years): 6, y (Resale Value, \$1000): 13
    • x (Car Age, years): 7, y (Resale Value, \$1000): 11
    • x (Car Age, years): 8, y (Resale Value, \$1000): 50 (Outlier)
    • Here, the 8-year-old car's resale value of \$50,000 is unusually high compared to the general decreasing trend, suggesting it might be an outlier (e.g., a rare vintage model).

Exponential, Logarithms, and Half-Life

Exponential Function

  • Definition: An exponential function is of the form y=axy = a^x, where the base aa is a positive real number (a > 0, a \neq 1) and the exponent xx is a real number.
  • Types of Exponential Functions:
    • Exponential Growth: If a > 1, the function y=axy = a^x increases rapidly as xx increases.
    • Exponential Decay: If 0 < a < 1, the function y=axy = a^x decreases rapidly as xx increases.
    • Note: If a=1a = 1, the function becomes y=1x=1y = 1^x = 1, which is a constant function (a horizontal line at y=1y = 1) and therefore not considered exponential growth or decay.
  • Real-World Examples of Exponential Functions:
    • Finance: Calculating compound interest over time.
    • Biology: Studying population growth of bacteria or viruses.
    • Chemistry: Measuring acidity levels using the pH scale.
    • Physics: Analyzing radioactive decay of unstable elements.
  • Laws of Exponents:
    1. ax+y=axaya^{x+y} = a^x \cdot a^y
    2. axy=axay=axaya^{x-y} = a^x \cdot a^{-y} = \frac{a^x}{a^y}
    3. (ax)y=axy(a^x)^y = a^{xy}
    4. axbx=(ab)xa^x b^x = (ab)^x
    5. a0=1a^0 = 1 (provided a0a \neq 0)
    6. ax=1axa^{-x} = \frac{1}{a^x}
  • Note: The general form of an exponential function is y=baxy = b \cdot a^x, where b > 0 is the initial value and a > 0, a \neq 1 is the base.
  • Examples:
    • Population growth: P(t)=P0ertP(t) = P_0 \cdot e^{rt}, where r > 0 is the growth constant.
    • Radioactive decay: N(t)=N0eωtN(t) = N_0 \cdot e^{-\omega t}, where \omega > 0 is the decay constant.

Logarithm

  • Definition: The logarithm of a number xx to the base aa (with a > 0, a \neq 1) is the exponent yy such that ay=xa^y = x. It is written as logax=y\log_a x = y.
  • Note: The logarithm is the inverse of the exponential function. That is, if y=axy = a^x, then x=logayx = \log_a y.
  • Laws of Logarithms:
    1. log<em>a(xy)=log</em>ax+logay\log<em>a(xy) = \log</em>a x + \log_a y
    2. log<em>a(xy)=log</em>axlogay\log<em>a(\frac{x}{y}) = \log</em>a x - \log_a y
    3. log<em>a(xk)=klog</em>ax\log<em>a(x^k) = k \log</em>a x (also valid if kk is a variable)
    4. loga(a)=1\log_a(a) = 1
    5. log<em>ax=log</em>10xlog10a\log<em>a x = \frac{\log</em>{10} x}{\log_{10} a} (change of base formula, base 1010)
    6. logax=lnxlna\log_a x = \frac{\ln x}{\ln a} (change of base formula, base ee)
  • Inverse Function Property: If f(x)f(x) and g(x)g(x) are inverse functions, then f(g(x))=g(f(x))=xf(g(x)) = g(f(x)) = x.
    • For example, let f(x)=axf(x) = a^x and g(x)=log<em>axg(x) = \log<em>a x. Then, alog</em>ax=loga(ax)=xa^{\log</em>a x} = \log_a(a^x) = x.
  • Examples:
    1. Solve 52x=0.235^{2x} = 0.23 for xx.
      • Take the natural logarithm of both sides: ln(52x)=ln(0.23)\ln(5^{2x}) = \ln(0.23)
      • Apply logarithm law log<em>a(xk)=klog</em>ax\log<em>a(x^k) = k \log</em>a x: 2xln(5)=ln(0.23)2x \ln(5) = \ln(0.23)
      • Solve for xx: x=ln(0.23)2ln(5)=1.469721.60941.46973.21880.4566x = \frac{\ln(0.23)}{2 \ln(5)} = \frac{-1.4697}{2 \cdot 1.6094} \approx \frac{-1.4697}{3.2188} \approx -0.4566
    2. If log<em>ax=2.1\log<em>a x = 2.1 and log</em>ay=0.45\log</em>a y = 0.45, compute loga(x3y)\log_a(x^3y).
      • Apply logarithm law log<em>a(xy)=log</em>ax+log<em>ay\log<em>a(xy) = \log</em>a x + \log<em>a y: log</em>a(x3y)=log<em>a(x3)+log</em>ay\log</em>a(x^3y) = \log<em>a(x^3) + \log</em>a y
      • Apply logarithm law log<em>a(xk)=klog</em>ax\log<em>a(x^k) = k \log</em>a x: =3log<em>ax+log</em>ay= 3 \log<em>a x + \log</em>a y
      • Substitute given values: =3(2.1)+0.45=6.3+0.45=6.75= 3(2.1) + 0.45 = 6.3 + 0.45 = 6.75
    3. Solve log4(3x)=1.4\log_4(3x) = 1.4 for xx.
      • Convert to exponential form (logay=x    ax=y\log_a y = x \implies a^x = y): 3x=41.43x = 4^{1.4}
      • Calculate 41.44^{1.4}: 3x6.96443x \approx 6.9644
      • Solve for xx: x=6.964432.3215x = \frac{6.9644}{3} \approx 2.3215
    4. Simplify: log<em>28+log</em>24\log<em>2 8 + \log</em>2 4.
      • Method 1 (using log<em>a(xy)=log</em>ax+logay\log<em>a(xy) = \log</em>a x + \log_a y):
        • log<em>2(84)=log</em>2(32)\log<em>2(8 \cdot 4) = \log</em>2(32)
        • Since 25=322^5 = 32, then log2(32)=5\log_2(32) = 5.
      • Method 2 (evaluating each logarithm):
        • Since 23=82^3 = 8, log28=3\log_2 8 = 3.
        • Since 22=42^2 = 4, log24=2\log_2 4 = 2.
        • 3+2=53 + 2 = 5.

Half-Life and Doubling Time

Half-Life
  • Definition: The half-life (T1/2T_{1/2}) of a substance is the time required for its quantity to decrease to half of its initial value.
  • If the decay is exponential, the half-life is given by: T1/2=ln(2)ωT_{1/2} = \frac{\ln(2)}{\omega}, where \omega > 0 is the decay constant.
  • Example: The half-life of Carbon-14 is about 57305730 years, meaning that after 57305730 years, only half of the initial Carbon-14 atoms remain.
Doubling Time
  • Definition: The doubling time (TdT_d) is the time required for a quantity to double its initial value under exponential growth.
  • If the growth is exponential, the doubling time is given by: Td=ln(2)rT_d = \frac{\ln(2)}{r}, where r > 0 is the growth rate.
  • Example: If a population of bacteria doubles every 3030 minutes, its doubling time is Td=30T_d = 30 minutes.
Example: Drug Decay in the Bloodstream
  • Let C(t)C(t) be the amount of drug (in milligrams) at time tt (in days), and C<em>0C<em>0 be the initial amount. The decay is modeled by C(t)=C</em>0ektC(t) = C</em>0 e^{-kt}, where k > 0 is the decay constant.
    • (a) If the drug has a half-life of 10 days, what is the value of kk?
      • At half-life, C(T<em>1/2)=12C</em>0C(T<em>{1/2}) = \frac{1}{2} C</em>0.
      • 12C<em>0=C</em>0ek(10)\frac{1}{2} C<em>0 = C</em>0 e^{-k(10)}
      • 12=e10k\frac{1}{2} = e^{-10k}
      • Take natural logarithm: ln(12)=10k\ln(\frac{1}{2}) = -10k
      • ln(2)=10k- \ln(2) = -10k
      • k=ln(2)100.6931100.06931k = \frac{\ln(2)}{10} \approx \frac{0.6931}{10} \approx 0.06931
      • The decay constant kk is approximately 0.06931extdays10.06931 ext{ days}^{-1}.
    • (b) What percent of the administered amount of drug remains in the bloodstream after 4 hours?
      • First, convert 44 hours to days: 4exthours=424extdays=16extdays0.1667extdays4 ext{ hours} = \frac{4}{24} ext{ days} = \frac{1}{6} ext{ days} \approx 0.1667 ext{ days}.
      • Use the decay function: C(t)=C0ektC(t) = C_0 e^{-kt}.
      • C(16)=C0e(0.06931)(16)C(\frac{1}{6}) = C_0 e^{-(0.06931)(\frac{1}{6})}
      • C(16)=C0e0.01155C(\frac{1}{6}) = C_0 e^{-0.01155}
      • C(16)C0(0.9885)C(\frac{1}{6}) \approx C_0 (0.9885)
      • The percentage remaining is approximately 0.9885×100%=98.85%0.9885 \times 100\% = 98.85\%. Approximately 99%99\% remains.
Example: Oxygen Consumption of Salmon
  • Oxygen consumption of yearling salmon increases exponentially with swimming speed according to f(x)=100e0.6xf(x) = 100e^{0.6x}, where xx is speed in ft/s.
    • (a) What is the amount of oxygen consumption when the fish are not moving?
      • Not moving means x=0x = 0 ft/s.
      • f(0)=100e0.6(0)=100e0=100(1)=100f(0) = 100e^{0.6(0)} = 100e^0 = 100(1) = 100
      • Oxygen consumption is 100100 mg.
    • (b) What is the oxygen consumption at a speed of 2 ft/s?
      • f(2)=100e0.6(2)=100e1.2f(2) = 100e^{0.6(2)} = 100e^{1.2}
      • f(2)100(3.3201)332.01f(2) \approx 100(3.3201) \approx 332.01
      • Oxygen consumption is approximately 332.01332.01 mg.
    • (c) If a salmon is swimming at 2 ft/s, how much faster does it need to swim in order to double its oxygen consumption?
      • Current consumption at 22 ft/s is 332.01332.01 mg (from part b).
      • Double consumption would be 2×332.01=664.022 \times 332.01 = 664.02 mg.
      • Set f(x)=664.02f(x) = 664.02
      • 664.02=100e0.6x664.02 = 100e^{0.6x}
      • 6.6402=e0.6x6.6402 = e^{0.6x}
      • Take natural logarithm: ln(6.6402)=0.6x\ln(6.6402) = 0.6x
      • 1.89310.6x1.8931 \approx 0.6x
      • x=1.89310.63.155 ft/sx = \frac{1.8931}{0.6} \approx 3.155 \text{ ft/s}
      • The additional speed needed is 3.1552=1.1553.155 - 2 = 1.155 ft/s.

Allometric or Power Laws, Rescaling, and Log Plots

Power Law or Allometry

  • Definition: A power law function (or allometric function in biology) is of the form y=axky = ax^k, where:
    • a > 0 is a constant of proportionality.
    • kRk \in \mathbb{R} is the power (or scaling exponent).
    • x > 0 is the independent variable.
  • Note: yy is an allometric function of xx, meaning xx and yy are allometrically related.
  • Properties:
    • If k > 1, the function grows faster than linear (superlinear growth).
    • If 0 < k < 1, the function grows slower than linear (sublinear growth).
    • If k=1k = 1, the function reduces to a linear function y=axy = ax.
    • If k < 0, the function represents a decreasing relationship, such as inverse proportionality.
  • Real-World Examples of Power Laws (Allometry):
    • Biology (Allometry): Metabolic rate of animals often scales as body mass to the power of 3/43/4 (e.g., B=aM3/4B = aM^{3/4}).
    • Physics: Gravitational force follows an inverse-square law (F=Gm<em>1m</em>2r2F = G\frac{m<em>1 m</em>2}{r^2}).
    • Economics: Wealth distributions often follow a Pareto power law.
    • Engineering: Stress or fracture strength scaling with material size.
  • Example: Elephant Surface Area (Allometry)
    • Surface area (SS) of an African elephant's body is an allometric function of trunk length (LL) with an exponent of 0.740.74. So, S=aL0.74S = aL^{0.74}.
    • An elephant has a surface area of 200extft2200 ext{ ft}^2 and a trunk length of 6extft6 ext{ ft}.
    • Find aa: 200=a(6)0.74200 = a(6)^{0.74}
      • 200=a(3.765)200 = a(3.765)
      • a=2003.76553.12a = \frac{200}{3.765} \approx 53.12
    • So, the specific allometric equation is: S=53.12L0.74S = 53.12L^{0.74}.
    • What is the expected surface area of an elephant with a trunk length of 7 ft?
      • S(7)=53.12(7)0.74S(7) = 53.12(7)^{0.74}
      • S(7)=53.12(4.280)S(7) = 53.12(4.280) (since 70.744.2807^{0.74} \approx 4.280)
      • S(7)227.35S(7) \approx 227.35 (or approximately 227.16227.16 from original calculation if aa is kept more precise as 200/60.74200 / 6^{0.74})
      • The expected surface area is approximately 227.35extft2227.35 ext{ ft}^2.

Rescaling Data

  • Used for biological variables xx and yy.
  • Definition (Log-Log Graph): A graph where the horizontal axis is labeled as ln(x)\ln(x) and the vertical axis is labeled as ln(y)\ln(y).
  • Definition (Semi-Log Graph): A graph where the horizontal axis is labeled as xx and the vertical axis is labeled as ln(y)\ln(y).
  • Note: Rescaling data using log or semi-log axes is particularly useful for:
    • Exponential functions: They appear as straight lines on a semi-log plot.
    • Power-law (allometric) functions: They appear as straight lines on a log-log plot.
  • Transformation of functions by taking natural logarithm:
    • For an exponential function f(x)=abxf(x) = ab^x:
      • ln(f(x))=ln(abx)\ln(f(x)) = \ln(ab^x)
      • ln(f(x))=ln(a)+ln(bx)\ln(f(x)) = \ln(a) + \ln(b^x)
      • ln(f(x))=ln(a)+xln(b)\ln(f(x)) = \ln(a) + x\ln(b)
      • This is in the form Y=A+BxY = A + Bx (where Y=ln(f(x))Y = \ln(f(x)), A=ln(a)A = \ln(a), B=ln(b)B = \ln(b)), which is a linear equation with respect to xx and ln(f(x))\ln(f(x)). So it's linear on a semi-log plot.
    • For a power-law function g(x)=cxkg(x) = cx^k:
      • ln(g(x))=ln(cxk)\ln(g(x)) = \ln(cx^k)
      • ln(g(x))=ln(c)+ln(xk)\ln(g(x)) = \ln(c) + \ln(x^k)
      • ln(g(x))=ln(c)+kln(x)\ln(g(x)) = \ln(c) + k\ln(x)
      • This is in the form Y=A+BXY = A + BX (where Y=ln(g(x))Y = \ln(g(x)), A=ln(c)A = \ln(c), B=kB = k, X=ln(x)X = \ln(x)), which is a linear equation with respect to ln(x)\ln(x) and ln(g(x))\ln(g(x)). So it's linear on a log-log plot.

Examples: Rescaling Data

Exponential Function (Semi-Log Plot)
  • Consider the function y=2e0.5xy = 2e^{0.5x} for x=0,1,2,3,4,5x = 0, 1, 2, 3, 4, 5.
  • Data values:
    • x: 0, y: 2.00
    • x: 1, y: 3.30
    • x: 2, y: 5.44
    • x: 3, y: 8.96
    • x: 4, y: 14.78
    • x: 5, y: 24.36
  • Observation: On ordinary axes, the curve rises exponentially. On a semi-log plot (x vs. ln(y)\ln(y)), the points will fall on a straight line.
Allometric Function (Log-Log Plot)
  • Consider the function y=3x0.75y = 3x^{0.75} for x=1,2,3,4,5,6x = 1, 2, 3, 4, 5, 6.
  • Data values:
    • x: 1, y: 3.00
    • x: 2, y: 5.04
    • x: 3, y: 6.84
    • x: 4, y: 8.48
    • x: 5, y: 9.96
    • x: 6, y: 11.34
  • Observation: On ordinary axes, the curve increases sublinearly. On a log-log plot (ln(x)\ln(x) vs. ln(y)\ln(y)), the points will fall on a straight line with a slope of 0.750.75.

Basic Descriptive Statistics

Types of Data

Ratio Scale
  • Definition: A measurement scale with a constant interval size and a true zero point (meaning the absence of the quantity).
  • Examples: Age, height, distance, weight. (e.g., 00 height means no height).
Interval Scale
  • Definition: A measurement scale with a constant interval size but no true zero point.
  • Examples: Temperature (Celsius/Fahrenheit), dates on a calendar, time on a watch. (e.g., 0extoextC0^ ext{o} ext{C} does not mean no temperature).
Ordinal Scale
  • Definition: Data can be ordered or ranked according to some measurement, but the intervals between ranks may not be equal or meaningful.
  • Examples: Education level (primary, secondary, post-secondary), income levels (low, middle, high), satisfaction ratings (poor, good, excellent).
Nominal Scale
  • Definition: Data is classified by an attribute or category rather than by a quantity measurement. Categories have no inherent order.
  • Examples: Grade scale (A, B, C, D), gender (Female, Male), blood group (A, B, AB, O), species (bird, mammal).
Continuous and Discrete Data
  • Continuous data: Can take any value within a given range (e.g., decimals).
    • Examples: Height (e.g.,1.75extm)(e.g., 1.75 ext{m}), temperature (e.g.,25.3extoextC)(e.g., 25.3^ ext{o} ext{C}).
  • Discrete data: Consists of distinct, separate values that can be counted (usually whole numbers).
    • Examples: Number of students in a classroom, number of cars in a parking lot, books on a shelf.

Tools used to describe and summarize data

Measures of Central Tendency
  • These summarize a data set by a single value point, typically representing the center of the data.
  • Arithmetic Mean (Mean):
    • Let \text{{x}1, x2, …, x_n}} denote a data set with nn data points. The arithmetic mean is defined as:
      • xˉ=x<em>1+x</em>2++x<em>nn=</em>i=1nxin\bar{x} = \frac{x<em>1 + x</em>2 + \dots + x<em>n}{n} = \frac{\sum</em>{i=1}^n x_i}{n}.
    • Example: For marks ext50,60,70,80,90ext{{50, 60, 70, 80, 90}}, the mean is:
      • xˉ=50+60+70+80+905=3505=70\bar{x} = \frac{50 + 60 + 70 + 80 + 90}{5} = \frac{350}{5} = 70.
  • Median:
    • The median is the middle value of an ordered dataset.
    • If the number of data points nn is odd, the median is the value at position (n+1)2\frac{(n+1)}{2}.
    • If nn is even, the median is the average of the values at positions n2\frac{n}{2} and (n2)+1(\frac{n}{2}) + 1.
    • Example (odd n): Dataset ext3,13,2,34,11,26,47ext{{3, 13, 2, 34, 11, 26, 47}} (n=7n=7).
      • Ordered set: ext2,3,11,13,26,34,47ext{{2, 3, 11, 13, 26, 34, 47}}.
      • Position: (7+1)2=4extth\frac{(7+1)}{2} = 4^ ext{th}. The median is 1313.
    • Example (even n): Dataset ext3,13,2,34,11,17,27,47ext{{3, 13, 2, 34, 11, 17, 27, 47}} (n=8n=8).
      • Ordered set: ext2,3,11,13,17,27,34,47ext{{2, 3, 11, 13, 17, 27, 34, 47}}.
      • Positions: 82=4extth\frac{8}{2} = 4^ ext{th} (1313) and (82)+1=5extth(\frac{8}{2})+1 = 5^ ext{th} (1717).
      • The median is the average: 13+172=302=15\frac{13+17}{2} = \frac{30}{2} = 15.
  • Difference between mean and median:
    • Example: Dataset ext0,0,0,1,1,2,10,10ext{{0, 0, 0, 1, 1, 2, 10, 10}}.
      • Mean: xˉ=0+0+0+1+1+2+10+108=248=3\bar{x} = \frac{0+0+0+1+1+2+10+10}{8} = \frac{24}{8} = 3.
      • Median: Ordered set is already given. n=8n=8 (even).
        • n2=4extth\frac{n}{2} = 4^ ext{th} position (value is 11).
        • (n2)+1=5extth(\frac{n}{2})+1 = 5^ ext{th} position (value is 11).
        • Median =1+12=1= \frac{1+1}{2} = 1.
      • In this case, the mean (33) is higher than the median (11) due to the presence of larger values (10,1010, 10) tugging the mean upwards, illustrating the mean's sensitivity to outliers/skewness.
  • Mode:
    • The mode is the value (or values) that occurs most frequently in a dataset.
    • A dataset can have:
      • One mode (unimodal): e.g., ext2,4,4,6,7ext{{2, 4, 4, 6, 7}} has mode 44.
      • Two modes (bimodal): e.g., ext1,2,2,3,3,4ext{{1, 2, 2, 3, 3, 4}} has modes 22 and 33.
      • More than two modes (multimodal): e.g., ext5,6,6,7,7,8,8ext{{5, 6, 6, 7, 7, 8, 8}} has modes 6,7,86, 7, 8.
      • No mode: If all values occur with the same frequency.
  • Midrange:
    • The midrange is the value halfway between the minimum and maximum data values.
    • extMidrange=extMinimumvalue+extMaximumvalue2ext{Midrange} = \frac{ ext{Minimum value} + ext{Maximum value}}{2}.
    • Example: For dataset ext3,7,10,15,18ext{{3, 7, 10, 15, 18}}, Midrange =3+182=212=10.5= \frac{3+18}{2} = \frac{21}{2} = 10.5.
  • Geometric Mean (GM):
    • For a dataset of nn positive numbers \text{{x}1, x2, …, x_n}, the geometric mean is the nextthn^ ext{th} root of the product of those nn points.
    • GM=(x<em>1x</em>2x<em>n)1n=(</em>i=1nxi)1nGM = (x<em>1 \cdot x</em>2 \cdot \dots \cdot x<em>n)^{\frac{1}{n}} = (\prod</em>{i=1}^n x_i)^{\frac{1}{n}}.
    • Example: For dataset ext4,16ext{{4, 16}}, GM=(416)12=(64)12=8GM = (4 \cdot 16)^{\frac{1}{2}} = (64)^{\frac{1}{2}} = 8.
    • Example: For dataset ext2,8,18ext{{2, 8, 18}} (n=3n=3), GM=(2818)13=(288)136.60GM = (2 \cdot 8 \cdot 18)^{\frac{1}{3}} = (288)^{\frac{1}{3}} \approx 6.60.
  • Harmonic Mean (HM):
    • For a dataset of nn positive numbers \text{{x}1, x2, …, x_n}, the harmonic mean is the reciprocal of the arithmetic mean of the reciprocals of the data points.
    • HM=n1x<em>1+1x</em>2++1x<em>n=n</em>i=1n1xiHM = \frac{n}{\frac{1}{x<em>1} + \frac{1}{x</em>2} + \dots + \frac{1}{x<em>n}} = \frac{n}{\sum</em>{i=1}^n \frac{1}{x_i}}.
    • Example: For dataset ext4,8,16ext{{4, 8, 16}} (n=3n=3), HM=314+18+116=3416+216+116=3716=3167=4876.86HM = \frac{3}{\frac{1}{4} + \frac{1}{8} + \frac{1}{16}} = \frac{3}{\frac{4}{16} + \frac{2}{16} + \frac{1}{16}} = \frac{3}{\frac{7}{16}} = 3 \cdot \frac{16}{7} = \frac{48}{7} \approx 6.86.
Measures of Dispersion
  • These describe the spread of data points around the central tendency.
  • Range:
    • The range of a dataset is the difference between the maximum and minimum values.
    • extRange=extMaximumvalueextMinimumvalueext{Range} = ext{Maximum value} - ext{Minimum value}.
    • Example: For dataset ext5,8,12,20,25ext{{5, 8, 12, 20, 25}}, Range =255=20= 25 - 5 = 20.
  • Variance:
    • Measures the average of the squared deviations from the mean.
    • For a sample dataset, variance (s2s^2) is computed using (n1)(n-1) in the denominator (Bessel's correction) to provide an unbiased estimate of the population variance.
    • s2=1(n1)<em>i=1n(x</em>ixˉ)2s^2 = \frac{1}{(n-1)} \sum<em>{i=1}^n (x</em>i - \bar{x})^2.
    • Example: For dataset ext2,4,6ext{{2, 4, 6}} (n=3n=3):
      • Mean: xˉ=2+4+63=123=4\bar{x} = \frac{2+4+6}{3} = \frac{12}{3} = 4.
      • Deviations: (24)=2(2-4) = -2, (44)=0(4-4) = 0, (64)=2(6-4) = 2.
      • Squared deviations: (2)2=4(-2)^2 = 4, (0)2=0(0)^2 = 0, (2)2=4(2)^2 = 4.
      • Sum of squared deviations: 4+0+4=84 + 0 + 4 = 8.
      • Variance: s2=1(31)(8)=82=4s^2 = \frac{1}{(3-1)} (8) = \frac{8}{2} = 4.
  • Standard Deviation:
    • Indicates how much data values deviate, on average, from the mean. It is the square root of the variance.
    • s=1(n1)<em>i=1n(x</em>ixˉ)2s = \sqrt{\frac{1}{(n-1)} \sum<em>{i=1}^n (x</em>i - \bar{x})^2}.
    • Example: For dataset ext2,4,6ext{{2, 4, 6}}, with xˉ=4\bar{x} = 4 and s2=4s^2 = 4, the standard deviation is s=4=2s = \sqrt{4} = 2.
  • Coefficient of Variation (CV):
    • A standardized measure of dispersion that expresses the standard deviation as a percentage of the mean. It allows comparison of variability between datasets with different units or vastly different means.
    • CV=sxˉ×100%CV = \frac{s}{\bar{x}} \times 100\%, where ss is the standard deviation and xˉ\bar{x} is the arithmetic mean.
    • Example: For dataset ext2,4,6ext{{2, 4, 6}}, with xˉ=4\bar{x} = 4 and s=2s = 2, CV =24×100%=0.5×100%=50%= \frac{2}{4} \times 100\% = 0.5 \times 100\% = 50\%. (The incorrect calculation in the transcript example, resulting in 40%, seems to have used a population standard deviation or a different nn value, but the provided solution for this example s=2,xˉ=4s=2, \bar{x}=4 correctly yields 50%50\%).