The linear statistical model is defined as follows: Y<em>ij=μ+τ</em>i+ϵij, where i=1,2,⋯,a and j=1,2,⋯,n.
τi represents the treatment effects.
ϵij represents the errors.
Treatment effects and errors are independent random variables.
The model's structure is identical to the fixed-effects case, but the parameters have a different interpretation.
If the variance of the treatment effects τ<em>i is σ</em>τ2, then the variance strength of the response is
V(Y<em>ij)=σ</em>τ2+σ2
The variance components στ2 and σ2 are called variance components.
The model in Eq. (10.19) is called the components of variance model or the random-effects model.
To test hypotheses in the random-effects model:
The errors ϵij are normally and independently distributed with a mean of zero and variance σ2.
The treatment effects τ<em>i are normally and independently distributed with a mean of zero and variance σ</em>τ2.
The {τ<em>i} are independent random variables, which implies that the usual assumption of ∑</em>i=1nτi=0 from the fixed-effects model does not apply to the random-effects model.
Testing the hypothesis that the individual treatment effects are zero is meaningless for the random-effects model.
It is more appropriate to test hypotheses about στ2.
If στ2=0, all treatments are identical.
If {\sigma_\tau}^2 > 0, there is variability between treatments.
The ANOVA decomposition of total variability is still valid:
SST=SSTTreatments+SSE
The expected values of the mean squares for treatments and error are different than in the fixed-effects case.
In the random-effects model for a single-factor, completely randomized experiment:
E(MST<em>Treatments)=E(a−1SST</em>Treatments)=σ2+nστ2
E(MSE)=E(a(n−1)SSE)=σ2
Both MSE and MST<em>Treatments estimate σ2 when H</em>0:στ2=0 is true.
MSE and MSTTreatments are independent.
The ratio F<em>0=MSEMST</em>Treatments is an F random variable with a−1 and a(n−1) degrees of freedom when H0 is true.
The null hypothesis is rejected at the α-level of significance if the computed test statistic value is f0 > f{\alpha, a-1, a(n-1)}.
The computational procedure and construction of the ANOVA table for the random-effects model are identical to the fixed-effects case.
We usually want to estimate the variance components (σ2 and στ2) in the model.
The procedure to estimate σ2 and στ2 is called the analysis of variance method, because it uses the information in the ANOVA table.
It does not require the normality assumption on the observations.
The procedure consists of equating the expected mean squares to their observed values in the ANOVA table and solving for the variance components.
When equating observed and expected mean squares in the one-way classification random-effects model, we obtain:
MST<em>Treatments=σ2+nσ</em>τ2 & MSE=σ2
Therefore, the estimators of the ANOVA variance components are:
σ^2=MSE
σ<em>τ^2=nMST</em>Treatments−MSE
Sometimes, the analysis of variance method produces a negative estimate of a variance component when using the above equations.
In that case, we set it to zero, re-analyze the model experiment, or assume the linear model does not apply and modify it.
In textile manufacturing, the undertaking "Design and Analysis of Experiments" is used to study a single-factor experiment involving the random-effects model.
A textile manufacturing company weaves a fabric on a large number of looms.
The company is interested in loom-to-loom variability in tensile strength.
To investigate this variability, the engineer selects four looms (a=4 treatments) at random and makes four strength determinations (n=4 samples) on fabric samples chosen at random from each loom.
The data are shown in Table 10.7, and the ANOVA is summarized in Table 10.8.
Data used in the illustrative example of tensile strength is given in Table 10.7.
y<em>..=1527;N=16,y</em>i.=390,366,383&388
a=1,2,3,4: four treatments, n=4 (samples), N=a×n=4×4=16
This example depicts an experiment that is a completely randomized design.
Use the analysis of variance (ANOVA) to test the hypothesis that loom-to-loom mean tensile strength. Use α=0.01.
H<em>0:τ</em>1=τ<em>2=τ</em>3=τ<em>4=0 & H</em>1:τi=0 for at least one i
The sums of squares for the analysis of variance are computed from Eqs. (10.8), (10.9) & (10.10) as follows:
SST=∑<em>i=1a∑</em>j=1ny<em>ij2−Ny</em>..2=∑<em>i=14∑</em>j=14y<em>ij2−Ny</em>..2=∑<em>i=14y</em>i12+y<em>i22+y</em>i32+y<em>i42−Ny</em>..2
Sum up the squares for row by row or treatment.
SST=982+972+992+962+912+902+932+922+962+952+972+952+952+962+992+982−1615272
=38030+33494+36675+37646−1615272=145845−145733.063
SST=111.9375
y<em>i.=Totals=390,366,383,388 and a=4 SST</em>Treatments=∑<em>i=1any</em>i.2−Ny..2
Thus,
SST<em>Treatments=∑</em>i=14ny<em>i.2−Ny</em>..2=ny<em>1.2+y</em>2.2+y<em>3.2+y</em>4.2−Ny<em>..2=43902+3662+3832+3882−2415272 =4583289−1615272=89.1875 SST</em>Treatments=89.1875
From Eq. (10.10), we have SSE=SST−SSTTreatments
SSE=<atarget="blank"rel="noopenernoreferrernofollow"class="link"href="tel:111.9375"data−prevent−progress="true">111.9375</a>−89.1875=22.7500
From Eq. (10.7), we calculate the test statistic as F<em>0=SSE/a(n−1)SST</em>Treatments/a−1=MSEMST<em>Treatments F</em>0=4−189.1875=29.7291667≈29.73
MSE=a(n−1)SSE=4(4−1)22.7500=1.89583333=σ2
MSE=σ2=1.89583333
From Eq. (10.7) we calculate the test statistic as
f<em>0=MSEMST</em>Treatments=1.8958333329.7291667=15.6813187
We note that f0 is an F-distribn with a−1=4−1=3 and a(n−1)=4(4−1)=12 degrees of freedom.
Find f<em>0.01,3,12 from Table VI ~ α=0.01:f</em>α,a−1,a(n−1)=f0.01,3,12=5.95
Reject H<em>0 if f0 = 11.99 > f{\alpha, a-1, a(n-1)} = f{0.01,3,12} = 5.95
P-value for this test statistic (via software) is P = P(F_{3,12} > 15.6813187) = 1.8 \times 10^{-4}
Because P≃2.33361×10−4 is considerably smaller than α=0.01, we have strong evidence to conclude that H0 is not true.
With mean square error, MSE=σ2=1.8958333
σ^2=MSE, via Eq. (10.25) we have σ<em>τ^2=nMST</em>Treatments−MSE=429.7291667−1.89583333=6.95833334
στ^2≅6.96
From the analysis of variance, we conclude that the looms in the plant differ significantly in their ability to produce fabric of uniform strength.
The variance components are estimated via Eqs. (10.24) & (10.25) to be σ^2=MSE=1.8958⋯≅1.90 & στ^2≅6.96
Therefore, the variance of strength in the manufacturing process is estimated by Eq. (10.19) as V(Y<em>ij)=σ</em>τ2+σ2≈8.86
Most of the variability in strength in the output product is attributable to differences between looms.
ANOVA for the Tensile Strength of looms via software:
SST=<atarget="blank"rel="noopenernoreferrernofollow"class="link"href="tel:111.9375"data−prevent−progress="true">111.9375</a>,SSTTreatments=89.1875
MSTTreatments=29.7291667,MSE=1.89583333
(numerator) dF=a−1=3, (denominator) dF=a(n−1)=12, (total) dF=15
*Follow the definitions especially defining expressions (the equations).
*Complicated problems are what engineers solve.
*Most problems will follow examples on how to use the equations.
In experimental design problems, it is necessary to design the experiment so that the variability arising from a nuisance factor can be controlled.
The paired t-test is a procedure for comparing two treatment means when all experimental runs cannot be made under homogeneous conditions.
The paired t-test is viewed as a method for reducing the background noise in the experiment by blocking out a nuisance factor effect.
The randomized block design is an extension of the paired t-test to situations where the factor of interest has more than two levels; i.e., more than two treatments must be compared.
For example, suppose that three methods could be used to evaluate the strength readings on steel plate girders (Z-like structural elements).
These are considered as three treatments, say t<em>1,t</em>2, and t3.
If we use four girders as the experimental units, a randomized complete block design (RCBD) would appear as shown in Fig. 10.9.
The design is called a RCBD because:
Experiment data is recorded in a table, such as is shown in Table 10.9.
The observations in this table, say, yij, represent the response obtained when method i is used on girder j.
The general procedure for a RCBD consists of selecting b blocks and running a complete replicate of the experiment in each block.
The data that result from running a RCBD for investigating a single factor with a levels and b blocks are shown in Table 10.10.
There are a observations (one per factor level) in each block.
The order in which these observations are run is randomly assigned within the block.
The statistical analysis for the RCBD:
Suppose that a single factor with a levels is of interest and that the experiment is run in b blocks.
The observations are here represented by the linear statistical model: Y<em>ij=μ+τ</em>i+β<em>j+ϵ</em>ij, where i=1,2,⋯,a and j=1,2,⋯,b
μ is an overall mean.
τi is the effect of the ith treatment.
βj is the effect of the jth block.
ϵij is the random error term, which is assumed to be normally and independently distributed with mean zero and variance σ2.
Furthermore, the treatment and block effects are defined as deviations from the overall mean, so ∑<em>i=1aτ</em>i=0 and ∑<em>j=1bβ</em>j=0.
This was the same type of definition used for completely randomized experiments.
It is assumed that treatments and blocks do not interact.
The effect of treatment i is the same regardless of which block (or blocks) in which it is tested.
It is of interest to test the equality of the treatment effects.
H<em>0:τ</em>1=τ<em>2=⋯=τ</em>a
H<em>1:τ</em>i=0 for at least one
The analysis of variance can be extended to the RCBD.
The procedure uses a sum of squares identity that partitions the total sum of squares into three components.
ANOVA Sums of Squares Identity: Randomized Complete Block Experiment
The sum of squares identity for the randomized complete block design is:
∑<em>i=1a∑</em>j=1b(y<em>ij−yˉ</em>..)2=b∑<em>i=1a(yˉ</em>i.−yˉ<em>..)2+a∑</em>j=1b(yˉ<em>.j−yˉ</em>..)2+∑<em>i=1a∑</em>j=1b(y<em>ij−yˉ</em>.j−yˉ<em>i.+yˉ</em>..)2
SST=SST<em>Treatments+SSB</em>Blocks+SSE
The degrees of freedom corresponding to these sums of squares are:
ab−1=a−1+b−1+(a−1)(b−1)
For the randomized block design, the relevant mean squares are:
MST<em>Treatments=a−1SST</em>Treatments
MSB<em>Blocks=b−1SSB</em>Blocks
MSE=(a−1)(b−1)SSE
The expected values of these mean squares can be shown to be as follows:
E(MST<em>Treatments)=σ2+a−1b∑</em>i=1aτ<em>i2 E(MSB</em>Blocks)=σ2+a−1a∑<em>i=1bβ</em>j2
E(MSE)=σ2
If the null hypothesis H<em>0 is true, so that all treatment effects τ</em>i=0, MSTTreatments is an unbiased estimator of σ2.
If H<em>0 is false, MST</em>Treatments overestimates σ2.
The mean square for error is always an unbiased estimate of σ2.
To test the null hypothesis that the treatment effects are all zero, we use the ratio
F<em>0=MSEMST</em>Treatments
F0 has an F-distribn with a−1 and (a−1)(b−1) degrees of freedom if the null hypothesis is true.
Reject the null hypothesis at the α-level of significance if the computed value of the test statistic is f0 > f{\alpha, a-1, (a-1)(b-1)}.
Compute SST,SST<em>Treatments and SSB</em>Blocks to obtain the error sum of squares SSE by subtraction.
Computing Formulas for ANOVA: Randomized Block Experiment (RCBD)
SST=∑<em>i=1a∑</em>j=1by<em>ij2−aby</em>..2
SST<em>Treatments=b1∑</em>i=1ay<em>i.2−aby</em>..2
SSB<em>Blocks=a1∑</em>j=1by<em>j.2−aby</em>..2
SSE=SST−SST<em>Treatments−SSB</em>Blocks
The computations are usually arranged in an ANOVA table.
Computer software is used to perform the analysis of variance for a RCBD because it is easier at data handling.
An experiment was performed to determine the effect of four different chemicals on the strength of a fabric.
The chemicals are used as part of the permanent-press finishing process.
Five fabric samples were selected, and a RCBD was run by testing each chemical type once in random order on each fabric sample.
It is required to test for differences in means using an ANOVA with α=0.01 significance level.
Data analysis and interpretation:
y<em>..=39.2;yˉ</em>..=1.96
y<em>i.=5.7,8.8,6.9&17.8;yˉ</em>i.=1.14,1.76,1.38&3.56
y.j=9.2,10.1,3.5,8.8&7.6
yˉ.j=2.3,2.525,0.875,2.2&1.9
N=20,a=1,2,3,4: four treatments; b=5 (blocks, samples), N=a×b=4×5=20
dF:a−1=3 (numerator) & (a−1)(b−1)=12 (denominator)
This example depicts an experiment that is a completely randomized design.
Use the analysis of variance (ANOVA) to test the hypothesis on the means using α=0.01.
H<em>0:τ</em>1=τ<em>2=τ</em>3=τ<em>4=0 H</em>1:τi=0 for at least one i
The sums of squares for the analysis of variance are computed from:
SST=∑<em>i=14∑</em>j=15y<em>ij2−aby</em>..2
Thus,
SST=1.32+1.62+0.52+1.22+1.12+2.22+2.42+0.42+2.02+1.82+1.82+1.72+0.62+1.52+1.32+3.92+4.42+2.02+4.12+3.42−2039.22
=7.15+18.00+10.43+66.94−2039.22
SST=102.52−76.832=25.688
y<em>i.=Totals=5.7,8.8,6.9,17.8 SST</em>Treatments=∑<em>i=1aby</em>i.2−aby<em>..2 SST</em>Treatments=∑<em>i=14by</em>i.2−aby<em>..2=5y</em>1.2+y<em>2.2+y</em>3.2+y<em>4.2−aby</em>..2=55.72+8.82+6.92+17.82−2039.22
=5474.38−2039.22=18.044
SSTTreatments=18.044
With y.j=9.2,10.1,3.5,8.8&7.6
SSB<em>Blocks=∑</em>j=15ay<em>.j2−aby</em>..2=5y<em>.12+y</em>.22+y<em>.32+y</em>.42+y<em>.52−aby</em>..2=49.22+10.12+3.52+8.82+7.62−2039.22
=4334.10−2039.22=6.693
SSBBlocks=6.693
SSE=SST−SSB<em>Blocks−SST</em>Treatments=25.688−6.693−18.044=0.951
From Eq. (10.7) we calculate the test statistic:
F<em>0=SSE/(a−1)(b−1)SST</em>Treatments/a−1=MSEMST<em>Treatments F</em>0=4−118.044=(4−1)(5−1)0.951=75.89484753
Note that f0 is an F-distribn with a−1=4−1=3 and a−1b−1=(4−1)(5−1)=12 degrees of freedom
Find f<em>0.01,3,12 from Table VI ~ α=0.01:f</em>α,a−1,a(n−1)=f0.01,3,12=5.95
Reject H<em>0 if f0 = 75.89 > f{\alpha, a-1, a(n-1)} = f{0.01,3,12} = 5.95 where f<em>0 is the computed value of F</em>0 from Eq. (10.7).
Find a P-value for this test statistic:
P = P(F_{3,12} > 75.89) = 4.5 \times 10^{-8}
Because P≃4.5×10−8 (calculator) is considerably smaller than α=0.01, we conclude that there is a significant difference in the chemical types so far as their effect on strength is concerned.
MathML
MST<em>Treatments=a−1SST</em>Treatments=4−118.044=6.0147
MSE=(a−1)(b−1)SSE=120.951=0.07925
MSB<em>Blocks=b−1SSB</em>Blocks=5−16.693=1.67325
*Practice problems at the end of section 13.4 in the textbook.