Owning Palette: Analysis of Variance VIs
Requires: Full Development System
Takes an array of experimental observations made at various levels of three factors and performs a three-way analysis of variance.
Add to the block diagram | Find on the palette |
Levels is a cluster of three numeric values corresponding to number of levels in the A, B, and C factors, as well as the effects of the A, B, and C factors (fixed or random).
| |||||||||||||||
X contains all the observational data. | |||||||||||||||
Index A contains the level of factor A to which the corresponding observation belongs. | |||||||||||||||
Index B contains the level of factor B to which the corresponding observation belongs. | |||||||||||||||
Index C contains the level of factor C to which the corresponding observation belongs. | |||||||||||||||
observations per cell is the number of observations in each cell. It is the same for all cells. | |||||||||||||||
Info is an 8 by 4 matrix organized where the first column corresponds to the sums of squares associated with the respective factors (A, B, C), the respective interactions (AB, AC, BC, ABC), and residual error. The second column corresponds to the respective degrees of freedom. The third column corresponds to the respective mean squares. The fourth column corresponds to the respective F values. |
|||||||||||||||
Significance is a cluster of seven numerical values corresponding to the significance levels.
| |||||||||||||||
error returns any error or warning from the VI. You can wire error to the Error Cluster From Error Code VI to convert the error code or warning into an error cluster. |
In any ANOVA, you look for evidence that the factors or interactions among factors have a significant effect on experimental outcomes. What varies with each model is the method used to do this.
A factor is a random effect if it has a large population of levels about which you want to draw conclusions but such that you cannot sample from all levels. You thus pick levels at random and hope to generalize about all levels. A factor is a fixed effect if you can sample from all levels about which you want to draw conclusions.
Let xpqrs equation be the sth observation at the pth, qth, and rth levels of A, B, and C respectively, where s = 0, 1, ..., L – 1. Express each observation as the sum of eight components. Thus,
xpqrs = µ + p + q + r + ()pq + ()pr + ()qr + ()pqr + pqrs
where
Each of the following hypotheses is a different way of saying that a factor or an interaction among factors has no effect on experimental outcomes. This VI assumes that there are no effects and then seeks evidence to contradict these assumptions. The following are the seven hypotheses:
The 3D ANOVA VI makes the following assumptions:
In each of the models, the VI breaks up the total sum of squares, tss, a measure of the total variation of the data from the overall population mean, into a number of component sums of squares.
tss = ssa + ssb + ssc + ssab + ssac + ssbc + ssabc + sse
Each component in the sum tss is a measure of variation attributed to a certain factor or interaction among the factors. Here ssa is a measure of the variation due to factor A; ssb is a measure of the variation due to factor B; ssc is a measure of the variation due to factor c; ssab is a measure of the variation due to the interaction between factors A and B; and so on for ssac, ssbc, and ssabc. Also, sse is a measure of the variation due to random fluctuation. The VI divides each by its own degrees of freedom to obtain the corresponding averages msa, msb, msc, msab, msac, msbc, msabc, and mse. For example, if factor A has a strong effect on the experimental observations, then msa will be relatively large.
For each hypothesis, the VI computes number f that is used to calculate the associated sig probability. For example, for the hypothesis (A), that (p = 0 for all the levels p), (fixed A), the VI computes
then
sigA = Prob{Fa – 1, abc(L – 1) > fa}
where
Fa – 1, abc(L – 1)
is an F distribution with degrees of freedom a – 1 and abc(L – 1). You then can use the probabilities sigA, sigB, sigC, sigAB, …, sig ABC to determine when you should reject the associated hypotheses (A), (B), (C), (AB), …, (ABC).
How do you know when to reject the null hypothesis? For each hypothesis, you choose a level of significance. This level of significance is how likely you want it to be that you mistakenly reject the hypothesis (a common choice is 0.05). Compare your chosen level of significance with the associated sig probability output. If the sig probability is less than your chosen level of significance, you should reject the null hypothesis. For example, if A is a random effect, your level of significance is 0.05, and sigA = 0.03, you must reject the hypothesis that A2 = 0 and conclude that factor A has an effect on the experimental observations.
With some models there are no appropriate tests for certain hypotheses. If such is the case, the output parameters directly involved with the testing of these hypotheses are –1.0.
Let xpqrs be the sth observation at the pth, qth, and rth levels of A, B, and C respectively, where s = 0, 1, ..., L – 1.
Let
a = |A levels|
b = |B levels|
c = |C levels|
then