Scheffé's method
In statistics, Scheffé's method, named after American statistician Henry Scheffé, is a method for adjusting significance levels in a linear regression analysis to account for multiple comparisons. It is particularly useful in analysis of variance, and in constructing simultaneous confidence bands for regressions involving basis functions.
Scheffé's method is a single-step multiple comparison procedure which applies to the set of estimates of all possible contrasts among the factor level means, not just the pairwise differences considered by the Tukey–Kramer method. It works on similar principles as the Working–Hotelling procedure for estimating mean responses in regression, which applies to the set of all possible factor levels.
The method
Let be the means of some variable in disjoint populations.An arbitrary contrast is defined by
where
If are all equal to each other, then all contrasts among them are. Otherwise, some contrasts differ from.
Technically there are infinitely many contrasts. The simultaneous confidence coefficient is exactly, whether the factor level sample sizes are equal or unequal.
We estimate by
for which the estimated variance is
where
- is the size of the sample taken from the th population, and
- is the estimated variance of the errors.
are simultaneously correct, where as usual is the size of the whole population. Norman R. Draper and Harry Smith, in their 'Applied Regression Analysis', indicate that should be in the equation in place of. The slip with is a result of failing to allow for the additional effect of the constant term in many regressions. That the result based on is wrong is readily seen by considering, as in a standard simple linear regression. That formula would then reduce to one with the usual -distribution, which is appropriate for predicting/estimating for a single value of the independent variable, not for constructing a confidence band for a range of values of the independent value. Also note that the formula is for dealing with the mean values for a range of independent values, not for comparing with individual values such as individual observed data values.