Chauvenet's criterion
In statistical theory, Chauvenet's criterion is a means of assessing whether one piece of experimental data from a set of observations is likely to be spurious – an outlier.
Derivation
The idea behind Chauvenet's criterion finds a probability band that reasonably contains all n samples of a data set, centred on the mean of a normal distribution. By doing this, any data point from the n samples that lies outside this probability band can be considered an outlier, removed from the data set, and a new mean and standard deviation based on the remaining values and new sample size can be calculated. This identification of the outliers will be achieved by finding the number of standard deviations that correspond to the bounds of the probability band around the mean and comparing that value to the absolute value of the difference between the suspected outliers and the mean divided by the sample standard deviation.where
- is the maximum allowable deviation,
- is the absolute value,
- is the value of suspected outlier,
- is sample mean, and
- is sample standard deviation.
where
- is the probability band centered on the sample mean and
- is the sample size.
where
- is probability represented by one tail of the normal distribution and
- = sample size.
where
- is the -score,
- is the sample value,
- is the mean of standard normal distribution, and
- is the standard deviation of standard normal distribution.
Calculation
To apply Chauvenet's criterion, first calculate the mean and standard deviation of the observed data. Based on how much the suspect datum differs from the mean, use the normal distribution function to determine the probability that a given data point will be at the value of the suspect data point. Multiply this probability by the number of data points taken. If the result is less than 0.5, the suspicious data point may be discarded, i.e., a reading may be rejected if the probability of obtaining the particular deviation from the mean is less than.Example
For instance, suppose a value is measured experimentally in several trials as 9, 10, 10, 10, 11, and 50, and we want to find out if 50 is an outlier.First, we find.
Then we find by plugging into the Quantile Function.
Then we find the z-score of 50.
From there we see that and can conclude that 50 is an outlier according to Chauvenet's Criterion.