Birnbaum's theorem

Birnbaum's Theorem is a pivotal result in the foundations of statistics, formulated by the American statistician Allan Birnbaum in 1962. The theorem formally demonstrates that the likelihood principle is logically equivalent to the combination of two more widely accepted statistical principles: the sufficiency principle and the conditionality principle.
The publication of the theorem in the Journal of the American Statistical Association was a landmark event that sparked intense debate between frequentist and Bayesian statisticians, as the likelihood principle implies that many standard frequentist methods violate basic axioms of consistency.

Definitions and principles

Birnbaum's theorem concerns the "evidential meaning" of an experiment, denoted as, where is the experiment and is the observed data.

Sufficiency principle (S)

The sufficiency principle states that if is a sufficient statistic for a parameter, then the evidential meaning of the data is the same as the evidential meaning of the statistic. Formally:
This principle is widely accepted by almost all statistical schools of thought.

Conditionality principle (C)

The conditionality principle states that if an experiment is chosen by a random mechanism that does not depend on the parameter, then the evidence provided by the result depends only on the experiment actually performed. For example, if a researcher decides to perform either experiment or based on a fair coin toss, and is chosen, the evidence should not be affected by the fact that "could have" been performed.

Likelihood principle (L)

The likelihood principle states that all the information about from an experiment is contained in the likelihood function. Two different experiments yielding the same likelihood function should result in the same inference about.

The theorem

Birnbaum's theorem states:

The likelihood principle is equivalent to the conjunction of the sufficiency principle and the conditionality principle.

Symbolically:

Significance

The theorem is considered a paradox by many frequentists. While and are viewed as intuitively obvious and "safe" principles of scientific practice, their logical consequence invalidates most frequentist techniques. For instance, implies that the stopping rule of an experiment should not affect the final inferencea direct contradiction to how p-values are calculated.

Criticisms

Following Birnbaum's original paper, several statisticians challenged the proof.

Deborah Mayo argued in 2004 that Birnbaum's application of the conditionality principle was flawed because it assumes the existence of an "evidential" framework that may not be compatible with frequentist goals of error control.
Michael Evans and others have revisited the proof using different categorical frameworks, generally upholding Birmbaum's logic while noting that the "evidence" must be carefully defined to avoid mathematical trivialities.