Rademacher distribution
In probability theory and statistics, the Rademacher distribution is a discrete probability distribution where a random variate X has a 50% chance of being +1 and a 50% chance of being −1.
A series of Rademacher distributed variables can be regarded as a simple symmetrical random walk where the step size is 1.
Mathematical formulation
The probability mass function of this distribution isIn terms of the Dirac delta function, as
Bounds on sums of independent Rademacher variables
There are various results in probability theory around analyzing the sum of i.i.d. Rademacher variables, including concentration inequalities such as Bernstein inequalities as well as anti-concentration inequalities like Tomaszewski's conjecture.Concentration inequalities
Let be a set of random variables with a Rademacher distribution. Let be a sequence of real numbers. Thenwhere ||a||2 is the Euclidean norm of the sequence, t > 0 is a real number and Pr is the probability of event Z.
Let Y = Σ xiai and let Y be an almost surely convergent series in a Banach space. The for t > 0 and s ≥ 1 we have
for some constant c.
Let p be a positive real number. Then the Khintchine inequality says that
where c1 and c2 are constants dependent only on p.
For p ≥ 1,
Tomaszewski’s conjecture
In 1986, Bogusław Tomaszewski proposed a question about the distribution of the sum of independent Rademacher variables. A series of works on this question culminated in a proof in 2020 by Nathan Keller and Ohad Klein of the following conjecture.Conjecture. Let, where and the 's are independent Rademacher variables. Then
For example, when, one gets the following bound, first shown by Van Zuijlen.
The bound is sharp and better than that which can be derived from the normal distribution.
Applications
The Rademacher distribution has been used in bootstrapping. See Chapter 17 of Testing Statistical Hypotheses for example.The distribution is particularly useful in high-dimensional statistics.
The Rademacher distribution can be used to show that normally distributed and uncorrelated does not imply independent.
Random vectors with components sampled independently from the Rademacher distribution are useful for various stochastic approximations, for example:
- The Hutchinson trace estimator, which can be used to efficiently approximate the trace of a matrix of which the elements are not directly accessible, but rather implicitly defined via matrix-vector products.
- SPSA, a computationally cheap, derivative-free, stochastic gradient approximation, useful for numerical optimization.
Related distributions
- Bernoulli distribution: If X has a Rademacher distribution, then has a Bernoulli distribution.
- Laplace distribution: If X has a Rademacher distribution and Y ~ Exp is independent from X, then XY ~ Laplace.