Relationships among probability distributions


In probability theory and statistics, there are several relationships among probability distributions. These relations can be categorized in the following groups:
  • One distribution is a special case of another with a broader parameter space
  • Transforms ;
  • Combinations ;
  • Approximation relationships;
  • Compound relationships ;
  • Duality;
  • Conjugate priors.

Special case of distribution parametrization

Transform of a variable

Multiple of a random variable

Multiplying the variable by any positive real constant yields a scaling of the original distribution.
Some are self-replicating, meaning that the scaling yields the same family of distributions, albeit with a different parameter:
normal distribution, gamma distribution, Cauchy distribution, exponential distribution, Erlang distribution, Weibull distribution, logistic distribution, error distribution, power-law distribution, Rayleigh distribution.
Example:
  • If X is a gamma random variable with shape and rate parameters, then Y = aX is a gamma random variable with parameters.
  • If X is a gamma random variable with shape and scale parameters, then Y = aX is a gamma random variable with parameters.

Linear function of a random variable

The affine transform ax + b yields a relocation and scaling of the original distribution. The following are self-replicating:
Normal distribution, Cauchy distribution, Logistic distribution, Error distribution, Power distribution, Rayleigh distribution.
Example:
  • If Z is a normal random variable with parameters, then X = aZ + b is a normal random variable with parameters.

Reciprocal of a random variable

The reciprocal 1/X of a random variable X, is a member of the same family of distribution as X, in the following cases:
Cauchy distribution, F distribution, log logistic distribution.
Examples:
  • If X is a Cauchy random variable, then 1/X is a Cauchy random variable where C = μ2 + σ2.
  • If X is an F random variable then 1/X is an F random variable.

Other cases

Some distributions are invariant under a specific transformation.
Example:
  • If X is a beta random variable then is a beta random variable.
  • If X is a binomial random variable then is a binomial random variable.
  • If X follows a continuous uniform distribution on, and FX is its cumulative distribution function, then the random variable U=FX follows a standard uniform distribution on .
Some distributions are variant under a specific transformation.
  • If X is a normal random variable then eX is a lognormal random variable.
  • If X is an exponential random variable with mean β, then X1/γ is a Weibull random variable.
  • The square of a standard normal random variable has a chi-squared distribution with one degree of freedom.
  • If X is a Student’s t random variable with ν degree of freedom, then X2 is an F random variable.
  • If X is a double exponential random variable with mean 0 and scale λ, then |X| is an exponential random variable with mean λ.
  • A geometric random variable is the floor of an exponential random variable.
  • A rectangular random variable is the floor of a uniform random variable.
  • A reciprocal random variable is the exponential of a uniform random variable.

Functions of several variables

Sum of variables

The distribution of the sum of independent random variables is the convolution of their distributions. Suppose is the sum of independent random variables each with probability mass functions. ThenIf it has a distribution from the same family of distributions as the original variables, that family of distributions is said to be closed under convolution. Often these distributions are also stable distributions.
Examples of such univariate distributions are: normal distributions, Poisson distributions, binomial distributions, negative binomial distributions, gamma distributions, chi-squared distributions, Cauchy distributions, hyperexponential distributions.
Examples:
  • *If X1 and X2 are Poisson random variables with means μ1 and μ2 respectively, then X1 + X2 is a Poisson random variable with mean μ1 + μ2.
  • * The sum of gamma random variables has a gamma distribution.
  • *If X1 is a Cauchy random variable and X2 is a Cauchy, then X1 + X2 is a Cauchy random variable.
  • *If X1 and X2 are chi-squared random variables with ν1 and ν2 degrees of freedom respectively, then X1 + X2 is a chi-squared random variable with ν1 + ν2 degrees of freedom.
  • *If X1 is a normal random variable and X2 is a normal random variable, then X1 + X2 is a normal random variable.
  • *The sum of N chi-squared random variables has a chi-squared distribution with N degrees of freedom.
Other distributions are not closed under convolution, but their sum has a known distribution:
  • The sum of n Bernoulli random variables is a binomial random variable.
  • The sum of n geometric random variables with probability of success p is a negative binomial random variable with parameters n and p.
  • The sum of n exponential random variables is a gamma random variable. Since n is an integer, the gamma distribution is also a Erlang distribution.
  • The sum of the squares of N standard normal random variables has a chi-squared distribution with N degrees of freedom.

Product of variables

The product of independent random variables X and Y may belong to the same family of distribution as X and Y: Bernoulli distribution and log-normal distribution.
Example:
  • If X1 and X2 are independent log-normal random variables with parameters and respectively, then X1 X2 is a log-normal random variable with parameters.

Minimum and maximum of independent random variables

For some distributions, the minimum value of several independent random variables is a member of the same family, with different parameters:
Bernoulli distribution, Geometric distribution, Exponential distribution, Extreme value distribution, Pareto distribution, Rayleigh distribution, Weibull distribution.
Examples:
  • If X1 and X2 are independent geometric random variables with probability of success p1 and p2 respectively, then min is a geometric random variable with probability of success p = p1 + p2p1 p2. The relationship is simpler if expressed in terms probability of failure: q = q1 q2.
  • If X1 and X2 are independent exponential random variables with rate μ1 and μ2 respectively, then min is an exponential random variable with rate μ = μ1 + μ2.
Similarly, distributions for which the maximum value of several independent random variables is a member of the same family of distribution include:
Bernoulli distribution, Power law distribution.

Other

  • If X and Y are independent standard normal random variables, X/''Y is a Cauchy random variable.
  • If X''1 and X2 are independent chi-squared random variables with ν1 and ν2 degrees of freedom respectively, then / is an F random variable.
  • If X is a standard normal random variable and U is an independent chi-squared random variable with ν degrees of freedom, then is a Student's t random variable.
  • If X1 is a gamma random variable and X2 is an independent gamma random variable then X1/ is a beta random variable. More generally, if X1 is a gamma random variable and X2 is an independent gamma random variable then β2 X1/ is a beta random variable.
  • If X and Y are independent exponential random variables with mean μ, then XY is a double exponential random variable with mean 0 and scale μ.
  • If Xi are independent Bernoulli random variables then their parity is a Bernoulli variable described by the piling-up lemma.

Approximate (limit) relationships

Approximate or limit relationship means
  • either that the combination of an infinite number of iid random variables tends to some distribution,
  • or that the limit when a parameter tends to some value approaches to a different distribution.
Combination of iid random variables:
Special case of distribution parametrization: X is a hypergeometric random variable. If n and m are large compared to N, and p = m/''N is not close to 0 or 1, then X'' approximately has a Binomial distribution.X is a beta-binomial random variable with parameters. Let p = α/ and suppose α + β is large, then X approximately has a binomial distribution.
  • If X is a binomial random variable and if n is large and np is small then X approximately has a Poisson distribution.
  • If X is a negative binomial random variable with r large, P near 1, and r = λ, then X approximately has a Poisson distribution with mean λ.
Consequences of the CLT:
  • If X is a Poisson random variable with large mean, then for integers j and k, P approximately equals to P where Y is a normal distribution with the same mean and variance as X.
  • If X is a binomial random variable with large np and n, then for integers j and k, P approximately equals to P where Y is a normal random variable with the same mean and variance as X, i.e. np and np.
  • If X is a beta random variable with parameters α and β equal and large, then X approximately has a normal distribution with the same mean and variance, i. e. mean α/ and variance αβ/2).
  • If X is a gamma random variable and the shape parameter α is large relative to the scale parameter β, then X approximately has a normal random variable with the same mean and variance.
  • If X is a Student's t random variable with a large number of degrees of freedom ν'' then X approximately has a standard normal distribution.
  • If X is an F random variable with ω'' large, then νX is approximately distributed as a chi-squared random variable with ν degrees of freedom.

Compound (or Bayesian) relationships

When one or more parameter of a distribution are random variables, the compound distribution is the marginal distribution of the variable.
Examples:
  • If X | N is a binomial random variable, where parameter N is a random variable with negative-binomial distribution, then X is distributed as a negative-binomial.
  • If X | N is a binomial random variable, where parameter N is a random variable with Poisson distribution, then X is distributed as a Poisson.
  • If X | μ is a Poisson random variable and parameter μ is random variable with gamma distribution, then X is distributed as a negative-binomial, sometimes called gamma-Poisson distribution.
Some distributions have been specially named as compounds:
beta-binomial distribution, Beta negative binomial distribution, gamma-normal distribution.
Examples: