# Natural exponential family

In probability and statistics, a natural exponential family (NEF) is a class of probability distributions that is a special case of an exponential family (EF). Every distribution possessing a moment-generating function is a member of a natural exponential family, and the use of such distributions simplifies the theory and computation of generalized linear models.

## Definition

### Probability distribution function (PDF) of the univariate case (scalar domain, scalar parameter)

The natural exponential family (NEF) is a subset of the exponential family. NEF is an exponential family in which the natural parameter η and the natural statistic T(x) are both the identity. A distribution in the exponential family with parameter θ can be written with probability density function (PDF)

${\displaystyle f_{X}(x|\theta )=h(x)\ \exp {\Big (}\ \eta (\theta )T(x)-A(\theta )\ {\Big )}\,\!,}$

where ${\displaystyle h(x)}$ and ${\displaystyle A(\theta )}$ are known functions. A distribution in the natural exponential family with parameter θ can thus be written with PDF

${\displaystyle f_{X}(x|\theta )=h(x)\ \exp {\Big (}\ \theta x-A(\theta )\ {\Big )}\,\!.}$

[Note that slightly different notation is used by the originator of the NEF, Carl Morris.[1] Morris uses ω instead of η and ψ instead of A.]

### Probability distribution function (PDF) of the general case (multivariate domain and/or parameter)

Suppose that ${\displaystyle \mathbf {x} \in {\mathcal {X}}\subseteq \mathbb {R} ^{p}}$, then a natural exponential family of order p has density or mass function of the form:

${\displaystyle f_{X}(\mathbf {x} |{\boldsymbol {\theta }})=h(\mathbf {x} )\ \exp {\Big (}{\boldsymbol {\theta }}^{\rm {T}}\mathbf {x} -A({\boldsymbol {\theta }})\ {\Big )}\,\!,}$

where in this case the parameter ${\displaystyle {\boldsymbol {\theta }}\in \mathbb {R} ^{p}.}$

### Moment and cumulant generating function

A member of a natural exponential family has moment generating function (MGF) of the form

${\displaystyle M_{X}(\mathbf {t} )=\exp {\Big (}\ A({\boldsymbol {\theta }}+\mathbf {t} )-A({\boldsymbol {\theta }})\ {\Big )}\,.}$

The cumulant generating function is by definition the logarithm of the MGF, so it is

${\displaystyle K_{X}(\mathbf {t} )=A({\boldsymbol {\theta }}+\mathbf {t} )-A({\boldsymbol {\theta }})\,.}$

## Examples

The five most important univariate cases are:

These five examples – Poisson, binomial, negative binomial, normal, and gamma – are a special subset of NEF, called NEF with quadratic variance function (NEF-QVF) because the variance can be written as a quadratic function of the mean. NEF-QVF are discussed below.

Distributions such as the exponential, chi-squared, Rayleigh, Weibull, Bernoulli, and geometric distributions are special cases of the above five distributions. Many common distributions are either NEF or can be related to the NEF. For example: the chi-squared distribution is a special case of the gamma distribution. The Bernoulli distribution is a binomial distribution with n = 1 trial. The exponential distribution is a gamma distribution with shape parameter α = 1 (or k = 1 ). The Rayleigh and Weibull distributions can each be written in terms of an exponential distribution.

Some exponential family distributions are not NEF. The lognormal and Beta distribution are in the exponential family, but not the natural exponential family.

The parameterization of most of the above distributions has been written differently from the parameterization commonly used in textbooks and the above linked pages. For example, the above parameterization differs from the parameterization in the linked article in the Poisson case. The two parameterizations are related by ${\displaystyle \theta =\log(\lambda )}$, where λ is the mean parameter, and so that the density may be written as

${\displaystyle f(k;\theta )={\frac {1}{k!}}\exp {\Big (}\ \theta \ k-\exp(\theta )\ {\Big )}\ ,}$
${\displaystyle h(k)={\frac {1}{k!}}\ }$, and ${\displaystyle A(\theta )=\exp(\theta )\ .}$

This alternative parameterization can greatly simplify calculations in mathematical statistics. For example, in Bayesian inference, a posterior probability distribution is calculated as the product of two distributions. Normally this calculation requires writing out the probability distribution functions (PDF) and integrating; with the above parameterization, however, that calculation can be avoided. Instead, relationships between distributions can be abstracted due to the properties of the NEF described below.

An example of the multivariate case is the multinomial distribution with known number of trials.

## Properties

The properties of the natural exponential family can be used to simplify calculations involving these distributions.

### Univariant case

1. The cumulants of an NEF can be calculated as derivatives of the NEF's cumulant generating function. The nth cumulant is the nth derivative with respect to θ of the cumulant generating function.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B= {{#invoke:Category handler|main}}{{#invoke:Category handler|main}}[citation needed] }} ${\displaystyle K_{X}(t)=A(\theta +t)-A(\theta )\,.}$ The first cumulant is ${\displaystyle K_{1}={\frac {d}{d\theta }}A(t)\,.}$ The mean is the first moment and always equal to the first cumulant, so ${\displaystyle \mu _{1}'=\kappa _{1}=\mathrm {E} [X]=K'_{X}(0)=A'(\theta )\,.}$ The variance is always the second moment, and it is always related to the first and second cumulants by ${\displaystyle \mathrm {Var} [X]=\mu _{2}'=\kappa _{2}+\kappa _{1}^{2}\,,}$ so that ${\displaystyle \mathrm {Var} [X]=K''_{X}(0)=A''(\theta )\,.}$ The nth cumulant is ${\displaystyle K_{n}={\frac {d^{(n)}}{d\theta ^{(n)}}}A(t)\,.}$ 2. Natural exponential families (NEF) are closed under convolution.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B= {{#invoke:Category handler|main}}{{#invoke:Category handler|main}}[citation needed] }}

Given independent identically distributed (iid) ${\displaystyle X_{1},\ldots ,X_{n}}$ with distribution from an NEF, then ${\displaystyle \sum _{i=1}^{n}X_{i}\,}$ is an NEF, although not necessarily the original NEF. This follows from the properties of the cumulant generating function.

3. The variance function for random variables with an NEF distribution can be written in terms of the mean.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B= {{#invoke:Category handler|main}}{{#invoke:Category handler|main}}[citation needed] }} ${\displaystyle Var(X)=V(\mu ).}$ 4. The first two moments of a NEF distribution uniquely specify the distribution within that family of distributions.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B= {{#invoke:Category handler|main}}{{#invoke:Category handler|main}}[citation needed] }}

${\displaystyle X\sim NEF[\mu ,V(\mu )].}$

### Properties of NEF-QVF

The properties of NEF-QVF can simplify calculations that use these distributions.

1. Natural exponential families with quadratic variance functions (NEF-QVF) are closed under convolutions of a linear transformation.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B= {{#invoke:Category handler|main}}{{#invoke:Category handler|main}}[citation needed] }} That is, a convolution of a linear transformation of an NEF-QVF is also an NEF-QVF, although not necessarily the original one. Given independent identically distributed (iid) ${\displaystyle X_{1},\ldots ,X_{n}}$ with distribution from a NEF-QVF. A convolution of a linear transformation of an NEF-QVF is also an NEF-QVF. Let ${\displaystyle Y=\sum _{i=1}^{n}(X_{i}-b)/c\,}$ be the convolution of a linear transformation of X. The mean of Y is ${\displaystyle \mu ^{*}=n(\mu -b)/c\,}$. The variance of Y can be written in terms of the variance function of the original NEF-QVF. If the original NEF-QVF had variance function ${\displaystyle Var(X)=V(\mu )=\nu _{0}+\nu _{1}\mu +\nu _{2}\mu ^{2},}$ then the new NEF-QVF has variance function ${\displaystyle Var(Y)=V^{*}(\mu ^{*})=\nu _{0}^{*}+\nu _{1}^{*}\mu +\nu _{2}^{*}\mu ^{2},}$ where ${\displaystyle \nu _{0}^{*}=nV(b)/c^{2}\,,}$ ${\displaystyle \nu _{1}^{*}=V'(b)/c\,,}$ ${\displaystyle \nu _{2}^{*}/n=\nu _{2}/n\,.}$ 2. Let ${\displaystyle X_{1}}$ and ${\displaystyle X_{2}}$ be independent NEF with the same parameter θ and let ${\displaystyle Y=X_{1}+X_{2}}$. Then the conditional distribution of ${\displaystyle X_{1}}$ given Y ${\displaystyle f(X_{1}|Y)}$ has quadratic variance in Y if and only if ${\displaystyle X_{1}}$ and ${\displaystyle X_{2}}$ are NEF-QVF. Examples of conditional distributions ${\displaystyle f(X_{1}|Y)}$ are the normal, binomial, beta, hypergeometric and geometric distributions, which are not all NEF-QVF.[1] 3. NEF-QVF have conjugate prior distributions on μ in the Pearson system of distributions (also called the Pearson distribution although the Pearson system of distributions is actually a family of distributions rather than a single distribution.) Examples of conjugate prior distributions of NEF-QVF distributions are the normal, gamma, reciprocal gamma, beta, F-, and t- distributions. Again, these conjugate priors are not all NEF-QVF.[1] 4. If ${\displaystyle X|\mu }$ has an NEF-QVF distribution and μ has a conjugate prior distribution then the marginal distributions are well-known distributions.[1] These properties together with the above notation can simplify calculations in mathematical statistics that would normally be done using complicated calculations and calculus. Template:One source {{ safesubst:#invoke:Unsubst||$N=Refimprove |date=__DATE__ |\$B= {{#invoke:Message box|ambox}} }}

## References

1. Morris C. (2006) "Natural exponential families", Encyclopedia of Statistical Sciences.
2. Morris C. (1982) "Natural exponential families with quadratic variance functions". Ann. Statist., 10(1), 65–80.
• Morris C. (1982) Natural exponential families with quadratic variance functions: statistical theory. Dept of mathematics, Institute of Statistics, University of Texas, Austin.
1. REDIRECT Template:Probability distributions