Well-behaved statistic: Difference between revisions

From formulasearchengine
Jump to navigation Jump to search
en>BattyBot
 
en>BDD
m multiple issues template
 
Line 1: Line 1:
In [[probability theory]] and [[statistics]], there are several relationships among [[probability distributions]]. These relations can be categorized in the following groups:
My name is Dominik (42 years old) and my hobbies are Taxidermy and Board sports.<br><br>My web site; [http://tinyurl.com/kecvhhb http://tinyurl.com/kecvhhb]
*One distribution is a special case of another with a broader parameter space
*Transforms (function of a random variable);
*Combinations (function of several variables);
*Approximation (limit) relationships;
*Compound relationships (useful for Bayesian inference);
*[[Duality (mathematics)|Duality]];
*[[Conjugate prior]]s.
 
[[File:Relationships among some of univariate probability distributions.jpg|thumb|700px|center|Relationships among some of univariate probability distributions are illustrated with connected lines. dashed lines means approximate relationship. more info:<ref>{{cite journal|last=LEEMIS|first=Lawrence M.|coauthors=Jacquelyn T. MCQUESTON|title=Univariate Distribution Relationships|journal=American Statistician|date=February 2008|volume=62|issue=1|pages=45–53|url=http://www.math.wm.edu/~leemis/2008amstat.pdf}}</ref>]]
 
==Special case of distribution parametrization==
 
* A [[binomial distribution|binomial]] (n, p) random variable with n  = 1, is a [[Bernoulli distribution|Bernoulli]] (p) random variable.
* A [[negative binomial distribution]] with r = 1 is a [[geometric distribution]].
* A [[gamma distribution]] with shape parameter α = 1 and scale parameter β is an [[exponential distribution|exponential]] (β) distribution.
* A [[gamma distribution|gamma]] (α, β) random variable with α = ν/2 and β = 2, is a [[chi-squared distribution|chi-squared]] random variable with ν [[degrees of freedom]].
* A [[chi-squared distribution]] with 2 degrees of freedom is an [[exponential distribution]] with mean 2 and vice versa.
* A [[Weibull distribution|Weibull]] (1, β) random variable is an [[exponential distribution|exponential]] random variable with mean β.
* A [[beta distribution|beta]] random variable with parameters α = β = 1 is a [[uniform distribution (continuous)|uniform]] random variable.
* A [[beta-binomial distribution|beta-binomial]] (n, 1, 1) random variable is a discrete uniform random variable over the values 0 ... n.
* A random variable with a [[Student's t-distribution|t distribution]] with one degree of freedom is a [[Cauchy distribution|Cauchy]](0,1) random variable.
 
== Transform of a variable ==
 
=== Multiple of a random variable ===
 
Multiplying the variable by any positive real constant yields a '''scaling''' of the original distribution.
Some are self-replicating, meaning that the scaling yields the same family of distributions, albeit with a different parameter:
[[Normal distribution]], [[Gamma distribution]], [[Cauchy distribution]], [[Exponential distribution]], [[Erlang distribution]], [[Weibull distribution]],  [[Logistic distribution]], [[Error distribution]], [[Power distribution]], [[Rayleigh distribution]].
 
'''Example: '''
* If X is a gamma random variable with parameters (r, &lambda;), then Y=aX is a gamma random variable with parameters (r, a&lambda;).
 
=== Linear function of a random variable ===
 
The affine transfom ''ax + b'' yields  a '''relocation and scaling''' of the original distribution. The following are self-replicating:
[[Normal distribution]], [[Cauchy distribution]], [[Logistic distribution]], [[Error distribution]], [[Power distribution]], [[Rayleigh distribution]].
 
'''Example: '''
* If Z is a normal random variable with parameters (μ=m, σ<sup>2</sup>=s<sup>2</sup>), then X=aZ+b is a normal random variable  with parameters (μ=am+b, σ<sup>2</sup>=a<sup>2</sup>s<sup>2</sup>).
 
=== Reciprocal of a random variable ===
 
The reciprocal ''1/X'' of a random variable ''X'', is a member of the same family of distribution as ''X'', in the following cases:
[[Cauchy distribution]], [[F distribution]], [[log logistic distribution]].
 
'''Examples: '''
* If X is a Cauchy (μ, σ) random variable, then 1/X is a Cauchy (μ/C, σ/C) random variable where C = μ<sup>2</sup> + σ<sup>2</sup>.
* If X is an F(ν<sub>1</sub>, ν<sub>2</sub>) random variable then 1/X is an F(ν<sub>2</sub>, ν<sub>1</sub>) random variable.
 
=== Other cases ===
Some distributions are invariant under a specific transformation.
 
'''Example: '''
* If X is a '''beta''' (α, β) random variable then (1 - X) is a '''beta''' (β, α) random variable.
* If X is a '''binomial''' (n, p) random variable then (n - X) is a ''binomial''' (n, 1-p) random variable.
 
* If ''X'' has [[cumulative distribution function]] ''F''<sub>''X''</sub>, then ''F''<sub>''X''</sub>(''X'') is a standard '''uniform''' (0,1) random variable
* If ''X'' is a '''normal''' (μ, σ<sup>2</sup>) random variable then ''e''<sup>''X''</sup> is a '''lognormal''' (μ, σ<sup>2</sup>) random variable.
:Conversely, if ''X'' is a lognormal (μ, σ<sup>2</sup>) random variable then log ''X'' is a normal (μ, σ<sup>2</sup>) random variable.
* If ''X'' is an '''exponential''' random variable with mean β, then ''X''<sup>1/γ</sup> is a '''Weibull''' (γ, β) random variable.
* The square of a '''standard normal''' random variable has a '''chi-squared''' distribution with one degree of freedom.
* If ''X'' is a '''[[Student’s t-distribution|Student’s t]]''' random variable with ν degree of freedom, then ''X''<sup>2</sup> is an '''''F''''' (1,ν) random variable.
* If ''X'' is a '''double exponential''' random variable with mean 0 and scale λ, then |''X''| is an '''exponential''' random variable with mean λ.
* A '''geometric''' random variable is the [[Floor and ceiling functions|floor]] of an '''exponential''' random variable.
* A '''[[rectangular distribution|rectangular]]''' random variable is the floor of a '''uniform''' random variable.
 
== Functions of several variables ==
 
=== Sum of variables ===
The distribution of the sum of independent random variables is called the [[Convolution of probability distributions|convolution]] of the primal distribution.
 
*If it has a distribution from the same family of distributions as the original variables, that family of distributions is said to be ''closed under convolution''.
 
Examples of such [[univariate distribution]]s are:
[[Normal distribution]], [[Poisson distribution]], [[Binomial distribution]] (with common success probability), , [[Negative binomial distribution]] (with common success probability), [[Gamma distribution]](with common rate parameter), [[Chi-squared distribution]], [[Cauchy distribution]], [[Hyper-exponential distribution]].
 
'''Examples:<ref>{{cite web|last=Cook|first=John D.|title=Diagram of distribution relationships|url=http://www.johndcook.com/distribution_chart.html}}</ref>'''{{citation needed|date=January 2013|reason=no proofs in Cook ref}}
**If X<sub>1</sub> and X<sub>2</sub> are '''Poisson''' random variables with means μ<sub>1</sub> and μ<sub>2</sub> respectively, then X<sub>1</sub> + X<sub>2</sub> is a '''Poisson''' random variable with mean μ<sub>1</sub> + μ<sub>2</sub>.
** The sum of '''gamma''' (''n''<sub>i</sub> , β) random variables has a '''gamma '''(&Sigma;''n''<sub>i</sub> , β) distribution.
**If X<sub>1</sub> is a '''Cauchy''' (μ<sub>1</sub>, σ<sub>1</sub>) random variable and X<sub>2</sub> is a Cauchy (μ<sub>2</sub>, σ<sub>2</sub>), then X<sub>1</sub> + X<sub>2</sub> is a '''Cauchy''' (μ<sub>1</sub> + μ<sub>2</sub>, σ<sub>1</sub> + σ<sub>2</sub>) random variable.
**If X<sub>1</sub> and X<sub>2</sub> are '''chi-squared''' random variables with ν<sub>1</sub> and ν<sub>2</sub> degrees of freedom respectively, then X<sub>1</sub> + X<sub>2</sub> is a chi-squared random variable with ν<sub>1</sub> + ν<sub>2</sub> degrees of freedom.
**If X<sub>1</sub> is a '''normal''' (μ<sub>1</sub>, σ<sub>1</sub><sup>2</sup>) random variable and X<sub>2</sub> is a normal (μ<sub>2</sub>, σ<sub>2</sub><sup>2</sup>) random variable, then X<sub>1</sub> + X<sub>2</sub> is a normal (μ<sub>1</sub> + μ<sub>2</sub>, σ<sub>1</sub><sup>2</sup> + σ<sub>2</sub><sup>2</sup>) random variable.
**The sum of N chi-squared (1) random variables has a chi-squared distribution with N degrees of freedom.
 
Other distributions are not closed under convolution, but their sum has a known distribution:
* The sum of ''n'' '''Bernoulli''' (p) random variables is a '''binomial''' (''n'', p) random variable.
* The sum of ''n'' '''geometric''' random variable with probability of success ''p'' is a '''negative binomial''' random variable with parameters ''n'' and ''p''.
* The sum of ''n'' '''exponential''' (β) random variables is a '''gamma''' (''n'', β) random variable.
*The sum of the squares of N '''standard normal''' random variables has a '''chi-squared''' distribution with N degrees of freedom.
 
=== Product of variables ===
 
The product of independent random variables ''X'' and ''Y'' may belong to the same family of distribution as ''X'' and ''Y'':
[[Bernoulli distribution]] and [[Log-normal distribution]].
 
'''Example: '''
* If X<sub>1</sub> and X<sub>2</sub> are independent '''log-normal''' random variables with parameters (μ<sub>1</sub>, σ<sub>1</sub><sup>2</sup>) and (μ<sub>2</sub>, σ<sub>2</sub><sup>2</sup>) respectively, then X<sub>1</sub> X<sub>2</sub> is a '''log-normal''' random variable with parameters (μ<sub>1</sub> + μ<sub>2</sub>, σ<sub>1</sub><sup>2</sup> + σ<sub>2</sub><sup>2</sup>).
 
=== Minimum and maximum of independent random variables ===
 
For some distributions, the '''minimum''' value of several independent random variables is a member of the same family, with different parameters:
[[Bernoulli distribution]], [[Geometric distribution]], [[Exponential distribution]], [[Extreme value distribution]], [[Pareto distribution]], [[Rayleigh distribution]], [[Weibull distribution]].
 
'''Examples: '''
* If X<sub>1</sub> and X<sub>2</sub> are independent '''geometric''' random variables with probability of success p<sub>1</sub> and p<sub>2</sub> respectively, then min(X<sub>1</sub>, X<sub>2</sub>) is a geometric random variable with probability of success p = p<sub>1</sub> + p<sub>2</sub> - p<sub>1</sub> p<sub>2</sub>. The relationship is simpler if expressed in terms probability of failure: q = q<sub>1</sub> q<sub>2</sub>.
* If X<sub>1</sub> and X<sub>2</sub> are independent '''exponential''' random variables with mean μ<sub>1</sub> and μ<sub>2</sub> respectively, then min(X<sub>1</sub>, X<sub>2</sub>) is an exponential random variable with mean μ<sub>1</sub> μ<sub>2</sub>/(μ<sub>1</sub> + μ<sub>2</sub>).
 
Similarly, distributions for which the '''maximum''' value of several independent random variables is a member of the same family of distribution include:
[[Bernoulli distribution]], [[Power distribution]].
 
=== Other ===
 
* If ''X'' and ''Y'' are independent '''standard normal''' random variables, ''X''/''Y'' is a '''Cauchy''' (0,1) random variable.
* If X<sub>1</sub> and X<sub>2</sub> are '''chi-squared''' random variables with ν<sub>1</sub> and ν<sub>2</sub> degrees of freedom respectively, then (X<sub>1</sub>/ν<sub>1</sub>)/(X<sub>2</sub>/ν<sub>2</sub>) is an '''''F'''''(ν<sub>1</sub>, ν<sub>2</sub>) random variable.
* If X is a '''standard normal''' random variable and U is a '''chi-squared''' random variable with ν degrees of freedom, then <math>\frac{X}{\sqrt{(U/\nu)}} </math> is a '''Student's ''t''''' (ν) random variable.
* If X<sub>1</sub> is '''gamma''' (α<sub>1</sub>, 1) random variable and X<sub>2</sub> is a gamma (α<sub>2</sub>, 1) random variable then X<sub>1</sub>/(X<sub>1</sub> + X<sub>2</sub>) is a '''beta'''(α<sub>1</sub>, α<sub>2</sub>) random variable. More generally, if X<sub>1</sub>is gamma(α<sub>1</sub>, β<sub>1</sub>) random variable and X<sub>2</sub> is gamma(α<sub>2</sub>, β<sub>2</sub>) random variable then β<sub>2</sub> X<sub>1</sub>/(β<sub>2</sub> X<sub>1</sub> + β<sub>1</sub> X<sub>2</sub>) is a beta(α<sub>1</sub>, α<sub>2</sub>) random variable.
* If ''X'' and ''Y'' are '''exponential''' random variables with mean μ, then ''X''-''Y'' is a '''[[Gumbel distribution|double exponential]]''' random variable with mean 0 and scale μ.
 
== Approximate (limit) relationships ==
 
Approximate or limit relationship means
*either that the combination of an infinite number of ''iid'' random variables tends to some distribution,
*or that the limit when a parameter tends to some value approaches to a different distribution.
 
'''Combination of ''iid'' random variables: '''
 
* Given certain conditions, the sum (hence the average) of a sufficiently large number of iid random variables, each with finite mean and variance, will be approximately normally distributed.(This is [[central limit theorem]] (CLT)).
 
'''Special case of distribution parametrization: '''
 
* X is a '''Hypergeometric''' (m, N, n) random variable. If ''n'' and ''m'' are large compared to ''N'', and ''p = m / N'' is not close to 0 or 1, then X approximately has a '''Binomial'''(n, p) Distribution.
* X is a '''beta-binomial''' random variable with parameters (n, α, β). Let p = α/(α + β) and suppose α + β is large, then X approximately has a '''binomial'''(n, p) distribution.
* If X is a '''binomial''' (n, p) random variable and if n is large and np is small then X approximately has a '''Poisson'''(np) distribution.
* If X is a '''negative binomial''' random variable with r large, P near 1, and r(1-P) = λ, then X approximately has a '''Poisson''' distribution with mean λ.
 
Consequences of the CLT:
* If X is a '''Poisson''' random variable with large mean, then for integers j and k, P(j ≤ X ≤ k) approximately equals to P(j - 1/2 ≤ Y ≤ k + 1/2) where Y is a '''normal''' distribution with the same mean and variance as X.
* If X is a '''binomial'''(n, p) random variable with large n and np, then for integers j and k, P(j ≤ X ≤ k) approximately equals to P(j - 1/2 ≤ Y ≤ k + 1/2) where Y is a '''normal''' random variable with the same mean and variance as X, i. e. np and np(1-p).
* If X is a '''beta''' random variable with parameters α and β equal and large, then X approximately has a '''normal''' distribution with the same mean and variance, i. e. mean α/(α + β) and variance αβ/((α+β)<sup>2</sup>(α + β + 1)).
* If X is a '''gamma'''(α, β) random variable and the shape parameter α is large relative to the scale parameter β, then X approximately has a '''normal''' random variable with the same mean and variance.
* If X is a '''Student's ''t''''' random variable with a large number of degrees of freedom ν then X approximately has a '''standard normal''' distribution.
* If X is an '''''F'''''(ν, ω) random variable with ω large, then ν X is approximately distributed As a '''chi-squared''' random variable with ν degrees of freedom.
 
== Compound (or Bayesian) relationships ==
 
When one or more parameter(s) of a distribution are random variables, the [[Compound probability distribution|compound]] distribution is the marginal distribution of the variable.
 
'''Examples: '''
 
* If X|N is a '''binomial''' (N,p) random variable, where parameter N is a random variable with negative-binomial (m, r) distribution, then X is distributed as a '''negative-binomial''' (m, r/(p+qr)).
* If X|N is a '''binomial''' (N,p) random variable, where parameter N is a random variable with Poisson (μ) distribution, then X is distributed as a '''Poisson''' (μp).
* If X|μ is a '''Poisson''' (μ) random variable and parameter μ is random variable with gamma (m, β) distribution, then X is distributed as a '''negative-binomial''' (m, μβ/(μ+β)), sometimes called [[Gamma-Poisson distribution]] if ''m'' is not integer.
 
Some distributions have been specially named as compounds:
[[Beta-Binomial distribution]], [[Beta-Pascal distribution]], [[Gamma-Normal distribution]].
 
'''Examples: '''
 
* If X is a Binomial (n,p) random variable, and parameter p is a random variable with beta (α, β) distribution, then X is distributed as a Beta-Binomial(α, β,n).
* If X is a negative-binomial (m,p) random variable, and parameter p is a random variable with beta (α, β) distribution, then X is distributed as a Beta-Pascal(α, β,m).
 
==See also==
 
*[[Central limit theorem]]
*[[Compound probability distribution]]
 
== References ==
 
{{Reflist}}
 
== External links ==
 
* Interactive graphic: [http://www.math.wm.edu/~leemis/chart/UDR/UDR.html Univariate Distribution Relationships]
 
{{Common univariate probability distributions}}
 
[[Category:Probability distributions]]

Latest revision as of 21:04, 29 December 2014

My name is Dominik (42 years old) and my hobbies are Taxidermy and Board sports.

My web site; http://tinyurl.com/kecvhhb