# Bose–Einstein statistics

Template:Statistical mechanics In quantum statistics, Bose–Einstein statistics (or more colloquially B–E statistics) is one of two possible ways in which a collection of non-interacting indistinguishable particles may occupy a set of available discrete energy states, at thermodynamic equilibrium. The aggregation of particles in the same state, which is a characteristic of particles obeying Bose–Einstein statistics, accounts for the cohesive streaming of laser light and the frictionless creeping of superfluid helium. The theory of this behaviour was developed (1924–25) by Satyendra Nath Bose, who recognized that a collection of identical and indistinguishable particles can be distributed in this way. The idea was later adopted and extended by Albert Einstein in collaboration with Bose.

The Bose–Einstein statistics apply only to those particles not limited to single occupancy of the same state—that is, particles that do not obey the Pauli exclusion principle restrictions. Such particles have integer values of spin and are named bosons, after the statistics that correctly describe their behaviour. There must also be no significant interaction between the particles.

## Concept

At low temperatures, bosons behave differently from fermions (which obey the Fermi–Dirac statistics) in a way that an unlimited number of them can "condense" into the same energy state. This apparently unusual property also gives rise to the special state of matter – Bose Einstein Condensate. Fermi–Dirac and Bose–Einstein statistics apply when quantum effects are important and the particles are "indistinguishable". Quantum effects appear if the concentration of particles satisfies,

${\displaystyle {\frac {N}{V}}\geq n_{q}}$

where N is the number of particles and V is the volume and nq is the quantum concentration, for which the interparticle distance is equal to the thermal de Broglie wavelength, so that the wavefunctions of the particles are barely overlapping. Fermi–Dirac statistics apply to fermions (particles that obey the Pauli exclusion principle), and Bose–Einstein statistics apply to bosons. As the quantum concentration depends on temperature, most systems at high temperatures obey the classical (Maxwell–Boltzmann) limit unless they have a very high density, as for a white dwarf. Both Fermi–Dirac and Bose–Einstein become Maxwell–Boltzmann statistics at high temperature or at low concentration.

B–E statistics was introduced for photons in 1924 by Bose and generalized to atoms by Einstein in 1924–25.

The expected number of particles in an energy state i  for B–E statistics is

${\displaystyle n_{i}(\varepsilon _{i})={\frac {g_{i}}{e^{(\varepsilon _{i}-\mu )/kT}-1}}}$

with εi > μ and where ni  is the number of particles in state i, gi  is the degeneracy of state i, εi  is the energy of the ith state, μ is the chemical potential, k is the Boltzmann constant, and T is absolute temperature. For comparison, the average number of fermions with energy ${\displaystyle \epsilon _{i}}$ given by Fermi–Dirac particle-energy distribution has a similar form,

${\displaystyle {\bar {n}}_{i}(\epsilon _{i})={\frac {g_{i}}{e^{(\epsilon _{i}-\mu )/kT}+1}}}$

B–E statistics reduces to the Rayleigh–Jeans Law distribution for ${\displaystyle kT\gg \varepsilon _{i}-\mu }$, namely ${\displaystyle n_{i}={\frac {g_{i}kT}{\varepsilon _{i}-\mu }}}$.

## History

While presenting a lecture at the University of Dhaka on the theory of radiation and the ultraviolet catastrophe, Satyendra Nath Bose intended to show his students that the contemporary theory was inadequate, because it predicted results not in accordance with experimental results. During this lecture, Bose committed an error in applying the theory, which unexpectedly gave a prediction that agreed with the experiment. The error was a simple mistake—similar to arguing that flipping two fair coins will produce two heads one-third of the time—that would appear obviously wrong to anyone with a basic understanding of statistics (remarkably, this error resembled the famous blunder by d'Alembert known from his "Croix ou Pile" Article) . However, the results it predicted agreed with experiment, and Bose realized it might not be a mistake after all. He for the first time took the position that the Maxwell–Boltzmann distribution would not be true for microscopic particles where fluctuations due to Heisenberg's uncertainty principle will be significant. Thus he stressed the probability of finding particles in the phase space, each state having volume h3, and discarding the distinct position and momentum of the particles.

Bose adapted this lecture into a short article called "Planck's Law and the Hypothesis of Light Quanta"[1][2] and submitted it to the Philosophical Magazine. However, the referee's report was negative, and the paper was rejected. Undaunted, he sent the manuscript to Albert Einstein requesting publication in the Zeitschrift für Physik. Einstein immediately agreed, personally translated the article into German (Bose had earlier translated Einstein's article on the theory of General Relativity from German to English), and saw to it that it was published. Bose's theory achieved respect when Einstein sent his own paper in support of Bose's to Zeitschrift für Physik, asking that they be published together. This was done in 1924.

The reason Bose produced accurate results was that since photons are indistinguishable from each other, one cannot treat any two photons having equal energy as being two distinct identifiable photons. By analogy, if in an alternate universe coins were to behave like photons and other bosons, the probability of producing two heads would indeed be one-third, and so is the probability of getting a head and a tail which equals one-half for the conventional (classical, distinguishable) coins. Bose's "error" lead to what is now called Bose–Einstein statistics.

Bose and Einstein extended the idea to atoms and this led to the prediction of the existence of phenomena which became known as Bose–Einstein condensate, a dense collection of bosons (which are particles with integer spin, named after Bose), which was demonstrated to exist by experiment in 1995.

## Two derivations of the Bose–Einstein distribution

### Derivation from the grand canonical ensemble

The Bose–Einstein distribution, which applies only to a quantum system of non-interacting bosons, is easily derived from the grand canonical ensemble.[3] In this ensemble, the system is able to exchange energy and exchange particles with a reservoir (temperature T and chemical potential µ fixed by the reservoir).

Due to the non-interacting quality, each available single-particle level (with energy level ϵ) forms a separate thermodynamic system in contact with the reservoir. In other words, each single-particle level is a separate, tiny grand canonical ensemble. With bosons there is no limit on the number of particles N in the level, but due to indistinguishability each possible N corresponds to only one microstate (with energy ). The resulting partition function for that single-particle level therefore forms a geometric series:

{\displaystyle {\begin{aligned}{\mathcal {Z}}&=\sum _{N=0}^{\infty }\exp(N(\mu -\epsilon )/k_{B}T)=\sum _{N=0}^{\infty }[\exp((\mu -\epsilon )/k_{B}T)]^{N}\\&={\frac {1}{1-\exp((\mu -\epsilon )/k_{B}T)}}\end{aligned}}}

and the average particle number for that single-particle substate is given by

${\displaystyle \langle N\rangle =k_{B}T{\frac {1}{\mathcal {Z}}}\left({\frac {\partial {\mathcal {Z}}}{\partial \mu }}\right)_{V,T}={\frac {1}{\exp((\epsilon -\mu )/k_{B}T)-1}}}$

This result applies for each single-particle level and thus forms the Bose–Einstein distribution for the entire state of the system.[3] [4]

The variance in particle number (due to thermal fluctuations) may also be derived:

${\displaystyle \langle (\Delta N)^{2}\rangle =k_{B}T\left({\frac {d\langle N\rangle }{d\mu }}\right)_{V,T}=\langle N^{2}\rangle -\langle N\rangle ^{2}}$

This level of fluctuation is much larger than for distinguishable particles, which would instead show Poisson statistics (${\displaystyle \langle (\Delta N)^{2}\rangle =\langle N\rangle }$). This is because the probability distribution for the number of bosons in a given energy level is a geometric distribution, not a Poisson distribution.

### Derivation in the canonical approach

It is also possible to derive approximate Bose–Einstein statistics in the canonical ensemble. These derivations are lengthy and only yield the above results in the asymptotic limit of a large number of particles. The reason is that the total number of bosons is fixed in the canonical ensemble. That contradicts the implication in Bose–Einstein statistics that each energy level is filled independently from the others (which would require the number of particles to be flexible).

Template:Hidden begin Suppose we have a number of energy levels, labeled by index ${\displaystyle \displaystyle i}$, each level having energy ${\displaystyle \displaystyle \varepsilon _{i}}$ and containing a total of ${\displaystyle \displaystyle n_{i}}$ particles. Suppose each level contains ${\displaystyle \displaystyle g_{i}}$ distinct sublevels, all of which have the same energy, and which are distinguishable. For example, two particles may have different momenta, in which case they are distinguishable from each other, yet they can still have the same energy. The value of ${\displaystyle \displaystyle g_{i}}$ associated with level ${\displaystyle \displaystyle i}$ is called the "degeneracy" of that energy level. Any number of bosons can occupy the same sublevel.

Let ${\displaystyle \displaystyle w(n,g)}$ be the number of ways of distributing ${\displaystyle \displaystyle n}$ particles among the ${\displaystyle \displaystyle g}$ sublevels of an energy level. There is only one way of distributing ${\displaystyle \displaystyle n}$ particles with one sublevel, therefore ${\displaystyle \displaystyle w(n,1)=1}$. It is easy to see that there are ${\displaystyle \displaystyle (n+1)}$ ways of distributing ${\displaystyle \displaystyle n}$ particles in two sublevels which we will write as:

${\displaystyle w(n,2)={\frac {(n+1)!}{n!1!}}.}$

With a little thought (see Notes below) it can be seen that the number of ways of distributing ${\displaystyle \displaystyle n}$ particles in three sublevels is

${\displaystyle w(n,3)=w(n,2)+w(n-1,2)+\cdots +w(1,2)+w(0,2)}$

so that

${\displaystyle w(n,3)=\sum _{k=0}^{n}w(n-k,2)=\sum _{k=0}^{n}{\frac {(n-k+1)!}{(n-k)!1!}}={\frac {(n+2)!}{n!2!}}}$

where we have used the following theorem involving binomial coefficients:

${\displaystyle \sum _{k=0}^{n}{\frac {(k+a)!}{k!a!}}={\frac {(n+a+1)!}{n!(a+1)!}}.}$

Continuing this process, we can see that ${\displaystyle \displaystyle w(n,g)}$ is just a binomial coefficient (See Notes below)

${\displaystyle w(n,g)={\frac {(n+g-1)!}{n!(g-1)!}}.}$

For example, the population numbers for two particles in three sublevels are 200, 110, 101, 020, 011, or 002 for a total of six which equals 4!/(2!2!). The number of ways that a set of occupation numbers ${\displaystyle \displaystyle n_{i}}$ can be realized is the product of the ways that each individual energy level can be populated:

${\displaystyle W=\prod _{i}w(n_{i},g_{i})=\prod _{i}{\frac {(n_{i}+g_{i}-1)!}{n_{i}!(g_{i}-1)!}}\approx \prod _{i}{\frac {(n_{i}+g_{i})!}{n_{i}!(g_{i}-1)!}}}$

where the approximation assumes that ${\displaystyle n_{i}\gg 1}$.

Following the same procedure used in deriving the Maxwell–Boltzmann statistics, we wish to find the set of ${\displaystyle \displaystyle n_{i}}$ for which W is maximised, subject to the constraint that there be a fixed total number of particles, and a fixed total energy. The maxima of ${\displaystyle \displaystyle W}$ and ${\displaystyle \displaystyle \ln(W)}$ occur at the same value of ${\displaystyle \displaystyle n_{i}}$ and, since it is easier to accomplish mathematically, we will maximise the latter function instead. We constrain our solution using Lagrange multipliers forming the function:

${\displaystyle f(n_{i})=\ln(W)+\alpha (N-\sum n_{i})+\beta (E-\sum n_{i}\varepsilon _{i})}$

Using the ${\displaystyle n_{i}\gg 1}$ approximation and using Stirling's approximation for the factorials ${\displaystyle \left(x!\approx x^{x}\,e^{-x}\,{\sqrt {2\pi x}}\right)}$ gives

${\displaystyle f(n_{i})=\sum _{i}(n_{i}+g_{i})\ln(n_{i}+g_{i})-n_{i}\ln(n_{i})+\alpha \left(N-\sum n_{i}\right)+\beta \left(E-\sum n_{i}\varepsilon _{i}\right)+K.}$

Where K is the sum of a number of terms which are not functions of the ${\displaystyle n_{i}}$. Taking the derivative with respect to ${\displaystyle \displaystyle n_{i}}$, and setting the result to zero and solving for ${\displaystyle \displaystyle n_{i}}$, yields the Bose–Einstein population numbers:

${\displaystyle n_{i}={\frac {g_{i}}{e^{\alpha +\beta \varepsilon _{i}}-1}}.}$

By a process similar to that outlined in the Maxwell–Boltzmann statistics article, it can be seen that:

${\displaystyle d\ln W=\alpha \,dN+\beta \,dE}$

which, using Boltzmann's famous relationship ${\displaystyle S=k\,\ln W}$ becomes a statement of the second law of thermodynamics at constant volume, and it follows that ${\displaystyle \beta ={\frac {1}{kT}}}$ and ${\displaystyle \alpha =-{\frac {\mu }{kT}}}$ where S is the entropy, ${\displaystyle \mu }$ is the chemical potential, k is Boltzmann's constant and T is the temperature, so that finally:

${\displaystyle n_{i}={\frac {g_{i}}{e^{(\varepsilon _{i}-\mu )/kT}-1}}.}$

Note that the above formula is sometimes written:

${\displaystyle n_{i}={\frac {g_{i}}{e^{\varepsilon _{i}/kT}/z-1}},}$

where ${\displaystyle \displaystyle z=\exp(\mu /kT)}$ is the absolute activity, as noted by McQuarrie.[5]

Also note that when the particle numbers are not conserved, removing the conservation of particle numbers constraint is equivalent to setting ${\displaystyle \alpha }$ and therefore the chemical potential ${\displaystyle \mu }$ to zero. This will be the case for photons and massive particles in mutual equilibrium and the resulting distribution will be the Planck distribution. Template:Hidden end Template:Hidden begin

A much simpler way to think of Bose–Einstein distribution function is to consider that n particles are denoted by identical balls and g shells are marked by g-1 line partitions. It is clear that the permutations of these n balls and g-1 partitions will give different ways of arranging bosons in different energy levels. Say, for 3(=n) particles and 3(=g) shells, therefore (g-1)=2, the arrangement might be |●●|●, or ||●●●, or |●|●● , etc. Hence the number of distinct permutations of n + (g-1) objects which have n identical items and (g-1) identical items will be:

OR

The purpose of these notes is to clarify some aspects of the derivation of the Bose–Einstein (B–E) distribution for beginners. The enumeration of cases (or ways) in the B–E distribution can be recast as follows. Consider a game of dice throwing in which there are ${\displaystyle \displaystyle n}$ dice, with each die taking values in the set ${\displaystyle \displaystyle \left\{1,\dots ,g\right\}}$, for ${\displaystyle g\geq 1}$. The constraints of the game are that the value of a die ${\displaystyle \displaystyle i}$, denoted by ${\displaystyle \displaystyle m_{i}}$, has to be greater than or equal to the value of die ${\displaystyle \displaystyle (i-1)}$, denoted by ${\displaystyle \displaystyle m_{i-1}}$, in the previous throw, i.e., ${\displaystyle m_{i}\geq m_{i-1}}$. Thus a valid sequence of die throws can be described by an n-tuple ${\displaystyle \displaystyle \left(m_{1},m_{2},\dots ,m_{n}\right)}$, such that ${\displaystyle m_{i}\geq m_{i-1}}$. Let ${\displaystyle \displaystyle S(n,g)}$ denote the set of these valid n-tuples:

Then the quantity ${\displaystyle \displaystyle w(n,g)}$ (defined above as the number of ways to distribute ${\displaystyle \displaystyle n}$ particles among the ${\displaystyle \displaystyle g}$ sublevels of an energy level) is the cardinality of ${\displaystyle \displaystyle S(n,g)}$, i.e., the number of elements (or valid n-tuples) in ${\displaystyle \displaystyle S(n,g)}$. Thus the problem of finding an expression for ${\displaystyle \displaystyle w(n,g)}$ becomes the problem of counting the elements in ${\displaystyle \displaystyle S(n,g)}$.

Example n = 4, g = 3:

${\displaystyle S(4,3)=\left\{\underbrace {(1111),(1112),(1113)} _{(a)},\underbrace {(1122),(1123),(1133)} _{(b)},\underbrace {(1222),(1223),(1233),(1333)} _{(c)},\right.}$
${\displaystyle \left.\underbrace {(2222),(2223),(2233),(2333),(3333)} _{(d)}\right\}}$
${\displaystyle \displaystyle w(4,3)=15}$ (there are ${\displaystyle \displaystyle 15}$ elements in ${\displaystyle \displaystyle S(4,3)}$)

Each element of ${\displaystyle \displaystyle S(4,3)}$ can be thought of as a multiset of cardinality ${\displaystyle \displaystyle n=4}$; the elements of such multiset are taken from the set ${\displaystyle \displaystyle \left\{1,2,3\right\}}$ of cardinality ${\displaystyle \displaystyle g=3}$, and the number of such multisets is the multiset coefficient

${\displaystyle \displaystyle \left\langle {\begin{matrix}3\\4\end{matrix}}\right\rangle ={3+4-1 \choose 3-1}={3+4-1 \choose 4}={\frac {6!}{4!2!}}=15}$

More generally, each element of ${\displaystyle \displaystyle S(n,g)}$ is a multiset of cardinality ${\displaystyle \displaystyle n}$ (number of dice) with elements taken from the set ${\displaystyle \displaystyle \left\{1,\dots ,g\right\}}$ of cardinality ${\displaystyle \displaystyle g}$ (number of possible values of each die), and the number of such multisets, i.e., ${\displaystyle \displaystyle w(n,g)}$ is the multiset coefficient

which is exactly the same as the formula for ${\displaystyle \displaystyle w(n,g)}$, as derived above with the aid of a theorem involving binomial coefficients, namely

To understand the decomposition

${\displaystyle \displaystyle w(4,3)=w(4,2)+w(3,2)+w(2,2)+w(1,2)+w(0,2),}$

let us rearrange the elements of ${\displaystyle \displaystyle S(4,3)}$ as follows

${\displaystyle S(4,3)=\left\{\underbrace {(1111),(1112),(1122),(1222),(2222)} _{(\alpha )},\underbrace {(111{\color {Red}{\underset {=}{3}}}),(112{\color {Red}{\underset {=}{3}}}),(122{\color {Red}{\underset {=}{3}}}),(222{\color {Red}{\underset {=}{3}}})} _{(\beta )},\right.}$
${\displaystyle \left.\underbrace {(11{\color {Red}{\underset {==}{33}}}),(12{\color {Red}{\underset {==}{33}}}),(22{\color {Red}{\underset {==}{33}}})} _{(\gamma )},\underbrace {(1{\color {Red}{\underset {===}{333}}}),(2{\color {Red}{\underset {===}{333}}})} _{(\delta )}\underbrace {({\color {Red}{\underset {====}{3333}}})} _{(\omega )}\right\}.}$

Clearly, the subset ${\displaystyle \displaystyle (\alpha )}$ of ${\displaystyle \displaystyle S(4,3)}$ is the same as the set

${\displaystyle \displaystyle S(4,2)=\left\{(1111),(1112),(1122),(1222),(2222)\right\}}$.

By deleting the index ${\displaystyle \displaystyle m_{4}=3}$ (shown in red with double underline) in the subset ${\displaystyle \displaystyle (\beta )}$ of ${\displaystyle \displaystyle S(4,3)}$, one obtains the set

${\displaystyle \displaystyle S(3,2)=\left\{(111),(112),(122),(222)\right\}}$.

In other words, there is a one-to-one correspondence between the subset ${\displaystyle \displaystyle (\beta )}$ of ${\displaystyle \displaystyle S(4,3)}$ and the set ${\displaystyle \displaystyle S(3,2)}$. We write

${\displaystyle \displaystyle (\beta )\longleftrightarrow S(3,2)}$.

Similarly, it is easy to see that

${\displaystyle \displaystyle (\gamma )\longleftrightarrow S(2,2)=\left\{(11),(12),(22)\right\}}$
${\displaystyle \displaystyle (\delta )\longleftrightarrow S(1,2)=\left\{(1),(2)\right\}}$
${\displaystyle \displaystyle (\omega )\longleftrightarrow S(0,2)=\varnothing }$ (empty set).

Thus we can write

${\displaystyle \displaystyle S(4,3)=\bigcup _{k=0}^{4}S(4-k,2)}$

or more generally,

and since the sets

${\displaystyle \displaystyle S(i,g-1)\ ,\ {\rm {for}}\ i=0,\dots ,n}$

are non-intersecting, we thus have

with the convention that

Continuing the process, we arrive at the following formula

${\displaystyle \displaystyle w(n,g)=\sum _{k_{1}=0}^{n}\sum _{k_{2}=0}^{n-k_{1}}w(n-k_{1}-k_{2},g-2)=\sum _{k_{1}=0}^{n}\sum _{k_{2}=0}^{n-k_{1}}\cdots \sum _{k_{g}=0}^{n-\sum _{j=1}^{g-1}k_{j}}w(n-\sum _{i=1}^{g}k_{i},0).}$

Using the convention (7)2 above, we obtain the formula

keeping in mind that for ${\displaystyle \displaystyle q}$ and ${\displaystyle \displaystyle p}$ being constants, we have

It can then be verified that (8) and (2) give the same result for ${\displaystyle \displaystyle w(4,3)}$, ${\displaystyle \displaystyle w(3,3)}$, ${\displaystyle \displaystyle w(3,2)}$, etc.

## Interdisciplinary applications

Viewed as a pure probability distribution, the Bose–Einstein distribution has found application in other fields:

• In recent years, Bose Einstein statistics have also been used as a method for term weighting in information retrieval. The method is one of a collection of DFR ("Divergence From Randomness") models,[6] the basic notion being that Bose Einstein statistics may be a useful indicator in cases where a particular term and a particular document have a significant relationship that would not have occurred purely by chance. Source code for implementing this model is available from the Terrier project at the University of Glasgow.
• {{#invoke:main|main}} The evolution of many complex systems, including the World Wide Web, business, and citation networks, is encoded in the dynamic web describing the interactions between the system's constituents. Despite their irreversible and nonequilibrium nature these networks follow Bose statistics and can undergo Bose–Einstein condensation. Addressing the dynamical properties of these nonequilibrium systems within the framework of equilibrium quantum gases predicts that the "first-mover-advantage," "fit-get-rich(FGR)," and "winner-takes-all" phenomena observed in competitive systems are thermodynamically distinct phases of the underlying evolving networks.[6]

## Notes

1. See p. 14, note 3, of the Ph.D. Thesis entitled Bose–Einstein condensation: analysis of problems and rigorous results, presented by Alessandro Michelangeli to the International School for Advanced Studies, Mathematical Physics Sector, October 2007 for the degree of Ph.D. See: http://digitallibrary.sissa.it/handle/1963/5272?show=full, and download from http://digitallibrary.sissa.it/handle/1963/5272
3. Chapter 7 of Template:Cite isbn Cite error: Invalid <ref> tag; name "sriva" defined multiple times with different content
4. The BE distribution can be derived also from thermal field theory.
5. See McQuarrie in citations
6. Amati, G.; C. J. Van Rijsbergen (2002). "Probabilistic models of information retrieval based on measuring the divergence from randomness " ACM TOIS 20 (4):357–389. Cite error: Invalid <ref> tag; name "bia" defined multiple times with different content

## References

• {{#invoke:citation/CS1|citation

|CitationClass=book }}

• Bose (1924). "Plancks Gesetz und Lichtquantenhypothese", Zeitschrift für Physik 26:178–181. doi:10.1007/BF01327326 (Einstein's translation into German of Bose's paper on Planck's law).
• {{#invoke:citation/CS1|citation

|CitationClass=book }}

• {{#invoke:citation/CS1|citation

|CitationClass=book }}

• {{#invoke:citation/CS1|citation

|CitationClass=book }}