# History of Lorentz transformations

The Lorentz transformations relate the space-time coordinates, (which specify the position x, y, z and time t of an event) relative to a particular inertial frame of reference (the "rest system"), and the coordinates of the same event relative to another coordinate system moving in the positive x-direction at a constant speed v, relative to the rest system. It was devised as a theoretical transformation which makes the velocity of light invariant between different inertial frames. The coordinates of the event in this "moving system" are denoted x′, y′, z′ and t′. The rest system was sometimes identified with the luminiferous aether, the postulated medium for the propagation of light, and the moving system was commonly identified with the earth as it moved through this medium. Early approximations of the transformation were published by Voigt (1887) and Lorentz (1895). They were completed by Larmor (1897, 1900) and Lorentz (1899, 1904) and were brought into their modern form by Poincaré (1905), who gave the transformation the name of Lorentz. Eventually, Einstein (1905) showed in his development of special relativity that the transformations follow from the principle of relativity and the constant light speed alone, without requiring a mechanical aether, and are changing the traditional concepts of space and time. Subsequently, Minkowski used them to argue that space and time are inseparably connected as spacetime.

In this article the historical notations are replaced with modern notations, with the Lorentz transformation,

$x^{\prime }=\gamma (x-vt),\quad y^{\prime }=y,\quad z^{\prime }=z,\quad t^{\prime }=\gamma \left(t-x{\frac {v}{c^{2}}}\right)$ ,

and the Lorentz factor,

$\gamma ={\frac {1}{\sqrt {1-v^{2}/c^{2}}}}$ v being the relative velocity of the two reference frames, and c the speed of light.

## Sphere geometry in the 19th century

{{#invoke:main|main}}

One of the defining properties of the Lorentz transformation is its group structure which leaves the expression $x^{2}+y^{2}+z^{2}-c^{2}t^{2}$ invariant. So a spherical wave in one frame remains spherical in another one, which is often used to derive the Lorentz transformation. However, long before experiments and physical theories made the introduction of the Lorentz transformation necessary, transformation groups and sphere geometries transforming sphere into spheres have been discussed, such as the Transformation by reciprocal radii within Möbius geometry, and the Transformation by reciprocal directions within Laguerre geometry. Both can be seen as special cases of Lie sphere geometry. The connections of these transformations to Maxwell's equations and the laws of physics were discovered, however, only after 1905 when the Lorentz transformation was already derived in a different way by physicists.

In several papers between 1847 and 1850 it was shown by Joseph Liouville[A 1] that the relation $\lambda \left(\delta x^{2}+\delta y^{2}+\delta z^{2}\right)$ is invariant under the group of conformal transformations or the "Transformation by reciprocal radii" which transforms spheres into spheres. This theorem was extended to all dimensions by Sophus Lie (1871) so that $\lambda \left(\delta x_{1}^{2}+\dots +\delta x_{n}^{2}\right)$ is invariant too.[A 2] In 1909, Harry Bateman and Ebenezer Cunningham showed that not only the quadratic form but also Maxwells equations are covariant with respect to conformal transformation, irrespective of the choice of $\lambda$ . This variant of conformal transformations were called spherical wave transformations by him.[A 3][A 4] However, this covariance is restricted to certain areas such as electrodynamics, whereas the totality of natural laws in inertial frames is covariant under the Lorentz group.[A 5]

Albert Ribaucour (1870)[A 6] and in particular Edmond Laguerre (1880-1885)[A 7][A 8] employed another variant, namely the "transformation by reciprocal directions" or „Laguerre inversion/transformation“ which transforms spheres into spheres and planes into planes.[A 7] Laguerre explicitly wrote down the corresponding transformation formulas in 1882, with Gaston Darboux (1887) presenting them in respect to coordinates $x,y,z,R$ (R being the radius):[A 9]

{\begin{aligned}x'&=x,\quad &z'&={\frac {1+k^{2}}{1-k^{2}}}z-{\frac {2kR}{1-k^{2}}},\\y'&=y,&R'&={\frac {2kz}{1-k^{2}}}-{\frac {1+k^{2}}{1-k^{2}}}R,\end{aligned}} producing the following relation:

$x^{\prime 2}+y^{\prime 2}+z^{\prime 2}-R^{\prime 2}=x^{2}+y^{2}+z^{2}-R^{2}$ .
${\frac {1-k^{2}}{1+k^{2}}}={\sqrt {1-v^{2}}}={\frac {1}{\gamma }},\quad {\frac {2k}{1-k^{2}}}=v\gamma ,$ $x'=x,\quad y'=y,\quad z'=\gamma (z-vt),\quad t'=\gamma (vz-t)$ Furthermore, the group isomorphism between the Laguerre group and Lorentz group was pointed out by Élie Cartan, Henri Poincaré and others (see Laguerre group isomorphic to Lorentz group).[A 12]

## Voigt (1887)

Woldemar Voigt (1887)[A 13] developed a transformation in connection with the Doppler effect and an incompressible medium, being in modern notation:

$x^{\prime }=x-vt,\quad y^{\prime }={\frac {y}{\gamma }},\quad z^{\prime }={\frac {z}{\gamma }},\quad t^{\prime }=t-x{\frac {v}{c^{2}}}$ .

If the right-hand sides of his equations are multiplied by $\gamma$ they are the modern Lorentz transformation. In Voigt's theory the speed of light is invariant, but his transformations mix up a relativistic boost together with a rescaling of space-time. Optical phenomena in free space are scale, conformal (using the factor $\lambda$ discussed above), and Lorentz invariant, so the combination is invariant too. For instance, Lorentz transformations can be extended by using $l={\sqrt {\lambda }}$ :[A 14][A 15]

$x^{\prime }=\gamma l\left(x-vt\right),\quad y^{\prime }=ly,\quad z^{\prime }=lz,\quad t^{\prime }=\gamma l\left(t-x{\frac {v}{c^{2}}}\right)$ .

$l=1/\gamma$ gives the Voigt transformation, $l=1$ the Lorentz transformation. But scale transformations are not a symmetry of all the laws of nature, only of electromagnetism, so these transformations cannot be used to formulate a principle of relativity in general. It was demonstrated by Poincaré and Einstein that one has to set $l=1$ in order to make the above transformation symmetric and to form a group as required by the relativity principle, therefore the Lorentz transformation is the only viable choice.

Voigt sent his 1887 paper to Lorentz in 1908, and that was acknowledged in 1909:

In a paper „Über das Doppler'sche Princip“, published in 1887 (Gött. Nachrichten, p. 41) and which to my regret has escaped my notice all these years, Voigt has applied to equations of the form (6) (§ 3 of this book) [namely $\Delta \Psi -{\tfrac {1}{c^{2}}}{\tfrac {\partial ^{2}\Psi }{\partial t^{2}}}=0$ ] a transformation equivalent to the formulae (287) and (288) [namely $x^{\prime }=\gamma l\left(x-vt\right),\ y^{\prime }=ly,\ z^{\prime }=lz,\ t^{\prime }=\gamma l\left(t-{\tfrac {v}{c^{2}}}x\right)$ ]. The idea of the transformations used above (and in § 44) might therefore have been borrowed from Voigt and the proof that it does not alter the form of the equations for the free ether is contained in his paper.[A 15]

Also Hermann Minkowski said in 1908 that the transformations which play the main role in the principle of relativity were first examined by Voigt in 1887. Voigt responded in the same paper by saying that his theory was based on an elastic theory of light, not an electromagnetic one. However, he concluded that some results were actually the same.[A 16]

## Heaviside (1888), Thomson (1889), Searle (1896)

In 1888, Oliver Heaviside[A 17] investigated the properties of charges in motion according to Maxwell's electrodynamics. He calculated, among other things, anisotropies in the electric field of moving bodies represented by this formula:

$\mathrm {E} =\left({\frac {q\mathrm {r} }{r^{2}}}\right)\left(1-{\frac {v^{2}\sin ^{2}\theta }{c^{2}}}\right)^{-3/2}$ .

Consequently, Joseph John Thomson (1889)[A 18] found a way to substantially simplify calculations concerning moving charges by using the following mathematical transformation:

$x^{\prime }=\gamma x$ .

Thereby, inhomogeneous electromagnetic wave equations are transformed into a Poisson equation. Eventually, George Frederick Charles Searle[A 19] noted in (1896) that Heaviside's expression leads to a deformation of electric fields which he called "Heaviside-Ellipsoid" of axial ratio $1/\gamma :1:1\!$ .

## Lorentz (1892, 1895)

In order to explain the aberration of light and the result of the Fizeau experiment in accordance with Maxwell's equations, Lorentz in 1892 developed a model ("Lorentz ether theory")[A 20] in which the aether is completely motionless, and the speed of light in the aether is constant in all directions. In order to calculate the optics of moving bodies, Lorentz introduced the following quantities to transform from the aether system into a moving system (it's unknown whether he was influenced by Voigt, Heaviside, and Thomson).

$x^{\prime }=\gamma x^{*},\quad y^{\prime }=y,\quad z^{\prime }=z,\quad t^{\prime }=t-\gamma ^{2}x^{*}{\frac {v}{c^{2}}}$ where x* is the Galilean transformation x-vt. While t is the "true" time for observers resting in the aether, t' is an auxiliary variable only for calculating processes for moving systems. It is also important that Lorentz and later also Larmor formulated this transformation in 2 steps. At first the Galilean transformation - and later the expansion into the "fictitious" electromagnetic system with the aid of the Lorentz transformation. In order to explain the negative result of the Michelson–Morley experiment, he (1892b)[A 21] introduced the additional hypothesis that also intermolecular forces are affected in a similar way and introduced length contraction in his theory (without proof as he admitted). The same hypothesis was already made by George FitzGerald in 1889 based on Heaviside's work. While length contraction was a real physical effect for Lorentz, he considered the time transformation only as a heuristic working hypothesis and a mathematical stipulation.

In 1895,[A 22] Lorentz further elaborated on his theory and introduced the "theorem of corresponding states". This theorem states that a moving observer (relative to the ether) in his „fictitious“ field makes the same observations as a resting observers in his „real“ field for velocities to first order in v/c. Lorentz showed that the dimensions of electrostatic systems in the ether and a moving frame are connected by this transformation:

$x^{\prime }=\gamma x^{*},\quad y^{\prime }=y,\quad z^{\prime }=z,\quad t^{\prime }=t$ For solving optical problems Lorentz used the following transformation, whereby for the time variable he used the expression "local time" (Ortszeit):

$x^{\prime }=x^{*},\quad y^{\prime }=y,\quad z^{\prime }=z,\quad t^{\prime }=t-x^{*}{\frac {v}{c^{2}}}$ With this concept Lorentz could explain the Doppler effect, the aberration of light, and the Fizeau experiment.

## Larmor (1897, 1900)

Larmor in 1897[A 23] and 1900[A 24] presented the transformations in two parts. Similar to Lorentz, he considered first the transformation from a rest system (xyzt) to a moving system (x′, y′, z′, t′)

$x'=x-vt,\quad y'=y',\quad z'=z,\quad t'=t-\gamma ^{2}vx^{*}/c^{2}$ This transformation is just the Galilean transformation for the xyz coordinates but contains Lorentz’s "local time". Larmor knew that the Michelson–Morley experiment was accurate enough to detect an effect of motion depending on the factor v²/c², and so he sought the transformations which were "accurate to second order" (as he put it). Thus he wrote the final transformations (where x* = x − vt) as:

$x^{\prime }=\gamma x^{*},\quad y^{\prime }=y,\quad z^{\prime }=z,\quad t^{\prime }={\frac {t}{\gamma }}-\gamma x^{*}{\frac {v}{c^{2}}}$ Larmor showed that Maxwell's equations were invariant under this two-step transformation, "to second order in v/c", as he put it. Larmor noted that if it is assumed that the constitution of molecules is electrical then the FitzGerald-Lorentz contraction is a consequence of this transformation. It's notable that Larmor was the first who recognized that some sort of time dilation is a consequence of this transformation as well, because individual electrons describe corresponding parts of their orbits in times shorter for the [rest] system in the ratio 1/γ.

Larmor gave credit to Lorentz in two papers published in 1904, in which he used the term "Lorentz transformation" for Lorentz's first order transformations of coordinates and field configurations:

p. 583: [..] Lorentz's transformation for passing from the field of activity of a stationary electrodynamic material system to that of one moving with uniform velocity of translation through the aether.[A 25]

p. 585: [..] the Lorentz transformation has shown us what is not so immediately obvious [..][A 25]
p. 622: [..] the transformation first developed by Lorentz: namely, each point in space is to have its own origin from which time is measured, its "local time" in Lorentz's phraseology, and then the values of the electric and magnetic vectors [..] at all points in the aether between the molecules in the system at rest, are the same as those of the vectors [..] at the corresponding points in the convected system at the same local times.[A 26]

## Lorentz (1899, 1904)

Also Lorentz, by extending his theorem of corresponding states, derived in 1899 the complete transformations. However, he used the undetermined factor $l$ as an arbitrary function of $v$ . Like Larmor, Lorentz noticed in 1899[A 27] also some sort of time dilation effect in relation to the frequency of oscillating electrons "that in $S$ the time of vibrations be $l\gamma$ times as great as in $S_{0}$ ", where $S_{0}$ is the aether frame, Subsequently, in 1904 he wrote the equations in the following form (as mentioned above $x^{\ast }$ must be replaced by $x-vt$ ):

$x^{\prime }=\gamma lx^{\ast },\quad y^{\prime }=ly,\quad z^{\prime }=lz,\quad t^{\prime }={\frac {l}{\gamma }}t-\gamma lx^{\ast }{\frac {v}{c^{2}}}$ Under the assumption that $l=1$ when $v=0$ , he demonstrated that $l=1$ must be the case at all velocities, therefore length contraction can only arise in the line of motion. So by setting the factor $l$ to unity, Lorentz's transformation now assumed the same form as Larmor's. Unlike Larmor, who restricted himself to show the covariance of Maxwell's equations to second order, Lorentz tried to widen its covariance to all orders in $v/c$ . He also derived the correct formulas for the velocity dependence of electromagnetic mass, and concluded that the transformation formulas must apply to all forces of nature, not only electrical ones.[A 28] However, he didn't achieve full covariance of the transformation equations for charge density and velocity. When the 1904 paper was reprinted in 1913, Lorentz therefore added the following remark:

One will notice that in this work the transformation equations of Einstein’s Relativity Theory have not quite been attained. [..] On this circumstance depends the clumsiness of many of the further considerations in this work.

## Poincaré (1900, 1905)

### Local time

Neither Lorentz or Larmor gave a clear physical interpretation of the origin of local time. However, Henri Poincaré[A 29][A 30] in 1900 commented on the origin of Lorentz’s “wonderful invention” of local time. He remarked that it arose when clocks in a moving reference frame are synchronised by exchanging signals which are assumed to travel with the same speed c in both directions, which lead to what is nowadays called relativity of simultaneity, although Poincaré's calculation does not involve length contraction or time dilation. In order to synchronise the clocks here on Earth (the x*, t* frame) a light signal from one clock (at the origin) is sent to another (at x*), and is sent back. It's supposed that the Earth is moving with speed v in the x-direction (= x*-direction) in some rest system (x,t) (i.e. the luminiferous aether system for Lorentz and Larmor). The time of flight outwards is

$\delta t_{o}={\frac {x^{*}}{\left(c-v\right)}}$ and the time of flight back is

$\delta t_{b}={\frac {x^{*}}{\left(c+v\right)}}\cdot$ The elapsed time on the clock when the signal is returned is δto + δtb and the time t* = (δto + δtb)/2 is ascribed to the moment when the light signal reached the distant clock. In the rest frame the time t = δto is ascribed to that same instant. Some algebra gives the relation between the different time coordinates ascribed to the moment of reflection. Thus

$t^{*}=t-{\frac {\epsilon vx^{*}}{c^{2}}}\cdot$ Poincaré gave the result t* = t − vx*/c2, which is the form used by Lorentz in 1895. Poincaré dropped the factor ε ≅ 1 under the assumption that

${\frac {v^{2}}{c^{2}}}\ll 1.\,$ Similar physical interpretations of local time were later given by Emil Cohn (1904)[A 31] and Max Abraham (1905).[A 32]

### Lorentz transformation

On June 5, 1905 (published June 9)[A 14] Poincaré simplified the equations which are algebraically equivalent to those of Larmor and Lorentz and gave them the modern form. Apparently Poincaré was unaware of Larmor's contributions, because he only mentioned Lorentz and therefore used for the first time the name "Lorentz transformation".

$x^{\prime }=\gamma l(x-vt),\quad y^{\prime }=ly,\quad z^{\prime }=lz,\quad t^{\prime }=\gamma l\left(t-vx\right)$ .

Poincaré set the speed of light to unity, pointed out the group characteristics of the transformation by setting $l=1$ , and modified/corrected Lorentz's derivation of the equations of electrodynamics in some details in order to fully satisfy the principle of relativity, i.e. making them fully Lorentz covariant.

In July 1905 (published in January 1906)[A 33] Poincaré showed in detail how the transformations and electrodynamic equations are a consequence of the principle of least action; he demonstrated in more detail the group characteristics of the transformation, which he called Lorentz group, and he showed that the combination x2 + y2 + z2 − c2t2 is invariant. He noticed that the Lorentz transformation is merely a rotation in four-dimensional space about the origin by introducing ct−1 as a fourth imaginary coordinate, and he used an early form of four-vectors.

## Einstein (1905)

On June 30, 1905 (published September 1905) Einstein[A 34] published what is now called special relativity and gave a new derivation of the transformation, which was based only on the principle on relativity and the principle of the constancy of the speed of light. While Lorentz considered "local time" to be a mathematical stipulation device for explaining the Michelson-Morley experiment, Einstein showed that the coordinates given by the Lorentz transformation were in fact the inertial coordinates of relatively moving frames of reference. For quantities of first order in v/c this was also done by Poincaré in 1900, while Einstein derived the complete transformation by this method. Unlike Lorentz and Poincaré who still distinguished between real time in the aether and apparent time for moving observers, Einstein showed that the transformations concern the nature of space and time.

The notation for this transformation is identical to Poincaré's of 1905, except that Einstein didn't set the speed of light to unity:

$x^{\prime }=\gamma (x-vt),\quad y^{\prime }=y,\quad z^{\prime }=z,\quad t^{\prime }=\gamma \left(t-x{\frac {v}{c^{2}}}\right)$ ## Minkowski (1907–1908)

The work on the principle of relativity by Lorentz, Einstein, Planck, together with Poincaré's four-dimensional approach, were further elaborated by Hermann Minkowski in 1907 and 1908.[A 35][A 36][A 37] Minkowski particularly reformulated electrodynamics in a four-dimensional way (Minkowski spacetime). For instance, he wrote $x,y,z,it$ in the form $x_{1},x_{2},x_{3},x_{4}$ . By defining $\psi$ as the angle of rotation around the $z$ -axis, the Lorentz transformation assumes the form[A 36]

$x'_{1}=x_{1},\quad x'_{2}=x_{2},\quad x'_{3}=x_{3}\cos i\psi +x_{4}\sin i\psi ,\quad x'_{4}=-x_{3}\sin i\psi +x_{4}\cos i\psi ,$ ## Ignatowski (1910)

While earlier derivations and formulations of the Lorentz transformation relied from the outset on optics, electrodynamics, or the invariance of the speed of light, Vladimir Ignatowski (1910) showed that it is possible to use the principle of relativity (and related group theoretical principles) alone, in order to derive the following transformation between two inertial frames:[A 38][A 39][A 40]

$x^{\prime }=p(x-vt),\quad y^{\prime }=y,\quad z^{\prime }=z,\quad t^{\prime }=p(t-nvx)$ where $p=1/{\sqrt {1-nv^{2}}}$ . The variable $n$ can be seen as a space-time constant whose value has to be determined by experiment or taken from a known physical law such as electrodynamics. For that purpose, Ignatowski used the above mentioned Heaviside ellipsoid representing a contraction of electrostatic fields by $x/\gamma$ in the direction of motion. It can be seen that this is only consistent with Ignatowski's transformation when $n=1/c^{2}$ , resulting into $p=\gamma$ and the Lorentz transformation. With $n=0$ , no length changes arise and the Galilean transformation follows. Ignatowski's method was further developed and improved by Philipp Frank and Hermann Rothe (1911, 1912)[A 41][A 42], with various authors developing similar methods in subsequent years.