In theoretical physics, the composition of two non-collinearLorentz boosts results in a Lorentz transformation that is not a pure boost but is the composition of a boost and a rotation. This rotation is called Thomas rotation, Thomas–Wigner rotation or Wigner rotation. The rotation was discovered and proved by Ludwik Silberstein in his 1914 book 'Relativity', rediscovered by Llewellyn Thomas in 1926, and rederived by Wigner in 1939. Wigner acknowledged Silberstein. If a sequence of non-collinear boosts returns an object to its initial velocity, then the sequence of Wigner rotations can combine to produce a net rotation called the Thomas precession.
There are still ongoing discussions about the correct form of equations for the Thomas rotation in different reference systems with contradicting results.Goldstein:
The spatial rotation resulting from the successive application of two non-collinear Lorentz transformations have been declared every bit as paradoxical as the more frequently discussed apparent violations of common sense, such as the twin paradox.
Einstein's principle of velocity reciprocity (EPVR) reads
We postulate that the relation between the coordinates of the two systems is linear. Then the inverse transformation is also linear and the complete non-preference of the one or the other system demands that the transformation shall be identical with the original one, except for a change of v to −v
With less careful interpretation, the EPVR is seemingly violated in some models. There is, of course, no true paradox present.
Let it be u the velocity in which the lab reference frame moves respect an object called A and let it be v the velocity in which another object called B is moving, measured from the lab reference frame. If u and v are not aligned the relative velocities of these two bodies will not be opposite, that is since there is a rotation between them
The velocity that A will measure on B will be:
The Lorentz factor for the velocities that either A sees on B or B sees on A:
The angle of rotation can be calculated in two ways:
And the axis of rotation is:
Setup of frames and relative velocities between themEdit
Velocity composition and Thomas rotation in xy plane, velocities u and v separated by angle θ. Left: As measured in Σ′, the orientations of Σ and Σ′′ appear parallel to Σ′. Centre: In frame Σ, Σ′′ is rotated through angle ε about an axis parallel to u×v and then moves with velocity wd relative to Σ. Right: In frame Σ′′, Σ moves with velocity −wd relative to Σ′′ and then moves with velocity wd relative to Σ.
Velocity composition and Thomas rotation in xy plane, velocities −u and −v separated by angle θ. Left: As measured in Σ′, the orientations of Σ and Σ′′ appear parallel to Σ′. Centre: In frame Σ′′, Σ is rotated through angle ε about an axis parallel to −(u×v) and then moves with velocity −wi relative to Σ′′. Right: In frame Σ, Σ′′ moves with velocity wi relative to Σ and then is rotated through angle ε about an axis parallel to u×v.
Comparison of velocity compositions wd and wi. Notice the same magnitudes but different directions.
Two general boostsEdit
When studying the Thomas rotation at the fundamental level, one typically uses a setup with three coordinate frames, Σ, Σ′ Σ′′. Frame Σ′ has velocity u relative to frame Σ, and frame Σ′′ has velocity v relative to frame Σ′.
The axes are, by construction, oriented as follows. Viewed from Σ′, the axes of Σ′ and Σ are parallel (the same holds true for the pair of frames when viewed from Σ.) Also viewed from Σ′, the spatial axes of Σ′ and Σ′′ are parallel (and the same holds true for the pair of frames when viewed from Σ′′.) This is an application of EVPR: If u is the velocity of Σ′ relative to Σ, then u′ = −u is the velocity of Σ relative to Σ′. The velocity 3-vectoru makes the same angles with respect to coordinate axes in both the primed and unprimed systems. This does not represent a snapshot taken in any of the two frames of the combined system at any particular time, as should be clear from the detailed description below.
This is possible, since a boost in, say, the positive z-direction, preserves orthogonality of the coordinate axes. A general boost B(w) can be expressed as L = R−1(ez, w)Bz(w)R(ez, w), where R(ez, w) is a rotation taking the z-axis into the direction of w and Bz is a boost in the new z-direction. Each rotation retains the property that the spatial coordinate axes are orthogonal. The boost will stretch the (intermediate) z-axis by a factor γ, while leaving the intermediate x-axis and y-axis in place. The fact that coordinate axes are non-parallel in this construction after two consecutive non-collinear boosts is a precise expression of the phenomenon of Thomas rotation.[nb 1]
is the Lorentz factor of the velocity u (the vertical bars |u| indicate the magnitude of the vector). The velocity u can be thought of the velocity of a frame Σ′ relative to a frame Σ, and v is the velocity of an object, say a particle or another frameΣ′′ relative to Σ′. In the present context, all velocities are best thought of as relative velocities of frames unless otherwise specified. The result w = u ⊕ v is then the relative velocity of frame Σ′′ relative to a frame Σ.
Although velocity addition is nonlinear, non-associative, and non-commutative, the result of the operation correctly obtains a velocity with a magnitude less than c. If ordinary vector addition was used, it would be possible to obtain a velocity with a magnitude larger than c. The Lorentz factorγ of both composite velocities are equal,
and the norms are equal under interchange of velocity vectors
Since the two possible composite velocities have equal magnitude, but different directions, one must be a rotated copy of the other. More detail and other properties of no direct concern here can be found in the main article.
Consider the reversed configuration, namely, frame Σ moves with velocity −u relative to frame Σ′, and frame Σ′, in turn, moves with velocity −v relative to frame Σ′′. In short, u → − u and v → −v by EPVR. Then the velocity of Σ relative to Σ′′ is (−v) ⊕ (−u) ≡ −v ⊕ u. By EPVR again, the velocity of Σ′′ relative to Σ is then wi = v ⊕ u. (A)
One finds wd ≠ wi. While they are equal in magnitude, there is an angle between them. For a single boost between two inertial frames, there is only one unambiguous relative velocity (or its negative). For two boosts, the peculiar result of two inequivalent relative velocities instead of one seems to contradict the symmetry of relative motion between any two frames. Which is the correct velocity of Σ′′ relative to Σ? Since this inequality may be somewhat unexpected and potentially breaking EPVR, this question is warranted.[nb 2]
Formulation in terms of Lorentz transformationsEdit
A frame Σ′′ is boosted with velocity v relative to another frame Σ′, which is boosted with velocity u relative to another frame Σ.
A frame Σ is boosted with velocity −u relative to another frame Σ′, which is boosted with velocity −v relative to another frame Σ′′ .
Original configuration with exchanged velocities u and v.
Inverse of exchanged configuration.
Two boosts equals a boost and rotationEdit
The answer to the question lies in the Thomas rotation, and that one must be careful in specifying which coordinate system is involved at each step. When viewed from Σ, the coordinate axes of Σ and Σ′′ are not parallel. While this can be hard to imagine since both pairs (Σ, Σ′) and (Σ′, Σ′′) have parallel coordinate axes, it is easy to explain mathematically.
Velocity addition does not provide a complete description of the relation between the frames. One must formulate the complete description in terms of Lorentz transformations corresponding to the velocities. A Lorentz boost with any velocity v (magnitude less than c) is given symbolically by
where the coordinates and transformation matrix are compactly expressed in block matrix form
It is clear that to each admissible velocity v there corresponds a pure Lorentz boost,
Velocity addition u⊕v corresponds to the composition of boosts B(v)B(u) in that order. The B(u) acts on X first, then B(v) acts on B(u)X. Notice succeeding operators act on the left in any composition of operators, so B(v)B(u) should be interpreted as a boost with velocities u then v, not v then u. Performing the Lorentz transformations by block matrix multiplication,
Here γ is the composite Lorentz factor, and a and b are 3×1 column vectors proportional to the composite velocities. The 3×3 matrix M will turn out to have geometric significance.
The inverse transformations are
and the composition amounts to a negation and exchange of velocities,
If the relative velocities are exchanged, looking at the blocks of Λ, one observes the composite transformation to be the matrix transpose of Λ. This is not the same as the original matrix, so the composite Lorentz transformation matrix is not symmetric, and thus not a single boost. This, in turn, translates to the incompleteness of velocity composition from the result of two boosts; symbolically,
To make the description complete, it is necessary to introduce a rotation, before or after the boost. This rotation is the Thomas rotation. A rotation is given by
where the 4×4 rotation matrix is
and R is a 3×3 rotation matrix.[nb 3] In this article the axis-angle representation is used, and θ = θe is the "axis-angle vector", the angle θ multiplied by a unit vector e parallel to the axis. Also, the right-handed convention for the spatial coordinates is used (see orientation (vector space)), so that rotations are positive in the anticlockwise sense according to the right-hand rule, and negative in the clockwise sense. With these conventions; the rotation matrix rotates any 3d vector about the axis e through angle θ anticlockwise (an active transformation), which has the equivalent effect of rotating the coordinate frame clockwise about the same axis through the same angle (a passive transformation).
The rotation matrix is an orthogonal matrix, its transpose equals its inverse, and negating either the angle or axis in the rotation matrix corresponds to a rotation in the opposite sense, so the inverse transformation is readily obtained by
A boost followed or preceded by a rotation is also a Lorentz transformation, since these operations leave the spacetime interval invariant. The same Lorentz transformation has two decompositions for appropriately chosen rapidity and axis-angle vectors;
and if these are two decompositions are equal, the two boosts are related by
It turns out the equality between two boosts and a rotation followed or preceded by a single boost is correct: the rotation of frames matches the angular separation of the composite velocities, and explains how one composite velocity applies to one frame, while the other applies to the rotated frame. The rotation also breaks the symmetry in the overall Lorentz transformation making it nonsymmetric. For this specific rotation, let the angle be ε and the axis be defined by the unit vector e, so the axis-angle vector is ε = εe.
Altogether, two different orderings of two boosts means there are two inequivalent transformations. Each of these can be split into a boost then rotation, or a rotation then boost, doubling the number of inequivalent transformations to four. The inverse transformations are equally important; they provide information about what the other observer perceives. In all, there are eight transformations to consider, just for the problem of two Lorentz boosts. In summary, with subsequent operations acting on the left, they are
...split into a boost then rotation...
...or split into a rotation then boost.
Matching up the boosts followed by rotations, in the original setup, an observer in Σ notices Σ′′ to move with velocity u⊕v then rotate clockwise (first diagram), and because of the rotation an observer in Σ′′ notices Σ to move with velocity −v⊕u then rotate anticlockwise (second diagram). If the velocities are exchanged an observer in Σ notices Σ′′ to move with velocity v⊕u then rotate anticlockwise (third diagram), and because of the rotation an observer in Σ′′ notices Σ to move with velocity −u⊕v then rotate clockwise (fourth diagram).
The cases of rotations then boosts are similar (no diagrams are shown). Matching up the rotations followed by boosts, in the original setup, an observer in Σ notices Σ′′ to rotate clockwise then move with velocity v⊕u, and because of the rotation an observer in Σ′′ notices Σ to rotate anticlockwise then move with velocity −u⊕v. If the velocities are exchanged an observer in Σ notices Σ′′ to rotate anticlockwise then move with velocity u⊕v, and because of the rotation an observer in Σ′′ notices Σ to rotate clockwise then move with velocity −u⊕v.
Finding the axis and angle of the Thomas rotationEdit
The above formulae constitute the relativistic velocity addition and the Thomas rotation explicitly in the general Lorentz transformations. Throughout, in every composition of boosts and decomposition into a boost and rotation, the important formula
holds, allowing the rotation matrix to be defined completely in terms of the relative velocities u and v. The angle of a rotation matrix in the axis–angle representation can be found from the trace of the rotation matrix, the general result for any axis is tr(R) = 1 + 2 cos ε. Taking the trace of the equation gives
The angle ε between a and b is not the same as the angle α between u and v.
In both frames Σ and Σ′′, for every composition and decomposition, another important formula
holds. The vectors a and b are indeed related by a rotation, in fact by the same rotation matrix R which rotates the coordinate frames. Starting from a, the matrix R rotates this into b anticlockwise, it follows their cross product (in the right-hand convention)
defines the axis correctly, therefore the axis is also parallel to u×v. The magnitude of this pseudovector is neither interesting nor important, only the direction is, so it can be normalized into the unit vector
which still completely defines the direction of the axis without loss of information.
The rotation is simply a "static" rotation and there is no relative rotational motion between the frames, there is relative translational motion in the boost. However, if the frames accelerate, then the rotated frame rotates with an angular velocity. This effect is known as the Thomas precession, and arises purely from the kinematics of successive Lorentz boosts.
Finding the Thomas rotationEdit
The decomposition process described (below) can be carried through on the product of two pure Lorentz transformations to obtain explicitly the rotation of the coordinate axes resulting from the two successive "boosts". In general, the algebra involved is quite forbidding, more than enough, usually, to discourage any actual demonstration of the rotation matrix
In principle, it is pretty easy. Since every Lorentz transformation is a product of a boost and a rotation, the consecutive application of two pure boosts is a pure boost, either followed by or preceded by a pure rotation. Thus, suppose
The task is to glean from this equation the boost velocity w and the rotation R from the matrix entries of Λ. The coordinates of events are related by
Inverting this relation yields
Set x′ = (ct′, 0, 0, 0). Then xν will record the spacetime position of the origin of the primed system,
Multiplying this matrix with a pure rotation will not affect the zeroth columns and rows, and
which could have been anticipated from the formula for a simple boost in the x-direction, and for the relative velocity vector
Thus given with Λ, one obtains β and w by little more than inspection of Λ−1. (Of course, w can also be found using velocity addition per above.) From w, construct B(−w). The solution for R is then
With the ansatz
one finds by the same means
Finding a formal solution in terms of velocity parameters u and v involves first formally multiplying B(v)B(u), formally inverting, then reading off βw form the result, formally building B(−w) from the result, and, finally, formally multiplying B(−w)B(v)B(u). It should be clear that this is a daunting task, and it is difficult to interpret/identify the result as a rotation, though it is clear a priori that it is. It is these difficulties that the Goldstein quote at the top refers to. The problem has been thoroughly studied under simplifying assumptions over the years.
Group theoretical originEdit
Another way to explain the origin of the rotation is by looking at the generators of the Lorentz group.
Boosts from velocitiesEdit
The passage from a velocity to a boost is obtained as follows. An arbitrary boost is given by
where ζ is a triple of real numbers serving as coordinates on the boost subspace of the Lie algebra so(3, 1) spanned by the matrices
is called the boost parameter or boost vector, while its norm is the rapidity. Here β is the velocity parameter, the magnitude of the vector β = u/c.
While for ζ one has 0 ≤ ζ < ∞, the parameter β is confined within 0 ≤ β < 1, and hence 0 ≤ u < c. Thus
The set of velocities satisfying 0 ≤ u < c is an open ball in ℝ3 and is called the space of admissible velocities in the literature. It is endowed with a hyperbolic geometry described in the linked article.
The generators of boosts, K1, K2, K3, in different directions do not commute. This has the effect that two consecutive boosts is not a pure boost in general, but a rotation preceding a boost.
Consider a succession of boosts in the x direction, then the y direction, expanding each boost to first order
Returning to the group commutator, the commutation relations of the boost generators imply for a boost along the x then y directions, there will be a rotation about the z axis. In terms of the rapidities, the rotation angle θ is given by
equivalently expressible as
Spacetime diagrams for non-collinear boostsEdit
The familiar notion of vector addition for velocities in the Euclidean plane can be done in a triangular formation, or since vector addition is commutative, the vectors in both orderings geometrically form a parallelogram (see "parallelogram law"). This does not hold for relativistic velocity addition; instead a hyperbolic triangle arises whose edges are related to the rapidities of the boosts. Changing the order of the boost velocities, one does not find the resultant boost velocities to coincide.
^This preservation of orthogonality of coordinate axes should not be confused with preservation of angles between spacelike vectors taken at one and the same time in one system, which, of course, does not hold. The coordinate axes transform under the passive transformation presented, while the vectors transform under the corresponding active transformation.
^This is sometimes called the "Mocanu paradox". Mocanu himself didn't name it a paradox, but rather a "difficulty" within the framework of relativistic electrodynamics in a 1986 paper. He was also quick to acknowledge that the problem is explained by Thomas precession Mocanu (1992), but the name lingers on.
In the literature, the 3d rotation matrix R may be denoted by other letters, others use a name and the relative velocity vectors involved; e.g., tom[u, v] for "Thomas rotation" or gyr[u, v] for "gyration" (see gyrovector space). Correspondingly the 4d rotation matrix R (non-bold italic) in this article may be denoted
Macfarlane, A. J. (1962). "On the Restricted Lorentz Group and Groups Homomorphically Related to It". Journal of Mathematical Physics. 3 (6): 1116–1129. Bibcode:1962JMP.....3.1116M. doi:10.1063/1.1703854. hdl:2027/mdp.39015095220474.
Ben-Menahem, A. (1985). "Wigner's rotation revisited". Am. J. Phys. 53 (1): 62–66. Bibcode:1985AmJPh..53...62B. doi:10.1119/1.13953.
Ben-Menahem, S. (1986). "The Thomas precession and velocityspace curvature". J. Math. Phys. 27 (5): 1284–1286. Bibcode:1986JMP....27.1284B. doi:10.1063/1.527132.
Cushing, J. T. (1967). "Vector Lorentz Transformations". Am. J. Phys. 35 (9): 858–862. Bibcode:1967AmJPh..35..858C. doi:10.1119/1.1974267.
Ferraro, R., & Thibeault, M. (1999). "Generic composition of boosts: an elementary derivation of the Wigner rotation". European journal of physics20(3):143.
Mocanu, C.I. (1992). "On the relativistic velocity composition paradox and the Thomas rotation". Found. Phys. Lett. 5 (5): 443–456. Bibcode:1992FoPhL...5..443M. doi:10.1007/BF00690425. ISSN 0894-9875. S2CID 122472788.
Rebilas, K. (2013). "Comment on Elementary analysis of the special relativistic combination of velocities, Wigner rotation and Thomas precession". Eur. J. Phys. 34 (3): L55–L61. Bibcode:2013EJPh...34L..55R. doi:10.1088/0143-0807/34/3/L55. S2CID 122527454. (free access)
Rhodes, J. A.; Semon, M. D. (2005). "Relativistic velocity space, Wigner rotation and Thomas precession". Am. J. Phys. 72 (7): 943–960. arXiv:gr-qc/0501070v1. Bibcode:2005APS..NES..R001S. doi:10.1119/1.1652040. S2CID 14764378.
Thomas, L. H. (1926). "Motion of the spinning electron". Nature. 117 (2945): 514. Bibcode:1926Natur.117..514T. doi:10.1038/117514a0. S2CID 4084303.
Ungar, A. A. (1988). "Thomas rotation and parameterization of the Lorentz group". Foundations of Physics Letters. 1 (1): 57–81. Bibcode:1988FoPhL...1...57U. doi:10.1007/BF00661317. ISSN 0894-9875. S2CID 121240925.