In particle physics, the Dirac equation is a relativistic wave equation derived by British physicist Paul Dirac in 1928. In its free form, or including electromagnetic interactions, it describes all spin1⁄2 massive particles, called "Dirac particles", such as electrons and quarks for which parity is a symmetry. It is consistent with both the principles of quantum mechanics and the theory of special relativity,^{[1]} and was the first theory to account fully for special relativity in the context of quantum mechanics. It was validated by accounting for the fine structure of the hydrogen spectrum in a completely rigorous way.
The equation also implied the existence of a new form of matter, antimatter, previously unsuspected and unobserved and which was experimentally confirmed several years later. It also provided a theoretical justification for the introduction of several component wave functions in Pauli's phenomenological theory of spin. The wave functions in the Dirac theory are vectors of four complex numbers (known as bispinors), two of which resemble the Pauli wavefunction in the nonrelativistic limit, in contrast to the Schrödinger equation which described wave functions of only one complex value. Moreover, in the limit of zero mass, the Dirac equation reduces to the Weyl equation.
Although Dirac did not at first fully appreciate the importance of his results, the entailed explanation of spin as a consequence of the union of quantum mechanics and relativity—and the eventual discovery of the positron—represents one of the great triumphs of theoretical physics. This accomplishment has been described as fully on a par with the works of Newton, Maxwell, and Einstein before him.^{[2]} It has been deemed by some physicists to be the "real seed of modern physics".^{[3]} In the context of quantum field theory, the Dirac equation is reinterpreted to describe quantum fields corresponding to spin1⁄2 particles.
The Dirac equation is inscribed upon a plaque on the floor of Westminster Abbey. Unveiled on 13 November 1995, the plaque commemorates Dirac's life.^{[4]}
In its modern formulation for field theory, the Dirac equation is written in terms of a Dirac spinor field taking values in a complex vector space described concretely as , defined on flat spacetime (Minkowski space) . Its expression also contains gamma matrices and a parameter interpreted as the mass, as well as other physical constants. Dirac first obtained his equation through a factorization of Einstein's energymomentummass equivalence relation assuming a scalar product of momentum vectors determined by the metric tensor and quantized the resulting relation by associating momenta to their respective operators.
In terms of a field , the Dirac equation is then
and in natural units, with Feynman slash notation,
The gamma matrices are a set of four complex matrices (elements of ) which satisfy the defining anticommutation relations:
The slash notation is a compact notation for
The Dirac adjoint of the spinor field is defined as
Applying to the Dirac equation gives
A conserved current of the theory is
Adding the Dirac and adjoint Dirac equations gives
Another approach to derive this expression is by variational methods, applying Noether's theorem for the global symmetry to derive the conserved current
Recall the Lagrangian is
Now considering the variation parameter to be infinitesimal, we work at first order in and ignore terms. From the previous discussion we immediately see the explicit variation in the Lagrangian due to is vanishing, that is under the variation,
As part of Noether's theorem, we find the implicit variation in the Lagrangian due to variation of fields. If the equation of motion for are satisfied, then

(*)

This immediately simplifies as there are no partial derivatives of in the Lagrangian. is the infinitesimal variation
Since the Dirac operator acts on 4tuples of squareintegrable functions, its solutions should be members of the same Hilbert space. The fact that the energies of the solutions do not have a lower bound is unexpected.
Planewave solutions are those arising from an ansatz
For this ansatz, the Dirac equation becomes an equation for :
For example, in the chiral representation for , the solution space is parametrised by a vector , with
These planewave solutions provide a starting point for canonical quantization.
Both the Dirac equation and the Adjoint Dirac equation can be obtained from (varying) the action with a specific Lagrangian density that is given by:
If one varies this with respect to one gets the adjoint Dirac equation. Meanwhile, if one varies this with respect to one gets the Dirac equation.
In natural units and with the slash notation, the action is then
For this action, the conserved current above arises as the conserved current corresponding to the global symmetry through Noether's theorem for field theory. Gauging this field theory by changing the symmetry to a local, spacetime point dependent one gives gauge symmetry (really, gauge redundancy). The resultant theory is quantum electrodynamics or QED. See below for a more detailed discussion.
The Dirac equation is invariant under Lorentz transformations, that is, under the action of the Lorentz group or strictly , the component connected to the identity.
For a Dirac spinor viewed concretely as taking values in , the transformation under a Lorentz transformation is given by a complex matrix . There are some subtleties in defining the corresponding , as well as a standard abuse of notation.
Most treatments occur at the Lie algebra level. For a more detailed treatment see here. The Lorentz group of real matrices acting on is generated by a set of six matrices with components
These satisfy the Lorentz algebra commutation relations
A Lorentz transformation can be written as
The corresponding transformation on spin space is
Under a Lorentz transformation, the Dirac equation
Multiplying both sides from the left by and returning the dummy variable to gives
Associated to Lorentz invariance is a conserved Noether current, or rather a tensor of conserved Noether currents . Similarly, since the equation is invariant under translations, there is a tensor of conserved Noether currents , which can be identified as the stressenergy tensor of the theory. The Lorentz current can be written in terms of the stressenergy tensor in addition to a tensor representing internal angular momentum.
The Dirac equation was also used (historically) to define a quantummechanical theory where is instead interpreted as a wavefunction.
The Dirac equation in the form originally proposed by Dirac is:^{[5]}
Dirac's purpose in casting this equation was to explain the behavior of the relativistically moving electron, and so to allow the atom to be treated in a manner consistent with relativity. His rather modest hope was that the corrections introduced this way might have a bearing on the problem of atomic spectra.
Up until that time, attempts to make the old quantum theory of the atom compatible with the theory of relativity, which were based on discretizing the angular momentum stored in the electron's possibly noncircular orbit of the atomic nucleus, had failed – and the new quantum mechanics of Heisenberg, Pauli, Jordan, Schrödinger, and Dirac himself had not developed sufficiently to treat this problem. Although Dirac's original intentions were satisfied, his equation had far deeper implications for the structure of matter and introduced new mathematical classes of objects that are now essential elements of fundamental physics.
The new elements in this equation are the four 4 × 4 matrices α_{1}, α_{2}, α_{3} and β, and the fourcomponent wave function ψ. There are four components in ψ because the evaluation of it at any given point in configuration space is a bispinor. It is interpreted as a superposition of a spinup electron, a spindown electron, a spinup positron, and a spindown positron.
The 4 × 4 matrices α_{k} and β are all Hermitian and are involutory:
These matrices and the form of the wave function have a deep mathematical significance. The algebraic structure represented by the gamma matrices had been created some 50 years earlier by the English mathematician W. K. Clifford. In turn, Clifford's ideas had emerged from the mid19thcentury work of the German mathematician Hermann Grassmann in his Lineare Ausdehnungslehre (Theory of Linear Expansion). The latter had been regarded as almost incomprehensible by most of his contemporaries. The appearance of something so seemingly abstract, at such a late date, and in such a direct physical manner, is one of the most remarkable chapters in the history of physics.^{[citation needed]} (Even more so, a validation of the exquisite insight displayed by the mathematicians Grassmann and Clifford.)
The single symbolic equation thus unravels into four coupled linear firstorder partial differential equations for the four quantities that make up the wave function. The equation can be written more explicitly in Planck units as:^{[6]}
The Dirac equation is superficially similar to the Schrödinger equation for a massive free particle:
The left side represents the square of the momentum operator divided by twice the mass, which is the nonrelativistic kinetic energy. Because relativity treats space and time as a whole, a relativistic generalization of this equation requires that space and time derivatives must enter symmetrically as they do in the Maxwell equations that govern the behavior of light — the equations must be differentially of the same order in space and time. In relativity, the momentum and the energies are the space and time parts of a spacetime vector, the fourmomentum, and they are related by the relativistically invariant relation
which says that the length of this fourvector is proportional to the rest mass m. Substituting the operator equivalents of the energy and momentum from the Schrödinger theory produces the Klein–Gordon equation describing the propagation of waves, constructed from relativistically invariant objects,
The fact that the density is positive definite and convected according to this continuity equation implies that one may integrate the density over a certain domain and set the total to 1, and this condition will be maintained by the conservation law. A proper relativistic theory with a probability density current must also share this feature. To maintain the notion of a convected density, one must generalize the Schrödinger expression of the density and current so that space and time derivatives again enter symmetrically in relation to the scalar wave function. The Schrödinger expression can be kept for the current, but the probability density must be replaced by the symmetrically formed expression^{[further explanation needed]}
The continuity equation is as before. Everything is compatible with relativity now, but the expression for the density is no longer positive definite; the initial values of both ψ and ∂_{t}ψ may be freely chosen, and the density may thus become negative, something that is impossible for a legitimate probability density. Thus, one cannot get a simple generalization of the Schrödinger equation under the naive assumption that the wave function is a relativistic scalar, and the equation it satisfies, second order in time.
Although it is not a successful relativistic generalization of the Schrödinger equation, this equation is resurrected in the context of quantum field theory, where it is known as the Klein–Gordon equation, and describes a spinless particle field (e.g. pi meson or Higgs boson). Historically, Schrödinger himself arrived at this equation before the one that bears his name but soon discarded it. In the context of quantum field theory, the indefinite density is understood to correspond to the charge density, which can be positive or negative, and not the probability density.
Dirac thus thought to try an equation that was first order in both space and time. He postulated an equation of the form
One could, for example, formally (i.e. by abuse of notation) take the relativistic expression for the energy
As the story goes, Dirac was staring into the fireplace at Cambridge, pondering this problem, when he hit upon the idea of taking the square root of the wave operator (see also half derivative) thus:
On multiplying out the right side it is apparent that, in order to get all the crossterms such as ∂_{x}∂_{y} to vanish, one must assume
Dirac, who had just then been intensely involved with working out the foundations of Heisenberg's matrix mechanics, immediately understood that these conditions could be met if A, B, C and D are matrices, with the implication that the wave function has multiple components. This immediately explained the appearance of twocomponent wave functions in Pauli's phenomenological theory of spin, something that up until then had been regarded as mysterious, even to Pauli himself. However, one needs at least 4 × 4 matrices to set up a system with the properties required — so the wave function had four components, not two, as in the Pauli theory, or one, as in the bare Schrödinger theory. The fourcomponent wave function represents a new class of mathematical object in physical theories that makes its first appearance here.
Given the factorization in terms of these matrices, one can now write down immediately an equation
Taking shows that all the components of the wave function individually satisfy the relativistic energy–momentum relation. Thus the soughtfor equation that is firstorder in both space and time is
Setting
To demonstrate the relativistic invariance of the equation, it is advantageous to cast it into a form in which the space and time derivatives appear on an equal footing. New matrices are introduced as follows:
where there is an implied summation over the values of the twicerepeated index μ = 0, 1, 2, 3, and ∂_{μ} is the 4gradient. In practice one often writes the gamma matrices in terms of 2 × 2 submatrices taken from the Pauli matrices and the 2 × 2 identity matrix. Explicitly the standard representation is
The complete system is summarized using the Minkowski metric on spacetime in the form
The Dirac equation may now be interpreted as an eigenvalue equation, where the rest mass is proportional to an eigenvalue of the 4momentum operator, the proportionality constant being the speed of light:
Using ( is pronounced "dslash"),^{[8]} according to Feynman slash notation, the Dirac equation becomes:
In practice, physicists often use units of measure such that ħ = c = 1, known as natural units. The equation then takes the simple form
A fundamental theorem states that if two distinct sets of matrices are given that both satisfy the Clifford relations, then they are connected to each other by a similarity transformation:
If in addition the matrices are all unitary, as are the Dirac set, then S itself is unitary;
The transformation U is unique up to a multiplicative factor of absolute value 1. Let us now imagine a Lorentz transformation to have been performed on the space and time coordinates, and on the derivative operators, which form a covariant vector. For the operator γ^{μ}∂_{μ} to remain invariant, the gammas must transform among themselves as a contravariant vector with respect to their spacetime index. These new gammas will themselves satisfy the Clifford relations, because of the orthogonality of the Lorentz transformation. By the fundamental theorem, one may replace the new set by the old set subject to a unitary transformation. In the new frame, remembering that the rest mass is a relativistic scalar, the Dirac equation will then take the form
If the transformed spinor is defined as
Thus, settling on any unitary representation of the gammas is final, provided the spinor is transformed according to the unitary transformation that corresponds to the given Lorentz transformation.
The various representations of the Dirac matrices employed will bring into focus particular aspects of the physical content in the Dirac wave function. The representation shown here is known as the standard representation – in it, the wave function's upper two components go over into Pauli's 2 spinor wave function in the limit of low energies and small velocities in comparison to light.
The considerations above reveal the origin of the gammas in geometry, hearkening back to Grassmann's original motivation; they represent a fixed basis of unit vectors in spacetime. Similarly, products of the gammas such as γ_{μ}γ_{ν} represent oriented surface elements, and so on. With this in mind, one can find the form of the unit volume element on spacetime in terms of the gammas as follows. By definition, it is
For this to be an invariant, the epsilon symbol must be a tensor, and so must contain a factor of √g, where g is the determinant of the metric tensor. Since this is negative, that factor is imaginary. Thus
This matrix is given the special symbol γ^{5}, owing to its importance when one is considering improper transformations of spacetime, that is, those that change the orientation of the basis vectors. In the standard representation, it is
This matrix will also be found to anticommute with the other four Dirac matrices:
It takes a leading role when questions of parity arise because the volume element as a directed magnitude changes sign under a spacetime reflection. Taking the positive square root above thus amounts to choosing a handedness convention on spacetime.
The necessity of introducing halfinteger spin goes back experimentally to the results of the Stern–Gerlach experiment. A beam of atoms is run through a strong inhomogeneous magnetic field, which then splits into N parts depending on the intrinsic angular momentum of the atoms. It was found that for silver atoms, the beam was split in two; the ground state therefore could not be integer, because even if the intrinsic angular momentum of the atoms were as small as possible, 1, the beam would be split into three parts, corresponding to atoms with L_{z} = −1, 0, +1. The conclusion is that silver atoms have net intrinsic angular momentum of 1⁄2. Pauli set up a theory which explained this splitting by introducing a twocomponent wave function and a corresponding correction term in the Hamiltonian, representing a semiclassical coupling of this wave function to an applied magnetic field, as so in SI units: (Note that bold faced characters imply Euclidean vectors in 3 dimensions, whereas the Minkowski fourvector A_{μ} can be defined as .)
Here A and represent the components of the electromagnetic fourpotential in their standard SI units, and the three sigmas are the Pauli matrices. On squaring out the first term, a residual interaction with the magnetic field is found, along with the usual classical Hamiltonian of a charged particle interacting with an applied field in SI units:
This Hamiltonian is now a 2 × 2 matrix, so the Schrödinger equation based on it must use a twocomponent wave function. On introducing the external electromagnetic 4vector potential into the Dirac equation in a similar way, known as minimal coupling, it takes the form:
A second application of the Dirac operator will now reproduce the Pauli term exactly as before, because the spatial Dirac matrices multiplied by i, have the same squaring and commutation properties as the Pauli matrices. What is more, the value of the gyromagnetic ratio of the electron, standing in front of Pauli's new term, is explained from first principles. This was a major achievement of the Dirac equation and gave physicists great faith in its overall correctness. There is more however. The Pauli theory may be seen as the low energy limit of the Dirac theory in the following manner. First the equation is written in the form of coupled equations for 2spinors with the SI units restored:
Assuming the field is weak and the motion of the electron nonrelativistic, the total energy of the electron is approximately equal to its rest energy, and the momentum going over to the classical value,
which is of order v/c – thus at typical energies and velocities, the bottom components of the Dirac spinor in the standard representation are much suppressed in comparison to the top components. Substituting this expression into the first equation gives after some rearrangement
The operator on the left represents the particle energy reduced by its rest energy, which is just the classical energy, so one can recover Pauli's theory upon identifying his 2spinor with the top components of the Dirac spinor in the nonrelativistic approximation. A further approximation gives the Schrödinger equation as the limit of the Pauli theory. Thus, the Schrödinger equation may be seen as the far nonrelativistic approximation of the Dirac equation when one may neglect spin and work only at low energies and velocities. This also was a great triumph for the new equation, as it traced the mysterious i that appears in it, and the necessity of a complex wave function, back to the geometry of spacetime through the Dirac algebra. It also highlights why the Schrödinger equation, although superficially in the form of a diffusion equation, actually represents the propagation of waves.
It should be strongly emphasized that this separation of the Dirac spinor into large and small components depends explicitly on a lowenergy approximation. The entire Dirac spinor represents an irreducible whole, and the components just neglected here to arrive at the Pauli theory will bring in new phenomena in the relativistic regime – antimatter and the idea of creation and annihilation of particles.
In the massless case , the Dirac equation reduces to the Weyl equation, which describes relativistic massless spin1⁄2 particles.^{[9]}
The theory acquires a second symmetry: see below.
The critical physical question in a quantum theory is this: what are the physically observable quantities defined by the theory? According to the postulates of quantum mechanics, such quantities are defined by Hermitian operators that act on the Hilbert space of possible states of a system. The eigenvalues of these operators are then the possible results of measuring the corresponding physical quantity. In the Schrödinger theory, the simplest such object is the overall Hamiltonian, which represents the total energy of the system. To maintain this interpretation on passing to the Dirac theory, the Hamiltonian must be taken to be
Thus, the Dirac Hamiltonian is fundamentally distinguished from its classical counterpart, and one must take great care to correctly identify what is observable in this theory. Much of the apparently paradoxical behavior implied by the Dirac equation amounts to a misidentification of these observables.^{[citation needed]}
The negative E solutions to the equation are problematic, for it was assumed that the particle has a positive energy. Mathematically speaking, however, there seems to be no reason for us to reject the negativeenergy solutions. Since they exist, they cannot simply be ignored, for once the interaction between the electron and the electromagnetic field is included, any electron placed in a positiveenergy eigenstate would decay into negativeenergy eigenstates of successively lower energy. Real electrons obviously do not behave in this way, or they would disappear by emitting energy in the form of photons.
To cope with this problem, Dirac introduced the hypothesis, known as hole theory, that the vacuum is the manybody quantum state in which all the negativeenergy electron eigenstates are occupied. This description of the vacuum as a "sea" of electrons is called the Dirac sea. Since the Pauli exclusion principle forbids electrons from occupying the same state, any additional electron would be forced to occupy a positiveenergy eigenstate, and positiveenergy electrons would be forbidden from decaying into negativeenergy eigenstates.
Dirac further reasoned that if the negativeenergy eigenstates are incompletely filled, each unoccupied eigenstate – called a hole – would behave like a positively charged particle. The hole possesses a positive energy because energy is required to create a particle–hole pair from the vacuum. As noted above, Dirac initially thought that the hole might be the proton, but Hermann Weyl pointed out that the hole should behave as if it had the same mass as an electron, whereas the proton is over 1800 times heavier. The hole was eventually identified as the positron, experimentally discovered by Carl Anderson in 1932.^{[10]}
It is not entirely satisfactory to describe the "vacuum" using an infinite sea of negativeenergy electrons. The infinitely negative contributions from the sea of negativeenergy electrons have to be canceled by an infinite positive "bare" energy and the contribution to the charge density and current coming from the sea of negativeenergy electrons is exactly canceled by an infinite positive "jellium" background so that the net electric charge density of the vacuum is zero. In quantum field theory, a Bogoliubov transformation on the creation and annihilation operators (turning an occupied negativeenergy electron state into an unoccupied positive energy positron state and an unoccupied negativeenergy electron state into an occupied positive energy positron state) allows us to bypass the Dirac sea formalism even though, formally, it is equivalent to it.
In certain applications of condensed matter physics, however, the underlying concepts of "hole theory" are valid. The sea of conduction electrons in an electrical conductor, called a Fermi sea, contains electrons with energies up to the chemical potential of the system. An unfilled state in the Fermi sea behaves like a positively charged electron, and although it too is referred to as an "electron hole", it is distinct from a positron. The negative charge of the Fermi sea is balanced by the positively charged ionic lattice of the material.
In quantum field theories such as quantum electrodynamics, the Dirac field is subject to a process of second quantization, which resolves some of the paradoxical features of the equation.
The Dirac equation is Lorentz covariant. Articulating this helps illuminate not only the Dirac equation, but also the Majorana spinor and Elko spinor, which although closely related, have subtle and important differences.
Understanding Lorentz covariance is simplified by keeping in mind the geometric character of the process.^{[11]} Let be a single, fixed point in the spacetime manifold. Its location can be expressed in multiple coordinate systems. In the physics literature, these are written as and , with the understanding that both and describe the same point , but in different local frames of reference (a frame of reference over a small extended patch of spacetime). One can imagine as having a fiber of different coordinate frames above it. In geometric terms, one says that spacetime can be characterized as a fiber bundle, and specifically, the frame bundle. The difference between two points and in the same fiber is a combination of rotations and Lorentz boosts. A choice of coordinate frame is a (local) section through that bundle.
Coupled to the frame bundle is a second bundle, the spinor bundle. A section through the spinor bundle is just the particle field (the Dirac spinor, in the present case). Different points in the spinor fiber correspond to the same physical object (the fermion) but expressed in different Lorentz frames. Clearly, the frame bundle and the spinor bundle must be tied together in a consistent fashion to get consistent results; formally, one says that the spinor bundle is the associated bundle; it is associated to a principal bundle, which in the present case is the frame bundle. Differences between points on the fiber correspond to the symmetries of the system. The spinor bundle has two distinct generators of its symmetries: the total angular momentum and the intrinsic angular momentum. Both correspond to Lorentz transformations, but in different ways.
The presentation here follows that of Itzykson and Zuber.^{[12]} It is very nearly identical to that of Bjorken and Drell.^{[13]} A similar derivation in a general relativistic setting can be found in Weinberg.^{[14]} Here we fix our spacetime to be flat, that is, our spacetime is Minkowski space.
Under a Lorentz transformation the Dirac spinor to transform as
This matrix can be interpreted as the intrinsic angular momentum of the Dirac field. That it deserves this interpretation arises by contrasting it to the generator of Lorentz transformations, having the form
The geometrical interpretation of the above is that the frame field is affine, having no preferred origin. The generator generates the symmetries of this space: it provides a relabelling of a fixed point The generator generates a movement from one point in the fiber to another: a movement from with both and still corresponding to the same spacetime point These perhaps obtuse remarks can be elucidated with explicit algebra.
Let be a Lorentz transformation. The Dirac equation is
After properly antisymmetrizing, one obtains the generator of symmetries given earlier. Thus, both and can be said to be the "generators of Lorentz transformations", but with a subtle distinction: the first corresponds to a relabelling of points on the affine frame bundle, which forces a translation along the fiber of the spinor on the spin bundle, while the second corresponds to translations along the fiber of the spin bundle (taken as a movement along the frame bundle, as well as a movement along the fiber of the spin bundle.) Weinberg provides additional arguments for the physical interpretation of these as total and intrinsic angular momentum.^{[15]}
The Dirac equation can be formulated in a number of other ways.
This article has developed the Dirac equation in flat spacetime according to special relativity. It is possible to formulate the Dirac equation in curved spacetime.
This article developed the Dirac equation using fourvectors and Schrödinger operators. The Dirac equation in the algebra of physical space uses a Clifford algebra over the real numbers, a type of geometric algebra.
As mentioned above, the massless Dirac equation immediately reduces to the homogeneous Weyl equation. By using the chiral representation of the gamma matrices, the nonzeromass equation can also be decomposed into a pair of coupled inhomogeneous Weyl equations acting on the first and last pairs of indices of the original fourcomponent spinor, i.e. , where and are each twocomponent Weyl spinors. This is because the skew block form of the chiral gamma matrices means that they swap the and and apply the twobytwo Pauli matrices to each:
.
So the Dirac equation
becomes
which in turn is equivalent to a pair of inhomogeneous Weyl equations for massless left and righthelicity spinors, where the coupling strength is proportional to the mass:
.^{[clarification needed]}
This has been proposed as an intuitive explanation of Zitterbewegung, as these massless components would propagate at the speed of light and move in opposite directions, since the helicity is the projection of the spin onto the direction of motion.^{[16]} Here the role of the "mass" is not to make the velocity less than the speed of light, but instead controls the average rate at which these reversals occur; specifically, the reversals can be modeled as a Poisson process.^{[17]}
Natural units are used in this section. The coupling constant is labelled by convention with : this parameter can also be viewed as modelling the electron charge.
The Dirac equation and action admits a symmetry where the fields transform as
If we 'promote' the global symmetry, parametrised by the constant , to a local symmetry, parametrised by a function , or equivalently the Dirac equation is no longer invariant: there is a residual derivative of .
The fix proceeds as in scalar electrodynamics: the partial derivative is promoted to a covariant derivative
The transformation law under gauge transformations for is then the usual
Expanding out the covariant derivative allows the action to be written in a second useful form:
Massless Dirac fermions, that is, fields satisfying the Dirac equation with , admit a second, inequivalent symmetry.
This is seen most easily by writing the fourcomponent Dirac fermion as a pair of twocomponent vector fields,
The Dirac action then takes the form
The earlier vector symmetry is still present, where and rotate identically. This form of the action makes the second inequivalent symmetry manifest:
This isn't the only symmetry possible, but it is conventional. Any 'linear combination' of the vector and axial symmetries is also a symmetry.
Classically, the axial symmetry admits a wellformulated gauge theory. But at the quantum level, there is an anomaly, that is, an obstruction to gauging.
We can extend this discussion from an abelian symmetry to a general nonabelian symmetry under a gauge group , the group of color symmetries for a theory.
For concreteness, we fix , the special unitary group of matrices acting on .
Before this section, could be viewed as a spinor field on Minkowski space, in other words a function , and its components in are labelled by spin indices, conventionally Greek indices taken from the start of the alphabet .
Promoting the theory to a gauge theory, informally acquires a part transforming like , and these are labelled by color indices, conventionally Latin indices . In total, has components, given in indices by . The 'spinor' labels only how the field transforms under spacetime transformations.
Formally, is valued in a tensor product, that is, it is a function
Gauging proceeds similarly to the abelian case, with a few differences. Under a gauge transformation the spinor fields transform as
Writing down a gaugeinvariant action proceeds exactly as with the case, replacing the Maxwell Lagrangian with the Yang–Mills Lagrangian
The action is then
For physical applications, the case describes the quark sector of the Standard model which models strong interactions. Quarks are modelled as Dirac spinors; the gauge field is the gluon field. The case describes part of the electroweak sector of the Standard model. Leptons such as electrons and neutrinos are the Dirac spinors; the gauge field is the gauge boson.
This expression can be generalised to arbitrary Lie group with connection and a representation , where the colour part of is valued in . Formally, the Dirac field is a function
Then transforms under a gauge transformation as
This theory can be generalised to curved spacetime, but there are subtleties which arise in gauge theory on a general spacetime (or more generally still, a manifold) which, on flat spacetime, can be ignored. This is ultimately due to the contractibility of flat spacetime which allows us to view a gauge field and gauge transformations as defined globally on .
Articles on the Dirac equation edit 
Other equations edit 
Other topics edit
