Domenico Fiorenza Dirac equation

The Dirac equation

Derivation

In this section, we will produce a covariant equation for a particle with spin non zero. This was first achieved by Dirac in 1928 for the electron, a particle with mass m=0,5MeVm= 0,5 MeV and spin 1/21/2. He moved his construction from two facts:

  1. An equation for a wave function of a particle must be of order one in time coordinates, according to the fundamental principles of quantum mechanics
  2. The square of the operator must be the K-G operator.

Forcing these two conditions, Dirac obtained an equation for a vectorial field Ψ\Psi :

(im)Ψ=0 ( i \slash{\partial} - m ) \Psi = 0

where = μγ μ\slash{\partial} = \partial^{\mu} \gamma_{\mu}. The matrices γ 0,γ 1,γ 2,γ 3\gamma_0,\gamma_1,\gamma_2,\gamma_3 satisfy:

{γ μ,γ ν}=g μν, \{ \gamma^{\mu},\gamma^{\nu} \} = g^{\mu \nu},
(γ 0) 2=1,(γ i) 2=1. (\gamma^0 )^2 = 1, \quad (\gamma^i)^2 = -1.

This is called the Feynman form for the Dirac equation. It results by an easy calculation that the lowest allowed dimension for the vectorial field Ψ\Psi is four. This is the most surprising fact: we expected to find that a 1/21/2-spin particle, such the electron is, could be represented by a two dimensional field, instead the dimension two is not even allowed. This was explained by introducting for each particle, the respective antiparticle.

Dictionary.

Antiparticle. We introduce three fundamental transformation over the Hilbert space \mathcal{H} of the L 2L^2 functions over the Minkowski space:

C: P: Ψ Ψ * \begin{matrix} C : & \mathcal{H} & \to & \mathcal{H} \\ \phantom{P : } & \Psi & \to & \Psi^{*} \end{matrix}
P: P: Ψ(x 0,x 1,x 2,x 3) Ψ(x 0,x 1,x 2,x 3) \begin{matrix} P : & \mathcal{H} & \to & \mathcal{H} \\ \phantom{P : } & \Psi(x_0,x_1,x_2,x_3) & \to & \Psi(x_0,-x_1,-x_2,-x_3) \end{matrix}
T: P: Ψ(x 0,x 1,x 2,x 3) Ψ(x 0,x 1,x 2,x 3) \begin{matrix} T : & \mathcal{H} & \to & \mathcal{H} \\ \phantom{P : } & \Psi(x_0,x_1,x_2,x_3) & \to & \Psi(-x_0,x_1,x_2,x_3) \end{matrix}

These three transformations are, respectevely, the charge coniugation, the parity and the time reversal. The antiparticle for the particle a+a+, represented by a wave function Ψ\Psi, is a particle aa- with wave function CPTΨCPT \Psi. In concrete the antiparticle has the same mass, the same spin, but the opposite charge respect to the associated particle.

Once the antiparticles come into play, we can state that a solution for the Dirac equation is a field, describing the evolution for both the particle and the antiparticle, i.e. the electron and the positron.

The iter followed by Dirac, while it was the only suitable way to write such an equation in an early stage of the quatum field theory, it is quite brutal. Moreover we should verify a posteriori the covariance of the equation.

What we want to do now is to derive the same expression through the representation theory. We know that a particle, say aa, with spin ss and mass m>0m \gt 0 is an irreducible representation inducted by an irrep of the little group SU(2)SU(2) of dimension 2s+12s+1. Then a couple particle-antiparticle representation must be found in an irrep for SU(2)SU(2) of dimension (2s+1)(2s+1)(2s+1)(2s+1). To fix the ideas, we study the case s=1/2s=1/2. It’s crucial that we don’t want two indipendent particles. The physics suggests the following correlation in a rest frame:

Ψ a+=Ψ a, \Psi_{a+} = \Psi_{a-},

where Ψ a+\Psi_{a+} is a 2 component field for the particle, and Ψ a\Psi_{a-} is the analogous for the antiparticle. Giving such a condition “in a rest frame”, it’s the same than solving the equation over a single point p=(m,0,0,0)p=(m,0,0,0) over the mass hyperboloid. The same condition could also be written in a more fitting way:

(1)[(0 I I 0)(I 0 0 I)](Ψ a Ψ a+) \left[ \left( \begin{matrix} 0 & I \\ I & 0 \end{matrix} \right) - \left( \begin{matrix} I & 0 \\ 0 & I \end{matrix} \right) \right] \left( \begin{matrix} \Psi_{a-} \\ \Psi_{a+} \end{matrix} \right)

where II is the identity matrix 2×22\times2

Now to produce the Dirac equation it is sufficient to act with a transformation of the rest frame over Ψ\Psi. Which is this transformation? Recall that by calculation (eq:act2) the action of the Poincaré group, is essentially the action of the little group: any other element contributes to move the mass eigenvalue over the hyperboloid p 2=m 2p^2=m^2. We have to compute the action of SU(2)SU(2) over a four dimensional wave function. Remark that the infinitesimal generators for SU(2)SU(2) are the Pauli’s matrices σ i\sigma_i, then the action of SU(2)SU(2) is written through the exponential e ϕ iσ ie^{ \phi_{i}\sigma^{i}}, the sum is over the three Pauli’s matrices indeces. The action is the following

(2)U=exp(1/2(ϕ iσ i 0 0 ϕ iσ i)). U = exp \left(-1/2 \left( \begin{matrix} - \phi_i \sigma^i & 0 \\ 0 & \phi_i \sigma^i \end{matrix} \right) \right).

The dependancy from ΛL\Lambda \in L is omissis.

Essentially we want to act over Ψ a±\Psi_{a \pm} with SU(2)SU(2) simultaneously, and give an equation of UΨU \Psi. Since U 1UΨ=ΨU^{-1}U \Psi = \Psi the following equations hold:

(γ 01)U 1UΨ=0 (\gamma_0 - 1 )U^{-1} U \Psi =0
(γ 0U 1U 1)UΨ=0 (\gamma_0 U^{-1} - U^{-1} ) U\Psi = 0

finally multiplying for UU we have:

(3)(Uγ 0U 11)UΨ, (U \gamma_0 U^{-1} -1 ) U \Psi,

We proceed to expand equation (3) and show that is exactly the Dirac equation. Remark that by construction the projection to the rest frame is equation (1).

First it results:

Uγ 0U 1=exp(1/2(iϕ iσ i 0 0 iϕ iσ i))γ 0exp(1/2(iϕ iσ i 0 0 iϕ iσ i)). U \gamma_0 U^{-1} = exp \left(-1/2 \left( \begin{matrix} -i \phi_i \sigma^i & 0 \\ 0 & i \phi_i \sigma^i \end{matrix} \right) \right) \gamma_0 exp \left(1/2 \left( \begin{matrix} -i \phi_i \sigma^i & 0 \\ 0 & i \phi_i \sigma^i \end{matrix} \right) \right).

Recall the Hadamard lemma, valid for two square matrix X,YX,Y:

e XYe X=Y+[X,Y]+12![X,[X,Y]]+13![X,[X,[X,Y]]]+.. e^X Y e^{-X}= Y + [X,Y] + \frac{1}{2!} [X,[X,Y]] + \frac{1}{3!}[X,[X,[X,Y]]]+..

For X=UX = U and Y=γ 0Y=\gamma_0 it results:

[X,Y]=(0 ϕ iσ i ϕ iσ i 0) [X,Y]= \left( \begin{matrix} 0 & \phi_i \sigma^i \\ - \phi_i \sigma^i & 0 \end{matrix} \right)

and

Uγ 0U 1=(0 e ϕ iσ i e ϕ iσ i 0). U \gamma_0 U^{-1} = \left( \begin{matrix} 0 & e^{ \phi_i \sigma^i} \\ e^{- \phi_i \sigma^i} & 0 \end{matrix} \right).

Next we have to compute the exponential of a Pauli’s matrix. By utilizing the commutation rules we have:

e ϕ iσ i= n(ϕ 1σ 1+ϕ 2σ 2+ϕ 3σ 3) nn!=(p 0/m+σ ip i/m). e^{\phi_i \sigma^i} = \sum_n \frac{(\phi_1 \sigma_1 +\phi_2 \sigma_2 + \phi_3 \sigma_3)^n}{n!} = (p_0/m + \sigma_i p_i/m).

this is achieved by utilizing the following relations between ϕ\phi and pp:

cosh(ϕ)=1(1p 1 2p 2 2p 3 2) 1/2 cosh(\phi)=\frac{1}{(1-p_1^2 - p^2_2 - p_3^2)^{1/2}}
senh(ϕ)=p 1 2+p 2 2+p 3 2(1p 1 2p 2 2p 3 2) 1/2. senh(\phi)=\frac{p_1^2 + p^2_2 + p_3^2 }{(1-p_1^2 - p^2_2 - p_3^2)^{1/2}}.

Finally UU results:

U=(0 (p 0/m+σ ip i/m) (p 0/mσ ip i/m) 0)=1mγ μp μ. U = \left( \begin{matrix} 0 & (p_0/m + \sigma_i p_i/m) \\ (p_0/m - \sigma_i p_i/m) & 0 \end{matrix} \right) = \frac{1}{m} \gamma^{\mu}p_{\mu} .

Call UΨ=Ψ(p)U \Psi = \Psi(p) then we have obtained the explicit form of the equation (3):

(γ μp μm)Ψ(p)=0 ( \gamma_{\mu}p^{\mu} - m) \Psi(p) = 0

the saturation of the indeces with the γ\gamma-matrices is also written as p\slash{p}.

Identifying the representation

At this point we want to come back to the Dirac’s construction. Once the equation was given, we had no information on the representation the solutions should belong to. It is worth to come up with a strategy to identify the representation which satisfies a certain covariant equation. Again we proceed with the Dirac equation for s=1/2s=1/2. The first step is finding a good space where the solutions can be written. The expression in Fourier coordinates for the Dirac operator is:

(im)^=(pm). \widehat{ ( i \slash{\partial} - m )} = (\slash{p} - m).

Remark that (p 2m 2)Id=(pm)(p+m)(p^2 - m^2)Id = (\slash{p} - m)(\slash{p} + m). Then the equation

(4)(pm)(p+m)Ψ^, (\slash{p} - m)(\slash{p} + m) \hat{\Psi},

has non trivial solutions when p 2=m 2p^2 = m^2. We can conclude that a state Ψ\Psi, solution for Dirac, solves K-G equation as well. Then a natural space where the solutions live is the set of the distributions D mD_m, taking values in 4\mathbb{C}^4, supported over the mass hyperboloid p 2=m 2p^2 = m^2. This identifies the first index for an irrep of the Poincaré group: the mass mm. Before proceeding furthermore, we introduce some more representation theory tool. What we want to do is understanding how factoring the K-G operator in Fourier can allow us to study representation associated to the factors. First we want to specify what we intend for “factoring” the K-G operator. Given two representation of a group GG, VV and WW, one can introduce the symmetric power of VV, Sym n(V)Sym^n(V) and then define

ϕ:Sym n(V)W \phi : Sym^n(V) \to W

Remark that such an homomorphism between representation is multilinear in the entries of Sym n(V)Sym^n(V), moreover the symmetric power has a concrete realization as the polynomials in the vectors vVv \in V. It is interesting to give the factorization notion for such an application.

Definition. Given a ϕ:Sym n(V)W\phi : Sym^n(V) \to W homomorphism between representation for a group GG, we say that ϕ\phi is factorizable if and only if exist W 1,W 2,W_1,W_2, representations and two maps pp and qq

p:Sym h(V)W 1 p : Sym^h(V) \to W_1
q:Sym k(V)W 2, q : Sym^k(V) \to W_2,

such that pp and qq are a complete system of projectors over WW, WW is isomorphic to W 1W 2W_1 \otimes W_2, h+k=nh+k=n and the following diagram commutes:

gendiagram

Then pq=ϕ.p \otimes q = \phi. is a factorization for ϕ\phi.

We are going to explain why equation (4) is a factorization in this new sense. Consider W= 4W=\mathbb{C}^4, V= 2V=\mathbb{C}^2 and the application:

ϕ=(p 2m 2)Id:Sym 2( 2) 4. \phi = (p^2 - m^2)Id : Sym^2(\mathbb{C}^2) \to \mathbb{C}^4.

Recall that a Symmetric power of a vector space can be indetified with polynomial in the vectors. Consider p=(p+m)p=(\slash{p}+m) and q=(pm)q=(\slash{p}-m): by an easy calculation it results that they are a complete system of projectors over 4\mathbb{C}^4, each projecting over 2\mathbb{C}^2. Then they can be identified as applications between VV and 2\mathbb{C}^2. Hence we can define:

pq:VV 2 2. p \otimes q : V \otimes V \to \mathbb{C}^2 \otimes \mathbb{C}^2.

Next, let ϕ\phi be ϕ=pq \phi = pq and π\pi be the natural projection from the tensor product to the symmetric power. We have the following diagram:

diracdiagram

We want to verify that is commutative, i.e. f(pq)=ϕπ f (p \otimes q) = \phi \pi. Remark that since pp and qq are a complete set of projectors and ϕ\phi is a multiple of the identity, it results Dim(Ran(pq))=Dim(Ran(ϕ))Dim(Ran(p \otimes q) ) = Dim(Ran(\phi)). Then it is possibile to give an isomorphism ff such that f(pq)=ϕπ f (p \otimes q) = \phi \pi, by defining f(vw)=ϕπ(vw)f(v \otimes w) = \phi \pi (v \otimes w). Such an ff is bi-linear in the entries, since so it is π\pi, and due to the fact that Ran(f)=WRan(f)= W, it results Ker(f)={0}Ker(f)= \{0\}, then ff is an isomorphism between vector spaces. We left behind the calculation of the dimension of the image of the projectors (p±m)(\slash{p} \pm m ). For convenience it can be calculated in a rest frame p=(m,0,0,0)p=(m,0,0,0). We have:

(γ 0mm)=(mI mI mI mI). (\gamma_0m - m) = \left( \begin{matrix} -mI & mI \\ mI & -mI \end{matrix} \right).

The operator is a 4×4 4 \times 4 matrix with two identical rows, then Dim(Ker(γ 0Id))=2Dim(Ker(\gamma_0 - Id)) = 2. Next we want an explicit action of SU(2)SU(2) over the solutions: this is done by forcing the covariance condition. Let Ψ\Psi be a Dirac solution in a frame OO, the solution in a new frame OO' is Ψ =U(Λ)Ψ\Psi^{'} = U(\Lambda)\Psi , where Λ\Lambda is the lorentz transformation that connects OO and OO'. For Ψ \Psi^{'} we write the following equation:

0=(p μγ μm)Ψ=(p μγ μm)U 1(Λ)Ψ = 0 = \left( p_{\mu} \gamma^{\mu} - m \right) \Psi = \left( p_{\mu} \gamma^{\mu} - m \right) U^{-1}(\Lambda) \Psi^{'} =
=(γ μΛ μ νp ν m)U 1(Λ)Ψ = = \left( \gamma^{\mu} \Lambda^{\nu}_{\mu} p^{'}_{\nu} - m \right) U^{-1}(\Lambda) \Psi^{'}=

Where p p^{'} is the momentum in OO'. Multiplying for U(Λ)U(\Lambda) we have:

=(γ μU(Λ)Λ μ νp ν U 1(Λ)m)Ψ . = \left( \gamma^{\mu} U(\Lambda) \Lambda^{\nu}_{\mu} p^{'}_{\nu} U^{-1}(\Lambda) - m \right) \Psi^{'}.

This is the Dirac equation if the following condition is satisfied:

Λ ν μU(Λ)γ νU 1(Λ)=γ μ, \Lambda^{\mu}_{\nu} U(\Lambda) \gamma^{\nu} U^{-1}(\Lambda) = \gamma^{\mu},

or the equivalent:

U(Λ)γ μU 1(Λ)=(Λ 1) ν μγ ν. U(\Lambda) \gamma^{\mu} U^{-1}(\Lambda) = (\Lambda^{-1})^{\mu}_{\nu} \gamma^{\nu}.

The expression for UU given in (2) solves the relation above. This is easily checked:

U(Λ)γ μU 1(Λ)=γ 0sinh(ϕ)+γ μcosh(ϕ)=(Λ 1) ν μγ ν. U(\Lambda) \gamma^{\mu} U^{-1}(\Lambda) = - \gamma^0 sinh(\phi) + \gamma^{\mu} cosh(\phi) = (\Lambda^{-1})^{\mu}_{\nu} \gamma^{\nu}.

todo: typical Lorentz transformation

todo irreducibility

Created on November 26, 2010 at 11:05:31 by giuseppe_malavolta