Lecture 3
Lecture 3
NORMALIZATION OF 4-VELOCITY
Last lecture we defined the proper time τ of a massive particle, s.t. dτ 2 = −gµν dxµ dxν , and the 4-velocity u, with
components uµ = dxµ /dτ in an arbitrary coordinate system. This implies the following normalization, valid in an
arbitrary coordinate system:
4-VECTORS
For now we define a 4-vector as a geometric object V whose components transform like those of a
4-velocity, i.e.
0
µ0 ∂xµ µ
V = V [general coordinate change].
∂xµ
In particular, under a Lorentz transformation between two ICS,
0 0
V µ = Λµ µ V µ [Lorentz transformation between two ICS].
Given a 4-vector with components V µ , we can define the object V with components
Vµ ≡ gµν V ν .
We will see later on that this object is a dual vector. Under a change of coordinates, we have
0
0 ∂xµ ∂xν ∂xν λ ∂xµ ∂xµ ∂xµ
Vµ0 = gµ0 ν 0 V ν = 0 0 gµν V = δλν µ0 gµν V λ = 0 gµλ V
λ
= Vµ ,
µ
∂x ∂x ν ∂xλ ∂x ∂xµ ∂xµ0
where I used Eq. (1) to simplify the terms in red. In general, we define a dual vector as a geometric object V
whose components transform as above. We will formalize these definitions in the upcoming lectures.
Although this is somewhat dated language, 4-vectors are sometimes referred to as contravariant vectors and
dual vectors as covariant vectors.
2
Now suppose we know the components V µ = (V 0 , V ~ ) of V in an ICS {xµ }, and want to compute them in the
rest-frame of some observer, moving with 3-velocity ~v with respect to that ICS. Let us denote these components
µ
V(obs) 0
= (V(obs) ~(obs) ). They are related to V µ by a Lorentz boost. In particular, the 0 component is given by
,V
0 1
~ .
V(obs) =√ V 0 − ~v · V
1 − v2
Now recall that, in the initial ICS, the observer’s 4-velocity has components uµ = √ 1 (1, ~v ). Thus we have
1−v 2
0
V(obs) = u0 V 0 − δij ui V j = −ηµν uµ V ν = −gµν uµ V ν = −uν V ν ,
where we used the fact that gµν = ηµν in an ICS. Now, the quantity gµν uµ V ν is coordinate-independent, as you
can easily explicitly check for yourself. Thus we found a coordinate-independent way to write the zero-th component
of V measured by an observer.
In the homework, you’ll be asked to write a similar geometric expression for the spatial components of V in the
observer’s rest frame.
METRIC INVERSE
Given a metric gµν in a general coordinate system, we define the inverse metric with the same symbol, g, but
with the indices up, g µν . Specifically, we define it through the relation
(
αµ α 1 if α = β
g gµβ = δβ = .
0 if α 6= β
If we put the components of the metric and inverse metric in square matrices, the above equation just states that the
product of the two matrices is the identity matrix. It should start being clear now that we only sum over repeated
indices if one of them is up and the other is down (though we make exceptions for purely spatial indices, in a cartesian
coordinate system). We will formalize the mathematical meaning of up and down indices in a couple of lectures.
Let’s look at some examples. First, in an ICS, gµν = ηµν = diag(−1, 1, 1, 1). It should be clear the the inverse of
this matrix has the same numerical components, so
η µν = diag(−1, 1, 1, 1).
Let us now consider spherical polar coordinates (x0 , x1 , x2 , x3 ) = (t, r, θ, ϕ), with line element
The components of the inverse metric are just the inverse of the matrix above, i.e.
00 01
g g
−1 0 0 0
g 10 g 11
0 1 0 0
(g µν ) ≡
. 0 0 1/r2 .
. . 0
0 0 0 1/(r2 sin2 θ)
3
Change of coordinates
Let us now write the definition of the inverse metric in primed coordinates:
0 0 0
g α µ gµ0 β 0 = δβα0 .
We already know how the metric components change. Inserting this, we have
0 0 ∂xµ ∂xβ α0
gα µ µ0 β 0 gµβ = δβ 0 .
∂x ∂x
Now multiply both sides and simplify – make sure to check at every step that there are the same free indices on
each side:
0 0
∂xµ ∂xβ ∂xβ
0 0 0 ∂x
β
µ
gα µ
0 β 0 α
gµβ = δβα0
∂x ∂x ∂x ∂xα
0
µ
0 0 ∂x ∂xα
gα µ δ β
g µβ =
∂xµ0 α ∂xα
0
µ
0 0 ∂x ∂xα
gα µ 0 gµα = .
∂xµ ∂xα
Let us now multiply by the inverse (unprimed) metric:
0
0 0 ∂xµ ∂xα αβ
gα µ 0 gµα g αβ
= g
∂xµ ∂xα
0
µ
0 0 ∂x ∂xα αβ
gα µ 0 δ β
µ = g
∂xµ ∂xα
0
β
0 0 ∂x ∂xα αβ
gα µ µ 0 = g .
∂x ∂xα
0
The last step is to multiply by ∂xβ /∂xβ . After simplification, this gives the transformation of the inverse metric
under changes of coordinates:
0 0
α0 β 0 ∂xα ∂xβ αβ
g = g .
∂xα ∂xβ
Again, you see that the priming-the-indices convention makes such transformation rules very easy to remember.
In the homework, you’ll be asked to check this relation explicitly for spherical polar coordinates.
We know that in an ICS, particles not subject to any forces move along straight lines at constant velocity,
i.e. such that v i = dxi /dt = constant. This mplies v = constant, hence dt/dτ = constant, thus, dxµ /dτ = constant.
Let us show that these trajectories correspond to paths of maximum proper time.
We denote events along the trajectory by P (λ), where λ ∈ [0, 1] is a parameter (so the set of events {P (λ); λ ∈ [0, 1]}
is the worldline). Consider trajectories starting at a given event A = P (λ = 0) and ending at B = P (λ = 1), with a
timelike separation fron A. The total proper time along such a trajectory is, in an ICS,
Z 1
dxµ
Z
p
dλ −ηµν ẋµ ẋν , ẋµ ≡
p
τ= µ
−ηµν dx dx =ν .
0 dλ
Rewrite this as
Z 1
dλ L(xµ , ẋµ ),
p
τ= L≡ −ηµν ẋµ ẋν .
0
Let us now search for the trajectories extremizing τ . They are solutions of the Euler-Lagrange equations:
d ∂L ∂L
= . (2)
dλ ∂ ẋµ ∂xµ
4
x0
<latexit sha1_base64="(null)">(null)</latexit>
B
<latexit sha1_base64="(null)">(null)</latexit>
<latexit
P( )
<latexit sha1_base64="(null)">(null)</latexit>
x1
<latexit sha1_base64="(null)">(null)</latexit>
A
<latexit sha1_base64="(null)">(null)</latexit>
<latexit
FIG. 1. Worlines between two events A and B, separated by a timelike separation. In an ICS, a particle not subject to any
force moves on a straight line at constant velocity (in blue). We showed that this trajectory extremizes proper time. To show
that it maximizes it, consider the purple paths, which is light-like (i.e. null) everywhere, hence as a vanishing proper time.
We recognize the denominator on the left-hand-side as dτ /dλ. The Euler-Lagrange equations thus imply
dxν
d
ηµν = 0.
dλ dτ
ν µ
This implies that ηµν dx µ dx
dτ is constant, hence that u = dτ is constant. Thus we have shown that the trajectories of
extremal proper time are straight lines of constant velocity in ICS. It is easy to show that this is in fact the maximum
proper time, as it is always possible to construct trajectories with null segments, hence with a zero proper time.
The conclusion we reached above is: freely-moving particles move along paths of maximum proper time.
Even though we used knowledge about free-trajectories in ICS to arrive at this conclusion, it is clearly meaningful
regardless of coordinates, given that proper time is coordinate independent. Let us now see what this translates to in
a general coordinate system {xµ }. The proper time is given by
Z 1
dλ L(xµ , ẋµ ), L ≡ −gµν ẋµ ẋν .
p
τ=
0
To compute the equation of motion in a general coordinate system, we look for extrema of τ , again using the Euler-
Lagrange equations (2). This time, the right-hand-side is no longer zero, as gµν (xα ) is a priori a function of the
non-inertial coordinates. We thus get, for any µ ∈ {0, 1, 2, 3},
!
gµν ẋν 1 ∂µ gρσ ẋρ ẋσ
d ∂X
− =− , notation: ∂µ X ≡ .
∂xµ
p p
dλ −gαβ ẋα ẋβ 2 −gαβ ẋα ẋβ
Note that there isn’t a 1/2 on teh right-hand-side because we need to take the derivative with respect to each of the
two ẋµ factors. There are lots of indices here; note that all of them are repeated (hence summed over) except for µ. It
5
Multiply both sides by dλ/dτ = 1/(dτ /dλ), and use the chain rule to simplify to
d2 xν 1 dxρ dxσ
gµν + (∂ρ gµσ + ∂σ gµρ − ∂ µ gρσ ) = 0.
dτ 2 2 dτ dτ
The last step is to multiply by g αµ , the inverse metric. After simplifying, this gives
d2 xα dxρ dxσ
2
+ Γα
ρσ =0, (4)
dτ dτ dτ
where Γα
ρσ is the Chirstoffel symbol:
1 αµ 1
Γα
ρσ ≡ g (∂ρ gµσ + ∂σ gµρ − ∂µ gρσ ) = g αµ (gµσ,ρ + gµρ,σ − gρσ,µ ) [notation: X...,ν ≡ ∂ν X... ]
2 2
There aren’t too many formulae that you should know “by heart” in physics, but this is one of them. Note that it
is symmetric in the two lower indices. We have not yet defined formally what a tensor is, but one thing you should
remember is that the Christoffel symbol is not a tensor.
Equation (4) is called the geodesic equation. This represents the equation of motion for a freely-moving particle
in an arbitrary, non-inertial coordinate system. What it fundamentally means is that, in an ICS, a free-moving particle
moves along a straight line with constant velocity, which also happens to be the path of maximum proper time. In a
non-ICS, this path is described by a more complicated equation, even though it remains “fundamentally” (i.e. in an
ICS) just a straight line traveled at constant velocity. In the homework, you’ll explicitly derive the geodesic equation
for spherical polar coordinates.
• Let us first remark that we can arrive at the geodesic equation by extremizing the following integral (which does
not have a particular physical meaning):
dxµ
Z
I ≡ dτ gµν ẋµ ẋν , ẋµ ≡ .
dτ
Indeed, using the Euler-Lagrange equation with L = gµν ẋµ ẋν , we get precisely Eq. (3), from which the same steps
follow. This is a useful trick to derive the geodesic equation in an arbitrary coordinate system, without
6
first computing all the Christoffel symbols: just find the extrema of I using the Euler-Lagrange equation.
• Also note that we can derive this equation starting from the geodesic equation in a ICS, and then
changing coordinates. In an ICS {xµ }, the geodesic equation is just
d2 xµ
= 0.
dτ 2
Now, using
0
the chain rule, and the coordinate-independence of τ , we may rewrite this in terms of the general coordi-
nates xµ :
0
! 0 0 0
d ∂xµ dxµ ∂xµ d2 xµ ∂ 2 xµ dxµ dxν
0= = + .
dτ ∂xµ0 dτ ∂xµ0 dτ 2 ∂xµ0 ∂xν 0 dτ dτ
0
0
Recalling that the inverse of the matrix ∂xµ /∂xµ is the matrix ∂xµ /∂xµ , we get
0 0 0 0
d2 xα ∂xα ∂ 2 xµ dxµ dxν
+ = 0.
dτ 2 ∂xµ ∂xµ0 ∂xν 0 dτ dτ
This already has the form of the geodesic equation with
0
0 ∂xα ∂ 2 xµ
Γα
µ0 ν 0 ≡ .
∂xµ ∂xµ0 ∂xν 0
In the homework, you will show that this is indeed equal to the Christoffel symbol derived above.
• Once we introduce differential geometry and covariant derivatives, we will see that this equation can be written
in a geometric way as
D α dxα
u ≡ uβ ∇β uα = 0, uα ≡ .
Dτ dτ
The only difference between “full-on” GR and what we have been doing so far is that in general, ICS are only
defined locally. The Christoffel symbol is proportional to derivatives of the metric components. Thus, the minimum
requirements to locally recover the straight-line-at-constant-velocity free motion are not only gµν = ηµν , but also
∂α gµν = 0 – both of which are satisfied when we implicitly assume a global ICS, such that gµν = ηµν everywhere.
This gives us a definition of a locally inertial coordinate system: it is a coordinate system defined around a
“central” event, around which the metric components deviate from Minkowski only at second order in the
coordinate separation. In such a coordinate system, freely-moving particles travel locally (i.e. around the central
event) on straight lines at constant velocity.