0% found this document useful (0 votes)
11 views

Lecture 3

Uploaded by

enlightenedep
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views

Lecture 3

Uploaded by

enlightenedep
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

General Relativity Fall 2019

Lecture 3: the geodesic equation


Yacine Ali-Haı̈moud
September 10th 2019

NORMALIZATION OF 4-VELOCITY

Last lecture we defined the proper time τ of a massive particle, s.t. dτ 2 = −gµν dxµ dxν , and the 4-velocity u, with
components uµ = dxµ /dτ in an arbitrary coordinate system. This implies the following normalization, valid in an
arbitrary coordinate system:

gµν uµ uν ≡ uµ uµ = −1 , [notation: uµ ≡ gµν uν ]

INVERSE OF COORDINATE TRANSFORMATION

Consider a general coordinate system {xµ }. It should be clear that


(
∂xµ 1 if µ = ν
= δνµ = .
∂xν 0 otherwise
0
Now, given another general coordinate system {xµ }, we can use the chain rule to compute the partial derivative:
0
∂xµ ∂xµ ∂xα
= = δνµ . (1)
∂xν ∂xα0 ∂xν
0 0
Thus the inverse of the matrix whose elements are ∂xα /∂xα is the matrix whose elements are ∂xα /∂xα .

4-VECTORS

For now we define a 4-vector as a geometric object V whose components transform like those of a
4-velocity, i.e.
0
µ0 ∂xµ µ
V = V [general coordinate change].
∂xµ
In particular, under a Lorentz transformation between two ICS,
0 0
V µ = Λµ µ V µ [Lorentz transformation between two ICS].

4-vectors (and their generalization, tensors), represent observable, physical quantities.

Given a 4-vector with components V µ , we can define the object V with components

Vµ ≡ gµν V ν .

We will see later on that this object is a dual vector. Under a change of coordinates, we have
0
0 ∂xµ ∂xν ∂xν λ ∂xµ ∂xµ ∂xµ
Vµ0 = gµ0 ν 0 V ν = 0 0 gµν V = δλν µ0 gµν V λ = 0 gµλ V
λ
= Vµ ,
µ
∂x ∂x ν ∂xλ ∂x ∂xµ ∂xµ0
where I used Eq. (1) to simplify the terms in red. In general, we define a dual vector as a geometric object V
whose components transform as above. We will formalize these definitions in the upcoming lectures.
Although this is somewhat dated language, 4-vectors are sometimes referred to as contravariant vectors and
dual vectors as covariant vectors.
2

MEASUREMENTS IN OBSERVER’S REST-FRAME

Now suppose we know the components V µ = (V 0 , V ~ ) of V in an ICS {xµ }, and want to compute them in the
rest-frame of some observer, moving with 3-velocity ~v with respect to that ICS. Let us denote these components
µ
V(obs) 0
= (V(obs) ~(obs) ). They are related to V µ by a Lorentz boost. In particular, the 0 component is given by
,V

0 1  
~ .
V(obs) =√ V 0 − ~v · V
1 − v2

Now recall that, in the initial ICS, the observer’s 4-velocity has components uµ = √ 1 (1, ~v ). Thus we have
1−v 2

0
V(obs) = u0 V 0 − δij ui V j = −ηµν uµ V ν = −gµν uµ V ν = −uν V ν ,

where we used the fact that gµν = ηµν in an ICS. Now, the quantity gµν uµ V ν is coordinate-independent, as you
can easily explicitly check for yourself. Thus we found a coordinate-independent way to write the zero-th component
of V measured by an observer.

In the homework, you’ll be asked to write a similar geometric expression for the spatial components of V in the
observer’s rest frame.

METRIC INVERSE

Given a metric gµν in a general coordinate system, we define the inverse metric with the same symbol, g, but
with the indices up, g µν . Specifically, we define it through the relation
(
αµ α 1 if α = β
g gµβ = δβ = .
0 if α 6= β

If we put the components of the metric and inverse metric in square matrices, the above equation just states that the
product of the two matrices is the identity matrix. It should start being clear now that we only sum over repeated
indices if one of them is up and the other is down (though we make exceptions for purely spatial indices, in a cartesian
coordinate system). We will formalize the mathematical meaning of up and down indices in a couple of lectures.
Let’s look at some examples. First, in an ICS, gµν = ηµν = diag(−1, 1, 1, 1). It should be clear the the inverse of
this matrix has the same numerical components, so

η µν = diag(−1, 1, 1, 1).

Let us now consider spherical polar coordinates (x0 , x1 , x2 , x3 ) = (t, r, θ, ϕ), with line element

ds2 = −dt2 + dr2 + r2 dθ2 + r2 sin2 θdϕ2 .

In these coordinates the metric has components


 
g00 g01
 
−1 0 0 0
g10 g11
 0 1 0 0 
 
(gµν ) ≡  = 0 .

.. 0 r2 0 
 . 
0 0 0 r2 sin2 θ

The components of the inverse metric are just the inverse of the matrix above, i.e.
 00 01 
g g

−1 0 0 0
g 10 g 11
 0 1 0 0

(g µν ) ≡ 


.   0 0 1/r2 .
 . .  0 
0 0 0 1/(r2 sin2 θ)
3

Change of coordinates

Let us now write the definition of the inverse metric in primed coordinates:
0 0 0
g α µ gµ0 β 0 = δβα0 .
We already know how the metric components change. Inserting this, we have
0 0 ∂xµ ∂xβ α0
gα µ µ0 β 0 gµβ = δβ 0 .
∂x ∂x
Now multiply both sides and simplify – make sure to check at every step that there are the same free indices on
each side:
0 0
∂xµ ∂xβ ∂xβ
0 0 0 ∂x
β

µ
gα µ
0 β 0 α
gµβ = δβα0
∂x ∂x ∂x ∂xα
0
µ
0 0 ∂x ∂xα
gα µ δ β
g µβ =
∂xµ0 α ∂xα
0
µ
0 0 ∂x ∂xα
gα µ 0 gµα = .
∂xµ ∂xα
Let us now multiply by the inverse (unprimed) metric:
0
0 0 ∂xµ ∂xα αβ
gα µ 0 gµα g αβ
= g
∂xµ ∂xα
0
µ
0 0 ∂x ∂xα αβ
gα µ 0 δ β
µ = g
∂xµ ∂xα
0
β
0 0 ∂x ∂xα αβ
gα µ µ 0 = g .
∂x ∂xα
0
The last step is to multiply by ∂xβ /∂xβ . After simplification, this gives the transformation of the inverse metric
under changes of coordinates:
0 0
α0 β 0 ∂xα ∂xβ αβ
g = g .
∂xα ∂xβ
Again, you see that the priming-the-indices convention makes such transformation rules very easy to remember.
In the homework, you’ll be asked to check this relation explicitly for spherical polar coordinates.

FREE TRAJECTORIES AS PATHS OF MAXIMUM PROPER TIME

We know that in an ICS, particles not subject to any forces move along straight lines at constant velocity,
i.e. such that v i = dxi /dt = constant. This mplies v = constant, hence dt/dτ = constant, thus, dxµ /dτ = constant.
Let us show that these trajectories correspond to paths of maximum proper time.
We denote events along the trajectory by P (λ), where λ ∈ [0, 1] is a parameter (so the set of events {P (λ); λ ∈ [0, 1]}
is the worldline). Consider trajectories starting at a given event A = P (λ = 0) and ending at B = P (λ = 1), with a
timelike separation fron A. The total proper time along such a trajectory is, in an ICS,
Z 1
dxµ
Z
p
dλ −ηµν ẋµ ẋν , ẋµ ≡
p
τ= µ
−ηµν dx dx =ν .
0 dλ
Rewrite this as
Z 1
dλ L(xµ , ẋµ ),
p
τ= L≡ −ηµν ẋµ ẋν .
0

Let us now search for the trajectories extremizing τ . They are solutions of the Euler-Lagrange equations:

d ∂L ∂L
= . (2)
dλ ∂ ẋµ ∂xµ
4

x0
<latexit sha1_base64="(null)">(null)</latexit>

B
<latexit sha1_base64="(null)">(null)</latexit>
<latexit

P( )
<latexit sha1_base64="(null)">(null)</latexit>

x1
<latexit sha1_base64="(null)">(null)</latexit>

A
<latexit sha1_base64="(null)">(null)</latexit>
<latexit

FIG. 1. Worlines between two events A and B, separated by a timelike separation. In an ICS, a particle not subject to any
force moves on a straight line at constant velocity (in blue). We showed that this trajectory extremizes proper time. To show
that it maximizes it, consider the purple paths, which is light-like (i.e. null) everywhere, hence as a vanishing proper time.

Computing each side, we get


!
d ηµν ẋν
−p = 0.
dλ −ηαβ ẋα ẋβ

We recognize the denominator on the left-hand-side as dτ /dλ. The Euler-Lagrange equations thus imply

dxν
 
d
ηµν = 0.
dλ dτ
ν µ
This implies that ηµν dx µ dx
dτ is constant, hence that u = dτ is constant. Thus we have shown that the trajectories of
extremal proper time are straight lines of constant velocity in ICS. It is easy to show that this is in fact the maximum
proper time, as it is always possible to construct trajectories with null segments, hence with a zero proper time.

GEODESICS IN NON-INERTIAL COORDINATES

The conclusion we reached above is: freely-moving particles move along paths of maximum proper time.
Even though we used knowledge about free-trajectories in ICS to arrive at this conclusion, it is clearly meaningful
regardless of coordinates, given that proper time is coordinate independent. Let us now see what this translates to in
a general coordinate system {xµ }. The proper time is given by
Z 1
dλ L(xµ , ẋµ ), L ≡ −gµν ẋµ ẋν .
p
τ=
0

To compute the equation of motion in a general coordinate system, we look for extrema of τ , again using the Euler-
Lagrange equations (2). This time, the right-hand-side is no longer zero, as gµν (xα ) is a priori a function of the
non-inertial coordinates. We thus get, for any µ ∈ {0, 1, 2, 3},
!
gµν ẋν 1 ∂µ gρσ ẋρ ẋσ
 
d ∂X
− =− , notation: ∂µ X ≡ .
∂xµ
p p
dλ −gαβ ẋα ẋβ 2 −gαβ ẋα ẋβ

Note that there isn’t a 1/2 on teh right-hand-side because we need to take the derivative with respect to each of the
two ẋµ factors. There are lots of indices here; note that all of them are repeated (hence summed over) except for µ. It
5

is good practise to never have more than two indentical


p indices on each side to have absolutely no possible confusion
on what is summed over. Let’s now recall that −gαβ ẋα ẋβ = dτ /dλ, and rewrite the Euler-Lagrange equations as
ν ρ
dxσ
!
d gµν dxdλ ∂µ gρσ dx
dλ dλ

= dτ
.
dλ dλ dλ

Multiply both sides by dλ/dτ = 1/(dτ /dλ), and use the chain rule to simplify to

dxν dxρ dxσ


 
d 1
gµν = ∂µ gρσ . (3)
dτ dτ 2 dτ dτ
Let us now expand the left-hand-side:

dxν dxν d2 xν dxα dxν d2 xν


 
d d
gµν = (gµν ) × + gµν 2
= ∂α gµν + gµν ,
dτ dτ dτ dτ dτ dτ dτ dτ 2
where again we have used the chain rule to compute the derivative of the metric coefficients along the trajectory. We
thus arrive at
d2 xν dxρ dxσ dxα dxν
  ρ σ
1 1 dx dx
gµν 2
= ∂µ gρσ − ∂α gµν = ∂ µ gρσ − ∂ρ gµσ ,
dτ 2 dτ dτ dτ dτ 2 dτ dτ
where I used the fact that summation indices are dummy, in order to factorize the expression. Using the fact that
the last factor is symmetric in ρ, σ, we can moreover replace ∂ρ gµσ → 12 (∂ρ gµσ + ∂σ gµρ ), and arrive at

d2 xν 1 dxρ dxσ
gµν + (∂ρ gµσ + ∂σ gµρ − ∂ µ gρσ ) = 0.
dτ 2 2 dτ dτ
The last step is to multiply by g αµ , the inverse metric. After simplifying, this gives

d2 xα dxρ dxσ
2
+ Γα
ρσ =0, (4)
dτ dτ dτ

where Γα
ρσ is the Chirstoffel symbol:

1 αµ 1
Γα
ρσ ≡ g (∂ρ gµσ + ∂σ gµρ − ∂µ gρσ ) = g αµ (gµσ,ρ + gµρ,σ − gρσ,µ ) [notation: X...,ν ≡ ∂ν X... ]
2 2

There aren’t too many formulae that you should know “by heart” in physics, but this is one of them. Note that it
is symmetric in the two lower indices. We have not yet defined formally what a tensor is, but one thing you should
remember is that the Christoffel symbol is not a tensor.

Equation (4) is called the geodesic equation. This represents the equation of motion for a freely-moving particle
in an arbitrary, non-inertial coordinate system. What it fundamentally means is that, in an ICS, a free-moving particle
moves along a straight line with constant velocity, which also happens to be the path of maximum proper time. In a
non-ICS, this path is described by a more complicated equation, even though it remains “fundamentally” (i.e. in an
ICS) just a straight line traveled at constant velocity. In the homework, you’ll explicitly derive the geodesic equation
for spherical polar coordinates.

Other derivations and comments

• Let us first remark that we can arrive at the geodesic equation by extremizing the following integral (which does
not have a particular physical meaning):
dxµ
Z
I ≡ dτ gµν ẋµ ẋν , ẋµ ≡ .

Indeed, using the Euler-Lagrange equation with L = gµν ẋµ ẋν , we get precisely Eq. (3), from which the same steps
follow. This is a useful trick to derive the geodesic equation in an arbitrary coordinate system, without
6

first computing all the Christoffel symbols: just find the extrema of I using the Euler-Lagrange equation.

• Also note that we can derive this equation starting from the geodesic equation in a ICS, and then
changing coordinates. In an ICS {xµ }, the geodesic equation is just

d2 xµ
= 0.
dτ 2
Now, using
0
the chain rule, and the coordinate-independence of τ , we may rewrite this in terms of the general coordi-
nates xµ :
0
! 0 0 0
d ∂xµ dxµ ∂xµ d2 xµ ∂ 2 xµ dxµ dxν
0= = + .
dτ ∂xµ0 dτ ∂xµ0 dτ 2 ∂xµ0 ∂xν 0 dτ dτ
 0
  0

Recalling that the inverse of the matrix ∂xµ /∂xµ is the matrix ∂xµ /∂xµ , we get

0 0 0 0
d2 xα ∂xα ∂ 2 xµ dxµ dxν
+ = 0.
dτ 2 ∂xµ ∂xµ0 ∂xν 0 dτ dτ
This already has the form of the geodesic equation with
0
0 ∂xα ∂ 2 xµ
Γα
µ0 ν 0 ≡ .
∂xµ ∂xµ0 ∂xν 0
In the homework, you will show that this is indeed equal to the Christoffel symbol derived above.

• Once we introduce differential geometry and covariant derivatives, we will see that this equation can be written
in a geometric way as
D α dxα
u ≡ uβ ∇β uα = 0, uα ≡ .
Dτ dτ
The only difference between “full-on” GR and what we have been doing so far is that in general, ICS are only
defined locally. The Christoffel symbol is proportional to derivatives of the metric components. Thus, the minimum
requirements to locally recover the straight-line-at-constant-velocity free motion are not only gµν = ηµν , but also
∂α gµν = 0 – both of which are satisfied when we implicitly assume a global ICS, such that gµν = ηµν everywhere.

This gives us a definition of a locally inertial coordinate system: it is a coordinate system defined around a
“central” event, around which the metric components deviate from Minkowski only at second order in the
coordinate separation. In such a coordinate system, freely-moving particles travel locally (i.e. around the central
event) on straight lines at constant velocity.

You might also like