CM 08 ActionEndPts
CM 08 ActionEndPts
Introduction
Following Landau, we'll first find how the action integral responds to incremental changes in the
endpoint coordinates and times, then use the result to write the action integral itself in a new, more
intuitive way. This new formulation shows very directly the link to quantum mechanics, and variation of
the action in this form gives Hamilton's equations immediately.
S δ q + ∫
δ= − δ qdt
∂q t1 t1 ∂q dt ∂q
(Recall that first term comes from the calculus of variations when we allow the end point to vary -- it’s exactly the
same point we previously discussed in the brachistochrone problem of fastest time for a given horizontal distance,
allowing the vertical position of the endpoint to be a free parameter.)
With the incremental variation, we’ve gone from the physical path P (followed by the system in
configuration space from q ( ) , t1 to q ( ) , t2 ) to a second path P′ beginning at the same place and time,
1 2
Both paths P, P′ are fully determined by their initial and final positions and times, so P, P′ must
correspond to slightly different initial velocities. The important point is that since both paths describe
the natural dynamical development of the system from the initial conditions, the system obeys the
equations of motion at all times along both paths, and therefore the integral term in the above equation
is identically zero.
2
δ q ( t2 ) δ=
Writing = q, p ( ) p the action, regarded as a function of the final position variable, with the
2
t2
∂L
δS =( (2)
q , t2 ∂= )
δ q
q t1
(2)
p= δ q( 2) pδ q.
For the multidimensional case, the incremental change in the action on varying the final position
variable is given by (dropping the superscript)
∂S / ∂qi =pi ,
( )
t2
(2)
∫
Since S = Ldt , the total time derivative dS / dt2 = L q , t2 , the value of the Lagrangian at the
t1
endpoint. Remember we are defining the action at a point as that from integrating along the true path
up to that point.
Landau denotes t2 by just t , so he writes dS / dt = L , and we’ll be doing this, but it’s crucial to keep in
mind that the endpoint position and time are the variables here!
If we now allow an incremental time increase, t2 → t2 + dt , with the final coordinate position as a free
parameter, the dynamical path will now continue on, to an incrementally different finishing point.
This will give (with t understood from now on to mean t2 , and qi means qi( ) )
2
dS ( qi , t ) ∂S ∂S ∂S
= +∑ qi = + ∑ pi qi .
dt ∂t i ∂qi ∂t i
Putting this together with dS / dt = L gives immediately the partial time derivative
∂S / ∂t =L − ∑ pi qi =− H
i
and therefore, combining this with the result ∂S / ∂qi =pi from the previous section,
( qi , t )
dS= ∑ p dq − Hdt.
i
i i
This, then, is the total differential of the action as a function of the spatial and time coordinates of the
end of the path.
3
The initial and final coordinates and times specify the action and the time development of the system
uniquely.
(Note: We’ll find this equation again in the section on canonical transformations -- the action will be seen there to
be the generating function of the time-development canonical transformation, this will become clear when we get
to it.)
q( ) , t2
2
( (2)
S qi , t2 , qi , t1 = (1)
) ∫ Ldt .
q( ) , t1
1
However, the expression derived in the last section for the increment of action generated by an
incremental change in the path endpoint is clearly equally valid for the contribution to the action from
some interior increment of the path, say from ( q, t ) to ( q + dq, t + dt ) , so we can write the total action
integral as the sum of these increments:
S ( q=
i,t) dS ∫ ∑ p dq − Hdt .
∫= i i
i
In this integral, of course, the dqi add up to cover the whole path.
(In writing ( qi , t ) we’re following Landau’s default practice of taking the action as a function of the final
endpoint coordinates and time, assuming the beginning point to be fixed. This is almost always fine—
we’ll make clear when it isn’t.)
ψ ( x, t ) A=
= ( x, t ) ei(kx −ωt ) A ( x, t ) e(i / )( px − Et )
where the amplitude function A ( x, t ) only varies over distances much greater than the wavelength, and
times far longer than the oscillation period. This expression is valid in almost all the classically accessible
4
regions, invalid in the neighborhood of turning points, but the size of those neighborhoods goes to zero
in the classical limit.
As we’ve discussed earlier, in the Dirac-Feynman formulation of quantum mechanics, to find the
probability amplitude of a particle propagating from one point to another, we add contributions from all
possible paths between the two points, each path contributing a term with phase equal to i / times
the action integral along the path.
From the semi-classical Schrodinger wave function above, it’s clear that the change in phase from a
small change in the endpoint is ( i / )( pdx − Edt ) , coinciding exactly with the incremental
contribution to the action in=
S dS ∫ ∑ p dq − Hdt .
∫= i i
i
So again we see, here very directly, how the action along a classical path is a multiple of the quantum
mechanical phase change along the path.
We’ll prove this for a one dimensional system, it’s trivial to go to many variables, but it clutters up the
equations.
δS
= ∫ δ p {dq − ( ∂H / ∂p ) dt} + [ pδ q] − ∫ δ q {dp + ( ∂H / ∂q )=
dt} 0.
The path variations δ p, δ q are independent and arbitrary, so must have identically zero coefficients—
Hamilton’s equations follow immediately, q =
∂H / ∂p, p = −∂H / ∂q.
Again, it’s worth emphasizing the close parallel with quantum mechanics: Hamilton’s equations written
using Poisson brackets are:
=q [=
H , q ] , p [ H , p ] .
In quantum mechanics, the corresponding Heisenberg equations of motion for position and momentum
operators in terms of commutators are
5
=q / i ) [ H , q ] , p (1 / i ) [ H , p ] .
(1=
How Can p, q Really Be Independent Variables?
It may seem a little odd at first that varying p, q as independent variables leads to the same equations as
the Lagrangian minimization, where we only varied q, and that variation “locked in” the variation of q.
And, isn’t p defined in terms of q, q by p =∂L / ∂q , which is some function of q, q ? So wouldn’t
varying q automatically determine the variation of p ?
The answer is, no, p is not defined as p =∂L / ∂q from the start in Hamilton’s formulation. In this
Hamiltonian approach, p, q really are taken as independent variables, then varying them to find the
minimum path gives the equations of motion, including the relation between p and q, q .
This comes about as follows: Along the minimum action path, we just established that
dH ( p,=
q ) qdp
− pdq
.
dL ( q=
, q ) pdq + pdq
,