[2] On the theory of dynamic programming
[2] On the theory of dynamic programming
the 1 occurring in the kth place. Each p and Sip is a probability vector,
N
Pk . 0, k -O, pt = 1, and f(p) is a scalar function of p.
If for each I it is true tha
N N
k=
E 1 Pkc
<1CE
k 1
P, =
O < cl < 1, (2.7)
there is a unique bounded positive solution to (2.5).
The proof in all three cases employs the method of successive approxi-
mations. The equation in (2.5) occurs in connection with problems similar
to problem 1 above.
3. Solutions of Some Particular Functional Equations.-In this section
we indicate the solution of some simple cases of the general equations dis-
cussed above.
THEOREM 4. The solution of
f(x, y) = max. + f(x,s2Y)]' x, Y . 0 (3.1)
where 0 < PI, P2, su, s2 < 1, r1, r2 > 0, is given by
f(x, y) = Pi [rix + f(sux, y) I for >pi p2r2y
1 PI1-P2
= p2[r2y + f(x, s2y)] for 1rp
1-PI
< p2_ p
P2
(3.2)
If s5m = 52i, f(x, y) is piecewise linear.
This result may be extended in many ways.
THEOREM 5. The solution of
f(x) =
o
max.
%y .x
[g(y) + h(x - y) + f(ay + b(x - y)], (3.3)
where 0 < a, b < 1, may be reduced to that of
f(x) = max. [g(x) +f(ax), h(x) +f(bx)I, (3.4)
in 0 < x < xo, if g and h are monotonically increasing functions such that
g(O) = h(O) = 0, g", h' > 0 in [0, xo].
If g', h' < 0 the situation is much more complicated, and no such simple
result such as (3.4) holds in general. The solution of (3.4) may be obtained
explicitly and is similar in structure to that of (3.1) above. This func-
tional equation arises from problem 2.
THEOREM 6. The solution of
f(P P2, ...PN) = min. [ k +
k qk
f(P1, P2p ..
PN) = 1 + (1-P&) X
(-Pi I
of' ..
I PN) (3 7)