[2] On the theory of dynamic programming

Richard Bellman's paper discusses the theory of dynamic programming, focusing on mathematical problems involving sequences of operations aimed at optimizing outcomes. It presents existence and uniqueness theorems for functional equations related to maximizing yields or minimizing costs, along with specific examples and solutions. The work is connected to sequential analysis and references contributions from other researchers in the field.

Uploaded by

Mohammadreza Hadipour

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views4 pages

[2] On the theory of dynamic programming

Uploaded by

Mohammadreza Hadipour

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

716 MATHEMATICS: RICHARD BELLMAN PROC. N. A S.

ON THE THEORY OF DYNAMIC PROGRAMMING

By RICHARD BELLMAN
THE RAND CORPORATION, SANTA MONICA, CALIFORNIA
Cohnmunicated by J. von Neumann, June 5, 1952
1. Introduction.-We are interested in a class of mathematical problems
which arise in connection with situations which require that a bounded or
unbounded sequence of operations be performed for the purpose of achiev-
ing a desired result. Particularly important are the cases where each oper-
ation gives rise to a stochastic event, the result of which is applied to the
determination of subsequent operations.
Two fundamental problems encountered in situations of this type, in
some sense duals of each other, are those of maximizing the yield obtained
in a given time, or of minimizing the time or cost required to accomplish a
certain task.
In many cases, the problem of determining an optimal sequence of oper-
ations may be reduced to that of determining an optimal first operation.
The general class of functional equations generated by problems of this
nature has the form
(min.)
f(p) = {max.}(Tk(f)), (1.1)
kJ
where Tk is an operator. In many cases of interest, the operator has the
form
Tk(f) = gk(P) + hk(P)f(SkP), (1.2)
where Sk is a point transformation.
We shall first presetit some existence and uniqueness theorems pertaining
to the solutions of (1.1), and then present explicit solutions of some simple
functional equations of the form of (1.1).
As simple examples of problems which give rise to functional equations
of this form, we mention the following:
1. We are given the fact that one of N boxes contains a ball, with prob-
ability Pk that the ball is in the kth box. Let qk be the probability that on
examining the kth box we are unable to examine its contents, and tk be the
time consumed in one examination. What procedure minimizes the ex-
pected time required to locate the box containing the ball, and what pro-
cedure minimizes the expected time required to obtain the ball?
2. We are given a quantity x > 0 which may be divided into two parts,
y and x - y. From y we obtain a return of g(y) and from x - y a return
of h(x - y). In so doing we are left with a new quantity ay + b(x -y),
0 < a, b < 1, with which to continue the process. How does one pro-
VOL. 38, 1952 MA THEMA TICS: RICHARD BELLMAN 717

ceed in order to maximize the total return obtained in a finite, or un-

bounded, number of stages?
The theory of dynamic programming is intimately related to the theory
of sequential analysis due to Wald.3 Two papers by Arrow, Blackwell and
Cirshick,' and Arrow, Harris and Marschak2 also treat problems of similar
type.
2. Existence and Uniqueness Theorems.
THEOREM 1. Consider the equationt
f(p) = max.
1<k <n
(gk(P) + hk(P)f(SkP)), p e R, (2.1)
w I:ere we assume that
(a) If p e R, a region of n-dimensional space, then Skp e R. (2.2)
(b) gk(P)I < ci for p e R,
(c) hk(p)I < c2 < 1 for p e R,
(d) gk(P), hk(P) 2 0 for p e R.
Under these assumptions there is a unique bounded solution to (2.1).
THEOREM 2. Consider the equation
f(x) = max. [a(x1,X2, ...,XN) + f(b(x1, x2, X,XN)) ], (2.3)
R
N
where R = R(x) is defined by Xk > 0, Xk = x.
k 1
If
(a) a(x1, x2, ..., XN) is continuous over R(x) for 0 < x < xo, non-
negative, qnd a- (0, 0, ..., 0) = 0, (2.4.)
(b) b(x1, x2, ..., XN) is continuous and non-negative over R,
(c) b(x1, x2, ...,XN) < cx, < c < 1, in R(x),
(d) , h(clxo) < c, where h(x) = max. a(x1, x2, ... , XN),
I = O R
there is a unique continuous solution to (2.3) for which f(O) = 0 for 0 < x <
XO.
THEOREM 3. Consider the equation

f(p) min. + E Pkf(xk), p

=~ 1 + f(S1p)k } $X, (25
f(xo) = 0,
where l= 1, 2, ...,M, and
(Po \ /Po\ 0
P Pi , SIP PV t 2 Xk =(i.I (2.6)
N PN
718 MATHEMATICS: RICHARD BELLMAN PRoC. N. A. S.

the 1 occurring in the kth place. Each p and Sip is a probability vector,
N
Pk . 0, k -O, pt = 1, and f(p) is a scalar function of p.
If for each I it is true tha
N N

k=
E 1 Pkc
<1CE
k 1
P, =
O < cl < 1, (2.7)
there is a unique bounded positive solution to (2.5).
The proof in all three cases employs the method of successive approxi-
mations. The equation in (2.5) occurs in connection with problems similar
to problem 1 above.
3. Solutions of Some Particular Functional Equations.-In this section
we indicate the solution of some simple cases of the general equations dis-
cussed above.
THEOREM 4. The solution of
f(x, y) = max. + f(x,s2Y)]' x, Y . 0 (3.1)
where 0 < PI, P2, su, s2 < 1, r1, r2 > 0, is given by
f(x, y) = Pi [rix + f(sux, y) I for >pi p2r2y
1 PI1-P2
= p2[r2y + f(x, s2y)] for 1rp
1-PI
< p2_ p
P2
(3.2)
If s5m = 52i, f(x, y) is piecewise linear.
This result may be extended in many ways.
THEOREM 5. The solution of
f(x) =
o
max.
%y .x
[g(y) + h(x - y) + f(ay + b(x - y)], (3.3)
where 0 < a, b < 1, may be reduced to that of
f(x) = max. [g(x) +f(ax), h(x) +f(bx)I, (3.4)
in 0 < x < xo, if g and h are monotonically increasing functions such that
g(O) = h(O) = 0, g", h' > 0 in [0, xo].
If g', h' < 0 the situation is much more complicated, and no such simple
result such as (3.4) holds in general. The solution of (3.4) may be obtained
explicitly and is similar in structure to that of (3.1) above. This func-
tional equation arises from problem 2.
THEOREM 6. The solution of
f(P P2, ...PN) = min. [ k +
k qk

(l Pk)f o,Ps***s°s'*'w1 p) (3.5)

VOL. 38, 1952 MATHEMATICS: RICHARD BELLMAN 719

the zero occurring in the kth place, where

f(O.* ... °.O Pk, ...*P) (3.6)

for pk > O,k = 1,2, ...,N, is given by

f(P1, P2p ..
PN) = 1 + (1-P&) X

(-Pi I
of' ..
I PN) (3 7)

if k is the index for which pj(l - ql)/tl is a maximum.

* This is the solution to problem 1 above in the case where we wish to ob-
tain the ball. If we want merely to locate the ball the solution is more com-
plicated. In this case we either examine the box for which PA(1 -qk)/tk
is a maximum first, or we never examine it.
THEOREM 7. The solution oft

f(x) = 1+ min.{(f(ax)} > x > O, O < a < 1, (3.8)

f(O) = O,
is
f(x)= 1 + xf(1), x <XQ
- 1 + f(ax), x > xo,
where xO = (1e-)a)/(k + 1)(1 - a), and k is the integer at which (y + 1)!
(1-aV) is a minimum for y = 1, 2,.
Detailed proofs and further results will appear in another publication.
t Results on the existence of solutions of (2.1) were obtained by S. Karlin and H. N.
Shapiro in an unpublished paper.
t The solution of (3.1) was obtained in conjunction with M. Shiffman, while that of
(3.8) was obtained in conjunction with D. Blackwell.
'Arrow, K. J., Blackwell, D., and Girshick, M. A., "Bayes and Minimax Solutions of
Sequential Decision Problems," Econometrica, 17, 214-244 (1949).
2 Arrow, K. J., Harris, T. E., and Marschak, J., Optimal Inventory Policy, Cowles
Commission Papers, New Series, No. 44, 1951.
3 Wald, A., Statistical Decision Functions, John Wiley & Sons, New York, 1950.

Dns PDF
100% (1)
Dns PDF
22 pages
Understanding Applied Linguistics
No ratings yet
Understanding Applied Linguistics
38 pages
Open Focus Dr. Lester G. Fehmi Ph.D. & George Fritz Ed.D.
100% (1)
Open Focus Dr. Lester G. Fehmi Ph.D. & George Fritz Ed.D.
7 pages
PSYC 1205 Discussion Forum Unit 5
No ratings yet
PSYC 1205 Discussion Forum Unit 5
2 pages
Explicit Analysis RADIOSS Ebook PDF
100% (2)
Explicit Analysis RADIOSS Ebook PDF
438 pages
Phoenix Xt150 Ps
100% (1)
Phoenix Xt150 Ps
2 pages
Dynamic Programming Problem
No ratings yet
Dynamic Programming Problem
12 pages
ODD PERFECT NUMBERS, DIOPHANTINE EQUATIONS, AND UPPER
No ratings yet
ODD PERFECT NUMBERS, DIOPHANTINE EQUATIONS, AND UPPER
17 pages
P550
No ratings yet
P550
27 pages
Bellman Routingproblem 1958
No ratings yet
Bellman Routingproblem 1958
5 pages
knapsack
No ratings yet
knapsack
2 pages
Ineq Lagrange PDF
100% (1)
Ineq Lagrange PDF
7 pages
Solutions To The 83rd William Lowell Putnam Mathematical Competition Saturday, December 3, 2022
No ratings yet
Solutions To The 83rd William Lowell Putnam Mathematical Competition Saturday, December 3, 2022
7 pages
2 Growth Neoclassical Growth
No ratings yet
2 Growth Neoclassical Growth
71 pages
Take-Home Test 1 Solutions: 6.243J (Fall 2003) : Dynamics of Nonlinear Systems by A. Megretski
No ratings yet
Take-Home Test 1 Solutions: 6.243J (Fall 2003) : Dynamics of Nonlinear Systems by A. Megretski
5 pages
2014 RI (Yr6) H3 Math Prelim (Student Solutions)
No ratings yet
2014 RI (Yr6) H3 Math Prelim (Student Solutions)
9 pages
SLchapt 3
No ratings yet
SLchapt 3
10 pages
Existence and Uniqueness of A Positive Steady State Solution For A Logistic System of Differential Difference Equations
No ratings yet
Existence and Uniqueness of A Positive Steady State Solution For A Logistic System of Differential Difference Equations
12 pages
An Algorithm For Minimax Solution of Overdetennined Systems of Non-Linear Equations
No ratings yet
An Algorithm For Minimax Solution of Overdetennined Systems of Non-Linear Equations
8 pages
Solutions of Second Order Ordinary Differential Equations
No ratings yet
Solutions of Second Order Ordinary Differential Equations
9 pages
Solutions
No ratings yet
Solutions
77 pages
dynamic programming
No ratings yet
dynamic programming
9 pages
Partial Exam 23 Nov 2011
No ratings yet
Partial Exam 23 Nov 2011
7 pages
applied mathematics
No ratings yet
applied mathematics
11 pages
Optimization
No ratings yet
Optimization
47 pages
1. PYQ Solutions
No ratings yet
1. PYQ Solutions
174 pages
The Methods of Solution For Constrained Nonlinear Programming
No ratings yet
The Methods of Solution For Constrained Nonlinear Programming
6 pages
12 International Mathematics Competition For University Students
No ratings yet
12 International Mathematics Competition For University Students
4 pages
Dynamic Programming: Quantitative Macroeconomics (Econ 5725)
No ratings yet
Dynamic Programming: Quantitative Macroeconomics (Econ 5725)
55 pages
Bellman's Equations: T t+1 T T 0 T T t+1 T T, T
No ratings yet
Bellman's Equations: T t+1 T T 0 T T t+1 T T, T
7 pages
Sydsaeter Odd Answers
No ratings yet
Sydsaeter Odd Answers
92 pages
Cbse Sample Paper Class 12 2024 Maths
No ratings yet
Cbse Sample Paper Class 12 2024 Maths
27 pages
Bellman Equation in Dynamic Programming
No ratings yet
Bellman Equation in Dynamic Programming
3 pages
EC744 Lecture Note 3 Dynamic Programming Under Certainty: Prof. Jianjun Miao
No ratings yet
EC744 Lecture Note 3 Dynamic Programming Under Certainty: Prof. Jianjun Miao
17 pages
Introduction To Stochastic Optimization-2
No ratings yet
Introduction To Stochastic Optimization-2
15 pages
Imc2000 2
No ratings yet
Imc2000 2
4 pages
2000 2 PDF
No ratings yet
2000 2 PDF
4 pages
DP_Bellman_1741339134 2025-03-07 09_19_05
No ratings yet
DP_Bellman_1741339134 2025-03-07 09_19_05
13 pages
Spring 2005 Solutions
No ratings yet
Spring 2005 Solutions
11 pages
Dynamic Programming Value Iteration
100% (1)
Dynamic Programming Value Iteration
36 pages
1995
No ratings yet
1995
11 pages
MME Odd Solutions
No ratings yet
MME Odd Solutions
76 pages
International Competition in Mathematics For Universtiy Students in Plovdiv, Bulgaria 1995
No ratings yet
International Competition in Mathematics For Universtiy Students in Plovdiv, Bulgaria 1995
11 pages
MIT Exercises
No ratings yet
MIT Exercises
11 pages
MATH1009_2021pp
No ratings yet
MATH1009_2021pp
10 pages
Isi Msqe 2008
No ratings yet
Isi Msqe 2008
13 pages
Solutions For Problems in The 9 International Mathematics Competition For University Students
No ratings yet
Solutions For Problems in The 9 International Mathematics Competition For University Students
5 pages
Solutions Manual
No ratings yet
Solutions Manual
94 pages
ugb2_solutions
No ratings yet
ugb2_solutions
4 pages
Min/Max Problems and Inequalities
No ratings yet
Min/Max Problems and Inequalities
3 pages
Solutions Sundaram
No ratings yet
Solutions Sundaram
22 pages
Dyn Part I
No ratings yet
Dyn Part I
38 pages
MR 4 2023 Functional Equations
No ratings yet
MR 4 2023 Functional Equations
9 pages
EE364a Homework 3 Solutions: 0 N 0 1 N N 1 1 N N 0 0
No ratings yet
EE364a Homework 3 Solutions: 0 N 0 1 N N 1 1 N N 0 0
19 pages
A_penalty_method_for_deriving_neces
No ratings yet
A_penalty_method_for_deriving_neces
21 pages
Isi 2016
No ratings yet
Isi 2016
16 pages
On The Quantitative Subspace Theorem: Journal of Mathematical Sciences August 2010
No ratings yet
On The Quantitative Subspace Theorem: Journal of Mathematical Sciences August 2010
27 pages
[9783110426045 - An Introduction to Nonlinear Optimization Theory] 3 the Study of Smooth Optimization Problems
No ratings yet
[9783110426045 - An Introduction to Nonlinear Optimization Theory] 3 the Study of Smooth Optimization Problems
39 pages
Optimal Recovery of Operator Sequences: V. F. Babenko, N. V. Parfinovych, D. S. Skorokhodov October 19, 2021
No ratings yet
Optimal Recovery of Operator Sequences: V. F. Babenko, N. V. Parfinovych, D. S. Skorokhodov October 19, 2021
21 pages
imo-2015-sl
No ratings yet
imo-2015-sl
12 pages
Computer Science
No ratings yet
Computer Science
38 pages
CS30053 Foundations of Computing, Spring 2004
No ratings yet
CS30053 Foundations of Computing, Spring 2004
1 page
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Algebraic Equations
From Everand
Algebraic Equations
Demetrios P. Kanoussis
No ratings yet
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
Transformation of Axes (Geometry) Mathematics Question Bank
From Everand
Transformation of Axes (Geometry) Mathematics Question Bank
Mohmmad Khaja Shareef
3/5 (1)
[13] Robust MPC with recursive model update
No ratings yet
[13] Robust MPC with recursive model update
11 pages
[16] Robust controller design for systems with probabilistic uncertain parameters using multi-objective genetic programming
No ratings yet
[16] Robust controller design for systems with probabilistic uncertain parameters using multi-objective genetic programming
17 pages
Meta-device: advanced manufacturing
No ratings yet
Meta-device: advanced manufacturing
16 pages
SKF FFT
100% (1)
SKF FFT
32 pages
Hermman 1967
No ratings yet
Hermman 1967
6 pages
Corner Sort For Pareto-Based Many-Objective Optimization: Handing Wang, Xin Yao
No ratings yet
Corner Sort For Pareto-Based Many-Objective Optimization: Handing Wang, Xin Yao
11 pages
Accepted Manuscript
No ratings yet
Accepted Manuscript
27 pages
Optimizacion de Consignas PDF
No ratings yet
Optimizacion de Consignas PDF
15 pages
Approximate ENS
No ratings yet
Approximate ENS
43 pages
Mbabaei@znu - Ac.ir: Nsga-Ii
No ratings yet
Mbabaei@znu - Ac.ir: Nsga-Ii
7 pages
Assy User Manual SAP
No ratings yet
Assy User Manual SAP
38 pages
MATH 4 - QUARTER 3 - LESSON 2 - Draw Parallel, Intersecting and Perpendicular Lines
No ratings yet
MATH 4 - QUARTER 3 - LESSON 2 - Draw Parallel, Intersecting and Perpendicular Lines
26 pages
ISEE Practice Test
No ratings yet
ISEE Practice Test
49 pages
7BEd ET2017 Urdu 20200528 192716 20200530 080120
No ratings yet
7BEd ET2017 Urdu 20200528 192716 20200530 080120
119 pages
Basic Electric Circuit Analysis-D E Johnson J L Hi
No ratings yet
Basic Electric Circuit Analysis-D E Johnson J L Hi
3 pages
Probability and Statistics Book Solutions
50% (4)
Probability and Statistics Book Solutions
100 pages
Stirling Engine
100% (7)
Stirling Engine
4 pages
The Big Idea Canvas
No ratings yet
The Big Idea Canvas
2 pages
Full Plaintext Recovery Attack On Broadcast RC4
No ratings yet
Full Plaintext Recovery Attack On Broadcast RC4
24 pages
ETHICS - Lesson 2
No ratings yet
ETHICS - Lesson 2
40 pages
Cartesian Coordinate System
No ratings yet
Cartesian Coordinate System
4 pages
Sustainable Design at HOK - Colombia
No ratings yet
Sustainable Design at HOK - Colombia
63 pages
Design of Bore Well Rescue System Using Morphological Chart: December 2016
No ratings yet
Design of Bore Well Rescue System Using Morphological Chart: December 2016
6 pages
Amir Peer Review
No ratings yet
Amir Peer Review
2 pages
Diesel Storage Tank
100% (2)
Diesel Storage Tank
13 pages
03-Advances in Design With Hollow Structural Steel Members 2019
No ratings yet
03-Advances in Design With Hollow Structural Steel Members 2019
12 pages
(Van Dijk) Discourse Analysis Its Development and Application
100% (2)
(Van Dijk) Discourse Analysis Its Development and Application
23 pages
AX Robot Controller
100% (1)
AX Robot Controller
2 pages
I3TD
No ratings yet
I3TD
43 pages
Geriatric Nursing Week2
No ratings yet
Geriatric Nursing Week2
45 pages
ViewStudentResult
No ratings yet
ViewStudentResult
1 page
Pue Karnataka I Puc Basic Maths Model Question Paper 2022
No ratings yet
Pue Karnataka I Puc Basic Maths Model Question Paper 2022
5 pages
CHAPTER 2 Human Acts Act of Man
No ratings yet
CHAPTER 2 Human Acts Act of Man
3 pages
Materials (Week 4 - 5 and 6)
No ratings yet
Materials (Week 4 - 5 and 6)
18 pages

[2] On the theory of dynamic programming

Uploaded by

[2] On the theory of dynamic programming

Uploaded by

716 MATHEMATICS: RICHARD BELLMAN PROC. N. A S.

ON THE THEORY OF DYNAMIC PROGRAMMING

ceed in order to maximize the total return obtained in a finite, or un-

f(p) min. + E Pkf(xk), p

(l Pk)f o,Ps***s°s'*'w1 p) (3.5)

the zero occurring in the kth place, where

f(O.* ... °.O Pk, ...*P) (3.6)

for pk > O,k = 1,2, ...,N, is given by

if k is the index for which pj(l - ql)/tl is a maximum.

f(x) = 1+ min.{(f(ax)} > x > O, O < a < 1, (3.8)

You might also like