optimal control
optimal control
• (1.1.2)
• Note that
• 1. the input u(t) to the plant is determined by the error e(t) and the
compensator, and
• 2. all the variables are not readily available for feedback. In most cases
only one output variable is available for feedback.
• The modern control theory concerned with multiple inputs and multiple
outputs (MIMO) is based on state variable representation in terms of a set of
first order differential (or difference) equations. Here, the system (plant) is
characterized by state variables, say, in linear, time invariant form as
• Referring to Figure 1.2, we are interested in finding the optimal control u*(t)
(* indicates optimal condition) that will drive the plant P from initial state to
final state with some constraints on controls and states and at the same time
extremizing the given performance index J.
The formulation of optimal control problem requires :
• or in general,
• here, R is a positive definite matrix and prime (') denotes transpose here
and throughout this book (see Appendix A for more details on definite
matrices).
• Similarly, we can think of minimization of the integral of the squared
error of a tracking system.
• We then have,
• where, Xd(t) is the desired value, xa(t) is the actual value, and
x(t) = xa(t) - Xd(t), is the error.
• Here, Q is a weighting matrix, which can be positive semi-definite.
4.Performance Index for Terminal Control System:
• In a terminal target problem, we are interested in minimizing the error
between the desired target position Xd (tf) and the actual target position Xa
(tf) at the end of the maneuver or at the final time tf.
• The terminal (final) error is x ( tf) = Xa ( tf) - Xd ( tf ). Taking care of
positive and negative values of error and weighting factors, we structure the
cost function as
• Which is also called the terminal cost function. Here, F is a positive semi-
definite matrix.
6.Performance Index for General Optimal Control System:
• where, +, and - indicate the maximum and minimum values the variables
can attain.
• We are interested in minimizing the error of the system; therefore, when
the desired state vector is represented as x(J = 0, we are able to consider
the error as identically equal to the value of the state vector.
• That is, we intend the system to be at equilibrium, x = x^ = 0, and any
deviation from equilibrium is considered an error.
• Therefore, in this section, we will consider the design of optimal control
systems using state variable feedback and error-squared performance
indices.
• The control system we will consider is shown in Figure 11.14 and can be
represented by the vector differential equation x = Ax + Bu.
• (11.27)
• We will select a feedback controller so that u is some function of the
measured state variables x and therefore u = -k(x).
• The choice of the control signals is somewhat arbitrary and depends partially
on the actual desired performance and the complexity of the feedback
structure allowable.
• Often, we are limited in the number of state variables available for feedback,
since
we are only able to use measurable state variables.
• In our case, we limit the feedback function to a linear function so that u = —
Kx, where K is an m X n matrix.
• Therefore, in expanded form, we have
where H is the n X n matrix resulting from the addition of the elements of A and - BK.
Now, returning to the error-squared performance index, we recall from Section 5.7 that
the index for a single state variable, Xn, is written as
Since we wish to define the performance index in terms of an integral of the sum of the
state variables squared, we will use the matrix operation
• Then the specific form of the performance index, in terms of the state
vector, is
• The general form of the performance index (Equation 11.26)
incorporates a term with u that we have not included at this point, but
we will do so later in this section.
• Again considering above Equation, we will let the final time of
interest be tf = oo.
• To obtain the minimum value of J, we postulate the existence of an
exact differential so that
• Where P is to be determined. A symmetric P matrix will be used to
simplify the algebra without any loss of generality.
• Then, for a symmetric P matrix, Pij=Pji
• Completing the differentiation indicated on the left-hand side of above
Equation , we have
• In the evaluation of the limit at t - oo, we have assumed that the system is
stable, and hence x(oo) = 0, as desired.
• Therefore, to minimize the performance index J, we consider the two equations