Cong 2009
Cong 2009
Abstract—A mix locally recurrent neural network was used output (SISO) plants, satisfactory control can be achieved by
to create a proportional-integral-derivative (PID)-like neural net- using established tuning rules. These rules can be applied to
work nonlinear adaptive controller for uncertain multivariable multivariable plants with SISO characteristics as well. Many
single-input/multi-output system. It is composed of a neural net-
work with no more than three neural nodes in hidden layer, and multivariable plants, particularly the single-input/multi-output
there are included an activation feedback and an output feedback, (SIMO) plants, however, show significant internal interaction.
respectively, in a hidden layer. Such a special structure makes the Since the tuning rules are developed for SISO system, their
exterior feature of the neural network controller able to become a applications to such “true” multivariable plants are ineffective.
P, PI, PD, or PID controller as needed. The closed-loop error be- Furthermore, in the case of multivariable plants, the number of
tween directly measured output and expected value of the system
is chosen to be the input of the controller. Only a group of initial PID parameters becomes quite a lot. Trial and error techniques
weights values, which can run the controlled closed-loop system are thus inadequate to derive a good compromise between
stably, are required to be determined. The proposed controller can controller performance and robustness. Second, assume that the
update weights of the neural network online according to errors controller has a PID structure, and find the PID parameters by
caused by uncertain factors of system such as modeling error and using some well-known optimization methods, e.g., H∞ [5],
external disturbance, based on stable learning rate. The resilient
back-propagation algorithm with sign instead of the gradient is mixed H2 /H∞ [6], and semidefinite programming approaches
used to update the network weights. The basic ideas, techniques, [7]. These methods can be used to obtain the PID controller
and system stability proof were presented in detail. Finally, actual parameters such that the controllers have good time-domain
experiments both of single and double inverted pendulums were performance and frequency-domain robustness. The main prob-
implemented, and the comparison of effectiveness between the lem with this approach is that the resulting controllers are state-
proposed controller and the linear optimal regulator were given.
space controllers of high-order rather than low-order controllers
Index Terms—Neural network, nonlinear adaptive control, with a fixed structure. Although one can reduce or approximate
proportional-integral-derivative (PID), single-input/multi-output it with a PID-like structure, it is not so far the reduced-order
(SIMO), uncertain multivariable system.
controller.
With advantages like the abilities of adaptive, parallel, and
I. I NTRODUCTION
fault tolerance, artificial neural networks plays an important
which commonly appear in real control applications to get = w1j (k)ej (k) + a1 (k − 1) (1)
the better control performance by putting extra sensors. Some j=1
⎛ ⎞
results on performance limitations of a single-input two-output
s
s
system, special form of the SIMO system, can be found in [17]. a2 (k) = f ⎝ w2j (k)ej (k)⎠ = w2j (k)ej (k) (2)
Early in 2005, Cong et al. [18], [19] proposed a novel multivari- j=1 j=1
able PID-like neural network control scheme for SIMO system, ⎛ ⎞
and then discussed its design method in 2006, but the sufficient
s
s
stability analysis of the close-loop system is still lacking. a3 (k) = f ⎝ w3j (k)ej (k) − w3j (k − 1)ej (k − 1)⎠
The main contribution of this paper is that based on a mix j=1 j=1
On the one hand, the basic idea of PID is that the control
action u should be proportional to the error, the integral of the
error over time, and the temporal derivative of the error. From
(4)–(6), one can see that when the neural controller adjusts
input/output relation of the system, it shows the characteristic
of proportional (the second node a2 ) + differential (the third
node a3 ) + integral (the first node a1 ), and it becomes actually a
PID-like controller. The conventional PID controller is usually
only suitable for the system with SISO system and without
consideration of environmental and disturbing factors, while Fig. 2. Structure of the control system with network controller proposed.
the PID-like neural network controller proposed in this paper is
hoped to be suitable for SIMO system and can online regulate The output error of closed-loop system is defined as
controller’s parameters adaptively under the effect of uncertain
and unmodeled factors. ej (k) = rj (k) − yj (k), j = 1, 2, . . . , s. (8)
On the other hand, conventional neural networks can have
It is assumed that yj (k) (j = 1, 2, . . . , s) are output variables
several layers. There are usually two main types of multilayer
of system that are measurable directly by sensors, where j is
networks: feedforward and recurrent ones, in which, recurrent
the number of measurable output variables, and rj (k), (j =
networks have at least one feedback loop. This means an output
1, 2, . . . , s) are given command inputs. Then, the cost function
of a layer feeds back to any proceeding layer. This gives the
to output yj (k) at k is defined as
network partial memory. This makes recurrent networks pow-
erful in approximating function depending on time. The neural 1 1
network controller proposed has two layers, in which the hidden Jj (k) = (rj (k) − yj (k))2 = (ej (k))2 . (9)
2 2
layer play does a multi-input PID function so it has the explicit
signification of PID, while the output layer plays the rule of The total cost function J(k) at k is
sum, so in the controller proposed the weights of output layer
s
are set as J(k) = Jj (k). (10)
j=1
wi = 1, (i = 1, 2, 3). (7)
Equation (10) is the final cost function used in every sam-
pling period.
Although two layers of the network controller here use both Without loss of generality, the rules of updating weights are
the linear activation functions, the mix local recurrent network obtained by means of minimizing the cost function J(k). Here,
used in hidden layer makes the PIDNNC have the properties we adopt the method of gradient descent to obtain updating
there do not exist in the general linear networks or conventional rules. The rule of updating weights in the hidden layer is
local recurrent networks.
⎛ ⎞
Just because it is composed of a neural network with no more s
∂J(k) ∂Jj (k) ∂yj (k) ⎠ ∂u(k)
than three neural nodes in hidden layer, and in the same hidden =⎝
layer there are an activation feedback and an output feedback, ∂wij (k) j=1
∂yj (k) ∂u(k) ∂wij (k)
respectively, such a special structure of the proposed controller ⎛ ⎞
makes it a P, PI, PD, or PID controller as needed. In such a
s
∂y (k)
= −⎝ ⎠ ej (k).
j
way, the proposed controller has both the property of simplicity ej (k) (11)
j=1
∂u(k)
of PID control and, at the same time, the neural networks’
powerful capability of self-learning.
Equation (11) is obtained as follows: From (4), one can see that
u is independent
of j, so ∂u(k)/∂wij (k) can be placed the out
side of the sj=1 (•), and for a fixed i, then according to (1)–(3):
III. U PDATING W EIGHTS R ULES OF P IDNNC AND I TS
One has (∂u(k)/∂uij (k)) = (∂ai (k)/∂wij (k)) − ej (k).
I NITIAL W EIGHTS D ESIGN
Considering the complexity and uncertainty of functional
The structure of closed-loop control system with the neural relation between the output yj (k) and the input u(k), the
controller proposed is shown in Fig. 2. Like general controllers, partial derivative ∂yj (k)/∂u(k) is approximated by using the
the goal of the control system with the PIDNNC is to minimize concept of difference, that is as follows: (∂yj (k)/∂u(k)) ≈
the error between the given command input and the output of (Δyj (k)/Δu(k)) = ((yj (k) − yj (k −1))/(u(k) − u(k − 1))).
closed-loop system. Because physical units used for each con- Moreover, in order not to consider the actual result of
trolled variable in practical system are different (for example calculation the same concept using in the resilient back-
the meter for the distance and meter per second for the veloc- propagation algorithm with sign instead of the gradient value is
ity), the cost function Jj (k) to different controlled variables used to calculate the Δyj (k)/Δu(k). That is to say, the result
yj (k) (j = 1, 2, . . . , s) at k should be defined, respectively, of Δyj (k)/Δu(k) is set to be ±1 according to the positive-
and the total cost function J(k) is the sum of all defined cost negative characteristic of yj (k) − yj (k − 1)/u(k) − u(k − 1).
functions. In such a way, we not only avoid the problem of stopping
CONG AND LIANG: PID-LIKE NEURAL NETWORK NONLINEAR ADAPTIVE CONTROL FOR MOTION CONTROL 3875
updating weights owing to the gradient decent being too small, where
but also greatly simplify the calculation and satisfy the request ⎛ ⎞⎛ ⎞
s s
of online control. Therefore, (11) can be rewritten as Δy (k)
q(k) = ⎝ ⎠⎝ e2j (k)⎠ .
j
ej (k)sgn (16)
⎛ ⎞ Δu(k)
s j=1 j=1
∂J(k) Δy (k)
≈ −⎝ ⎠ ej (k).
j
ej (k)sgn (12)
∂wij (k) j=1
Δu(k) Proof: In order to stabilize the closed-loop control system
with the PIDNNC, the Lyapunov stability theorem is used here.
According to (12), the final rule of updating weights in The Lyapunov function is defined as
hidden layer is
1 2
s
s
3 Δej (k) = el (k) η(k)
ej (k)
i=1
wij (k)
wij (k) l=1
j=1 i=1 ⎛ ⎞
−2 2
≤ η(k) ≤ 0 (15) 3 s
s
3
1 =
1 ⎝p(k) e2j (k) ⎠ η(k). (24)
wij (k) q(k) w (k)
j=1 i=1 i=1 ij j=1
3876 IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, VOL. 56, NO. 10, OCTOBER 2009
e1 = −x e2 = −θ1 e3 = −θ2 .
condition, two controllers have almost the same control results [12] D. Wang and J. Huang, “Neural network-based adaptive dynamic surface
of angles. This is reasonable because angles in stable control control for a class of uncertain nonlinear systems in strict-feedback form,”
IEEE Trans. Neural Netw., vol. 16, no. 1, pp. 195–202, Jan. 2005.
region usually do not change greatly. [13] H. Su and H. T. Chong, “Induction machine condition monitoring using
neural network modeling,” IEEE Trans. Ind. Electron., vol. 54, no. 1,
pp. 241–249, Feb. 2007.
VI. C ONCLUSION [14] S. Madhusudan and S. Smriti, “Identification and control of a nonlinear
system using neural networks by extracting the system dynamics,” IETE
An adaptive PID-like neural network controller suitable for J. Res., vol. 53, no. 1, pp. 43–50, 2007.
a large class of SIMO system is proposed in this paper. Rules [15] C. J. Lin and C. H. Chen, “A compensation-based recurrent fuzzy neural
network for dynamic system identification,” Eur. J. Oper. Res., vol. 172,
of updating weights for the controller are derived. The effect no. 2, pp. 696–715, 2006.
of learning rates to stability of closed-loop system is analyzed. [16] T. J. Ren and T. C. Chen, “Motion control for a two-wheeled vehicle
The actual experiments on single and double inverted pendulum using a self-tuning PID controller,” Control Eng. Pract., vol. 16, no. 3,
pp. 365–375, Mar. 2008.
are implemented, and the results are compared with that of [17] T. Bakhtiara and S. Hara, “Regulation performance limitations for SIMO
the LQR, which demonstrates the advantages of controller linear time-invariant feedback control systems,” Automatica, vol. 44,
proposed in simple structure, easy-design, and higher accuracy no. 3, pp. 659–670, Mar. 2008.
[18] S. Cong, G. Li, and B. Ji, “A novel PID-like neural network controller,” in
response performance for multivariable SIMO systems with Proc. 16th IFAC World Congr., Prague, Czech Republic, 2005, pp. 1–6.
disturbance and uncertainty. [19] S. Cong, Y. Liang, and G. Li, “The design of multivariable adaptive
PID-like neural network controller and its design method,” Inf. Control,
vol. 35, no. 5, pp. 568–573, 2006.
R EFERENCES [20] Y. Liang, “Nonlinear adaptive control of time-varying uncertain electro-
[1] J. G. Ziegler and N. B. Nichols, “Optimum settings for automatic mechanical motion system,” Ph.D. dissertation, Univ. Sci. Technol. China,
controllers,” Trans. ASME, vol. 64, pp. 759–768, 1942. Hefei, China, 2008.
[2] M. Morari and E. Zafiriou, Robust Process Control. Englewood Cliffs,
NJ: Prentice-Hall, 1989.
[3] M. Zhuang and D. P. Atherton, “Automatic tuning of optimum PID con-
trollers,” Proc. Inst. Elect. Eng.—Control Theory Appl., vol. 140, no. 3,
S. Cong received the B.S. degree in automatic con-
pp. 216–224, May 1993.
trol from Beijing University of Aeronautics and As-
[4] W. K. Ho, C. C. Hang, and L. S. Cao, “Tuning of PID controllers based
tronautics, Beijing, China, in 1982 and the Ph.D.
on gain and phase margin specifications,” Automatica, vol. 31, no. 3,
degree in system engineering from the University of
pp. 497–502, Mar. 1995.
Rome “La Sapienza,” Rome, Italy, in 1995.
[5] M. T. Ho, “Synthesis of H-infinity PID controllers: A parametric
She is currently a Professor with the Department
approach,” Automatica, vol. 39, no. 6, pp. 1069–1075, 2003.
of Automation, University of Science and Technol-
[6] H. J. Uang and C. C. Lien, “Mixed H2 /H∞ PID tracking control design
ogy of China, Hefei, China. Her research interests
for uncertain spacecraft systems using a cerebellar model articulation
include advanced control strategies for motion con-
controller,” Proc. Inst. Elect. Eng.—Control Theory Appl., vol. 153, no. 1,
trol, fuzzy logic control, neural networks design and
pp. 1–13, Jan. 2006.
applications, robotic coordination control, and quan-
[7] G. Cheng and K. Peng, “Robust composite nonlinear feedback con-
tum system control.
trol with application to a servo positioning system,” IEEE Trans. Ind.
Electron., vol. 54, no. 2, pp. 1132–1140, Apr. 2007.
[8] D. Psaltis, A. Sideris, and A. A. Yamamura, “A multilayered neural
network controller,” IEEE Control Syst. Mag., vol. 8, no. 2, pp. 17–21,
Apr. 1988. Y. Liang received the B.S. degree in automation
[9] T. Pajchrowski and K. Zawirski, “Application of artificial neural network from Southwest University of Science and Technol-
to robust speed control of servodrive,” IEEE Trans. Ind. Electron., vol. 54, ogy, Mianyang City, China, in 2002 and the Ph.D.
no. 1, pp. 200–207, Feb. 2007. degree in control theory and control engineering
[10] S. S. Ge and F. Hong, “Adaptive neural network control of nonlinear from the University of Science and Technology of
systems with unknown time delays,” IEEE Trans. Autom. Control, vol. 48, China, Hefei, China, in 2008.
no. 11, pp. 2004–2010, Nov. 2003. He is currently with the Department of Automa-
[11] C.-H. Lu and C.-C. Tsai, “Adaptive predictive control with recurrent tion, University of Science and Technology of China.
neural network for industrial processes: An application to temperature His research interests include the neural network
control of a variable-frequency oil-cooling machine,” IEEE Trans. Ind. controller design and implementation, motion con-
Electron., vol. 55, no. 3, pp. 1366–1375, Mar. 2008. trol, and robust control for nonlinear system.