Professional Documents
Culture Documents
IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICSPART B: CYBERNETICS, VOL. 35, NO. 1, FEBRUARY 2005
AbstractFault tolerant control of dynamic processes is investigated in this paper using an auto-tuning PID controller. A fault
tolerant control scheme is proposed composing an auto-tuning
PID controller based on an adaptive neural network model. The
model is trained online using the extended Kalman filter (EKF)
algorithm to learn system post-fault dynamics. Based on this model,
the PID controller adjusts its parameters to compensate the effects
of the faults, so that the control performance is recovered from
degradation. The auto-tuning algorithm for the PID controller is
derived with the Lyapunov method and therefore, the model predicted tracking error is guaranteed to converge asymptotically. The
method is applied to a simulated two-input two-output continuous
stirred tank reactor (CSTR) with various faults, which demonstrate
the applicability of the developed scheme to industrial processes.
Index TermsAdaptive NN models, auto-tuning PID, continuous stirred tank reactor (CSTR), extended Kalman filter (EKF),
fault tolerant control.
I. INTRODUCTION
ITH rapid increase in complexity of modern control systems, the importance of the fault tolerant control (FTC)
concept and technology has been realized and accepted by industry. Control system stability and reliability are not only critical for some projects where strict safety conditions apply, e.g.,
nuclear power stations and passenger airplanes, but also essential for large-scale productions, since most of modern industrial
plants are complex and often include a number of subsystems
which may compensate for the effects of sensor faults and component malfunction. This requires solutions that are very costly
in both hardware and development effort. Therefore, FTC is
very important from the viewpoint of safety, as well as reduced
production costs. FTC offers the capability to avoid inadvertent
process shut downs from simple faults, e.g. in instrumentation
and control loops that could develop into production stoppages
or plant failures.
Currently, FTC in most real industrial systems are realized by
hardware redundancy. For example, the majority-voting scheme
is used with redundant sensors to cope with sensor faults [1].
However, due to two main limitations of the hardware redundancy, high cost, and taking more space, solutions using analytical redundancy [1] have been investigated over the last two
Manuscript received February 7, 2004; revised June 3, 2004. This paper was
recommended by Associate Editor S. Phoha.
D.-L. Yu is with the Control Systems Research Group, School of Engineering,
Liverpool John Moores University, Liverpool, L3 3AF, U.K.
T. K. Chang is with the Control Systems Research Group, School of Engineering, Liverpool John Moores University, Liverpool, L3 3AF, U.K.
D.-W. Yu is with the Department of Automation, Northeast University, Qinhuangdao, China (e-mail: D.Yu@livjm.ac.uk).
Digital Object Identifier 10.1109/TSMCB.2004.839247
decades. There are generally two different approaches using analytical redundancy: 1) passive approaches, and 2) active approaches. Passive approaches use robust control techniques to
design closed-loop systems so that it is insensitive to certain
faults, e.g., [2]. Recently, an elegant design method of passive
approach was proposed by Chen et al. [3], in which the linear
matrix inequality (LMI) method was used to synthesis the reliable controller. Different faults were formulated as constraints
in the method and were considered in the optimal design using
LMI. While the design example showed the stability and maintenance of satisfactory system performance, a limitation is that
the method is based on an accurate linear state space model and
therefore, is not capable of controlling nonlinear processes for
which an accurate analytical model is usually unavailable. In addition, because the passive approaches consider fault tolerance
in only the stage of controller design without taking adaptation
when faults occur, the amplitude of the faults that can be tolerable is usually limited.
Active approaches use online fault accommodation information and reconfigurable controllers. When a fault is detected
using analytical or hardware redundancy, the controller is
reconfigured to guarantee the post-fault stability and maintain
acceptable performance. Active FTC has been investigated
using different methods including the feedback linearization
[4], control law rescheduling [5], and model following control
[6]. Reconfigurable control against plant component faults
has been studied using state feedback, where the feedback
gain matrix was designed using linear quadratic regulation
method [7], the pseudo inverse method [8] and eigenstructure
assignment method [9]. However, these studies were also based
on linear models so that they are not suitable for nonlinear
processes. Model predictive control (MPC) has been employed
in FTC [10], [11], where an adjustable objective function was
optimized based on a simple linear model. The model was
expected to learn the post-fault dynamics if the amplitude of
the fault is not too big. The research utilizing this method is
active and the models used are extended to nonlinear models.
More recently, it was reported that neural networks (NN) had
been employed to tackle FTC problems for nonlinear systems.
A pseudo-linear NN was used in a FTC system to form an additional feedback loop, which was used to compensate for the
degradation of system performance caused by component faults
[12]. A neural network was used in this method as a compensator that was adapted with a modified gradient-descent algorithm. The method guarantees the overall system stability for
linear systems, while it is still an open problem to ensure global
system stability for nonlinear systems. Besides, slow convergence of gradient-descent algorithms may result in instability
33
method by strm and Hagglund [20]. For the early multivariable PID control, Koivo and Tanttu [32] gave a survey for its
tuning techniques. These techniques mainly aim at decoupling
the plant at certain frequencies.
Methods based on online parameter estimation have also
been proposed for the automatic tuning of PID regulators.
Some authors proposed auto-tuning regulators based on minimum variance, pole placement, or linear quadratic Gaussian
(LQG) design methods. Gawthrop, Radke, [33], and Isermann
[34] proposed auto-tuning PID using adaptive parameter estimation methods. Hang and his co-workers have proposed
auto-tuning PID regulators using different methods, including
a knowledge-based PID auto-tuner [35]. Based on the method
given in Nishikawa, et al. [36], Ruano, et al. [37] proposed a
connectionist approach to PID auto-tuning, which used integral
measures of the step response as the input to neural networks to
determine the required PID parameter values. However, most
of these methods are for SISO systems only.
In this paper, a new FTC approach for unknown nonlinear
dynamic systems is proposed. The approach combines an
auto-tuning PID control with an adaptive neural network
model. The neural network model is adapted to learn post-fault
dynamics when faults occur, while the PID controller tune its
parameters online based on the adapted model to compensate
the effects of the faults on the system stability and performance.
The multilayer perceptron (MLP) network is adopted because
this kind of network generally does not need structure tuning
as that for the Gaussian RBF networks. The EKF algorithm is
used to update the network weights online as this algorithm
can learn the process dynamics much faster than the steepest
descent method. The auto-tuning algorithm compute an optimal
control variable based on the adaptive model to minimize the
sum-squared error function of the predicted system tracking
error. The Lyapunuv method is used to derive the optimal
control so that the convergence of the predicted tracking error
is guaranteed. Consequently, the stability and the tracking
performance of the entire system are maintained after occurring
of the faults.
The proposed FTC scheme is evaluated by a simulated MIMO
CSTR process as an unknown system for its performance and
stability under various simulated actuator and component faults.
Both abrupt faults and incipient faults are tested. The rest of the
paper is arranged as follows. Section II presents the configuration of the FTC scheme. The adaptive NN model and its online
learning algorithm is described in Section III. The auto-tuning
PID controller is described in Section IV. Section V presents the
evaluation of the adaptive model, while Section VI describes the
application of the proposed method to the CSTR and shows the
simulation results. Finally, some conclusions are drawn in Section VII.
II. CONFIGURATION OF FTC SCHEME
The main objective of the FTC is to obtain a control variable to drive the process to track the desired trajectory when
the process is not subject to any fault, while to maintain the
system stability and to recover from the performance degradation as much and as fast as possible when fault occurs in the
34
IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICSPART B: CYBERNETICS, VOL. 35, NO. 1, FEBRUARY 2005
Fig. 1.
where
is the weight matrix connecting hidden
is the hidden layer output
layer and output layer,
vector in which each entry is transformed from the correby a nonlinear activation function
.
sponding entry in
The sigmoid activation function is chosen in this study
(4)
(5)
where
is the weight matrix connecting the
with
input layer and hidden layer,
is the network input vector with an extra unity input
augmented to introduce bias, and is defined by
(6)
Therefore,
is expressed as
A. NN Model
(7)
with
..
.
..
.
(8)
(1)
and
are the sampled process input and
where
output vectors,
and
are the input order and output order
respectively, denotes input transmission delay, is measuredenotes a vector valued nonlinear function.
ment noise and
The MLP network model used in this work is represented by the
following:
where
is the
row vector in
and
is the
row
. In applying the EKF algorithm for model paramvector in
eter estimation, the following equations are formulated to describe the model
(9)
(10)
where
(2)
(11)
35
where
(12)
(13)
(14)
where
(21)
The second term,
, can be expressed as
(15)
is the model output with the weights estimated at the last sample
time, and
(16)
..
.
(17)
(18)
model output
..
..
.
(22)
(19)
The first term in the matrix in right-hand side of (19),
, can be expressed as
..
.
..
..
.
..
..
.
..
.
..
.
..
(23)
..
.
where
(20)
(24)
36
IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICSPART B: CYBERNETICS, VOL. 35, NO. 1, FEBRUARY 2005
with
being the desired trajectory. is the sampling
,
and
are PID
time,
controller parameter matrices, and is the number of input. In
order to estimate the optimum PID parameters,
,
and
, a parameter vector,
is formulated
as
..
.
is positive def-
(31)
in (30). In discrete time operation,
for the calculation of
can be approximated by the derivative of
with
respect to . Thus
(25)
(32)
is ex-
(26)
is the gain matrix,
is the NN model
where
tracking error that is different from the process tracking error,
(33)
(27)
(28)
We design the gain matrix in (26) as
is the desired output at sample time and
is
the NN model output in iterative step within the sample peis a function of the predicted optimal control variable,
riod.
(34)
(29)
Then,
in (34) becomes
(35)
with
(36)
..
.
..
.
..
..
..
.
37
for
..
.
..
.
..
Fig. 2.
..
.
CSTR process.
..
..
.
(37)
where
..
.
..
..
.
(38)
To evaluate the learning performance of the developed adaptive network, the network is used to model a multivariable, nonlinear CSTR process.
The CSTR process used in this research is a typical nonlinear
dynamic process used in chemical and biochemical industry.
The process will also be used to evaluate the developed FTC
scheme. A second order endothermic chemical reaction 2A
B takes place in the tank. The schematic diagram of the process
is shown in Fig. 2.
The process works in the following way. The reactant A with
flows
constant concentration and variable temperature
into the tank with the flow rate
. An endothermic chemical
reaction takes place in the tank, which is based on the temperature, and the reaction absorbs heat energy. As a result, the reaction influences the temperature and concentration of outflow
liquid. The two inputs and two outputs are chosen as follows:
with
(39)
(40)
Equation (39) is a mass balance equation for chemicals while
(40) is a heat energy balance equation. The operating ranges of
the input variables for the process are
Kelvin
(41)
38
IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICSPART B: CYBERNETICS, VOL. 35, NO. 1, FEBRUARY 2005
can be seen that the process is MIMO, nonlinear and with strong
coupling among variables. The CSTR process is simulated as
an unknown nonlinear dynamic process in the research to evaluate the performance of the proposed adaptive NN model. Although the model of the CSTR process used in the simulation
is known, it is supposed that the process model is unknown and
these known parameters have not been used in the modeling and
control.
In practice, measurement noise is inevitable in process
control. Therefore, the white noise of zero mean and unity
variance with appropriate amplitude is superimposed on the
process output. The simulated CSTR process was used to
generate two input-output data sets when two different random
amplitude binary sequences were used as excitation signals.
One set containing 5000 samples is for network learning and
the other containing 1200 samples for network validation. The
,
sample period was chosen as 30 s. In this simulation,
,
and 10 hidden layer nodes are found to be most
appropriate for the MLP model. Hence, the MLP model has the
following structure:
where the subscript (8:10:2) denotes that the network has eight
input nodes, ten hidden layer nodes, and two output nodes. The
MLP model is trained in the online mode with the EKF algorithm using the training data set in the same way as that in online updating displayed in Fig. 3.
The initial value of vector is assigned to small random
and
values. The initial values of the parameters of EKF,
are assigned an identity matrix and a zero matrix respectively. All the initial values of the Jacobian matrix in the
learning calculation are assigned to be zero. After training the
model is evaluated using the test data set for one-step-ahead prediction, during which the online learning of the model is still
conducted after prediction. The validation results are shown in
Fig. 4.
As the error is very small in Fig. 4 so that the process output
and the model output are almost virtually identical. To numerically evaluate the modeling performance, an error index of the
normalized mean absolute modeling error (NMAE) as shown
below is used:
NMAE
(42)
,
].
The NMAE of Fig. 4 is [
To demonstrate the online learning ability of the developed
adaptive NN model, another data set is generated when a mal-
Fig. 4.
Fig. 6.
39
Fig. 9.
40
IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICSPART B: CYBERNETICS, VOL. 35, NO. 1, FEBRUARY 2005
Fig. 10.
q ).
Fig. 11.
1q ).
for normal operation without faults are also generated and displayed in Fig. 11 for comparison.
We can see in Fig. 11 that the control variables change quickly
with a big amplitude to respond to the fault occurrence. In responding, the controller output changes and has a steady state
difference from that for no fault condition. This is because of the
step change of of 40%.
Fig. 12 shows the process tracking response to a 30% ramp
. Because the ramp fault is a fault with its amactuator fault
plitude increased gradually, the process tracking response has a
very small tracking error. The associated controller outputs to
Fig. 12 are displayed in Fig. 13.
is simulated on the CSTR process
The ramp fault
starting from sample instant
, therefore the predicted
control signal for 30% change of
actuator fault is gradually diverging away from the control signal under no fault
condition. The process tracking response show that the control
performance and stability are maintained after 40% change of
and 30% change of . The adaptive MLP model is able to
learn the post-fault dynamics quickly with the EKF algorithm.
The proposed FTC with auto-tuning PID controller drives the
CSTR process output back to the desired process trajectory
quickly when a fault occurs. The tracking error index in Fig. 12
,
].
is [
C. Case 3: With Component Faults
, and
, are simulated on the
Two component faults,
process, where
is simulated for the outflow pipe restriction
Fig. 12.
T ).
Fig. 13.
1T ).
41
Fig. 14. Process response under auto-tuning PID control (with abrupt fault,
R ).
Fig. 17.
1c ).
TABLE I
PROCESS TRACKING ERROR INDICES IN DIFFERENT FAULT CONDITIONS
Fig. 15.
1R
).
Fig. 16.
c ).
42
IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICSPART B: CYBERNETICS, VOL. 35, NO. 1, FEBRUARY 2005
[21] C. C. Hang, K. J. Astrom, and W. K. Ho, Relay auto-tuning in the presence of static load disturbance, Automatica, vol. 29, no. 2, pp. 563564,
1993.
[22] J. H. Park, S. W. Sung, and I. B. Lee, Improved relay auto-tuning with
static load disturbance, Automatica, vol. 33, no. 4, pp. 711715, 1997.
[23] W. K. Ho, Y. Hong, A. Hansson, H. Hjalmarsson, and J. W. Deng, Relay
auto-tuning of PID controllers using iterative feedback tuning, Automatica, vol. 39, no. 1, pp. 149157, 2003.
[24] C. C. Hang, K. J. Astrom, and Q. G. Wang, Relay feedback auto-tuning
of process controllersa tutorial review, J. Process Control, vol. 12, no.
1, pp. 143162, 2002.
[25] W. K. Ho, C. C. Hang, and L. S. Cao, Tuning of PID controllers based
on gain and phase margin specifications, Automatica, vol. 31, no. 3, pp.
497502, 1995.
[26] W. D. Chang, R. C. Hwang, and J. G. Hsieh, An auto-tuning PID control for a class of nonlinear systems based on Lyapunov approach, J.
Process Control, vol. 12, pp. 233242, 2002.
[27] E. J. Davidson, Multivariable tuning regulators: the feedforward and robust control of general servomechanism problem, IEEE Trans. Autom.
Control, vol. 21, no. 1, pp. 3547, Feb. 1976.
[28] J. Pentinnen and H. N. Koivo, Multivariable tuning regulators for unknown systems, Automatica, vol. 16, pp. 393398, 1980.
[29] P. Zgorzelski, H. Unbehauen, and A. Niederlinski, A new simple decentralized adaptive multivariable regulator and its application to multivariable plants, in Proc. 11th IFAC World Congr., Tallin, Estonia, 1990,
pp. 226231.
[30] A. P. Loh, C. C. Hang, C. K. Quek, and V. U. Vasnani, Auto-tuning of
multi-loop proportional-integral controllers using relay feedback, Ind.
Emg. Chem. Proc. Des. Dev., vol. 25, pp. 654660, 1993.
[31] M. Zhuang and D. P. Atherton, PID controller design for a TITO
system, Inst. Elect. Eng.Proc. Pt. D: Control Theory Applications,
vol. 141, pp. 111120, 1994.
[32] H. N. Koivo and J. T. Tanttu, Tuning of PID controllersurvey of SISO
and MIMO, in Proc. Preprints IFAC Int. Symp. Intelligent Tuning Adaptive Control, Singapore, 1991.
[33] P. J. Gawthrop, Self-tuning PID controller: algorithms and implementation, IEEE Trans. Automatic Control, vol. 31, no. 3, pp. 201209, Mar.
1986.
[34] F. Radke and R. Isermann, A parameter adaptive PID controller with
stepwise parameter optimization, Automatica, vol. 23, no. 4, pp.
449457, 1987.
[35] T. H. Lee, C. C. Hang, W. K. Ho, and P. K. Yue, Implementation of
a knowledge-based PID auto-tuner, Automatica, vol. 29, no. 4, pp.
11071113, 1993.
[36] Y. Nishikawa, N. Sannomya, T. Ohta, and H. Tanaka, A method for
auto-tuning of PID parameters, Automatica, vol. 20, no. 3, pp. 321332,
1984.
[37] A. E. B. Ruano, P. J. Fleming, and D. I. Jones, Connectionist approach
to PID auto-tuning, Instit. Elect. Eng.Proc. Part D: Control Theory
Applications, vol. 139, no. 3, pp. 279285, 1992.
[38] Y. Iiguni, H. Sakai, and H. Tokumaru, A real-time algorithm for a
multi-layered neural network based on the extended Kalman filter,
IEEE Trans. Signal Process., vol. 40, no. 4, pp. 959966, Apr. 1992.
[39] L. Ljung and T. Soderstrom, Theory and Practice of Recursive Identification. Cambridge, MA: MIT Press, 1983.
43
Ding-Wen Yu received the B.Eng. and M.Sc. degrees from Beijing Chemical
Engineering Institute (BCEI), China, in 1982 and 1987, respectively, and the
Ph.D. degree from Nottingham Trent University, Nottingham, U.K., in 2000, all
in control engineering.
He was a Lecturer at BCEI from 1987 to 1994, before he came to University
of Salford as a Visiting Researcher in 1994. He then worked at Liverpool John
Moores University, Liverpool, U.K., as a Postdoctoral Research Fellow between
2001 and 2003. Then, he joined the Northeast University, Qinhuangdao, China,
in 2004 as a Professor. His current research interests include modeling and dynamic simulations of chemical processes, model predictive control of industrial
systems, and neural network applications in control and signal processing.