for Systems and Robotics (ISR), Instituto Superior Tcnico (IST), Lisbon, Portugal
SUMMARY We overview recent progress in the eld of robust adaptive control with special emphasis on methodologies that use multiple-model architectures. We argue that the selection of the number of models, estimators and compensators in such architectures must be based on a precise denition of the robust performance requirements. We illustrate some of the concepts and outstanding issues by presenting a new methodology that blends robust nonadaptive mixed -synthesis designs and stochastic hypothesis-testing concepts leading to the so-called Robust Multiple Model Adaptive Control (RMMAC) architecture. A numerical example is used to illustrate the RMMAC design methodology, as well as its strengths and potential shortcomings. The later motivated us to develop a variant architecture, denoted as RMMAC/XI, that can be eectively used in highly uncertain exogenous plant disturbance environments. Copyright c 2006 John Wiley & Sons, Ltd. key words: Multiple model adaptive control; adaptive control; robust control; robust adaptive control; robust -synthesis
1. INTRODUCTION The solution to the so-called adaptive control problem is akin to the elusive search for the Holy Grail in the context of feedback control system design. In spite of forty years of research, several books and hundreds of articles we still lack, in our view, a universally accepted design methodology for adaptive control which is based on sound theoretical issues and suitable for engineering implementations in real-life control systems. In this paper we overview some recent progress in adaptive designs that employ multiple-models. Currently available results seem to
An earlier version of this paper was based on a plenary talk by M. Athans and published in the Proceedings of the 2005 IFAC World Congress, Prague, Czech Republic, 2005 [1]. This work is based upon S. Fekris doctoral thesis [2] and was supported in part by the Portuguese FCT POSI programme under framework QCA III and by project MAYA-Sub of the AdI.
be very promising, but still require a great deal of theoretical and pragmatic research to arrive at the Holy Grail of adaptive control. Thus in our view, the solution to the adaptive control problem is still not available. We rst discuss a general philosophy for designing robust adaptive multivariable feedback control systems for linear time-invariant (LTI) plants that include both unmodeled dynamics and uncertain real parameters in the plant state-space description. The adjective adaptive refers to the fact that the real parameter uncertainty and performance requirements require the implementation of a feedback architecture with better performance and greater complexity than that of the best possible xed non-adaptive controller. The word robust refers to the desire that the adaptive control system remains stable and also meets the posed performance specications for all-possible legal parameter values and unmodeled dynamics. In order to place our remarks in a proper perspective and motivate the development of our new RMMAC method we pose a set of basic engineering questions that naturally arise when we deal with adaptive control: (1). What do we gain by using adaptive control? (2). How do we fairly predict and compare performance improvements (if any) of a proposed adaptive design vis-`-vis the best nonadaptive one? a (3). How do we design adaptive controllers with guaranteed robust-stability and robustperformance in the presence of unmodeled dynamics and unmeasurable plant disturbances and sensor noises? (4). Is the increased complexity of an adaptive controller justied by the performance improvement? What should be the level of complexity for designer-specied adaptive system performance guarantees? Even though there is no precise universally accepted denition of adaptive control the above questions are (or should be) at the heart of adaptive control research; they have motivated the perspective adopted in this paper. It is important to stress at this point that the vast majority of approaches to adaptive control deal with the case of constant uncertain real parameters. Furthermore, they invariably focus upon stability without explicit quantitative specications for the desired adaptive system performance. However, from an engineering perspective, the true value of an adaptive system can only be judged by its performance when the uncertain real parameters change slowly with time, within their predened limits. Thus, one designs and tests an adaptive system for constant real parameters, using whatever theoretical approaches developed, but it should be also tested, and its performance evaluated, for time-varying parameters as well. The current accepted concept of a robust (nonadaptive) feedback control system, for linear time-invariant (LTI) plants, is that the designed compensator must be such so as to guarantee (if possible) closed-loop stability and to also meet posed performance specications, most often reecting superior disturbance-rejection. This attribute is often referred to as stabilityand performance-robustness. The physical plant is assumed to belong to a legal family of possible plants, where the nominal plant together with frequency-dependent upper bounds on unmodeled dynamics and upper- and lower-bounds on key uncertain real parameters denes this legal family of plants. The performance specications are explicitly stated in the frequency domain; they typically require superior disturbance-rejection in the lower frequency region while safeguarding for excessive control action at higher frequencies. Such robust nonadaptive compensators can be designed, more or less in a routine manner, using the
so-called mixed- synthesis methodology and associated Matlab software. At present, robust synthesis can not deal with slowly-varying parameters, from a theoretical perspective, although some recent research in Linear Parameter Varying (LPV) systems shows some promise. It is highly desirable that the above attributes of nonadaptive robust feedback systems be also reected in the design of robust adaptive controllers as well. Thus, in our view, an adaptive control design must explicitly yield stability- and performance-robustness guarantees, not just stability (which has been the central focus of almost all adaptive control methodologies). 1.1. Brief Historical Perspective Early approaches to adaptive control, such as the model-reference adaptive control (MRAC) method and its variants, were concerned with real-time parameter identication and simultaneous adjustment of the loop-gain. Representative references are [3, 4, 5, 6, 7]. In the classical MRAC method the emphasis was on proving global convergence to the uncertain real parameter, while using deterministic Lyapunov (hyperstability) arguments for inferring closed-loop stability. However, the assumptions required for stability and convergence (such as relative degree, positive-realness, knowledge of the high-frequency gain) did not include the presence of unmodeled dynamics, unmeasurable disturbances and sensor noise. Moreover, no explicit and quantiable performance requirement was posed for the adaptive system; rather the goodness of the MRAC design was judged by the nature of the command-following error based upon simulations. It turned out that classical MRAC systems can become unstable in the presence of plant disturbances, sensor noise and high-frequency unmodeled dynamics [8]. In [9] averaging analysis was used to partially overcome the problems identied in [8]. Moreover the MRAC methodology was limited to single-input single-output (SISO) plants; attempts to extend the MRAC methodology to the multi-input multi-output (MIMO) case were extremely cumbersome. Because of these shortcomings, we shall not further address the MRAC methodology in the sequel. Later, and the more recent, approaches to the adaptive control problem involved multiplemodel techniques which, in principle, are also applicable to the MIMO case. The (large) realparameter uncertainty set was subdivided into smaller parameter subsets; each parameter subset gives rise to a dierent plant model set with reduced real-parameter uncertainty. One then designed a set of control gains or dynamic compensators for each model set so that, if indeed the true parameter was close to a specic model, then a satisfactory performance was obtained. In most of the multiple-model approaches, the identication of the most likely model is carried out by a supervisor which switches into the feedback loop dierent controllers, based primarily on deterministic concepts [10, 11, 12, 13, 14, 15, 16, 17, 18]. These proofs and results were presented for the case of SISO systems. The second approach relied upon stochastic designs that generated on-line posterior probabilities reecting which of the models is more likely. In the latter approach the controllers could be designed either by classical LQG methods [19, 20, 21, 22, 23, 24, 25, 26] or by more sophisticated RMMAC methods [27, 28, 29, 2] which can deal with MIMO designs. A philosophically dierent, more direct approach, called unfalsied control, also merits attention [30, 31, 32, 33, 34, 35] and we shall briey discuss it in the sequel. Finally, we do not discuss numerous approaches to adaptive control utilizing intelligent methods, such as neuro-fuzzy designs, since they are void of any analytical insights.
In all the proposed multiple-model adaptive methods the complexity of the adaptive feedback system directly depends on the number of models employed, N . By decreasing the size of the parametric subsets one obtains more models. Thus, all multiple model approaches must address the following: (a) how to divide the initial large parameter uncertain set into N smaller parameter subsets, (b) how to determine the size or boundary of each parameter subset, and (c) how large should N be? Presumably the larger the N , the better the performance of the adaptive system should be. Although not commonly considered as an adaptive design, the widely used engineering designs utilizing gain-scheduling can be viewed as multiple-model methods. Gain scheduling is actually used to control nonlinear systems, such as aircraft and jet engines, where an exogenous measured variable, such as dynamic pressure, denes a family of LTI models over the operating envelope [36, 37, 38, 39, 40, 41]. A set of control gains is dened for each LTI model. The measured dynamic pressure (or equivalent) is used to interpolate between the control gains. The basic dierence between the adaptive multiple-model approaches discussed above and the gain-scheduled methods is the fact that in adaptive multiple-model approaches there is no externally measured variable to accomplish the equivalent of gain-scheduling. Rather, the information of which model is most likely (and what controller to use) must be obtained from organic plant measurements. 1.2. The RMMAC Design Philosophy In this paper we shall stress the use of robust performance requirements on the adaptive system implemented by one of the available multiple-model methods. We follow our recent research on Robust Multiple-Model Adaptive Control (RMMAC), [1, 42, 27, 28, 29, 2], which will be discussed and evaluated in much more detail in the sequel. If we turn our attention to the non-adaptive literature there exists a well-documented design methodology, and associated Matlab design software, for linear time-invariant multivariable plants (both SISO and MIMO) that addresses simultaneously both robust-stability and robustperformance in the presence of unmodeled dynamics and parametric uncertainty as well as unmeasurable plant disturbances and sensor noise. This methodology, pioneered by J.C. Doyle and his colleagues, is often called the mixed- design method [43, 44, 45, 46, 47, 48, 49, 50]. We assume that the reader is familiar with this robust design methodology and associated software. The mixed- design method incorporates the state-of-the-art in non-adaptive multivariable robust control synthesis and exploits the proper use of frequency-domain weights to quantify desired performance. Typically, using the mixed- design method, one nds that as the size of the parametric uncertainty is reduced the guaranteed desired performance, say superior disturbance-rejection, increases. Unfortunately, very little has been done in integrating the non-adaptive mixed- design methodology with that of robust adaptive control studies; even though it should be apparent that the mixed- design method should provide us guidance on the selection and number, N , of the models to be used in any multiple-model adaptive control scheme. Notable exceptions are [32, 51, 16, 27, 28, 29, 2]. We now summarize our design philosophy regarding adaptive control designs that employ multiple models. We assume that: (1). Independent of the size of uncertainty for the plant real parameter(s), the plant always contains unmodeled dynamics whose size must be bounded a priori only in the frequency
domain. Therefore, the adaptive design must explicitly reect these frequency-domain bounds on the unmodeled dynamics. The presence of unmodeled dynamics immediately brings into sharp focus the fact that we must use the state-of-the-art in nonadaptive robust control synthesis, i.e. mixed- synthesis [43, 44, 45, 46, 47, 50]; and associated Matlab software [48, 49]. (2). The plant is subject to unmeasurable plant disturbances whose impact upon the chosen performance variables (error signals) must be minimized, i.e. we must have superior disturbance-rejection. The modern trend is to use frequency-dependent weights to emphasize and dene superior disturbance-rejection performance. This design objective can also be accommodated by the mixed- design methodology. (3). The plant measurements are not perfect; thus sensor measurements are corrupted by unmeasurable sensor noise. The performance variables must be insensitive, to the degree possible, to such sensor noise. (4). Performance requirements must be explicitly dened, up to constants whose values can be optimized for superior performance. In the mixed- design methodology these disturbancerejection performance requirements are explicitly quantied by frequency-domain weights typically involving the selected error signals and the control variables. (5). Given the information in (1) to (4), we can design what we call the best global nonadaptive robust compensator (GNARC) for the entire (large) uncertain real-parameter set and by taking into account both the unmodeled dynamics and the performance requirements. This non-adaptive feedback design must be optimized so as to yield the best possible performance, i.e. superior disturbance-rejection with reasonable control eort. The GNARC design then provides a yardstick (lower-bound) for performance, so that any performance improvements by more complex adaptive designs can be quantied. The GNARC is designed using the mixed synthesis methodology. (6). An upper-bound for adaptive performance can be obtained by optimizing the performance under the assumption that the real-parameter values are known exactly, but still reecting the presence of complex-valued unmodeled dynamics and frequency-dependent performance requirements. This implies that we compute for a large number of grid points in the original parameter uncertainty set what we call a xed non-adaptive robust compensator (FNARC) which denes the best possible performance for each parameter value. The FNARC design is carried out using the complex- synthesis methodology, since we still must take into account the unmodeled dynamics and frequency-dependent performance specications. We use the same quantitative performance requirements as in part (5) above. The set of the FNARCs corresponds to having an innite number of models in the multiple-model implementation. The dierence between the lower-bound on performance dened by the GNARC in part (5) and the FNARC upper-bound from part (6) provides a valuable quantitative decision aid to the designer on what performance improvements are possible by some multiple-model adaptive control method. The designer must then make a quantiable choice on the degree of performance improvement that he/she desires from the adaptive system. As we shall show in the sequel, this approach will then dene the number of models (parameter subsets) required, N , and their numerical specication (boundary of parameter subsets) in a natural manner. One possible approach is that the designer demands that the adaptive performance equals or exceeds a certain percentage, say 75%, of the (best possible) FNARC performance for each parameter value. Another possible approach is to demand that the adaptive system yield a performance that equals a certain multiple, say 10, of the (lower-bound) GNARC performance,
if possible (there may be inherent limitations due to non-minimum phase zeros and/or unstable poles) and consistent with the FNARC upper-bound. Yet another approach is to x the number N of the models, i.e. specify the complexity of a multiple-model scheme, and maximize the performance for each model. Many other approaches are also possible which use both the GNARC lower-bound and the FNARC upper-bound for performance. By following the above performance-driven methodology one directly arrives at the number, N , of required models in the adaptive multiple-model system, as well as the quantication of each model. The robust synthesis method requires that each model is represented by a parameter-subset which must be a hyperparallelpiped. In general, the more stringent the performance requirements on any adaptive implementation consistent, of course, with the FNARC upper-bound the larger the number of models and the greater the complexity of the multiple-model adaptive system. We stress that such a systematic denition of the required models and numerical specication would not be possible if we did not explicitly pose the performance specications and optimized performance to the extent possible. The procedure summarized above can be used with any of the adaptive multiple-model methods. We shall illustrate its detailed design and properties by using the multiple-model method in the context of dynamic hypothesis-testing, which involves generating the posterior probability for each model, the so-called Robust Multiple-Model Adaptive Control (RMMAC) architecture. However, it can also be used in conjunction with the switching supervisory controllers developed by Morse, Anderson, Hespanha and others, as well as by the unfalsiedcontrol approaches (although in the unfalsied-control methods one does not assume any models for the plant to be controlled, no lower- and upper-bounds on the real uncertain parameters, no frequency-domain bounds on unmodeled dynamics nor explicit performance requirements in the sense of (4)-(6) above). The important point to remember is that all multiple-model adaptive schemes require the denition of the minimum number of models required to achieve both robust-stability and robust-performance, and these can only be dened after we pose realistic performance requirements for the adaptive system as discussed above. In Section 2 we present an overview of four dierent multiple-model architectures for adaptive estimation, identication and control. In Section 3 we discuss the designs of the robust dynamic compensators, such as the GNARC and FNARC discussed above, and how to determine the number N of models in the multiple-model architectures, as well as the collection of the compensators. In Section 4 we focus upon the identication aspect of the RMMAC. In Section 5 we dene a variant of the RMMAC architecture, denoted by RMMAC/XI. The RMMAC/XI was developed because of possible performance degradation of the standard RMMAC when the plant disturbances had very wide variability [2]. We shall demonstrate that such performance degradations are eliminated by the RMMAC/XI architecture. In Section 6 we present numerous results and simulations using the RMMAC and RMMAC/XI architectures. These results provide a concrete illustration of our design philosophy as well as several performance evaluations and comparisons of the RMMAC design, even when we violate the theoretical assumptions to a signicant degree. Section 7 discusses the commonalties and dierences of the supervisory switching multiple model adaptive control (SMMAC) and RMMAC architectures and we summarize our conclusions in Section 8.
2. MULTIPLE-MODEL ARCHITECTURES IN ADAPTIVE CONTROL In this section we shall discuss the architectures that utilize multiple-models. We follow a historical development that traces the concepts over the past 40 years or so. First we shall discuss the architecture associated with multiple-model adaptive estimation (MMAE). Second, we discuss the early extension of the MMAE concepts to classical multiple-model adaptive control (CMMAC). Third, we present architectures associated with SMMAC. Next, we discuss the architecture associated with robust multiple-model adaptive control (RMMAC). Finally, we comment briey on the unfalsied control concept. 2.1. Multiple-Model Adaptive Estimation (MMAE) One of the earliest uses of multiple-models was motivated by the need for accurate stochastic state-estimation for dynamic systems subject to signicant parameter uncertainty. In many such applications the estimation accuracy provided by standard Kalman lters was not adequate. For some early references regarding MMAE see [52, 53, 54, 55, 56]. We remark that the MMAE algorithms were also referred to as partitioned estimation, especially in the research by Lainiotis. We also remark that a similar adaptive estimation architecture, the so-called Sum of Gaussians method, utilizing a bank of extended Kalman lters, was used for nonlinear ltering problems [57, 58, 59]. Fig. 1 shows the architecture of the MMAE system. It is assumed that a discrete-time linear time-invariant plant is driven by white process noise, as well as a known deterministic input signal, and generates measurements that are corrupted by white measurement noise. If there is no parameter uncertainty in the plant, then the Kalman lter (KF) is the optimal stateestimation algorithm in a well-dened sense; see, for example, [60, 57]. Moreover, under the usual linear-gaussian assumptions, the KF state-estimate is the true conditional mean of the state, given the past controls and observations. If the plant has an uncertain real-parameter vector, p, one can imagine that it is close to one of the elements of a nite discrete parameter
set, PD = {p1 , p2 , . . . , pN }. One can then design a bank of standard Kalman lters (KFs) where each KF uses one of the discrete parameters pk in its implementation, k = 1, 2, . . . , N . It turns out that, if indeed the true plant parameter is identical to one of its discrete values and this is modeled by the hypothesis H = Hk , then the conditional probability density of the state is the sum of gaussian densities. Then, the MMAE of Fig. 1 will indeed generate the true conditional mean of the state and one can calculate the true conditional covariance matrix; see, for example, [54]; [57, Chap. 10]. Appendix I summarizes the notation and formulas associated with the MMAE algorithm for the discrete-time case. From a technical point of view the MMAE system of Fig. 1 blends optimal estimation concepts (i.e. Kalman ltering) and dynamic hypothesis-testing concepts that lead to a system identication algorithm. As explained in Appendix I, each KF generates a local state estimate, xk (t|t) and a residual (or innovations) signal, rk (t), which is the dierence between the actual measurement and the predicted measurement (the residual is precisely the prediction error common to all adaptive systems). Furthermore, the (steady-state) residual covariance matrix, Sk , k = 1, 2, . . . , N , associated with each KF can be computed o-line. The key to the MMAE algorithm is the so-called posterior probability evaluator (PPE) which calculates, in real time, the posterior conditional probability that each model generates the data, i.e. Pk (t) = P rob{H = Hk |Y (t)}, k = 1, 2, . . . , N ; see eq. (2.4). Thus, the PPE represents an identication subsystem. The global state-estimate is then obtained by the probabilistic weighting of the local state-estimates as shown in Fig. 1; this global state estimate is precisely the true conditional mean of the state given the set Y (t) of past measurements and controls. The true conditional covariance can also be calculated on-line; see Appendix I for mathematical details. The key property of the MMAE algorithm is that, under suitable assumptions, one of the posterior probabilities, say Pj , Pj (t) 1, where j indexes the model that is closest to the correct hypothesis H = Hj , even though the actual plant parameter is dierent than pj , as t . These asymptotic convergence results hinge upon information-theoretic arguments and involve non-trivial stationarity and ergodicity assumptions. The detailed convergence proofs involve either the so-called Baram Proximity Measure (BPM) [61, 62, 63], as discussed in Appendix II, or the Kullback information metric [57, pp. 267279]. The detailed proofs are beyond the scope of this paper. These (asymptotic) convergence results to the nearest probabilistic neighbor using the BPM represent the key system-identication algorithms associated with both the CMMAC and the RMMAC algorithms discussed in the sequel. It should be noted that the MMAE architecture is essentially identical to that of the sum of Gaussians estimators used extensively in nonlinear ltering [57, 58, 59] which utilize banks of extended Kalman lters. Furthermore, it is important to stress that the blend of dynamic hypothesis-testing concepts and optimal estimation theory is the workhorse of all modern defense and civilian surveillance and fusion algorithms that deal with several sensors and several targets (crossing, manoeuvring, disappearing, re-appearing, etc.) 2.2. Classical Multiple-Model Adaptive Control (CMMAC) The intriguing convergence properties of the MMAE algorithm, coupled with the robustness shortcomings of MRAC systems to disturbances and sensor noise, gave rise to what we call the classical MMAC (CMMAC) algorithms which simply integrated design concepts from LinearQuadratic-Gaussian (LQG) control system design [64, 65] with the MMAE architecture of
Copyright c 2006 John Wiley & Sons, Ltd. Prepared using acsauth.cls Int. J. Adapt. Control Signal Process. 2006; 00:00
Figure 2. The CMMAC algorithm where the local KF state estimates are multiplied by local LQ gains to form local controls which, in turn, generate a probabilistically weighted global control.
Section 2.1 in a purely ad-hoc manner. The classical MMAC architecture (CMMAC) is shown in Fig. 2. Each local state-estimate of the (steady-state) KF, xk (t|t), k = 1, 2, . . . , N , is multiplied by the associated linear-quadratic (LQ) optimal control-gain matrix, Gk , to generate a local control signal uk (t) by uk (t) = Gk xk (t|t), k = 1, 2, . . . , N (2.1)
Using precisely the same MMAE calculation by the PPE of the posterior probabilities Pk (t), eq. (2.4), the global control applied to the plant and to the bank of KFs is obtained by probabilistic weighting of the local controls, i.e.
u(t) =
Pk (t)uk (t)
The CMMAC algorithm of Fig. 2 represented a purely ad-hoc approach to adaptive control. It is not optimal in a stochastic LQG sense [19]. However, several adaptations, extensions and simulations have been reported [20, 22, 21, 23, 24, 25, 26]. In the context of this paper it is important to stress that no robustness to unmodeled dynamics was considered in early CMMAC designs (such robustness issues were unknown in the 1970s) and that performance specications were generated by LQG tricks and not with the frequency-weight concepts and H2 and H designs widely adopted at present. From a high-level philosophical perspective, the CMMAC represents, in our view, a form of probabilistic gain-scheduling. As we remarked earlier in classical gain-scheduling designs [10, 36, 37, 38, 39, 40, 41] an exogenous measured signal is used to schedule or interpolate from a nite-set of gains. In the CMMAC no such exogenous signal is available; rather, it is the posterior probabilities in Fig. 2 that are used to accomplish this probabilistic gain-scheduling. As we shall see, the same situation happens in the RMMAC architecture(s) discussed in the sequel. It is also noteworthy to mention that a switching version of the CMMAC can be readily (and naturally) implemented in which the local control with the largest posterior probability
Copyright c 2006 John Wiley & Sons, Ltd. Prepared using acsauth.cls Int. J. Adapt. Control Signal Process. 2006; 00:00
is used as the global control; such switching versions of the CMMAC are much more sensitive to the stochastic signals. Finally, in retrospect, the fact that both the local KF state estimates and their residuals, via the posterior probabilities, were combined in the generation of the global adaptive control has certain shortcomings, because one mixes state-estimation and feedback control generation. Thus, errors in the local state-estimates will directly impact the local controls. In the CMMAC architecture no clear-cut separation-principle of an identication subsystem and a control subsystem is made. 2.3. Supervisory Switching Multiple-Model Adaptive Control (SMMAC) During the past decade a dierent, with a strong deterministic avour, approach to adaptive control using multiple models has been initiated [11, 12, 13, 14, 16, 17, 18] and active research along these lines is still in progress. We refer to these methodologies as Supervisory Switching Multiple-Model Adaptive Control (SMMAC). The basic architecture of the dierent SMMAC algorithms is shown in Fig. 3 (which is an adaptation of Fig. 1 in [18], so that comparisons with the CMMAC and RMMAC become easier). We remark that in all versions of the SMMAC architectures there exists a separation between identication and control (unlike the CMMAC). In other words, the state-estimates inside each multi-estimator do not directly inuence the associated local control signal. We briey discuss the SMMAC architecture to point out some similarities and dierences to the CMMAC of Fig. 2 and the RMMAC architecture to be discussed below see Fig. 4. The approach is deterministic and the goal is to prove local bounded-input bounded-output stability of the SMMAC system under certain assumptions. The plant-disturbance and sensornoise signals are assumed bounded rather than being characterized as stochastic processes as in the CMMAC and RMMAC. The presence of unmodeled dynamics is also considered, although the bound on the unmodeled dynamics is simply an H bound. The theoretical results to date are restricted to single-input single-output (SISO) systems, although research is underway to extend them to the MIMO case [10]. In reference to Fig. 3, the SMMAC employs a nite number of stable deterministic estimators
(Luenberger observers), called multi-estimators and denoted by Ek (s), k = 1, 2, . . . , N , designed for a grid of distinct known parameter values pk P , where P is the compact realparameter set. The output of each estimator yk (t) is compared with the measured (true plant) output y(t) to form the estimation (prediction) errors ek (t) = yk (t) y(t), k = 1, 2, . . . , N . We remark that the errors ek (t) in Fig. 3 are completely analogous to the residuals rk (t) in the MMAE (Fig. 1), the CMMAC (Fig. 2) and the RMMAC (Fig. 4). The monitoring signal generator, M (s), is a dynamical system that generates monitoring signals k (t); these are suitably-dened integral norms of the estimation errors ek (t). The size of these monitoring signals indicates which of the multi-estimators is closer to the true plant. In addition to the bank of multi-estimators, it is assumed that a family of multi-controllers, Ck (s), k = 1, 2, . . . , N , has been designed so that each provides satisfactory stable feedback performance for at least one discrete parameter pk PD . However, no quantitative design performance specications are dened for the CMMAC system. The basic idea is to use the monitoring signals k (t) to switch-in the suitable controller. This is accomplished see Fig. 3 by a switching logic S which generates a signal (t) PD that can be used to switch-in the appropriate controller. A key property of the switching logic S is that it keeps its output (t) constant over some suitably long dwell time; this avoids rapid switching of the controllers and allows most transients to die-out between controller switchings. The details of the switching logic algorithms dier in the cited SMMAC references. The basic structural dierence between the CMMAC and the SMMAC compare Figs. 2 and 3 is that in the SMMAC the identication process is completely separated from the control process. Even in the case of the switching CMMAC, where the largest posterior probability switches the corresponding LQ control gain, the identication and control get mixed-up. The separation of the identication and control processes in the SMMAC seems to have an advantage, coupled with the idea of infrequent controller-switching. Otherwise, the KFs in the CMMAC serve the same objective as the multi-estimators of the SMMAC. Moreover, the multi-controllers in the SMMAC can be more complex than the simple LQ-gains in the CMMAC. Some potential shortcomings of the SMMAC methodology will be discussed after we present the RMMAC approach below. Unfortunately, SMMAC numerical simulations have been reported for only a couple of (very) academic SISO plants. 2.4. Robust Multiple-Model Adaptive Control (RMMAC) We now overview the newest multiple-model architecture which we call RMMAC, Fig. 4, to emphasize the fact that both stability-robustness and performance-robustness are addressed from the start. Our preliminary results on RMMAC can be found in [1, 42, 28, 29] and a more complete treatment is available in Fekris Ph.D. thesis [2]. We note that the RMMAC architecture has a separation between identication and control, like the SMMAC and unlike the CMMAC. As in the CMMAC, the RMMAC uses a bank of (steady-state) Kalman Filters (KFs) and relies on stochastic processes for the disturbance signals and the sensor noise measurements. The stochastic plant disturbances provide the necessary sucient excitation for identication [66]. However, unlike CMMAC the local KF state-estimates (the xk (t|t) in Fig. 2) are not used in generating the control signals. Only the KF residuals, rk (t), k = 1, 2, . . . , N , generated on-line and their pre-computed residual covariance matrices, Sk , are utilized by the posterior probability evaluator (PPE) to generate the posterior probability signals Pk (t). The calculation
Copyright c 2006 John Wiley & Sons, Ltd. Prepared using acsauth.cls Int. J. Adapt. Control Signal Process. 2006; 00:00
of the posterior probabilities is identical to that in the MMAE or CMMAC (see Appendix I); see also eq. (2.4) below. A crucial dierence is that the nominal KF design is based upon explicit use of the Baram Proximity Measure (BPM) to ensure asymptotic convergence of the posterior probabilities. Another key dierence lies in the construction of the bank of the local robust compensators, Kj (s), j = 1, 2, . . . , N , in Fig. 4, which are designed using the state-of-the-art in robust mixed- synthesis [43, 45, 44, 46, 49, 48]. These compensators, Kj (s); j = 1, 2, . . . , N , referred to as the local non-adaptive robust compensators (LNARC), are designed so as to guarantee local stability- and performance-robustness, as will be explained in detail in Section 3. Each local compensator Kj (s) generates a local control signal uj (t). The global control is then generated, as in the CMMAC case, by the probabilistic weighting of the local controls uj (t) by the posterior probability Pj (t)
u(t) =
Pj (t)uj (t)
A switching version of RMMAC can also be implemented by nding the largest probability Pj (t), at each instant of time, and using the corresponding local uj (t) as the global control u(t). Moreover, a dwell-time could be incorporated to avoid frequent switching. One relies upon the convergence properties of the posterior probabilities to the nearest probabilistic neighbor, using the Baram proximity measure (BPM) [61, 62, 63], to ensure that the RMMAC operates in a superior manner, so as to ensure correct asymptotic identication. See Appendix II. This requires careful design of the KFs, perhaps robustied through the use of fake-white-plant-noise a time-honoured design trick in linear and nonlinear estimation practice [67]. One of the key algorithms in the RMMAC architecture is the Posterior Probability Evaluator (PPE) in Fig. 4. Since we are concerned with both robust-stability and robustperformance and the LNARCs K1 (s), . . . , K N (s) are designed with this objective in mind it is imperative that one of the posterior probabilities converges to the right model. We
Copyright c 2006 John Wiley & Sons, Ltd. Prepared using acsauth.cls Int. J. Adapt. Control Signal Process. 2006; 00:00
repeat from Appendix I (eqs. (A.20) to (A.22)) the recursive formula by which the posterior probabilities Pk (t), k=1,2,. . . ,N are generated: Pk (t + 1) = k e 2 rk (t+1)Sk
N j=1
1 1
rk (t+1)
j e
1 1 rj (t+1)Sj rj (t+1) 2
Pj (t)
Pk (t)
where: rj (t) ; j = 1, 2, ..., N is the residual of the j-th Kalman lter Sj ; j = 1, 2, ..., N is the steady-state constant residual covariance matrix of rj (t) 1 j is a constant scaling factor. m/2
(2) det Sj
Notice that the propagation of the posterior probabilities involves the quadratic forms 1 rj (t)Sj rj (t). If the residuals are, in a sense, regular, i.e. their size and correlation is moreor-less consistent with that predicted by their associated covariance matrix (say within 3 to 4 sigma levels), then the posterior probabilities will behave in a smooth manner and will result in correct identication (convergence to the nearest probabilistic neighbour in the BPM sense. This is accomplished by the very careful design of the KFs as explained in Section 4. If the theoretical assumptions of Appendix II are valid or mildly violated, the RMMAC yields excellent performance, as evidenced by thousands of simulations [2]. If, however, the theoretical assumptions are severely violated, then the residuals can become very large compared with their predicted size inherent in their covariances Sj , the posterior probabilities can get confused and the inaccurate identication can cause the RMMAC to yield degraded performance and, in rare circumstances, break into instability. These issues will be discussed further in the sequel; it is important to state explicitly the potential shortcomings of any adaptive method (a rarity in the adaptive literature). 2.5. Unfalsied Control As we have remarked the recent unfalsied control method developed by M.G. Safonov and colleagues [30, 31, 32, 33, 34, 35] is a promising approach to the control of uncertain plants. We stress that in that methodology there are no assumptions about the plant to be controlled; one applies control(s) and makes real-life measurements. It is assumed that there exists a collection of N precomputed compensators and that at least one of them will stabilize the actual plant. Thus, the emphasis is on stability (which is certainly robust since it involves the actual plant). It can be argued that the availability of N compensators makes unfalsied control a multiple-model scheme. When one of the available compensators is connected to the plant and it does not stabilize the plant, then a safe algorithm should recognize rapidly in real-time this instability and discard this compensator. Next, another compensator is tried and so on. By assumption, eventually one will nd one of the stabilizing compensators. Since there are absolutely no assumptions on the plant, we cannot make any direct comparisons with the architectures discussed above. It is hard to say how one can design for this method the required family of compensators to achieve not only robust-stability but also robust-performance based upon explicit specication of closed-loop performance specications. For these reasons, we will not comment any further on this potentially useful methodology.
Copyright c 2006 John Wiley & Sons, Ltd. Prepared using acsauth.cls Int. J. Adapt. Control Signal Process. 2006; 00:00
2.6. Discussion The three adaptive multiple-model algorithms (CMMAC, SMMAC, and RMMAC) presented above have the following common characteristics. (a). One must design a set of N multi-controllers or dynamic compensators. (b). One must design a set of N Kalman lters or multi-estimators (observers). (c). One must implement an identication process by which the actual (global) adaptive control is generated. Clearly, the complexity of the adaptive system will depend on the number, N , of models that are required to implement. Ideally, N should be as small as possible. However, it should be intuitively obvious that if N is too small the performance of the adaptive system may not be very good. On the other hand, if N is very large, one may reach the point of diminishing returns as far as adaptive performance improvement is concerned. It follows that we need a more-or-less systematic procedure by which, starting from a compact parameter set p P , to dene a nite set, N , of discrete values (models), PD = {p1 , p2 , . . . , pN } P, that are used subsequently in designing the KFs in the CMMAC and RMMAC, the multi-estimators in SMMAC as well as the compensators. The following section summarizes our suggested methodology which hinges upon the recent developments in robust feedback control synthesis using the so-called mixed- methodology and associated software. Even though all MMAC architectures are made by piecing together LTI systems, the probabilistic weighting in the RMMAC as well as the supervisory switching logic of the SMMAC result in a highly nonlinear and time-varying closed loop MIMO feedback system. Hence, it is na to expect foolproof global asymptotic stability results in the ve near future, because there does not, as yet, exist a solid mathematical theory for global (stochastic) nonlinear time-varying stability which can be readily adapted to the multiplemodel architectures discussed in this paper. Even in the simpler CMMAC, involving LQG controllers, attempts to prove global stability were not successful [20, 22, 23]. Thus, it is the opinion of the authors, what is needed in the short run is additional pragmatic understanding of the dierent multiple-model approaches, their similarities and dierences and consistent fair comparisons on performance improvement over non-adaptive designs. Thus, there are numerous opportunities for future theoretical research to investigate such global stability-robustness and performance-robustness issues, especially in the MIMO case.
3. DESIGNING ROBUST COMPENSATORS IN THE RMMAC ARCHITECTURE 3.1. Introduction During the past several years a very sophisticated and complete non-adaptive design methodology, accompanied by Matlab design software, has been developed for the robust feedback control of MIMO linear time-invariant (LTI) uncertain dynamic systems with simultaneous dynamic and parametric errors. This design methodology is often called the mixed- synthesis method, which involves the so-called D,G-K iteration, and it requires the design of dierent H compensators at each iteration. The outcome of the mixed- synthesis process is the denition of a non-adaptive LTI MIMO dynamic compensator with xed parameters, which guarantees that the closed-loop feedback system enjoys stability-robustness and performance-robustness, i.e. it meets the posed performance specications in the frequency
Copyright c 2006 John Wiley & Sons, Ltd. Prepared using acsauth.cls Int. J. Adapt. Control Signal Process. 2006; 00:00
domain (if such a compensator exists). The mixed- synthesis, loosely speaking, de-tunes an optimal H nominal design, that meets more stringent performance specications, to reect the presence of inevitable dynamic and parameter errors. In particular, if the bounds on key parameter errors are large, then the mixed- synthesis yields a robust LTI design albeit with inferior performance guarantees as compared to the H nominal design. So the price for stability-robustness is poorer performance. Experience has shown that if the bounds on the parameter errors decrease, then the mixed- synthesis yields a design with better guaranteed performance. Two recent applications [68, 69] illustrate this uncertainty/performance tradeo in a clear manner. We cannot provide details about this elegant methodology in this paper. Whether we are dealing with non-adaptive or adaptive feedback designs we must take into account the following engineering issues: (a). Complex-valued plant unmodeled dynamics (e.g. unmodeled time-delays, plant-order reduction, parasitic high-frequency poles and zeros, high-frequency bending and torsional modes, etc). (b). Errors in key real-valued plant parameters in its state space realization. (c). Explicit denition of performance requirements typically in the frequency-domain (rather than just the shape of step responses, location of dominant closed-loop poles etc); these reect the common objective to have small tracking errors in the low-frequency region and small control signals in the high-frequency region. (d). Unmeasurable plant disturbances, perhaps with information on their power spectral densities. (e). Unmeasurable sensor noises, perhaps with information on their power spectral densities. From now on, we focus our attention on the problem of disturbance-rejection in the presence of noisy sensor measurements so as to simplify the exposition. Adding commandfollowing to the specications is straightforward but complicates the exposition. What we want to stress relates to our philosophy that we cannot design adaptive control systems without explicit quantication of desired performance. Assume that we have a state-space description of the plant (excluding unmodeled dynamics) of the form d x(t) = A(p)x(t) + B(p)u(t) + L(p)d(t) dt y(t) = C1 (p)x(t) + D(p)n(t) (3.1) z(t) = C(p)x(t) where x(t) is the state vector, u(t) the control vector, d(t) the plant-disturbance vector, y(t) the (noisy) measurement vector, n(t) the sensor noise and z(t) the performance (output or error) vector, i.e. the vector for which we wish to minimize the eects of the disturbance d(t) and noise n(t), i.e. have superior disturbance-rejection. The system matrices depend upon a real-valued parameter vector p, where p is constrained to be in a (hyper) parallelepiped, p P ; this is a required -synthesis constraint. Thus, we must have a lower- and an upper-bound (real-valued) for each independent uncertain parameter. The disturbances, d(t), can be either deterministic time-functions or stochastic processes. In either case, the robust -synthesis requires a disturbance frequency weight for superior performance. If d(t) is a stochastic process, its power-spectral density naturally denes this weight. From a performance point of view, in order to achieve superior disturbance-rejection, the designer species a frequency weight on z(t). Typically, to achieve superior disturbanceCopyright c 2006 John Wiley & Sons, Ltd. Prepared using acsauth.cls Int. J. Adapt. Control Signal Process. 2006; 00:00
rejection in the low-frequency region, the designer species, in the -synthesis methodology, a frequency-weight, say of the form z (s) = Ap s+
I z(s)
which implies that superior disturbance-rejection is most important in the frequency range 0 . The larger the performance-gain parameter Ap , the better the desired performancerejection. To complete the robust design synthesis the designer must provide frequency-dependent bounds for all (structured or unstructured) unmodeled dynamics, and frequency weights for the control, disturbance and sensor noise vectors. The control and sensor-noise weights, together with the bound(s) on the unmodeled dynamics, safeguard against very high-bandwidth feedback designs. 3.2. The GNARC Design Before a wise designer can make a decision on whether or not to implement an adaptive system, he/she must have a solid knowledge on what is the best robust non-adaptive design. We called the best global non-adaptive robust compensator GNARC-based design. The GNARC is computed via mixed- synthesis and it takes into account the frequency-domain bounds on unmodeled dynamics, the various frequency weights that quantify disturbance-rejection requirements, control eort and, perhaps, power spectral densities for the plant-disturbances and sensor-noises. We certainly do not intend to provide here a tutorial exposition of the mixed- synthesis method. To compute the GNARC, one xes the performance gain Ap in eq. (3.2) to some initial value and exercises the Matlab software which, after a sequence of the so-called DG-K iterations, determines a compensator K(s) and generates an upper-bound ub (). If ub () < 1 (3.3)
then the resulting feedback design is guaranteed to be stable for all legal unmodeled dynamics and the entire parameter uncertainty p P . Moreover, in addition to stability-robustness, we are guaranteed that we meet or exceed the posed performance requirements. Therefor, in order to nd the best GNARC we must maximize the performance-parameter Ap in eq. (3.2) until the -upper bound is just below unity, say 0.995 ub () 1 . The GNARC is a single dynamic (SISO or MIMO) compensator that can be used by the designer to fully understand what is the best possible robust performance in the absence of adaptation. Since the feedback system is LTI a whole variety of performance evaluations are possible, using representative values of the uncertain real-parameter vector p P for the plant, as summarized in Table I. We believe that such a thorough understanding of the non-adaptive GNARC is essential prior to making a design decision on using some sort of multiple-model adaptive control. It has been our experience that such GNARC analyses point out which parameter uncertainty is most critical; this can be used to eliminate from further consideration non-critical parameters. Moreover, the performance of the GNARC, say for a SISO system, can change drastically if one adds more measurements and/or controls. Thus, an unacceptable performance for a SISO GNARC non-adaptive system may, indeed, become acceptable if more controls and/or sensors
Copyright c 2006 John Wiley & Sons, Ltd. Prepared using acsauth.cls Int. J. Adapt. Control Signal Process. 2006; 00:00
Table I. List of Performance Evaluation Tools (1) Magnitude (or singular-value) Bode plot of the closed-loop transfer function from the plant disturbance, d, to output, z, which measures the quality of disturbance-rejection vs frequency. Assuming that the plant has no integrators, the magnitude of this transfer function, in the low frequency region, will be approximately 1/Ap . This is why we must maximize the performance parameter Ap for superior disturbance-rejection. (2) Magnitude (or singular-value) Bode plot of the closed-loop transfer function from the sensor noise, n, to output, z, which measures the quality of insensitivity to sensor noise vs frequency (3) Magnitude (or singular-value) Bode plot of the closed-loop transfer function from the plant disturbance, d, to control, u, which measures the impact of the plant-disturbance on the control vs frequency (4) Magnitude (or singular-value) Bode plot of the closed-loop transfer function from the sensor noise, n, to control, u, which measures the impact of the sensor noise on the control vs frequency (5) Root-mean-square (RMS) tables assuming that the plant-disturbance, d, and the sensor noise, n, are stationary stochastic processes. Such RMS values are readily evaluated via the solution of Lyapunov equations. Individual or combined RMS tables for the output, z, and the control, u, as a function of the plant-disturbance, d, and the sensor noise, n, can be computed. (6) Time-domain responses, e.g. step- or sinusoidal-disturbance, stochastic signals, etc for dierent values of the uncertain constant parameters and for slowly time-varying parameters (within their predened ranges).
are introduced [2] and a MIMO GNARC analyzed, thereby eliminating the need for complex adaptive control. 3.3. The FNARC Design If the GNARC analyses discussed above indicate the need for adaptive control, they provide a lower-bound upon robust performance. The xed non-adaptive robust compensators (FNARC) provide the means for quantifying an upper-bound on robust performance. Ideally, the FNARC analyses assumes an innite number of models, N , in any multiplemodel adaptive scheme. Thus, we understand what is the best possible performance if we knew the real parameter(s) exactly. In practice, to determine the FNARC one uses a dense grid of parameters pj , j , in P and determines the associated robust compensator for each pj using exactly the same bounds on unmodeled dynamics and frequency weights employed in the GNARC design. Thus, we can make fair and meaningful comparisons. For each pj we use the complex- design methodology and Matlab software [49], because both the performance weights and bounds on unmodeled dynamics are complex-valued and there are no real parameter uncertainties. For each pj , we again maximize the performance-parameter Ap in (3.2) until the complex- upper-bound, c () is just below unity for all frequencies, say c () 0.995 , to be consistent with ub ub the GNARC lower-bound. One can then again analyze each FNARC design using the six techniques outlined in Table I.
Copyright c 2006 John Wiley & Sons, Ltd. Prepared using acsauth.cls Int. J. Adapt. Control Signal Process. 2006; 00:00
Our experience indicates that detailed analyses, especially at the corners and faces of the hyperparallelepiped P , provide useful insights regarding the impact of the subset of the uncertain real parameters upon performance; this determines the need for sophisticated adaptive control. 3.4. The Potential Benet of Adaptive Control Recall that we had posed the following question: do we need adaptive control? The GNARC and FNARC results provide the designer with the means to answer this question. In Fig. 5 we visualize a hypothetical plot of the outcome of the GNARC and FNARC designs by plotting the (maximized) performance parameter Ap , see (3.2), as a function of a scalar uncertain real parameter, p, pL p pU . We denote the (constant) value associated with the GNARC design by AG . We denote the parameter-dependent value associated with the p FNARC by AF (p), pL p pU . The dierence, AF (p) AG 0 quanties the impact p p p of the uncertain parameter p [pL , pU ] upon performance. The FNARC process indicates that if the parameter p were known exactly, then the (low-frequency) disturbance-rejection is approximately 1/AF (p). The GNARC process indicates that if the parameter p were unknown, p then the (low-frequency) disturbance-rejection is approximately 1/AG . Note that to obtain the p FNARC benets we must implement a multiple-model architecture with an innite number of models. In this manner we have quantied the potential performance benets of adaptive control; the non-adaptive (single-model) GNARC provides the lower-bound upon expected performance, while the adaptive (innite-model) FNARC provides the performance upperbound. This information is critical in deciding whether to implement an adaptive control system or not. The shapes of the curves in Fig. 5 provide additional valuable information. In the hypothetical case of Fig. 5, we should expect the benet of using adaptive control to be greatest if the true parameter was near its upper-bound, i.e. p pU . The benets decrease if the unknown parameter is closer to its lower-bound, i.e. p pL . Indeed it may well happen that AF (pL ) = AG . This can occur, for example, if the parameter p, in rad/sec, represents p p the value of a non-minimum phase zero which places inherent restrictions upon disturbancerejection (see, e.g. [70]). Such a non-minimum phase system has been analyzed, using the RMMAC, in [27].
4. DESIGNING THE BANK OF KALMAN FILTERS FOR THE RMMAC ARCHITECTURE 4.1. Introduction In this section we discuss the critical issues related to the design of the Kalman lters (KFs) in the RMMAC architecture. We remark that the design of the bank of KFs is much more systematic (and complex) that the ad-hoc multi-estimators employed in SMMAC architectures. The proper design of each KF in the RMMAC architecture of Fig. 4 is crucial in order to satisfy the theoretical assumptions [61, 62, 63] which will imply that the PPE will yield the correct model identication. The KF design explicitly utilizes the Baram proximity measure (BPM) Appendix II presents the summary concepts leading to the on-line generation of the posterior probabilities and contains the key equations for calculating the Baram proximity measure (BPM). The design of the KFs is done after the number of models and their boundaries have been established using the procedures in Section 3.5. Recall that the original parameter set, p P (a
Copyright c 2006 John Wiley & Sons, Ltd. Prepared using acsauth.cls Int. J. Adapt. Control Signal Process. 2006; 00:00
5. THE RMMAC/XI ARCHITECTURE The simulations in Section 6 will demonstrate that the RMMAC works quite well even if we violate the theoretical assumptions for MMAE convergence in a mild manner. This fact is supported by hundreds of dierent simulations in Fekris Ph.D. thesis [2]. Nonetheless, we must always remember that: theories have limitations, stupidity does not! Obviously, the superior performance of the RMMAC hinges upon the rapid convergence of the posterior probabilities to the correct model (see eq. 2.4 and Appendices I and II). As we remarked in Section 2.5 such a convergence critically depends upon the regularity of the residuals generated by the bank of Kalman lters (KFs) in Fig. 4. Each KF is designed for a particular covariance matrix cov[(t); ( )] = E{(t) ( )} = (t ) of the continuous-time zero-mean plant white noise (t), where is the plant-noise intensity matrix. The numerical value of the intensity noise will determine the size of the KF gains and the residual covariance matrices Sk . These will determine the size of the colored stochastic disturbances applied to the plant. Under normal operating conditions these exogenous plant disturbances will fall within a specic range. It may very well happen that once-in-a-while the actual plant disturbances become very much larger than the normal ones . During these abnormal time-intervals, the
load-alleviation automatic control systems are often designed for dierent levels of turbulence and the pilot can switch-in the appropriate control system. However, in case of extreme turbulence (not anticipated Copyright c 2006 John Wiley & Sons, Ltd. Prepared using acsauth.cls Int. J. Adapt. Control Signal Process. 2006; 00:00
actual KF residuals will be much larger than those predicted by their associated normal residual covariances, Sk . Our experience has been that during these abnormal time intervals [2], the posterior probabilities generated by eq. (2.4) can undergo rapid oscillations and frequent switching among the models and, as a consequence, the RMMAC performance deteriorates; often it is worse than that of the nonadaptive GNARC design. A numerical example in Section 6 will provide concrete evidence of this phenomenon. In short, the RMMAC can behave poorly when the true stochastic plant disturbances, as quantied by the intensity matrix act of the actual plant white noise, are very dierent from those associated with the intensity d used to design the linear KFs. If act d , then the actual residuals are much higher than, say, the 3-sigma values expected by the residual covariance matrices Sk see eq. (2.4) and this can lead to rapid switching of the posterior probabilities, confused identication and poor RMMAC performance. If, on the other hand, act d , then the actual residuals are much lower than, say, the 3-sigma values expected by the residual covariance matrices Sk and this can also lead to rapid switching of the posterior probabilities, or slow convergence, confused identication and poor RMMAC performance. In this case, the smaller actual plant disturbance may not indeed provide sucient persistent excitation, required by the identiability conditions, to generate signals that are not masked by the measurement noise. These pragmatic considerations led us to develop the so-called RMMAC/XI architecture illustrated in Fig. 10. By increasing the complexity of the RMMAC we can mitigate the RMMAC performance deterioration problems. The basic idea in RMMAC/XI is to introduce additional models, i.e. increase the number of hypotheses, to reect dierent ranges of the plant noise intensity matrix . Let us assume that the number, N , of models and their size has been determined by the required adaptive performance specications and that we have already calculated the N LNARCs denoted by Kk (s), k = 1, 2, . . . , N. Let us further suppose that the plant noise intensity is bounded by [L , H ] and that, for the sake of simplicity, we decide to select two intermediate values, 1 and 2 , such that L < 1 < 2 < H Then we design two sets of linear KFs, one set indexed by k = 1, 2, . . . , N using the noise intensity 1 and the second set indexed by k = N + 1, N + 2, . . . , 2N using the noise intensity 2 . It is important to stress that the nominal points for designing each KF, as determined by the BPM method of Section 4 will be dierent (and must be recomputed) for each value of 1 and 2 . Essentially, we have doubled the number of hypotheses in the associated MMAE; but we can still apply the MMAE-based identication methodology. This allows us to dene the RMMAC/XI architecture shown in Fig. 10. Note that each of the 2N KFs will now generate a dierent residual, rk (t), k = 1, 2, . . . , 2N. Also, we pre-compute the 2N residual covariance matrices Sk . These are introduced to the PPE of Fig. 10 which will generate 2N posterior probabilities, Pk (t), k = 1, 2, . . . , 2N , one for each of the 2N hypotheses. Note, however, that the bank, and number, of the N LNARCs is unchanged, since their design does not depend on ; the LNARCs are only inuenced by the
in the design phase) the pilot typically disconnects the control system and ies the aircraft himself. Copyright c 2006 John Wiley & Sons, Ltd. Prepared using acsauth.cls Int. J. Adapt. Control Signal Process. 2006; 00:00
6. RMMAC SIMULATIONS We described above how the RMMAC architecture combines the state-of-the-art in mixed robust synthesis and multiple model adaptive estimation (MMAE) system identication. Furthermore, we described the step-by-step design process required to implement a RMMAC design. In this section we test and evaluate the disturbance-rejection performance of the RMMAC
Copyright c 2006 John Wiley & Sons, Ltd. Prepared using acsauth.cls Int. J. Adapt. Control Signal Process. 2006; 00:00
The following parameters in (6.2) are xed and known: m1 = m2 = 1, k2 = 0.15, b1 = b2 = 0.1, = 0.1 The upper and lower-bounds for the uncertain spring constant, k1 , are: = {k1 : 0.25 k1 1.75} The performance variable (output) z(t) is the position of mass m2 , z(t) x2 (t) (6.5) (6.4) (6.3)
controller complexity, as measured by the number of models in the RMMAC should be the natural by-product of the performance requirements. We have just demonstrated that if we demand that the performance equals or exceeds X=70% of the FNARC, we require the four models summarized in Table II. If we are willing to accept a somewhat inferior performance and select, say, X=50% then the procedure outlined yields only two models. If we wish to have much better performance and select, say, X=90%, then the outlined procedure yields nine models. Therefore, as we demand better and better performance we must increase the RMMAC complexity, and this agrees with engineering intuition. If explicit performance requirements were used in the design of the SMMAC, the same philosophy could be applied for those architectures as well. 6.4. Predicting Potential RMMAC Performance Benets Testing the RMMAC requires signicant computation using multiple Monte Carlo (MC) runs under dierent scenarios. It is highly desirable, as explained in Section 3, to use the LTI feedback designs, using the GNARC and LNARCs, to quantify the potential benets of using adaptive control in general, and the RMMAC in particular. From a pragmatic engineering perspective we must have tradeos that contrast the performance improvements (if any) of the very sophisticated RMMAC vis-a-vis the much simpler non-adaptive GNARC design. To the best of our knowledge, such performance tradeos have not been explicitly quantied in other adaptive control studies. Referring to Fig. 4, the RMMAC requires the on-line computation of its four Kalman lters (KFs) as well as of its four dynamic LNARCs, K1 (s), . . . , K4 (s), in addition to the calculation of the four posterior probabilities, P1 (s), . . . , P4 (s), by the posterior probability evaluator (PPE) a lot of computations! In order to understand how one can easily predict the potential RMMAC performance characteristics, assume that one of the posterior probabilities converges to its nearest probabilistic neighbor (which it does, as we demonstrate in the sequel); it follows that, after a transient time, a specic LNARC is used. After the probability convergence, the RMMAC essentially operates as an LTI stochastic feedback system! In the spirit of Table I, this allows us to calculate two key transfer functions for disturbancerejection and control signal characteristics Disturbance - rejection: Mz (s) Control - signal: Mu (s)
uncertain k
6.6. RMMAC Stochastic Simulations and Performance Evaluations Unless stated otherwise: (a) all simulations use a plant-stochastic disturbance and white measurement noise generated according to (6.1) and (6.6). The true system includes an actual (but unmodeled) legal time-delay of 0.01 secs in the control channel. (b) all initial model probabilities are initialized to be Pk (0) = 0.25 (k = 1, . . . , 4) at t = 0 secs. (c) we present numerical averages for 5 MC simulations. In the sequel some representative stochastic simulations are shown using the complete RMMAC closed-loop system. Due to space limitations, we only show typical plots; however, our conclusions are based on thousands of other MC runs not explicitly shown in this paper [2]. 6.6.1. Easy Identication, I: The dynamic evolution of the four posterior probabilities when the true k1 = 1.65, well inside the Model #1 subinterval, and the corresponding outputs for the RMMAC and the GNARC systems are shown in Fig. 19. The correct model (Model #1) is identied quickly in about 2 secs. The signicant improvement in disturbance-rejection by the RMMAC vis-a-vis that of the GNARC is evident as shown in 19(b). 6.6.2. Easy Identication, II: The dynamic evolution of the four posterior probabilities when the true k1 = 0.3, well inside the Model #4 subinterval, and the corresponding outputs for the RMMAC and the GNARC systems are shown in Fig. 20(a). The correct model (Model #4) is identied quickly in about 10 secs. The improvement in disturbance-rejection by the RMMAC is again evident as shown in Fig. 20(b). 6.6.3. Harder Identication: When the actual spring constant is near the boundary between two models, it takes longer (more data) to resolve the true hypothesis. In this example, k1 = 0.405 is selected which belongs to Model #3 but is also very close to Model #4, see
Copyright c 2006 John Wiley & Sons, Ltd. Prepared using acsauth.cls Int. J. Adapt. Control Signal Process. 2006; 00:00
Copyright c 2006 John Wiley & Sons, Ltd. Prepared using acsauth.cls Int. J. Adapt. Control Signal Process. 2006; 00:00
6.7.3. Slow Parameter variation: As mentioned in Section 1, a driving engineering motivation for using adaptive control is the need to deal with slow changes in the plant uncertain parameters. In all the numerical simulations presented up to now, we constrained the uncertain parameter to remain constant for all time. Of course, the presence of a timevarying spring stiness violates the plant LTI assumption, and hence all stationarity and ergodicity assumptions (required to prove the posterior probability convergence results) do not hold. Nevertheless, it is important to understand, for any adaptive system, its behavior and performance in the presence of slow parameter variations. In the following numerical MC simulations, the uncertain spring stiness is assumed to be
Copyright c 2006 John Wiley & Sons, Ltd. Prepared using acsauth.cls Int. J. Adapt. Control Signal Process. 2006; 00:00
Finally, we stress that all mixed- computations to determine the GNARC and LNARCs were carried out by the (as yet non-commercially available) software provided to us by Prof. G. Balas [48]. If we had designed these compensators with the commercial version of Matlab [49], which uses only the D-K scales in the iterative design of the H compensators, they would be much more conservative and the performance improvements would be inferior [69].
7. DISCUSSION AND FUTURE RESEARCH DIRECTIONS We have presented a brief overview of dierent approaches to adaptive control that utilize multiple-model architectures. Recent research in the area of the CMMAC and RMMAC architectures seems most promising, but we still have no widely accepted solution to the robust adaptive control problem. As a consequence, there is still a great deal of work that needs to be done, both theoretical and applied, to fully understand the similarities and dierence among the two architectures and the variants within each one, and suitable extensions. Any future theoretical research must adopt a design methodology. We strongly believe that the adaptive problem formulation should always include unmodeled dynamics, unknown plant disturbances and sensor noise, in addition to the (slowly-varying) uncertain real parameters. Furthermore, the design methodology should contain explicit performance specications and performance-robustness should be an integral part of the adaptive problem formulation. Thus, closed loop-stability arguments are not enough. This implies that the local compensators, the LNARCs, must be designed to exhibit both local robust-stability and robust-performance, following the ideas presented in Section 3 and as utilized in our RMMAC architectures. In the unfalsied control methodologies, if robust-performance is to be considered, the available results must be signicantly extended; most probably, the family of robust compensators available for unfalsied and safe control will also have to be designed using the methodology suggested in Section 3. Recent results in unfalsied control [30, 31] claim stability, but performance is not addressed (after all, an open-loop stable highly-uncertain plant remains robustly-stable with no feedback, albeit with lousy performance). The structure of the multi-controllers used in the SMMAC architecture does not explicitly reect a robust-performance requirement. Indeed, for the most part, the SMMAC multicontrollers have a common Ac -matrix (so all controllers have the same poles). This was done for the purpose of computational simplicity and bump less transfer to avoid control transients. However, such controller structures are not appropriate if performance-robustness is desired. A recent exception [75] uses multi-controllers that have an LQG structure, but they use poleplacement ideas, which are also notorious for their lack-of-robustness properties. On the other hand, the RMMAC architecture deals directly with such robust-performance issues and the dynamic characteristics of the RMMAC compensators are quite dierent. We believe that, as stressed in Section 3, one cannot arrive at a systematic procedure for dening the number of models required in any multiple-model scheme without explicit performance specications. Specication of complexity requires xing the number, N , of models; however, one still needs to properly calculate their boundaries. Specication of the required performance, such as in the % FNARC method, would naturally lead to the required number of models and their boundaries. In either case, one needs to utilize the mixed- methodology to maximize performance and derive the LNARCs.
Copyright c 2006 John Wiley & Sons, Ltd. Prepared using acsauth.cls Int. J. Adapt. Control Signal Process. 2006; 00:00
We note that for many industrial applications, where tuned PID controllers are judged to yield satisfactory performance, designing the LNARCs may be an overkill. Based upon operator experience, one can dene the N models in an ad-hoc manner as opposed to systematic procedures of Section 3. In such cases, one can still design the KFs, using the BPM methodology of Section 4 and obtain and implement the identication subsystem of the resulting RMMAC. We now consider the system identication part of the SMMAC and RMMAC architectures. In the RMMAC architecture the approach is stochastic and relies on the residuals generated by the bank of KFs. These must be designed very carefully so as to meet the probability convergence results, as explained in Section 4 and Appendices I and II. In the SMMAC architectures the approach is deterministic and relies upon the prediction errors generated by a bank of multi-estimators, which are a special class of Luenberger observers. In all cited SMMAC references, with the exception of [75], each estimator uses the same AE matrix, i.e. all observers have identical stable poles. This was done to minimize on-line computation. An integral norm of the prediction errors is used for the system identication function. Research should be done to remove the common AE matrix assumption and use more general observers. Also, the assumption of scalar prediction errors must be removed to make the SMMAC architectures extendable to the MIMO case. In [75] this was done, but the problem of designing suitable deterministic MIMO Luenberger observers, say by eigenstructure assignment, becomes quite complex. Our experience has shown that the RMMAC can eectively deal with slowly time-varying parameters. It may be interesting to examine if one can integrate the RMMAC philosophy and methodology with the evolving theory of Linear Parameter Varying (LPV) systems [36, 37, 38, 39, 40, 41, 71, 72, 73, 74, 76]. A weakness of the MMAE algorithm is that the Baram Proximity Measure (BPM) see Appendix II cannot be dened or calculated for open-loop unstable plants. Thus, if in an RMMAC application the plant is unstable one would not be able to use the ideas of Section 4, which are based on the BPM, to design the KFs. One can still design KFs (and RMMACs) for unstable plants, but one would most likely have to use trial-and-error methods in designing the KFs and ensuring convergence of the posterior probabilities. This denes a key and very important area for future research. With respect to the RMMAC and RMMAC/XI architectures, if one or more of the LNARCs is unstable (but leads to a stable feedback loop) remains an open question. All the LNARCs that we have designed [2] were stable. However, if the open-loop plant is both unstable and nonminimum phase then some of the LNARCs may be unstable. Since we cannot dene the BPM for unstable open-loop plants noted above, this remains a very important future research area for RMMAC architectures. Finally, we need to stress that in all available SMMAC and RMMAC results we cannot prove as yet global robust-stability, because a suitable theory is lacking for handling the nonlinear time-varying (stochastic) closed loop dynamics arising from these architectures. Needless to say, any advances in stability theory along these directions would be welcomed. At present, the promise of the SMMAC and RMMAC architectures is in their local and asymptotic results related to stability and performance. With respect to the RMMAC one can design real-time monitoring systems for the regularity of the residuals. If abnormal situations are observed (see the discussion in Sections 2 and 6), which may lead to long-term instability, one should be able to design a fail-safe logic that disconnects the LNARCs and uses the GNARC until the residuals return to a regular behavior and the LNARCs are reconnected.
Copyright c 2006 John Wiley & Sons, Ltd. Prepared using acsauth.cls Int. J. Adapt. Control Signal Process. 2006; 00:00
Equally important to the theoretical research investigations would be to fairly compare the dierent adaptive methods using a variety of common test-bed examples, with identical performance requirements.
8. CONCLUDING REMARKS We have witnessed the development and evolution of a variety of novel multiple-model architectures for robust adaptive control during the past decade. In order to compare them in a fair manner we have suggested that all should utilize the available results and software associated with mixed- synthesis. If we explicitly state the specications for required stabilityrobustness and performance-robustness for the adaptive closed-loop system, then we can evolve the present architectures so that they can be condently used in real applications. We view our RMMAC architecture and design methodology as a potential contribution to adaptive feedback engineering system design. As we have remarked, we can not prove global stability results and we only have numerous simulation studies that demonstrate its excellent performance. Thus, a reader may ask an important question: why should one pay attention to the RMMAC procedure in the absence of solid theoretical guarantees? The best way to view our perspective is by an analogy. Thousands of real engineering systems are operating using methods that also lack a solid theoretical foundation, but are based upon common-sense integration of rigorous results. We list just three, all of them involving nonlinear systems: (a) gain-scheduling feedback control system designs (with no global stability guarantees), (b) surveillance, tracking and other estimation systems that utilize the extended Kalman lter (which may diverge), and (c) surveillance, tracking systems and other estimation systems that are based on the sumof-gaussian multiple model estimation method, when the extended Kalman lter performance is not good enough (but they still may not always work well). None of the above practical design methods is backed by a complete body of solid theory. Their extensive engineering utility is based upon exhaustive simulations, engineering insight and know-how to inevitably tune certain parameters in the algorithms. However, all utilize reasonable extensions of theoretical results valid for linear systems, e.g. the Kalman lter. The gain-scheduling method also pieces together linear feedback control results in a common sense manner. Since almost all real applications involve nonlinear systems the engineering utility of (a), (b) and (c) is extensive. Our perspective is that the highly nonlinear proposed RMMAC method is akin to the above. Maybe we are nearing the end of the search for the adaptive Holy Grail.
We thank Profs. B.D.O. Anderson and J. Hespanha for their frequent suggestions, discussions and criticisms throughout the course of our research. We also are grateful to Prof. G.J. Balas for providing us with the latest non-commercial version of the mixed- software [48] and Prof. Y. Baram for providing us a copy of his Ph.D. thesis [61]. We are also indebted to the four anonymous reviewers whose comments and criticisms greatly improved the nal version of this paper.
where {u(n)} is the deterministic (control) input and {w(n)} and {v(n)} are zero-mean white Gaussian sequences, mutually independent with E{w(n)w(n)T } = Q E{v(n)v(n)T } = R (A.2)
Let us assume that N discrete-time stochastic LTI models are given that include disturbance dynamics (if any). These models blend a nite set of families of models = {M : Mk = (Fk , Jk , Gk , Hk , Qk , Rk ) ; k = (1, 2, . . . , N )} The k-th model is then described by the state space dynamics xk (n + 1) = Fk xk (n) + Jk u(n) + Gk w(n) yk (n) = Hk xk (n) + v(n) (A.4) (A.3)
where the time index is n = 0, 1, 2, . . . and the model index is k = 1, 2, . . . , N . It is assumed that the true system that generates the data is one of the models in (A.4). The set of past controls, u(0), u(1), u(2), . . . , u(n 1), and the set of past noisy measurements, including the one at the present time n, y(1), y(2), . . . , y(n 1), y(n), are also known at time n. We want to determine the true steady-state conditional mean of the present state vector, x(n), i.e. as n x(n|n) = E x(n)| u(0), u(1), . . . , u(n 1); y(1), . . . , y(n 1), y(n)
| {z
Y (n)
and the true steady-state conditional covariance matrix of x(n), i.e. (n|n) = E x(n) x(n|n) x(n) x(n|n) |Y (n) (A.5)
The MMAE lter shown in Fig. 1 is driven by the sequence of past controls and noisy sensor measurements while it generates both a state-estimate vector and a corresponding error-covariance matrix. Let us rst establish some notation for the discrete-time steady-state Kalman lter (KF). Predict-cycle: xk (n + 1|n) = Fk xk (n) + Jk u(n) yk (n + 1|n) = Hk xk (n + 1|n) Update-cycle: xk (n + 1|n + 1) = xk (n + 1|n) + Kk rk (n + 1) (A.7) (A.6)
The residual rk (), residual covariance matrix Sk , and the constant steady-state KF gain matrix, Kk , are respectively dened as follows. Residual: rk (n + 1) = y(n + 1) yk (n + 1|n) (A.8)
(A.9) (A.10)
The constant steady-state covariance equations are Predict-cycle covariance p : k p = Fk k AT + Gk Qk GT k k k Update-cycle covariance k :
T 1 k = p p Hk Sk Hk p k k k
(A.11) (A.12)
We stress that both the MMAE state-estimate and state-covariance matrix (A.5) represent true conditional state estimate and its conditional covariance at steady-state [54, 57]. This is because, one can explicitly calculate the conditional probability density function p (x(n)|Y (n)) , which turns out to be a weighted sum of gaussian densities, where the weights are found from the posterior probability evaluator (PPE); see Fig. 1. The problem reduces to a combination of a hypothesis-testing problem and a state-estimation problem. The fact that one of the N models is the true one is modeled by a hypothesis random variable that must belong to a discrete set of hypothesis {H1 , H2 , . . . , HN }. It turns out that online generation of the posterior conditional probabilities determines which hypothesis is true. Let suppose that H indicates the hypothesis random variable (scalar) which can attain only one of N possible values, H {H1 , H2 , . . . , HN } (A.13) The event H = Hk means that the k-th system is the true one, i.e. the one that is generating the data. The prior probabilities at initial time n = 0, Pk (0) P rob(H = Hk ), are assumed known (typically 1 Pk (0) = N ), and Pk (0) 0,
Pk (0) = 1
The posterior probabilities, Pk (n) = P rob(H = Hk |Y (n)), must also satisfy Pk (n) 0,
N X k=1
Pk (n) = 1
and can be calculated on-line by the PPE in the MMAE system. It turns out that the conditional PDF, p (y(n + 1)|u(n), Hk , Y (n)) , is Gaussian with mean E {y(n + 1)|u(n), Hk , Y (n)} = Hk xk (n + 1|n) and steady-state covariance
T cov [y(n + 1); y(n + 1)|u(n), Hk , Y (n)] = Hk p Hk + Rk k
Furthermore, p (y(n + 1)|u(n), Hk , Y (n)) = By using Bayes rule we deduce that Pk (n + 1) = p y(n + 1) | Hk , u(n), Y (n)
N P j=1
1 1 T 1 e 2 rk (n+1) Sk rk (n+1) (2)m/2 det Sk
Pk (n)
j = (2)
(A.20) (A.21)
det Sj
where m is the number of measurements. Then, from (A.19)(A.22) the posterior probabilities can be computed on-line by the PPE using the recursive formula Pk (n + 1) = k e 2 wk (n+1)
N P j=1
Pk (n)
j e 2 wj (n+1) Pj (n)
where Pk (0) are the prior model probabilities as in (A.14). Thus, at steady-state the MMAE generates the state estimate (exact conditional mean) by x(n|n) = and the exact conditional covariance matrix (n|n) =
N X k=1 N X k=1
Pk (n) xk (n|n)
We stress that the above results hold when the true (unknown) plant is assumed to belong to the model set of (A.4).
N 1 X 1 uM (t)uM (t) < I N 1
