WO1993000618A1 - Stable adaptive neural network controller - Google Patents

Stable adaptive neural network controller Download PDF

Info

Publication number
WO1993000618A1
WO1993000618A1 PCT/US1992/005266 US9205266W WO9300618A1 WO 1993000618 A1 WO1993000618 A1 WO 1993000618A1 US 9205266 W US9205266 W US 9205266W WO 9300618 A1 WO9300618 A1 WO 9300618A1
Authority
WO
WIPO (PCT)
Prior art keywords
outputs
plant
control
operating range
control system
Prior art date
Application number
PCT/US1992/005266
Other languages
French (fr)
Original Assignee
Massachusetts Institute Of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Massachusetts Institute Of Technology filed Critical Massachusetts Institute Of Technology
Publication of WO1993000618A1 publication Critical patent/WO1993000618A1/en

Links

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • G05B13/047Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators the criterion being a time optimal performance criterion
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/0265Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
    • G05B13/027Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only

Definitions

  • This invention relates to adaptive control systems and, in particular, to an adaptive control systems which use neural networks to estimate parameters of the controlled plant.
  • controller performs poorly or is unstable because nonlinearities in the system are not properly
  • controllers which can compensate for the nonlinear forces involved in many conditions to produce accurate operation over a large operating range. Consequently, productivity of such a system can be substantially increased by the use of such a nonlinear controller.
  • a robot arm may have to manipulate loads of various sizes, weights and mass distributions.
  • the robot motion may become very inaccurate or unstable.
  • the controlling parameters may be well understood at the beginning of control operation, However, unpredictable parameter variations may occur as the control operation proceeds. Unless the controller is continually changed during actual operation to
  • an adaptive control system can be used to control a robot arm to manipulate various loads by changing the motion parameters of the arm according to a sensed load condition.
  • automatic navigation systems such as ship steering systems in which the characteristics of the ship depend on many uncertain and unpredictable parameters such as water, depth, ship loading and wind and wave conditions.
  • aircraft control is another field in which adaptive control systems have been profitably used in which the characteristics of the airplane depend on its altitude, wind speed, aircraft speed, and ground configuration.
  • Adaptive systems have also been used in process control in which batches of materials must be processed.
  • the adaptive systems can compensate for changes in the parameters which characterize the system from batch to batch.
  • an adaptive control system is similar to a conventional control system in that the output of the system under control (called the "plant") is fed back to the controller which then generates a control output for the plant.
  • an adaptive controller differs from an ordinary controller in that the
  • parameters of the controller itself are variable and an "adaptation mechanism" is provided to adjust the controller parameters based on signals which are measured during on-line operation of the plant.
  • MRAC Model Reference Adaptive Control
  • the plant which contains unknown or unpredictable parameters
  • a reference model which responds to a reference input to produce desired outputs which are theoretically the ideal response of the plant to the reference input
  • a parameterized controller which generates the control inputs to the plant and which is characterized by adjustable parameters
  • an adaption mechanism which changes the adjustable parameters.
  • the reference model output is subtracted from the plant output to produce a "tracking error".
  • the tracking error is provided to the adaption mechanism which then adjusts the controller parameters to minimize the tracking error.
  • One type of mechanism estimates the plant parameters and then computes the controller parameters from the estimated plant parameters and is called an "indirect” system.
  • the other type of adaption mechanism computes the controller parameters directly from the tracking error and plant output and is called a "direct" system.
  • the second type of adaptive control structure commonly used with linear systems is called a self-tuning controller (STC).
  • STC self-tuning controller
  • This structure consists of the plant, a parameterized controller which generates the control inputs to the plant and which is characterized by
  • estimator circuits conventional types of estimator circuits: an "indirect” circuit and a “direct” circuit.
  • the neural network is first "trained” by providing it with a selected set of pre-recorded data about plant inputs and corresponding outputs. After the network is trained, it is used to generate a model of the plant. In theory, the trained network should provide an accurate estimate of the plant output to an input not previously provided to it and the network could then be used to design the required controllers.
  • the problem with this latter arrangement was that the prior art methods that were used to "train” the network were back-propagation methods which were
  • an adaptive control system uses a neural network to provide adaptive control when the plant is operating within a normal operating range, but shifts to other types of control as the plant operating conditions move outside of the normal operating range.
  • the controller uses a structure which allows the neural network parameters to be determined from minimal information about plant structure and the neural network is trained on-line during normal plant operation. The resulting system can be proven to be stable over all possible conditions. Further, with the inventive techniques, the tracking accuracy can be controlled by appropriate network design.
  • Figure 1 is a block schematic diagram of a prior art MRAC adaptive control system.
  • FIG. 2 is a block schematic diagram of an MRAC adaptive control system which has been modified for use with the present invention.
  • Figure 3 is a block schematic diagram of an
  • Figure 4 is an expanded block schematic diagram of the tracking error processing circuit 224 as shown in Figure 2.
  • FIG. 5 is an expanded block schematic diagram of the modulator circuit 324 as shown in Figure 3.
  • Figure 6 is an expanded block schematic diagram of the proportional/derivative controller 306 as shown in Figure 3.
  • Figure 7 is an expanded block schematic diagram of the sliding controller 334 as shown in Figure 3.
  • Figure 8 is an expanded block schematic diagram of the adaptation mechanism 208 as shown in Figure 2.
  • Figure 9 is an expanded block schematic diagram of the neural network controller 310 as shown in Figure 3.
  • Figure 10 is a phase space portrait of desired position and velocity of a first simulation.
  • Figure 11 is a graph of tracking error and total control signal provided to the plant versus time for the first five seconds of the first simulation.
  • Figure 12 shows the tracking error using the inventive adaptive controller versus time during a fifty second period of the first simulation.
  • Figure 13 is a graph showing both the position tracking error and control signals during the first five seconds for a second simulation.
  • Figure 14 is a graph showing the position tracking error during the first fifty seconds for a second
  • Figure 9 is an expanded block schematic diagram of the neural network controller 310 as shown in Figure 3.
  • Figure 10A is a diagram of desired trajectories of variables x 1 and x 2 versus time for a first
  • Figure 10B is a phase space portrait of desired position and velocity of a first simulation.
  • Figure 11A is a graph of tracking errors of the adaptive controller and the proportional-differential controller versus time for the first five seconds of the first simulation.
  • Figure 11B is a graph of the total control signal provided to the plant by the adaptive controller and the proportional-differential controller versus time for the first five seconds of the first simulation.
  • Figure 12 shows the tracking error using the
  • inventive adaptive controller versus time during a fifty second period of the first simulation.
  • Figure 13A is a graph showing the position tracking error during the first five seconds for a second
  • Figure 13B is a graph sho.wing the control signals during the first five seconds for a second simulation.
  • Figure 14 is a graph showing the position tracking error during the first fifty seconds for a second
  • a plant output is represented by x(t)
  • x (n) (t) is the nth derivative of x(t) with respect to t
  • u(t) is a control input which drives that output
  • b is an arbitrary gain.
  • the function, f(x(t), , ..., x (n-1) (t)), in equation (1) will be an unknown linear or nonlinear function.
  • the objective of the control system is to drive the plant state vector which can be expressed as , ..., x (n-1) ] T to follow a prespecified desired
  • equation (1) Specifically, it can be shown that, if the function f can be smoothly truncated outside a selected compact set in such a way that the spatial Fourier transform of this truncation is absolutely integrable, then sampling theory indicates that the function itself can be uniformly approximated to a chosen degree of accuracy on that set with a linear superposition of a finite number of interpolating functions (that is, only a finite number of samples are required).
  • ⁇ f a gaussian neural network approximation of f(x) over a chosen compact set, A, consisting of a truncated series expansion containing a finite number of terms resulting from a finite number of samples, and ⁇ f is a
  • any interpolating function can be used for the approximation, provided that its spectrum is real-valued, strictly positive, and integrable on R n , the plant state space.
  • Gaussian function is chosen to be the interpolating function. It is known that any continuous function, not necessarily infinitely smooth, can be uniformly
  • Gaussian radial interpolation functions are particularly attractive because they are bounded, strictly positive and absolutely integrable on R n , and further, they are their own Fourier transforms (modulo a scale factor).
  • the radial gaussian can hence be used as the reconstructing function in the expansion f A (x).
  • is the standard deviation of the gaussian function, g ⁇ .
  • the number of terms in the expansion is given by I o which is determined by the size of the region over which f must be approximated, the sample spacing ⁇ , and the approximate error which can be tolerated. More particularly, the approximation error results from three sources: (1) an error which arises because f is not band-limited as originally assumed; (2) an error which arises because the gaussian reconstruction functions are not exactly band-limited and (3) an error which arises because the infinite series of gaussian functions which would exactly reconstruct f must be truncated to allow reconstruction with a finite number of elements. As shown in the attached paper, the error from each of these sources can be shown to be less than ⁇ f /3 so that the overall error is less than s f . More
  • the magnitudes of errors (1) and (2) above are related to the choice of ⁇ and ⁇ and the magnitude of error (3) is related to the number of samples I o .
  • the gaussian is a separable nonlinearity, so that instead of computing a single non-linear transform of the entire state vector, individual subspaces of R n may be transformed separately, then multiplied together to form the final approximation.
  • the radial gaussian function conveniently allows the expansion (3) to map directly to a class of three-layer neural networks employing a single layer of "neurons" (or nodes) with radial gaussian input-output characteristics.
  • n-dimensional point, x at which the approximation f A (x) is required, forms the input to the network.
  • the number of nodes in the network is determined as previously discussed, and the weights connecting the network inputs to each gaussian node encode the points ⁇ I in the sampling grid.
  • Each node computes a quadratic "activation energy" from its inputs and feed-in weights, given by:
  • the network output is formed by
  • sampling density and variance should be chosen such that the Gaussian's spectrum is strictly positive on the support of the bandlimited version of f, yet approaches zero quickly enough that the high frequency components contribute negligibly in the inverse
  • any additional information known about f can be used in sharpening the above bounds or reducing the size of the required representation. If information about the shape and orientation of the spectrum is known, the distance between the sampling points can be adjusted accordingly. For example, if f is known to have a spectral support which is much smaller in one direction in R n the sample spacing for the corresponding
  • expansion equation (3) provides a network structure which can approximate (sufficiently smooth) unknown plant nonlinearities with a prespecified accuracy, uniformly on a particular compact subset, the adaptive component of the controller can be taken as the output of this network.
  • the small approximation errors thus introduced could then be viewed as a uniformly bounded disturbance driving the plant dynamics, and conventional deadzone adaptive control theory could then be used to derive stable adaptive control laws.
  • this problem can be overcome by including in the control law an additional component which takes over from the adaptive component as its approximation ability begins to degrade, and forces the plant state back into the normal operating range.
  • this component takes the form of a sliding controller with a boundary layer, and its design is straightforward given upper bounds on the magnitudes of the functions being
  • the complete control law thus has a dual character, acting as either a sliding or as an adaptive controller depending upon the instantaneous location of the plant state vector.
  • discontinuously switching between adaptive and sliding components creates the possibility that the controller might chatter along the boundary between the two types of operation.
  • pure sliding operation is restricted to the exterior of a slightly larger set A, containing the normal operating range, A d
  • pure adaptive operation is restricted to the interior of the set A d .
  • the two modes are effectively blended using a continuous modulation function, which controls the degree to which each component contributes to the complete control law.
  • the resulting controller smoothly transitions between adaptive and nonadaptive control strategies and is capable of globally stabilizing the systems under consideration.
  • u(t) u pd (t) + (1-m(t))u ad (t) + m(t)u sl (t) (8)
  • u pd (t) is a negative feedback term associated with a proportional/derivative controller and consists of a weighted combination of both the measured tracking error states and a tracking metric s(t), to be defined below.
  • u sl (t) represents the sliding
  • a d the normal operating range
  • a d ⁇ x
  • p,w ⁇ 1 ⁇ and A ⁇ x
  • is a positive constant representing the width of the transition region, x o fixes the absolute
  • the modulation function can be taken as,
  • nonlinearity f(x) which is realized as the output of a radial gaussian neural network whose inputs are the measured values of the plant states.
  • the variance and mesh size of this network are designed considering assumed upper bounds on the spectral properties of the plant nonlinearity f(x).
  • the input weights, ⁇ I which encode the sampling mesh, are fixed in this architecture, while the output weights, ⁇ I (t), are to be adjusted to attempt to match their values, c I when the system has been trained.
  • the set of terms is chosen so that the bound on the uniform approximation error, ⁇ f produced by the
  • the selection of the desired function approximation specifies the subset I o of the uniform, square sampling mesh which must be used to reconstruct the function f, and, thus, the number of nodes required and the precise values of the
  • k sl (x) is the gain of the controller
  • is the width of a boundary layer surrounding the sliding surface in accordance with conventional sliding control theory.
  • the deadzones required in the algorithm for updating the output weights of the gaussian network reflect the fact that no useful information can be gained about the quality of the current approximation when the tracking errors are less than a threshold determined by the bounds on the approximation errors, e.g., ⁇ f .
  • a threshold determined by the bounds on the approximation errors
  • deadzone adaptation requires discontinuously starting and stopping the parametric adjustment mechanism according to the magnitude of the error signal. These discontinuities can be eliminated using the metric s(t) and by
  • variable s has several properties useful in the design of adaptive laws : if
  • control gain is bounded away from zero and of known sign for all x. Without loss of generality, this sign can be taken as positive, so that
  • equation (27) can be
  • Equation (30) suggests a control law of the form:
  • k sl (x,x d ) (M o (x) + ⁇ h ) + (M 2 (x)+ ⁇ b )
  • the input weights encoding the sampling mesh are fixed in this architecture, and the output weights and are adjusted to match their trained
  • the adaptive controller can compensate for dynamic effects attributable only to the states it
  • the parameter ⁇ which defines the size of the transition region A-A d between adaptive and sliding operation, in a sense also represents the width of a boundary layer, serving to smooth the transition between the two controller modes. Since, with a zero width transition region, there is a possibility the controller might chatter along the boundary of the set A d , the same comments which apply to ⁇ above also apply to ⁇ : the transition region should be "thick" enough that there is no possibility that the mode switching might excite unmodeled dynamics.
  • FIG. 10A and 10B show desired trajectories of variables x 1 and x 2 versus time and a phase space
  • the omitted nodes closest to the set A contribute terms only of order 10 -6 to the reconstruction of the values assumed by f on this set.
  • a conservative estimate taking into account the number of closest omitted nodes and an assumed uniform upper bound on f of 20, suggests that ⁇ f ⁇ 10 -3 on A if this value of p is used.
  • Figure 11A and compares the position tracking error, x(t), and Figure 11B compares the total control signal issued to the plant during the first 5 seconds for both of these simulations.
  • Use of the adaptive controller clearly results in superior tracking performance; worst case tracking error is reduced by a factor of ten compared with that obtained using PD and sliding components alone during this time period, and this ratio continually improves as the adaptive controller learns more about the plant dynamics.
  • Figure 12 shows the tracking error using the adaptive controller during a fifty second period which does indeed demonstrate the predicted asymptotic convergence to the deadzone.
  • the constant function 2 is not Fourier transformable in the classical sense
  • Figures 13A and 13B show both the position tracking error and control signals, respectively, input to the plant during the first 5 seconds for both these simulations.
  • Figure 14 shows the tracking error for the adaptive controller over the first 50 seconds of operation
  • Figure 1 shows a schematic block diagram of a conventional model reference adaptive control system.
  • the box 106 represents the plant of the system to be controlled.
  • the dynamic equations governing the plant are subject to a known linear parameterization, although the exact
  • the plant generates an output vector x on bus 114.
  • the reference model 104 is used to specify the ideal response of the adaptive control system to an external reference command are provided on lead 100 through the system. Essentially, the reference model generates an ideal plant response to the reference input r. The adaptive mechanism represented by box 108 then seeks to adjust the parameters of controller 102 so that the plant reproduces this ideal response.
  • the reference model receives the input from lead 100 via lead 103 and generates an output vector x d on bus 110. The output of the plant on bus 114 and the output of the reference model on lead 110 are provided to the subtraction circuit 112 which generates the tracking error x-x d vector on bus 116.
  • Controller 102 is constructed with a number of adjustable parameters which change how the control output u generated on bus 120 is related to the input R on lead 100 and feedback from bus 118.
  • the controller is adjusted by an adaptive mechanism 108 which is
  • controller 102 is generally provided with the output of the plant 106 by means of bus 118.
  • the conventional MRAC controller is modified slightly to both simplify it and restructure the network to operate with the theory presented above.
  • plant 106 in Figure 1 is labeled plant 206 in Figure 2.
  • controller 102 in Figure 1 is labeled
  • the parameters of the output x d can be calculated ahead of time either in explicit reference model or by calculations provided to the summing network.
  • tracking error vector x-x d on bus 216 is provided to an error processing circuit 224 which generates a set of error "metric" variables represented by e.
  • error variables which include the metrics s, s ⁇ and a r , are provided to the adaptive
  • Figure 3 shows a block schematic diagram of
  • the controller receives the error signals e on lead 300 from the error processing circuit 224 shown in Figure 2.
  • the output vector x of the plant is received, via bus 302.
  • the error signals e are provided via leads 304 to a
  • proportional/differential controller 306. This is a conventional controller which provides damping to the system.
  • the output of the proportional controller 308 is provided to a summing circuit 318.
  • the error signals on leads 300 are also provided by leads 342 to a sliding controller 334. Sliding
  • controller 334 also receives the plant output on bus 302 via bus 340.
  • Sliding controller 334 is a conventional circuit of which the exact construction is described below. The general principles of construction of sliding controllers are set forth in the book entitled "Applied Nonlinear Control” by Jean-Jacques Slotine and Weiping Li, Prentice Hall, Englewood Cliffs, New Jersey, 07632, 1991, chapter 7.
  • the output 336 of sliding controller 334 is provided to a multiplier circuit 330 which is in turn controlled by means of signals on lead 328 by modulator 324.
  • Modulator 324 also provides output 326 which controls multipliers 314 and 332. These in turn control the outputs and adaptation mechanisms of a neural network 310.
  • Neural network 310 receives the error signals e on bus 300 by means of bus 346 and the output of the plant on bus 302 by means of bus 348.
  • the construction and operation of the neural network will be described hereinafter in detail below.
  • the neural network generates an adaptive portion of the control signal which is effective in the normal operating range of the plant.
  • the output of the network on bus 312 is provided to multiplier 314.
  • Modulator 324 controls multipliers 330, 314, and 332 as previously stated. In order to switch the control of the network smoothly between the neural network 310 and the output of the sliding controller 334. Depending on whether the control system is operating in its normal operating range or outside of its normal operating range the modulator 324 is arranged so that control is
  • the parameter adaptation signals on 344 provided by the adaptive mechanism 208 are also controlled by multiplier 332 which provides a modulated output via bus 333 to control the learning rate of neural network 310.
  • This smooth modulation technique prevents chattering in the network when the plant is operating near the edge of its normal operating range.
  • Modulated outputs of neural network 310 on bus 316 and of sliding controller 334 on bus 322 are also provided to summing network 318.
  • Network 318 sums the output of PD controller 306, neural network 310 and sliding controller 334 to provide controller output on bus 320.
  • Figure 4 of the drawing is an expanded block diagram of the circuitry which generates the error metrics s ⁇ , s and a r .
  • the tracking error is an expanded block diagram of the circuitry which generates the error metrics s ⁇ , s and a r .
  • the tracking error signals are provided to a set of multipliers illustrated schematically by multiplier 404.
  • multiplier 404 By convention in the drawings which follow if two inputs entering a multiply junction are underlined, they represent a collection of signals (a vector) and the multiplier should be
  • multiplier 404 forms the "dot" product of the tracking vector with a constant vector ⁇ .
  • These constants are entered by the user by means of an entry point shown schematically as triangle 408.
  • the triangles indicate inputs supplied by the user and are selected in accordance with the aforementioned theory. The resulting products are summed to generate the "dot" product.
  • the product generated by multipliers 404 are provided, via bus 412, to a summer 420 which forms differences between the feedforward signal (the nth derivative of the desired state x d (n) ) provided at entry point 422 and the product developed by multipliers 404.
  • the output on lead 426 is the error metric a r .
  • the error metrics s ⁇ and s are developed by the upper portion of the circuitry.
  • the tracking error vector is provided to a set of
  • multipliers, 402 which multiply the tracking errors by constants, ⁇ , provided from the user entry point 406 in order to generate the error metric s on lead 410.
  • the error metric s on lead 410 is provided by a lead 414 to a multiplier 416 where it is multiplied by a constant 1/ ⁇ provided via user entry point 418.
  • the product on lead 424 is provided to a saturation circuit 428.
  • ⁇ 1, and sat (y) sign (y) otherwise
  • the saturated value appears on lead 430 and is applied to a multiplier 434 where it is multiplied by the reciprocal of the value supplied in 418 ( ⁇ ) provided via user entry point 432.
  • the resulting product on lead 436 is applied to a summer 438 which forms a difference between the s error metric provided on lead 442 and the product of multiplier 434.
  • the result on lead 440 is the error metric s ⁇ .
  • the error metrics generated by the error processing circuitry shown in Figure 4 are used in the remainder of the circuitry to generate various quantities as will hereinafter be described.
  • FIG. 5 is a detailed block schematic diagram of modulator circuit 324 shown in Figure 3.
  • Modulator operates to generate the modulation signals m and 1-m which are used to smoothly switch from the adaptive controller to the sliding controller as previously described.
  • the plant output vector x is provided, via bus 500, to a plurality of multiplier circuits of which only circuits 502 and 504 are shown.
  • each component of the plant output (such as components x l on lead 501 and x n on lead 503) is applied to a
  • Multiplier 502 is provided with a coefficient 1/ ⁇ l applied via user entry point 506.
  • multiplier 504 receives a coefficient 1/ ⁇ n applied via user entry point 508.
  • the weighted inputs are forwarded to absolute value circuits 510-512 by means of a plurality of leads including leads 514 and 516.
  • the absolute value of the weighted inputs generated by circuits 510-512 is then provided to a maximum circuit 518 which selects the signal with the maximum magnitude and applies the selected signal (r) to lead 520.
  • the maximum signal r is applied to a summer 522 which subtracts 1 from the result.
  • the resulting difference is applied, via lead 524, to a multiplier 526 where it is multiplied by the user-supplied constant 1/ ⁇
  • the resulting product on lead 528 is provided to a saturation circuit 530.
  • the function of the saturation circuit is as previously discussed.
  • the saturated output is provided to another maximum circuit 532 which selects the maximum of the saturated output of circuit 530 and the signal value "zero" provided on lead 534 to generate the modulation output m on lead 536.
  • the modulation output is provided, via lead 538, to the negative input of an adder 540 and subtracted from 1 to generate the signal 1-m on lead 542.
  • the modulation signal m is applied, via lead 328, to multiplier 330 to modulate the output of the sliding controller 334.
  • the modulation value 1-m is provided, via lead 326, to multipliers 314 and 332 in order to control the output of neural network 310 and the adaptation control signals to neural network 310.
  • Figure 6 illustrates the construction of an
  • the illustrative proportional/differential controller which generates the output control signal u pd from the input error metrics s and s ⁇ .
  • the error metric s is provided on lead 600 to multiplier 604 where it is multiplied by constant -K d (provided via user input 606) in order to generate a product on lead 612. The resulting product is applied to one input of an adder 616.
  • the error metric s is provided on lead 600 to multiplier 604 where it is multiplied by constant -K d (provided via user input 606) in order to generate a product on lead 612.
  • the resulting product is applied to one input of an adder 616.
  • lead 602 is forwarded to multiplier 608 where it is multiplied by constant -K provided at user input 610.
  • the resulting product on lead 614 is also applied to adder 616 which adds together the two signals to generate the output control signal u pd on lead 618.
  • Figure 7 shows the construction of the sliding controller 334 in Figure 3. It uses the error metric s provided on lead 700 as an input to multiplier 702.
  • Multiplier 702 multiplies s by the constant 1/ ⁇
  • the resulting product is applied to a saturation circuit 708, which operates as previously described.
  • the saturated output on lead 710 is applied to multiplier 714 where it is multiplied by user supplied constant -K sl provided via input
  • the resulting product on lead 716 is the sliding controller output u sl .
  • Figure 8 shows the construction of the adaptive mechanism 208 shown in Figure 2. It utilizes the error metrics s ⁇ and a r to generate the control signals
  • lead 800 is applied to a multiplier 806 where it is multiplied by the user supplied constant -K al provided via input point 808.
  • the resulting product on lead 812 is the adaptation control signal c L .
  • the error metric s ⁇ is also provided, via lead
  • multiplier 810 to one input of multiplier 810.
  • the other input of multiplier 810 is provided with the error metric a on lead 802.
  • the resulting product on lead 814 is provided to multiplier 816 where it is multiplied by the user supplied constant K a2 provided via input point 818.
  • the resulting product on lead 820 is the control signal d L .
  • FIG 9 shows the construction of the neural network 310 shown in Figure 3.
  • the neural network receives the plant output vector x on bus 900.
  • the bus is split into a plurality of parallel busses shown schematically as leads 902, 904, 906 and 908.
  • the number of busses is equal to the number of nodes in the network as discussed above.
  • Each separate bus carries the vector x and is processed in a similar manner and therefore only two busses are shown in detail.
  • each plant output vector is processed by appropriately weighting the variables in a plurality of processing circuits of which circuits 910 and 912 are shown. The processing performed by
  • processing circuits 910-912 is described by equation (3) which specifies the formation of the radial gaussian function. During this formation an input weight ⁇ is subtracted from each plant output variable and the absolute value of the resulting difference is squared and divided by variance ⁇ 2 of the gaussian.
  • nodes 910-912 are provided to a plurality of nodes of which nodes 914-916 are shown. These nodes form the radial gaussian of their inputs and the processed signals are provided to processing circuits 918-920 which multiply resulting outputs via adjustable output weights.
  • the adjustable output weights are controlled by the modulated adaptation signals (1-m)c L and (1-m)d L
  • the adaptation mechanism controls the magnitude of the output constants c I in equation (4).
  • the actual values which are used to adjust the output weights are obtained by multiplying the output of gaussian circuits 914-916 by the adaptation signal (1-m)c L . The resulting
  • the weighted outputs of the adjustable output circuits 918-920 are provided, via leads 922-924, to an output summing circuit 926 which receives all of the outputs from the adjustable output circuits and provides the sum on output 928 which is the adaptive control output u ad .

Abstract

An adaptive control system uses a neural network to provide adaptive control when the plant is operating within a normal operating range, but shifts to other types of control as the plant operating conditions move outside of the normal operating range. The controller uses a structure which allows the neural network parameters to be determined from minimal information about plant structure and the neural network is trained on-line during normal plant operation. The resulting system can be proven to be stable over all possible conditions. Further, with the inventive techniques, the tracking accuracy can be controlled by appropriate network design.

Description

STABLE ADAPTIVE NEURAL NETWORK CONTROLLER
FIELD OF THE INVENTION
This invention relates to adaptive control systems and, in particular, to an adaptive control systems which use neural networks to estimate parameters of the controlled plant.
BACKGROUND OF THE INVENTION
Automatic control techniques are highly useful in a variety of applications including industrial
manufacturing applications, numerically controlled machining systems and automated aircraft and ship navigation systems. A large amount of early control theory and design work focussed on linear systems. In such control systems, a key assumption must be made that the system operates over a small, closely controlled range in which system output is linearly related to the control parameters. In many cases a linear controller operates satisfactorily and extensive work has been done concerning the theory and design of such controllers.
However, in many other cases, especially when the range of required operation is large, the linear
controller performs poorly or is unstable because nonlinearities in the system are not properly
compensated. As an example, when a linear controller is used to control the operation of a robot arm in a manufacturing operation, the linear theoretic model used to design the control system for the robot, neglects the inevitably nonlinear forces associated with the
mechanical motions of the arm. Consequently, the controller's accuracy quickly degrades as the movement speed of the robot arm increases. Consequently, in order to achieve a prespecified accuracy in even simple robot tasks such as machine tool replacement or object movement, the use of a linear controller requires that the speed of motion and, accordingly, the overall productivity, be restricted to a low range.
It is possible to design simple nonlinear
controllers which can compensate for the nonlinear forces involved in many conditions to produce accurate operation over a large operating range. Consequently, productivity of such a system can be substantially increased by the use of such a nonlinear controller.
However, in some control systems, such as the
aforementioned robot arm, some of the parameters which drive the system are not known or cannot be easily predicted at the beginning of the control operation.
For example, a robot arm may have to manipulate loads of various sizes, weights and mass distributions.
Consequently, the initial and dynamic forces associated with the varying loads are not well known before the robot system operates. Even if a nonlinear controller is used, if the controller has constant gains
incorporated into the control mechanism, the robot motion may become very inaccurate or unstable. In other systems, for example, power systems and auto-navigation systems, the controlling parameters may be well understood at the beginning of control operation, However, unpredictable parameter variations may occur as the control operation proceeds. Unless the controller is continually changed during actual operation to
accommodate these parameter variations, a controller design which is initially accurate and stable may not be able to control the system properly to compensate for all changes in the parameters. Consequently, a class of control systems known as adaptive control systems has been designed for these latter situations.
An adaptive control system that continuously
"redesigns" the controller during operation can
compensate for unexpected or unpredictable changes in the system parameters. For example, an adaptive control system can be used to control a robot arm to manipulate various loads by changing the motion parameters of the arm according to a sensed load condition.
Adaptive control systems are also useful for
automatic navigation systems such as ship steering systems in which the characteristics of the ship depend on many uncertain and unpredictable parameters such as water, depth, ship loading and wind and wave conditions. Another field in which adaptive control systems have been profitably used is aircraft control in which the characteristics of the airplane depend on its altitude, wind speed, aircraft speed, and ground configuration.
Adaptive systems have also been used in process control in which batches of materials must be processed. The adaptive systems can compensate for changes in the parameters which characterize the system from batch to batch.
In general, an adaptive control system is similar to a conventional control system in that the output of the system under control (called the "plant") is fed back to the controller which then generates a control output for the plant. However, an adaptive controller differs from an ordinary controller in that the
parameters of the controller itself are variable and an "adaptation mechanism" is provided to adjust the controller parameters based on signals which are measured during on-line operation of the plant.
For linear systems, there are two conventional design structures for an adaptive controller. The first of these structures is called the "Model Reference Adaptive Control" (MRAC) structure and is composed of four main parts: the plant (which contains unknown or unpredictable parameters), a reference model which responds to a reference input to produce desired outputs which are theoretically the ideal response of the plant to the reference input, a parameterized controller which generates the control inputs to the plant and which is characterized by adjustable parameters, and an adaption mechanism which changes the adjustable parameters.
During system operation, the reference model output is subtracted from the plant output to produce a "tracking error". The tracking error is provided to the adaption mechanism which then adjusts the controller parameters to minimize the tracking error. There are two conventional types of adaption
mechanisms. One type of mechanism estimates the plant parameters and then computes the controller parameters from the estimated plant parameters and is called an "indirect" system. The other type of adaption mechanism computes the controller parameters directly from the tracking error and plant output and is called a "direct" system.
The second type of adaptive control structure commonly used with linear systems is called a self-tuning controller (STC). This structure consists of the plant, a parameterized controller which generates the control inputs to the plant and which is characterized by
adjustable parameters, and an estimator circuit which computes the controller parameters from estimated plant parameters. Both the control input to the plant and the output from the plant are provided to the estimator circuit which generates a set of controller parameters that are appropriate to the plant input and output data. As with the MRAC system, there are two
conventional types of estimator circuits: an "indirect" circuit and a "direct" circuit.
It even proves possible to extend slightly the MRAC and STC systems for certain classes of non-linear
systems. All of these designs have depended crucially upon the existence of a known linear parameterization of the uncertainty in the plant dynamics. This implies that the exact form of the dynamic equations which govern plant operation are known, but some of the parameters of the equations may not be known. If the plant dynamics are known, it is possible, in many cases, with
conventional linear adaptive design theory to design a control system which will be both stable and accurate.
The above methods cannot be used if the exact parameterization is unknown or if the uncertainty cannot be parameterized linearly. Some prior art attempts were made to design adaptive controllers in the absence of linear-parameterizations based on "intuitive" design techniques. A well-known example of this type of analysis involves the use of the so-called "gradient" techniques which attempted to control a plant by
minimizing a pre-defined "cost" function. These
techniques were not based on an underlying mathematical theory which could be used to prove that the systems would be stable under all conditions. Consequently, these attempts appeared to work during simulations or in the laboratory, but failed in actual control situations in which other variables which were not modeled
contributed significantly to the overall system
performance.
Recently, some prior art suggestions have been made to use a neural network to overcome the need for a linear-parameterization model. In many schemes, the neural network is first "trained" by providing it with a selected set of pre-recorded data about plant inputs and corresponding outputs. After the network is trained, it is used to generate a model of the plant. In theory, the trained network should provide an accurate estimate of the plant output to an input not previously provided to it and the network could then be used to design the required controllers. The problem with this latter arrangement was that the prior art methods that were used to "train" the network were back-propagation methods which were
essentially similar to the previously-used "gradient" methods. Consequently, these latter techniques suffered from the same problems as the prior gradient
techniques. In particular, since there was no
underlying proof of stability, systems which performed well in the laboratory might become unstable in
real-world operations. Further, these design techniques never considered the effect of operation outside a carefully-defined operating range, so that the resulting system would be unable to handle disturbances which cause it to stray outside the defined range.
Other attempts were made to train the neural network by connecting the network to the actual plant inputs and outputs as the plant operated in actual use. Various schemes were then devised to use the trained network "on-line" in a feed-forward control system. Such systems were still prone to inaccuracies or instabilities due to unexpected situations and, due to the lack of an
underlying theory, there was no known way to correctly specify the network parameters such as the number of layers necessary or the number of nodes per layer which would be necessary to guarantee stability.
Accordingly, it is an object of the present
invention to provide an adaptive control system which can be proven to be stable even though the linear
parameterizations of the plant dynamics are not known. It is another object of the present invention to provide an adaptive control system which can be designed using only basic information regarding the "smoothness" of the plant dynamics, which smoothness can be
ascertained using conventional Fourier techniques.
It is still another object of the present invention to provide an adaptive control system which can specify the number of layers in a neural network and the number of nodes necessary to guarantee stability based on the amount of tracking error that can be tolerated.
It is yet another object of the present invention to provide an adaptive control system which uses
adaptive control techniques within a normal plant operating range, but which is stable even though plant operating characteristics occur outside of the normal operating range.
It is a further object of the present invention to provide an adaptive control system which smoothly switches from an adaptive control strategy to an
alternate control strategy when the adaptive control strategy fails.
SUMMARY OF THE INVENTION
The foregoing objects are achieved and the foregoing problems are solved in one illustrative embodiment of the invention in which an adaptive control system uses a neural network to provide adaptive control when the plant is operating within a normal operating range, but shifts to other types of control as the plant operating conditions move outside of the normal operating range. The controller uses a structure which allows the neural network parameters to be determined from minimal information about plant structure and the neural network is trained on-line during normal plant operation. The resulting system can be proven to be stable over all possible conditions. Further, with the inventive techniques, the tracking accuracy can be controlled by appropriate network design.
BRIEF DESCRIPTION OF THE DRAWING
Figure 1 is a block schematic diagram of a prior art MRAC adaptive control system.
Figure 2 is a block schematic diagram of an MRAC adaptive control system which has been modified for use with the present invention.
Figure 3 is a block schematic diagram of an
illustrative controller design constructed in accordance with the principles of the present invention.
Figure 4 is an expanded block schematic diagram of the tracking error processing circuit 224 as shown in Figure 2.
Figure 5 is an expanded block schematic diagram of the modulator circuit 324 as shown in Figure 3.
Figure 6 is an expanded block schematic diagram of the proportional/derivative controller 306 as shown in Figure 3.
Figure 7 is an expanded block schematic diagram of the sliding controller 334 as shown in Figure 3.
Figure 8 is an expanded block schematic diagram of the adaptation mechanism 208 as shown in Figure 2. Figure 9 is an expanded block schematic diagram of the neural network controller 310 as shown in Figure 3.
Figure 10 is a phase space portrait of desired position and velocity of a first simulation.
Figure 11 is a graph of tracking error and total control signal provided to the plant versus time for the first five seconds of the first simulation.
Figure 12 shows the tracking error using the inventive adaptive controller versus time during a fifty second period of the first simulation.
Figure 13 is a graph showing both the position tracking error and control signals during the first five seconds for a second simulation.
Figure 14 is a graph showing the position tracking error during the first fifty seconds for a second
simulation using the inventive adaptive controller.
DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT
In the ensuing discussion, the mathematical theory underlying the present invention will first be discussed. Subsequently, an illustrative control system design using the underlying theory will be discussed in detail. The following discussion is a siαmmary of a rigorous
mathematical theory presented in detail in the paper attached as Appendix A hereto. As such, the detailed analysis presented in the paper should be understood as the proper mathematical expression of the underlying theory and some inaccuracies may be introduced in this summary. Figure 9 is an expanded block schematic diagram of the neural network controller 310 as shown in Figure 3.
Figure 10A is a diagram of desired trajectories of variables x1 and x2 versus time for a first
simulation in which x2=d/dtx1.
Figure 10B is a phase space portrait of desired position and velocity of a first simulation.
Figure 11A is a graph of tracking errors of the adaptive controller and the proportional-differential controller versus time for the first five seconds of the first simulation.
Figure 11B is a graph of the total control signal provided to the plant by the adaptive controller and the proportional-differential controller versus time for the first five seconds of the first simulation.
Figure 12 shows the tracking error using the
inventive adaptive controller versus time during a fifty second period of the first simulation.
Figure 13A is a graph showing the position tracking error during the first five seconds for a second
simulation.
Figure 13B is a graph sho.wing the control signals during the first five seconds for a second simulation.
Figure 14 is a graph showing the position tracking error during the first fifty seconds for a second
simulation using the inventive adaptive controller.
DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT
In the ensuing discussion, the mathematical theory underlying the present invention will first be discussed. Subsequently, an illustrative control system design using the underlying theory will be discussed in detail. The following discussion is a summary of a rigorous
mathematical theory presented in detail in the paper attached as Appendix A hereto. As such, the detailed analysis presented in the paper should be understood as the proper mathematical expression of the underlying theory and some inaccuracies may be introduced in this summary.
Control System Theory
In the discussion following it is assumed that the plant has equations of motion which can be represented by the general form:
Figure imgf000015_0001
where a plant output is represented by x(t), x(n)(t) is the nth derivative of x(t) with respect to t, u(t) is a control input which drives that output and b is an arbitrary gain. In general, the function, f(x(t),
Figure imgf000015_0007
, ..., x(n-1)(t)), in equation (1) will be an unknown linear or nonlinear function.
The objective of the control system is to drive the plant state vector which can be expressed as
Figure imgf000015_0006
, ..., x(n-1)]T to follow a prespecified desired
trajectory · A
Figure imgf000015_0002
"tracking error" vector 3c(t) is defined as
Figure imgf000015_0005
= x(t) - xd(t). With these definitions, the problem becomes the specification of a control law which insures that
Figure imgf000015_0003
goes to zero as time goes to infinity.
In order to design such a control law it is
necessary to make some assumptions about the structure of the function
Figure imgf000015_0004
, in
equation (1). Specifically, it can be shown that, if the function f can be smoothly truncated outside a selected compact set in such a way that the spatial Fourier transform of this truncation is absolutely integrable, then sampling theory indicates that the function itself can be uniformly approximated to a chosen degree of accuracy on that set with a linear superposition of a finite number of interpolating functions (that is, only a finite number of samples are required).
Mathematically, if the function f can be smoothly truncated outside a selected compact set in such a way that the spatial Fourier transform of this truncation is absolutely integrable, then the condition:
|f(x) - fA(x)| ≤ εf (2) can always be satisfied, where fA(x) is a radial
gaussian neural network approximation of f(x) over a chosen compact set, A, consisting of a truncated series expansion containing a finite number of terms resulting from a finite number of samples, and εf is a
prespecified accuracy which can be made vanishingly small.
Any interpolating function can be used for the approximation, provided that its spectrum is real-valued, strictly positive, and integrable on Rn, the plant state space. In one preferred embodiment a radial
Gaussian function is chosen to be the interpolating function. It is known that any continuous function, not necessarily infinitely smooth, can be uniformly
approximated by linear combination of Gaussian functions. Gaussian radial interpolation functions are particularly attractive because they are bounded, strictly positive and absolutely integrable on Rn, and further, they are their own Fourier transforms (modulo a scale factor). Thus, the radial gaussian can hence be used as the reconstructing function in the expansion fA(x).
The representation fA takes the form: fA(x) = Σ cI gσ(x,ξI) (3) where the points ξI encode a uniform lattice: ξI = i1Δ + i2Δ + ... + inΔ (4) where I = {i1, ..., in} is an n-tuple of integers
and cI = c(ξI) are constants associated with each
sample point and σ is the standard deviation of the gaussian function, gσ.
The number of terms in the expansion is given by Io which is determined by the size of the region over which f must be approximated, the sample spacing Δ, and the approximate error which can be tolerated. More particularly, the approximation error results from three sources: (1) an error which arises because f is not band-limited as originally assumed; (2) an error which arises because the gaussian reconstruction functions are not exactly band-limited and (3) an error which arises because the infinite series of gaussian functions which would exactly reconstruct f must be truncated to allow reconstruction with a finite number of elements. As shown in the attached paper, the error from each of these sources can be shown to be less than εf/3 so that the overall error is less than sf. More
specifically, the magnitudes of errors (1) and (2) above are related to the choice of Δ and σ and the magnitude of error (3) is related to the number of samples Io.
Unlike other reconstructing functions which might also be used in expansion equation (3), for example, n-fold products of sinc or Bessel functions, whose structure changes depending upon the dimension of the underlying space, the radial gaussian and its spectrum have exactly the same representation regardless of dimension. Hence, this design can apply to dynamic systems of any order. The standard form the radial gaussian function is given by:
Figure imgf000018_0001
Further, the gaussian is a separable nonlinearity, so that instead of computing a single non-linear transform of the entire state vector, individual subspaces of Rn may be transformed separately, then multiplied together to form the final approximation.
The radial gaussian function conveniently allows the expansion (3) to map directly to a class of three-layer neural networks employing a single layer of "neurons" (or nodes) with radial gaussian input-output characteristics.
More particularly, a particular n-dimensional point, x, at which the approximation fA(x) is required, forms the input to the network. The number of nodes in the network is determined as previously discussed, and the weights connecting the network inputs to each gaussian node encode the points ξI in the sampling grid. Each node computes a quadratic "activation energy" from its inputs and feed-in weights, given by:
Figure imgf000019_0001
and outputs the gaussian of this activation,
exp(-r2 I2). The network output is formed by
weighting each nodal output by the corresponding cI, and summing together all the weighted outputs. The result represents the approximation fA(x).
The sampling density and variance should be chosen such that the Gaussian's spectrum is strictly positive on the support of the bandlimited version of f, yet approaches zero quickly enough that the high frequency components contribute negligibly in the inverse
transform. The following combination has been found to work well: σ2 = ß2π and Δ2 = 1/8σ2 (7)
Therefore, if the set A constitutes the region of "normal operation" of the control system, a controller constructed of a finite number of elements can be used to control the system within this range.
Any additional information known about f can be used in sharpening the above bounds or reducing the size of the required representation. If information about the shape and orientation of the spectrum is known, the distance between the sampling points can be adjusted accordingly. For example, if f is known to have a spectral support which is much smaller in one direction in Rn the sample spacing for the corresponding
direction in the sampling grid on Rn can be taken proportionally larger, reducing the number of nodes required.
For an arbitrary function, it may be difficult to establish precisely the tightest possible bounds εf on the approximation errors. However, if the assumed value of ß accurately reflects the frequency content of fF, it can be expected that the bound εf will be
dominated by the error introduced in truncating the infinite expansion series which exactly represents fF. By estimating the magnitude of the contributions of the omitted nodes to points in the set A, a conservative upper bound on this truncation error can be established. In accordance with the inventive adaptive control algorithms discussed below, it is only necessary that the crude bound thus developed exceed the true uniform bound on the approximation errors.
However, the approximation error outside of the set A, or outside of the normal operating range, is of great importance and has been completely overlooked by prior art designs. This function reflects the fact that the accuracy of the estimate implemented by the
aforementioned approximations may degrade rapidly outside the normal operating range. It is easy to see that the worst case bound on this degradation, α(x) = |f(x)|+|fA(x)I, can be achieved, especially in
the case of gaussian interpolating functions. Since the approximation implemented by a gaussian network decays smoothly to zero as x moves radially away from the set A, a function, f, which changes sign quickly just outside of A would result in the above equality for α(x). The output of the gaussian network approximation, fA, in this situation would be incorrect both in magnitude and in sign. This rapid degradation of the neural
approximation outside of the normal operating range for which the network was designed must be taken into
consideration.
If the desired plant state trajectories are
contained in a normal operating range which is a compact subset, Ad, of the plant state space, in principle, the tracking problem associated with the plant dynamics described in equation (1) could be solved by an adaptive component in the control law capable of reconstructing the unknown function f everywhere on Ad. Since
expansion equation (3) provides a network structure which can approximate (sufficiently smooth) unknown plant nonlinearities with a prespecified accuracy, uniformly on a particular compact subset, the adaptive component of the controller can be taken as the output of this network. The small approximation errors thus introduced could then be viewed as a uniformly bounded disturbance driving the plant dynamics, and conventional deadzone adaptive control theory could then be used to derive stable adaptive control laws.
However, it is possible with such a scheme that the tracking errors would become sufficiently large during the initial stages of adaptation that the plant state could leave the normal operating range, Ad, Further, impulsive, unmodeled disturbances might also force the plant state outside this range. The ability of a neural adaptive element to approximate f degrades rapidly as the state moves outside of the region for which the network was designed. As discussed previously, if the plant moves outside of the normal operating range, it is possible for the approximation implemented by such an adaptive system to be incorrect not only in relative magnitude, but also in sign, indicating that the
contribution of the adaptive system in this case would force the plant dynamics in precisely the wrong
direction. Simple deadzone adaptation will not suffice to ensure stability of the system under these conditions.
However, in accordance with another aspect of the invention, this problem can be overcome by including in the control law an additional component which takes over from the adaptive component as its approximation ability begins to degrade, and forces the plant state back into the normal operating range. Preferably, this component takes the form of a sliding controller with a boundary layer, and its design is straightforward given upper bounds on the magnitudes of the functions being
approximated. The action of this sliding term will be shown to be sufficient to reduce the tracking error during the times when the plant state lies outside of the normal operating range, while the boundary layer prevents control chattering in the regions at the edges of the normal operating range. To prevent unnecessary contention between the sliding component and the (typically erroneous) output of the adaptive subsystem when the plant state lies outside of the normal operating range, the entire adaptive portion of the controller - both the adaptive output and the adaptive adjustment mechanism - must be turned off when the sliding controller is active. Similarly, it is beneficial to turn off the sliding controller when the approximation implemented by the adaptive controller is accurate, since the sliding controller relies on crude upper bounds of the plant nonlinearities to reduce tracking errors and is hence likely to require large amounts of control authority when active.
In accordance with another aspect of the invention, the complete control law thus has a dual character, acting as either a sliding or as an adaptive controller depending upon the instantaneous location of the plant state vector. However, discontinuously switching between adaptive and sliding components creates the possibility that the controller might chatter along the boundary between the two types of operation. To prevent this, pure sliding operation is restricted to the exterior of a slightly larger set A, containing the normal operating range, Ad, while pure adaptive operation is restricted to the interior of the set Ad. In accordance with another aspect of the invention, in between, in the region A - Ad, the two modes are effectively blended using a continuous modulation function, which controls the degree to which each component contributes to the complete control law. The resulting controller smoothly transitions between adaptive and nonadaptive control strategies and is capable of globally stabilizing the systems under consideration.
The structure of the inventive control law has the following general form: u(t) = upd(t) + (1-m(t))uad(t) + m(t)usl(t) (8) where upd(t) is a negative feedback term associated with a proportional/derivative controller and consists of a weighted combination of both the measured tracking error states and a tracking metric s(t), to be defined below. The term usl(t), represents the sliding
component of the control law, and, similarly, the
adaptive component is represented by uad(t). The
function m(t) = m(x(t)) is a continuous, state dependent modulation term which allows the controller to smoothly transition between sliding and adaptive modes of
operation, chosen so that m(x) = 0 in the normal
operating range, Ad; m(x) = 1 outside of the expanded range, A; and 0<m(x)<1 on A - Ad.
To aid in the construction of this modulation, the normal operating range, Ad, should be chosen so as to correspond to a unit ball with respect to an appropriate weighted norm function. This certainly poses no problem since, as long as Ad contains all the trajectories the plant will be required to follow, and A is chosen to completely contain Ad, the actual dimensions of these sets are arbitrary. Thus, Ad and A can be defined by
Ad = {x| ||x-xo||p,w≤1} and A = {x|||x-xo||p,w≤l+Ψ}. (9) Here Ψ is a positive constant representing the width of the transition region, xo fixes the absolute
location of the sets in the state space of the plant, and ||x||p , w is a weighted p-norm of the form
Figure imgf000025_0001
or, in the limiting case p = ∞,
Figure imgf000025_0002
for a set of strictly positive weights
. In R2, for example, with p = 2
Figure imgf000025_0004
the sets Ad and A are ellipses, and with p = ∞ these sets are rectangles. With these definitions, the modulation function can be taken as,
Figure imgf000025_0003
where r(t) = ||x(t)-xo||p,w, and sat is the
saturation function (sat(y) = y if |y|<1, and sat
(y) = sign (y) otherwise). When r(t)≤1, meaning that x is in Ad, the output of the saturation function is negative, hence the maximum (which defines m(t)) is zero, as desired. When r(t)≥1+Ψ, corresponding
to an x outside the expanded range A, the saturation function is unity, as desired. In between, for x in the range A - Ad, it is easy to check that 0<m(x)<1. To illustrate the structure of the adaptive controller, consider for example a unity control gain system represented in equation (1) with b = 1. The adaptive control component, uad(t), for this system would implement an approximation, , to the plant
Figure imgf000026_0001
nonlinearity f(x), which is realized as the output of a radial gaussian neural network whose inputs are the measured values of the plant states. The variance and mesh size of this network are designed considering assumed upper bounds on the spectral properties of the plant nonlinearity f(x). Hence the adaptive portion of the controller can be represented as:
Figure imgf000026_0002
The input weights, ξI, which encode the sampling mesh, are fixed in this architecture, while the output weights, ĉI(t), are to be adjusted to attempt to match their values, cI when the system has been trained.
The set of terms is chosen so that the bound on the uniform approximation error, εf produced by the
trained network over the normal operating range is small enough to allow the desired tracking accuracy, which is related to the function approximation capability of the network through equation (37) below.
As discussed above, the selection of the desired function approximation specifies the subset Io of the uniform, square sampling mesh which must be used to reconstruct the function f, and, thus, the number of nodes required and the precise values of the
corresponding ξI are also determined. It is helpful to process the tracking error
variables x - xd, in order to simplify the resulting controller designs. A useful tracking error metric for both the sliding and adaptive control subsystems is defined by
Figure imgf000027_0001
T
which can be rewritten as s(t) =
Figure imgf000027_0002
with λ =
[Xn-1,(n-1)λn-2, ... , 1]. The equation s(t) = 0
defines a time-varying hyperplane in Rn on which the tracking error vector decays exponentially to zero, so that perfect tracking can be asymptotically obtained by maintaining this condition. Further, if the magnitude of s can be shown to be bounded by a constant Θ, the actual tracking errors can be shown to be asymptotically bounded by:
Figure imgf000027_0003
Using the metric s(t) and the saturation function defined above, the sliding control component can be represented as: usl(t) = -ksl(x(t)) sat(s(t)/Φ) (16) where ksl(x) is the gain of the controller, and Φ is the width of a boundary layer surrounding the sliding surface in accordance with conventional sliding control theory. The exact values chosen for each of these parameters are determined by the previously-mentioned assumptions concerning f and the accuracy of the
reconstruction desired from the network. Examples are given below.
The deadzones required in the algorithm for updating the output weights of the gaussian network reflect the fact that no useful information can be gained about the quality of the current approximation when the tracking errors are less than a threshold determined by the bounds on the approximation errors, e.g., εf. However,
deadzone adaptation requires discontinuously starting and stopping the parametric adjustment mechanism according to the magnitude of the error signal. These discontinuities can be eliminated using the metric s(t) and by
introducing the continuous function sΔ, defined as sΔ = s(t) - Φ sat(s(t)/Φ) (17)
The variable s. has several properties useful in the design of adaptive laws : if |s|<Φ then sΔ
= sΔ = 0, while if |s| > Φ, then sΔ = s,
and [sΔ| = |s| - Φ.
Applying these principles to a system in the form of equation (1) with b=1 and assuming that a prior upper bound Mo(x) is known on the magnitude of f for points outside of the normal operating range, Ad, the error metric s(t) will have a time derivative given by: s(t) = ar(t) - f(x) + u(t) (18) where ar(t) = λT ʋx(t) - xd (n) with λT ʋ
= [ 0 , λn-1 , (n-1 )λn-2 , . . . λ ] , and xd (n )
is the nth derivative of the desired trajectory. Having chosen a set A containing Ad as outlined above, let fA(x) be a radial gaussian approximation to f(x)
designed such that |f - fA| ≤ εf uniformly on
the set A. In terms of this approximation, equation
(18) can be rewritten as: s(t) = ar(t) - fA(x) + d(x) + u(t) (19) where d(x) = fA(x) - f(x) satisfies |d| ≤ εf
in the normal operating range. This expression, and the considerations of the previous section, suggests use of the control law: u(t) = -kds(t) -ar(t) + (1-m(t))fA(x(t)) + m(t)usl(t) (20) where kd is a constant feedback gain. The sliding
component of this control law is given by equation (16) with state dependent gains ksl(x(t)) = Mo(x(t)) +
εf, and the adaptive component fA(x(t)) is realized as the gaussian network described by equation (13) above, whose tuned output implements the approximation fA.
The modulation function which blends the two controller modes is specified by equation (12).
If the parameters in fA(x(t)) are updated as:
Figure imgf000029_0001
where ka is a positive constant determining the
adaption rate, and if Φ is chosen so that Φ≥
εf/kd in both the sliding controller and the
calculation in equation (17) of sΔ, then all states in the adaptive system will remain bounded, and moreover the tracking errors will asymptotically converge to a neighborhood of zero given by:
Figure imgf000030_0001
The previous design procedure can be extended to plants with non-unity control gains which have dynamic equations of the form: x(n)(t) + f(x(t)) = b(xT(t)) u(t) (23) where , and it is assumed
Figure imgf000030_0003
that the control gain is bounded away from zero and of known sign for all x. Without loss of generality, this sign can be taken as positive, so that
0 < M1(xT) ≤ b-1(xT) ≤ M2(xT) (24)
It is further assumed that a prior bound is known for the norm of the gradient of this inverse, so that:
Figure imgf000030_0002
As above, it is also assumed that an upper bound is known for f; to facilitate the development below, it is most convenient to combine this knowledge with the bound on b-1 to produce
|b-1(x)f(x)| ≤ M0(x) (26)
As before, define s(t) = , so that
Figure imgf000031_0001
b-1s = b-1ar - b_1f + u (27) with ar is defined as previously stated. Define the unknown nonlinear function h = b-1f, and let hA and bA -1 be a radial gaussian network approximation to
the functions h and b-1 respectively with the
properties, valid on the chosen set A:
|h(x) -hA(x)|≤∊h and |b-1(xτ) - bA -1(x)|≤∊b, (28) where εh and εb are as small as desired. There
is no conceptual problem approximating b-1(xT) by a function of the entire state vector instead of just xT; this fact will also allow a considerable simplification in the design of the adaptive subsystem, as will be discussed below.
With these definitions, equation (27) can be
rewritten as : b-1s = bA -1ar - hA + d + u (29) where the disturbance term d is here given by: d(x,xd) = (b-1(xT) - bA -1(x))ar - (h(x) - hA(x)) (30) and thus satisfies the inequality:
|d(x,xd)| ≤ εh + εb | ar | (31) when x is in A. Equation (30) suggests a control law of the form:
Figure imgf000032_0001
for some positive constant Kd. The sliding term,
usl(t), is again given by (16) with state-dependent gain
ksl(x,xd) = (Mo(x) +∊h) + (M2(x)+∊b)|aτ|. (33)
Note particularly that the adaptive component of this control law estimates b-1 itself, as opposed to
estimating b and inverting the estimate.
The adaptive components of this control law,
Figure imgf000032_0003
and , can each be realized as the output of a
Figure imgf000032_0002
network of radial gaussian nodes similar to the one described for the unity gain system. Each network is designed considering assumed upper bounds on the spectral supports of h and b-1 respectively, as well as the set of desired trajectories the system is required to track, and approximation accuracy desired. It is possible to simplify this design somewhat by employing a single network, with the n state variables as inputs, and the two adaptive approximations as the outputs. The number of gaussian nodes and the corresponding input weights for this network are chosen to correspond to a sampling grid with the smaller of the two mesh sizes which would be used in the separate networks. This network then has two sets of output weights for each node,
Figure imgf000033_0002
and
, creating two outputs which are the
Figure imgf000033_0001
approximations and
Figure imgf000033_0003
Figure imgf000033_0004
respectively:
Figure imgf000033_0005
As before, the input weights encoding the sampling mesh are fixed in this architecture, and the output weights and are adjusted to match their trained
Figure imgf000033_0006
Figure imgf000033_0007
values, cI and dI, in the expansions hA and
bA -1 respectively.
Choose the adaptation law for these parameters to be:
Figure imgf000033_0008
where ka1 and ka2 are strictly positive constants
determining the adaptation rate. Define
∊τ = ∊h + ∊b(2||λ||τa + |Id (n )|max), ( 36 ) with ra being the radius of the largest n-ball
containing the set A, and |xd (n)|max an upper
bound on the magnitude of the nth derivative of the reference trajectory. If Φ is chosen such that Φ ≥ εr/kd in both the sliding controller and the
calculation in equation (17) of sΔ, then all states in the adaptive system will remain bounded, and moreover the tracking errors will asymptotically converge to a neighborhood of zero given by:
Figure imgf000034_0001
Note that this inequality, together with the definitions of Φ and εr, implies that a good approximation to b-1 ls much more critical to reduction of the
asymptotic tracking errors than is approximation to h; this consideration should be reflected in the network design.
Since the desired trajectories are contained
completely within the normal operating range, Ad, and since the tracking errors converge to a neighborhood of zero, eventually all the plant trajectories will converge to a set which is either within a small neighborhood of the set Ad, or else is completely contained inside it. In particular, the sliding subsystem will asymptotically be used less and less, and, since the negative feedback terms decay to zero as the tracking error does, the control input will eventually be dominated by the
outputs of the neural network, regardless of the initial conditions on the plant or network parameters. While the use of the proposed neural controllers allows the design of adaptive algorithms which can stably control plants with limited knowledge of the functions driving their dynamics, one still cannot ignore robust stability issues. The adaptive controller can compensate for dynamic effects attributable only to the states it
"sees", i.e. those which are measured and which form the inputs to the gaussian networks. There is no way for the adaptive architecture described to compensate for dynamic effects due to (temporal ) high frequency modes neglected in the above models. It is thus necessary to ensure that the control signal issued by the network is free from any (temporal) high frequency components which might excite these modes. Inspection of the adaptation laws suggests that the magnitude of the adaptation signals will be much larger for the than for the
Figure imgf000035_0001
. To prevent excessively rapid adaptation from
adding high frequency components to the control signal, it is hence desirable to choose the adaptation gain ka2 to be much smaller than that of ka1.
Choosing the same parameter to control both the adaptation deadzone and the sliding controller boundary layer allows one to guarantee convergence of the tracking errors to the same neighborhood of zero regardless of location of the state vector. However, if extreme tracking accuracy is required, and the network is
designed to produce the required reconstruction accuracy, the resulting choice of Φ might produce a sliding
controller with a boundary layer so thin it risks
exciting the high frequency dynamic modes when this component is active. This suggests that the value of Φ used, and hence the resulting tracking accuracy, must arise as a tradeoff between the frequency range of the unmodeled dynamics and trajectory following requirements.
The parameter ψ, which defines the size of the transition region A-Ad between adaptive and sliding operation, in a sense also represents the width of a boundary layer, serving to smooth the transition between the two controller modes. Since, with a zero width transition region, there is a possibility the controller might chatter along the boundary of the set Ad, the same comments which apply to Φ above also apply to ψ: the transition region should be "thick" enough that there is no possibility that the mode switching might excite unmodeled dynamics.
Simulation Examples
In the simulation examples which follow, a
two-dimensional plant is required to track a set of desired trajectories which are specified as the output of a third order, critically-damped system with a bandwidth of 10 rad/sec driven by a unity amplitude, 0.4 Hz. square wave. Figures 10A and 10B show desired trajectories of variables x1 and x2 versus time and a phase space
portrait of the desired position and velocity,
respectively. Based upon this portrait, the set A, was chosen to be the unit ball with respect to the weighted infinity norm
(38) centered at x0 = [0.5,0]T. This corresponds to the rectangular subset of the state space, Ad = [-0.5, 1.5] x [-3.0,3.0]. A thin transition region between adaptive and sliding operation was chosen for these simulations, corresponding to a value of ψ = 0.025. in equation (12), so that
(39)
For the first simulation, a two dimensional plant with unity control gain, i.e. equation (1) with n = 2 and b = 1. The plant nonlinearity f is assumed to have a spectral support of ß = 2, and equation (7) suggests gaussian nodes with variance σ2 = 4π whose input
weights encode a sampling grid with sample spacing Δ = (32π)-1/2 ~0.1. A truncation value of p=0.5 is
used in the truncation of the function expansion and, for simplicity, the resulting sample set, Γ, is taken as the smallest rectangle which contains all the points in this truncated sampling grid, Γ = [-1.0, 2.0] ×
[-3.5, 3.5]. This set actually encloses a few mesh points of distance further than 0.5 to points in A, but the value 0.5 will be used in developing conservative bounds on the truncation error. This definition of Γ results in a network with 2201 gaussian nodes whose output weights are adjusted according to equation (21) with ka = 500.
Using the truncation radius ρ = 0.5, the omitted nodes closest to the set A contribute terms only of order 10-6 to the reconstruction of the values assumed by f on this set. A conservative estimate, taking into account the number of closest omitted nodes and an assumed uniform upper bound on f of 20, suggests that εf ≤ 10-3 on A if this value of p is used.
Using this bound and a value kd = 5 for the feedback gain, results in deadzone Φ = 2 × 10-4 for use in
the adaption law and sliding controller.
The complete control law is given by equations (13), (16) and (20). Since the nonlinearity f is assumed to be uniformly upper bounded over the range outside of the expanded operating range by 20, the gain ksl = 20 +
10-3 is used m the sliding controller. The bandwidth of the sliding surface is taken as λ = 10 in the
definition of s(t).
The true plant is modeled by equation (1) with n=2 and:
(40)
Which is indeed spatially bandlimited, its spectrum resembling a "roof" of length 2 and width 1. The plant and controller dynamics were simulated in C on DECStation
3100 RISC workstations, using a forth order, fixed stepsize Runge-Kutta algorithm with Δt = 10-4sec.
All initial conditions were taken as 0, reflecting the fact that no prior knowledge is assumed about the plant save for its order and upper bounds on both the absolute magnitude of the function f and on the spectral radius of its spatial Fourier transform. Two simulations were conducted, one in which the complete control law (20), including the adaptive
gaussian network, was employed, and one in which only the PD and sliding components were used. Figure 11A and compares the position tracking error, x(t), and Figure 11B compares the total control signal issued to the plant during the first 5 seconds for both of these simulations. Use of the adaptive controller clearly results in superior tracking performance; worst case tracking error is reduced by a factor of ten compared with that obtained using PD and sliding components alone during this time period, and this ratio continually improves as the adaptive controller learns more about the plant dynamics. Figure 12 shows the tracking error using the adaptive controller during a fifty second period which does indeed demonstrate the predicted asymptotic convergence to the deadzone.
An additional simulation was performed again using a two-dimensional plant with the same f nonlinearity used in the unity gain simulation, but with b in
equation (1) given by: b(x) = 2 + sin(3π(x - 0.5)) (41)
Note that in this case, the constant function 2 is not Fourier transformable in the classical sense
(although it is in the distributional sense), but it can easily be approximated by a function which is 2 on A and decays smoothly to 0 on Ac. Furthermore, b-1 is
uniformly lower bounded by M1 = 1/3 and upper bounded by M2 = 1, and b-1 can be uniformly bounded using
M3 = 3π, so that the control gains satisfy the
constraints previously discussed.
The desired trajectories described above with respect to the unity gain simulation are used, and the same network designed in that section is used to generate both hA and b-1 by adding a new set of output
weights, dI, which are adjusted as specified in
equation (35) with ka2 = 1.0. The maximum desired acceleration is |x(n)|max = 25 for these
trajectories, so that |ar| ≤ 100 when x is in A.
Under the assumption that truncation errors dominate the uniform bounds on the approximation errors, εb is
considered to be the same order as the εf computed in the previous section, and hence εr ≤ 0.1 which
suggests a choice of Φ = 0.02 as the adaptation
deadzone and sliding controller boundary layer.
The complete control law is given by equations
(16), (32), and (34). Using the above bounds on b and f, and again the value Kd = 5, the sliding controller gains are taken as Ksl = 20 + |ar|.
The same simulation parameters and initial
conditions as above are chosen. Once again, two
simulations were conducted, one with and the other without the contribution of the adaptive network.
Figures 13A and 13B show both the position tracking error and control signals, respectively, input to the plant during the first 5 seconds for both these simulations. Once again, it is evidence that the use of the adaptive controller provides superior performance,; initial tracking errors are about three times better than those of the PD and sliding controller alone, with this ratio improving dramatically after the first few seconds.
Notice the larger control inputs in places for the simulation without the adaptive component; these occur as the plant state drifts outside the set A prompting full use of the sliding component of the controller.
Figure 14 shows the tracking error for the adaptive controller over the first 50 seconds of operation;
again, it is evident that the errors have converged into the deadzone.
Figure 1 shows a schematic block diagram of a conventional model reference adaptive control system. In Figure 1 the box 106 represents the plant of the system to be controlled. In general, in order to use this model with prior art theory, it is assumed that the dynamic equations governing the plant are subject to a known linear parameterization, although the exact
operating parameters of the plant are generally unknown or unpredictable. However, in accordance with the invention, it has been shown that it is not necessary to know an explicit linear parameterization of the dynamics and that these can be approximated by means of an
adaptive neural network. The plant generates an output vector x on bus 114.
The reference model 104 is used to specify the ideal response of the adaptive control system to an external reference command are provided on lead 100 through the system. Essentially, the reference model generates an ideal plant response to the reference input r. The adaptive mechanism represented by box 108 then seeks to adjust the parameters of controller 102 so that the plant reproduces this ideal response. The reference model receives the input from lead 100 via lead 103 and generates an output vector xd on bus 110. The output of the plant on bus 114 and the output of the reference model on lead 110 are provided to the subtraction circuit 112 which generates the tracking error x-xd vector on bus 116.
Controller 102 is constructed with a number of adjustable parameters which change how the control output u generated on bus 120 is related to the input R on lead 100 and feedback from bus 118. The controller is adjusted by an adaptive mechanism 108 which is
responsive to the tracking error on lead 116 to generate adjustment signals cL and dL on leads 122. These
adjustment signals change the controller parameters so that the tracking error is reduced to zero over a period of time. In addition, the controller 102 is generally provided with the output of the plant 106 by means of bus 118.
In accordance with the invention, the conventional MRAC controller is modified slightly to both simplify it and restructure the network to operate with the theory presented above. A comparison of Figures 1 and 2
indicates that the corresponding elements are
corresponding numeral designations. For example, plant 106 in Figure 1 is labeled plant 206 in Figure 2.
Similarly, controller 102 in Figure 1 is labeled
controller 202 in Figure 2. Figure 2 has been simplified by eliminating the reference model 104 in Figure 1. Instead, the desired plant output vector, xd, on bus 210 is provided
directly to the summing network 212. The parameters of the output xd can be calculated ahead of time either in explicit reference model or by calculations provided to the summing network.
In addition, tracking error vector x-xd on bus 216 is provided to an error processing circuit 224 which generates a set of error "metric" variables represented by e. These error variables, which include the metrics s, sΔ and ar, are provided to the adaptive
mechanism 208 and also directly by means of lead 220 to the controller 202.
Figure 3 shows a block schematic diagram of
controller 202 in Figure 2. The controller receives the error signals e on lead 300 from the error processing circuit 224 shown in Figure 2. In addition, the output vector x of the plant is received, via bus 302. The error signals e are provided via leads 304 to a
proportional/differential controller 306. This is a conventional controller which provides damping to the system. The output of the proportional controller 308 is provided to a summing circuit 318.
The error signals on leads 300 are also provided by leads 342 to a sliding controller 334. Sliding
controller 334 also receives the plant output on bus 302 via bus 340. Sliding controller 334 is a conventional circuit of which the exact construction is described below. The general principles of construction of sliding controllers are set forth in the book entitled "Applied Nonlinear Control" by Jean-Jacques Slotine and Weiping Li, Prentice Hall, Englewood Cliffs, New Jersey, 07632, 1991, chapter 7. The output 336 of sliding controller 334 is provided to a multiplier circuit 330 which is in turn controlled by means of signals on lead 328 by modulator 324. Modulator 324 also provides output 326 which controls multipliers 314 and 332. These in turn control the outputs and adaptation mechanisms of a neural network 310.
Neural network 310 receives the error signals e on bus 300 by means of bus 346 and the output of the plant on bus 302 by means of bus 348. The construction and operation of the neural network will be described hereinafter in detail below. However, the neural network generates an adaptive portion of the control signal which is effective in the normal operating range of the plant. The output of the network on bus 312 is provided to multiplier 314.
Modulator 324 controls multipliers 330, 314, and 332 as previously stated. In order to switch the control of the network smoothly between the neural network 310 and the output of the sliding controller 334. Depending on whether the control system is operating in its normal operating range or outside of its normal operating range the modulator 324 is arranged so that control is
smoothly switched from the neural network 310 to sliding controller 334 as the control system approaches the edges of the normal operating range. In addition, the parameter adaptation signals on 344 provided by the adaptive mechanism 208 (Figure 2) are also controlled by multiplier 332 which provides a modulated output via bus 333 to control the learning rate of neural network 310.
As the system moves outside the normal operating range, the adaptation control parameters and the output of the neural network are reduced smoothly to zero.
This smooth modulation technique prevents chattering in the network when the plant is operating near the edge of its normal operating range.
Modulated outputs of neural network 310 on bus 316 and of sliding controller 334 on bus 322 are also provided to summing network 318. Network 318 sums the output of PD controller 306, neural network 310 and sliding controller 334 to provide controller output on bus 320.
Figure 4 of the drawing is an expanded block diagram of the circuitry which generates the error metrics sΔ, s and ar. In particular, the tracking error
vector, x-xd, is provided on input bus 400. In order to compute the error metric ar, the tracking error signals are provided to a set of multipliers illustrated schematically by multiplier 404. By convention in the drawings which follow if two inputs entering a multiply junction are underlined, they represent a collection of signals (a vector) and the multiplier should be
understood to implement the vector dot product: xTλ = ∑xiλi For example, multiplier 404 forms the "dot" product of the tracking vector with a constant vector λ. These constants are entered by the user by means of an entry point shown schematically as triangle 408. In Figure 4 and the following figures, the triangles indicate inputs supplied by the user and are selected in accordance with the aforementioned theory. The resulting products are summed to generate the "dot" product.
The product generated by multipliers 404 are provided, via bus 412, to a summer 420 which forms differences between the feedforward signal (the nth derivative of the desired state xd (n)) provided at entry point 422 and the product developed by multipliers 404. The output on lead 426 is the error metric ar.
The error metrics sΔ and s are developed by the upper portion of the circuitry. In particular, the tracking error vector is provided to a set of
multipliers, 402, which multiply the tracking errors by constants, λ, provided from the user entry point 406 in order to generate the error metric s on lead 410.
The error metric s on lead 410 is provided by a lead 414 to a multiplier 416 where it is multiplied by a constant 1/Φ provided via user entry point 418. The product on lead 424 is provided to a saturation circuit 428.
Circuit 428 processes the signal in accordance with the following equation. sat(y) = y if |y|<1, and sat (y) = sign (y) otherwise The saturated value appears on lead 430 and is applied to a multiplier 434 where it is multiplied by the reciprocal of the value supplied in 418 (Φ) provided via user entry point 432. The resulting product on lead 436 is applied to a summer 438 which forms a difference between the s error metric provided on lead 442 and the product of multiplier 434. The result on lead 440 is the error metric sΔ. The error metrics generated by the error processing circuitry shown in Figure 4 are used in the remainder of the circuitry to generate various quantities as will hereinafter be described.
Figure 5 is a detailed block schematic diagram of modulator circuit 324 shown in Figure 3. Modulator operates to generate the modulation signals m and 1-m which are used to smoothly switch from the adaptive controller to the sliding controller as previously described. The plant output vector x is provided, via bus 500, to a plurality of multiplier circuits of which only circuits 502 and 504 are shown. In particular, each component of the plant output (such as components xl on lead 501 and xn on lead 503) is applied to a
separate multiplier. Multiplier 502 is provided with a coefficient 1/ωl applied via user entry point 506.
Similarly multiplier 504 receives a coefficient 1/ωn applied via user entry point 508. The weighted inputs are forwarded to absolute value circuits 510-512 by means of a plurality of leads including leads 514 and 516. The absolute value of the weighted inputs generated by circuits 510-512 is then provided to a maximum circuit 518 which selects the signal with the maximum magnitude and applies the selected signal (r) to lead 520. The maximum signal r is applied to a summer 522 which subtracts 1 from the result. The resulting difference is applied, via lead 524, to a multiplier 526 where it is multiplied by the user-supplied constant 1/Ψ
provided via user entry point 527.
The resulting product on lead 528 is provided to a saturation circuit 530. The function of the saturation circuit is as previously discussed. The saturated output is provided to another maximum circuit 532 which selects the maximum of the saturated output of circuit 530 and the signal value "zero" provided on lead 534 to generate the modulation output m on lead 536. The modulation output is provided, via lead 538, to the negative input of an adder 540 and subtracted from 1 to generate the signal 1-m on lead 542.
Referring to Figure 3, the modulation signal m is applied, via lead 328, to multiplier 330 to modulate the output of the sliding controller 334. In addition, the modulation value 1-m is provided, via lead 326, to multipliers 314 and 332 in order to control the output of neural network 310 and the adaptation control signals to neural network 310.
Figure 6 illustrates the construction of an
illustrative proportional/differential controller which generates the output control signal upd from the input error metrics s and sΔ. In particular, the error metric s is provided on lead 600 to multiplier 604 where it is multiplied by constant -Kd (provided via user input 606) in order to generate a product on lead 612. The resulting product is applied to one input of an adder 616. In a similar manner, the error metric s.
provided on lead 602 is forwarded to multiplier 608 where it is multiplied by constant -K provided at user input 610. The resulting product on lead 614 is also applied to adder 616 which adds together the two signals to generate the output control signal upd on lead 618.
Figure 7 shows the construction of the sliding controller 334 in Figure 3. It uses the error metric s provided on lead 700 as an input to multiplier 702.
Multiplier 702 multiplies s by the constant 1/Φ
provided by the user input 704. The resulting product is applied to a saturation circuit 708, which operates as previously described. The saturated output on lead 710 is applied to multiplier 714 where it is multiplied by user supplied constant -Ksl provided via input
point 712. The resulting product on lead 716 is the sliding controller output usl.
Figure 8 shows the construction of the adaptive mechanism 208 shown in Figure 2. It utilizes the error metrics sΔ and ar to generate the control signals
cL and dL. In particular, the error metric sΔ
provided on lead 800 is applied to a multiplier 806 where it is multiplied by the user supplied constant -Kal provided via input point 808. The resulting product on lead 812 is the adaptation control signal cL.
The error metric sΔ is also provided, via lead
804, to one input of multiplier 810. The other input of multiplier 810 is provided with the error metric a on lead 802. The resulting product on lead 814 is provided to multiplier 816 where it is multiplied by the user supplied constant Ka2 provided via input point 818.
The resulting product on lead 820 is the control signal dL.
Figure 9 shows the construction of the neural network 310 shown in Figure 3. The neural network receives the plant output vector x on bus 900. The bus is split into a plurality of parallel busses shown schematically as leads 902, 904, 906 and 908. The number of busses is equal to the number of nodes in the network as discussed above. Each separate bus carries the vector x and is processed in a similar manner and therefore only two busses are shown in detail.
More specifically, each plant output vector is processed by appropriately weighting the variables in a plurality of processing circuits of which circuits 910 and 912 are shown. The processing performed by
processing circuits 910-912 is described by equation (3) which specifies the formation of the radial gaussian function. During this formation an input weight ξ is subtracted from each plant output variable and the absolute value of the resulting difference is squared and divided by variance σ2 of the gaussian.
The outputs of the input processing circuits
910-912 are provided to a plurality of nodes of which nodes 914-916 are shown. These nodes form the radial gaussian of their inputs and the processed signals are provided to processing circuits 918-920 which multiply resulting outputs via adjustable output weights. The adjustable output weights are controlled by the modulated adaptation signals (1-m)cL and (1-m)dL
provided by the output of multiplier 332 shown in Figure 3. In fact, the adaptation mechanism controls the magnitude of the output constants cI in equation (4). In the case of a unity gain system, the actual values which are used to adjust the output weights are obtained by multiplying the output of gaussian circuits 914-916 by the adaptation signal (1-m)cL. The resulting
product is then integrated to produce the output weights cI which multiply the outputs of the gaussian circuits 914-916 in accordance with equation (23). In the case of a non-unity gain system, operation is similar except that the outputs of the gaussian circuits 914-916 are multiplied separately by the adaptation signals (1-m)cL and (1-m)dL to generate two separate products which are then integrated to produce two weighting signals that are each multiplied by the outputs of circuits 914-916. The results of the d, multiplication are also
multiplied by the error metric a and the two resulting products are added in accordance with equation (37).
The weighted outputs of the adjustable output circuits 918-920 are provided, via leads 922-924, to an output summing circuit 926 which receives all of the outputs from the adjustable output circuits and provides the sum on output 928 which is the adaptive control output uad.
Although only a few embodiments of the present invention are disclosed above, other embodiments and modifications will be immediately apparent to those skilled in the art. For example, the techniques discussed above can be used to extend the above ideas to much more general classes of systems, including multiple input and output plants. If perfect tracking can be accomplished through reconstruction of the values assumed by unknown nonlinear functions on a compact set, and a measure of the tracking error can be found which is related to the function reconstruction errors by a passive operator, then gaussian networks of the type described above can be used to adaptively compensate for the nonlinearities, and reduce tracking errors to a neighborhood of the zero.
What is claimed is:

Claims

1. A control system for controlling a non-linear plant having a plurality of outputs so that each of said plurality of plant outputs approaches a
corresponding desired plant output, said plant outputs having a predetermined plurality of values constituting a normal operating range and a
plurality of values outside of said normal
operating range, said control system having means responsive to said plurality of plant outputs and to said plurality of desired plant outputs for generating a plurality of tracking error variables, said control system comprising:
a first controller having a plurality of variable adaption parameters and being responsive to said plurality of tracking error variables, to said plurality of plant outputs and to said
plurality of desired plant outputs for generating first control outputs for controlling said plant; an adaptive mechanism responsive to said plurality of tracking error variables for adjusting said variable adaptation parameters; and
a second controller having a plurality of fixed parameters and being responsive to said plurality of tracking error variables, to said plurality of plant outputs and to said plurality of desired plant outputs for generating second control outputs for controlling said plant
means responsive to said plurality of plant outputs for applying said first control outputs to said plant when said plurality of plant outputs are within said normal operating range and for applying said second control outputs to said plant when said plurality of plant outputs are outside of said normal operating range.
2. A control system according to Claim 1 wherein said first controller comprises a neural network.
3. A control system according to Claim 2 wherein said neural network comprises a plurality of nodes, each of said plurality of nodes comprising an
interpolator circuit, said neural network further comprising means for generating a weighted set of tracking error variables and means for applying said weighted set of tracking error variables to each of said nodes to produce interpolated outputs.
4. A control system according to Claim 3 wherein said means for generating a weighted set of tracking error variables comprises means for multiplying each of said plurality of tracking error variables by a predetermined, fixed weight value.
5. A control system according to Claim 3 wherein said neural network further comprises means for
generating a weighted set of interpolated outputs and means for summing said weighted set of
interpolated outputs to generate said first control outputs.
6. A control system according to Claim 5 wherein said means for generating a weighted set of interpolated outputs comprises means for multiplying each of said plurality of interpolated outputs by said variable adaptation parameters.
7. A control system according to Claim 3 wherein each of said interpolator circuits multiplies one of said weighted set of tracking error variables by a predetermined interpolation function.
8. A control system according to Claim 7 wherein said interpolation function is a radial Gaussian
interpolation function.
9. A control system according to Claim 1 wherein said second controller comprises a sliding controller.
10. A control system according to Claim 1 wherein said applying means comprises a first modulator circuit responsive to said plurality of plant outputs for multiplying said first control outputs by a first modulation variable having a magnitude which decreases as said plant outputs move from said normal operating range to outside of said normal operating range to generate first modulated control outputs.
11. A control system according to Claim 10 wherein said applying means comprises a second modulator circuit responsive to said plurality of plant outputs for multiplying said second control outputs by a second modulation variable having a magnitude which increases as said plant outputs move from said normal operating range to outside of said normal operating range to generate second modulated control outputs.
12. A control system according to Claim 11 wherein said applying means further comprises means for summing said first modulated control outputs and said second modulated control outputs to generate a plant control signal which smoothly changes from said first control outputs to said second control outputs as said plant outputs move from said normal operating range to outside of said normal operating range.
13. A control system according to Claim 1 further
comprising a proportional/differential controller responsive to said plurality of tracking error variables for generating third control outputs and means for summing said first control outputs, said second control outputs and said third control outputs to generate a control sum and means for applying said control sum to said plant.
14. A control system for controlling a non-linear plant having a plurality of outputs so that each of said plurality of plant outputs approaches a
corresponding desired plant output, said plant outputs having a predetermined plurality of values constituting a normal operating range, a plurality of values constituting an abnormal operating range outside of said normal operating range and a plurality of values in a transition operating range between said normal operating range and said abnormal operating range, said control system comprising:
means responsive to said plurality of plant outputs and to said plurality of desired plant outputs for generating a plurality of tracking error variables;
a neural network having a plurality of variable weighting factors and being responsive to said plurality of tracking error variables, to said plurality of plant outputs and to said plurality of desired plant outputs for generating adaptive control outputs for controlling said plant;
an adaptive mechanism responsive to said plurality of tracking error variables for adjusting said variable weighting factors; and
a sliding controller having a plurality of fixed parameters and being responsive to said plurality of tracking error variables, to said plurality of plant outputs and to said plurality of desired plant outputs for generating sliding control outputs for controlling said plant
a modulator responsive to said plurality of plant outputs for applying said adaptive control outputs to said plant when said plurality of plant outputs are within said normal operating range, for applying said sliding control outputs to said plant when said plurality of plant outputs are in said abnormal operating range and for applying a linear combination of said adaptive control outputs and said sliding control outputs to said plant when said plurality of plant outputs are in said
transition operating range.
15. A control system according to Claim 14 wherein said neural network comprises a plurality of nodes, each of said plurality of nodes comprising an
interpolator circuit, said neural network further comprising means for generating a weighted set of tracking error variables and means for applying said weighted set of tracking error variables to each of said nodes to produce interpolated outputs .
16. A control system according to Claim 15 wherein said means for generating a weighted set of tracking error variables comprises means for multiplying each of said plurality of tracking error variables by a predetermined, fixed weight value.
17. A control system according to Claim 16 wherein said neural network further comprises means for
generating a weighted set of interpolated outputs and means for summing said weighted set of
interpolated outputs to generate said adaptive control outputs.
18. A control system according to Claim 17 wherein said means for generating a weighted set of interpolated outputs comprises means for multiplying each of said plurality of interpolated outputs by said variable weighting factors.
19. A control system according to Claim 18 wherein each of said interpolator circuits multiplies one of said weighted set of tracking error variables by a predetermined interpolation function.
20. A control system according to Claim 19 wherein said interpolation function is a radial Gaussian
interpolation function.
21. A control system according to Claim 20 wherein said radial Gaussian interpolation function has a standard deviation and a means which depend on the frequency characteristics of said plant outputs.
22. A control system according to Claim 21 wherein said neural network has a predetermined number of nodes which predetermined number depends on the frequency characteristics of said plant outputs.
23. A control system according to Claim 14 wherein said modulator comprises a first modulator circuit responsive to said plurality of plant outputs for multiplying said adaptive control outputs by a first modulation variable to generate modulated adaptive control outputs, said first modulation variable having a magnitude which is "1" when said plant outputs are inside said normal operating range, which is between "0" and "1" when said plant outputs are in said transition operating range and which is "0" when said plant outputs are in said abnormal operating range.
24. A control system according to Claim 23 wherein said modulator comprises a second modulator circuit responsive to said plurality of plant outputs for multiplying said sliding control outputs by a second modulation variable to generate modulated sliding control outputs, said second modulation variable having a magnitude which is "0" when said plant outputs are inside said normal operating range, which is between "0" and "1" when said plant outputs are in said transition operating range and which is "1" when said plant outputs are in said abnormal operating range.
25. A control system according to Claim 24 wherein said first modulation variable and said second modulation variable sum to "1" for all plant outputs.
26. A control system according to Claim 25 wherein said modulator further comprises means for summing said modulated adaptive control outputs and said
modulated sliding control outputs to generate a plant control signal which smoothly changes from said adaptive control outputs to said sliding control outputs as said plant outputs move from said normal operating range through said transition operating range to said abnormal operating range.
27. A control system according to Claim 14 further
comprising a proportional/differential controller responsive to said plurality of tracking error variables for generating proportional/differential control outputs and means for summing said adaptive control outputs, said sliding control outputs and said proportional/differential control outputs to generate a control sum and means for applying said control sum to said plant.
28. In a control system for controlling a non-linear
plant having a plurality of outputs so that each of said plurality of plant outputs approaches a
corresponding desired plant output, said plant outputs having a predetermined plurality of values constituting a normal operating range and a plurality of values outside of said normal operating range, said control system having means responsive to said plurality of plant outputs and to said plurality of desired plant outputs for generating a plurality of tracking error variables, a first controller having a plurality of variable adaption parameters and being responsive to said plurality of tracking error variables, to said plurality of plant outputs and to said plurality of desired plant outputs for generating control outputs for controlling said plant, and an adaptive mechanism responsive to said plurality of tracking error variables for adjusting said variable adaptation parameters, the improvement wherein said first controller comprises:
a neural network having a plurality of nodes, each of said plurality of nodes comprising a radial Gaussian interpolator circuit;
means for generating a weighted set of tracking error variables;
means for applying said weighted set of
tracking error variables to each of said nodes to produce interpolated outputs;
means responsive to said variable adaptation parameters for generating a weighted set of
interpolated outputs; and
means for summing said weighted set of
interpolated outputs to generate said control outputs.
29. In a control system, the improvement according to Claim 28 wherein said means for generating a weighted set of interpolated outputs comprises means for multiplying each of said plurality of interpolated outputs by said variable adaptation parameters.
30. In a control system, the improvement according to Claim 28 wherein said means for generating a weighted set of tracking error variables comprises means for multiplying each of said plurality of tracking error variables by a predetermined, fixed weight value.
31. In a control system, the improvement according to Claim 28 wherein said means for generating a weighted set of interpolated outputs comprises means for multiplying each of said plurality of interpolated outputs by said variable adaptation parameters.
32. In a control system, the improvement according to Claim 28 further comprising a second controller having a plurality of fixed parameters and being responsive to said plurality of tracking error variables, to said plurality of plant outputs and to said plurality of desired plant outputs for generating second control outputs for controlling said plant; and means responsive to said plurality of plant outputs for applying said control outputs to said plant when said plurality of plant outputs are within said normal operating range and for applying said second control outputs to said plant when said plurality of plant outputs are outside of said normal operating range.
33. In a control system, the improvement according to Claim 32 wherein said second controller comprises a sliding controller.
34. In a control system, the improvement according to Claim 32 wherein said applying means comprises a first modulator circuit responsive to said plurality of plant outputs for multiplying said control outputs by a first modulation variable having a magnitude which decreases as said plant outputs move from said normal operating range to outside of said normal operating range to generate first modulated control outputs.
35. In a control system, the improvement according to Claim 34 wherein said applying means comprises a second modulator circuit responsive to said
plurality of plant outputs for multiplying said second control outputs by a second modulation variable having a magnitude which increases as said plant outputs move from said normal operating range to outside of said normal operating range to
generate second modulated control outputs.
36. In a control system, the improvement according to Claim 35 wherein said applying means further comprises means for summing said first modulated control outputs and said second modulated control outputs to generate a plant control signal which smoothly changes from said first control outputs to said second control outputs as said plant outputs move from said normal operating range to outside of said normal operating range.
37. In a control system, the improvement according to Claim 32 further comprising a
proportional/differential controller responsive to said plurality of tracking error variables for generating third control outputs and means for summing said control outputs, said second control outputs and said third control outputs to generate a control sum and means for applying said control sum to said plant.
PCT/US1992/005266 1991-06-24 1992-06-23 Stable adaptive neural network controller WO1993000618A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US720,201 1991-06-24
US07/720,201 US5268834A (en) 1991-06-24 1991-06-24 Stable adaptive neural network controller

Publications (1)

Publication Number Publication Date
WO1993000618A1 true WO1993000618A1 (en) 1993-01-07

Family

ID=24893069

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1992/005266 WO1993000618A1 (en) 1991-06-24 1992-06-23 Stable adaptive neural network controller

Country Status (2)

Country Link
US (1) US5268834A (en)
WO (1) WO1993000618A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1996007126A1 (en) * 1994-08-29 1996-03-07 Siemens Aktiengesellschaft Process for quickly adapting model-backed controls and arrangement therefor
CN105785762A (en) * 2016-03-17 2016-07-20 北京航空航天大学 Bi-axis inertially-stabilized platform high-precision control method based on self-adaptive backstepping sliding mode
EP2983053A3 (en) * 2014-08-04 2016-08-03 Canon Kabushiki Kaisha Conveying controlling apparatus, conveying controlling method and program
CN109062208A (en) * 2018-08-03 2018-12-21 合肥工业大学 A kind of adaptive Trajectory Tracking Control circuit of uncertain wheeled mobile robot
EP3885848A1 (en) * 2020-03-24 2021-09-29 Robert Bosch GmbH Controller with neural network and improved stability

Families Citing this family (107)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5471381A (en) * 1990-09-20 1995-11-28 National Semiconductor Corporation Intelligent servomechanism controller
WO1993003443A1 (en) * 1991-08-05 1993-02-18 Kawasaki Steel Corporation Signal processor and learning method thereof
JPH07501643A (en) * 1991-12-09 1995-02-16 シーメンス アクチエンゲゼルシヤフト Control parameter optimization method for systems with actual value characteristics that depend on control parameters
US5396415A (en) * 1992-01-31 1995-03-07 Honeywell Inc. Neruo-pid controller
AU658066B2 (en) * 1992-09-10 1995-03-30 Deere & Company Neural network based control system
US5477444A (en) * 1992-09-14 1995-12-19 Bhat; Naveen V. Control system using an adaptive neural network for target and path optimization for a multivariable, nonlinear process
JPH06131009A (en) * 1992-10-20 1994-05-13 Fujitsu Ltd Feedback controller
JP2750648B2 (en) * 1992-11-16 1998-05-13 本田技研工業株式会社 Adaptive controller with recursive form of parameter adjustment rule.
US5486996A (en) * 1993-01-22 1996-01-23 Honeywell Inc. Parameterized neurocontrollers
CA2118885C (en) * 1993-04-29 2005-05-24 Conrad K. Teran Process control system
DE4416317B4 (en) * 1993-05-17 2004-10-21 Siemens Ag Method and control device for controlling a material processing process
US5668717A (en) * 1993-06-04 1997-09-16 The Johns Hopkins University Method and apparatus for model-free optimal signal timing for system-wide traffic control
US5513098A (en) * 1993-06-04 1996-04-30 The Johns Hopkins University Method for model-free control of general discrete-time systems
US5408406A (en) * 1993-10-07 1995-04-18 Honeywell Inc. Neural net based disturbance predictor for model predictive control
DE4338615B4 (en) * 1993-11-11 2005-10-13 Siemens Ag Method and device for managing a process in a controlled system
US5586221A (en) * 1994-07-01 1996-12-17 Syracuse University Predictive control of rolling mills using neural network gauge estimation
US5682309A (en) * 1995-04-28 1997-10-28 Exxon Chemical Patents Inc. Feedback method for controlling non-linear processes
US5768121A (en) * 1995-05-24 1998-06-16 Johnson Service Company Adaptive flow controller for use with a flow control system
US5875109A (en) * 1995-05-24 1999-02-23 Johnson Service Company Adaptive flow controller for use with a flow control system
US5812992A (en) * 1995-05-24 1998-09-22 David Sarnoff Research Center Inc. Method and system for training a neural network with adaptive weight updating and adaptive pruning in principal component space
EP0751447B1 (en) * 1995-06-26 2001-12-19 Siemens Aktiengesellschaft Numerical control method
US6092919A (en) * 1995-08-01 2000-07-25 Guided Systems Technologies, Inc. System and method for adaptive control of uncertain nonlinear processes
US5748847A (en) * 1995-12-21 1998-05-05 Maryland Technology Corporation Nonadaptively trained adaptive neural systems
US5796922A (en) * 1996-03-29 1998-08-18 Weber State University Trainable, state-sampled, network controller
US7149590B2 (en) 1996-05-06 2006-12-12 Pavilion Technologies, Inc. Kiln control and upset recovery using a model predictive control in series with forward chaining
US7418301B2 (en) * 1996-05-06 2008-08-26 Pavilion Technologies, Inc. Method and apparatus for approximating gains in dynamic and steady-state processes for prediction, control, and optimization
US6438430B1 (en) 1996-05-06 2002-08-20 Pavilion Technologies, Inc. Kiln thermal and combustion control
US7058617B1 (en) * 1996-05-06 2006-06-06 Pavilion Technologies, Inc. Method and apparatus for training a system model with gain constraints
US8311673B2 (en) * 1996-05-06 2012-11-13 Rockwell Automation Technologies, Inc. Method and apparatus for minimizing error in dynamic and steady-state processes for prediction, control, and optimization
US6493596B1 (en) * 1996-05-06 2002-12-10 Pavilion Technologies, Inc. Method and apparatus for controlling a non-linear mill
US5933345A (en) 1996-05-06 1999-08-03 Pavilion Technologies, Inc. Method and apparatus for dynamic and steady state modeling over a desired path between two end points
US7610108B2 (en) * 1996-05-06 2009-10-27 Rockwell Automation Technologies, Inc. Method and apparatus for attenuating error in dynamic and steady-state processes for prediction, control, and optimization
US5940789A (en) * 1996-05-17 1999-08-17 Nikon Corporation Stage control method and apparatus with varying stage controller parameter
JPH1097514A (en) * 1996-09-24 1998-04-14 Masahiko Shizawa Polyvalent mapping learning method
DE59908924D1 (en) * 1998-01-14 2004-04-29 Heidolph Elektro Gmbh & Co Kg Method and device for regulating a drive system of a machine and / or a system
US6167391A (en) * 1998-03-19 2000-12-26 Lawrence Technologies, Llc Architecture for corob based computing system
JP3559209B2 (en) 1998-12-24 2004-08-25 富士通株式会社 Storage device
US7085688B1 (en) * 1999-10-22 2006-08-01 Shizuo Sumida Non-linear characteristic reproducing apparatus and non-linear characteristic reproducing program storage medium
NL1014075C2 (en) * 2000-01-13 2001-07-16 Koninkl Kpn Nv Method and device for determining the quality of a signal.
US6618631B1 (en) * 2000-04-25 2003-09-09 Georgia Tech Research Corporation Adaptive control system having hedge unit and related apparatus and methods
WO2001092974A2 (en) * 2000-05-27 2001-12-06 Georgia Tech Research Corporation Adaptive control system having direct output feedback and related apparatuses and methods
JP4265874B2 (en) 2000-07-10 2009-05-20 富士通株式会社 Disk apparatus and track following control method
US6629089B1 (en) * 2000-09-29 2003-09-30 Cirrus Logic, Inc. Artificial neural network voice coil motor controller
EP1293851B1 (en) * 2001-04-20 2014-04-09 Honda Giken Kogyo Kabushiki Kaisha Plant control apparatus
US6947913B1 (en) 2001-08-23 2005-09-20 Lawrence Technologies, Llc Systems and methods for generating string correlithm objects
US20030065409A1 (en) * 2001-09-28 2003-04-03 Raeth Peter G. Adaptively detecting an event of interest
US7031969B2 (en) * 2002-02-20 2006-04-18 Lawrence Technologies, Llc System and method for identifying relationships between database records
US9983559B2 (en) * 2002-10-22 2018-05-29 Fisher-Rosemount Systems, Inc. Updating and utilizing dynamic process simulation in an operating process environment
US7146231B2 (en) * 2002-10-22 2006-12-05 Fisher-Rosemount Systems, Inc.. Smart process modules and objects in process plants
DE10348563B4 (en) 2002-10-22 2014-01-09 Fisher-Rosemount Systems, Inc. Integration of graphic display elements, process modules and control modules in process plants
US7277764B2 (en) 2002-12-09 2007-10-02 Georgia Tech Research Corporation Adaptive output feedback apparatuses and methods capable of controlling a non-minimum phase system
US7194320B2 (en) * 2003-06-05 2007-03-20 Neuco, Inc. Method for implementing indirect controller
US7447664B2 (en) * 2003-08-28 2008-11-04 Boeing Co Neural network predictive control cost function designer
US7046794B2 (en) 2003-12-12 2006-05-16 Motorola, Inc. Double talk activity detector and method for an echo canceler circuit
US7680265B2 (en) * 2003-12-12 2010-03-16 Continental Automotive Systems, Inc. Echo canceler circuit and method
US7599483B2 (en) * 2003-12-12 2009-10-06 Temic Automotive Of North America, Inc. Echo canceler circuit and method
US7099458B2 (en) * 2003-12-12 2006-08-29 Motorola, Inc. Downlink activity and double talk probability detector and method for an echo canceler circuit
TWI231481B (en) * 2004-03-11 2005-04-21 Quanta Comp Inc Electronic apparatus
JP2007536634A (en) 2004-05-04 2007-12-13 フィッシャー−ローズマウント・システムズ・インコーポレーテッド Service-oriented architecture for process control systems
US7729789B2 (en) 2004-05-04 2010-06-01 Fisher-Rosemount Systems, Inc. Process plant monitoring based on multivariate statistical analysis and on-line process simulation
US7523035B2 (en) * 2004-07-26 2009-04-21 Motorola, Inc. Hands-free circuit and method for communicating with a wireless device
JP4376202B2 (en) * 2005-04-07 2009-12-02 本田技研工業株式会社 Control device
US20070041730A1 (en) * 2005-08-09 2007-02-22 Alcatel RF extender co-located with DSL remote
US7546190B2 (en) * 2005-08-31 2009-06-09 Northrop Grumman Systems Corp. Adaptive dead-zone compensation for steering systems
US8719327B2 (en) * 2005-10-25 2014-05-06 Fisher-Rosemount Systems, Inc. Wireless communication of process measurements
US7620460B2 (en) * 2005-10-25 2009-11-17 Fisher-Rosemount Systems, Inc. Process control with unreliable communications
US7587252B2 (en) * 2005-10-25 2009-09-08 Fisher-Rosemount Systems, Inc. Non-periodic control communications in wireless and other process control systems
US8055358B2 (en) 2005-12-05 2011-11-08 Fisher-Rosemount Systems, Inc. Multi-objective predictive process optimization with concurrent process simulation
US8527252B2 (en) * 2006-07-28 2013-09-03 Emerson Process Management Power & Water Solutions, Inc. Real-time synchronized control and simulation within a process plant
US7496414B2 (en) * 2006-09-13 2009-02-24 Rockwell Automation Technologies, Inc. Dynamic controller utilizing a hybrid model
FR2908542B1 (en) * 2006-11-15 2009-01-23 Eurocopter France METHOD AND SYSTEM FOR DETECTING AND LOCATING A DRAFT OR FAULT OF A ROTOR OF GIRAVION
DE102007005070B4 (en) * 2007-02-01 2010-05-27 Klippel, Wolfgang, Dr. Arrangement and method for the optimal estimation of the linear parameters and the non-linear parameters of a model describing a transducer
US8185290B2 (en) * 2008-03-07 2012-05-22 Caterpillar Inc. Data acquisition system indexed by cycle segmentation
US8156048B2 (en) * 2008-03-07 2012-04-10 Caterpillar Inc. Adaptive payload monitoring system
US8024095B2 (en) 2008-03-07 2011-09-20 Caterpillar Inc. Adaptive work cycle control system
US20090287320A1 (en) * 2008-05-13 2009-11-19 Macgregor John System and Method for the Model Predictive Control of Batch Processes using Latent Variable Dynamic Models
US8219437B2 (en) * 2008-07-10 2012-07-10 Palo Alto Research Center Incorporated Methods and systems for constructing production plans
US8165705B2 (en) * 2008-07-10 2012-04-24 Palo Alto Research Center Incorporated Methods and systems for continuously estimating persistent and intermittent failure probabilities for production resources
US8145334B2 (en) * 2008-07-10 2012-03-27 Palo Alto Research Center Incorporated Methods and systems for active diagnosis through logic-based planning
US8266092B2 (en) * 2008-07-10 2012-09-11 Palo Alto Research Center Incorporated Methods and systems for target value path identification
US8868221B1 (en) * 2008-08-22 2014-10-21 Marvell International Ltd. Adaptive neural net feed forward system and method for adaptive control of mechanical systems
US8881039B2 (en) 2009-03-13 2014-11-04 Fisher-Rosemount Systems, Inc. Scaling composite shapes for a graphical human-machine interface
US8359110B2 (en) * 2009-03-23 2013-01-22 Kuhn Lukas D Methods and systems for fault diagnosis in observation rich systems
US8825183B2 (en) 2010-03-22 2014-09-02 Fisher-Rosemount Systems, Inc. Methods for a data driven interface based on relationships between process control tags
US20110257800A1 (en) * 2010-04-14 2011-10-20 Zakariya Al-Hamouz Particle swarm optimizing sliding mode controller
US20110276150A1 (en) * 2010-05-10 2011-11-10 Al-Duwaish Hussain N Neural network optimizing sliding mode controller
EP2732343B8 (en) * 2011-07-13 2020-12-16 Siemens Energy Global GmbH & Co. KG Automated adaptation of a power plant control
US10423127B2 (en) 2012-01-17 2019-09-24 Fisher-Rosemount Systems, Inc. Velocity based control in a non-periodically updated controller
US11199824B2 (en) 2012-01-17 2021-12-14 Fisher-Rosemount Systems, Inc. Reducing controller updates in a control loop
US9298176B2 (en) 2012-01-17 2016-03-29 Fisher-Rosemount Systems, Inc. Compensating for setpoint changes in a non-periodically updated controller
CN103676649A (en) * 2013-10-09 2014-03-26 江苏师范大学 Local self-adaptive WNN (Wavelet Neural Network) training system, device and method
US10481565B2 (en) 2014-11-11 2019-11-19 Applied Brain Research Inc. Methods and systems for nonlinear adaptive control and filtering
CN104598552A (en) * 2014-12-31 2015-05-06 大连钜正科技有限公司 Method for learning incremental update-supported big data features
CN105045097B (en) * 2015-05-26 2017-10-27 河海大学常州校区 A kind of gyroscope inverting global sliding mode fuzzy control method based on neutral net
CN105204341B (en) * 2015-09-25 2017-09-05 西安石油大学 A kind of network control system Robust Tracking Control theoretical based on switching control
CN105629738B (en) * 2016-03-24 2018-06-29 内蒙古瑞特优化科技股份有限公司 SCR flue gas denitrification systems control method and equipment
CN105911863B (en) * 2016-05-19 2018-12-14 杭州新松机器人自动化有限公司 Multi-robot Cooperation grasping system neural network Trajectory Tracking Control method
US10878140B2 (en) 2016-07-27 2020-12-29 Emerson Process Management Power & Water Solutions, Inc. Plant builder system with integrated simulation and control system configuration
US11604440B2 (en) * 2017-03-29 2023-03-14 Hitachi, Ltd. Control switching device for abnormality prevention in multiple terminals
US11507032B2 (en) * 2017-11-06 2022-11-22 Hitachi, Ltd. Control device using artificial intelligence
US11188832B2 (en) 2018-05-16 2021-11-30 Distech Controls Inc. Method and environment controller for validating a predictive model of a neural network through interactions with the environment controller
US11041644B2 (en) * 2018-05-16 2021-06-22 Distech Controls Inc. Method and environment controller using a neural network for bypassing a legacy environment control software module
CN109946973B (en) * 2019-04-12 2021-05-04 东北大学 Combined sliding mode control method combining rapid terminal sliding mode and linear sliding mode
US11604459B2 (en) 2019-07-12 2023-03-14 Emerson Process Management Power & Water Solutions, Inc. Real-time control using directed predictive simulation within a control system of a process plant
US11418969B2 (en) 2021-01-15 2022-08-16 Fisher-Rosemount Systems, Inc. Suggestive device connectivity planning
CN112947056B (en) * 2021-03-04 2022-10-14 北京交通大学 Magnetic-levitation train displacement speed tracking control method based on IGWO-BP-PID
CN114200837B (en) * 2021-12-08 2023-10-10 北京理工大学 Layering sliding mode control method for interference unknown spherical robot

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS57199004A (en) * 1981-06-01 1982-12-06 Toshiba Corp Sample value adaptive process controller
JPH0387901A (en) * 1989-08-31 1991-04-12 Omron Corp Fuzzy controller capable of change of rule and its working method
US5159660A (en) * 1990-08-09 1992-10-27 Western Thunder Universal process control using artificial neural networks

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
ICCON '89 PROCEEDINGS April 1989, JERUSALEM pages 1 - 5 G.W. NEAT, H. KAUFMAN, R.J. ROY 'A HYBRID ADAPTATIVE APPROACH FOR DRUG DELIVERY SYSTEMS' *
PROCEEDINGS OF THE 25TH IEEE CONFERENCE ON DECISION & CONTROL December 1986, ATHENS GR pages 409 - 414 G.L. LUO, G.N. SARIDIS, C.Z. WANG 'A DUAL-MODE CONTROL FOR ROBOTIC MANIPULATORS' *
PROCEEDINGS OF THE 29TH IEEE CONFERENCE ON DECISION AND CONTROL vol. 3, December 1990, HONOLULU HAWAII pages 1757 - 1762 S.K. BISWAS 'A CONJUGATE HOPFIELD NEURAL NETWORK FOR OPTIMUM SYSTEMS CONTROL' *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1996007126A1 (en) * 1994-08-29 1996-03-07 Siemens Aktiengesellschaft Process for quickly adapting model-backed controls and arrangement therefor
EP2983053A3 (en) * 2014-08-04 2016-08-03 Canon Kabushiki Kaisha Conveying controlling apparatus, conveying controlling method and program
US9440265B2 (en) 2014-08-04 2016-09-13 Canon Kabushiki Kaisha Conveying controlling apparatus, conveying controlling method and program
CN105785762A (en) * 2016-03-17 2016-07-20 北京航空航天大学 Bi-axis inertially-stabilized platform high-precision control method based on self-adaptive backstepping sliding mode
CN109062208A (en) * 2018-08-03 2018-12-21 合肥工业大学 A kind of adaptive Trajectory Tracking Control circuit of uncertain wheeled mobile robot
EP3885848A1 (en) * 2020-03-24 2021-09-29 Robert Bosch GmbH Controller with neural network and improved stability

Also Published As

Publication number Publication date
US5268834A (en) 1993-12-07

Similar Documents

Publication Publication Date Title
US5268834A (en) Stable adaptive neural network controller
Zervos et al. Deterministic adaptive control based on Laguerre series representation
US5038269A (en) Industrial control systems
Ge et al. Adaptive predictive control using neural network for a class of pure-feedback systems in discrete time
Sun et al. Stable neural-network-based adaptive control for sampled-data nonlinear systems
Smith et al. An algorithm for automated fuzzy logic controller tuning
Tutsoy et al. A novel exploration-exploitation-based adaptive law for intelligent model-free control approaches
Ghavidel et al. Robust control for MIMO hybrid dynamical system of underwater vehicles by composite adaptive fuzzy estimation of uncertainties
Lakhekar et al. Novel adaptive fuzzy sliding mode controller for depth control of an underwater vehicles
Lakhekar et al. Dynamic fuzzy sliding mode control of underwater vehicles
Kroll et al. Fuzzy network model-based fuzzy state controller design
US6029095A (en) Method and device for dynamic system control
Yahya et al. Model predictive control for uncertain PWA systems based on robust observer
Wang et al. Filtering controller in sliding mode: From the estimation to control
Abdullah et al. Fuzzy logic based switching and tuning supervisor for a multi-variable multiple controller
Lauritsen et al. Generalized predictive control in the delta-domain
Li et al. An RBFNN-based direct inverse controller for PMSM with disturbances
Tatjewski et al. Nonlinear Predictive Control
Ahmed Developing and Testing of an MPC strategy for a four tanks multivariable process using Emerson Delta V system.
Dogruer An integral-model predictive controller with finite memory for trajectory tracking
Herman IQV Position and Velocity Tracking Control with Adaptive Term in Earth-Fixed Frame
Grimble et al. Linear and nonlinear predictive optimal control
Grimble et al. The F-iteration approach to H∞ control
Samad et al. Neural networks as optimal nonlinear controllers-Concepts and flight control applications
Davison The stability of an nth-order nonlinear time-varying differential system

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): CA JP

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH DE DK ES FR GB GR IT LU MC NL SE

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: CA