ebook img

DTIC ADA439986: Stabilization of LTI Systems with Communication Constraints PDF

10 Pages·0.36 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview DTIC ADA439986: Stabilization of LTI Systems with Communication Constraints

T R R ECHNICAL ESEARCH EPORT Stabilization of LTI Systems with Communication Constraints by Dimitrios Hristu CDCSS T.R. 99-3 (ISR T.R. 99-52) + CENTER FOR DYNAMICS C D - AND CONTROL OF S SMART STRUCTURES The Center for Dynamics and Control of Smart Structures (CDCSS) is a joint Harvard University, Boston University, University of Maryland center, supported by the Army Research Office under the ODDR&E MURI97 Program Grant No. DAAG55-97-1-0114 (through Harvard University). This document is a technical report in the CDCSS series originating at the University of Maryland. Web site http://www.isr.umd.edu/CDCSS/cdcss.html Report Documentation Page Form Approved OMB No. 0704-0188 Public reporting burden for the collection of information is estimated to average 1 hour per response, including the time for reviewing instructions, searching existing data sources, gathering and maintaining the data needed, and completing and reviewing the collection of information. Send comments regarding this burden estimate or any other aspect of this collection of information, including suggestions for reducing this burden, to Washington Headquarters Services, Directorate for Information Operations and Reports, 1215 Jefferson Davis Highway, Suite 1204, Arlington VA 22202-4302. Respondents should be aware that notwithstanding any other provision of law, no person shall be subject to a penalty for failing to comply with a collection of information if it does not display a currently valid OMB control number. 1. REPORT DATE 3. DATES COVERED 1999 2. REPORT TYPE - 4. TITLE AND SUBTITLE 5a. CONTRACT NUMBER Stabilization of LTI Systems with Communication Constraints 5b. GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR(S) 5d. PROJECT NUMBER 5e. TASK NUMBER 5f. WORK UNIT NUMBER 7. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) 8. PERFORMING ORGANIZATION Army Research Office,PO Box 12211,Research Triangle Park,NC,27709 REPORT NUMBER 9. SPONSORING/MONITORING AGENCY NAME(S) AND ADDRESS(ES) 10. SPONSOR/MONITOR’S ACRONYM(S) 11. SPONSOR/MONITOR’S REPORT NUMBER(S) 12. DISTRIBUTION/AVAILABILITY STATEMENT Approved for public release; distribution unlimited 13. SUPPLEMENTARY NOTES 14. ABSTRACT see report 15. SUBJECT TERMS 16. SECURITY CLASSIFICATION OF: 17. LIMITATION OF 18. NUMBER 19a. NAME OF ABSTRACT OF PAGES RESPONSIBLE PERSON a. REPORT b. ABSTRACT c. THIS PAGE 9 unclassified unclassified unclassified Standard Form 298 (Rev. 8-98) Prescribed by ANSI Std Z39-18 subm. to ACC 2C00 Stabilization of LTI Systems with Communication Constraints1 Dimitris Hristu2 University of Maryland, Institute for Systems Research, College Park, MD 20742 e-mail: [email protected] tational complexity and are appealing because they Abstract quantify the amount of \attention" that the controller paystoeachcomponentoftheunderlyingsystem. Pre- This workis directed towardsexploring interactionsof vious work has addressed problems involving stabiliz- communication and control in systems with communi- ing anLTIsystemwhenonlysomeofits output(s)can cation constraints. Examples of such systems include be measured at one time [10]. That work explored the groupsofautonomousvehicles,MEMSarraysandsys- e(cid:11)ects of communicating sequentially with each of a temswhosesensorsandactuatorsaredistributedacross set of linear systems and described a method for sta- anetwork. Weextendsomerecentresultsinvolvingsta- bilizing all systems in the set. This paper extends and bilizationofLTIsystemsunderlimitedcommunication completes those results, taking into account communi- and address a class of feed-forward control problems cation constraints that are present both when making for the systems of interest. measurementsaswellaswhentransmitting controlac- tions to the LTI system. 1 Introduction We present a new \extensi(cid:12)cation algorithm" that With communication networks proliferating into in- transforms the stabilization problem to an equivalent creasinglymanyaspectsofmodern-daytechnology,en- problem involving matrix search (as is the case with gineers are faced with new opportunities involving the the work in [10]). Our goal is to combine our algo- control of systems whose components are distributed rithm together with recent results on tracking for net- acrossa networkandwhicharesupposedto operatein worked control systems ([8]) so that closed-loop track- a coordinated manner. Examples of such systems in- ing can be achieved for the systems of interest. For clude groupsof vehicles, satellite clusters,mobile com- related problems in hybrid system theory see [3], [5] munications,MEMSarraysandothers. Thechallenges [4], and [12]. Work discussing systems with communi- that these systems presentgo beyondproblems associ- cation constraints can be found in [2], [11] and others. ated with increased dimensionality. Frequently, their performance is limited not because of lack of compu- 2 A Prototype Computer-Controlled System tational power but because of lack of time on a shared network of sensors and actuators. This has lead to Consider an n-dimensional LTI system G(s) with recente(cid:11)ortstowardsbringingtogetheraspectsofcon- input u 2 m, and output y 2 p. The system trol and communication under a framework that will R R is driven by a computer or other digital controller lead to a better understanding of controlsystems with (Fig. 1) that may be remotely located and which does communication constraints. not have simultaneous access to all inputs/outputs of One way to jointly formulate control and communica- the control system. In particular: tion problems is to employ the idea of a \communica- (cid:15) The controller sends commands to and receives tion sequence" [6] which allows multiple (sub)systems inputs from the system every (cid:1) units of time, via a to share the attention of a centralized controller [8]. zero-order-holdstage. Communicationsequencesbringforthsomeofthecon- (cid:15) Inputs/outputs are communicated through a bus nections between control, communication and compu- which has limited capacity. Speci(cid:12)cally, the bus can \carry"at mostb>0 signals, with b<m+p, b2 (cid:3). 1ThisworkwasfundedbyNSFGrantno. EEC94-02384and N AROGrantno.DAAG55-97-1-0144. Typically, the capacity of the communication bus 2This work was done while the author was with the Divi- will be divided between input and output signals, sionofEngineeringandAppliedSciencesatHarvardUniversity, Cambridge,MA. with br channels devoted to sampling the output of and with the transmission of those samples through the communication bus. De(cid:12)nition 2 Consider a computer-controlled system G(z) with b < b (b < b) being the dimension of the w r input (output) communication bus. A pair of commu- nication sequences σ 2 m(cid:2)N, σ 2 p(cid:2)N is admis- w r E E sible if: Figure 1: A closed-loop computer-controlled system (cid:15) Pkσw(i)k2 (cid:20)bw,Pkσr(i)k2 (cid:20)br 8i=0,...,N −1 (cid:15) m σ (i)+ p σ (i)(cid:20)b 8i=0,...,N −1 j=1 w j=1 r (cid:15) Spanfσ (0),...,σ (N −1)g= m and the underlying LTI system and bw channels used for Spanfσw(0),...,σ w(N −1)g= Rp transmitting control inputs. We will refer to these r r R Theaboveconditionsrequirethatnomorethanb (b ) two groups of channels as the \input" and \output w r ofthesysteminputs(outputs)areupdated(measured) bus." Of course, b and b may change at any time as r w by the controller at every step and that the pair long as b +b = b. This represents a rather general r w (σ ,σ ) allows communication with all inputs (out- setting for controller/plant communication, allowing w r puts) of the linear system at least once every period. for dynamic recon(cid:12)guration of the available channels. We will use the terms \narrow" (b < m or b < p) w r 3 Stabilization with Limited Communication and \wide" (b (cid:21) m and b (cid:21) p) to describe the w r communication bus. We now focus on the problem of stabilizing systems Inthefollowingitwillbeconvenienttousethediscrete- like the one discussed in the previous section, using time equivalent of the linear system G(s): static output feedback (see Fig. 1). We will take the numberofinputandoutputchannels(b andb )tobe w r x(k+1) = Ax(k)+Bu(k) constant. y(k) = Cx(k) (1) Problem Statement 1 Given: a computer- Each element of u retains its value (by virtue of the controlled system G(z) with b ,b 2 N(cid:3),b +b = b ZOH stage) until that element is updated by the con- r w r w denoting the size of the input and output commu- troller. At the same time, the controller may only re- nication busses and a pair of admissible N-periodic ceivepartialinformationabouttheoutputy(k). When communication sequences σ ,σ 2 p(cid:2)N, m(cid:2)N, (cid:12)nd the communication bus is narrow, one possibility is to a constant feedback gain Γ 2r wp(cid:2)mEtpheart sEtapbeirlizes the choose a sequence of operations for the switches (see R system under output feedback. Fig. 1) that select which inputs/outputs are to be up- dated/sampled at a particular time. This leads us to the notion of a \communication sequence" (originally In [10], weshowedthat a versionofthis problem(with introducedin[6])bywhichthecontrollerchooseswhich noconstraintsontransmittingcontrolsamplesu(k))is of the input signals (measurements) to update (read) equivalent to the NP-hard problem: at every step. Problem Statement 2 Given a collection of matri- ces A 2 q(cid:2)q,0(cid:20)i(cid:20)i , and scalars γ ,...,γ 2 De(cid:12)nition 1 An N-periodic communication se- i R max 1 imax , (cid:12)nd a stable element of the a(cid:14)ne subspace quence is an element of R Epme(cid:2)rN =f(σ(0),σ(1),...,σ(N −1), A=A0+iXmaxγiAi (3) σ(0),...,σ(N −1),...):σ(i)2f0,1gmg (2) i=1 whereq =(2N2−N)nandi =mp. Inthenextsec- max for some m>0. tion we present a new extensi(cid:12)cation algorithm. This We will consider the controller’s communication with algorithm is complete in the sense that takes into ac- the system to follow a periodic pattern speci(cid:12)ed by count the both input (b <m) and measurement con- w a pair of N-periodic sequences: a \control" sequence straints (b < p) and arrives at a similar construction σw 2 pme(cid:2)rN will be used to transmit inputs and a for the martrices that span the a(cid:14)ne subspace of inter- \measuErement" sequence σ 2 p(cid:2)N will provide a est. r per E patternforsamplingthesystemoutput. Theentriesof σ (i)(σ (i))indicatewhichelements ofu(k)(y(k))are 4 Extensive Form of a Discrete LTI System w r to be updated (measured) at the kth time step. We willignorequantizationerrorsassociatedwiththerep- Consider the system of Fig. 1, to which a static out- resentation of signal samples in the digital controller put feedback controller is attached, subject to limited p. 2 communicationasdescribedinSec.2. Iftheinput and 4.2 Constrained Control output busses were both \wide" (b = p, b = m) We now follow the procedure of Sec. 4.1, applied this r w thenthestaticoutputfeedbackcontrolu(k)=ΓCx(k) time to communication over the input bus. The in- wouldbepossibletoimplementandΓwouldhavetobe put vectors u(k) arriveat the LTI system accordingto chosen to make the closed-loop dynamics (A+BΓC) the communication sequence σ 2 m(cid:2)N. At the kth w per stable. We proceed to modify this simple situation to step, only the inputs speci(cid:12)ed by thEe non-zero entries reflect the existence of communication constraints. of σ (k) are updated, with all other inputs remaining w at their previous values: 4.1 Constrained Measurements Assume for now that bw = m so that the controller u(k)=diag(σw(k))Γyl(k)+(I−diag(σw(k)))u(k−1) cantransmittheentireinputvectoratoncetotheLTI (9) system. Because measurements are to be obtained ac- By iterating backwardsfor a full period (N steps) and cording to the communication sequence σr 2 ppe(cid:2)rN, assuming that the communication sequence σw is ad- only some of the elements of y(k) are received bEy the missible (i.e. all m elements of the input are updated controller at each step k. The elements for which the at least once every N steps) we obtain: controllerdoesnotreceiveupdatesareassumedtohold NX−1 theirlast-knownvalues. Moreprecisely,wehaveacon- u(k)= D (k,i)Γy (k−i) (10) W l trol law of the form i=0 u(k)=Γyl(k) (4) where (cid:26) whereyl(k) is the output vector composedof the most 4 diag(σw(k)) Q i=0 up to date information available to the controller at DW(k,i)= diag(σ (k−i)) i−1M (k,j) i>0 the kth step. Notice that in general yl(k) 6= y(k) be- w j=0 W (11) causenotallelementsofy(k)arecommunicatedtothe 4 arediagonalm(cid:2)mbinarymatricesandM (k,j)=I− controller at step k. We can write W diag(σ (k−j)). ThejthdiagonalelementofD (k,i) w W y (k)=diag(σ (k))y(k)+(I −diag(σ (k)))y (k−1) is 1 if the jth input was last updated at the (k−i)th l r r l (5) step, 0 otherwise. where for a vector x2 n, diag(x) is an n(cid:2)n matrix with the elements of xRalong its diagonal and all o(cid:11)- 4.3 Combining Communication Constraints diagonal entries being zero. By iteratively applying If communication with the system is to proceed ac- Eq.5foranumberofstepsequaltothecommunication cording to the pair (σr,σw) of admissible N-periodic period N, we obtain sequences, we can combine the results of Sec. 4.1 and Sec. 4.2. By substituting Eq. 7 into Eq. 10 we obtain: y (k) = diag(σ (k))y(k)+ l r 0 1 2XN−2 NX−1diag(σ (k−i))@iY−1M (k,j)Ay(k−i) (6) Bu(k) = Fkix(k−i) (12) r R i=0 i=1 j=0 where 4 where M (k,j) = I − diag(σ (k−j)). We observe R r bic(Xi−N−1) that if the communication sequence σr is admissible F =4 B N D (k,j)ΓD (k−j,i−j)C (13) then each of the p elements of the output y(k) will be ki W R read at least once every N steps so the summation in j=min(i,N−1) Eq.6terminatesafteratmostN steps: systemoutputs We can now write the closed-loop dynamics of the older than N steps areessentially\overwritten"in the computer-controlled system as: controller. We can rewrite Eq. 6 as 2XN−2 NX−1 x(k+1)=Ax(k)+ F x(k−i) (14) y (k)= D (k,i)Cx(k−i) (7) ki l R i=0 i=0 Letus de(cid:12)ne Comp(p)to bethecompanionfoPrmasso- where ciated with an nth-degree polynomial p(s)= np si (cid:26) 0 i 4 diag(σr(k)) Q i=0 2 3 DR(k,i)= diag(σ (k−i)) i−1M (k,j) i>0 0 1 0 (cid:1)(cid:1)(cid:1) 0 r j=0 R 66 0 0 1 (cid:1)(cid:1)(cid:1) 0 77 are diagonal p(cid:2)p matrices with binary entries. T(h8e) Comp(p)=4 666 ... ... ... ... 0 777 (15) jth diagonal element of DR(k,i) is 1 if the jth output 4 0 (cid:1)(cid:1)(cid:1) (cid:1)(cid:1)(cid:1) 0 1 5 was last read at the (k−i)th step and is 0 otherwise. pn pn−1 (cid:1)(cid:1)(cid:1) p1 p0 p. 3 If we now use the F (Eq. 13) to de(cid:12)ne the matrix of the state. If all elements of u(k) could be commu- kj polynomial nicated simultaneously (wide input bus), it would be su(cid:14)cient to \extensify" the state by considering only 2XN−2 its N-past values at times k,k−1,...,k−N −1 (see f (s)=A+ F si (16) k kj [10]). However, because the inputs are also subject to i=0 delays in arriving at the LTI system, a particular el- then the closed-loop dynamics of Eq. 14 can be ex- ement of u(k) could persist for (at most) the next N pressed in (cid:12)rst-order form: steps k,k+1,...,k+N −1. This implies that the en- tries of u(k) as seen by the LTI system could depend χ(k+1)=Comp(f )χ(k) (17) on state values that are (at most) 2N −1 steps old. k where χ=[xT (cid:1)(cid:1)(cid:1) xT xT ]T 2 (2N−1)n. In summary, we have given a procedure for converting (k−2N+1) (k) (k+1) R anoutputfeedbackstabilizationprobleminvolvingLTI The system of Eq. 17 is linear, time-varying and de- systems under limited communication, into a search scribes the state evolution of the computer-controlled problem involving a (cid:12)nite collection of (2N2 − N)n- system under output feedback and periodic commu- dimensional matrices. These matrices are obtained nication. We have essentially \extensi(cid:12)ed" the state from the parameters of the control system together vector to include past values up to two communica- with a pair of admissible communication sequences. tion periods. We note that the matrix Comp(f ) is k N-periodic in k, so Eq. 17 represents a periodic linear Equations 18 and 19 show that after a full communi- systemof dimension (2N−1)n. Although oflargerdi- cation period, the stability properties of the extensive mension,this periodicsystemis equivalentto the orig- form can be captured in a lower-dimensional space by inal system in the sense that it exactly describes the considering closed loop dynamics under the communication policy z(k+1)=A^z(k) (21) that was imposed. whereA^=AN−1AN−2...A0anddim(z)=(2N−1)n. In this lower-dimensionalspace,thenumber ofgainsis It is a fact that every discrete-time periodic system the same as in the extensive form, however the gains can be expressed as a time-invariant system of higher dimension(see[7]). Inourcase,this yields a systemof enter A^nonlinearly. Ifthenumber ofgainsisnγ =4 mp order (2N2−N)n which we call the \extensive form" then the number of matrix coe(cid:14)cients for the exten- of the original system in Problem 1: siveform is nγ+1. In the lower-dimensPional(cid:0)spaceth(cid:1)e numberofmatrixcoe(cid:14)cientswouldbe N nγ+i−1 . Xe(k+1)=AXe(k) (18) Thus, even though lower-dimensional mati=ri0ces arie of- ten convenient from a computational viewpoint, the where Xe(k)2 (2N2−N)n and potential savings in algorithm run times are o(cid:11)set by R 2 3 the memory demands due to the large number of coef- 6 Com0p(f1) (cid:1)0(cid:1)(cid:1) (cid:1)0(cid:1)(cid:1) 00 Com0p(f0) 7 (cid:12)cient matrices that Eq. 21 requires. A=4 0 Comp(f2) 0 (cid:1)(cid:1)(cid:1) 0 5 ... ... ... (cid:1)(cid:1)(cid:1) 0 5 An Extensi(cid:12)cation Example 0 (cid:1)(cid:1)(cid:1) 0 Comp(fN−1) 0 (19) Consider the scalar system By construction, stability of the extensi(cid:12)ed system (Eq. 18) is equivalent to the stability of the original x(k+1) = ax(k)+u(k) (22) system. Moreover, each of the matrices Comp(f ) are a(cid:14)neintheentriesofΓ.PBychoosingabasisfor km(cid:2)p, y(k) = x(k) (23) wecanexpressΓasΓ= mp γ E whereE isanRm(cid:2)p i=0 i i i We assume that the controller communicates with the matrixwhose(bic+1,imodp+1)th entryis\1",with p abovesystemoverabusofwidthb=1accordingtothe allotherentriesbeingzero. InthebasisofthefEigwe following pair of 2-periodic communication sequences: can express A as an element of the a(cid:14)ne subspace σ =(1,0,1,0,...), σ =(0,1,0,1,...) (24) Xmp r w A=A + γ A (20) 0 i i Noticethattheonlycommunicationchannelisusedfor i=0 measurementsoftheoutputandfortransmissionofthe where each of the A are obtained by substituting E control actions in an alternating fashion. Clearly, the i i for Γ in Eq. 13. above sequences are admissible. We want to stabilize the system using a control law of the form We note that at the kth step, the latest available out- puttothecontrollery (k)dependsonthepastN values u(k)=γy(k−d(σ ,σ ,k)) (25) l r w p. 4 where d(σ ,σ ,k) is a delay that depends on the com- 7 Simulation Results r w munication sequences and the current step k. Consider the fourth-order LTI system: The scalar system we began with, combined with the (cid:20) (cid:21) (cid:20) (cid:21) 1 3/4 1/2 0 0 0 period-2 pair of communication sequences σr,σw gives x(k+1) = 1/4 3 1/3 −1/3 x(k)+ 1 0 u 1/6 0 −1/2 −3/7 0 0 rise to a 3-dimensional periodic system χ(k + 1) = (cid:2) 0 −1 2/(cid:3)5 0 0 1 Comp(f )χ(k) with y = 1 0 0 1 x (31) k (cid:26) 0 1 0 0 a+γs2 k even whose open-loop system eigenvalues are shown in f (s) = (26) k a+γs k odd Fig. 2. We want to stabilize this LTI system using The corresponding companion forms are: 3 8 2 3 >>>>>>< 4 00 10 01 5 k even 2 1 2 γ 0 a 3 Comp(f ) = (27) k >>>>>>: 4 00 10 01 5 k odd Im 0 0 γ a −1 −2 and the extensive form is given by the 6-dimensional LTI system: −3 2 3 −2 0 Re 2 4 0 1 0 6 7 6 0 0 0 1 7 Figure2:Eigenvaluesofopen-loopsystem,kλ k=3.2. 6 7 max Xe(k+1)=666 0 1 0 γ 0 a 777Xe(k) (28) static output feedback, given that the communication 4 5 0 0 1 0 bus can carry two signals to/from any of the inputs or 0 γ a outputs (i.e. b = 2). Of the two available channels, one is to be used for transmitting control values, the Choosing the gain γ in order to stabilize the above otherforobtainingmeasurements. Inthefollowing,we system is equivalent to (cid:12)nding a stable element of the investigate the performance of the extensi(cid:12)cation and formgivenby Eq.20 whereA andA canbe reado(cid:11) 0 1 simulated annealingalgorithms(Sec. 4,6 ) for two dif- from Eq. 28. ferent communication sequences. 6 Finding a Set of Stabilizing Gains 7.1 Control with Uniform Attention Initially thecontrolleris to divideits attentionequally Afteracomputer-controlledsystemhasbeenputinthe between each of the input/output pairs (u ,y ) and 1 1 extensiveformasdescribedinSec.4,theoutputstabi- (u ,y ). For that purpose, we chose 2 2 lizationproblembecomesequivalentto(cid:12)ndingastable (cid:18)(cid:20) (cid:21) (cid:20) (cid:21) (cid:20) (cid:21) (cid:20) (cid:21) (cid:19) element of the a(cid:14)ne subspace de(cid:12)ned by Eq. 20. This σ =σ = 1 , 0 , 1 , 0 ,(cid:1)(cid:1)(cid:1) (32) isanNP-hardproblem[1]. Onepossibilityis toPchoose w r 0 1 0 1 thegainsγ sothattheeigenvaluesofA=A + γ A i 0 i i i to be thecommunicationsequencesto be used,so that are enclosed in a circle with the smallest possible ra- the controller reads and transmits in an alternating dius. This suggests minimizing the spectral radius of pattern. the closed-loop system The matrices composing the extensive form were com- η =jjλ (A)jj (29) max puted and simulated annealing was performed on the To negotiate the large number of local minima that four elements ofthe feedbackmatrix Γ. Thesimulated are expected, we use simulated annealing on the gains annealingalgorithmwasstoppedafter5000steps. The γ . Our algorithm numerically computes the gradient coolingschedulewasa 1 decayfollowedbyalinear i log(n) ∂η/∂γi and then lets the gains γi flow along that gra- decaytozerowhenthegaing(t)reachedapre-speci(cid:12)ed dient, adding a white-noise term dw with a gain g(t) level of 0.1. A plot of the cooling schedule is shown that decays to zero: in Fig. 3. Choosing \good" cooling schedules for the stabilization problem considered here remains an open ∂η dγi = dt+g(t)dw. (30) problem. Inthiscase,thespectralradiusdidnotreach ∂γ i valuesbelowunity. Theevolutionofthespectralradius The\coolingschedule"g(t)shouldgotozeroast!1, isshowninFig.4. Theresulting(cid:12)naleigenvaluesofthe butitshoulddosoataslowenoughrateforthespectral extensi(cid:12)ed system are shown in Fig. 5, corresponding radius to approach the global minimum. to a closed-loop system that was unstable. p. 5 7.2 Towards Optimal Communication Next, we investigated a period-four pair of communi- cation sequences, Cooling Schedule 50 (cid:18)(cid:20) (cid:21) (cid:20) (cid:21) (cid:20) (cid:21) (cid:20) (cid:21) (cid:19) 1 1 1 0 40 σw =σr = 0 , 0 , 0 , 1 ,(cid:1)(cid:1)(cid:1) (33) 30 that devote three cycles to the pair (u ,y ) for every 1 1 20 one cycle allocated to (u ,y ). The above sequences 2 2 were chosen after some experimentation and by notic- 10 ing that the upper-left 2(cid:2)2 block of the dynamics for thestateevolution(Eq.31)hasalargerspectralradius 0 0 1000 2000 3000 4000 5000 than thelower-rightblock (whenthe couplingbetween the two blocks is removed). As a result, communicat- Figure 3: Cooling schedule. ingmoreoftenwiththe(u ,y )pairmayleadtobetter 1 1 performance. Thecoolingschedulewasthesameasintheuniformat- tention case. This time, simulatedannealingstabilized the closed-loopsystem, reducing the spectralradius to 0.795. The (cid:12)nal closed-loop eigenvalues are shown in Spectral Radius Fig. 6. with the evolution of the spectral radius of the 6 5 1 4 0.5 3 2 m 0 I 1 00 1000 2000 3000 4000 5000 −0.5 Figure 4: Evolution of spectral radius (uniform atten- −1 tion). −1 −0.5 0 0.5 1 Re Figure 6: Closed-loop eigenvalues with non-uniform at- tention, λ =0.795. max system shown in Fig. 7. 3 Spectral Radius 2 4 1 3.5 m 0 3 I 2.5 −1 2 −2 1.5 1 −3 −4 −2 0 2 4 0.5 Re 0 0 1000 2000 3000 4000 5000 Figure5:Closed-loopeigenvalueswithuniformattention, kλmaxk=2.7. Figure 7: Evolution of spectral radius (non-uniform at- tention). p. 6 8 Combining Stabilization with Feed-forward where Control 2 3 2 3 ψ(1) v(0) 6 7 6 7 We outline a method for combining stabilization and ψ~=4 66 ψ(.2) 77 v~=66 v(.1) 77, (40) 4 . 5 4 . 5 feed-forward control. Consider an LTI system subject . . to communication constraints as described in Sec. 2 ψ(µ) v(µ−1) and assume that a stabilizing feedback matrix Γ has been computed. Let fydgkkm=1, (yd(k) 2 Rp,km > 0 be µ=km/(2N −1) and a desired output sequence of (cid:12)nite duration, which we 4 want to track with the output of the system of Fig. 1. (cid:3)=[(cid:3) ] (41) ij That is, we allow the controller to send signals of the with each (2N −1)p(cid:2)(2N −1)m block (cid:3) given by form ij u(k)=Γy(k−d(σ ,σ ,k))+u0(k) (34) 8 (cid:16)Q (cid:17) w r >< C (i) j A (q) B (j) i>j where d() is a delay due to the communication con- c q=i c c straints. The signal u0(k) is a feed-forward term (cid:3)ij =>: Cc(i)Bc(j) i=j (42) 0 j >i whichissubjecttothesamecommunicationconstraints (σr,σ(w)) as the feedback signal so that it may be su- If p (cid:21) m, (cid:3) has more rows than columns. If (cid:3) has perimposed on the stabilizing inputs transmitted by maximal column-rank, then we can use its generalized the controller as Eq. 34 shows. We want to select the inverse to compute the feed-forward control sequence sequence u0(k),k =0,...,k so that the error: m that will minimize the tracking error (Eq. 35) by: Xkm v(cid:3) =((cid:3)T(cid:3))−1(cid:3)T(ψ −ψ ) (43) jjy (k)−y(k)jj (35) d ic d i=0 whereψ isthee(cid:11)ectofinitialconditionsfortheplant ic state x(0). The vector ψ will be comprised of the is minimized. In the following, we assume that the ic samples of the sequence duration (k ) of the desired signal is a multiple of m (2N −1). y (k)=CAkx(0) (44) ic If the desired output y (k) is known a-priori, then d according to Eq. 37, 40. we can make use of operator-based approaches [8] for this \preview tracking" problem, suitably modi(cid:12)ed for Therearetwonecessaryconditionsfor(cid:3)tobeinjective. discrete-time systems. In short, consider the LTI sys- First, the original system (Eq. 1) have normal rank: tem discussed in Sec. 2, in its periodic form of Eq. 17 limsupjzj=1rank(C(zI −A)−1B)=m (45) χ(k+1)=A (k)χ(k) (36) c Second, the communication sequences σ ,σ must be r w with Ac(k) = Comp(fk) N-periodic, de(cid:12)ned by admissible,topreventfromconsistently\ignoring"any Eq. 15, 16. The dynamics of the stabilized system oftheinputsoroutputs. Althoughtheseareonlyneces- (Eq. 36) must be modi(cid:12)ed to reflect the presence of saryconditions,isispossibletomodifymatterssothat the feed-forward signal u0(k). Let they are also su(cid:14)cient, by rede(cid:12)ning v~(k) in Eq. 39 to be: 2 3 v(k) =4 [uT (cid:1)(cid:1)(cid:1) uT uT ]T v(−2N −1) (k−2N+2) (k−1) (k) 6 . 7 ψ(k) =4 [yT (cid:1)(cid:1)(cid:1) yT yT ]T (37) 66 .. 77 (k−2N+2) (k−1) (k) 66 v(−1) 77 6 7 then v~=6 v(0) 7 (46) 6 7 6 v(1) 7 6 7 χ(k+1) = Ac(k)χ(k)+Bc(k)v(k) 4 .. 5 . ψ(k) = Cc(k)χ(k) (38) v(µ−1) where B 2 (2N−1)n(cid:2)m and C 2 (2N−1)p(cid:2)(2N−1)n and (cid:3) to be the new map that takes v~ to y~. This c c will be determRined by the choice of cRommunication se- corresponds to considering the \past" 2N −1 samples quence andthe parametersof the underlying liner sys- of v in order to ensure injectivity of the new map (cid:3). tem. From a least-squarespoint of view, if the costfunction penalizes the deviation from a given output y(0), then Now,feed-forwardinputsequencesaremappedto out- the controller must be given the chance to a(cid:11)ect y(0). puts by Thiscanonlybeguaranteedifweareallowedtochoose ψ~=(cid:3)v~ (39) the 2N −1 inputs preceding that output. This is not p. 7 surprising if we recall the n-step controllability results [2] V.S.BorkarandS.K.Mitter. LQGcontrolwith for discrete-time LTI systems. If the above conditions communicationconstraints. TechnicalReportLIDS-P- aresatis(cid:12)edthennon-zeroinputssentbythecontroller 2326, MIT, 1995. to the LTI system will produce non-zero outputs and [3] M. S. Branicky,V. S. Borkar,, and S.K. Mitter. (cid:3) will be kernel-free. We will not give a formal proof Auni(cid:12)edframeworkforhybridcontrol: modelandop- of this here; a moredetailed argumentcan be found in timalcontroltheory. IEEE Transactions on Automatic [9]. Control, 43(1):31{45,January 1998. We expect to include results demonstrating combined [4] R. W. Brockett. On the computer control of tracking and stabilization tasks in the (cid:12)nal version of movement. In Proceedings of the 1988 IEEE Confer- this manuscript. enceonRoboticsandAutomation,pages534{540,April 1988. [5] R. W. Brockett. Hybrid models for motion con- trol systems. In H. Trentelman and J.C. Willems, edi- 9 Conclusions and Future Work tors, Perspectives in Control, pages 29{54.Birkh¨auser, Boston, 1993. In this paper we have addressed the stabilization of LTI systems which are operating under limited com- [6] R.W.Brockett. Stabilizationofmotornetworks. munication. Our approachis based on the use of peri- In Proc. of the 34th IEEE Conference on Decision and odiccommunicationsequences whichdirectthe flowof Control, pages 1484{8,December 1995. controlandmeasurementsignalsfromthecontrollerto [7] Jon H. Davis. Stability conditions derived from theplantacrossanetworkorcommunicationbus. The spectral theory: discrete systems with periodic feed- work presented here builds on previous versions of the back. SIAM Journal of Control, 10(1):1{13, February extensi(cid:12)cationalgorithminordertohandleconstraints 1972. a(cid:11)ecting both control and measurements. There are [8] D. Hristu. Generalizedinversesfor (cid:12)nite-horizon several issues that present opportunities for further tracking.ToappearinProc.ofthe38IEEEConference workrelatedtotheextensi(cid:12)cationalgorithm,including on Decision and Control, December 1999. methods for (cid:12)nding \good" communication sequences and cooling schedules. In addition, it would be desir- [9] D. Hristu. Optimal Control with Limited Com- abletoinvestigateboundsforthespectralradiusofthe munication. PhD thesis, Harvard University, Div. of extensive form so that stopping criteria for our simu- Engineering and Applied Sciences, 1999. lated annealing algorithm can be constructed. [10] D. Hristu and K.Morgansen. Limited communi- cationcontrol.SystemsandControlLetters,37(4):193{ We have outlined a method for combining basic feed- 205, July 1999. forward control with the extensive form. Closed- [11] P. Voulgaris. Control of asynchronous sampled loop tracking is a logical next step that remains to data systems. IEEE Transactions on Automatic Con- be addressed. Finally, although our model for lim- trol, 39(7):1451{1455,July 1994. ited communication assumes timely arrival of each in- put/outputsamplefrom/tothecontroller,thismaynot [12] W.S.WongandR.W.Brockett.Systemswith(cid:12)- bethecaseinmayrealisticsituations(i.e. controlover nitebandwidthconstraints-partII:Stabilizationwith the Internet). Current e(cid:11)orts are focused on develop- limited information feedback. IEEE Transactions on ing models that can handle variability in the arrivals Automatic Control, 42(5):1049{1052,May 1999. ofcontrolsamplesaswellas\lost"samples thatfailto arrive at their destination. 10 Acknowledgment The author would like to thank Prof. R. W. Brockett for his helpful discussions on the subject. References [1] V. Blondell and J. Tsitsiklis. NP hardness of somelinearcontroldesignproblems. SIAM Journal on Control and Optimization, 35(6):2118{2127,November 1997. p. 8

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.