It is a well-known result that, if the pair (A,Q1/2) is stabilizable and the pair (A,C) is detectable, the solution to the discrete-time Riccati recursion
P(t+1)=AP(t)AT−AP(t)CT(CP(t)CT+R)−1CP(t)AT+Q
converges to the unique stabilizing solution of the Algebraic Riccati Equation
P=APAT−APCT(CPCT+R)−1CPAT+Q
from any initial positive semidefinite matrix P(0)=P0.
Do you know any result on the behavior of P(t) during the transient, or more generally
for all tgeq0? More precisely, I need some result that ensures that P(t) is
stabilizing for all tgeq0, under fair conditions on A, C, Q, R, and for arbitrary P0.
Thank you very much in advance,
f.
EDIT. Some background: We consider a discrete-time, linear, time invariant system of the form
x(t+1)=Ax(t)+v(t)
y(t)=Cx(t)+w(t)
where x(t)inRn, AinRntimesn, y(t)inRp, CinRptimesn, and v and w are zero-mean, uncorrelated white noises (say, Gaussian) with appropriate dimensions, with variances Q and R respectively. The predictor hatx(t+1|t), that is, the best linear estimator of x(t+1) given y(0),cdots,y(t), is given by the Kalman filter. It can be expressed recursively substituting the equations of the Kalman filter into each other. The variance P(t+1) of the corresponding prediction error tildex(t+1|t)=hatx(t+1|t)−x(t+1) is then given (recursively) by the Riccati equation.
The "dual" of the linear estimation problem above is the "linear quadratic regulator" problem of optimal control, and the Riccati equation is fundamental also in this context.
Given matrices AinRntimesn and BinRntimesm with mleqn, to say that the pair (A,B) is reachable means that the matrix [BABcdotsAn−1B] has full rank (=n). The pair (A,B) is stabilizable if with a suitable "change of base" (A,B)mapsto(T−1AT,T−1B) it can be put in the form
left(left[begin{matrix} A_{11} & A_{12} \\ 0 & A_{22} \\ end{matrix}right], left[begin{matrix} B_1 \\ 0 \\ end{matrix}right]right)
where (A11,B1) is reachable and A22 is Hurwitz (all eigenvalues in the interior of the unit disk).
Dually, given matrices AinRntimesn and CinRptimesn with pleqn, to say that the pair (A,C) is observable means that the matrix
left[beginmatrixCCAvdotsCAn−1endmatrixright]
has full rank. The pair (A,C) is detectable if with a suitable "change of base" (A,C)mapsto(T−1AT,CT) it can be put in the form
left(left[begin{matrix} A_{11} & 0 \\ A_{21} & A_{22} \\ end{matrix}right], left[begin{matrix} C_1 & 0 end{matrix}right]right)
where (A11,C1) is observable and A22 is Hurwitz.
Finally, the matrix P=PTgeq0 is stabilizing if the "closed loop" matrix A−KC is Hurwitz, where K=APCT(CPCT+R)−1.
For more details, see for example the Wikipedia pages on Kalman filter, controllability, observability, and Kalman decomposition. For a full reference, see e.g. A. H. Jazwinski, Stochastic Processes and Filtering Theory.
No comments:
Post a Comment