- Research
- Open access
- Published:
Infinite horizon linear quadratic optimal control for stochastic difference time-delay systems
Advances in Difference Equations volume 2015, Article number: 14 (2015)
Abstract
The aim of this paper is to investigate the infinite horizon linear quadratic (LQ) optimal control for stochastic time-delay difference systems with both state and control dependent noise. To do this, the notion of exact observability of a stochastic time-delay deference system is introduced and its PBH criterion is presented by the spectrum of an operator related with stochastic time-delay deference systems. Under the assumptions of stabilization and exact observability, it is shown that the optimal control law and optimal value exist, and also the properties of the associated general algebraic Ricatti equation (GARE) are discussed.
1 Introduction
As is well known, the optimal linear quadratic regulation (LQR) problem was initiated by Kalman in [1], which is one of the most important optimal control problems. In [2, 3], the authors further investigated the LQR problem of the deterministic case. In [4], Wonham first studied stochastic linear quadratic (LQ) control for Itô systems. In [5], the authors investigated LQ optimal control when the state and control weighting matrices Q and R are indefinite, and they proved the stochastic LQ optimal control may be still well posed. The discrete-time stochastic LQ problem involving state and control dependent noises has been introduced in [6]. Virtually most of the studies on optimal control in time-delay systems consider only delays in the state. By exploiting the dynamic programming approach, the authors presented a solution to the stochastic LQR problem for systems with input delay and stochastic parameter uncertainties in [7]. This paper will discuss the infinite horizon linear quadratic regulation problem for discrete-time stochastic systems with input delay and state delay. In order to guarantee the well posedness of the quadratic performance and the existence of the feedback stabilizing control law, we shall introduce some concepts such as stabilizability and exact observability, as regards which similar definitions have been well defined in [8] for stochastic Itô systems. By exact observability, we are able to discuss the infinite horizon stochastic LQ problem as well as the properties of the related generalized algebraic Ricatti equation (GARE). It is worth pointing out that, similar to the continuous context [9], stabilizability and exact observability will play an important role in discussing other problems, such as stochastic time-delay difference \(H_{2}/H_{\infty}\) control.
For stochastic time-delay difference systems, we concentrate our attention upon infinite horizon linear quadratic optimal control. This paper is organized as follows. In Section 2, by the Lyapunov equation and the H-presentation, we give the equivalent condition for the stabilizability of stochastic time-delay systems. We introduce the definition of exact observability of time-delay systems, and under the exact observability, we give an equivalent condition for the stabilizability of stochastic time-delay systems. In Section 3, under assumptions of stabilization and exact observability, we prove that the optimal control law and optimal value exist of stochastic time-delay difference systems.
To avoid confusion, we fix the following traditional notation. \(R^{n\times n}\): the set of all real matrices; \(S^{n}\): the set of all symmetric matrices; \(N=\{0,1,2,\ldots\}\); \(A'(\operatorname{Ker}(A))\): the transpose (kernel space) of a matrix A; \(A\geq0\) (\(A>0\)) is a positive semidefinite (positive definite) symmetric matrix A; I: identity matrix; \(\sigma(L)\): spectral set of the operator or matrix L; \(D(0,\alpha)=\{\lambda\mid\| \lambda\|<\alpha\}\); \(\|\cdot\|\) is the \(l_{2}\)-norm ⋅ ; \(L_{\mathcal{F}_{t}}^{2}(R^{+},R^{n_{v}})\): space of nonanticipative stochastic processes \(x(t)\in R^{n_{v}}\) with respect to an increasing σ-algebra \(\{\mathcal{F}_{t}\}_{t\geq0}\) satisfying \(E\| x(t)\|^{2}<\infty\). Finally, we make the assumption throughout this paper that all systems have real coefficients.
2 Stabilizability and exact observability
In this section, we introduce a general Lyapunov operator and the notion of exact observability of stochastic time-delay deference systems. By the spectrum of the general Lyapunov operator, we present the PBH criterion of exact observability of stochastic time-delay systems. By the Lyapunov functional approach and the H-representation in [10], some sufficient and necessary conditions of the asymptotical mean square stabilization of stochastic time-delay systems are given.
Consider the initial-value problem for the following linear difference time-delay system:
here \(x\in R^{n}\) is a column vector, \(F_{j}, G_{j},M_{j},N_{j}\in R^{n\times n}\), \(j=0,1,\ldots,m\), are constant coefficient matrices, \(u(t)\in R^{n}\) is the control input, \(\{w(t) \in R, t \in N\}\) is a sequence of real random variables defined on a complete probability space \(\{\Omega, \mathcal{F},\mathcal{F}_{t},\mu\} \) which is in a wide sense a stationary, second-order process with \(E(w(t))=0\) and \(E(w(t)w(s)) =\delta_{s,t} \), where \(\delta_{s,t}\) is the Kronecker delta with \(\mathcal{F}_{t}=\sigma\{w(s): 0\leq s \leq t\}\). \(\varphi (\theta)\in R^{n}\), \(\theta=0,-1,\ldots,-m\) are deterministic column vectors.
Definition 2.1
The trivial stationary solution \(x=0\) of system (2.1) is called mean square stabilization if there exists an input feedback K such that for any arbitrarily small number \(\varepsilon>0\), one can find a number \(\delta >0\), when \(\sup_{\theta\in[-m,0]}\|\varphi(\theta)\|<\delta\), such that
for a solution \(x(t)=x(t,x(\theta))\) satisfying the initial values \(x(\theta)=\varphi(\theta)\in R^{n}\), \(\theta=0,-1,\ldots,-m\).
Definition 2.2
The trivial stationary solution \(x=0\) of system (2.1) is called asymptotical mean square stabilization in the mean square if it is stable in the sense of Definition 2.1 and
For a state feedback control law \(u(t)=Kx(t)\), we introduce a linear operator \(\mathcal{L}_{\overline {K}}\) associated with the closed-loop system
Let \(\overline{x}(t)=[x'(t),x'(t-1),\ldots,x'(t-m)]'\) and \(\overline {u}(t)=[u'(t),u'(t-1),\ldots,u'(t-m)]'\). System (2.1) can now be written in the form of an equivalent stochastic system of dimension \(n(m + 1)\), namely,
where
Take a control input \(\overline{u}(t)=\overline{K}\overline{x}(t)\) with
and let \(X(t)=E\overline{x}(t)\overline{x}'(t)\), system (2.2) can now be written in the following form:
Now we introduce an operator
With the Kronecker matrix product, (2.4) can be rewritten in the following form:
where \(\overrightarrow{X}(t)\) denotes the \(n(m+1)\)-dimensional column vector
and \(\widehat{A}\in R^{n^{2}(m + 1)^{2}\times n^{2}(m+1)^{2}}\) has the form \(\widehat{A}=(\overline{F}+\overline{M}\overline{K})\otimes (\overline{F}+\overline{M}\overline{K})+(\overline{G}+\overline {N}\overline{K})\otimes(\overline{G}+\overline{N}K)\).
Lemma 2.1
Let \(H_{n,m}\) be a \(n^{2}(m+1)^{2}\times\frac {n(m+1)[n(m+1)+1]}{2}\) matrix and
Then \(H'_{n,m}H_{n,m}\) is invertible.
Theorem 2.1
The trivial solution \(\overrightarrow{X}(t)=0\) of system (2.5) has asymptotical stabilization if and only if, for any \(Q>0\), there exists a positive-definite matrix \(P\in S^{\frac{n(m+1)[n(m+1)+1]}{2}}\) such that P is a solution of the following Lyapunov equation:
where \(\theta(H_{n,m})=[H'_{n,m}H_{n,m}]^{-1}H'_{n,m}[(\overline {F}+\overline{M}\overline{K})\otimes(\overline{F}+\overline {M}\overline{K})+(\overline{G}+\overline{N}\overline{K})\otimes (\overline{G}+\overline{N}\overline{K})]H_{n}\).
Proof
If we set \(X(t)=E\overline{x}(t)\overline{x}'(t)\), \(X(t)\) satisfies
Since the matrix \(X(\cdot)\) is real symmetric, (2.6) is a linear matrix equation with \(\frac{n(m+1)[n(m+1)+1]}{2}\) different variables, i.e., it is in fact a \(\frac{n(m+1)[n(m+1)+1]}{2}\)th-order linear system. Assume we define a map \(\widetilde{\mathcal{L}}\) from \(S^{n(m+1)}\) to \(C^{\frac{n(m+1)[n(m+1)+1]}{2}}\) as follows:
-
for any \(Y=(Y_{ij})_{n(m+1)\times n(m+1)} \in S^{n(m+1)}\), set
$$\widetilde{Y} =\widetilde{\mathcal{L}}(\widetilde{Y})=(Y_{11},\ldots ,Y_{1n(m+1)},\ldots, Y_{n(m+1)-1,n(m+1)-1}, Y_{n(m+1)-1,n(m+1)}; Y_{n(m+1)n(m+1)})'. $$
Then there exists a unique matrix \(\Theta_{1}(H_{n,m})\in R^{\frac {n(m+1)[n(m+1)+1]}{2}\times\frac{n(m+1)[n(m+1)+1]}{2}}\), by Lemma 2.1 and the H-representation of [10], such that (2.6) is equivalent to
where \(\Theta_{1}(H_{n,m})=[H'_{n,m}H_{n,m}]^{-1}H'_{n,m}[(\overline {F}+\overline{M}\overline{K})\otimes(\overline{F}+\overline {M}\overline{K})+(\overline{G}+\overline{N}\overline{K})\otimes (\overline{G}+\overline{N}\overline{K})]H_{n,m}\), \(\widetilde{X}(t)\in R^{\frac{n(m+1)[n(m+1)+1]}{2}}\) due to \(X(t)=E\overline{x}(t)\overline {x}'(t)\) being real positive semidefinite. It is obvious, since system (2.7) is deterministic. The statement of the theorem can be established in a way that is standard for the method of Lyapunov functions for deterministic difference equations, namely, considering an \(\frac{n(m+1)[n(m+1)+1]}{2}\)-parameter Lyapunov function as a quadratic form
The role of the parameters is played by \(\frac {n(m+1)[n(m+1)+1][n(m+1)[n(m+1)+1]+2]}{8}\) elements of the positive-definite matrix P, which should be determined. □
From the proofs of Theorem 2.1, we easily get the following result.
Corollary 2.1
The trivial solution \(x(t)=0\) of system (2.1) being asymptotically mean square stabilizable is equivalent to one of the following results:
-
(1)
The trivial solution \(\overline{x}(t)=0\) of system (2.3) is asymptotically mean square stabilizable.
-
(2)
The trivial solution \(\overrightarrow{X}(t)=0\) of system (2.5) is asymptotically stabilizable.
Similar to Definition 5 of [8], we define ‘exact observability’ for stochastic time-delay difference systems as follows, which will be used in Section 3.
Definition 2.3
Consider the following linear difference system:
We call (2.9) or \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}G_{j}\mid C)\) exactly observable, if \(y(t)=0\), a.s., \(t\in N \Rightarrow\overline{x}(0)=0\).
The following lemma extends Theorem 6 of [8] to a time-delay version by the H-representation approach in [10].
Lemma 2.2
(PBH Criterion)
\((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}G_{j}\mid C)\) is exactly observable if and only if there does not exist \(0\neq Z\in S^{n(m+1)}\) such that
Proof
If we set \(X(t)=E\overline{x}(t)\overline{x}'(t)\), \(X(t)\) satisfies the following difference equation:
Since \(X(\cdot)\) is real symmetric, (2.11) is a linear matrix equation with \(\frac{n(m+1)[n(m+1)+1]}{2}\) different variables, i.e., it is in fact an \(\frac{n(m+1)[n(m+1)+1]}{2}\)th-order linear system. On the other hand, from Definition 2.3, \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}G_{j}\mid C)\) is exactly observable if and only if for any arbitrary \(X(0)\neq0\), there exists a \(k_{0}\in N\) such that
In addition, since \(X(k)\geq0\) for any \(k\in N\), (2.12) is equivalent to
which is equivalent to
So (2.10) is exactly observable if and only if the deterministic system
is completely observable, where
and
By the PBH criterion for complete observability, (2.14) is completely observable if and only if there does not exist an eigenvector \(\xi\neq0\) in \(\frac{n(m+1)[n(m+1)+1]}{2}\) dimensions such that
Obviously, (2.15) is equivalent to the nonexistence of \(0\neq Z\in S^{\frac{n(m +1)[n(m +1)+1]}{2}}\) satisfying (2.10). □
Theorem 2.2
A nonsingular transformation does not change the exact detectability of the original systems.
Proof
Assume \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}G_{j}\mid C)\) is exactly detectable; arbitrarily choose a nonsingular matrix T, let \(x(t)=T\xi(t)\); a transformed system takes the following form of \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}G_{j}\mid C)\):
Here
If we let
then system (2.16) becomes
We shall show that (2.17) is also exactly detectable. Otherwise, by Lemma 2.2, there does not exist \(0\neq Z\in S^{n(m+1)}\) such that
Pre- and post-multiplying (2.10) by T and \(T'\), respectively, yields
If we set \(X(t)=\overline{T}Z\overline{T}'\), then from Lemma 2.2, we know that (2.19) contradicts the exact detectability of \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}G_{j}\mid C)\). □
Lemma 2.3
If \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}G_{j}\mid C)\) is exactly observable, then \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}G_{j})\) is asymptotically mean square stable if and only if the Lyapunov-type equation
has a solution \(P>0\).
Proof
Necessity part. If \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}G_{j})\) is asymptotically mean square stable, from the method of Lyapunov functions for linear stochastic difference equations, (2.20) has a unique solution \(P\geq0\). Now we show \(P>0\). Otherwise, there exists \(\overline{x}_{0}\neq0\) such that \(P\overline{x}_{0}=0\). We obtain, for \(T\in N\),
from which follows \(y(t)=C(x'(t),x'(t-1),\ldots ,x'(t-m))'=C\overline{x}(t)=0\), a.s., \(t\in N_{T}\). Together with the exact observability of \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}G_{j}\mid C)\), we obtain \(\overline{x}_{0}=0\), which contradicts \(\overline{x}_{0}\neq 0\). So \(P>0\).
Sufficiency part. Assume \(P>0\) is a solution to (2.20). Let \(V(\overline{x}(t))= E\overline{x}'(t)P\overline{x}(t)\), then we have
which indicates that \(V(\overline{x}(t))\) is monotonically decreasing and bounded from below with respect to t, so \(\lim_{t\rightarrow+\infty} V(\overline{x}(t))\) exists. The rest of the proof proceeds along the lines of Theorem 6 of [8] and is omitted. □
Corollary 2.2
If \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}G_{j}\mid C)\) is exactly observable, then the Lyapunov-type equation (2.20) has at most one positive definite solution.
Proof
If (2.20) has a positive semidefinite solution \(P_{1}\geq0\), from the proof of Lemma 2.3, we find that under the condition of exact observability, \(P_{1}>0\). So \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}G_{j}\mid C)\) is asymptotically mean square stable. By Lemma 2 in [11], (2.20) admits a unique positive semidefinite solution \(P_{1}\). □
3 Stochastic time-delay LQ control
In this section, under the assumptions of stabilization and exact observability, we investigate the problem of the existence of the optimal control law and optimal value of stochastic time-delay difference systems.
Considering the following linear stochastic system with time-delays:
For the linear stochastic time-delay controlled system (3.1), we define the admissible control input set
with the associated cost
where \(Q\geq0\), \(R>0\). The LQ optimal control problem is to find a control \(u^{*}\in u_{ad}\) called the optimal control such that
We call \(x(t)\) corresponding to \(\overline{u}^{*}(t)\) the optimal trajectory, and \(V(\overline{x}_{0})\) is the optimal cost value.
Theorem 3.1
Assume that \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}M_{j},\sum_{j=0}^{m}G_{j},\sum_{j=0}^{m}N_{j})\) is asymptotically mean square stabilizable, and \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}G_{j}\mid Q^{\frac{1}{2}})\) is exactly observable. Then the GARE
has a solution \(P>0\), which is the unique nonnegative definite solution of (3.3).
Proof
Let \(\overline{x}(t)=[x'(t),x'(t-1),\ldots ,x'(t-m)]'\), \(\overline{u}(t)=[u'(t),u'(t-1),\ldots,u'(t-m)]'\). \((\sum_{j=0}^{m}F_{j}, \sum_{j=0}^{m}M_{j},\sum_{j=0}^{m}G_{j},\sum_{j=0}^{m}N_{j})\) can be written in the form of an equivalent stochastic system of dimension \(n(m+1)\), namely,
where
For a control input \(\overline{u}(t)=\overline{K}\overline{x}(t)\), \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}M_{j},\sum_{j=0}^{m}G_{j},\sum_{j=0}^{m}N_{j})\) becomes
where
Since \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}M_{j},\sum_{j=0}^{m}G_{j},\sum_{j=0}^{m}N_{j})\) is asymptotically mean square stabilizable, (3.3) has a stabilizable solution \(P\geq0\). Since \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}G_{j}\mid Q^{\frac{1}{2}})\) is exactly observable, by Lemma 2.3, \(P>0\). By the uniqueness of stabilizable solution of (3.3), we know that (3.3) has only one positive-definite solution. □
Corollary 3.1
Assume that \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}G_{j}\mid Q^{\frac{1}{2}})\) is exactly observable. Then system \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}G_{j})\) is asymptotically mean square stable if and only if the Lyapunov equation
has a solution \(P>0\).
Lemma 3.1
In system \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}G_{j})\), \(t \in N\), \(P \in S^{n(m+1)}\), and \(\overline{x}(0) \in R^{n(m+1)}\), we have
where
Proof
It can easily be derived by the following identity:
and the fact that \(\overline{F}\overline{x}(t)+\overline {M}\overline{u}(t)\) and \(\overline{G}\overline{x}(t)+\overline{N}\overline{u}(t)\) are independent for \(w(t)\). □
Theorem 3.2
Assume that \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}M_{j},\sum_{j=0}^{m}G_{j},\sum_{j=0}^{m}N_{j})\) is asymptotically mean square stabilizable, and \((\sum_{j=0}^{m}F_{j},\sum_{j=0}^{m}G_{j}\mid Q^{\frac{1}{2}})\) is exactly observable. Then the optimal cost value is given by \(V(\overline{x}_{0})=\overline{x}'_{0}P_{1}\overline{x}_{0}\), where \(P_{1}>0\) is the unique feedback and stabilizing solution of (3.3), and the optimal control is uniquely determined by \(\overline{u}(t)=\overline{K}\overline{x}(t)\) where \(\overline {K}=-(R+\overline{M}'P_{1}\overline{M}+\overline{N}'P_{1}\overline {N})^{-1}(\overline{F}'P_{1}\overline{G}+\overline{M}'P_{1}\overline{N})'\).
Proof
Note that GARE (3.3) can be written as
From Lemma 2.3, we know (3.5) has a stabilizing solution \(P_{1}>0\), so \(\lim_{t\rightarrow+\infty}E\|\overline{x}(t)\|=0\) when \(\overline{K}=-(R+\overline{M}'P_{1}\overline{M}+\overline {N}'P_{1}\overline{N})^{-1}(\overline{F}'P_{1}\overline{G}+\overline {M}'P_{1}\overline{N})'\overline{x}(t)\).
From Lemma 3.1,
where
with
Hence we have
and the optimal control is uniquely determined by \(\overline{u}(t)=\overline{K}\overline{x}(t)\) where \(\overline {K}=-(R+\overline{M}'P_{1}\overline{M}+\overline{N}'P_{1}\overline {N})^{-1}(\overline{F}'P_{1}\overline{G}+\overline{M}'P_{1}\overline{N})'\). □
Example 3.1
Consider the following stochastic time-delay system:
Let \(\overline{x}(t)=(x(t), x(t-1))'\) and \(\overline{u}(t)=(u(t), u(t-1))'\). The above system can be written in the form of an equivalent stochastic system
Here
Now we solve the following general algebraic Ricatti equation:
Using Matlab, solving the stabilizing solution of the above GARE, i.e., solving the optimal solution of the following SDP problem:
we can get the following optimal solution (actually, from Theorem 10 in [6], we know the optimal solution is the stabilizing solution P of the above GARE), with the optimal control:
Specially put \(a_{0}=0\), \(a_{1}=1\), \(m_{0}=0\), \(m_{1}=g=n=1\), \(R=I\), and \(C=\bigl ( {\scriptsize\begin{matrix} 1&0\cr 0&1 \end{matrix}} \bigr )\), we get the stabilizing solution of the above GARE
and the optimal control
References
Kalman, RE: Contributions to the theory of optimal control. Bol. Soc. Mat. Mexicana 5, 102-119 (1960)
Anderson, BDO, Moore, JB: Optimal Control Linear Quadratic Methods. Prentice-Hall, New York (1989)
Lewis, FL: Optimal Control. Wiley, New York (1986)
Wonham, WM: On a matrix Riccati equation of stochastic control. SIAM J. Control 6, 312-326 (1968)
Chen, S, Li, X, Zhou, XY: Stochastic linear quadratic regulators with indefinite control weight costs. SIAM J. Control Optim. 36, 1685-1702 (1998)
Ait Rami, M, Chen, X, Zhou, XY: Discrete time indefinite LQ control with state and control dependent noises. J. Glob. Optim. 23, 245-265 (2002)
Song, XM, Zhang, HS, Xie, LH: Stochastic linear quadratic regulation for discrete-time linear systems with input delay. Automatica 45, 2067-2073 (2009)
Zhang, W, Chen, BS: On stabilizability and exact observability of stochastic systems with their applications. Automatica 40, 87-94 (2004)
Chen, BS, Zhang, W: Stochastic \(H_{2}/H_{\infty}\) control with state-dependent noise. IEEE Trans. Autom. Control 49, 45-57 (2004)
Zhang, W, Chen, BS: H-Representation and applications to generalized Lyapunov equations and linear stochastic systems. IEEE Trans. Autom. Control 57, 3009-3022 (2012)
Huang, Y, Zhang, WH, Zhang, HS: The infinite horizon linear quadratic optimal control for discrete-time stochastic systems. Asian J. Control 10, 608-615 (2008)
Acknowledgements
This work was supported by the National Natural Science Foundation of China under Grants 61174078 and 61201430.
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
GL performed all the steps of proof in this research and also wrote the paper. MC suggested many good ideas that made this paper possible and helped to draft the first manuscript. All authors read and approved the final manuscript.
Rights and permissions
Open Access This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.
About this article
Cite this article
Li, G., Chen, M. Infinite horizon linear quadratic optimal control for stochastic difference time-delay systems. Adv Differ Equ 2015, 14 (2015). https://doi.org/10.1186/s13662-014-0342-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13662-014-0342-1