In control theory, a separation principle, more formally known as a principle of separation of estimation and control, states that under some assumptions the problem of designing an optimal feedback controller for a stochastic system can be solved by designing an optimal observer for the state of the system, which feeds into an optimal deterministic controller for the system. Thus the problem can be broken into two separate parts, which facilitates the design.
The first instance of such a principle is in the setting of deterministic linear systems, namely that if a stable observer and a stable state feedback are designed for a linear time-invariant system (LTI system hereafter), then the combined observer and feedback is stable. The separation principle does not hold in general for nonlinear systems.
Another instance of the separation principle arises in the setting of linear stochastic systems, namely that state estimation (possibly nonlinear) together with an optimal state feedback controller designed to minimize a quadratic cost, is optimal for the stochastic control problem with output measurements. When process and observation noise are Gaussian, the optimal solution separates into a Kalman filter and a linear-quadratic regulator. This is known as linear-quadratic-Gaussian control. More generally, under suitable conditions and when the noise is a martingale (with possible jumps), again a separation principle applies and is known as the separation principle in stochastic control.[1] [2] [3] [4] [5] [6]
The separation principle also holds for high gain observers used for state estimation of a class of nonlinear systems[7] and control of quantum systems.
Consider a deterministic LTI system:
\begin{align} x |
(t)&=Ax(t)+Bu(t)\\ y(t)&=Cx(t) \end{align}
where
u(t)
y(t)
x(t)
We can design an observer of the form
\hat{x |
and state feedback
u(t)=-K\hat{x}.
Define the error e:
e=x-\hat{x}.
Then
e |
=(A-LC)e
u(t)=-K(x-e).
Now we can write the closed-loop dynamics as
\begin{bmatrix} x | \\ |
e |
\\ \end{bmatrix}=\begin{bmatrix} A-BK&BK\\ 0&A-LC\\ \end{bmatrix} \begin{bmatrix} x\\ e\\ \end{bmatrix}.
Since this is a triangular matrix, the eigenvalues are just those of A - BK together with those of A - LC.[8] Thus the stability of the observer and feedback are independent.