Controllability Explained

Controllability is an important property of a control system and plays a crucial role in many control problems, such as stabilization of unstable systems by feedback, or optimal control.

Controllability and observability are dual aspects of the same problem.

Roughly, the concept of controllability denotes the ability to move a system around in its entire configuration space using only certain admissible manipulations. The exact definition varies slightly within the framework or the type of models applied.

The following are examples of variations of controllability notions which have been introduced in the systems and control literature:

State controllability

The state of a deterministic system, which is the set of values of all the system's state variables (those variables characterized by dynamic equations), completely describes the system at any given time. In particular, no information on the past of a system is needed to help in predicting the future, if the states at the present time are known and all current and future values of the control variables (those whose values can be chosen) are known.

Complete state controllability (or simply controllability if no other context is given) describes the ability of an external input (the vector of control variables) to move the internal state of a system from any initial state to any final state in a finite time interval.[1]

That is, we can informally define controllability as follows:If for any initial state

x0
and any final state
xf
there exists an input sequence to transfer the system state from
x0
to
xf
in a finite time interval, then the system modeled by the state-space representation is controllable. For the simplest example of a continuous, LTI system, the row dimension of the state space expression
x

=Ax(t)+Bu(t)

determines the interval; each row contributes a vector in the state space of the system. If there are not enough such vectors to span the state space of

x

, then the system cannot achieve controllability. It may be necessary to modify

A

and

B

to better approximate the underlying differential relationships it estimates to achieve controllability.

Controllability does not mean that a reached state can be maintained, merely that any state can be reached.

Controllability does not mean that arbitrary paths can be made through state space, only that there exists a path within the prescribed finite time interval.

Continuous linear systems

Consider the continuous linear system [2]

x

(t)=A(t)x(t)+B(t)u(t)

y(t)=C(t)x(t)+D(t)u(t).

There exists a control

u

from state

x0

at time

t0

to state

x1

at time

t1>t0

if and only if

x1-\phi(t0,t1)x0

is in the column space of

W(t0,t1)=

t1
\int
t0
T
\phi(t
0,t)B(t)B(t)
T
\phi(t
0,t)

dt

where

\phi

is the state-transition matrix, and

W(t0,t1)

is the Controllability Gramian.

In fact, if

η0

is a solution to

W(t0,t1)η=x1-\phi(t0,t1)x0

then a control given by

u(t)=-B(t)T

T
\phi(t
0,t)

η0

would make the desired transfer.

Note that the matrix

W

defined as above has the following properties:

W(t0,t1)

is symmetric

W(t0,t1)

is positive semidefinite for

t1\geqt0

W(t0,t1)

satisfies the linear matrix differential equation
d
dt

W(t,t1)=A(t)W(t,t1)+W(t,t

T
1)A(t)

-B(t)B(t)T,W(t1,t1)=0

W(t0,t1)

satisfies the equation

W(t0,t1)=W(t0,t)+\phi(t0,t)W(t,t1)\phi(t

T
0,t)
[3]

Rank condition for controllability

The Controllability Gramian involves integration of the state-transition matrix of a system. A simpler condition for controllability is a rank condition analogous to the Kalman rank condition for time-invariant systems.

Consider a continuous-time linear system

\Sigma

smoothly varying in an interval

[t0,t]

of

R

:
x

(t)=A(t)x(t)+B(t)u(t)

y(t)=C(t)x(t)+D(t)u(t).

The state-transition matrix

\phi

is also smooth. Introduce the n x m matrix-valued function

M0(t)=\phi(t0,t)B(t)

and define

Mk(t)

=
dkM0
dtk

(t),k\geqslant1

. Consider the matrix of matrix-valued functions obtained by listing all the columns of the

Mi

,

i=0,1,\ldots,k

:

M(k)(t):=\left[M0(t),\ldots,Mk(t)\right]

.

If there exists a

\bar{t}\in[t0,t]

and a nonnegative integer k such that

\operatorname{rank}M(k)(\bar{t})=n

, then

\Sigma

is controllable.[4]

If

\Sigma

is also analytically varying in an interval

[t0,t]

, then

\Sigma

is controllable on every nontrivial subinterval of

[t0,t]

if and only if there exists a

\bar{t}\in[t0,t]

and a nonnegative integer k such that

\operatorname{rank}M(k)(ti)=n

.

The above methods can still be complex to check, since it involves the computation of the state-transition matrix

\phi

. Another equivalent condition is defined as follow. Let

B0(t)=B(t)

, and for each

i\geq0

, define

Bi+1(t)

=

A(t)Bi(t)-

d
dt

Bi(t).

In this case, each

Bi

is obtained directly from the data

(A(t),B(t)).

The system is controllable if there exists a

\bar{t}\in[t0,t]

and a nonnegative integer

k

such that

rm{rank}(\left[B0(\bar{t}),B1(\bar{t}),\ldots,Bk(\bar{t})\right])=n

.

Example

Consider a system varying analytically in

(-infty,infty)

and matrices

A(t)=\begin{bmatrix} t&1&0\ 0&t3&0\ 0&0&t2\end{bmatrix}

,

B(t)=\begin{bmatrix} 0\ 1\ 1\end{bmatrix}.

Then

[B0(0),B1(0),B2(0),B3(0)]=\begin{bmatrix} 0&1&0&-1\ 1&0&0&0\ 1&0&0&2 \end{bmatrix}

and since this matrix has rank 3, the system is controllable on every nontrivial interval of

R

.

Continuous linear time-invariant (LTI) systems

Consider the continuous linear time-invariant system

x

(t)=Ax(t)+Bu(t)

y(t)=Cx(t)+Du(t)

where

x

is the

n x 1

"state vector",

y

is the

m x 1

"output vector",

u

is the

r x 1

"input (or control) vector",

A

is the

n x n

"state matrix",

B

is the

n x r

"input matrix",

C

is the

m x n

"output matrix",

D

is the

m x r

"feedthrough (or feedforward) matrix".

The

n x nr

controllability matrix is given by

R=\begin{bmatrix}B&AB&A2B&...&An-1B\end{bmatrix}

The system is controllable if the controllability matrix has full row rank (i.e.

\operatorname{rank}(R)=n

).

Discrete linear time-invariant (LTI) systems

For a discrete-time linear state-space system (i.e. time variable

k\inZ

) the state equation is

bf{x}(k+1)=Abf{x}(k)+Bbf{u}(k)

where

A

is an

n x n

matrix and

B

is a

n x r

matrix (i.e.

u

is

r

inputs collected in a

r x 1

vector). The test for controllability is that the

n x nr

matrix

l{C}=\begin{bmatrix}B&AB&A2B&&An-1B\end{bmatrix}

has full row rank (i.e.,

\operatorname{rank}(lC)=n

). That is, if the system is controllable,

lC

will have

n

columns that are linearly independent; if

n

columns of

lC

are linearly independent, each of the

n

states is reachable by giving the system proper inputs through the variable

u(k)

.

Derivation

Given the state

bf{x}(0)

at an initial time, arbitrarily denoted as k=0, the state equation gives

bf{x}(1)=Abf{x}(0)+Bbf{u}(0),

then

bf{x}(2)=Abf{x}(1)+Bbf{u}(1)=A2bf{x}(0)+ABbf{u}(0)+Bbf{u}(1),

and so on with repeated back-substitutions of the state variable, eventually yielding

bf{x}(n)=Bbf{u}(n-1)+ABbf{u}(n-2)++An-1Bbf{u}(0)+Anbf{x}(0)

or equivalently

bf{x}(n)-Anbf{x}(0)=[BABAn-1B][bf{u}T(n-1)bf{u}T(n-2)bf{u}T(0)]T.

Imposing any desired value of the state vector

bf{x}(n)

on the left side, this can always be solved for the stacked vector of control vectors if and only if the matrix of matrices at the beginning of the right side has full row rank.

Example

For example, consider the case when

n=2

and

r=1

(i.e. only one control input). Thus,

B

and

AB

are

2 x 1

vectors. If

\begin{bmatrix}B&AB\end{bmatrix}

has rank 2 (full rank), and so

B

and

AB

are linearly independent and span the entire plane. If the rank is 1, then

B

and

AB

are collinear and do not span the plane.

Assume that the initial state is zero.

At time

k=0

:

x(1)=Abf{x}(0)+Bbf{u}(0)=Bbf{u}(0)

At time

k=1

:

x(2)=Abf{x}(1)+Bbf{u}(1)=ABbf{u}(0)+Bbf{u}(1)

At time

k=0

all of the reachable states are on the line formed by the vector

B

.At time

k=1

all of the reachable states are linear combinations of

AB

and

B

.If the system is controllable then these two vectors can span the entire plane and can be done so for time

k=2

.The assumption made that the initial state is zero is merely for convenience.Clearly if all states can be reached from the origin then any state can be reached from another state (merely a shift in coordinates).

This example holds for all positive

n

, but the case of

n=2

is easier to visualize.

Analogy for example of n = 2

Consider an analogy to the previous example system.You are sitting in your car on an infinite, flat plane and facing north.The goal is to reach any point in the plane by driving a distance in a straight line, come to a full stop, turn, and driving another distance, again, in a straight line.If your car has no steering then you can only drive straight, which means you can only drive on a line (in this case the north-south line since you started facing north).The lack of steering case would be analogous to when the rank of

C

is 1 (the two distances you drove are on the same line).

Now, if your car did have steering then you could easily drive to any point in the plane and this would be the analogous case to when the rank of

C

is 2.

If you change this example to

n=3

then the analogy would be flying in space to reach any position in 3D space (ignoring the orientation of the aircraft).You are allowed to:

Although the 3-dimensional case is harder to visualize, the concept of controllability is still analogous.

Nonlinear systems

Nonlinear systems in the control-affine form

x

=f(x)+

m
\sum
i=1

gi(x)ui

are locally accessible about

x0

if the accessibility distribution

R

spans

n

space, when

n

equals the rank of

x

and R is given by:[5]

R=\begin{bmatrix}g1&&gm&

k
[ad
gi
gj]

&&

k
[ad
f
gi]

\end{bmatrix}.

Here,

k
[ad
f

g]

is the repeated Lie bracket operation defined by
k
[ad
f

g]=\begin{bmatrix}f&&j&&[f,g]\end{bmatrix}.

The controllability matrix for linear systems in the previous section can in fact be derived from this equation.

Null Controllability

If a discrete control system is null-controllable, it means that there exists a controllable

u(k)

so that

x(k0)=0

for some initial state

x(0)=x0

. In other words, it is equivalent to the condition that there exists a matrix

F

such that

A+BF

is nilpotent.

This can be easily shown by controllable-uncontrollable decomposition.

Output controllability

Output controllability is the related notion for the output of the system (denoted y in the previous equations); the output controllability describes the ability of an external input to move the output from any initial condition to any final condition in a finite time interval. It is not necessary that there is any relationship between state controllability and output controllability. In particular:

For a linear continuous-time system, like the example above, described by matrices

A

,

B

,

C

, and

D

, the

m x (n+1)r

output controllability matrix

\begin{bmatrix}CB&CAB&CA2B&&CAn-1B&D\end{bmatrix}

has full row rank (i.e. rank

m

) if and only if the system is output controllable.

Controllability under input constraints

In systems with limited control authority, it is often no longer possible to move any initial state to any final state inside the controllable subspace. This phenomenon is caused by constraints on the input that could be inherent to the system (e.g. due to saturating actuator) or imposed on the system for other reasons (e.g. due to safety-related concerns). The controllability of systems with input and state constraints is studied in the context of reachability[6] and viability theory.[7]

Controllability in the behavioral framework

In the so-called behavioral system theoretic approach due to Willems (see people in systems and control), models considered do not directly define an input - output structure. In this framework systems are described by admissible trajectories of a collection of variables, some of which might be interpreted as inputs or outputs.

A system is then defined to be controllable in this setting, if any past part of a behavior (trajectory of the external variables) can be concatenated with any future trajectory of the behavior in such a way that the concatenation is contained in the behavior, i.e. is part of the admissible system behavior.[8]

Stabilizability

A slightly weaker notion than controllability is that of stabilizability. A system is said to be stabilizable when all uncontrollable state variables can be made to have stable dynamics. Thus, even though some of the state variables cannot be controlled (as determined by the controllability test above) all the state variables will still remain bounded during the system's behavior.[9]

Reachable set

Let T ∈ Т and x ∈ X (where X is the set of all possible states and Т is an interval of time). The reachable set from x in time T is defined as:

RT{(x)}=\left\{z\inX:x\overset{T}{}z\right\}

, where xz denotes that there exists a state transition from x to z in time T.

For autonomous systems the reachable set is given by :

Im(R)=Im(B)+Im(AB)+....+Im(An-1B)

,where R is the controllability matrix.

In terms of the reachable set, the system is controllable if and only if

Im(R)=Rn

.

Proof We have the following equalities:

R=[BAB....An-1B]

Im(R)=Im([BAB....An-1B])

dim(Im(R))=rank(R)

Considering that the system is controllable, the columns of R should be linearly independent. So:

dim(Im(R))=n

rank(R)=n

Im(R)=\Rn\blacksquare

A related set to the reachable set is the controllable set, defined by:

CT{(x)}=\left\{z\inX:z\overset{T}{}x\right\}

.The relation between reachability and controllability is presented by Sontag:

(a) An n-dimensional discrete linear system is controllable if and only if:

R(0)=Rk{(0)=X}

(Where X is the set of all possible values or states of x and k is the time step).(b) A continuous-time linear system is controllable if and only if:

R(0)=Re{(0)=X}

for all e>0.if and only if

C(0)=Ce{(0)=X}

for all e>0.

ExampleLet the system be an n dimensional discrete-time-invariant system from the formula:

Φ(n,0,0,w)=

n
\sum\limits
i=1

Ai-1Bw(n-1)

(Where Φ(final time, initial time, state variable, restrictions) is defined is the transition matrix of a state variable x from an initial time 0 to a final time n with some restrictions w).It follows that the future state is in

Rk{(0)}

⇔ it is in the image of the linear map:

Im(R)=R(A,B)≜ Im(

[BAB....An-1B]

),which maps,

un

→XWhen

u=Km

and

X=Kn

we identify R(A,B) with a n by nm matrix whose columns are the columns of

B,AB,....,An-1B

in that order. If the system is controllable the rank of

[BAB....An-1B]

is n. If this is truth, the image of the linear map R is all of X. Based on that, we have:

R(0)=Rk{(0)=X}

with XЄ

\Rn

.

See also

External links

Notes and References

  1. Book: Katsuhiko Ogata. Modern Control Engineering. 3rd. 1997. Prentice-Hall. Upper Saddle River, NJ. 978-0-13-227307-7.
  2. A linear time-invariant system behaves the same but with the coefficients being constant in time.
  3. Book: Brockett, Roger W.. Finite Dimensional Linear Systems. John Wiley & Sons. 1970. 978-0-471-10585-5.
  4. Eduardo D. Sontag, Mathematical Control Theory: Deterministic Finite Dimensional Systems.
  5. Isidori, Alberto (1989). Nonlinear Control Systems, p. 92–3. Springer-Verlag, London. .
  6. Claire J. Tomlin . Ian Mitchell . Alexandre M. Bayen . Meeko Oishi. Meeko Oishi . Computational Techniques for the Verification of Hybrid Systems . Proceedings of the IEEE . 91 . 7 . 986–1001 . 10.1109/jproc.2003.814621 . 2003 . 2012-03-04. 10.1.1.70.4296 .
  7. Book: Jean-Pierre Aubin . Viability Theory . 1991 . Birkhauser . 978-0-8176-3571-8 .
  8. Book: Jan Polderman . Jan Willems . Introduction to Mathematical Systems Theory: A Behavioral Approach. 1st. 1998. Springer Verlag. New York. 978-0-387-98266-3.
  9. Book: Brian D.O. Anderson. John B. Moore. Optimal Control: Linear Quadratic Methods. 1990. Prentice Hall. Englewood Cliffs, NJ. 978-0-13-638560-8.