Estimation of signal parameters via rotational invariance techniques explained

Estimation of signal parameters via rotational invariant techniques (ESPRIT), is a technique to determine the parameters of a mixture of sinusoids in background noise. This technique was first proposed for frequency estimation. However, with the introduction of phased-array systems in everyday technology, it is also used for angle of arrival estimations.[1]

One-dimensional ESPRIT

\omegak

. This frequency only depends on the index of the system's input, i.e., k. The goal of ESPRIT is to estimate

\omegak

's, given the outputs y_m[t] and the number of input signals, K. Since the radial frequencies are the actual objectives, a_ is denoted as a_m(\omega_k).

Collating the weights a_m(\omega_k) as

a(\omegak)=[1

-j\omegak
e
-j2\omegak
e

...

-j(M-1)\omegak
e

]\top

and the M output signals at instance t as

y[t]=[y1[t]y2[t]...

\top
y
M[t]]
, \mathbf y[t] = \sum_^K \mathbf a(\omega_k) x_k[t] + \mathbf[t],where

n[t]=[n1[t]n2[t]...

\top
n
M[t]]
. Further, when the weight vectors

a(\omega1),a(\omega2),...,a(\omegaK)

are put into a Vandermonde matrix

A=[a(\omega1)a(\omega2)...a(\omegaK)]

, and the

K

inputs at instance t into a vector

x[t]=[x1[t]...

\top
x
k[t]]
, we can write \mathbf y[t] = \mathbf A\, \mathbf x[t]+\mathbf n[t].With several measurements at instances t = 1, 2, \dots, T and the notations \mathbf = [\,\mathbf{y}[1] \,\ \mathbf[2] \,\ \dots \,\ \mathbf[T ]\,], \mathbf = [\,\mathbf{x}[1] \,\ \mathbf[2] \,\ \dots \,\ \mathbf[T ]\,] and \mathbf = [\, \mathbf{n}[1] \,\ \mathbf[2] \,\ \dots \,\ \mathbf[T ]\,], the model equation becomes\mathbf = \mathbf \mathbf + \mathbf.

Dividing into virtual sub-arrays

The weight vector \mathbf a(\omega_k) has the property that adjacent entries are related.[\mathbf{a}(\omega_k)]_ = e^ [\mathbf{a}(\omega_k)]_For the whole vector

a(\omegak)

, the equation introduces two selection matrices

J1

and

J2

:

J1=[IM-1

and

J2=[0IM-1]

. Here,

IM-1

is an identity matrix of size (M-1) and

0

is a vector of zero.

The vectors

J1a(\omegak)

[J2a(\omegak)]

contains all elements of

a(\omegak)

except the last [first] one. Thus,

J2a(\omegak)=

-j\omegak
e

J1a(\omegak)

and\mathbf J_2 \mathbf A = \mathbf J_1 \mathbf A \mathbf H,\quad\text\quad := \begin e^ & \\ & e^ \\ & & \ddots \\ & & & e^ \end.The above relation is the first major observation required for ESPRIT. The second major observation concerns the signal subspace that can be computed from the output signals.

Signal subspace

The singular value decomposition (SVD) of \mathbf Y is given as\mathbf = \mathbf U \mathbf \Sigma \mathbf V^\daggerwhere \mathbf U \in\mathbb C^ and \mathbf V \in \mathbb C^ are unitary matrices and \mathbf \Sigma is a diagonal matrix of size M \times T, that holds the singular values from the largest (top left) in descending order. The operator \dagger denotes the complex-conjugate transpose (Hermitian transpose).

Let us assume that T \geq M. Notice that we have K input signals. If there was no noise, there would only be K non-zero singular values. We assume that the K largest singular values stem from these input signals and other singular values are presumed to stem from noise. The matrices in SVD of \mathbf Y can be partitioned into submatrices, where some submatrices correspond to the signal subspace and some correspond to the noise subspace. \begin\mathbf = \begin\mathbf_\mathrm &\mathbf_\mathrm\end,& & \mathbf = \begin\mathbf_\mathrm & \mathbf & \mathbf \\\mathbf & \mathbf_\mathrm & \mathbf\end,& & \mathbf = \begin\mathbf_\mathrm &\mathbf_\mathrm &\mathbf_\mathrm\end\end,where \mathbf_\mathrm \in \mathbb^ and \mathbf_\mathrm \in \mathbb^ contain the first K columns of \mathbf U and \mathbf V, respectively and \mathbf_\mathrm \in \mathbb^is a diagonal matrix comprising the K largest singular values.

Thus, The SVD can be written as \mathbf = \mathbf U_\mathrm \mathbf \Sigma_\mathrm \mathbf V_\mathrm^\dagger + \mathbf U_\mathrm \mathbf \Sigma_\mathrm \mathbf V_\mathrm^\dagger,where \mathbf_\mathrm, ⁣\mathbf_\mathrm, and \mathbf_\mathrm represent the contribution of the input signal x_k[t] to \mathbf Y. We term \mathbf_\mathrm the signal subspace. In contrast, \mathbf_\mathrm, \mathbf_\mathrm, and \mathbf_\mathrm represent the contribution of noise n_m[t ] to \mathbf Y.

Hence, from the system model, we can write \mathbf \mathbf = \mathbf U_\mathrm \mathbf \Sigma_\mathrm \mathbf V_\mathrm^\dagger and \mathbf = \mathbf U_\mathrm \mathbf\Sigma_\mathrm \mathbf V_\mathrm^\dagger. Also, from the former, we can write \mathbf U_\mathrm= \mathbf \mathbf,where

{F}=XVS

-1
\Sigma
S

. In the sequel, it is only important that there exist such an invertible matrix \mathbf F and its actual content will not be important.

Note: The signal subspace can also be extracted from the spectral decomposition of the auto-correlation matrix of the measurements, which is estimated as\mathbf_\mathrm = \frac\sum_^T \mathbf[t] \mathbf[t] ^\dagger=\frac\mathbf \mathbf^\dagger= \frac\mathbf U \mathbf U^\dagger= \frac\mathbf U_\mathrm \mathbf \Sigma_\mathrm^2 \mathbf U_\mathrm^\dagger+\frac\mathbf U_\mathrm \mathbf \Sigma_\mathrm^2 \mathbf U_\mathrm^\dagger.

Estimation of radial frequencies

We have established two expressions so far: \mathbf J_2 \mathbf A = \mathbf J_1 \mathbf A \mathbf H and \mathbf U_\mathrm = \mathbf \mathbf. Now, \begin\mathbf J_2 \mathbf A = \mathbf J_1 \mathbf A \mathbf H\implies\mathbf J_2 (\mathbf U_\mathrm \mathbf^)= \mathbf J_1 (\mathbf U_\mathrm \mathbf^) \mathbf H\implies\mathbf S_2 = \mathbf S_1 \mathbf,\endwhere

S1=J1US

and

S2=J2US

denote the truncated signal sub spaces, and \mathbf = \mathbf^ \mathbf H \mathbf.The above equation has the form of an eigenvalue decomposition, and the phases of eigenvalues in the diagonal matrix

H

are used to estimate the radial frequencies.

Thus, after solving for

P

in the relation

S2=S1P

, we would find the eigenvalues \lambda_1,\ \lambda_2,\ \ldots,\ \lambda_K of

P

, where

λk=\alphak

j\omegak
e
, and the radial frequencies

\omega1,\omega2,\ldots,\omegaK

are estimated as the phases (argument) of the eigenvalues.

Remark: In general,

S1

is not invertible. One can use the least squares estimate

{P}=

\dagger
(S
1
-1
{S
1})
\dagger
S
1

{S2}

. An alternative would be the total least squares estimate.

Algorithm summary

Input: Measurements \mathbf := [\,\mathbf{y}[1] \,\ \mathbf[2] \,\ \dots \,\ \mathbf[T ]\,], the number of input signals K (estimate if not already known).

  1. Compute the singular value decomposition (SVD) of \mathbf = \mathbf U \mathbf \Sigma \mathbf V^\dagger and extract the signal subspace \mathbf_\mathrm \in \mathbb^

as the first K columns of \mathbf U .

  1. Compute \mathbf_1 = \mathbf J_1 \mathbf U_\mathrm and \mathbf_2 = \mathbf J_2 \mathbf U_\mathrm

, where

J1=[IM-10]

and

J2=[0IM-1]

.
  1. Solve for \mathbf

in \mathbf S_2 = \mathbf S_1 \mathbf (see the remark above).

  1. Compute the eigenvalues \lambda_1, \lambda_2, \ldots, \lambda_K

of \mathbf.

  1. The phases of the eigenvalues \lambda_k = \alpha_k \mathrm^

provide the radial frequencies \omega_k, i.e., \omega_k = \arg \lambda_k

Notes

Choice of selection matrices

In the derivation above, the selection matrices

J1=[IM-10]

and

J2=[0IM-1]

were used. However, any appropriate matrices \mathbf J_1 and

J2

may be used as long as the rotational invariance (i.e.,

J2a(\omegak)=

-j\omegak
e

J1a(\omegak)

) , or some generalization of it (see below) holds; accordingly, the matrices \mathbf J_1 \mathbf A and \mathbf J_2 \mathbf A may contain any rows of \mathbf A .

Generalized rotational invariance

The rotational invariance used in the derivation may be generalized. So far, the matrix

H

has been defined to be a diagonal matrix that stores the sought-after complex exponentials on its main diagonal. However,

H

may also exhibit some other structure.[2] For instance, it may be an upper triangular matrix. In this case, \mathbf = \mathbf^ \mathbf H \mathbfconstitutes a triangularization of

P

.

See also

Further reading

Notes and References

  1. Volodymyr Vasylyshyn. The direction of arrival estimation using ESPRIT with sparse arrays.// Proc. 2009 European Radar Conference (EuRAD). – 30 Sept.-2 Oct. 2009. - Pp. 246 - 249. - https://ieeexplore.ieee.org/abstract/document/5307000
  2. Hu . Anzhong . Lv . Tiejun . Gao . Hui . Zhang . Zhang . Yang . Shaoshi . 2014 . An ESPRIT-Based Approach for 2-D Localization of Incoherently Distributed Sources in Massive MIMO Systems . IEEE Journal of Selected Topics in Signal Processing . 8 . 5 . 996–1011 . 10.1109/JSTSP.2014.2313409 . 1403.5352 . 2014ISTSP...8..996H . 11664051 . 1932-4553.