In mathematics, in linear algebra, a Weyr canonical form (or, Weyr form or Weyr matrix) is a square matrix which (in some sense) induces "nice" properties with matrices it commutes with. It also has a particularly simple structure and the conditions for possessing a Weyr form are fairly weak, making it a suitable tool for studying classes of commuting matrices. A square matrix is said to be in the Weyr canonical form if the matrix has the structure defining the Weyr canonical form. The Weyr form was discovered by the Czech mathematician Eduard Weyr in 1885.[1] [2] [3] The Weyr form did not become popular among mathematicians and it was overshadowed by the closely related, but distinct, canonical form known by the name Jordan canonical form.[3] The Weyr form has been rediscovered several times since Weyr’s original discovery in 1885.[4] This form has been variously called as modified Jordan form, reordered Jordan form, second Jordan form, and H-form.[4] The current terminology is credited to Shapiro who introduced it in a paper published in the American Mathematical Monthly in 1999.[4] [5]
Recently several applications have been found for the Weyr matrix. Of particular interest is an application of the Weyr matrix in the study of phylogenetic invariants in biomathematics.
n1+n2+ … +nr=n
n
n1\gen2\ge … \genr\ge1
W
r x r
(Wij)
(i,j)
Wij
ni x nj
Wii
ni x ni
λI
i=1,\ldots,r
Wi,i+1
ni x ni+1
i=1,\ldots,r-1
Wij=0
j\nei,i+1
In this case, we say that
W
(n1,n2,\ldots,nr)
The following is an example of a basic Weyr matrix.
W=
= \begin{bmatrix} W11&W12&&\\ &W22&W23&\\ &&W33&W34\\ &&&W44\\ \end{bmatrix}
In this matrix,
n=9
n1=4,n2=2,n3=2,n4=1
W
(4,2,2,1)
W11= \begin{bmatrix} λ&0&0&0\\ 0&λ&0&0\\ 0&0&λ&0\\ 0&0&0&λ\\ \end{bmatrix}=λI4, W22= \begin{bmatrix} λ&0\\ 0&λ&\\ \end{bmatrix}=λI2, W33= \begin{bmatrix} λ&0\\ 0&λ&\\ \end{bmatrix}=λI2, W44= \begin{bmatrix} λ\\ \end{bmatrix}=λI1
and
W12= \begin{bmatrix} 1&0\\ 0&1\\ 0&0\\ 0&0\\ \end{bmatrix}, W23= \begin{bmatrix} 1&0\\ 0&1\\ \end{bmatrix}, W34= \begin{bmatrix} 1\\ 0\\ \end{bmatrix}.
Let
W
λ1,\ldots,λk
W
W
W
W= \begin{bmatrix} W1&&&\\ &W2&&\\ &&\ddots&\\ &&&Wk\\ \end{bmatrix}
where
Wi
λi
i=1,\ldots,k
The following image shows an example of a general Weyr matrix consisting of three basic Weyr matrix blocks. The basic Weyr matrix in the top-left corner has the structure (4,2,1) with eigenvalue 4, the middle block has structure (2,2,1,1) with eigenvalue -3 and the one in the lower-right corner has the structure (3, 2) with eigenvalue 0.
The Weyr canonical form
W=P-1JP
J
P
That the Weyr form is a canonical form of a matrix is a consequence of the following result:[3] Each square matrix
A
W
W
A
Let
A
n
A
λ1,λ2,\ldots,λk
A
A= \begin{bmatrix} λ1I+N1&&&\\ &λ2I+N2&&\\ &&\ddots&\\ &&&λkI+Nk\\ \end{bmatrix} = \begin{bmatrix} λ1I&&&\\ &λ2I&&\\ &&\ddots&\\ &&&λkI\\ \end{bmatrix} + \begin{bmatrix} N1&&&\\ &N2&&\\ &&\ddots&\\ &&&Nk\\ \end{bmatrix} = D+N
where
D
N
[D,N]=0
N
Ni
A
Ni
Given a nilpotent square matrix
A
n
F
C
W
W=C-1AC
Step 1
Let
A1=A
Step 2
A1
A1
n
Fn
P1
-1 | |
P | |
1 |
A1P1=\begin{bmatrix}0&B2\ 0&A2\end{bmatrix}
A2
n
(A1)
Step 3
If
A2
A2
A2
A2
n
(A1)
P2
-1 | |
P | |
2 |
A2P2=\begin{bmatrix}0&B3\ 0&A3\end{bmatrix}
A2
n
(A1)
(A2)
Step 4
Continue the processes of Steps 1 and 2 to obtain increasingly smaller square matrices
A1,A2,A3,\ldots
P1,P2,P3,\ldots
Ar
Step 5
The Weyr structure of
A
(n1,n2,\ldots,nr)
ni
(Ai)
Step 6
P=P1\begin{bmatrix}I&0\ 0&P2\end{bmatrix}\begin{bmatrix}I&0\ 0&P3\end{bmatrix} … \begin{bmatrix}I&0\ 0&Pr\end{bmatrix}
I
X=P-1AP
X
X=\begin{bmatrix}0&X12&X13& … &X1,r-1&X1r\ &0&X23& … &X2,r-1&X2r\ &&&\ddots&\ &&& … &0&Xr-1,r\ &&&&&0\end{bmatrix}
Step 7
Use elementary row operations to find an invertible matrix
Yr-1
Yr-1Xr,r-1
Ir,r-1=\begin{bmatrix}I\ O\end{bmatrix}
Step 8
Set
Q1=
(I,I,\ldots,
-1 | |
Y | |
r-1 |
,I)
-1 | |
Q | |
1 |
XQ1
(r,r-1)
Ir,r-1
Step 9
Find a matrix
R1
-1 | |
R | |
1 |
-1 | |
Q | |
1 |
XQ1R1
Ir,r-1
0
Step 10
Repeat Steps 8 and 9 on column
r-1
(r-1,r-2)
Ir-1,r-2
Q2
R2
Step 11
Repeat these processes on
r-2,r-3,\ldots,3,2
Q3,R3,\ldots,Qr-2,Rr-2,Qr-1
W
Step 12
Let
C=P1diag(I,P2) … diag(I,Pr-1)Q1R1Q2 … Rr-2Qr-1
W=C-1AC
Some well-known applications of the Weyr form are listed below:[3]
n x n
n