In mathematics, especially in linear algebra and matrix theory, the vectorization of a matrix is a linear transformation which converts the matrix into a vector. Specifically, the vectorization of a matrix A, denoted vec(A), is the column vector obtained by stacking the columns of the matrix A on top of one another:Here,
ai,j
{}T
Rm:=Rm ⊗ Rn\congRmn
For example, for the 2×2 matrix
A=\begin{bmatrix}a&b\ c&d\end{bmatrix}
\operatorname{vec}(A)=\begin{bmatrix}a\ c\ b\ d\end{bmatrix}
The connection between the vectorization of A and the vectorization of its transpose is given by the commutation matrix.
The vectorization is frequently used together with the Kronecker product to express matrix multiplication as a linear transformation on matrices. In particular,for matrices A, B, and C of dimensions k×l, l×m, and m×n.[1] For example, if
\operatorname{ad}A(X)=AX-XA
\operatorname{vec}(\operatorname{ad}A(X))=(In ⊗ A-AT ⊗ In)vec(X)
In
There are two other useful formulations:
More generally, it has been shown that vectorization is a self-adjunction in the monoidal closed structure of any category of matrices.[2]
Vectorization is an algebra homomorphism from the space of matrices with the Hadamard (entrywise) product to Cn2 with its Hadamard product:
Vectorization is a unitary transformation from the space of n×n matrices with the Frobenius (or Hilbert–Schmidt) inner product to Cn2:where the superscript † denotes the conjugate transpose.
The matrix vectorization operation can be written in terms of a linear sum. Let X be an matrix that we want to vectorize, and let ei be the i-th canonical basis vector for the n-dimensional space, that is . Let Bi be a block matrix defined as follows:
Bi consists of n block matrices of size, stacked column-wise, and all these matrices are all-zero except for the i-th one, which is a identity matrix Im.
Then the vectorized version of X can be expressed as follows:
Multiplication of X by ei extracts the i-th column, while multiplication by Bi puts it into the desired position in the final vector.
Alternatively, the linear sum can be expressed using the Kronecker product:
For a symmetric matrix A, the vector vec(A) contains more information than is strictly necessary, since the matrix is completely determined by the symmetry together with the lower triangular portion, that is, the entries on and below the main diagonal. For such matrices, the half-vectorization is sometimes more useful than the vectorization. The half-vectorization, vech(A), of a symmetric matrix A is the column vector obtained by vectorizing only the lower triangular part of A:
For example, for the 2×2 matrix
A=\begin{bmatrix}a&b\ b&d\end{bmatrix}
\operatorname{vech}(A)=\begin{bmatrix}a\ b\ d\end{bmatrix}
There exist unique matrices transforming the half-vectorization of a matrix to its vectorization and vice versa called, respectively, the duplication matrix and the elimination matrix.
Programming languages that implement matrices may have easy means for vectorization.In Matlab/GNU Octave a matrix A
can be vectorized by A(:)
.GNU Octave also allows vectorization and half-vectorization with vec(A)
and vech(A)
respectively. Julia has the vec(A)
function as well.In Python NumPy arrays implement the flatten
method,[1] while in R the desired effect can be achieved via the c
or as.vector
functions. In R, function vec
of package 'ks' allows vectorization and function vech
implemented in both packages 'ks' and 'sn' allows half-vectorization.[3] [4] [5]
Vectorization is used in matrix calculus and its applications in establishing e.g., moments of random vectors and matrices, asymptotics, as well as Jacobian and Hessian matrices.[6] It is also used in local sensitivity and statistical diagnostics.[7]
\operatorname{vec}(ABC)=(A ⊗ CT)\operatorname{vec}(B)