Numerical linear algebra explained

Numerical linear algebra, sometimes called applied linear algebra, is the study of how matrix operations can be used to create computer algorithms which efficiently and accurately provide approximate answers to questions in continuous mathematics. It is a subfield of numerical analysis, and a type of linear algebra. Computers use floating-point arithmetic and cannot exactly represent irrational data, so when a computer algorithm is applied to a matrix of data, it can sometimes increase the difference between a number stored in the computer and the true number that it is an approximation of. Numerical linear algebra uses properties of vectors and matrices to develop computer algorithms that minimize the error introduced by the computer, and is also concerned with ensuring that the algorithm is as efficient as possible.

Numerical linear algebra aims to solve problems of continuous mathematics using finite precision computers, so its applications to the natural and social sciences are as vast as the applications of continuous mathematics. It is often a fundamental part of engineering and computational science problems, such as image and signal processing, telecommunication, computational finance, materials science simulations, structural biology, data mining, bioinformatics, and fluid dynamics. Matrix methods are particularly used in finite difference methods, finite element methods, and the modeling of differential equations. Noting the broad applications of numerical linear algebra, Lloyd N. Trefethen and David Bau, III argue that it is "as fundamental to the mathematical sciences as calculus and differential equations",[1] even though it is a comparatively small field.[2] Because many properties of matrices and vectors also apply to functions and operators, numerical linear algebra can also be viewed as a type of functional analysis which has a particular emphasis on practical algorithms.[1]

Common problems in numerical linear algebra include obtaining matrix decompositions like the singular value decomposition, the QR factorization, the LU factorization, or the eigendecomposition, which can then be used to answer common linear algebraic problems like solving linear systems of equations, locating eigenvalues, or least squares optimisation. Numerical linear algebra's central concern with developing algorithms that do not introduce errors when applied to real data on a finite precision computer is often achieved by iterative methods rather than direct ones.

History

Numerical linear algebra was developed by computer pioneers like John von Neumann, Alan Turing, James H. Wilkinson, Alston Scott Householder, George Forsythe, and Heinz Rutishauser, in order to apply the earliest computers to problems in continuous mathematics, such as ballistics problems and the solutions to systems of partial differential equations.[2] The first serious attempt to minimize computer error in the application of algorithms to real data is John von Neumann and Herman Goldstine's work in 1947.[3] The field has grown as technology has increasingly enabled researchers to solve complex problems on extremely large high-precision matrices, and some numerical algorithms have grown in prominence as technologies like parallel computing have made them practical approaches to scientific problems.[2]

Matrix decompositions

Partitioned matrices

See main article: Block matrix. For many problems in applied linear algebra, it is useful to adopt the perspective of a matrix as being a concatenation of column vectors. For example, when solving the linear system

x=A-1b

, rather than understanding x as the product of

A-1

with b, it is helpful to think of x as the vector of coefficients in the linear expansion of b in the basis formed by the columns of A.[1] Thinking of matrices as a concatenation of columns is also a practical approach for the purposes of matrix algorithms. This is because matrix algorithms frequently contain two nested loops: one over the columns of a matrix A, and another over the rows of A. For example, for matrices

Am

and vectors

xn

and

ym

, we could use the column partitioning perspective to compute y := Ax + y as

for q = 1:n for p = 1:m y(p) = A(p,q)*x(q) + y(p) endend

Singular value decomposition

See main article: singular value decomposition. The singular value decomposition of a matrix

Am

is

A=U\SigmaV\ast

where U and V are unitary, and

\Sigma

is diagonal. The diagonal entries of

\Sigma

are called the singular values of A. Because singular values are the square roots of the eigenvalues of

AA\ast

, there is a tight connection between the singular value decomposition and eigenvalue decompositions. This means that most methods for computing the singular value decomposition are similar to eigenvalue methods;[1] perhaps the most common method involves Householder procedures.[1]

QR factorization

See main article: QR decomposition. The QR factorization of a matrix

Am

is a matrix

Qm

and a matrix

Rm

so that A = QR, where Q is orthogonal and R is upper triangular.[1] [4] The two main algorithms for computing QR factorizations are the Gram–Schmidt process and the Householder transformation. The QR factorization is often used to solve linear least-squares problems, and eigenvalue problems (by way of the iterative QR algorithm).

LU factorization

See main article: LU decomposition. An LU factorization of a matrix A consists of a lower triangular matrix L and an upper triangular matrix U so that A = LU. The matrix U is found by an upper triangularization procedure which involves left-multiplying A by a series of matrices

M1,\ldots,Mn-1

to form the product

Mn-1M1A=U

, so that equivalently

L=

-1
M
1

-1
M
n-1
.[1] [4]

Eigenvalue decomposition

See main article: Eigendecomposition of a matrix. The eigenvalue decomposition of a matrix

Am

is

A=XΛX-1

, where the columns of X are the eigenvectors of A, and

Λ

is a diagonal matrix the diagonal entries of which are the corresponding eigenvalues of A.[1] There is no direct method for finding the eigenvalue decomposition of an arbitrary matrix. Because it is not possible to write a program that finds the exact roots of an arbitrary polynomial in finite time, any general eigenvalue solver must necessarily be iterative.[1]

Algorithms

Gaussian elimination

See main article: Gaussian elimination. From the numerical linear algebra perspective, Gaussian elimination is a procedure for factoring a matrix A into its LU factorization, which Gaussian elimination accomplishes by left-multiplying A by a succession of matrices

Lm-1L2L1A=U

until U is upper triangular and L is lower triangular, where

L\equiv

-1
L
1
-1
L
2

-1
L
m-1
.[1] Naive programs for Gaussian elimination are notoriously highly unstable, and produce huge errors when applied to matrices with many significant digits.[2] The simplest solution is to introduce pivoting, which produces a modified Gaussian elimination algorithm that is stable.[1]

Solutions of linear systems

Numerical linear algebra characteristically approaches matrices as a concatenation of columns vectors. In order to solve the linear system

x=A-1b

, the traditional algebraic approach is to understand x as the product of

A-1

with b. Numerical linear algebra instead interprets x as the vector of coefficients of the linear expansion of b in the basis formed by the columns of A.[1]

Many different decompositions can be used to solve the linear problem, depending on the characteristics of the matrix A and the vectors x and b, which may make one factorization much easier to obtain than others. If A = QR is a QR factorization of A, then equivalently

Rx=Q\astb

. This is as easy to compute as a matrix factorization.[1] If

A=XΛX-1

is an eigendecomposition A, and we seek to find b so that b = Ax, with

b'=X-1b

and

x'=X-1x

, then we have

b'=Λx'

.[1] This is closely related to the solution to the linear system using the singular value decomposition, because singular values of a matrix are the absolute values of its eigenvalues, which are also equivalent to the square roots of the absolute values of the eigenvalues of the Gram matrix

X*X

. And if A = LU is an LU factorization of A, then Ax = b can be solved using the triangular matrices Ly = b and Ux = y.[1] [4]

Least squares optimisation

See main article: Numerical methods for linear least squares. Matrix decompositions suggest a number of ways to solve the linear system r = bAx where we seek to minimize r, as in the regression problem. The QR algorithm solves this problem by computing the reduced QR factorization of A and rearranging to obtain

\widehat{R}x=\widehat{Q}\astb

. This upper triangular system can then be solved for x. The SVD also suggests an algorithm for obtaining linear least squares. By computing the reduced SVD decomposition

A=\widehat{U}\widehat{\Sigma}V\ast

and then computing the vector

\widehat{U}\astb

, we reduce the least squares problem to a simple diagonal system.[1] The fact that least squares solutions can be produced by the QR and SVD factorizations means that, in addition to the classical normal equations method for solving least squares problems, these problems can also be solved by methods that include the Gram-Schmidt algorithm and Householder methods.

Conditioning and stability

Allow that a problem is a function

f:X\toY

, where X is a normed vector space of data and Y is a normed vector space of solutions. For some data point

x\inX

, the problem is said to be ill-conditioned if a small perturbation in x produces a large change in the value of f(x). We can quantify this by defining a condition number which represents how well-conditioned a problem is, defined as\widehat = \lim_ \sup_ \frac.

Instability is the tendency of computer algorithms, which depend on floating-point arithmetic, to produce results that differ dramatically from the exact mathematical solution to a problem. When a matrix contains real data with many significant digits, many algorithms for solving problems like linear systems of equation or least squares optimisation may produce highly inaccurate results. Creating stable algorithms for ill-conditioned problems is a central concern in numerical linear algebra. One example is that the stability of householder triangularization makes it a particularly robust solution method for linear systems, whereas the instability of the normal equations method for solving least squares problems is a reason to favour matrix decomposition methods like using the singular value decomposition. Some matrix decomposition methods may be unstable, but have straightforward modifications that make them stable; one example is the unstable Gram–Schmidt, which can easily be changed to produce the stable modified Gram–Schmidt.[1] Another classical problem in numerical linear algebra is the finding that Gaussian elimination is unstable, but becomes stable with the introduction of pivoting.

Iterative methods

See main article: Iterative methods. There are two reasons that iterative algorithms are an important part of numerical linear algebra. First, many important numerical problems have no direct solution; in order to find the eigenvalues and eigenvectors of an arbitrary matrix, we can only adopt an iterative approach. Second, noniterative algorithms for an arbitrary

m x m

matrix require

O(m3)

time, which is a surprisingly high floor given that matrices contain only

m2

numbers. Iterative approaches can take advantage of several features of some matrices to reduce this time. For example, when a matrix is sparse, an iterative algorithm can skip many of the steps that a direct approach would necessarily follow, even if they are redundant steps given a highly structured matrix.

The core of many iterative methods in numerical linear algebra is the projection of a matrix onto a lower dimensional Krylov subspace, which allows features of a high-dimensional matrix to be approximated by iteratively computing the equivalent features of similar matrices starting in a low dimension space and moving to successively higher dimensions. When A is symmetric and we wish to solve the linear problem Ax = b, the classical iterative approach is the conjugate gradient method. If A is not symmetric, then examples of iterative solutions to the linear problem are the generalized minimal residual method and CGN. If A is symmetric, then to solve the eigenvalue and eigenvector problem we can use the Lanczos algorithm, and if A is non-symmetric, then we can use Arnoldi iteration.

Software

See main article: List of numerical analysis software. Several programming languages use numerical linear algebra optimisation techniques and are designed to implement numerical linear algebra algorithms. These languages include MATLAB, Analytica, Maple, and Mathematica. Other programming languages which are not explicitly designed for numerical linear algebra have libraries that provide numerical linear algebra routines and optimisation; C and Fortran have packages like Basic Linear Algebra Subprograms and LAPACK, python has the library NumPy, and Perl has the Perl Data Language. Many numerical linear algebra commands in R rely on these more fundamental libraries like LAPACK.[5] More libraries can be found on the List of numerical libraries.

Further reading

External links

Notes and References

  1. Book: Trefethen . Lloyd . Bau III . David . Philadelphia . 978-0-89871-361-9 . 1997 . Numerical Linear Algebra . SIAM . 1st.
  2. Web site: A History of Modern Numerical Linear Algebra . Golub . Gene H. . University of Chicago Statistics Department . February 17, 2019.
  3. von Neumann. John . Goldstine . Herman H. . 1947 . Numerical inverting of matrices of high order . Bulletin of the American Mathematical Society . 53 . 11 . 1021–1099 . 10.1090/s0002-9904-1947-08909-6. 16174165 . free .
  4. Book: Golub . Gene H. . Van Loan . Charles F. . Baltimore . 0-8018-5413-X . 1996 . Matrix Computations . The Johns Hopkins University Press . 3rd.
  5. Web site: R and Linear Algebra . Rickert . Joseph . R-bloggers . August 29, 2013 . February 17, 2019.