Descent direction explained

In optimization, a descent direction is a vector

p\inRⁿ

that points towards a local minimum

x^*

of an objective function

f:R^n\toR

Computing

x^*

by an iterative method, such as line search defines a descent direction

at the

th iterate to be any

p_k

such that

\langlep_k,\nablaf(x_k)\rangle<0

, where

\langle,\rangle

denotes the inner product. The motivation for such an approach is that small steps along

p_k

guarantee that

\displaystylef

is reduced, by Taylor's theorem.

Using this definition, the negative of a non-zero gradient is always adescent direction, as

\langle-\nablaf(x_k),\nablaf(x_k)\rangle=-\langle\nablaf(x_k),\nablaf(x_k)\rangle<0

Numerous methods exist to compute descent directions, all with differing merits, such as gradient descent or the conjugate gradient method.

More generally, if

is a positive definite matrix, then

p_k=-P\nablaf(x_k)

is a descent direction at

x_k

.^[1] This generality is used in preconditioned gradient descent methods.

Notes and References

Book: J. M. Ortega and W. C. Rheinbold . Iterative Solution of Nonlinear Equations in Several Variables . 243 . 1970 . 10.1137/1.9780898719468 .