Aberth method explained

The Aberth method, or Aberth–Ehrlich method or Ehrlich–Aberth method, named after Oliver Aberth^[1] and Louis W. Ehrlich,^[2] is a root-finding algorithm developed in 1967 for simultaneous approximation of all the roots of a univariate polynomial.

This method converges cubically, an improvement over the Durand–Kerner method, another algorithm for approximating all roots at once, which converges quadratically. (However, both algorithms converge linearly at multiple zeros.)

This method is used in MPSolve, which is the reference software for approximating all roots of a polynomial to an arbitrary precision.

Description

Let

	n+p
p(x)=p
	n-1

x^n-1+ … +p_1x+p₀

be a univariate polynomial of degree n

with real or complex coefficients. Then there exist complex numbers

	*
z
	n

, the roots of p(x)

, that give the factorization:

	*
p(x)=p
	n).

Although those numbers are unknown, upper and lower bounds for their absolute values are computable from the coefficients of the polynomial. Now one can pick

distinct numbers in the complex plane—randomly or evenly distributed—such that their absolute values are within the same bounds. (Also, if the zeros are symmetrical, the starting points must not be exactly symmetrical along the same axis, as this can prevent convergence.) A set of such numbers is called an initial approximation of the set of roots of

p(x)

. This approximation can be iteratively improved using the following procedure.

Let

z_1,...,z_n\inC

be the current approximations of the zeros of p(x)

. Then offset numbers

w_1,...,w_n\inC

are computed as

	p(z_k)
	p'(z_k)

p(z_k)

⋅ \sum_j\ne

	1{z
	_k-z

p'(z_k)

where

p'(z_k)

is the polynomial derivative of p

evaluated in the point z_k

.

The next set of approximations of roots of

p(x)

is then

z_1-w_1,...,z_n-w_n

. One can measure the quality of the current approximation by the values of the polynomial or by the size of the offsets.

Conceptually, this method uses an electrostatic analogy, modeling the approximated zeros as movable negative point charges, which converge toward the true zeros, represented by fixed positive point charges. A direct application of Newton's method to each approximated zero will often cause multiple starting points to incorrectly converge to the same root. The Aberth method avoids this by also modeling the repulsive effect the movable charges have on each other. In this way, when a movable charge has converged on a zero, their charges will cancel out, so that other movable charges are no longer attracted to that location, encouraging them to converge to other "unoccupied" zeros. (Stieltjes also modeled the positions of zeros of polynomials as solutions to electrostatic problems.)

Inside the formula of the Aberth method one can find elements of Newton's method and the Durand–Kerner method. Details for an efficient implementation, esp. on the choice of good initial approximations, can be found in Bini (1996).^[3]

The updates of the roots may be executed as a simultaneous Jacobi-like iteration where first all new approximations are computed from the old approximations or as a sequential Gauss–Seidel-like iteration that uses each new approximation from the time it is computed.

A very similar method is the Newton-Maehly method. It computes the zeros one after another, but instead of an explicit deflation it divides by the already acquired linear factors on the fly. The Aberth method is like the Newton-Maehly method for computing the last root while pretending you have already found the other ones.^[4]

Derivation from Newton's method

The iteration formula is the univariate Newton iteration for the function

F(x)=

p(x)

	n(x-z
\prod
	j)

If the values

z_1,...,z_n

are already close to the roots of

p(x)

, then the rational function

F(x)

is almost linear with a dominant root close to

z_k

and poles at

z_1,...,z_k-1,z_k+1,...,z_n

that direct the Newton iteration away from the roots of p(x) that are close to them. That is, the corresponding basins of attraction get rather small, while the root close to

z_k

has a wide region of attraction.

The Newton step

\tfrac{F(x)}{F'(x)}

in the univariate case is the reciprocal value to the logarithmic derivative

\begin{align}

	F'(x)
	F(x)

	d
	dx

ln|F(x)|\\ &=

	d
	dx

	nln\|x-z
(ln\|p(x)\|-\sum
	j\|)\\

	p'(x)
	p(x)

n	1{x-z
	_{j} \end{align}}

-\sum

j=1;j\nek

Thus, the new approximation is computed as

z_k'=z

k-	F(z_k)
	F'(z_k)

1{	p'(z_k)
	p(z_k)

-\sum

n	1{z
	_k-z

j}},

which is the update formula of the Aberth - Ehrlich method.

Literature

Aberth. Oliver. 1973. Iteration methods for finding all zeros of a polynomial simultaneously. Math. Comp.. Mathematics of Computation, Vol. 27, No. 122. 27. 122. 339–344. 10.2307/2005621. 2005621. Because of the obvious analogy from electrostatics, this field may be called the field of a unit plus charge ... To avoid this, we assign a unit minus charge at each sampling point. The idea here is that when a sampling point z, is near a simple zero, the field from the minus charge at z, should counteract that from the plus charge at the zero, preventing a second sampling point from converging to this zero.. free.
Ehrlich. Louis W.. A modified Newton method for polynomials. Comm. ACM. 10 . 2 . 1967. 107–108. 10.1145/363067.363115. free.
Bini. Dario Andrea. Numerical computation of polynomial zeros by means of Aberth's method. Numerical Algorithms. 13. 1996. 179–200. 10.1007/BF02207694. 2. 1996NuAlg..13..179B. 23899456.
Bauer. F.L.. Stoer. J. . Algorithm 105: Newton Maehly . Comm. ACM. 5 . 7 . 1962 . 387–388 . 10.1145/368273.368423. free .

Aberth method explained

Description

Derivation from Newton's method

Literature

See also