Polynomial evaluation explained

In mathematics and computer science, polynomial evaluation refers to computation of the value of a polynomial when its indeterminates are substituted for some values. In other words, evaluating the polynomial

P(x_1,x₂₎=2x_1x₂+

	3
x
	1

x_1=2,x₂₌₃

consists of computing

P(2,3)=2 ⋅ 2 ⋅ 3+2^3+4=24.

Background

This problem arises frequently in practice. In computational geometry, polynomials are used to compute function approximations using Taylor polynomials. In cryptography and hash tables, polynomials are used to compute k-independent hashing.

In the former case, polynomials are evaluated using floating-point arithmetic, which is not exact. Thus different schemes for the evaluation will, in general, give slightly different answers. In the latter case, the polynomials are usually evaluated in a finite field, in which case the answers are always exact.

General methods

Horner's rule

Multivariate

If the polynomial is multivariate, Horner's rule can be applied recursively over some ordering of the variables.E.g.

P(x,y)=4+x+2xy+2x²y+x²y²

can be written as

\begin{align} P(x,y)&=4+x(1+y(2)+x(y(2+y))) or\\ P(x,y)&=4+x+y(x(2+x(2))+y(x^{2)).
\end{align}}

An efficient version of this approach was described by Carnicer and Gasca.^[1]

Estrin's scheme

Evaluation with preprocessing

Arbitrary polynomials can be evaluated with feweroperations than Horner's rule requires if we first "preprocess"the coefficients

a_n,...,a₀

An example was first given by Motzkin^[2] who noted that

P(x)=x⁴+a₃x³+a₂x²+a₁x+a₀

can be written as

y=(x+\beta_0)x+\beta_1,P(x)=(y+x+\beta_2)y+\beta_3,

where the values

\beta_0,...,\beta₃

are computed in advanced, based on

a_0,...,a₃

.Motzkin's method uses just 3 multiplications compared to Horner's 4.

The values for each

\beta_i

can be easily computed by expanding

P(x)

and equating the coefficients:

\begin{align} \beta_{0&=\tfrac12(a}_{3-1),
&z&=a}_2-\beta_0(\beta_{0+1),
&\beta}_1&=a_1-\beta₀z,\\ \beta_2&=z-2\beta_1, &\beta_3&=a_0-\beta_1(\beta_1+\beta_{2).\end{align}}

Example

\exp(x) ≈ 1+x+x^2/2+x^3/6+x^4/24

,we can upscale by a factor 24, apply the above steps, and scale back down.That gives us the three multiplication computation

y=(x+1.5)x+11.625, P(x)=(y+x-15)y/24+2.63477.

Improving over the equivalent Horner form (that is

P(x)=1+x(1+x(1/2+x(1/6+x/24)))

) by 1 multiplication.

Some general methods include the Knuth–Eve algorithm and the Rabin–Winograd algorithm.

Multipoint evaluation

Evaluate of a

-degree polynomial

P(x)

in multiple points

x_1,...,x_m

can be done with

multiplications by using Horner's method

times. Using above preprocessing approach, this can be reduced that by a factor of two, that is, to

mn/2

multiplications. However, it is possible to do better.

It is possible to reduce the time requirement to just

O((n+m)log²⁽ⁿ+m))

.^[3] The idea is to define two polynomials that are zero in respectively the first and second half of the points:

m_0(x)=(x-x_{1) … (x-x}_n/2)

and

m_1(x)=(x-x_n/2+1) … (x-x_n)

.We then compute

R₀=P\bmodm₀

and

R₁=P\bmodm₁

using the Polynomial remainder theorem, which can be done in

O(nlogn)

time using a fast Fourier transform.This means

P(x)=Q(x)m_0(x)+R_0(x)

and

P(x)=Q(x)m_1(x)+R_1(x)

by construction, where

R₀

and

R₁

are polynomials of degree at most

n/2

.Because of how

m₀

and

m₁

were defined, we have

\begin{align} R_0(x_i)&=P(x_i) fori\len/2 and\\ R_1(x_i)&=P(x_i) fori>n/2. \end{align}

Thus to compute

on all

of the

x_i

, it suffices to compute the smaller polynomials

R₀

and

R₁

on each half of the points.This gives us a divide-and-conquer algorithm with

T(n)=2T(n/2)+nlogn

, which implies

T(n)=O(n(logn)²⁾

by the master theorem.

In the case where the points in which we wish to evaluate the polynomials have some structure, simpler methods exist.For example, Knuth^[4] section 4.6.4gives a method for tabulating polynomial values of the type

P(x₀+h),P(x₀+2h),....

Dynamic evaluation

In the case where

x_1,...,x_m

are not known in advance,Kedlaya and Umans^[5] gave a data structure for evaluating polynomials over a finite field of size

F_q

in time

(logn)^O(1)(log₂q)^1+o(1)

per evaluation after some initial preprocessing.This was shown by Larsen^[6] to be essentially optimal.

The idea is to transform

P(x)

of degree

into a multivariate polynomial

f(x_1,x_2,...,x_m)

, such that

P(x)=f(x,x^d,

	d²
x

,...,

	d^m
x

)

and the individual degrees of

is at most

.Since this is over

\bmodq

, the largest value

can take (over

) is

M=d^m(q-1)^dm

.Using the Chinese remainder theorem, it suffices to evaluate

modulo different primes

p_1,...,p_\ell

with a product at least

.Each prime can be taken to be roughly

logM=O(dmlogq)

, and the number of primes needed,

\ell

, is roughly the same.Doing this process recursively, we can get the primes as small as

loglogq

.That means we can compute and store

on all the possible values in

T=(loglogq)^m

time and space.If we take

d=logq

, we get

m=\tfrac{logn}{loglogq}

, so the time/space requirement is just

	loglogq
	logloglogq

Kedlaya and Umans further show how to combine this preprocessing with fast (FFT) multipoint evaluation.This allows optimal algorithms for many important algebraic problems, such as polynomial modular composition.

Specific polynomials

While general polynomials require

\Omega(n)

operations to evaluate, some polynomials can be computed much faster.For example, the polynomial

P(x)=x^2+2x+1

can be computed using just one multiplication and one addition since

P(x)=(x+1)²

Evaluation of powers

See main article: Exponentiation by squaring and Addition-chain exponentiation.

A particularly interesting type of polynomial is powers like

xⁿ

.Such polynomials can always be computed in

O(logn)

operations.Suppose, for example, that we need to compute

x¹⁶

; we could simply start with

and multiply by

to get

x²

.We can then multiply that by itself to get

x⁴

and so on to get

x⁸

and

x¹⁶

in just four multiplications.Other powers like

x⁵

can similarly be computed efficiently by first computing

x⁴

by 2 multiplications and then multiplying by

The most efficient way to compute a given power

xⁿ

is provided by addition-chain exponentiation. However, this requires designing a specific algorithm for each exponent, and the computation needed for designing these algorithms are difficult (NP-complete^[7]), so exponentiation by squaring is generally preferred for effective computations.

Polynomial families

Often polynomials show up in a different form than the well known

a_nxⁿ+...+a₁x+a₀

.For polynomials in Chebyshev form we can use Clenshaw algorithm.For polynomials in Bézier form we can use De Casteljau's algorithm,and for B-splines there is De Boor's algorithm.

Hard polynomials

The fact that some polynomials can be computed significantly faster than "general polynomials" suggests the question: Can we give an example of a simple polynomial that cannot be computed in time much smaller than its degree?Volker Strassen has shown^[8] that the polynomial

	n
P(x)=\sum
	k=0

	kn³
2

x^k

cannot be evaluated by with less than

\tfrac12n-2

multiplications and

n-4

additions.At least this bound holds if only operations of those types are allowed, giving rise to a so-called "polynomial chain of length

<n^2/logn

The polynomial given by Strassen has very large coefficients, but by probabilistic methods, one can show there must exist even polynomials with coefficients just 0's and 1's such that the evaluation requires at least

\Omega(n/logn)

multiplications.

For other simple polynomials, the complexity is unknown.The polynomial

(x+1)(x+2) … (x+n)

is conjectured to not be computable in time

(logn)^c

for any

.This is supported by the fact, that if it can be computed fast then integer factorization can be computed in polynomial time, breaking the RSA cryptosystem.^[9]

Matrix polynomials

Sometimes the computational cost of scalar multiplications (like

) is less than the computational cost of "non scalar" multiplications (like

x²

).The typical example of this is matrices.If

is an

m x m

matrix, a scalar multiplication

takes about

m²

arithmetic operations, while computing

M²

takes about

m³

(or

m^2.3

using fast matrix multiplication).

Matrix polynomials are important for example for computing the Matrix Exponential.

Paterson and Stockmeyer ^[10] showed how to compute a degree

polynomial using only

O(\sqrtn)

non scalar multiplications and

O(n)

scalar multiplications.Thus a matrix polynomial of degree can be evaluated in

O(m^2.3\sqrt{n}+m²ⁿ⁾

time. If

m=n

this is

O(m³⁾

, as fast as one matrix multiplication with the standard algorithm.

This method works as follows: For a polynomial

P(M)=a_n-1M^n-1+...+a₁M+a₀I,

let be the least integer not smaller than

\sqrt{n}.

The powers

M,M^2,...,M^k

are computed with

matrix multiplications, and

M^2k,M^3k,...,

	k^2-k
M

are then computed by repeated multiplication by

M^k.

Now,

\begin{align}P(M)= &(a₀I+a₁M+...+a_k-1M^k-1) \\+&(a_kI+a_k+1M+...+a_2k-1M^k-1

	k \\+&... \\+&(a
)M
	n-k

I+a_n-k+1M+...+a_n-1M^k-1

	k^2-k
)M

, \end{align}

,where

a_i=0

for .This requires just

more non-scalar multiplications.

We can write this succinctly using the Kronecker product:

P(M)= \begin{bmatrix}I\\M\\\vdots\\M^k-1

	T \left(\begin{bmatrix} a
\end{bmatrix}
	0

&a₁&a₂&...\\ a_k&a_k+1&\ddots\\ a_2k&\ddots\\ \vdots\end{bmatrix} ⊗ I\right) \begin{bmatrix}I\\M^k\\M^2k\\\vdots\end{bmatrix}

The direct application of this method uses

2\sqrt{n}

non-scalar multiplications, but combining it with Evaluation with preprocessing, Paterson and Stockmeyer show you can reduce this to

\sqrt{2n}

Methods based on matrix polynomial multiplications and additions have been proposed allowing to save nonscalar matrix multiplications with respect to the Paterson-Stockmeyer method.^[11]

Notes and References

Carnicer . J. . Gasca . M. . 1990 . Evaluation of Multivariate Polynomials and Their Derivatives . . 54 . 189 . 231–243 . 10.2307/2008692 . 2008692 . free.
Motzkin. T. S.. 1955. Evaluation of polynomials and evaluation of rational functions. Bulletin of the American Mathematical Society. 61. 163. 10.
Book: Von Zur Gathen . Joachim . Modern computer algebra . Jürgen . Gerhard . . 2013 . 9781139856065 . Chapter 10 . Joachim von zur Gathen.
Book: Knuth, Donald . . . 2005 . 9780201853926 . 2: Seminumerical Algorithms . Donald Knuth.
Kedlaya. Kiran S.. Kiran Kedlaya. Umans. Christopher. Chris Umans. 2011. Fast Polynomial Factorization and Modular Composition. SIAM Journal on Computing. 40. 6. 1767–1802. 10.1137/08073408x. 1721.1/71792. 412751 . free.
Book: Larsen, K. G.. 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science . Higher Cell Probe Lower Bounds for Evaluating Polynomials . Kasper Green Larsen. 2012. IEEE. 53. 293–301. 10.1109/FOCS.2012.21. 978-0-7695-4874-6 . 7906483 .
Downey . Peter . Leong . Benton . Sethi . Ravi . Computing Sequences with Addition Chains . SIAM Journal on Computing . 1981 . 10 . 3 . 27 January 2024.
Strassen. Volker. Volker Strassen. 1974. Polynomials with Rational Coefficients Which are Hard to Compute. SIAM Journal on Computing. en. 3. 2. 128–149. 10.1137/0203010.
Chen, Xi, Neeraj Kayal, and Avi Wigderson. Partial derivatives in arithmetic complexity and beyond. Now Publishers Inc, 2011.
Paterson. Michael S.. Mike Paterson. Stockmeyer. Larry J.. Larry J. Stockmeyer. 1973. On the Number of Nonscalar Multiplications Necessary to Evaluate Polynomials. SIAM Journal on Computing. en. 2. 1. 60–66. 10.1137/0202007.
Fasi . Massimiliano . Optimality of the Paterson–Stockmeyer method for evaluating matrix polynomials and rational matrix functions . Linear Algebra and its Applications . 1 August 2019 . 574 . 185 . 10.1016/j.laa.2019.04.001 . 0024-3795. free .

	n+a
a
	n-1

Polynomial evaluation explained

Background

General methods

Horner's rule

Multivariate

Estrin's scheme

Evaluation with preprocessing

Example

Multipoint evaluation

Dynamic evaluation

Specific polynomials

Evaluation of powers

Polynomial families

Hard polynomials

Matrix polynomials

See also

Notes and References