Berlekamp's algorithm explained
In mathematics, particularly computational algebra, Berlekamp's algorithm is a well-known method for factoring polynomials over finite fields (also known as Galois fields). The algorithm consists mainly of matrix reduction and polynomial GCD computations. It was invented by Elwyn Berlekamp in 1967. It was the dominant algorithm for solving the problem until the Cantor–Zassenhaus algorithm of 1981. It is currently implemented in many well-known computer algebra systems.
Overview
(i.e. one with no repeated factors) of degree
with coefficients in a finite field
and gives as output a polynomial
with coefficients in the same field such that
divides
. The algorithm may then be applied recursively to these and subsequent divisors, until we find the decomposition of
into powers of
irreducible polynomials (recalling that the
ring of polynomials over a finite field is a
unique factorization domain).
All possible factors of
are contained within the
factor ring
The algorithm focuses on polynomials
which satisfy the congruence:
g(x)q\equivg(x)\pmod{f(x)}.
These polynomials form a
subalgebra of R (which can be considered as an
-dimensional vector space over
), called the
Berlekamp subalgebra. The Berlekamp subalgebra is of interest because the polynomials
it contains satisfy
In general, not every GCD in the above product will be a non-trivial factor of
, but some are, providing the factors we seek.
Berlekamp's algorithm finds polynomials
suitable for use with the above result by computing a basis for the Berlekamp subalgebra. This is achieved via the observation that Berlekamp subalgebra is in fact the
kernel of a certain
matrix over
, which is derived from the so-called Berlekamp matrix of the polynomial, denoted
. If
then
is the coefficient of the
-th power term in the reduction of
modulo
, i.e.:
xiq\equivqi,n-1xn-1+qi,n-2xn-2+\ldots+qi,0\pmod{f(x)}.
With a certain polynomial
, say:
g(x)=gn-1xn-1+gn-2xn-2+\ldots+g0,
we may associate the row vector:
It is relatively straightforward to see that the row vector
corresponds, in the same way, to the reduction of
modulo
. Consequently, a polynomial
is in the Berlekamp subalgebra if and only if
(where
is the
identity matrix), i.e. if and only if it is in the null space of
.
By computing the matrix
and reducing it to reduced row echelon form and then easily reading off a basis for the null space, we may find a basis for the Berlekamp subalgebra and hence construct polynomials
in it. We then need to successively compute GCDs of the form above until we find a non-trivial factor. Since the ring of polynomials over a field is a
Euclidean domain, we may compute these GCDs using the
Euclidean algorithm.
Conceptual algebraic explanation
With some abstract algebra, the idea behind Berlekamp's algorithm becomes conceptually clear. We represent a finite field , where for some prime , as . We can assume that is square free, by taking all possible pth roots and then computing the gcd with its derivative.
Now, suppose that is the factorization into irreducibles. Then we have a ring isomorphism, , given by the Chinese remainder theorem. The crucial observation is that the Frobenius automorphism commutes with , so that if we denote , then restricts to an isomorphism . By finite field theory, is always the prime subfield of that field extension. Thus, has elements if and only if is irreducible.
Moreover, we can use the fact that the Frobenius automorphism is -linear to calculate the fixed set. That is, we note that is a -subspace, and an explicit basis for it can be calculated in the polynomial ring by computing and establishing the linear equations on the coefficients of polynomials that are satisfied iff it is fixed by Frobenius. We note that at this point we have an efficiently computable irreducibility criterion, and the remaining analysis shows how to use this to find factors.
The algorithm now breaks down into two cases:
- In the case of small we can construct any , and then observe that for some there are so that and . Such a has a nontrivial factor in common with , which can be computed via the gcd. As is small, we can cycle through all possible .
- For the case of large primes, which are necessarily odd, one can exploit the fact that a random nonzero element of is a square with probability , and that the map maps the set of non-zero squares to , and the set of non-squares to . Thus, if we take a random element , then with good probability will have a non-trivial factor in common with .
For further details one can consult.[1]
Applications
One important application of Berlekamp's algorithm is in computing discrete logarithms over finite fields
, where
is prime and
. Computing discrete logarithms is an important problem in
public key cryptography and
error-control coding. For a finite field, the fastest known method is the
index calculus method, which involves the factorisation of field elements. If we represent the field
in the usual way - that is, as polynomials over the base field
, reduced modulo an irreducible polynomial of degree
- then this is simply polynomial factorisation, as provided by Berlekamp's algorithm.
Implementation in computer algebra systems
Berlekamp's algorithm may be accessed in the PARI/GP package using the factormod command, and the WolframAlpha http://www.wolframalpha.com/input/?i=factor+x^5+%2B+x+mod+17 website.
See also
References
- Elwyn R. . Berlekamp . Factoring Polynomials Over Finite Fields . Bell System Technical Journal. 46 . 1853–1859 . 1967 . 8 . 0219231. 10.1002/j.1538-7305.1967.tb03174.x. BSTJ Later republished in: Book: Berlekamp, Elwyn R. . Algebraic Coding Theory . McGraw Hill . 1968 . 0-89412-063-8 .
- Book: Knuth, Donald E. Donald E. Knuth. 4.6.2 Factorization of Polynomials. Seminumerical Algorithms. The Art of Computer Programming. 2. Third. Reading, Massachusetts. Addison-Wesley. 1997. 439–461, 678–691. 0-201-89684-2.
Notes and References
- Book: Theory of Computation - Dexter Kozen . Springer . 2020-09-19.