Block Wiedemann algorithm explained

The block Wiedemann algorithm for computing kernel vectors of a matrix over a finite field is a generalization by Don Coppersmith of an algorithm due to Doug Wiedemann.

Wiedemann's algorithm

Let

be an

n x n

square matrix over some finite field F, let

x_base

be a random vector of length

, and let

x=Mx_base

. Consider the sequence of vectors

S=\left[x,Mx,M^2x,\ldots\right]

obtained by repeatedly multiplying the vector by the matrix

; let

be any other vector of length

, and consider the sequence of finite-field elements

S_y=\left[y ⋅ x,y ⋅ Mx,y ⋅ M^2x\ldots\right]

We know that the matrix

has a minimal polynomial; by the Cayley–Hamilton theorem we know that this polynomial is of degree (which we will call

n₀

) no more than

. Say

	n₀
\sum
	r=0

	r
p
	rM

. Then

	n₀
\sum
	r=0

y ⋅ (p_r(M^rx))=0

; so the minimal polynomial of the matrix annihilates the sequence

and hence

S_y

But the Berlekamp–Massey algorithm allows us to calculate relatively efficiently some sequence

q₀\ldotsq_L

with

	L
\sum
	i=0

q_iS_y[{i+r}]=0 \forall r

. Our hope is that this sequence, which by construction annihilates

y ⋅ S

, actually annihilates

; so we have

	L
\sum
	i=0

q_iMⁱx=0

. We then take advantage of the initial definition of

to say

	L
\sum
	i=0

q_iMⁱx_base=0

and so

	L
\sum
	i=0

q_iMⁱx_base

is a hopefully non-zero kernel vector of

The block Wiedemann (or Coppersmith-Wiedemann) algorithm

The natural implementation of sparse matrix arithmetic on a computer makes it easy to compute the sequence S in parallel for a number of vectors equal to the width of a machine word – indeed, it will normally take no longer to compute for that many vectors than for one. If you have several processors, you can compute the sequence S for a different set of random vectors in parallel on all the computers.

It turns out, by a generalization of the Berlekamp–Massey algorithm to provide a sequence of small matrices, that you can take the sequence produced for a large number of vectors and generate a kernel vector of the original large matrix. You need to compute

y_i ⋅ M^tx_j

for some

i=0\ldotsi_max,j=0\ldotsj_max,t=0\ldotst_max

where

i_max,j_max,t_max

need to satisfy

t_max>

	d
	i_max

	d
	j_max

+O(1)

and

y_i

are a series of vectors of length n; but in practice you can take

y_i

as a sequence of unit vectors and simply write out the first

i_max

entries in your vectors at each time t.

Invariant Factor Calculation

The block Wiedemann algorithm can be used to calculate the leading invariant factors of the matrix, ie, the largest blocks of the Frobenius normal form. Given

M\in

	n x n
F
	q

and

U,V\in

	b x n
F
	q

where

F_q

is a finite field of size

, the probability

that the leading

k<b

invariant factors of

are preserved in

	2n-1
\sum
	i=0

UM^iV^Txⁱ

p\geq\begin{cases}1/64,&ifb=k+1andq=2\ \left(1-

	3
	2^b-k

\right)²\geq1/16&ifb\geqk+2andq=2\ \left(1-

	2
	q^b-k

\right)²\geq1/9&ifb\geqk+1andq>2\end{cases}

.^[1]

References

Wiedemann, D., "Solving sparse linear equations over finite fields," IEEE Trans. Inf. Theory IT-32, pp. 54-62, 1986.
D. Coppersmith, Solving homogeneous linear equations over GF(2) via block Wiedemann algorithm, Math. Comp. 62 (1994), 333-350.
Villard's 1997 research report 'A study of Coppersmith's block Wiedemann algorithm using matrix polynomials' (the cover material is in French but the content in English) is a reasonable description.
Thomé's paper 'Subquadratic computation of vector generating polynomials and improvement of the block Wiedemann algorithm' uses a more sophisticated FFT-based algorithm for computing the vector generating polynomials, and describes a practical implementation with i_max = j_max = 4 used to compute a kernel vector of a 484603×484603 matrix of entries modulo 2⁶⁰⁷−1, and hence to compute discrete logarithms in the field GF(2⁶⁰⁷).

Notes and References

Harrison . Gavin . Johnson . Jeremy . Saunders . B. David . 2022-01-01 . Probabilistic analysis of block Wiedemann for leading invariant factors . Journal of Symbolic Computation . en . 108 . 98–116 . 10.1016/j.jsc.2021.06.005 . 0747-7171. 1803.03864 .