In mathematics, Hensel's lemma, also known as Hensel's lifting lemma, named after Kurt Hensel, is a result in modular arithmetic, stating that if a univariate polynomial has a simple root modulo a prime number, then this root can be lifted to a unique root modulo any higher power of . More generally, if a polynomial factors modulo into two coprime polynomials, this factorization can be lifted to a factorization modulo any higher power of (the case of roots corresponds to the case of degree for one of the factors).
By passing to the "limit" (in fact this is an inverse limit) when the power of tends to infinity, it follows that a root or a factorization modulo can be lifted to a root or a factorization over the -adic integers.
These results have been widely generalized, under the same name, to the case of polynomials over an arbitrary commutative ring, where is replaced by an ideal, and "coprime polynomials" means "polynomials that generate an ideal containing ".
Hensel's lemma is fundamental in -adic analysis, a branch of analytic number theory.
The proof of Hensel's lemma is constructive, and leads to an efficient algorithm for Hensel lifting, which is fundamental for factoring polynomials, and gives the most efficient known algorithm for exact linear algebra over the rational numbers.
Hensel's original lemma concerns the relation between polynomial factorization over the integers and over the integers modulo a prime number and its powers. It can be straightforwardly extended to the case where the integers are replaced by any commutative ring, and is replaced by any maximal ideal (indeed, the maximal ideals of
\Z
p\Z,
Making this precise requires a generalization of the usual modular arithmetic, and so it is useful to define accurately the terminology that is commonly used in this context.
R\toR/I.
f\inR[X]
f\bmodI,
(R/I)[X]=R[X]/IR[X]
R/I.
R[X]
f-g\inIR[X].
h\inR[X],
R[X]
The lifting process is the inverse of reduction. That is, given objects depending on elements of
R/I,
R
R/Ik
For example, given a polynomial
h\inR[X]
Ik
f',g'\inR[X]
Originally, Hensel's lemma was stated (and proved) for lifting a factorization modulo a prime number of a polynomial over the integers to a factorization modulo any power of and to a factorization over the -adic integers. This can be generalized easily, with the same proof to the case where the integers are replaced by any commutative ring, the prime number is replaced by a maximal ideal, and the -adic integers are replaced by the completion with respect to the maximal ideal. It is this generalization, which is also widely used, that is presented here.
Let
akm
n+ … | |
h=\alpha | |
0X |
+\alphan-1X+\alphan
R[X]
\alpha0
akm.
Since
akm
R/akm
(R/akm)[X]
(R/akm)[X]
(R/akm)
Hensel's lemma asserts that every factorization of modulo
akm
akmk
More precisely, with the above hypotheses, if where and are monic and coprime modulo
akm,
fk
gk
\begin{align} h&\equiv\alpha0fkgk
k},\\ f | |
\pmod{akm | |
k&\equiv |
f\pmod{akm},\\ gk&\equivg\pmod{akm}, \end{align}
fk
gk
akmk.
An important special case is when
f=X-r.
h\bmodakm.
With above hypotheses and notations, if is a simple root of
h\bmodakm,
h\bmod{akmn}
rn\inR/{akm}n
rn
h\bmodakmn.
The fact that one can lift to
R/akmn
Given a maximal ideal
akm
akm
akm
Rakm,
\lim\leftarrowR/akmn.
\widehatRakm.
akm=p\Z,
\Zp.
The definition of the completion as an inverse limit, and the above statement of Hensel's lemma imply that every factorization into pairwise coprime polynomials modulo
akm
h\inR[X]
\widehatRakm[X].
akm
\widehatRakm[X].
Hensel's lemma is generally proved incrementally by lifting a factorization over
R/akmn
R/akmn+1
R/akm2n
The main ingredient of the proof is that coprime polynomials over a field satisfy Bézout's identity. That is, if and are coprime univariate polynomials over a field (here
R/akm
\dega<\degg,
\degb<\degf,
af+bg=1.
Bézout's identity allows defining coprime polynomials and proving Hensel's lemma, even if the ideal
akm
h\inR[X]
R/I
R/I
A-B\inIR[X].
Let be an ideal of a commutative ring, and
h\inR[X]
\alpha
\alpha
R/I
R/I
Suppose that for some positive integer there is a factorization
h\equiv\alphafg\pmod{Ik},
a,b\inR[X],
\deltaf,\deltag\inIkR[X],
\deg\deltaf<\degf,
\deg\deltag<\degg,
h\equiv\alpha(f+\deltaf)(g+\deltag)\pmod{Ik+1
Under these conditions,
\deltaf
\deltag
Ik+1R[X].
Moreover,
f+\deltaf
g+\deltag
The proof that follows is written for computing
\deltaf
\deltag
R/I
Ik/Ik+1.
R=\Z
I=p\Z,
Proof: By hypothesis,
\alpha
\beta\inR
\gamma\inIR[X]
\alpha\beta=1-\gamma.
Let
\deltah\inIkR[X],
\degh,
\deltah\equivh-\alphafg\pmod{Ik+1
\deltah=h-\alphafg,
R=\Z
I=p\Z,
k\delta' | |
\delta | |
h |
\delta'h
As is monic, the Euclidean division of
a\deltah
a\deltah=qg+c,
\degc<\degg.
IkR[X].
b\deltah=q'f+d,
\degd<\degf,
q',d\inIkR[X].
One has
q+q'\inIk+1R[X].
fc+gd=af\deltah+bg\deltah-fg(q+q')\equiv\deltah-fg(q+q')\pmod{Ik+1
fg
Ik+1
fg(q+q')
\degfg
q+q'\inIk+1R[X].
Thus, considering congruences modulo
Ik+1,
\begin{align} \alpha(f+\betad)&(g+\betac)-h\\ &\equiv\alphafg-h+\alpha\beta(f(a\deltah-qg)+g(b\deltah-q'f))\\ &\equiv\deltah(-1+\alpha\beta(af+bg))-\alpha\betafg(q+q')\\ &\equiv0\pmod{Ik+1
So, the existence assertion is verified with
\deltaf=\betad, \deltag=\betac.
Let,, and
\alpha
h\equiv\alphafg{\pmodI}
\degf0+\degg0=\degh.
k=1,2,\ldots,n-1\ldots,
\deltaf
\deltag
\deg\deltaf<\degf,
\deg\deltag<\degg,
h\equiv\alpha(f+\deltaf)(g+\deltag)\pmod{In}.
The polynomials
\deltaf
\deltag
In.
(\delta'f,\delta'g)
\delta'f\equiv\deltaf\pmod{In} and \delta'g\equiv\deltag\pmod{In}.
Proof: Since a congruence modulo
In
In-1,
\deltaf-\delta'f\inIn-1R[X] and \deltag-\delta'g\inIn-1R[X].
By hypothesis, has
h\equiv\alpha(f+\deltaf)(g+\deltag)\equiv\alpha(f+\delta'f)(g+\delta'g)\pmod{In},
\begin{align} \alpha(f+\deltaf)(g+\deltag)&-\alpha(f+\delta'f)(g+\delta'g)\\ &=\alpha(f(\deltag-\delta'g)+g(\deltaf-\delta'f))+\alpha(\deltaf(\deltag-\delta'g)-\deltag(\deltaf-\delta'f))\inInR[X]. \end{align}
By induction hypothesis, the second term of the latter sum belongs to
In,
\alpha
\beta\inR
\gamma\inI
\alpha\beta=1+\gamma.
\begin{align} f(\deltag-\delta'g)&+g(\deltaf-\delta'f)\\ &=\alpha\beta(f(\deltag-\delta'g)+g(\deltaf-\delta'f))-\gamma(f(\deltag-\delta'g)+g(\deltaf-\delta'f))\inInR[X], \end{align}
The coprimality modulo implies the existence of
a,b\inR[X]
\begin{align} \deltag-\delta'g&\equiv(af+bg)(\deltag-\delta'g)\\ &\equivg(b(\deltag-\delta'g)-a(\deltaf-\delta'f))\pmod{In}. \end{align}
\degg
In
w\inInR[X],
\deltag-\delta'g\inInR[X].
\deltaf-\delta'f
InR[X],
Linear lifting allows lifting a factorization modulo
In
In+1.
I2n,
In
For lifting up to modulo
IN
N=2k,
IN
Quadratic lifting is based on the following property.
Suppose that for some positive integer there is a factorization
h\equiv\alphafg\pmod{Ik},
a,b\inR[X],
\deltaf,\deltag\inIkR[X],
\deg\deltaf<\degf,
\deg\deltag<\degg,
h\equiv\alpha(f+\deltaf)(g+\deltag)\pmod{I2k
Moreover,
f+\deltaf
g+\deltag
(a+\deltaa)(f+\deltaf)+(b+\deltab)(g+\deltag)\equiv1\pmod{I2k
Proof: The first assertion is exactly that of linear lifting applied with to the ideal
Ik
I.
Let
\alpha=af+bg-1\inIkR[X].
a(f+\deltaf)+b(g+\deltag)=1+\Delta,
\Delta=\alpha+a\deltaf+b\deltag\inIkR[X].
\deltaa=-a\Delta
\deltab=-b\Delta,
(a+\deltaa)(f+\deltaf)+(b+\deltab)(g+\delta
2\in | |
g)=1-\Delta |
I2kR[X],
Let
f(X)=X6-2\inQ[X].
Modulo 2, Hensel's lemma cannot be applied since the reduction of
f(X)
\bar{f}(X)=X6-\overline{2}=X6
X
f(X)
\Q2[X].
k=F7
\bar{f}(X)=X6-\overline{2}=X6-\overline{16}=(X3-\overline{4}) (X3+\overline{4})
4
F7
F7,
F7
X6-2
\Z7[X]
\Q7[X]
f(X)=X6-2=(X3-\alpha) (X3+\alpha),
\alpha=\ldots4504547
\Z7
F727[X]
\bar{f}(X)=X6-\overline{2}=(X-\overline{3}) (X-\overline{116}) (X-\overline{119}) (X-\overline{608}) (X-\overline{611}) (X-\overline{724})
\Z727[X]
\Q727[X]
X-\beta
\beta=\left\{\begin{array}{rrr}3 +&545 ⋅ 727 +&537 ⋅ 7272+&161 ⋅ 7273+\ldots\\116 +&48 ⋅ 727 +&130 ⋅ 7272+&498 ⋅ 7273+\ldots\\119 +&593 ⋅ 727 +&667 ⋅ 7272+&659 ⋅ 7273+\ldots\\608 +&133 ⋅ 727 +&59 ⋅ 7272+&67 ⋅ 7273+\ldots\\611 +&678 ⋅ 727 +&596 ⋅ 7272+&228 ⋅ 7273+\ldots\\724 +&181 ⋅ 727 +&189 ⋅ 7272+&565 ⋅ 7273+\ldots\end{array}\right.
Let
f(x)
f(r)\equiv0\bmodpk and f'(r)\not\equiv0\bmodp
then, for every
m>0
f(s)\equiv0\bmodpk+m and r\equivs\bmodpk.
Furthermore, this s is unique modulo pk+m, and can be computed explicitly as the integer such that
s=r-f(r) ⋅ a,
where
a
a\equiv[f'(r)]-1\bmodpm.
Note that
f(r)\equiv0\bmodpk
s\equivr\bmodpk
f'(r)\equiv0\bmodp
We use the Taylor expansion of f around r to write:
f(s)=
N | |
\sum | |
n=0 |
cn(s-r)n, cn=f(n)(r)/n!.
From
r\equivs\bmodpk,
\begin{align} f(s)&=
N | |
\sum | |
n=0 |
cn\left(tpk\right)n\\ &=f(r)+tpkf'(r)+
N | |
\sum | |
n=2 |
cntnpkn\\ &=f(r)+tpkf'(r)+p2kt2g(t)&&g(t)\in\Z[t]\\ &=zpk+tpkf'(r)+p2kt2g(t)&&f(r)\equiv0\bmodpk\\ &=(z+tf'(r))pk+p2kt2g(t) \end{align}
For
m\leqslantk,
\begin{align} f(s)\equiv0\bmodpk+m&\Longleftrightarrow(z+tf'(r))pk\equiv0\bmodpk+m\\ &\Longleftrightarrowz+tf'(r)\equiv0\bmodpm\\ &\Longleftrightarrowtf'(r)\equiv-z\bmodpm\\ &\Longleftrightarrowt\equiv-z[f'(r)]-1\bmodpm&&p\nmidf'(r) \end{align}
The assumption that
f'(r)
f'(r)
pm
pm,
pk+m.
Using the above hypotheses, if we consider an irreducible polynomial
f(x)=a0+a1x+ … +anxn\inK[X]
a0,an ≠ 0
|f|=max\{|a0|,|an|\}
f(X)=X6+10X-1
Q2[X]
\begin{align} |f(X)|&=max\{|a0|,\ldots,|an|\}\\ &=max\{0,1,0\}=1 \end{align}
max\{|a0|,|an|\}=0
Q7[X]
Note that given an
a\inFp
y\mapstoyp
xp-a
\begin{align} | d |
dx |
(xp-a)&=p ⋅ xp-1\\ &\equiv0 ⋅ xp-1\bmodp\\ &\equiv0\bmodp \end{align}
a
Zp
a=1
Zp
\mup
Although the pth roots of unity are not contained in
Fp
xp-x=x(xp-1-1)
\begin{align} | d |
dx |
(xp-x)&=pxp-1-1\\ &\equiv-1\bmodp \end{align}
Zp
ap=a,
x | |
F | |
p |
Using the lemma, one can "lift" a root r of the polynomial f modulo pk to a new root s modulo pk+1 such that (by taking ; taking larger m follows by induction). In fact, a root modulo pk+1 is also a root modulo pk, so the roots modulo pk+1 are precisely the liftings of roots modulo pk. The new root s is congruent to r modulo p, so the new root also satisfies
f'(s)\equivf'(r)\not\equiv0\bmodp.
f(x)\equiv0\bmodpk
f'(rk)\not\equiv0\bmodp
What happens to this process if r is not a simple root mod p? Suppose that
f(r)\equiv0\bmodpk and f'(r)\equiv0\bmodp.
s\equivr\bmodpk
f(s)\equivf(r)\bmodpk+1.
f(r+tpk)\equivf(r)\bmodpk+1
f(r)\not\equiv0\bmodpk+1
f(r)\equiv0\bmodpk+1
Example. To see both cases we examine two different polynomials with :
f(x)=x2+1
f(1)\equiv0\bmod2
f'(1)\equiv0\bmod2.
f(1)\not\equiv0\bmod4
g(x)=x2-17
g(1)\equiv0\bmod2
g'(1)\equiv0\bmod2.
g(1)\equiv0\bmod4,
In the -adic numbers, where we can make sense of rational numbers modulo powers of p as long as the denominator is not a multiple of p, the recursion from rk (roots mod pk) to rk+1 (roots mod pk+1) can be expressed in a much more intuitive way. Instead of choosing t to be an(y) integer which solves the congruence
tf'(rk)\equiv
k | |
-(f(r | |
k)/p |
)\bmodpm,
let t be the rational number (the pk here is not really a denominator since f(rk) is divisible by pk):
k | |
-(f(r | |
k)/p |
)/f'(rk).
Then set
rk+1=rk+tpk=rk-
f(rk) | |
f'(rk) |
.
This fraction may not be an integer, but it is a -adic integer, and the sequence of numbers rk converges in the -adic integers to a root of f(x) = 0. Moreover, the displayed recursive formula for the (new) number rk+1 in terms of rk is precisely Newton's method for finding roots to equations in the real numbers.
By working directly in the -adics and using the -adic absolute value, there is a version of Hensel's lemma which can be applied even if we start with a solution of f(a) ≡ 0 mod p such that
f'(a)\equiv0\bmodp.
f'(a)
|f(a)|p<
2, | |
|f'(a)| | |
p |
then there is a unique -adic integer b such f(b) = 0 and
|b-a|p<|f'(a)|p.
|b-a|p<|f'(a)|p
The statement of Hensel's lemma given above (taking
m=1
f'(a)\not\equiv0\bmodp
|f(a)|p<1
|f'(a)|p=1.
Suppose that p is an odd prime and a is a non-zero quadratic residue modulo p. Then Hensel's lemma implies that a has a square root in the ring of -adic integers
\Zp.
f(x)=x2-a.
f(r)=r2-a\equiv0\bmodp and f'(r)=2r\not\equiv0\bmodp,
where the second condition is dependent on the fact that p is odd. The basic version of Hensel's lemma tells us that starting from r1 = r we can recursively construct a sequence of integers
\{rk\}
rk+1\equivrk\bmodpk,
2 | |
r | |
k |
\equiva\bmodpk.
This sequence converges to some -adic integer b which satisfies b2 = a. In fact, b is the unique square root of a in
\Zp
\Zp
To make the discussion above more explicit, let us find a "square root of 2" (the solution to
x2-2=0
r1=3
r2
\begin{align} f(r1)&=32-2=7
1 | |
\\ f(r | |
1)/p |
&=7/7=1\\ f'(r1)&=2r1=6\end{align}
Based on which the expression
tf'(r1)\equiv
k)\bmod | |
-(f(r | |
1)/p |
p,
turns into:
t ⋅ 6\equiv-1\bmod7
which implies
t=1.
r2=r1+tp1=3+1 ⋅ 7=10=137.
And sure enough,
102\equiv2\bmod72.
r2=r1-f(r1)/f'(r1)=3-7/6=11/6,
11/6\equiv10\bmod72.
We can continue and find
r3=108=3+7+2 ⋅ 72=2137
\Z7
3+7+2 ⋅ 72+6 ⋅ 73+74+2 ⋅ 75+76+2 ⋅ 77+4 ⋅ 78+ … .
If we started with the initial choice
r1=4
\Z7
As an example where the original version of Hensel's lemma is not valid but the more general one is, let
f(x)=x2-17
a=1.
f(a)=-16
f'(a)=2,
|f(a)|2<
2, | |
|f'(a)| | |
2 |
which implies there is a unique 2-adic integer b satisfying
b2=17 and |b-a|2<|f'(a)|2=
1 | |
2 |
,
i.e., b ≡ 1 mod 4. There are two square roots of 17 in the 2-adic integers, differing by a sign, and although they are congruent mod 2 they are not congruent mod 4. This is consistent with the general version of Hensel's lemma only giving us a unique 2-adic square root of 17 that is congruent to 1 mod 4 rather than mod 2. If we had started with the initial approximate root a = 3 then we could apply the more general Hensel's lemma again to find a unique 2-adic square root of 17 which is congruent to 3 mod 4. This is the other 2-adic square root of 17.
In terms of lifting the roots of
x2-17
1 mod 2 → 1, 3 mod 4
1 mod 4 → 1, 5 mod 8 and 3 mod 4 → 3, 7 mod 8
1 mod 8 → 1, 9 mod 16 and 7 mod 8 → 7, 15 mod 16, while 3 mod 8 and 5 mod 8 don't lift to roots mod 16
9 mod 16 → 9, 25 mod 32 and 7 mod 16 → 7, 23 mod 16, while 1 mod 16 and 15 mod 16 don't lift to roots mod 32.
For every k at least 3, there are four roots of x2 − 17 mod 2k, but if we look at their 2-adic expansions we can see that in pairs they are converging to just two 2-adic limits. For instance, the four roots mod 32 break up into two pairs of roots which each look the same mod 16:
9 = 1 + 23 and 25 = 1 + 23 + 24.
7 = 1 + 2 + 22 and 23 = 1 + 2 + 22 + 24.
The 2-adic square roots of 17 have expansions
1+23+25+26+27+29+210+ …
1+2+22+24+28+211+ …
Another example where we can use the more general version of Hensel's lemma but not the basic version is a proof that any 3-adic integer c ≡ 1 mod 9 is a cube in
\Z3.
f(x)=x3-c
f'(r)\equiv0\bmod3
|f(1)|3
2, | |
<|f'(1)| | |
3 |
c\equiv1\bmod27.
In a similar way, after some preliminary work, Hensel's lemma can be used to show that for any odd prime number p, any -adic integer c congruent to 1 modulo p2 is a p-th power in
\Zp.
ak{m},
f(x)\inA[x].
f(a)\equiv0\bmodf'(a)2ak{m}.
If f has an approximate root then it has an exact root b ∈ A "close to" a; that is,
f(b)=0 and b\equiva\bmod{akm}.
Furthermore, if
f'(a)
This result can be generalized to several variables as follows:
Theorem. Let A be a commutative ring that is complete with respect to ideal
ak{m}\subsetA.
f1,\ldots,fn\inA[x1,\ldots,xn]
f=(f1,\ldots,fn),
Jf(x)
fi(a)\equiv0\bmod(\detJf(a))2ak{m}, 1\leqslanti\leqslantn.
Then there is some b = (b1, ..., bn) ∈ An satisfying f(b) = 0, i.e.,
fi(b)=0, 1\leqslanti\leqslantn.
Furthermore this solution is "close" to a in the sense that
bi\equivai\bmod\detJf(a)ak{m}, 1\leqslanti\leqslantn.
As a special case, if
fi(a)\equiv0\bmodak{m}
\detJf(a)
bi\equivai\bmodak{m}
When n = 1, a = a is an element of A and
Jf(a)=Jf(a)=f'(a).
Completeness of a ring is not a necessary condition for the ring to have the Henselian property: Goro Azumaya in 1950 defined a commutative local ring satisfying the Henselian property for the maximal ideal m to be a Henselian ring.
Masayoshi Nagata proved in the 1950s that for any commutative local ring A with maximal ideal m there always exists a smallest ring Ah containing A such that Ah is Henselian with respect to mAh. This Ah is called the Henselization of A. If A is noetherian, Ah will also be noetherian, and Ah is manifestly algebraic as it is constructed as a limit of étale neighbourhoods. This means that Ah is usually much smaller than the completion  while still retaining the Henselian property and remaining in the same category.