Sieve theory explained

Sieve theory is a set of general techniques in number theory, designed to count, or more realistically to estimate the size of, sifted sets of integers. The prototypical example of a sifted set is the set of prime numbers up to some prescribed limit X. Correspondingly, the prototypical example of a sieve is the sieve of Eratosthenes, or the more general Legendre sieve. The direct attack on prime numbers using these methods soon reaches apparently insuperable obstacles, in the way of the accumulation of error terms. In one of the major strands of number theory in the twentieth century, ways were found of avoiding some of the difficulties of a frontal attack with a naive idea of what sieving should be.

One successful approach is to approximate a specific sifted set of numbers (e.g. the set of prime numbers) by another, simpler set (e.g. the set of almost prime numbers), which is typically somewhat larger than the original set, and easier to analyze. More sophisticated sieves also do not work directly with sets per se, but instead count them according to carefully chosen weight functions on these sets (options for giving some elements of these sets more "weight" than others). Furthermore, in some modern applications, sieves are used not to estimate the size of a siftedset, but to produce a function that is large on the set and mostly small outside it, while being easier to analyze than the characteristic function of the set.

The term sieve was first used by the Norwegian mathematician Viggo Brun in 1915.^[1] However Brun's work was inspired by the works of the French mathematician Jean Merlin who died in the World War I and only two of his manuscripts survived.^[2]

Basic sieve theory

For information on notation see at the end.

We start with some countable sequence of non-negative numbers

l{A}=(a_n)

. In the most basic case this sequence is just the indicator function

a_n=1_A(n)

of some set

A=\{s:s\leqx\}

we want to sieve. However this abstraction allows for more general situations. Next we introduce a general set of prime numbers called the sifting range

l{P}\subseteqP

and their product up to

as a function

P(z)=\prod\limits_p\inl{P,p<z}p

The goal of sieve theory is to estimate the sifting function

S(l{A},l{P},z)=\sum\limits_n\leqa_n.

In the case of

a_n=1_A(n)

this just counts the cardinality of a subset

A_{\operatorname{sift}

}\subseteq A of numbers, that are coprime to the prime factors of

P(z)

The inclusion–exclusion principle

For

l{P}

define

A_{\operatorname{sift}

}:=\, \quad p_1,\dots,p_k\in\mathcaland for each prime

p\inl{P}

denote the subset

E_p\subseteqA

of multiples

E_{p:=\{pn:n\inN\}}

and let

|E_p|

be the cardinality.

We now introduce a way to calculate the cardinality of

A_{\operatorname{sift}

}. For this the sifting range

l{P}

will be a concrete example of primes of the form

l{P}:=\{2,3,5,7,11,13...\}

If one wants to calculate the cardinality of

A_{\operatorname{sift}

}, one can apply the inclusion–exclusion principle. This algorithm works like this: first one removes from the cardinality of

|A|

the cardinality

|E_2|

and

|E_3|

. Now since one has removed the numbers that are divisble by

and

twice, one has to add the cardinality

|E_6|

. In the next step one removes

|E_5|

and adds

|E₁₀|

and

|E₁₅|

again. Additionally one has now to remove

|E₃₀|

, i.e. the cardinality of all numbers divisible by

2,3

and

. This leads to the inclusion–exclusion principle

|A_{\operatorname{sift}

}|=|A|-|E_2|-|E_3|+|E_6|-|E_5|+|E_|+|E_|-|E_|+\cdots

Notice that one can write this as

|A_{\operatorname{sift}

}|=\sum\limits_ \mu(d)| E_d|where

\mu

is the Möbius function and

P:=\prod\limits_p\inl{P

} p the product of all primes in

l{P}

and

E_1:=A

Legendre's identity

We can rewrite the sifting function with Legendre's identity

S(l{A},l{P},z)=\sum\limits_d\mid\mu(d)A_d(x)

by using the Möbius function and some functions

A_d(x)

induced by the elements of

l{P}

A_{d(x)=\sum\limits}_n\leq

}a_n.

Example

Let

z=7

and

l{P}=P

. The Möbius function is negative for every prime, so we get

\begin{align} S(l{A},P,7)&=A_1(x)-A_2(x)-A_3(x)-A_5(x)+A_6(x)+A₁₀(x)+A₁₅(x)-A₃₀(x). \end{align}

Approximation of the congruence sum

One assumes then that

A_d(x)

can be written as

A_d(x)=g(d)X+r_d(x)

where

g(d)

is a density, meaning a multiplicative function such that

g(1)=1, 0\leqg(p)<1 p\inP

and

is an approximation of

A_1(x)

and

r_d(x)

is some remainder term. The sifting function becomes

S(l{A},l{P},z)=X\sum\limits_d\mid\mu(d)g(d)+\sum\limits_d\mid\mu(d)r_d(x)

or in short

S(l{A},l{P},z)=XG(x,z)+R(x,z).

One tries then to estimate the sifting function by finding upper and lower bounds for

respectively

and

The partial sum of the sifting function alternately over- and undercounts, so the remainder term will be huge. Brun's idea to improve this was to replace

\mu(d)

in the sifting function with a weight sequence

(λ_d)

consisting of restricted Möbius functions. Choosing two appropriate sequences

	-
(λ
	d

)

and

	+
(λ
	d

)

and denoting the sifting functions with

S^-

and

S⁺

, one can get lower and upper bounds for the original sifting functions

S^-\leqS\leqS⁺.

Since

is multiplicative, one can also work with the identity

\sum\limits_d\mid\mu(d)g(d)=\prod\limits_\begin{array{c}p|n; p\inP\end{array}}(1-g(p)), \forall n\inN.

Notation: a word of caution regarding the notation, in the literature one often identifies the set of sequences

l{A}

with the set

itself. This means one writes

l{A}=\{s:s\leqx\}

to define a sequence

l{A}=(a_n)

. Also in the literature the sum

A_d(x)

is sometimes notated as the cardinality

|A_d(x)|

of some set

A_d(x)

, while we have defined

A_d(x)

to be already the cardinality of this set. We used

to denote the set of primes and

(a,b)

for the greatest common divisor of

and

Types of sieving

Modern sieves include the Brun sieve, the Selberg sieve, the Turán sieve, the large sieve, the larger sieve and the Goldston-Pintz-Yıldırım sieve. One of the original purposes of sieve theory was to try to prove conjectures in number theory such as the twin prime conjecture. While the original broad aims of sieve theory still are largely unachieved, there have been some partial successes, especially in combination with other number theoretic tools. Highlights include:

Brun's theorem, which shows that the sum of the reciprocals of the twin primes converges (whereas the sum of the reciprocals of all primes diverges);
Chen's theorem, which shows that there are infinitely many primes p such that p + 2 is either a prime or a semiprime (the product of two primes); a closely related theorem of Chen Jingrun asserts that every sufficiently large even number is the sum of a prime and another number which is either a prime or a semiprime. These can be considered to be near-misses to the twin prime conjecture and the Goldbach conjecture respectively.
The fundamental lemma of sieve theory, which asserts that if one is sifting a set of N numbers, then one can accurately estimate the number of elements left in the sieve after

N^\varepsilon

iterations provided that

\varepsilon

is sufficiently small (fractions such as 1/10 are quite typical here). This lemma is usually too weak to sieve out primes (which generally require something like

N^1/2

iterations), but can be enough to obtain results regarding almost primes.

The Friedlander–Iwaniec theorem, which asserts that there are infinitely many primes of the form

a²+b⁴

Zhang's theorem, which shows that there are infinitely many pairs of primes within a bounded distance. The Maynard–Tao theorem generalizes Zhang's theorem to arbitrarily long sequences of primes.

Techniques of sieve theory

The techniques of sieve theory can be quite powerful, but they seem to be limited by an obstacle known as the parity problem, which roughly speaking asserts that sieve theory methods have extreme difficulty distinguishing between numbers with an odd number of prime factors and numbers with an even number of prime factors. This parity problem is still not very well understood.

Compared with other methods in number theory, sieve theory is comparatively elementary, in the sense that it does not necessarily require sophisticated concepts from either algebraic number theory or analytic number theory. Nevertheless, the more advanced sieves can still get very intricate and delicate (especially when combined with other deep techniques in number theory), and entire textbooks have been devoted to this single subfield of number theory; a classic reference is and a more modern text is .

The sieve methods discussed in this article are not closely related to the integer factorization sieve methods such as the quadratic sieve and the general number field sieve. Those factorization methods use the idea of the sieve of Eratosthenes to determine efficiently which members of a list of numbers can be completely factored into small primes.

Literature

- - - Book: Harman, Glyn . Glyn Harman . Prime-detecting sieves . London Mathematical Society Monographs . 33 . Princeton University Press . 2007 . 978-0-691-12437-7 . 1220.11118 . 2331072 . Princeton, NJ.
Book: Heini . Halberstam . Heini Halberstam . Hans-Egon . Richert . Hans-Egon Richert . Sieve Methods . . 1974 . 0-12-318250-6 . London-New York . 0424730 . London Mathematical Society Monographs . 4.
- - Small gaps between primes . Maynard . James . James Maynard (mathematician) . 2015 . . 181 . 1 . 383–413 . 3272929 . 10.4007/annals.2015.181.1.7 . 1311.4600 .
- Bounded gaps between primes . Yitang . Zhang . Yitang Zhang . . 2014 . 179 . 3 . 1121–1174 . 10.4007/annals.2014.179.3.7 . 3171761 . free .

References

Viggo . Brun . 1915 . Archiv for Math. Naturvidenskab . Über das Goldbachsche Gesetz und die Anzahl der Primzahlpaare . 34.
Book: Alina Carmen . Cojocaru . M. Ram . Murty . An Introduction to Sieve Methods and Their Applications . Cambridge University Press . 10.1017/CBO9780511615993 . 2005. 978-0-521-84816-9 .