Roth's theorem on arithmetic progressions explained

Roth's theorem on arithmetic progressions is a result in additive combinatorics concerning the existence of arithmetic progressions in subsets of the natural numbers. It was first proven by Klaus Roth in 1953.^[1] Roth's theorem is a special case of Szemerédi's theorem for the case

k=3

Statement

A subset A of the natural numbers is said to have positive upper density if

\limsup_n

	\|A\cap\{1,2,3,...c,n\
	\|}{n}

Roth's theorem on arithmetic progressions (infinite version): A subset of the natural numbers with positive upper density contains a 3-term arithmetic progression.

An alternate, more qualitative, formulation of the theorem is concerned with the maximum size of a Salem–Spencer set which is a subset of

[N]=\{1,...,N\}

. Let

r_3([N])

be the size of the largest subset of

[N]

which contains no 3-term arithmetic progression.

Roth's theorem on arithmetic progressions (finitary version):

r_3([N])=o(N)

.

Improving upper and lower bounds on

r_3([N])

is still an open research problem.

History

The first result in this direction was Van der Waerden's theorem in 1927, which states that for sufficiently large N, coloring the integers

1,...,n

with

colors will result in a

term arithmetic progression.^[2]

Later on in 1936 Erdős and Turán conjectured a much stronger result that any subset of the integers with positive density contains arbitrarily long arithmetic progressions. In 1942, Raphaël Salem and Donald C. Spencer provided a construction of a 3-AP-free set (i.e. a set with no 3-term arithmetic progressions) of size

	N
	e^O(log

,^[3] disproving an additional conjecture of Erdős and Turán that

r_3([N])=N¹

for some

\delta>0

.^[4]

In 1953, Roth partially resolved the initial conjecture by proving they must contain an arithmetic progression of length 3 using Fourier analytic methods. Eventually, in 1975, Szemerédi proved Szemerédi's theorem using combinatorial techniques, resolving the original conjecture in full.

Proof techniques

The original proof given by Roth used Fourier analytic methods. Later on another proof was given using Szemerédi's regularity lemma.

Proof sketch via Fourier analysis

In 1953, Roth used Fourier analysis to prove an upper bound of

r_3([N])=O\left(

	N
	loglogN

\right)

. Below is a sketch of this proof.

Define the Fourier transform of a function

f:Z → C

to be the function

\widehat{f}

satisfying

\widehat{f}(\theta)=\sum_xf(x)e(-x\theta)

,where

e(t)=e^2\pi

Let

be a 3-AP-free subset of

\{1,...,N\}

. The proof proceeds in 3 steps.

Show that a

admits a large Fourier coefficient.

Deduce that there exists a sub-progression of

\{1,...,N\}

such that

has a density increment when restricted to this subprogression.

Iterate Step 2 to obtain an upper bound on

|A|

Step 1

For functions,

f,g,h:Z → C,

define

Λ(f,g,h)=\sum_x,f(x)g(x+y)h(x+2y)

Counting Lemma Let

f,g:Z → C

satisfy
\sum_n|f(n)|^2,\sum_n|g(n)|²\leM

. Define
Λ_3(f)=Λ(f,f,f)

. Then
|Λ_3(f)-Λ_3(g)|\le3M\|\widehat{f-g}\|_infty

.

The counting lemma tells us that if the Fourier Transforms of

and

are "close", then the number of 3-term arithmetic progressions between the two should also be "close." Let

\alpha=|A|/N

be the density of

. Define the functions

f=1_A

(i.e the indicator function of

), and

g=\alpha ⋅ 1_[N]

. Step 1 can then be deduced by applying the Counting Lemma to

and

, which tells us that there exists some

\theta

such that

	N
\left\|\sum
	n=1

(1_A-\alpha)(n)e(\thetan)\right|\ge

	\alpha²
	10

Step 2

Given the

\theta

from step 1, we first show that it's possible to split up

[N]

into relatively large subprogressions such that the character

x\mapstoe(x\theta)

is roughly constant on each subprogression.

Lemma 1: Let

0<η<1,\theta\inR

. Assume that
N>Cη^-6

for a universal constant
C

. Then it is possible to partition
[N]

into arithmetic progressions
P_i

with length
N^1/3\le|P_i|\le2N^1/3

such that
\sup
x,y\inP_i

|e(x\theta)-e(y\theta)|<η

for all
i

.

Next, we apply Lemma 1 to obtain a partition into subprogressions. We then use the fact that

\theta

produced a large coefficient in step 1 to show that one of these subprogressions must have a density increment:

Lemma 2: Let

A

be a 3-AP-free subset of
[N]

, with
|A|=\alphaN

and
N>C\alpha^-12

. Then, there exists a sub progression
P\subset[N]

such that
|P|\geN^1/3

and
|A\capP|\ge(\alpha+\alpha^2/40)|P|

.

Step 3

We now iterate step 2. Let

a_t

be the density of

after the

th iteration. We have that

\alpha₀=\alpha,

and

\alpha_t\ge\alpha+\alpha^2/40.

First, see that

\alpha

doubles (i.e. reach

such that

\alpha_T\ge2\alpha₀

) after at most

40/\alpha+1

steps. We double

\alpha

again (i.e reach

\alpha_T\ge4\alpha₀

) after at most

20/\alpha+1

steps. Since

\alpha\le1

, this process must terminate after at most

O(1/\alpha)

steps.

Let

N_t

be the size of our current progression after

iterations. By Lemma 2, we can always continue the process whenever

N_t\ge

	-12
C\alpha
	t

and thus when the process terminates we have that

N_t\le

	-12
C\alpha
	t

\leC\alpha^-12.

Also, note that when we pass to a subprogression, the size of our set decreases by a cube root. Therefore

N\le

	3^t
N
	t

\le(C\alpha^-12

	3^O(1/\alpha)
)

	e^O(1/\alpha)
e

Therefore

\alpha=O(1/loglogN),

|A|=O\left(

	N
	loglogN

\right),

as desired.

\blacksquare

Unfortunately, this technique does not generalize directly to larger arithmetic progressions to prove Szemerédi's theorem. An extension of this proof eluded mathematicians for decades until 1998, when Timothy Gowers developed the field of higher-order Fourier analysis specifically to generalize the above proof to prove Szemerédi's theorem.^[5]

Proof sketch via graph regularity

Below is an outline of a proof using the Szemerédi regularity lemma.

Let

be a graph and

X,Y\subseteqV(G)

. We call

(X,Y)

\epsilon

-regular pair if for all

A\subsetX,B\subsetY

with

|A|\geq\epsilon|X|,|B|\geq\epsilon|Y|

, one has

|d(A,B)-d(X,Y)|\leq\epsilon

A partition

l{P}=\{V_1,\ldots,V_k\}

V(G)

is an

\epsilon

-regular partition if

\sum
	(i,j)\in[k]^2,(V_i,V_{j)not\epsilon-regular}

|V_i||V

	2

	j\|\leq\epsilon\|V(G)\|

Then the Szemerédi regularity lemma says that for every

\epsilon>0

, there exists a constant

such that every graph has an

\epsilon

-regular partition into at most

parts.

We can also prove that triangles between

\epsilon

-regular sets of vertices must come along with many other triangles. This is known as the triangle counting lemma.

Triangle Counting Lemma: Let

G

be a graph and
X,Y,Z

be subsets of the vertices of
G

such that
(X,Y),(Y,Z),(Z,X)

are all
\epsilon

-regular pairs for some
\epsilon>0

. Let
d_XY,d_XZ,d_YZ

denote the edge densities
d(X,Y),d(X,Z),d(Y,Z)

respectively. If
d_XY,d_XZ,d_YZ\ge2\epsilon

, then the number of triples
(x,y,z)\inX x Y x Z

such that
x,y,z

form a triangle in
G

is at least
(1-2\epsilon)(d_XY-\epsilon)(d_XZ-\epsilon)(d_YZ-\epsilon) ⋅ |X||Y||Z|

.

Using the triangle counting lemma and the Szemerédi regularity lemma, we can prove the triangle removal lemma, a special case of the graph removal lemma.

Triangle Removal Lemma: For all

\epsilon>0

, there exists
\delta>0

such that any graph on
n

vertices with less than or equal to
\deltan³

triangles can be made triangle-free by removing at most
\epsilonn²

edges.

This has an interesting corollary pertaining to graphs

vertices where every edge of

lies in a unique triangle. In specific, all of these graphs must have

o(N²⁾

edges.

Take a set

with no 3-term arithmetic progressions. Now, construct a tripartite graph

whose parts

X,Y,Z

are all copies of

Z/(2N+1)Z

. Connect a vertex

x\inX

to a vertex

y\inY

y-x\inA

. Similarly, connect

z\inZ

with

y\inY

z-y\inA

. Finally, connect

x\inX

with

z\inZ

(z-x)/2\inA

This construction is set up so that if

x,y,z

form a triangle, then we get elements

y-x,

	z-x
	2

,z-y

that all belong to

. These numbers form an arithmetic progression in the listed order. The assumption on

then tells us this progression must be trivial: the elements listed above are all equal. But this condition is equivalent to the assertion that

x,y,z

is an arithmetic progression in

Z/(2N+1)Z

. Consequently, every edge of

lies in exactly one triangle. The desired conclusion follows.

\blacksquare

Extensions and generalizations

Szemerédi's theorem resolved the original conjecture and generalized Roth's theorem to arithmetic progressions of arbitrary length. Since then it has been extended in multiple fashions to create new and interesting results.

Furstenberg and Katznelson^[6] used ergodic theory to prove a multidimensional version and Leibman and Bergelson^[7] extended it to polynomial progressions as well. Most recently, Green and Tao proved the Green–Tao theorem which says that the prime numbers contain arbitrarily long arithmetic progressions. Since the prime numbers are a subset of density 0, they introduced a "relative" Szemerédi theorem which applies to subsets with density 0 that satisfy certain pseudorandomness conditions. Later on Conlon, Fox, and Zhao^[8] ^[9] strengthened this theorem by weakening the necessary pseudorandomness condition. In 2020, Bloom and Sisask^[10] proved that any set

such that

\sum_n

	1
	n

diverges must contain arithmetic progressions of length 3; this is the first non-trivial case of another conjecture of Erdős postulating that any such set must in fact contain arbitrarily long arithmetic progressions.

Improving bounds

There has also been work done on improving the bound in Roth's theorem. The bound from the original proof of Roth's theorem showed that

r_3([N])\leqc ⋅

	N
	loglogN

for some constant

. Over the years this bound has been continually lowered by Szemerédi,^[11] Heath-Brown,^[12] Bourgain,^[13] ^[14] and Sanders.^[15] ^[16] The current (July 2020) best bound is due to Bloom and Sisask who have showed the existence of an absolute constant c>0 such that

r_3([N])\leq

	N
	(logN)^1+c

In February 2023 a preprint^[17] ^[18] (later published^[19]) by Kelley and Meka gave a new bound of:

r_3([N])\leq

	-\Omega((logN)^1/12)
2

⋅ N

Four days later, Bloom and Sisask published a preprint giving an exposition of the result^[20] (later published^[21]), simplifying the argument and yielding some additional applications. Several months later, Bloom and Sisask obtained a further improvement to

r_3([N])\leq\exp(-c(logN)^1/9)N

, and stated (without proof) that their techniques can be used to show

r_3([N])\leq\exp(-c(logN)^5/41)N

.^[22]

There has also been work done on the other end, constructing the largest set with no three-term arithmetic progressions. The best construction has barely been improved since 1946 when Behrend^[23] improved on the initial construction by Salem and Spencer and proved

r_3([N])\geqN\exp(-c\sqrt{logN})

Due to no improvements in over 70 years, it is conjectured that Behrend's set is asymptotically very close in size to the largest possible set with no three-term progressions. If correct, the Kelley-Meka bound will prove this conjecture.

Roth's theorem in finite fields

As a variation, we can consider the analogous problem over finite fields. Consider the finite field

	n
F
	3

, and let

r_3(F

	n)

	3

be the size of the largest subset of

	n
F
	3

which contains no 3-term arithmetic progression. This problem is actually equivalent to the cap set problem, which asks for the largest subset of

	n
F
	3

such that no 3 points lie on a line. The cap set problem can be seen as a generalization of the card game Set.

In 1982, Brown and Buhler^[24] were the first to show that

r_3(F

	n)

	3

=o(3^n).

In 1995, Roy Mesuhlam^[25] used a similar technique to the Fourier-analytic proof of Roth's theorem to show that

r_3(F

	n)

	3

=O\left(

	3ⁿ
	n

\right).

This bound was improved to

O(3^n/n¹)

in 2012 by Bateman and Katz.^[26]

In 2016, Ernie Croot, Vsevolod Lev, Péter Pál Pach, Jordan Ellenberg and Dion Gijswijt developed a new technique based on the polynomial method to prove that

r_3(F

	n)

	3

=O(2.756ⁿ⁾

.^[27] ^[28] ^[29]

The best known lower bound is

2.2202ⁿ

, discovered in December 2023 by Google DeepMind researchers using a large language model (LLM).^[30]

Roth's theorem with popular differences

Another generalization of Roth's theorem shows that for positive density subsets, there not only exists a 3-term arithmetic progression, but that there exist many 3-APs all with the same common difference.

Roth's theorem with popular differences: For all

\epsilon>0

, there exists some
n₀=n_0(\epsilon)

such that for every
n>n₀

and
A\subset

n
F
3

with
|A|=\alpha3^n,

there exists some
y ≠ 0

such that
|\{x:x,x+y,x+2y\inA\}|\ge(\alpha³-\epsilon)3^n.

is chosen randomly from

	n,
F
	3

then we would expect there to be

\alpha³³ⁿ

progressions for each value of

. The popular differences theorem thus states that for each

|A|

with positive density, there is some

such that the number of 3-APs with common difference

is close to what we would expect.

This theorem was first proven by Green in 2005,^[31] who gave a bound of

n₀=tow((1/\epsilon)^O(1)),

where

tow

is the tower function. In 2019, Fox and Pham recently improved the bound to

n₀=tow(O(log

	1
	\epsilon

)).

^[32]

A corresponding statement is also true in

for both 3-APs and 4-APs.^[33] However, the claim has been shown to be false for 5-APs.^[34]

External links

Edmonds, Chelsea; Koutsoukou-Argyraki, Angeliki; Paulson, Lawrence C. Roth's Theorem on Arithmetic Progressions (Formal proof development in Isabelle/HOL, Archive of Formal Proofs)

Notes and References

Roth . Klaus . On certain sets of integers . Journal of the London Mathematical Society . 1953 . 28 . 1 . 104–109 . 10.1112/jlms/s1-28.1.104.
van der Waerden . B. L. . Beweis einer Baudetschen Vermutung . Nieuw. Arch. Wisk. . 1927 . 15 . 212–216.
Salem . Raphaël . Spencer . Donald C. . On sets of integers which contain no three terms in arithmetical progression . . 1942 . 28 . 12 . 561–563 . 10.1073/pnas.28.12.561 . free . 16588588 . 1078539 . 0007405 . 1942PNAS...28..561S .
Erdös . Paul . Turán . Paul . On Some Sequences of Integers . Journal of the London Mathematical Society . 1936 . 4 . 4 . 261–264 . 1574918 . 10.1112/jlms/s1-11.4.261 .
Gowers . W. T.. A new proof of Szemerédi's theorem for arithmetic progressions of length four . . 1998 . 8 . 3 . 529–551 . 10.1007/s000390050065 . free.
Furstenberg . Hillel . Hillel Furstenberg . Katznelson . Yitzhak . Yitzhak Katznelson . An ergodic Szemerédi theorem for commuting transformations . . 10.1007/BF02790016 . free . 531279 . 38 . 1978 . 275–291 . 1. 123386017 .
Vitaly . Bergelson . Vitaly Bergelson . Alexander . Leibman . Polynomial extensions of van der Waerden's and Szemerédi's theorems . . 9 . 3 . 1996 . 725–753 . 1325795 . 10.1090/S0894-0347-96-00194-4. free .
David . Conlon . David Conlon . Jacob . Fox . Jacob Fox . Yufei . Zhao . A relative Szemerédi theorem . 1305.5440 . 2015 . 3361771 . 25 . 3 . 733–762 . 10.1007/s00039-015-0324-9 . free . Geometric and Functional Analysis.
Yufei . Zhao . An arithmetic transference proof of a relative Szemerédi theorem . . 156 . 2014 . 255–261 . 2 . 3177868 . 10.1017/S0305004113000662. 1307.4959 . 2014MPCPS.156..255Z . 119673319 .
Thomas F. Bloom, Olof Sisask, Breaking the logarithmic barrier in Roth's theorem on arithmetic progressions, arXiv:2007.03528, 2020
Endre. Szemerédi. Integer sets containing no arithmetic progressions. Acta Mathematica Hungarica. 1990. 56. 1–2. 10.1007/BF01903717. free. 1100788. 155–158.
D. R. Heath-Brown. Roger. Heath-Brown. Integer sets containing no arithmetic progressions. 35. Journal of the London Mathematical Society. 3. 1987. 889362. 385–394. 10.1112/jlms/s2-35.3.385.
Jean Bourgain. Jean. Bourgain. On triples in arithmetic progression. Geometric and Functional Analysis. 9. 5. 1999. 968–984. 1726234. 10.1007/s000390050105. 392820 .
Jean Bourgain. Jean. Bourgain. Roth's theorem on progressions revisited. 104. 2008. 155–192. Journal d'Analyse Mathématique. 2403433. 10.1007/s11854-008-0020-x. free. 1. 16985451 .
Tom Sanders (mathematician). Tom. Sanders. On certain other sets of integers. Annals of Mathematics. 185. 1. 2012. 53–82. 2892617. 10.1007/s11854-012-0003-9. 1007.5444. 119727492 .
Tom Sanders (mathematician). Tom. Sanders. On Roth's theorem on progressions. Annals of Mathematics. 174. 1. 2011. 619–636. 2811612. 10.4007/annals.2011.174.1.20. 1011.0104. 53331882 .
2302.05537 . math.NT . Zander . Kelley . Raghu . Meka . Strong Bounds for 3-Progressions . 2023-02-10.
Web site: Sloman . Leila . 2023-03-21 . Surprise Computer Science Proof Stuns Mathematicians . Quanta Magazine . en.
Kelley . Zander . Meka . Raghu . 2023-11-06 . Strong Bounds for 3-Progressions . 2023 IEEE 64th Annual Symposium on Foundations of Computer Science (FOCS)-Conference Proceedings . IEEE . 933–973 . 10.1109/FOCS57990.2023.00059 . 979-8-3503-1894-4. 2302.05537 .
2302.07211 . math.NT . Thomas F. . Bloom . Olof . Sisask . The Kelley--Meka bounds for sets free of three-term arithmetic progressions . 2023-02-14.
Bloom . Thomas F. . Sisask . Olof . 2023-12-31 . The Kelley–Meka bounds for sets free of three-term arithmetic progressions . Essential Number Theory . en . 2 . 1 . 15–44 . 10.2140/ent.2023.2.15 . 2834-4634. 2302.07211 .
Bloom . Thomas F. . Sisask . Olof . 2023-09-05 . An improvement to the Kelley-Meka bounds on three-term arithmetic progressions . math.NT . 2309.02353 .
Behrend . F. A. . On sets of integers which contain no three terms in arithmetical progression . Proceedings of the National Academy of Sciences of the United States of America . 1946 . 32 . 12 . 331–332 . 10.1073/pnas.32.12.331 . free. 16578230 . 1078964 . 1946PNAS...32..331B .
Brown . T. C. . Tom Brown (mathematician) . Buhler . J. P. . A density version of a geometric Ramsey theorem . . Series A . 1982 . 32 . 1 . 20–34. 10.1016/0097-3165(82)90062-0 . free .
Mesuhlam . Roy . On subsets of finite abelian groups with no 3-term arithmetic progressions . . Series A . 71 . 1 . 168–172 . 10.1016/0097-3165(95)90024-1. 1995 . free .
Bateman . M. . Katz . N. . New bounds on cap sets . Journal of the American Mathematical Society . 2012 . 25 . 2 . 585–613. 10.1090/S0894-0347-2011-00725-X . free . 2022/19057 . free .
Ellenberg . Jordan S. . Gijswijt . Dion . On large subsets of
n
F
q

with no three-term arithmetic progression . Annals of Mathematics, Second Series . 185 . 1 . 339–343 . 10.4007/annals.2017.185.1.8 . 1605.09223 . 2016 . 119683140 .
Croot . Ernie . Lev . Vsevolod F. . Pach . Péter Pál . Progression-free sets in
n
Z
4

are exponentially small . 1605.01506 . . 2nd series . 331–337 . 185 . 2017 . 1 . 10.4007/annals.2017.185.1.7.
Web site: Klarreich . Erica . Simple Set Game Proof Stuns Mathematicians . Quanta . May 31, 2016.
Romera-Paredes . Bernardino . Barekatain . Mohammadamin . Novikov . Alexander . Balog . Matej . Kumar . M. Pawan . Dupont . Emilien . Ruiz . Francisco J. R. . Ellenberg . Jordan S. . Wang . Pengming . Fawzi . Omar . Kohli . Pushmeet . Fawzi . Alhussein . 2023-12-14 . Mathematical discoveries from program search with large language models . Nature . en . 1–3 . 10.1038/s41586-023-06924-6 . 1476-4687. free . 10794145 .
Green . Ben . A Szemerédi-type regularity lemma in abelian groups, with applications . . 2005 . 15 . 2 . 340–376 . 2153903 . 10.1007/s00039-005-0509-8 . free.
Fox . Jacob . Jacob Fox . Pham . Huy Tuan . Popular progression differences in vector spaces . International Mathematics Research Notices . 2021 . 7 . April 2021 . 5261–5289 . 10.1093/imrn/rny240 . free . 1708.08482 . 2017arXiv170808482F.
Book: Green . Ben . An Irregular Mind . Tao . Terrence . 2010 . 21 . 261–334 . Bolyai Society Mathematical Studies. 10.1007/978-3-642-14444-8_7 . 2010arXiv1002.2028G . 1002.2028 . Bolyai Society Mathematical Studies . 978-3-642-14443-1 . An Arithmetic Regularity Lemma, an Associated Counting Lemma, and Applications . 115174575 .
Bergelson . Vitaly . Host . Bernard . Kra . Bryna . Multiple recurrence and nilsequences. With an appendix by Imre Ruzsa . Inventiones Mathematicae . 2005 . 160 . 2 . 261–303. 10.1007/s00222-004-0428-6 . 1380361 .