Multiscale Analysis of The Primes: Terence Tao (UCLA)
Multiscale Analysis of The Primes: Terence Tao (UCLA)
Multiscale Analysis of The Primes: Terence Tao (UCLA)
www.math.ucla.edu/∼tao/preprints
1
Patterns in the primes
2
More specific examples of Q2:
3
Prime number theorem
4
∑∞
• Define the Riemann zeta function ζ(s) := 1
n=1 ns ,
thus ∞
∑ Λ(n)
′
−ζ (s) = ζ(s) s
.
n=1
n
5
A probabilistic model of the primes
6
be that
E(Λ(n)Λ(n+2)|n ∈ P1) ≈ E(E(Λ)(n)E(Λ)(n+2)|n ∈ P1)
and
E(Λ(n) . . . Λ(n + (k − 1)r)|n ∈ P1, r ∈ [1, N/2])
≈ E(E(Λ)(n) . . . E(Λ)(n+(k−1)r)|n ∈ P1, r ∈ [1, N/2]).
• Unfortunately, this simple model is too naive. For
instance, it would also predict that there are roughly
N/2 log2 N pairs of the form p, p + 1 in P , which is
clearly wrong!
• The problem is that we have to incorporate more
Q1 type information than the prime number theo-
rem, which only gives density bounds on P1. In this
particular case, the additional structure we need to
incorporate is
All elements of P are odd.
• To exploit this, let P2 denote those elements of P1
which are odd. Then |P2| ≈ |P1|/2, and so P has
density roughly 2/ log N inside P2. Thus we could
conjecture a new probabilistic model for the primes,
P behaves like a random subset of P2 of density 2/ log N .
7
• A fancier way of stating this model is the follow-
ing. Let B2 be the σ-algebra of P1 generated by the
residue classes modulo 2. Then the above model as-
serts that Λ can be approximated in a weak sense,
not by the expectation E(Λ), which is roughly 1, but
by the conditional expectation, E(Λ|B2), which is
roughly 2 on the odd numbers and 0 on the even
numbers.
• This model now gives the correct count for pairs
p, p+1 in P , and gives a revised prediction of N/ log2 N
for the number of pairs p, p+2 in P , and similarly for
other Q2 type problems. However, it gives incorrect
counts for other problems, such as counting triples of
the form p, p + 2, p + 4 in P (it predicts 2N/ log3 N
such triples, when in fact there are 0). This is be-
cause we have not yet incorporated further Q1 type
information about the primes, namely
All elements of P are equal to 1 or 2 mod 3.
• In fact, the elements of P are equally divided into
the two residue classes. This is a special case of
• Dirichlet-Siegel-Walfisz theorem The density
of P in the residue class {n ∈ P1 : n = a mod k}
is equal to 0 if a is not coprime to k, and equal to
8
(1+o(1))k
when a is coprime to k, if N is large enough
ϕ(k) log N
depending on k. Here ϕ(k) is the number of residue
classes mod k that are coprime to k.
• This leads to another model for the primes. Let P3
be those elements of P2 which are coprime to 3; then
P3 has density 1/3 in P1, then we could conjecture
P behaves like a random subset of P3 of density 3/ log N .
• Equivalently, we can define B3 to be the σ-algebra
formed by adding the residue classes modulo 3 to
B∈, and we then model Λ by E(Λ|B3) (i.e. the func-
tion which equals 3 when n = 1, 5 mod 6 and zero
otherwise).
9
Hardy-Littlewood prime tuple conjecture
10
stant
∏ 1
B2 := ω(p)/(1 − )2 = 1.88032 . . . ,
p
p prime
where ω(p) is the probability that a random pair
n, n + 2 in Z/pZ are both coprime to p.
• The number of progressions of length k in P would
be (1 + o(1))Ck N/8(k − 1) logk N , where
∏ 1
Ck := γk (p)/(1 − )k ,
p
p prime
12
Vinogradov’s method
13
and applying Möbius inversion to obtain the for-
mula ∑ n
Λ(n) = µ(d)Λ( )
d
d|n
14
i.e. the magnitudes of Fourier coefficients are con-
sistent with the Hardy-Littlewood conjecture. This
allows us in turn to solve Q2 type questions which
only require magnitude control on the Fourier coef-
ficients. For instance, the number of ways to write
M as the sum of three primes in P can be written
in terms of the Fourier coefficients as
∫ 2π
1
e−2πiM θ f (θ)3 dθ,
2π 0
and this can be computed using the above techniques
to prove Vinogradov’s theorem (every large odd num-
ber is the sum of three primes). It is crucial here that
the exponent is 3, which is larger than the Plancherel
exponent of 2; this means that large Fourier coef-
ficients dominate, and one can estimate the small
Fourier coefficients in absolute values. A similar ar-
gument gives van der Corput’s theorem (infinitely
many prime arithmetic progressions of length 3).
• Unfortunately, many Q2 problems are known to de-
pend on the phase of Fourier coefficients, as well as
their magnitude. A typical example is the problem
of locating progressions of length 4. Let ε be a small
number and consider two subsets A and B of P1.
A will be a randomly chosen set of cardinality εN .
15
B will be the set of all numbers n such that αn2
is within ε/2 to an integer,
√ where α is a fixed ir-
rational (e.g. α := 2). Both sets have roughly
the same size, and roughly the same Fourier coeffi-
cients. However, B has significantly more progres-
sions n, n + r, n + 2r, n + 3r of length 4 than A (it
has roughly ε3n2, while A has roughly ε4n2). The
reason is because of the quadratic identity
αn2 − 3α(n + r)2 + 3α(n + 2r)2 − α(n + 3r)2 = 0
which means that if three elements of a progression lie
in B, the fourth is very likely to also. This quadratic
correlation is not easily detected using Fourier meth-
ods (although Tim Gowers has managed to do so by
developing the foundations of a “quadratic Fourier
analysis”; we will not discuss this fascinating topic
here).
Density-based methods
16
small (e.g. a small power of N ), but as for how P
sits inside PR , we only have a conjecture, and some
Fourier-type information. This blocks us from solv-
ing a number of Q2 type problems.
• However, for certain special types of Q2 problems it is
possible to establish patterns in P without knowing
the specific way in which P lies inside PR - instead,
relying purely on the density of P inside PR . The
prototype for such a result is
• Szemerédi’s theorem Let A be any subset of P1
with density δ > 0. Then, if N is large enough de-
pending on δ and k, then A must contain arithmetic
progressions of length k (in fact, it contains at least
c(k, δ)N 2 of them).
• The intuition behind this theorem is the following.
If A is randomly distributed, then it is easy to see
why A should have lots of progressions. Conversely,
if A is very structured (e.g. periodic), then it is also
easy to see why A should have progressions (whereas
it need not have any twins, for instance). The (very)
hard part of the proof is to show that every set A
can be somehow “split” into a random part and a
structured part.
17
• While Szemerédi’s theorem is very deep, its reach
has been limited by the fact that N must be large
depending on the density δ. Despite much progress
on this issue, particularly by Gowers, the best bound
known for N for large k is
N ≥ 2 ↑ 2 ↑ 1/δ ↑ 2 ↑ 2 ↑ k + 9,
where ↑ denotes exponentiation. Since the primes
P have density 1/ log N , this bound is nowhere near
good enough to obtain long arithmetic progressions
in the primes. (There is a conjectured bound of 2ck /δ ,
essentially due to Erdös and Turan, which would give
progressions in the primes, but this conjecture is still
well out of reach.)
• Szemerédi’s theorem asserts, roughly speaking, that
dense subsets of integers contain long arithmetic pro-
gressions. The reach of this theorem has recently
been significantly extended to assert that further-
more, dense subsets of sufficiently pseudorandom
sets of integers also contain long arithmetic progres-
sions. The definition of “sufficiently pseudorandom”
is technical, but let us give a principal consequence:
• Theorem [Green,T.] Let A be any subset of PR
k
with density δ > 0, where R := N 1/(16k×2 ). Then,
18
if N is large enoguh depending on δ and k, then A
must contain arithmetic progressions of length k (in
fact, it contains at least c(k, δ)N 2/ logk N of them).
• Since P is itself a dense subset of PR (with density
roughly 1/(16k × 2k )), we obtain as a corollary that
the primes contain arbitrarily long arithmetic pro-
gressions. The key here is that while the density of
P in P1 depends on N , the density of P in PR does
not.
• A technical note: the almost primes PR are not them-
selves pseudorandom, for instance they are all odd.
However, they are pseudorandomly distributed in-
side PW for some very slowly growing W (e.g. W =
log log N ), which suffices to conclude the above the-
orem.
• The above theorem “transfers” Szemerédi’s theorem
(which it uses as a “black box”) from dense sub-
sets of integers to dense subsets of pseudorandom
sets. There are other theorems of Szemerédi type for
dense subsets of integers that are known, and it is
likely that those can now also be transferred to the
pseudorandom setting. For instance, it seems fea-
sible to establish infinitely many quadruples of the
19
form p, p + n, p + n2, p + n3 in the primes. However,
problems such as the twin prime problem still remain
well out of reach, since even quite dense subsets of
the integers need not have any twins.
20
Szemerédi’s theorem
• Szemerédi’s theorem has four quite different, quite
difficult, and quite important proofs: the original
graph theory-combinatorial proof of Szemerédi, the
ergodic theory proof of Furstenberg and later au-
thors, the Fourier-combinatorial proof of Roth (in
k = 3) and Gowers (in k = 4 and k > 4), and
the hypergraph-combinatorial proof of Frankl-Rodl
and Gowers. The connections between these proofs
are only just now being understood; the proof of the
transference theorem required ingredients (and inspi-
ration) from all four proofs.
• All proofs revolve around the dichotomy between
randomness and structure, and around some no-
tion of relative randomness and relative structure.
Schematically, all the proofs look like the following.
• Step 0. Start with no structure identified on A.
• Step 1. Is A pseudorandom relative to the existing
structure? If so, we model A by a random subset
of the existing structure, and then locate arithmetic
progressions in that structure.
• Step 2. If A is not pseudorandom, then there must
be an additional structure which correlates with A.
21
Add this to the list of identified structures and return
to Step 1.
• The various proofs of Szemerédi’s theorem then dif-
fer on the details of implementing this scheme, for
instance in defining what structure or pseudoran-
dom is (or even what “A” is), how to justify the
random model, how to find arithmetic progressions
in structures, how to show that non-randomness im-
plies correlation with structure, and how to ensure
the algorithm terminates.
• This type of proof should be contrasted with the
models of the primes built up earlier. There, we
identified in advance explicit structures (in this case,
small residue classes) to model the primes. In the
proofs of
22
Arbitrarily long progressions in P .
25
• Sketch of proof Start with B being the trivial σ-
algebra. If Λ − E(Λ|B) is already Gowers-uniform,
then we are done. (Note: this is essentially what is
claimed by the Hardy-Littlewood prime tuple conjec-
ture). If not, use dual functions to locate a σ-algebra
which correlates with Λ − E(Λ|B), and add it to B
(this will increase the L2 norm or energy of E(Λ|B)).
Repeat this process, extracting more and more struc-
ture from Λ, until what is left is Gowers-uniform; the
energy of the bounded function E(Λ|B) is bounded
from above, so this procedure has to terminate in a
bounded number of steps.
• To conclude the proof of long arithmetic progressions
in the primes, one now simply discards the Gowers
uniform error Λ−E(Λ|B), thus replacing Λ by a “ho-
mogenized” variant E(Λ|B). This variant is bounded
and now one can use the ordinary Szemerédi theorem
to obtain arbitrarily long progressions.
• In other words, the Structure theorem tells us that
while the Hardy-Littlewood prime tuple conjecture
could be false (so we don’t necessarily know that P
behaves like a random subset of PR ), we do know that
P behaves like a random subset of something reason-
ably dense - and since Szemerédi’s theorem tells us
26
that any dense set contains arithmetic progressions,
so must this random subset.
27