mit18_225_f23_lec02-05
mit18_225_f23_lec02-05
mit18_225_f23_lec02-05
Forbidding a Subgraph
Chapter Highlights
• Turán problem: determine the maximum number of edges in an 𝑛-vertex 𝐻-free graph
• Mantel’s and Turán’s theorems: 𝐾𝑟 -free
• Kővári–Sós–Turán theorem: 𝐾 𝑠,𝑡 -free
• Erdős–Stone–Simonovits theorem: 𝐻-free for general 𝐻
• Dependent random choice technique: 𝐻-free for a bounded degree bipartite 𝐻
• Lower bound constructions of 𝐻-free graphs for bipartite 𝐻
• Algebraic constructions: matching lower bounds for 𝐾2,2 , 𝐾3,3 , and 𝐾 𝑠,𝑡 for 𝑡 much larger
than 𝑠, and also for 𝐶4 , 𝐶6 , 𝐶10
• Randomized algebraic constructions
We will see the answer shortly. More generally, we can ask about what happens if we
replace “triangle” by an arbitrary subgraph. This is a foundational problem in extremal graph
theory.
The Turán problem is one of the most basic problems in extremal graph theory. It is named
after Pál Turán for his fundamental work on the subject. Research on this problem has led to
many important techniques. We will see a fairly satisfactory answer to the Turán problem for
nonbipartite graphs 𝐻. We also know the answer for a small number of bipartite graphs 𝐻.
However, for nearly all bipartite graphs 𝐻, much mystery remains.
In the first part of the chapter, we focus on techniques for upper bounding ex(𝑛, 𝐻). In
11
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
12 Forbidding a Subgraph
the last few sections, we turn our attention to lower bounding ex(𝑛, 𝐻) when 𝐻 is a bipartite
graph.
𝐾4,4 = .
The graph 𝐾 ⌊𝑛/2⌋,⌈𝑛/2⌉ has ⌊𝑛/2⌋ ⌈𝑛/2⌉ = ⌊𝑛2 /4⌋ edges (one can check this equality by
separately considering even and odd 𝑛).
Mantel (1907) proved that 𝐾 ⌊𝑛/2⌋,⌈𝑛/2⌉ has the greatest number of edges among all triangle-
free graphs.
𝑁 (𝑥) 𝑁 (𝑦)
On the other hand, note that for each vertex 𝑥, the term deg 𝑥 appears once in the preceding
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
sum for each edge incident to 𝑥, and so it appears a total of deg 𝑥 times. We then apply the
Cauchy–Schwarz inequality to get
!2
∑︁ ∑︁ 1 ∑︁ (2𝑚) 2
(deg 𝑥 + deg 𝑦) = (deg 𝑥) 2 ≥ deg 𝑥 = .
𝑥 𝑦 ∈𝐸 𝑥 ∈𝑉
𝑛 𝑥 ∈𝑉 𝑛
Comparing the two inequalities, we obtain (2𝑚) 2 /𝑛 ≤ 𝑚𝑛, and hence 𝑚 ≤ 𝑛2 /4. Since 𝑚 is
an integer, we obtain 𝑚 ≤ ⌊𝑛2 /4⌋, as claimed. □
Second proof of Mantel’s theorem. Let 𝐺 = (𝑉, 𝐸) be a triangle-free graph. Let 𝑣 be a
vertex of maximum degree in 𝐺. Since 𝐺 is triangle-free, the neighborhood 𝑁 (𝑣) of 𝑣 is an
independent set.
𝐴 = 𝑁 (𝑣) 𝐵
(independent
set)
14 Forbidding a Subgraph
Exercise 1.1.3. Let 𝑋 and 𝑌 be independent and identically distributed random vectors
in R𝑑 according to some arbitrary probability distribution. Prove that
1
P(|𝑋 + 𝑌 | ≥ 1) ≥ P(|𝑋 | ≥ 1) 2 .
2
Hint: Consider i.i.d. 𝑋1 , . . . , 𝑋𝑛 , and then let 𝑛 → ∞.
The next several exercises explore extensions of Mantel’s theorem. It is useful to revisit
the proof techniques.
Exercise 1.1.4 (Many triangles). Show that a graph with 𝑛 vertices and 𝑚 edges has at
least
4𝑚 𝑛2
𝑚− triangles.
3𝑛 4
Exercise 1.1.5. Prove that every 𝑛-vertex nonbipartite triangle-free graph has at most
(𝑛 − 1) 2 /4 + 1 edges.
Exercise 1.1.6 (Stability). Let 𝐺 be an 𝑛-vertex triangle-free graph with at least ⌊𝑛2 /4⌋ − 𝑘
edges. Prove that 𝐺 can be made bipartite by removing at most 𝑘 edges.
Exercise 1.1.7. Show that every 𝑛-vertex triangle-free graph with minimum degree
greater than 2𝑛/5 is bipartite.
Exercise 1.1.8∗. Prove that every 𝑛-vertex graph with at least ⌊𝑛2 /4⌋ + 1 edges contains
at least ⌊𝑛/2⌋ triangles.
Exercise 1.1.9∗. Let 𝐺 be an 𝑛-vertex graph with ⌊𝑛2 /4⌋ − 𝑘 edges (here 𝑘 ∈ Z) and 𝑡
triangles. Prove that 𝐺 can be made bipartite by removing at most 𝑘 + 6𝑡/𝑛 edges, and that
this constant 6 is the best possible.
Exercise 1.1.10∗. Prove that every 𝑛-vertex graph with at least ⌊𝑛2 /4⌋ + 1 edges contains
some edge in at least (1/6 − 𝑜(1))𝑛 triangles, and that this constant 1/6 is the best possible.
Even when 𝑛 is not divisible by 𝑟, the difference between 𝑒(𝑇𝑛,𝑟 ) and (1 − 1/𝑟)𝑛2 /2 is
𝑂 (𝑛𝑟). As we are generally interested in the regime when 𝑟 is fixed, this difference is a
negligible lower-order contribution. That is,
2
1 𝑛
ex(𝑛, 𝐾𝑟+1 ) = 1 − − 𝑜(1) , for fixed 𝑟 as 𝑛 → ∞.
𝑟 2
Every 𝑟-partite graph is automatically 𝐾𝑟+1 -free. Let us first consider an easy special case
of the problem.
Proof. Suppose we have an 𝑛-vertex 𝑟-partite graph with the maximum possible number of
edges. It should be a complete 𝑟-partite graph. If there were two vertex parts 𝐴 and 𝐵 with
| 𝐴| + 2 ≤ |𝐵|, then moving a vertex from 𝐵 (the larger part) to 𝐴 (the smaller part) would
increase the number of edges by (| 𝐴| + 1) (|𝐵| − 1) − | 𝐴| |𝐵| = |𝐵| − | 𝐴| − 1 > 0. Thus all
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
16 Forbidding a Subgraph
the vertex parts must have sizes within one of each other. The Turán graph 𝑇𝑛,𝑟 is the unique
such graph. □
We will see three proofs of Turán’s theorem. The first proof extends our second proof of
Mantel’s theorem.
First proof of Turán’s theorem. We prove by induction on 𝑟. The case 𝑟 = 1 is trivial, as a
𝐾2 -free graph is empty. Now assume 𝑟 > 1 and that ex(𝑛, 𝐾𝑟 ) = 𝑒(𝑇𝑛,𝑟 −1 ) for every 𝑛.
Let 𝐺 = (𝑉, 𝐸) be a 𝐾𝑟+1 -free graph. Let 𝑣 be a vertex of maximum degree in 𝐺. Since 𝐺
is 𝐾𝑟+1 -free, the neighborhood 𝐴 = 𝑁 (𝑣) of 𝑣 is 𝐾𝑟 -free. So, by the induction hypothesis,
𝑒( 𝐴) ≤ ex(| 𝐴| , 𝐾𝑟 ) = 𝑒(𝑇| 𝐴|,𝑟 −1 ).
𝐴 𝐵
(𝐾𝑟 -free)
Thus
𝑒(𝐺) = 𝑒( 𝐴) + 𝑒( 𝐴, 𝐵) + 𝑒(𝐵) ≤ 𝑒(𝑇| 𝐴|,𝑟 −1 ) + | 𝐴| |𝐵| ≤ 𝑒(𝑇𝑛,𝑟 ),
where the final step follows from the observation that 𝑒(𝑇| 𝐴|,𝑟 −1 ) + | 𝐴| |𝐵| is the number of
edges in an 𝑛-vertex 𝑟-partite graph (with part of size |𝐵| and the remaining vertices equitably
partitioned into 𝑟 − 1 parts) and Lemma 1.2.7.
To have equality in each of the preceding steps, 𝐵 must be an independent set (or else
Í
𝑦 ∈ 𝐵 deg(𝑦) < | 𝐴| |𝐵|) and 𝐴 must induce 𝑇| 𝐴|,𝑟 −1 , so that 𝐺 is 𝑟-partite. We knew from
Lemma 1.2.7 that the Turán graph 𝑇𝑛,𝑟 uniquely maximizes the number of edges among
𝑟-partite graphs. □
The second proof starts out similarly to our first proof of Mantel’s theorem. Recall that in
Mantel’s theorem, the initial observation was that in a triangle-free graph, given an edge, its
two endpoints must have no common neighbors (or else they form a triangle). Generalizing,
in a 𝐾4 -free graph, given a triangle, its three vertices have no common neighbor. The rest of
the proof proceeds somewhat differently from earlier. Instead of summing over all edges as
we did before, we remove the triangle and apply induction to the rest of the graph.
Second proof of Turán’s theorem. We fix 𝑟 and proceed by induction on 𝑛. The statement
is trivial for 𝑛 ≤ 𝑟, as the Turán graph is the complete graph 𝐾𝑛 = 𝑇𝑛,𝑟 and thus maximizes
the number of edges.
Now, assume that 𝑛 > 𝑟 and that Turán’s theorem holds for all graphs on fewer than 𝑛
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
vertices. Let 𝐺 = (𝑉, 𝐸) be an 𝑛-vertex 𝐾𝑟+1 -free graph with the maximum possible number
of edges. By the maximality assumption, 𝐺 contains 𝐾𝑟 as a subgraph, since otherwise we
could add an edge to 𝐺 and it would still be 𝐾𝑟+1 -free. Let 𝐴 be the vertex set of an 𝑟-clique
in 𝐺, and let 𝐵 B 𝑉 \ 𝐴.
𝐴
(𝑟 = 3)
We have
𝑒(𝐺) = 𝑒( 𝐴) + 𝑒( 𝐴, 𝐵) + 𝑒(𝐵)
𝑟
≤ + (𝑟 − 1) (𝑛 − 𝑟) + 𝑒(𝑇𝑛−𝑟 ,𝑟 ) = 𝑒(𝑇𝑛,𝑟 ),
2
where the inequality uses the induction hypothesis on 𝐺 [𝐵], which is 𝐾𝑟+1 -free, and the final
equality can be seen by removing a 𝐾𝑟 from 𝑇𝑛,𝑟 .
Finally, let us check when equality occurs. To have equality in each of thepreceding steps,
the subgraph induced on 𝐵 must be 𝑇𝑛−𝑟 ,𝑟 by induction. To have 𝑒( 𝐴) = 𝑟2 , 𝐴 must induce
a clique. To have 𝑒( 𝐴, 𝐵) = (𝑟 − 1) (𝑛 − 𝑟), every vertex of 𝐵 must be adjacent to all but one
vertex in 𝐴. Also, two vertices 𝑥, 𝑦 lying in distinct parts of 𝐺 [𝐵] 𝑇𝑛−𝑟 ,𝑟 cannot “miss” the
same vertex 𝑣 of 𝐴, or else 𝐴 ∪ {𝑥, 𝑦} \ {𝑣} would be an 𝐾𝑟+1 -clique. This then forces 𝐺 to
be 𝑇𝑛,𝑟 . □
The third proof uses a method known as Zykov symmetrization. The idea here is that
if a 𝐾𝑟+1 -free graph that is a not a Turán graph, then we should be able make some local
modifications (namely replacing a vertex by a clone of another vertex) to get another 𝐾𝑟+1 -free
with strictly more edges.
Third proof of Turán’s theorem. As before, let 𝐺 be an 𝑛-vertex 𝐾𝑟+1 -free graph with the
maximum possible number of edges.
We claim that if 𝑥 and 𝑦 are nonadjacent vertices, then deg 𝑥 = deg 𝑦. Indeed, suppose
deg 𝑥 > deg 𝑦. We can modify 𝐺 by removing 𝑦 and adding in a clone of 𝑥 (a new vertex 𝑥 ′
with the same neighborhood as 𝑥 but not adjacent to 𝑥), as illustrated below.
𝑥 𝑦 𝑥 𝑥′
−→
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
18 Forbidding a Subgraph
The resulting graph would still be 𝐾𝑟+1 -free (since a clique cannot contain both 𝑥 and its
clone) and has strictly more edges than 𝐺, thereby contradicting the assumption that 𝐺 has
the maximum possible number of edges.
Suppose 𝑥 is nonadjacent to both 𝑦 and 𝑧 in 𝐺. We claim that 𝑦 and 𝑧 must be nonadjacent.
We just saw that deg 𝑥 = deg 𝑦 = deg 𝑧. If 𝑦𝑧 is an edge, then by deleting 𝑦 and 𝑧 from 𝐺 and
adding two clones of 𝑥, we obtain a 𝐾𝑟+1 -free graph with one more edge than 𝐺. This would
contradict the maximality of 𝐺.
𝑥 𝑥
−→
𝑦 𝑧 𝑥′ 𝑥 ′′
Exercise 1.2.8. Let 𝐺 be a 𝐾𝑟+1 -free graph. Prove that there exists an 𝑟-partite graph 𝐻
on the same vertex set as 𝐺 such that deg𝐻 (𝑥) ≥ deg𝐺 (𝑥) for every vertex 𝑥 (here deg𝐻 (𝑥)
is the degree of 𝑥 in 𝐻, and likewise with deg𝐺 (𝑥) for 𝐺). Give another proof of Turán’s
theorem from this fact.
The following exercise is an extension of Exercise 1.1.6.
Exercise 1.2.9∗ (Stability). Let 𝐺 be an 𝑛-vertex 𝐾𝑟+1 -free graph with at least 𝑒(𝑇𝑛,𝑟 ) − 𝑘
edges, where 𝑇𝑛,𝑟 is the Turán graph. Prove that 𝐺 can be made 𝑟-partite by removing at
most 𝑘 edges.
The next exercise is a neat geometric application of Turán’s theorem.
Exercise 1.2.10. Let 𝑆 be a set of 𝑛 points in the plane, with the property that no two
points are at distance greater
√ than 1. Show that 𝑆 has at most ⌊𝑛2 /3⌋ pairs of points at
distance greater than 1/ 2. Also, show that the bound ⌊𝑛2 /3⌋ is tight (i.e., cannot be
improved).
Turán Density
In this chapter, we will define the edge density of a graph 𝐺 to be
𝑣(𝐺)
𝑒(𝐺) .
2
So the edge density of a clique is 1. Later in the book, we will consider a different normal-
ization 2𝑒(𝐺)/𝑣(𝐺) 2 for edge density, which is more convenient for other purposes. When
𝑣(𝐺) is large, there is no significant difference between the two choices.
Next, we use an averaging/sampling argument to show that ex(𝑛, 𝐻)/ 𝑛2 is nonincreasing
in 𝑛.
Proposition 1.3.1 (Monotonicity of Turán numbers)
For every graph 𝐻 and positive integer 𝑛,
ex(𝑛 + 1, 𝐻) ex(𝑛, 𝐻)
𝑛+1
≤ 𝑛 .
2 2
Proof. Let 𝐺 be an 𝐻-free graph on 𝑛 + 1 vertices. For each 𝑛-vertex subset 𝑆 of 𝑉 (𝐺), since
𝐺 [𝑆] is also 𝐻-free, we have
𝑒(𝐺 [𝑆]) ex(𝑛, 𝐻)
𝑛 ≤ 𝑛 .
2 2
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
20 Forbidding a Subgraph
Varying 𝑆 uniformly over all 𝑛-vertex subsets of 𝑉 (𝐺), the left-hand side averages to the
edge density of 𝐺 by linearity of expectations. (Check this.) It follows that
𝑒(𝐺) ex(𝑛, 𝐻)
𝑛+1
≤ 𝑛 .
2 2
The exact value of 𝜋(𝐻) is known in very few cases. It is a major open problem to determine
𝜋(𝐻) when 𝐻 is the complete 3-uniform hypergraph on four vertices (also known as a
tetrahedron), and more generally when 𝐻 is a complete hypergraph.
Supersaturation
We know from Mantel’s theorem that any 𝑛-vertex graph 𝐺 with > 𝑛2 /4 edges must contain
a triangle. What if 𝐺 has a lot more edges? It turns out that 𝐺 must have a lot of triangles.
In particular, an 𝑛-vertex graph with > (1/4 + 𝜀)𝑛2 edges must have at least 𝛿𝑛3 triangles
for some constant 𝛿 > 0 depending on 𝜀 > 0. This is indeed a lot of triangles, since there
are could only be at most 𝑂 (𝑛3 ) triangles no matter what. (Exercise 1.1.4 asks you to give a
more precise quantitative lower bound on the number of triangles. The optimal dependence
of 𝛿 on 𝜀 is a difficult problem that we will discuss in Chapter 5.)
It turns out there is a general phenomenon in combinatorics where once some density
crosses an existence threshold (e.g., the Turán density is the threshold for 𝐻-freeness), it
will be possible to find not just one copy of the desired object, but in fact lots and lots of
copies. This fundamental principle, called supersaturation, is useful for many applications,
including in our upcoming determination of 𝜋(𝐻) for general 𝐻.
Equivalently: every 𝑛-vertex graph with 𝑜(𝑛𝑣 (𝐻 ) ) copies of 𝐻 has edge density ≤ 𝜋(𝐻) +
𝑜(1) (here 𝐻 is fixed). The sampling argument in the following proof is useful in many
applications.
Proof. By the definition of the Turán density, there exists some 𝑛0 (depending on 𝐻 and
𝜀) such that every 𝑛0 -vertex graph with at least (𝜋(𝐻) + 𝜀/2) 𝑛20 edges contains 𝐻 as a
subgraph.
Let 𝑛 ≥ 𝑛0 and 𝐺 be an 𝑛-vertex graph with at least (𝜋(𝐻) + 𝜀) 𝑛2 edges. Let 𝑆 be an
𝑛0 -element subset of 𝑉 (𝐺), chosen uniformly at random. Let 𝑋 denote the edge density
of 𝐺 [𝑆]. By averaging, E𝑋 equals the edge density of 𝐺, and so E𝑋 ≥ 𝜋(𝐻) + 𝜀. Then
𝑋 ≥ 𝜋(𝐻) + 𝜀/2 with probability ≥ 𝜀/2 (or else E𝑋 could not be as large as 𝜋(𝐻) + 𝜀). So,
from the previous paragraph, we know that with probability ≥ 𝜀/2, 𝐺 [𝑆] contains a copy of
(𝐻 )
𝐻. This gives us ≥ (𝜀/2) 𝑛𝑛0 copies of 𝐻, but each copy of 𝐻 may be counted up to 𝑛𝑛−𝑣
0 −𝑣 (𝐻 )
times. Thus, the number of copies of 𝐻 in 𝐺 is
(𝜀/2) 𝑛𝑛0
≥ 𝑛−𝑣 (𝐻 ) = Ω𝐻, 𝜀 (𝑛𝑣 (𝐻 ) ). □
𝑛0 −𝑣 (𝐻 )
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
22 Forbidding a Subgraph
Exercise 1.3.6 (Density Ramsey). Prove that for every 𝑠 and 𝑟, there is some constant
𝑐 > 0 so that for every sufficiently large 𝑛, if the edges of 𝐾𝑛 are colored using 𝑟 colors,
then at least 𝑐 fraction of all copies of 𝐾𝑠 are monochromatic.
Zarankiewicz (1951) originally asked a related problem: determine the maximum number
of 1s in an 𝑚 × 𝑛 matrix without an 𝑠 × 𝑡 submatrix with all entries 1.
The main theorem of this section is the fundamental result due to Kővári, Sós, and Turán
(1954). We will refer to it as the KST theorem, which stands both for its discoverers, as well
as for the forbidden subgraph 𝐾𝑠,𝑡 .
..
𝑋 = number of copies of 𝐾𝑠,1 in 𝐺. .
(When 𝑠 = 1, we set 𝑋 = 2𝑒(𝐺).) The strategy is to count 𝑋 in two ways. First we count 𝐾𝑠,1
by first embedding the “left” 𝑠 vertices of 𝐾𝑠,1 . Then we count 𝐾𝑠,1 by first embedding the
“right” single vertex of 𝐾𝑠,1 .
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
Upper bound on 𝑋. Since 𝐺 is 𝐾𝑠,𝑡 -free, every 𝑠-vertex subset of 𝐺 has ≤ 𝑡 − 1 common
neighbors. Therefore,
𝑛
𝑋≤ (𝑡 − 1).
𝑠
Lower bound on 𝑋. For each vertex 𝑣 of 𝐺, there are exactly deg𝑠 𝑣 ways to pick 𝑠 of its
neighbors to form a 𝐾𝑠,1 as a subgraph. Therefore
∑︁ deg 𝑣
𝑋= .
𝑣 ∈𝑉 (𝐺)
𝑠
To obtain a lower bound onthis quantity in terms of the number of edges 𝑚 of 𝐺, we use a
standard trick of viewing 𝑥𝑠 as a convex function on the reals, namely, letting
(
𝑥(𝑥 − 1) · · · (𝑥 − 𝑠 + 1)/𝑠! if 𝑥 ≥ 𝑠 − 1
𝑓𝑠 (𝑥) =
0 𝑥 < 𝑠 − 1.
Then 𝑓𝑠 (𝑥) = 𝑥𝑠 for all nonnegative integers 𝑥. Furthermore 𝑓𝑠 is a convex function. Since
the average degree of 𝐺 is 2𝑚/𝑛, it follows by convexity that
∑︁
2𝑚
𝑋= 𝑓𝑠 (deg 𝑣) ≥ 𝑛 𝑓𝑠 .
𝑣 ∈𝑉 (𝐺)
𝑛
(It would be a sloppy mistake to lower bound 𝑋 by 𝑛 2𝑚/𝑛
𝑠
.)
Combining the upper bound and the lower bound. We find that
2𝑚 𝑛
𝑛 𝑓𝑠 ≤𝑋≤ (𝑡 − 1).
𝑛 𝑠
Since 𝑓𝑠 (𝑥) = (1 + 𝑜(1))𝑥 𝑠 /𝑠! for 𝑥 → ∞ and fixed 𝑠, we find that, as 𝑛 → ∞,
𝑠
𝑛 2𝑚 𝑛𝑠
≤ (1 + 𝑜(1)) (𝑡 − 1).
𝑠! 𝑛 𝑠!
Therefore,
(𝑡 − 1) 1/𝑠
𝑚≤ + 𝑜(1) 𝑛2−1/𝑠 . □
2
The final bound in the proof gives us a somewhat more precise estimate than stated in
Theorem 1.4.2. Let us record it here for future reference.
It has been long conjectured that the KST theorem is tight up to a constant factor.
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
24 Forbidding a Subgraph
In the final sections of this chapter, we will produce some constructions showing that
Conjecture 1.4.4 is true for 𝐾2,𝑡 and 𝐾3,𝑡 . We also know that the conjecture is true if 𝑡 is much
larger than 𝑠. The first open case of the conjecture is 𝐾4,4 .
Corollary 1.4.5
For every bipartite graph 𝐻, there exists some constant 𝑐 > 0 so that ex(𝑛, 𝐻) =
𝑂 𝐻 (𝑛2−𝑐 ).
Proof. Suppose the two vertex parts of 𝐻 have sizes 𝑠 and 𝑡, with 𝑠 ≤ 𝑡. Then 𝐻 ⊆ 𝐾 𝑠,𝑡 . And
thus every 𝑛-vertex 𝐻-free graph is also 𝐾𝑠,𝑡 -free, and thus has 𝑂 𝑠,𝑡 (𝑛2−1/𝑠 ) edges. □
In particular, the Turán density 𝜋(𝐻) of every bipartite graph 𝐻 is zero.
The KST theorem gives a constant 𝑐 in the preceding corollary that depends on the number
of vertices on the smaller part of 𝐻. In Section 1.7, we will use the dependent random choice
technique to give a proof of the corollary showing that 𝑐 only has to depend on the maximum
degree of 𝐻.
In other words, given 𝑛 distinct points in the plane, at most how many pairs of these points
can be exactly distance 1 apart? We can draw a graph with these 𝑛 points as vertices, with
edges joining points exactly a unit distance apart.
To get a feeling for the problem, let us play with some constructions. For small values of
𝑛, it is not hard to check by hand that the following configurations are optimal.
𝑛= 3 4 5 6 7
What about for larger values of 𝑛? If we line up the 𝑛 points equally spaced on a line, we
get 𝑛 − 1 unit distances.
···
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
We can be a bit more efficient by chaining up triangles. The following construction gives us
2𝑛 − 3 unit distances.
···
The construction for 𝑛 = 6 looks like it was obtained by copying and translating a unit
triangle. We can generalize this idea to obtain a recursive construction. Let 𝑓 (𝑛) denote the
maximum number of unit distances formed by 𝑛 points in the plane. Given a configuration
𝑃 with ⌊𝑛/2⌋ points that has 𝑓 (⌊𝑛/2⌋) unit distances, we can copy 𝑃 and translate it by a
generic unit vector to get 𝑃′ . The configuration 𝑃 ∪ 𝑃′ has at least 2 𝑓 ( ⌊𝑛/2⌋) + ⌊𝑛/2⌋ unit
distances. We can solve the recursion to get 𝑓 (𝑛) ≳ 𝑛 log 𝑛.
𝑃 𝑃′
1
Now we take a different approach to obtain an even better construction. Take a square grid
√ √
with ⌊ 𝑛⌋ × ⌊ 𝑛⌋ vertices. Instead of choosing the distance between adjacent points as the
√
unit distance, we can scale the configuration so that 𝑟 becomes the “unit” distance for some
integer 𝑟. As an illustration, here is an example of a 5 × 5 grid with 𝑟 = 10.
It turns out that by choosing the optimal 𝑟 as a function of 𝑛, we can get at least
𝑛1+𝑐/log log 𝑛
unit distances, where 𝑐 > 0 is some absolute constant. The proof uses analytic number theory,
which we omit as it would take us too far afield. The basic idea is to choose 𝑟 to be a product
of many distinct primes that are congruent to 1 modulo 4, so that 𝑟 can be represented as a
sum of two squares in many different ways, and then estimate the number of such ways.
It is conjectured that the last construction just discussed is close to optimal.
The KST theorem can be used to prove the following upper bound on the number of unit
distances.
Proof. Every unit distance graph is 𝐾2,3 -free. Indeed, for every pair of distinct points, there
are at most two other points that are at a unit distance from both points.
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
26 Forbidding a Subgraph
𝑝 𝑞
So, the number of edges is at most ex(𝑛, 𝐾2,3 ) = 𝑂 (𝑛3/2 ) by Theorem 1.4.2. □
There is a short proof of a better bound of 𝑂 (𝑛4/3 ) using the crossing number inequality
(see Section 8.2), and this is best known upper bound to date.
Erdős (1946) also asked the following related question.
Let 𝑔(𝑛) denote the answer. The asymptotically best construction for the minimum number
of distinct distances is also a square grid, same √︁
as earlier. It can be shown that a square grid
√ √
with ⌊ 𝑛⌋ × ⌊ 𝑛⌋ points has on the order of 𝑛/ log 𝑛 distinct distances. This is conjectured
√︁
to be optimal (i.e., 𝑔(𝑛) ≳ 𝑛/ log 𝑛).
Let 𝑓 (𝑛) denote the
maximum number of unit distances among 𝑛 points in the answer. We
have 𝑓 (𝑛)𝑔(𝑛) ≥ 𝑛2 , since each distance occurs at most 𝑓 (𝑛) times. So an upper bound on
𝑓 (𝑛) gives a lower bound on 𝑔(𝑛) (but not conversely).
A breakthrough on the distinct distances problem was obtained by Guth and Katz (2015).
In other
√︁ words, 𝑔(𝑛) ≳ 𝑛/log 𝑛, thereby matching the upper bound example up to a factor
of 𝑂 ( log 𝑛). The Guth–Katz proof is quite sophisticated. It uses tools ranging from the
polynomial method to algebraic geometry.
Exercise 1.4.11. Show that a 𝐶4 -free bipartite graph between two vertex parts of sizes 𝑎
and 𝑏 has at most 𝑎𝑏 1/2 + 𝑏 edges.
Exercise 1.4.12 (Density KST). Prove that for every pair of positive integers 𝑠 ≤ 𝑡, there
𝑛
are constants 𝐶, 𝑐 > 0 such that every 𝑛-vertex graph with 𝑝 2
edges contains at least
𝑐 𝑝 𝑠𝑡 𝑛𝑠+𝑡 copies of 𝐾𝑠,𝑡 , provided that 𝑝 ≥ 𝐶𝑛 −1/𝑠 .
The next exercise asks you to think about the quantitative dependencies in the proof of the
KST theorem.
Exercise 1.4.13. Show that for every 𝜀 > 0, there exists 𝛿 > 0 such that every graph with
𝑛 vertices and at least 𝜀𝑛2 edges contains a copy of 𝐾𝑠,𝑡 where 𝑠 ≥ 𝛿 log 𝑛 and 𝑡 ≥ 𝑛0.99 .
The next exercise illustrates a bad definition of density of a subset of Z2 (it always ends
up being either 0 or 1).
Exercise 1.4.14 (How not to define density). Let 𝑆 ⊆ Z2 . Define
|𝑆 ∩ ( 𝐴 × 𝐵)|
𝑑 𝑘 (𝑆) = max .
𝐴,𝐵⊆Z | 𝐴| |𝐵|
| 𝐴|=| 𝐵|=𝑘
Remark 1.5.2 (History). Erdős and Stone (1946) proved this result when 𝐻 is a complete
multipartite graph. Erdős and Simonovits (1966) observed that the general case follows as a
quick corollary. The proof given here is due to Erdős (1971).
Example 1.5.3. When 𝐻 = 𝐾𝑟+1 , 𝜒(𝐻) = 𝑟 + 1, and so Theorem 1.5.1 agrees with Turán’s
theorem.
Example 1.5.4. When 𝐻 is the Petersen graph (shown in what follows), which has chromatic
number 3, Theorem 1.5.1 tells us that ex(𝑛, 𝐻) = (1/4 + 𝑜(1))𝑛2 . The Turán density of the
Petersen graph is the same as that of a triangle, which may be somewhat surprising since the
Petersen graph seems more complicated than the triangle.
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
28 Forbidding a Subgraph
2
1
1 1
3 2
3 2
2 3
In other words, using the notation 𝐾𝑟 [𝑠] for 𝒔-blowup of 𝐾𝑟 , obtained by replacing each
vertex of 𝐾𝑟 by 𝑠 duplicates of itself (so that 𝐾𝑟 [𝑠] = 𝐻 in the preceding theorem statement),
the Erdős–Stone theorem says that
1
𝜋(𝐾𝑟 [𝑠]) = 𝜋(𝐾𝑟 ) = 1 − ,
𝑟 −1
In Section 1.3, we saw supersaturation (Theorem 1.3.4): when the edge density is signif-
icantly above the Turán density threshold 𝜋(𝐻), one finds not just a single copy of 𝐻 but
actually many copies. The Erdős–Stone theorem can be viewed in this light: above edge
density 𝜋(𝐻), we find a large blowup of 𝐻.
The proof uses the following hypergraph extension of the KST theorem, which we will
prove later in the section.
Recall the hypergraph Turán problem (Remark 1.3.3). Given an 𝑟-uniform hypergraph 𝐻
(also known as an 𝑟-graph), we write ex(𝑛, 𝐻) to be the maximum number of edges in an
𝐻-free 𝑟-graph.
The analogue of a complete bipartite graph for an 𝑟-graph is a complete 𝑟-partite 𝑟-graph
𝑲𝒔1 ,...,𝒔𝒓 . Its vertex set consists of disjoint vertex parts 𝑉1 , . . . , 𝑉𝑟 with |𝑉𝑖 | = 𝑠𝑖 for each 𝑖.
(𝒓 )
Proof of the Erdős–Stone theorem (Theorem 1.5.5). We already saw the lower bound to
ex(𝑛, 𝐻) using a Turán graph. It remains to prove an upper bound.
Let 𝐺 be an 𝐻-free graph (where 𝐻 = 𝐾𝑠,...,𝑠 is the complete 𝑟-partite graph in the
theorem). Let 𝐺 (𝑟 ) be the 𝑟-graph with the same vertex set as 𝐺 and whose edges are the
𝑟-cliques in 𝐺. Note that 𝐺 (𝑟 ) is 𝐾𝑠,...,𝑠
(𝑟 )
-free, or else a copy of 𝐾𝑠,...,𝑠
(𝑟 )
in 𝐺 (𝑟 ) would be
supported by a copy of 𝐻 in 𝐺. Thus, by the hypergraph KST theorem (Theorem 1.5.6),
𝐺 (𝑟 ) has 𝑜(𝑛𝑟 ) edges. So 𝐺 has 𝑜(𝑛𝑟 ) copies of 𝐾𝑟 , and thus by the supersaturation theorem
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
quoted above, the edge density of 𝐺 is at most 𝜋(𝐾𝑟 ) + 𝑜(1), which equals 1 − 1/(𝑟 − 1) + 𝑜(1)
by Turán’s theorem. □
In Section 2.6, we will give another proof of the Erdős–Stone–Simonovits theorem using
the graph regularity method.
Hypergraph KST
To help keep notation simple, we first consider what happens for 3-uniform hypergraphs.
Recall that the KST theorem (Theorem 1.4.2) was proved by counting the number of
copies of 𝐾𝑠,1 in the graph in two different ways. For 3-graphs, we instead count the number
of copies of 𝐾𝑠,1,1
(3)
in two different ways, one of which uses the KST theorem for 𝐾𝑠,𝑠 -free
graphs.
Proof. Let 𝐺 be a 𝐾 𝑠,𝑠,𝑠 -free 3-graph with 𝑛 vertices and 𝑚 edges. Let 𝑋 denote the number
(3)
of copies of (3)
𝐾𝑠,1,1 in 𝐺. (When 𝑠 = 1, we count each copy three times.)
Upper bound on 𝑋. Given a set 𝑆 of 𝑠 vertices, consider the set 𝑇 of all unordered pairs
of distinct vertices that would form a 𝐾𝑠,1,1
(3)
with 𝑆 (i.e., every triple formed by combining
a pair in 𝑇 and a vertex of 𝑆 is an edge of 𝐺). Note that 𝑇 is the edge-set of a graph on the
same 𝑛 vertices. If 𝑇 contains a 𝐾𝑠,𝑠 , then together with 𝑆 we would have a 𝐾𝑠,𝑠,𝑠
(3)
. Thus 𝑇 is
2−1/𝑠
𝐾𝑠,𝑠 -free, and hence by Theorem 1.4.2, |𝑇 | = 𝑂 𝑠 (𝑛 ). Therefore,
𝑛 2−1/𝑠
𝑋 ≲𝑠 𝑛 ≲𝑠 𝑛𝑠+2−1/𝑠 .
𝑠
Lower bound on 𝑋. We write deg(𝑢, 𝑣) for the number of edges in 𝐺 containing both 𝑢
and 𝑣. Then, summing over all unordered pairs of distinct vertices 𝑢, 𝑣 in 𝐺, we have
∑︁ deg(𝑢, 𝑣)
𝑋= .
𝑢,𝑣
𝑠
30 Forbidding a Subgraph
and hence
2
𝑚 = 𝑂 𝑠 (𝑛3−1/𝑠 ). □
We can iterate further, using the same technique, to prove an analogous result for ev-
ery uniformity, thereby giving us the statement (Theorem 1.5.6) used in our proof of the
Erdős–Stone–Simonovits theorem earlier. Feel free to skip reading the next proof if you feel
comfortable with generalizing the above proof to 𝑟-graphs.
where 𝐾𝑠,...,𝑠
(𝑟 )
is the complete 𝑟-partite 𝑟-graph with 𝑠 vertices in each of the 𝑟 parts.
and hence
−𝑟+1
𝑚 = 𝑂 𝑟 ,𝑠 (𝑛𝑟 −𝑠 ). □
Exercise 1.5.10 (Forbidding a multipartite complete hypergraph with unbalanced parts).
Prove that for every sequence of positive integers 𝑠1 , . . . , 𝑠𝑟 , there exists 𝐶 so that
ex(𝑛, 𝐾𝑠(𝑟1 ,...,𝑠
)
𝑟
) ≤ 𝐶𝑛𝑟 −1/(𝑠1 ···𝑠𝑟 −1 ) .
Exercise 1.5.11 (Erdős–Stone for hypergraphs). Let 𝐻 be an 𝑟-graph. Show that 𝜋(𝐻 [𝑠]) =
𝜋(𝐻), where 𝐻 [𝑠], the 𝑠-blowup of 𝐻, is obtained by replacing every vertex of 𝐻 by 𝑠
duplicates of itself.
Odd Cycles
First let us consider forbidding odd cycles. Let 𝑘 be a positive integer. Then 𝐶2𝑘+1 has
chromatic number 3, and so the Erdős–Stone–Simonovits theorem (Theorem 1.5.1) tells us
that
𝑛2
ex(𝑛, 𝐶2𝑘+1 ) = (1 + 𝑜(1)) .
4
In fact, an even stronger statement is true. If 𝑛 is large enough (as a function of 𝑘), then the
complete bipartite graph 𝐾 ⌊𝑛/2⌋,⌈𝑛/2⌉ is always the extremal graph, just like in the triangle
case.
We will not prove this theorem. See Füredi and Gunderson (2015) for a more recent proof.
More generally, Simonovits (1974) developed a stability method for exactly determining
the Turán number of nonbipartite color-edge-critical graphs. Here we say that a graph if
color-edge-critical if the removal of any edge strictly reduces the chromatic number.
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
32 Forbidding a Subgraph
Remark 1.6.4 (Tightness). We will see in Section 1.10 a matching lower bound construction
(up to constant factors) for 𝑘 = 2, 3, 5. For all other values of 𝑘, it is an open question whether
a matching lower bound construction exists.
Instead of proving the preceding theorem, we will prove a weaker result, stated in what
follows. This weaker result has a short and neat proof, which hopefully gives some intuition
as to why the above theorem should be true.
Proof. Color every vertex with red or blue independently and uniformly at random. Then
the expected number of nonmonochromatic edges is 𝑒(𝐺)/2. Hence there exists a coloring
that has at least 𝑒(𝐺)/2 nonmonochromatic edges, and these edges form the desired bipartite
subgraph. □
Lemma 1.6.7 (Large average degree implies subgraph with large minimum degree)
Let 𝑡 > 0. Every graph with average degree 2𝑡 has a subgraph with minimum degree
greater than 𝑡.
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
Proof. Let 𝐺 be a graph with average degree 2𝑡. Removing a vertex of degree at most 𝑡
cannot decrease the average degree, since the total degree goes down by at most 2𝑡 and so
the post-deletion graph has average degree of at least (2𝑒(𝐺) − 2𝑡)/(𝑣(𝐺) − 1), which is
at least 2𝑒(𝐺)/𝑣(𝐺) since 2𝑒(𝐺)/𝑣(𝐺) ≥ 2𝑡. Let us repeatedly delete vertices of degree at
most 𝑡 in the remaining graph until every vertex has degree more than 𝑡. This algorithm must
terminate with a nonempty graph since we cannot ever drop below 2𝑡 vertices in this process
(as such a graph would have average degree less than 2𝑡). □
Proof of Theorem 1.6.5. The idea is to use a breath-first search. Suppose 𝐺 contains no
even cycles of length at most 2𝑘. Applying Lemma 1.6.6 followed by Lemma 1.6.7, we find
a bipartite subgraph 𝐺 ′ of 𝐺 with minimum degree > 𝑡 B 𝑒(𝐺)/(2𝑣(𝐺)). Let 𝑢 be an
arbitrary vertex of 𝐺 ′ . For each 𝑖 = 0, 1, . . . , 𝑘, let 𝐴𝑖 denote the set of vertices at distance
exactly 𝑖 from 𝑢.
···
𝑢 ···
𝐴0
···
𝐴1
𝐴2 𝐴𝑘
34 Forbidding a Subgraph
Theorem 1.7.1 (Bounded degree bipartite graph: Turán number upper bound)
Let 𝐻 be a bipartite graph with vertex bipartition 𝐴 ∪ 𝐵 such that every vertex in 𝐴 has
degree at most 𝑟. Then there exists a constant 𝐶 = 𝐶𝐻 such that for all 𝑛,
ex(𝑛, 𝐻) ≤ 𝐶𝑛2−1/𝑟 .
Remark 1.7.2 (History). The result was first proved by Füredi (1991). The proof given here
is due to Alon, Krivelevich, and Sudakov (2003a). For more applications of the dependent
random choice technique see the survey by Fox and Sudakov (2011).
Remark 1.7.3 (Tightness). The exponent 2 − 1/𝑟 is best possible as a function of 𝑟. Indeed,
we will see in the following section that for every 𝑟 there exists some 𝑠 so that ex(𝑛, 𝐾𝑟 ,𝑠 ) ≥
𝑐𝑛2−1/𝑟 for some 𝑐 = 𝑐(𝑟, 𝑠) > 0.
On the other hand, for specific graphs 𝐺, Theorem 1.7.1 may not be tight. For example,
ex(𝑛, 𝐶6 ) = Θ(𝑛4/3 ), whereas Theorem 1.7.1 only tells us that ex(𝑛, 𝐶6 ) = 𝑂 (𝑛3/2 ).
Given a graph 𝐺 with many edges, we wish to find a large subset 𝑈 of vertices such
that every 𝑟-vertex subset of 𝑈 has many common neighbors in 𝐺. (Even the case 𝑟 = 2 is
interesting.) Once such a 𝑈 is found, we can then embed the 𝐵-vertices of 𝐻 into 𝑈. It will
then be easy to embed the vertices of 𝐴. The tricky part is to find such a 𝑈.
Remark 1.7.4 (Intuition). We want to host a party so that each pair of partygoers has many
common friends. (Here 𝐺 is the friendship graph.) Whom should we invite? Inviting people
uniformly at random is not a good idea. (Why?) Perhaps we can pick some random individual
(Alice) to host a party inviting all her friends. Alice’s friends are expected to share some
common friends – at least they all know Alice.
We can take a step further, and pick a few people at random (Alice, Bob, Carol, David)
and have them host a party and invite all their common friends. This will likely be an even
more sociable crowd. At least all the party goers will know all the hosts, and likely even
more. As long as the social network is not too sparse, there should be lots of invitees.
Some invitees (e.g., Zack) might feel a bit out of place at the party – maybe they don’t
have many common friends with other partygoers. (They all know the hosts but maybe
Zack doesn’t know many others.) To prevent such awkwardness, the hosts will cancel Zack’s
invitation. There shouldn’t be too many people like Zack. The party must go on.
Here is the technical statement that we will prove. While there are many parameters, the
specific details are less important compared to the proof technique. This is quite a tricky
proof.
Remark 1.7.6 (Parameters). In the theorem statement, 𝑡 is an auxiliary parameter that does
not appear in the conclusion. While one can optimize for 𝑡, it is instructive and convenient to
leave it as is. The theorem is generally applied to graphs with at least 𝑛2−𝑐 edges, for some
small 𝑐 > 0, and we can play with the parameters to get |𝑈| and 𝑚 both large as desired.
Proof. We say that an 𝑟-element subset of 𝑉 (𝐺) is “bad” if it has at most 𝑚 common
neighbors in 𝐺.
Let 𝑢 1 , . . . , 𝑢 𝑡 be vertices chosen uniformly and independently at random from 𝑉 (𝐺).
(These vertices are chosen “with replacement,” i.e., they can repeat.) Let 𝐴 be their common
neighborhood. (Keep in mind that 𝑢 1 , . . . , 𝑢 𝑡 , 𝐴 are random. It may be a bit confusing in this
proof to see what is random and what is not.)
𝑢1 ... 𝑢𝑡
Each fixed vertex 𝑣 ∈ 𝑉 (𝐺) has probability (deg(𝑣)/𝑛) 𝑡 of being adjacent to all of 𝑢 1 , . . . , 𝑢 𝑡 ,
and so, by linearity of expectations and convexity,
!𝑡
∑︁ ∑︁ deg(𝑣) 𝑡 1 ∑︁ deg(𝑣)
E | 𝐴| = P(𝑣 ∈ 𝐴) = ≥𝑛 ≥ 𝑛𝛼𝑡 .
𝑣 ∈𝑉 (𝐺) 𝑣 ∈𝑉 (𝐺)
𝑛 𝑛 𝑣 ∈𝑉
𝑛
36 Forbidding a Subgraph
Now we are ready to show Theorem 1.7.1, which we recall says that, for a bipartite graph
𝐻 with vertex bipartition 𝐴 ∪ 𝐵 such that every vertex in 𝐴 has degree at most 𝑟, one has
ex(𝑛, 𝐻) = 𝑂 𝐻 (𝑛2−1/𝑟 ).
Proof of Theorem 1.7.1. Let 𝐺 be a graph with 𝑛 vertices and at least 𝐶𝑛2−1/𝑟 edges. By
choosing 𝐶 large enough (depending only on | 𝐴| + |𝐵|), we have
𝑟 𝑛 | 𝐴| + |𝐵| 𝑟
𝑛 2𝐶𝑛 −1/𝑟
− ≥ |𝐵| .
𝑟 𝑛
We want to show that 𝐺 contains 𝐻 as a subgraph. By dependent random choice (Theo-
rem 1.7.5 applied with 𝑡 = 𝑟), we can embed the 𝐵-vertices of 𝐻 into 𝐺 so that every 𝑟-vertex
subset of 𝐵 (now viewed as a subset of 𝑉 (𝐺)) has > | 𝐴| + |𝐵| common neighbors.
𝐴 𝐵
𝐻 𝐺
Next, we embed the vertices of 𝐴 one at a time. Suppose we need to embed 𝑣 ∈ 𝐴 (some
previous vertices of 𝐴 may have already been embedded at this point). Note that 𝑣 has ≤ 𝑟
neighbors in 𝐵, and these ≤ 𝑟 vertices in 𝐵 have > | 𝐴| + |𝐵| common neighbors in 𝐺. While
some of these common neighbors may have already been used up in earlier steps to embed
vertices of 𝐻, there are enough of them that they cannot all be used up, and thus we can
embed 𝑣 to some remaining common neighbor. This process ends with an embedding of 𝐻
into 𝐺. □
Exercise 1.7.7. Let 𝐻 be a bipartite graph with vertex bipartition 𝐴 ∪ 𝐵, such that 𝑟
vertices in 𝐴 are complete to 𝐵, and all remaining vertices in 𝐴 have degree at most 𝑟.
Prove that there is some constant 𝐶 = 𝐶𝐻 such that ex(𝑛, 𝐻) ≤ 𝐶𝑛2−1/𝑟 for all 𝑛.
Exercise 1.7.8. Let 𝜀 > 0. Show that, for sufficiently large 𝑛, every 𝐾4 -free graph with 𝑛
vertices and at least 𝜀𝑛2 edges contains an independent set of size at least 𝑛1− 𝜀 .
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
Randomized Constructions
The idea is to take a random graph at a density that gives a small number of copies of 𝐻,
and then destroy these copies of 𝐻 by removing some edges from the random graph. The
resulting graph is then 𝐻-free. This method is easy to implement and applies quite generally
to all 𝐻. For example, it will be shown that
𝑣 (𝐻) −2
ex(𝑛, 𝐻) = Ω𝐻 𝑛2− 𝑒 (𝐻) −1 .
However, bounds arising from this method are usually not tight.
Algebraic Constructions
The idea is to use algebraic geometry over a finite field to construct a graph. Its vertices
correspond to geometric objects such as points or lines. Its edges correspond to incidences
or other algebraic relations. These constructions sometimes give tight bounds. They work
for a small number of graphs 𝐻, and usually require a different ad hoc idea for each 𝐻. They
work rarely, but when they do, they can appear quite mysterious, or even magical. Many
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
38 Forbidding a Subgraph
important tight lower bounds on bipartite extremal numbers arise this way. In particular it
will be shown that
ex(𝑛, 𝐾𝑠,𝑡 ) = Ω𝑠,𝑡 𝑛2−1/𝑠 whenever 𝑡 ≥ (𝑠 − 1)! + 1,
thereby matching the KST theorem (Theorem 1.4.2) for such 𝑠, 𝑡. Also, it will be shown that
ex(𝑛, 𝐶2𝑘 ) = Ω𝑘 𝑛1+1/𝑘 whenever 𝑘 ∈ {2, 3, 5},
thereby matching Theorem 1.6.3 for these values of 𝑘.
Proof. Let 𝐺 be an instance of the Erdős–Rényi random graph G(𝑛, 𝑝), with
1 − 𝑒𝑣 (𝐻)
(𝐻) −2
𝑝= 𝑛 −1
4
(chosen with hindsight). We have E 𝑒(𝐺) = 𝑝 𝑛2 . Let 𝑋 denote the number of copies of 𝐻
in 𝐺. Then, our choice of 𝑝 ensures that
𝑒 (𝐻 ) 𝑣 (𝐻 ) 𝑝 𝑛 1
E𝑋 ≤ 𝑝 𝑛 ≤ = E 𝑒(𝐺).
2 2 2
Thus
𝑝 𝑛 𝑣 (𝐻) −2
E[𝑒(𝐺) − 𝑋] ≥ ≳ 𝑛2− 𝑒 (𝐻) −1 .
2 2
Take a graph 𝐺 such that 𝑒(𝐺) − 𝑋 is at least its expectation. Remove one edge from each
𝑣 (𝐻) −2
copy of 𝐻 in 𝐺, and we get an 𝐻-free graph with at least 𝑒(𝐺) − 𝑋 ≳ 𝑛2− 𝑒 (𝐻) −1 edges. □
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
For some graphs 𝐻, we can bootstrap Theorem 1.9.1 to give an even better lower bound.
For example, if
𝐻= ,
then 𝑣(𝐻) = 10 and 𝑒(𝐻) = 20, so applying Theorem 1.9.1 directly gives
ex(𝑛, 𝐻) ≳ 𝑛2−8/19 .
On the other hand, any 𝐾4,4 -free graph is automatically 𝐻-free. Applying Theorem 1.9.1 to
𝐾4,4 (8-vertex 16-edge) actually gives a better lower bound (2 − 6/15 > 2 − 8/19):
ex(𝑛, 𝐻) ≥ ex(𝑛, 𝐾4,4 ) ≳ 𝑛2−6/15 .
In general, given 𝐻, we should apply Theorem 1.9.1 to the subgraph of 𝐻 with the
maximum (𝑒(𝐻) − 1)/(𝑣(𝐻) − 2) ratio. This gives the following corollary, which sometimes
gives a better lower bound than directly applying Theorem 1.9.1.
When 𝑡 is large compared to 𝑠, the exponents in the preceding two bounds are close to each
other (but never equal). When 𝑡 = 𝑠, the preceding bounds specialize to
2 1
𝑛2− 𝑠+1 ≲ ex(𝑛, 𝐾𝑠,𝑠 ) ≲ 𝑛2− 𝑠 .
In particular, for 𝑠 = 2,
𝑛4/3 ≲ ex(𝑛, 𝐾2,2 ) ≲ 𝑛3/2 .
It turns out that the upper bound is tight. We will show this in the next section using an
algebraic construction.
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
40 Forbidding a Subgraph
Exercise 1.9.5. Show that if 𝐻 is a bipartite graph containing a cycle of length 2𝑘, then
ex(𝑛, 𝐻) ≳𝐻 𝑛1+1/(2𝑘−1) .
1 2
Exercise 1.9.6. Find a graph 𝐻 with 𝜒(𝐻) = 3 and ex(𝑛, 𝐻) > 4
𝑛 + 𝑛1.99 for all
sufficiently large 𝑛.
𝐾2,2 -free
We begin by constructing 𝐾2,2 -free graphs with the number of edges matching the KST
theorem. The construction is due to Erdős, Rényi, and Sós (1966) and Brown (1966) inde-
pendently.
Before giving the proof of Theorem 1.10.1, let us first sketch the geometric intuition.
Given a set of points P and a set of lines L, the point-line incidence graph is the bipartite
graph with two vertex parts P and L, where 𝑝 ∈ P and ℓ ∈ L are adjacent if 𝑝 ∈ ℓ.
P 𝑝∈ℓ L
𝑝
ℓ
A point-line incidence graph is 𝐶4 -free. Indeed, a 𝐶4 would correspond to two lines both
passing through two distinct points, which is impossible.
We want to construct a set of points and a set of lines so that there are many incidences.
To do this, we take all points and all lines in a finite field plane F2𝑝 . There are 𝑝 2 points
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
and 𝑝 2 + 𝑝 lines. Since every line contains 𝑝 points, the graph has around 𝑝 3 edges, and
so ex(2𝑝 2 + 𝑝, 𝐾2,2 ) ≥ 𝑝 3 . By rounding down an integer 𝑛 to the closest number of the
form 2𝑝 2 + 𝑝 for a prime 𝑝, we already see that ex(𝑛, 𝐾2,2 ) ≳ 𝑛3/2 for all 𝑛. Here we use a
theorem from number theory regarding large gaps in primes, which we quote in what follows
without proof. (This strategy does not work if we instead take points and lines in R2 ; see the
Szemerédi–Trotter theorem in Section 8.2).
Remark 1.10.4 (Large gaps between primes). The preceding result already follows from
the prime number theorem, which says that the number of primes up to 𝑁 is (1+𝑜(1))𝑁/log 𝑁.
The best quantitative result, due to Baker, Harman, and Pintz (2001), says that there exists a
prime in [𝑁 − 𝑁 0.525 , 𝑁] for all sufficiently large 𝑁. Cramer’s conjecture, which is wide open
and based on a random model of the primes, speculates that the 𝑜(𝑁) in Theorem 1.10.3 may
be replaced by 𝑂 ((log 𝑁) 2 ). An easier claim is Bertrand’s postulate, which says that there is a
prime between 𝑁 and 2𝑁 for every 𝑁, and this already suffices for proving ex(𝑛, 𝐾2,2 ) ≳ 𝑛3/2 .
To get a better constant in the preceding construction, we optimize somewhat by using the
same vertices to represent both points and lines. This pairing of points and lines is known as
polarity in projective geometry, and this construction is known as the polarity graph. (This
usually refers to the projective plane version of the construction.)
Proof of Theorem 1.10.1. Let 𝑝 denote the largest prime such that 𝑝 2 − 1 ≤ 𝑛. Then 𝑝 =
√
(1 − 𝑜(1)) 𝑛 by Theorem 1.10.3. Let 𝐺 be a graph with vertex set 𝑉 (𝐺) = F2𝑝 \ {(0, 0)} and
an edge between (𝑥, 𝑦) and (𝑎, 𝑏) if and only if 𝑎𝑥 + 𝑏𝑦 = 1 in F 𝑝 .
For any two distinct vertices (𝑎, 𝑏) and (𝑎 ′ , 𝑏 ′ ) in 𝑉 (𝐺), they have at most one common
neighbor since there is at most one solution to the system 𝑎𝑥 + 𝑏𝑦 = 1 and 𝑎 ′ 𝑥 + 𝑏 ′ 𝑦 = 1.
Therefore, 𝐺 is 𝐾2,2 -free. (This is where we use the fact that two lines intersect in at most
one point.)
For every (𝑎, 𝑏) ∈ 𝑉 (𝐺), there are exactly 𝑝 vertices (𝑥, 𝑦) satisfying 𝑎𝑥 + 𝑏𝑦 = 1.
However, one of those vertices could be (𝑎, 𝑏) itself. So every vertex in 𝐺 has degree 𝑝 or
𝑝 − 1. Hence 𝐺 has at least ( 𝑝 2 − 1) ( 𝑝 − 1)/2 = (1/2 − 𝑜(1))𝑛3/2 edges. □
𝐾3,3 -free
Next, we construct 𝐾3,3 -free graphs with the number of edges matching the KST theorem.
This construction is due to Brown (1966).
Consider the incidences between points in R3 and unit spheres. This graph is 𝐾3,3 -free
since no three unit spheres can share three distinct common points. Again, one needs to do
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
42 Forbidding a Subgraph
this over a finite field to attain the desired bounds, but it is easier to visualize the setup in
Euclidean space, where it is clearly true.
Proof sketch. Let 𝑝 be the largest prime less than 𝑛1/3 . Fix a nonzero element 𝑑 ∈ F 𝑝 , which
we take to be a quadratic residue if 𝑝 ≡ 3 (mod 4) and a quadratic nonresidue if 𝑝 . 3
(mod 4). Construct a graph 𝐺 with vertex set 𝑉 (𝐺) = F3𝑝 , and an edge between (𝑥, 𝑦, 𝑧) and
(𝑎, 𝑏, 𝑐) ∈ 𝑉 (𝐺) if and only if
(𝑎 − 𝑥) 2 + (𝑏 − 𝑦) 2 + (𝑐 − 𝑧) 2 = 𝑑.
It turns out that each vertex has (1 − 𝑜(1)) 𝑝 2 neighbors. (The intuition here is that, for a
fixed (𝑎, 𝑏, 𝑐), if we choose 𝑥, 𝑦, 𝑧 ∈ F 𝑝 independently and uniformly at random, then the
resulting sum (𝑎 − 𝑥) 2 + (𝑏 − 𝑦) 2 + (𝑐 − 𝑧) 2 is roughly uniformly distributed, and hence
equals to 𝑑 with probability close to 1/𝑝.) It remains to show that the graph is 𝐾3,3 -free.
To see this, think about how one might prove this claim in R3 via algebraic manipulations.
We compute the radical planes between pairs of spheres as well as the intersections of these
radical planes (i.e., the radical axis). The claim boils down to the fact that no sphere has three
collinear points, which is true due to the quadratic (non)residue hypothesis on 𝑑. The details
are omitted.
Thus 𝐺 is a 𝐾3,3 -free graph on 𝑝 3 ≤ 𝑛 vertices and with at least (1/2 − 𝑜(1)) 𝑝 5 =
(1/2 − 𝑜(1))𝑛5/3 edges. □
It is unknown if the preceding ideas can be extended to construct 𝐾4,4 -free graphs with
Ω(𝑛7/4 ) edges. It is a major open problem to determine the asymptotics of ex(𝑛, 𝐾4,4 ).
𝐾𝑠,𝑡 -free
Now we present a substantial generalization of the above constructions, due to Kollár, Rónyai,
and Szabó (1996) and Alon, Rónyai, and Szabó (1999). It gives a matching lower bound (up
to a constant factor) to the KST theorem for 𝐾𝑠,𝑡 whenever 𝑡 is sufficiently large compared to
𝑠.
Proposition 1.10.10
NormGraph 𝑝,𝑠 is 𝐾 𝑠,𝑠!+1 -free for all 𝑠 ≥ 2.
We wish to upper bound the number of common neighbors to a set of 𝑠 vertices. This
amount to showing that a certain system of algebraic equations cannot have too many
solutions. We quote without proof the following key algebraic result from Kollár, Rónyai,
and Szabó (1996), which can be proved using algebraic geometry.
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
44 Forbidding a Subgraph
Theorem 1.10.11
Let F be any field and 𝑎 𝑖 𝑗 , 𝑏 𝑖 ∈ F such that 𝑎 𝑖 𝑗 ≠ 𝑎 𝑖′ 𝑗 for all 𝑖 ≠ 𝑖 ′ . Then the system of
equations
(𝑥 1 − 𝑎 11 ) (𝑥2 − 𝑎 12 ) · · · (𝑥 𝑠 − 𝑎 1𝑠 ) = 𝑏 1
(𝑥 1 − 𝑎 21 ) (𝑥2 − 𝑎 22 ) · · · (𝑥 𝑠 − 𝑎 2𝑠 ) = 𝑏 2
..
.
(𝑥1 − 𝑎 𝑠1 ) (𝑥2 − 𝑎 𝑠2 ) · · · (𝑥 𝑠 − 𝑎 𝑠𝑠 ) = 𝑏 𝑠
has at most 𝑠! solutions (𝑥1 , . . . , 𝑥 𝑠 ) ∈ F𝑠 .
Remark 1.10.12 (Special case 𝑏 = 0). Consider the special case when all the 𝑏 𝑖 are 0. In
this case, since the 𝑎 𝑖 𝑗 are distinct for each fixed 𝑗, every solution to the system corresponds
to a permutation 𝜋 : [𝑠] → [𝑠], setting 𝑥𝑖 = 𝑎 𝑖 𝜋 (𝑖) . So there are exactly 𝑠! solutions in
this special case. The difficult part of the theorem says that the number of solutions cannot
increase if we move 𝑏 away from the origin.
Proof of Proposition 1.10.10. Consider distinct 𝑦 1 , 𝑦 2 , . . . , 𝑦 𝑠 ∈ F 𝑝 𝑠 . We wish to bound the
number of common neighbors 𝑥. Recall that in a field with characteristic 𝑝, we have the
identity (𝑥 + 𝑦) 𝑝 = 𝑥 𝑝 + 𝑦 𝑝 for all 𝑥, 𝑦. So,
1 = 𝑁 (𝑥 + 𝑦 𝑖 ) = (𝑥 + 𝑦 𝑖 ) (𝑥 + 𝑦 𝑖 ) 𝑝 . . . (𝑥 + 𝑦 𝑖 ) 𝑝
𝑠−1
𝑠−1 𝑠−1
= (𝑥 + 𝑦 𝑖 ) (𝑥 𝑝 + 𝑦 𝑖𝑝 ) . . . (𝑥 𝑝 + 𝑦 𝑖𝑝 )
for all 1 ≤ 𝑖 ≤ 𝑠. By Theorem 1.10.11, these 𝑠 equations (as 𝑖 ranges over [𝑠]) have at most
𝑠! solutions in 𝑥. Note the hypothesis of Theorem 1.10.11 is satisfied since 𝑦 𝑖𝑝 = 𝑦 𝑝𝑗 if and
only if 𝑦 𝑖 = 𝑦 𝑗 in F 𝑝 𝑠 . □
Now we modify the norm graph construction to forbid 𝐾𝑠, (𝑠−1)!+1 , thereby yielding Theo-
rem 1.10.7.
Construction 1.10.13 (Projective norm graph)
Let ProjNormGraph 𝒑,𝒔 be the graph with vertex set F 𝑝 𝑠−1 × F×𝑝 , where two vertices
(𝑋, 𝑥), (𝑌 , 𝑦) ∈ F 𝑝 𝑠−1 × F×𝑝 are adjacent if and only if
𝑁 (𝑋 + 𝑌 ) = 𝑥𝑦.
In ProjNormGraph 𝑝,𝑠 , every vertex (𝑋, 𝑥) has degree 𝑝 𝑠−1 − 1 since its neighbors are
(𝑌 , 𝑁 (𝑋 + 𝑌 )/𝑥) for all 𝑌 ≠ −𝑋. There are ( 𝑝 𝑠−1 − 1) 𝑝 𝑠−1 ( 𝑝 − 1)/2 edges. As earlier, it
remains to show that this graph is 𝐾𝑠, (𝑠−1)!+1 -free. Once we know this, by taking 𝑝 to be the
largest prime satisfying 𝑝 𝑠−1 ( 𝑝 − 1) ≤ 𝑛, we obtain the desired lower bound
1 𝑠−1 1
ex(𝑛, 𝐾𝑠, (𝑠−1)!+1 ) ≥ ( 𝑝 − 1) 𝑝 ( 𝑝 − 1) ≥
𝑠−1
− 𝑜(1) 𝑛2−1/𝑠 .
2 2
Proposition 1.10.14
ProjNormGraph 𝑝,𝑠 is 𝐾 𝑠, (𝑠−1)!+1 -free.
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
Proof. Fix distinct (𝑌1 , 𝑦 1 ), . . . , (𝑌𝑠 , 𝑦 𝑠 ) ∈ F 𝑝 𝑠−1 × F×𝑝 . We wish to show that there are at most
(𝑠 − 1)! solutions (𝑋, 𝑥) ∈ F 𝑝 𝑠−1 × F×𝑝 to the system of equations
𝑁 (𝑋 + 𝑌𝑖 ) = 𝑥𝑦 𝑖 , 𝑖 = 1, . . . , 𝑠.
Assume this system has at least one solution. Then if 𝑌𝑖 = 𝑌 𝑗 with 𝑖 ≠ 𝑗 we must have
that 𝑦 𝑖 = 𝑦 𝑗 . Therefore all the 𝑌𝑖 are distinct. For each 𝑖 < 𝑠, dividing 𝑁 (𝑋 + 𝑌𝑖 ) = 𝑥𝑦 𝑖 by
𝑁 (𝑋 + 𝑌𝑠 ) = 𝑥𝑦 𝑠 gives
𝑋 + 𝑌𝑖 𝑦𝑖
𝑁 = , 𝑖 = 1, . . . , 𝑠 − 1.
𝑋 + 𝑌𝑠 𝑦𝑠
Dividing both sides by 𝑁 (𝑌𝑖 − 𝑌𝑠 ) gives
1 1 𝑦𝑖
𝑁 + = , 𝑖 = 1, . . . , 𝑠 − 1.
𝑋 + 𝑌𝑠 𝑌𝑖 − 𝑌𝑠 𝑁 (𝑌𝑖 − 𝑌𝑠 )𝑦 𝑠
Now apply Theorem 1.10.11 (same as in the proof of Proposition 1.10.10). We deduce
that there are at most (𝑠 − 1)! choices for 𝑋, and each such 𝑋 automatically determines
𝑥 = 𝑁 (𝑋 + 𝑌1 )/𝑦 1 . Thus, there are at most (𝑠 − 1)! solutions (𝑋, 𝑥). □
𝐶4 , 𝐶6 , 𝐶10 -free
Finally, let us turn to constructions of 𝐶2𝑘 -free graphs. We had mentioned in Section 1.6 that
ex(𝐶2𝑘 , 𝑛) = 𝑂 𝑘 (𝑛1+1/𝑘 ). We saw a matching lower bound construction for 4-cycles. Now
we give matching constructions for 6-cycles and 10-cycles. (It remains an open problem for
other cycle lengths.)
Theorem 1.10.15 (Tight lower bound for avoiding 𝐶2𝑘 for 𝑘 ∈ {2, 3, 5})
Let 𝑘 ∈ {2, 3, 5}. Then there is a constant 𝑐 > 0 such that for every 𝑛,
ex(𝑛, 𝐶2𝑘 ) ≥ 𝑐𝑛1+1/𝑘 .
Remark 1.10.16 (History). The existence of such 𝐶2𝑘 -free graphs for 𝑘 ∈ {3, 5} is due to
Benson (1966) and Singleton (1966). The construction given here is due to Wenger (1991),
with a simplified description due to Conlon (2021).
The following construction generalizes the point-line incidence graph construction earlier
for the 𝐶4 -free graph in Theorem 1.10.1. Here we consider a special set of lines in F𝑞𝑘 , whereas
previously for 𝐶4 we took all lines in F2𝑞 .
We have |L| = 𝑞 𝑘 , since to specify a line in L we can provide a point with first coordinate
equal to zero, along with a choice of 𝑡 ∈ F𝑞 giving the direction of the line. So the graph 𝐺 𝑞,𝑘
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
46 Forbidding a Subgraph
has 𝑛 = 2𝑞 𝑘 vertices. Since each line contains exactly 𝑞 points, there are exactly 𝑞 𝑘+1 ≍ 𝑛1+1/𝑘
edges in the graph. It remains to show that this graph is 𝐶2𝑘 -free whenever 𝑘 ∈ {2, 3, 5}.
Then Theorem 1.10.15 would follow after the usual trick of taking 𝑞 to be the largest prime
with 2𝑞 𝑘 < 𝑛.
Proposition 1.10.18
Let 𝑘 ∈ {2, 3, 5}. The graph 𝐺 𝑞,𝑘 from Construction 1.10.17 is 𝐶2𝑘 -free.
𝑝𝑘 𝑝3
ℓ𝑘−1 ... ℓ3
Then
𝑝 𝑖+1 − 𝑝 𝑖 = 𝑎 𝑖 (1, 𝑡𝑖 , . . . , 𝑡 𝑖𝑘−1 )
for some 𝑎 𝑖 ∈ F𝑞 \ {0}. Thus (recall that 𝑝 𝑘+1 = 𝑝 1 )
∑︁
𝑘 ∑︁
𝑘
𝑎 𝑖 (1, 𝑡𝑖 , . . . , 𝑡 𝑖𝑘−1 ) = ( 𝑝 𝑖+1 − 𝑝 𝑖 ) = 0. (1.3)
𝑖=1 𝑖=1
The vectors (1, 𝑡𝑖 , . . . , 𝑡 𝑖𝑘−1 ), 𝑖 = 1, . . . , 𝑘, after deleting duplicates, are linearly independent.
One way to see this is via the Vandermonde determinant
1 𝑥1 𝑥12 ··· 𝑥 1𝑘−1
1 𝑥2 𝑥22 ··· 𝑥 2𝑘−1 Ö
.. .. .. .. .. = (𝑥 𝑗 − 𝑥𝑖 ).
. . . . . 𝑖< 𝑗
1 𝑥𝑘 𝑥 2𝑘 ··· 𝑥 𝑘𝑘−1
For (1.3) to hold, each vector (1, 𝑡𝑖 , . . . , 𝑡 𝑖𝑘−1 ) must appear at least twice in the sum, with
their coefficients 𝑎 𝑖 adding up to zero.
Since the lines ℓ1 , . . . , ℓ𝑘 are distinct, for each 𝑖 = 1, . . . , 𝑘 (indices taken mod 𝑘), the
lines ℓ𝑖 and ℓ𝑖+1 cannot be parallel. So, 𝑡 𝑖 ≠ 𝑡𝑖+1 . When 𝑘 ∈ {2, 3, 5} it is impossible to select
𝑡1 , . . . , 𝑡 𝑘 with no equal consecutive terms (including wraparound) and so that each value is
repeated at least twice. Therefore the 2𝑘-cycle cannot exist. (Why does the argument fail for
𝐶8 -freeness?) □
The algebraic constructions in the previous section can be abstractly described as follows.
Take a graph whose vertices are points in some algebraic set (e.g., some finite field geometry),
with two vertices 𝑥 and 𝑦 being adjacent if some algebraic relationship such as 𝑓 (𝑥, 𝑦) = 0
is satisfied. Previously, this 𝑓 was carefully chosen by hand. The new idea is to take 𝑓 to be
a random polynomial.
We illustrate this technique by giving another proof of the tightness of the KST bound on
extremal numbers for 𝐾𝑠,𝑡 when 𝑡 is large compared to 𝑠.
The construction we present here has a worse dependence of 𝑡 on 𝑠 than in Theorem 1.10.7.
The main purpose of this section is to illustrate the technique of randomized algebraic
constructions. Bukh (2021) later gave a significant extension of this technique which shows
that ex(𝑛, 𝐾𝑠,𝑡 ) = Ω𝑠 (𝑛2−1/𝑠 ) for some 𝑡 close to 9𝑠 , improving on Theorem 1.10.7, which
required 𝑡 > (𝑠 − 1)!.
Proof idea. Take a random polynomial 𝑓 (𝑋1 , . . . , 𝑋𝑠 , 𝑌1 , . . . , 𝑌𝑠 ) symmetric in the 𝑋 and 𝑌
variables (i.e., 𝑓 (𝑋, 𝑌 ) = 𝑓 (𝑌 , 𝑋)), but otherwise uniformly chosen among all polynomials
with degree up to 𝑑 with coefficients in F𝑞 . Consider a graph with vertex set F𝑞𝑠 and where
𝑋 and 𝑌 are adjacent if 𝑓 (𝑋, 𝑌 ) = 0.
Given an 𝑠-vertex set 𝑈, let 𝑍𝑈 denote the set of common neighbors of 𝑈. It is an algebraic
set: the common zeros of the polynomials 𝑓 (𝑋, 𝑦), 𝑦 ∈ 𝑈. Due to the Lang–Weil bound
from algebraic geometry, 𝑍𝑈 is either bounded in size, |𝑍𝑈 | ≤ 𝐶 (the zero dimensional case),
or it must be quite large, say, |𝑍𝑈 | > 𝑞/2 (the positive dimensional case). This is unlike an
Erdős–Rényi random graph.
One can then deduce, using Markov’s inequality, that
𝑞 E[|𝑍𝑈 | 𝑘 ] 𝑂 𝑘 (1)
P(|𝑍𝑈 | > 𝐶) = P |𝑍𝑈 | > ≤ = ,
2 (𝑞/2) 𝑘 (𝑞/2) 𝑘
which is quite small (much smaller compared to an Erdős–Rényi random graph). So typically
very few sets 𝑈 have size > 𝐶. By deleting these bad 𝑈s from the vertex set of the graph, we
obtain a 𝐾𝑠,𝐶+1 -free graph with around 𝑞 𝑠 vertices and on the order of 𝑞 2𝑠−1 edges. ■
Now we begin the actual proof. Let 𝑞 be the largest prime power satisfying 𝑞 𝑠 ≤ 𝑛. Due
to prime gaps (Theorem 1.10.3), we have 𝑞 = (1 − 𝑜(1))𝑛1/𝑠 . So it suffices to construct a
𝐾𝑠,𝑡 -free graph on 𝑞 𝑠 vertices with (1/2 − 𝑜(1))𝑞 2𝑠−1 edges.
Let 𝑑 = 𝑠2 + 𝑠 (the reason for this choice will come up later). Let
𝑓 ∈ F𝑞 [𝑋1 , 𝑋2 , . . . , 𝑋𝑠 , 𝑌1 , 𝑌2 , . . . , 𝑌𝑠 ] ≤𝑑
be a polynomial chosen uniformly at random among all polynomials with degree at most
𝑑 in each of 𝑋 = (𝑋1 , 𝑋2 , . . . , 𝑋𝑠 ) and 𝑌 = (𝑌1 , 𝑌2 , . . . , 𝑌𝑠 ) and furthermore satisfying
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
48 Forbidding a Subgraph
where the coefficients 𝑎 𝑖1 ,...,𝑖𝑠 , 𝑗1 ,..., 𝑗𝑠 ∈ F𝑞 are chosen subject to 𝑎 𝑖1 ,...,𝑖𝑠 , 𝑗1 ,..., 𝑗𝑠 = 𝑎 𝑗1 ,..., 𝑗𝑠 ,𝑖1 ,...,𝑖𝑠
but otherwise independently and uniformly at random.
Let 𝐺 be the graph with vertex set F𝑞𝑠 , with distinct 𝑥, 𝑦 ∈ F𝑞𝑠 adjacent if and only if
𝑓 (𝑥, 𝑦) = 0.
Then 𝐺 is a random graph. The next two lemmas show that 𝐺 behaves in some ways like
a random graph with edges independently appearing with probability 1/𝑞. Indeed, the next
lemma shows that every pair of vertices form an edge with probability 1/𝑞.
Proof. Note that resampling the constant term of 𝑓 does not change its distribution. Thus,
𝑓 (𝑢, 𝑣) is uniformly distributed in F𝑞 for a fixed (𝑢, 𝑣). Hence 𝑓 (𝑢, 𝑣) takes each value with
probability 1/𝑞. □
More generally, we show below that the expected occurrence of small subgraphs mirrors
that of the usual random graph with independent edges. We write 𝑈2 for the set of unordered
pairs of element from 𝑈.
Proof. We first perform multivariate Lagrange interpolation to show that ( 𝑓 (𝑢, 𝑣)) {𝑢,𝑣 } can
take all possible values. For each pair 𝑢, 𝑣 ∈ 𝑊 with 𝑢 ≠ 𝑣, we can find some polynomial
ℓ𝑢,𝑣 ∈ F[𝑋1 , . . . , 𝑋𝑠 ] of degree 1 such that ℓ𝑢,𝑣 (𝑢) = 1 and ℓ𝑢,𝑣 (𝑣) = 0. For each 𝑢 ∈ 𝑊, let
Ö
𝑞 𝑢 (𝑋) = ℓ𝑢,𝑣 (𝑋) ∈ F[𝑋1 , . . . , 𝑋𝑠 ]
𝑣 ∈𝑊\{𝑢}
which has degree |𝑊 | − 1 ≤ 𝑑. It satisfies 𝑞 𝑢 (𝑢) = 1, and 𝑞 𝑢 (𝑣) = 0 for all 𝑣 ∈ 𝑊 \ {𝑢}.
Let
∑︁
𝑝(𝑋, 𝑌 ) = 𝑐 𝑢,𝑣 (𝑞 𝑢 (𝑋)𝑞 𝑣 (𝑌 ) + 𝑞 𝑣 (𝑋)𝑞 𝑢 (𝑌 ))
{𝑢,𝑣 } ∈ ( 𝑊
2 )
with 𝑐 𝑢,𝑣 ∈ F𝑞 . Note that 𝑝(𝑋, 𝑌 ) = 𝑝(𝑌 , 𝑋). Also, 𝑝(𝑢, 𝑣) = 𝑐 𝑢,𝑣 for all distinct 𝑢, 𝑣 ∈ 𝑊.
Now let each 𝑐 𝑢,𝑣 ∈ F𝑞 above be chosen independently and uniformly at random so that
𝑝(𝑋, 𝑌 ) is a random polynomial. Note that 𝑓 (𝑋, 𝑌 ) and 𝑝(𝑋, 𝑌 ) are independent random
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
polynomials both with degree at most 𝑑 in each of 𝑋 and 𝑌 . Since 𝑓 is chosen uniformly
( |𝑊 |)
at random, it has the same distribution as 𝑓 + 𝑝. Since ( 𝑝(𝑢, 𝑣))𝑢,𝑣 = (𝑐 𝑢,𝑣 )𝑢,𝑣 ∈ F𝑞 2 is
uniformly distributed, the same must be true for ( 𝑓 (𝑢, 𝑣))𝑢,𝑣 as well. □
Now fix 𝑈 ⊆ F𝑞𝑠 with |𝑈| = 𝑠. We want to show that it is rare for 𝑈 to have many common
neighbors. We will use the method of moments. Let
𝑍𝑈 = the set of common neighbors of 𝑈
= {𝑥 ∈ F𝑞𝑠 \ 𝑈 : 𝑓 (𝑥, 𝑢) = 0 for all 𝑢 ∈ 𝑈}.
Then using Lemma 1.11.3, for any 𝑘 ≤ 𝑠2 + 1,
h ∑︁ 𝑘i
E[|𝑍𝑈 | 𝑘 ] = E 1{𝑣 ∈ 𝑍𝑈 }
∑︁
𝑣 ∈F𝑞𝑠 \𝑈
= E[1{𝑣 (1) , . . . , 𝑣 (𝑘 ) ∈ 𝑍𝑈 }]
∑︁
𝑣 (1) ,...,𝑣 (𝑘) ∈F𝑞𝑠 \𝑈
𝑞 − |𝑈 | #{𝑣
(1)
,...,𝑣 (𝑘) }
= ,
𝑣 (1) ,...,𝑣 (𝑘) ∈F𝑞𝑠 \𝑈
with the final step due to Lemma 1.11.3 applied with 𝑊 = 𝑈 ∪ {𝑣 (1) , . . . , 𝑣 (𝑘 ) }, which has
cardinality ≤ |𝑈| + 𝑘 ≤ 𝑠 + 𝑠2 + 1 = 𝑑 + 1. Note that #{𝑣 (1) , . . . , 𝑣 (𝑘 ) } counts distinct elements
in the set. Thus, continuing the above calculation,
∑︁ 𝑞 𝑠 − |𝑈|
= 𝑞 −𝑟 𝑠 #{surjections [𝑘] → [𝑟]}
𝑟
∑︁
𝑟 ≤𝑘
= 𝑂 𝑘 (1).
Applying the preceding with 𝑘 = 𝑠2 + 1 and using Markov’s inequality, we get
2
𝑠 2 +1 𝑠 2 +1
E |𝑍𝑈 | 𝑠 +1 𝑂 𝑠 (1)
P(|𝑍𝑈 | ≥ 𝜆) = P(|𝑍𝑈 | ≥𝜆 )≤ 𝑠 2 +1
≤ 𝑠2 +1 . (1.4)
𝜆 𝜆
Remark 1.11.4. All the probabilistic arguments up to this point would be identical had we
used a random graph with independent edges appearing with probability 𝑝. In both settings,
|𝑍𝑈 | is a random variable with constant order expectation. However, their distributions are
extremely different, as we will soon see. For a random graph with independent edges, |𝑍𝑈 |
behaves like a Poisson random variable, and consequently, for any constant 𝑡, P(|𝑍𝑈 | ≥ 𝑡)
is bounded from below by a constant. Consequently, many 𝑠-element sets of vertices are
expected to have at least 𝑡 common neighbors, and so this method will not work. However,
this is not the case with the random algebraic construction. It is impossible for |𝑍𝑈 | to take
on certain ranges of values. If |𝑍𝑈 | is somewhat large, then it must be very large.
Note that 𝑍𝑈 is defined by 𝑠 polynomial equations. The next result tells us that the number
of points on such an algebraic variety must be either bounded or at least around 𝑞.
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
50 Forbidding a Subgraph
The lemma can be deduced from the following important result from algebraic geometry
due to Lang and Weil (1954), which says that the number of points of an 𝑟-dimensional
algebraic variety in F𝑞𝑠 is roughly 𝑞 𝑟 , as long as certain irreducibility hypotheses are satisfied.
We include here the statement of the Lang–Weil bound. Here F𝑞 denotes the algebraic closure
of F𝑞 .
The two cases in Lemma 1.11.5 then correspond to the zero-dimensional case and the
positive-dimensional case, though some care is needed to deal with what happens if the
variety is reducible in the field closure. We refer the reader to Bukh (2015) for details on how
to deduce Lemma 1.11.5 from the Lang–Weil bound.
Now, continuing our proof of Theorem 1.11.1. Recall 𝑍𝑈 = {𝑥 ∈ F𝑞𝑠 \ 𝑈 : 𝑓 (𝑥, 𝑢) =
0 for all 𝑢 ∈ 𝑈}. Apply Lemma 1.11.5 to the polynomials 𝑓 (𝑋, 𝑢), 𝑢 ∈ 𝑈. Then for large
enough 𝑞 there exists a constant 𝐶 from Lemma 1.11.5 such that either |𝑍𝑈 | ≤ 𝐶 (bounded)
or |𝑍𝑈 | > 𝑞/2 (very large). Thus, by (1.4),
𝑞 𝑂 𝑠 (1)
P(|𝑍𝑈 | > 𝐶) = P |𝑍𝑈 | > ≤ .
2 (𝑞/2) 𝑠2 +1
So the expected number of 𝑠-element subset 𝑈 with |𝑍𝑈 | > 𝐶 is
𝑠
𝑞 𝑂 𝑠 (1)
≤ = 𝑂 𝑠 (1/𝑞).
𝑠 (𝑞/2) 𝑠2 +1
Remove from 𝐺 a vertex from every 𝑠-element 𝑈 with |𝑍𝑈 | > 𝐶. Then the resulting graph is
𝐾𝑠, ⌈𝐶 ⌉+1 -free. Since we remove at most 𝑞 𝑠 edges for each deleted vertex, the expected number
of remaining edges is at least
1 𝑞𝑠 1
− 𝑂 𝑠 (𝑞 𝑠−1 ) = − 𝑜(1) 𝑞 2𝑠−1 .
𝑞 2 2
Finally, given 𝑛, we can take the largest prime 𝑞 satisfying 𝑞 𝑠 ≤ 𝑛 to finish the proof of
Theorem 1.11.1.
MIT OCW: Graph Theory and Additive Combinatorics --- Yufei Zhao
Further Reading
Graph theory is a huge subject. There are many important topics that are quite far from
the main theme of this book. For a standard introduction to the subject (especially on more
classical aspects), several excellent graph theory textbooks are available: Bollobás (1998),
Bondy and Murty (2008), Diestel (2017), West (1996). The three-volume Combinatorial
Optimization by Schrijver (2003) is also an excellent reference for graph theory, with a focus
on combinatorial algorithms.
The following surveys discuss in more depth various topics encountered in this chapter:
• The History of Degenerate (Bipartite) Extremal Graph problems by Füredi and Si-
monovits (2013);
• Hypergraph Turán Problems by Keevash (2011);
• Dependent Random Choice by Fox and Sudakov (2011).
Chapter Summary
• Turán number ex(𝑛, 𝐻) = the maximum number of edges in an 𝑛-vertex 𝐻-free graph.
• Turán’s theorem. Among all 𝑛-vertex 𝐾𝑟+1 -free graphs, the Turán graph 𝑇𝑛,𝑟 (a complete
𝑟-partite graph with nearly equal sized parts) uniquely maximizes the number of edges.
• Erdős–Stone–Simonovits Theorem. For any fixed graph 𝐻,
2
1 𝑛
ex(𝑛, 𝐻) = 1 − + 𝑜(1) .
𝜒(𝐻) − 1 2
• Supersaturation (from one copy to many copies): an 𝑛-vertex graph with ≥ ex(𝑛, 𝐻)+𝜀𝑛2
edges has ≥ 𝛿𝑛 𝑣 (𝐻 ) copies of 𝐻, for some constant 𝛿 > 0 only depending on 𝜀 > 0, and
provided that 𝑛 is sufficiently large.
• Kővári–Sós–Turán theorem. For fixed 𝑠 ≤ 𝑡,
ex(𝑛, 𝐾 𝑠,𝑡 ) = 𝑂 𝑠,𝑡 (𝑛2−1/𝑠 ).
– Tight for 𝐾2,2 , 𝐾3,3 , and more generally, for 𝐾 𝑠,𝑡 with 𝑡 much larger than 𝑠 (algebraic
constructions).
– Conjectured to be tight in general.
• Even cycles. For any integer 𝑘 ≥ 2,
For information about citing these materials or our Terms of Use, visit: https://ocw.mit.edu/terms.