CALT-TH-2022-042
arXiv:2212.06187v2 [hep-th] 5 Mar 2023
Lectures on the string landscape
and the Swampland
Nathan Benjamin Agmon1 , Alek Bedroya1 , Monica Jinwoo Kang2 , and
Cumrun Vafa1
1
2
Physics Department, Harvard University, Cambridge, MA 02138, U.S.A.
Walter Burke Institute for Theoretical Physics, California Institute of Technology
Pasadena, CA 91125, U.S.A.
nagmon@g.harvard.edu, abedroya@g.harvard.edu, monica@caltech.edu, vafa@g.harvard.edu
Abstract
We provide an overview of the string landscape and the Swampland program. Our review
of the string landscape covers the worldsheet and spacetime perspectives, including vacua
and string dualities. We then review and motivate the Swampland program from the lessons
learned from the string landscape. These lecture notes are aimed to be self-contained and
thus can serve as a starting point for researchers interested in exploring these ideas.
These notes are an expanded version of two courses The String Landscape and the Swampland
taught by C. Vafa at Harvard University in 2018 with a focus on the landscape, written by
M. J. Kang with additional material from N. B. Agmon, and in 2022 with a focus on the
Swampland, written by A. Bedroya.
Contents
Introduction
4
I
5
The string landscape
1 Bosonic string theory
1.1 Conventions . . . . . . . . . . . . . . . . . . . . . .
1.2 Freely propagating strings . . . . . . . . . . . . . .
1.3 Basics of conformal field theory . . . . . . . . . . .
1.4 Worldsheet gauge anomalies and the critical string .
1.5 BRST formalism . . . . . . . . . . . . . . . . . . .
1.6 Quantum string . . . . . . . . . . . . . . . . . . . .
1.7 String perturbation theory . . . . . . . . . . . . . .
1.8 Tree-level scattering . . . . . . . . . . . . . . . . . .
1.9 One-loop scattering . . . . . . . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
5
5
6
8
11
12
14
19
23
25
.
.
.
.
31
31
38
40
46
.
.
.
.
.
.
.
.
48
48
50
53
54
55
59
64
67
4 String dualities
4.1 Supergravities in d ≥ 10 . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
4.2 T-duality for superstring theories . . . . . . . . . . . . . . . . . . . . . . . .
4.3 Branes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
72
72
73
75
2 Bosonic string compactifications
2.1 Geometric CFTs . . . . . . . . . .
2.2 WZW models and current algebras
2.3 Orbifolds . . . . . . . . . . . . . . .
2.4 Noncritical string theory . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
3 Superstring theory
3.1 Basics of the NSR formalism . . . . . . . . .
3.2 Modular invariance and the GSO projection
3.3 Green-Schwarz superstring . . . . . . . . . .
3.4 Examples of superstring compactifications .
3.5 The type I string . . . . . . . . . . . . . . .
3.6 The heterotic string . . . . . . . . . . . . . .
3.7 Superstring compactifications . . . . . . . .
3.8 Model building . . . . . . . . . . . . . . . .
1
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
4.4
4.5
4.6
4.7
M-theory . . . . . . . . . . . . . . . . . . . . . .
Completing the web of dualities for NSU SY = 16
F-theory . . . . . . . . . . . . . . . . . . . . . .
More dualities in lower dimensions . . . . . . .
5 Complex geometry
5.1 Preliminary definitions . . . . . . . . . . . . .
5.2 Examples of Calabai–Yau manifolds . . . . . .
5.3 Calabi–Yau manifolds from complex projective
5.4 Singularities of K3 surfaces . . . . . . . . . . .
5.5 Singularities of Calabi–Yau threefolds . . . . .
5.6 Toric geometry . . . . . . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
. . . .
. . . .
spaces
. . . .
. . . .
. . . .
6 Sigma models
6.1 Supersymmetric sigma models and mirror symmetry .
6.2 Supersymmetric minimal models . . . . . . . . . . . .
6.3 Mirror symmetry in minimal models . . . . . . . . .
6.4 Calabi–Yau SCFT from minimal models . . . . . . .
6.5 Minimal models and Calabi–Yau σ-models . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
80
88
92
96
.
.
.
.
.
.
99
99
101
104
106
109
111
.
.
.
.
.
116
116
118
122
122
124
7 Black Holes and holography
127
7.1 Black holes in string theory . . . . . . . . . . . . . . . . . . . . . . . . . . . 127
7.2 Holography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130
II
The Swampland program
135
1 Introduction to Swampland program
1.1 Basic features of quantum field theories . . . . . . .
1.2 Quantum gravity vs quantum field theory . . . . .
1.3 Why is gravity special? . . . . . . . . . . . . . . . .
1.4 Problems of treating gravitational theories as EFTs
1.5 The Swampland program . . . . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
135
135
136
138
141
146
2 Swampland I: No global symmetry conjecture
149
2.1 No global symmetry: black hole argument . . . . . . . . . . . . . . . . . . . 149
2.2 What is a global symmetry? . . . . . . . . . . . . . . . . . . . . . . . . . . . 151
2.3 Non-invertible symmetries . . . . . . . . . . . . . . . . . . . . . . . . . . . . 154
2
2.4
2.5
2.6
2.7
2.8
2.9
What is a gauge symmetry? . . . . . . . . . .
Non-compact spaces and boundary symmetries
No global symmetry: holographic argument .
Symmetries in string theory . . . . . . . . . .
Cobordism conjecture . . . . . . . . . . . . . .
Baby universe hypothesis . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
3 Swampland II: Completeness of spectrum
3.1 Completeness hypothesis . . . . . . . . . . . . . .
3.2 Evidence in string theory . . . . . . . . . . . . . .
3.3 Completeness of spectrum for discrete symmetries
3.4 Completeness of spectrum: arguments . . . . . .
4 Swampland III: Weak gravity conjecture
4.1 Evidence from string theory . . . . . . .
4.2 Weak gravity conjecture: formulation . .
4.3 Motivation . . . . . . . . . . . . . . . . .
4.4 Festina lente . . . . . . . . . . . . . . . .
4.5 Applications . . . . . . . . . . . . . . . .
5 Swampland IV: Distance conjectures
5.1 Introduction . . . . . . . . . . . . . . .
5.2 Moduli space . . . . . . . . . . . . . .
5.3 Supergravities . . . . . . . . . . . . . .
5.4 Dualities and infinite distances limits .
5.5 Universal properties of infinite distance
5.6 AdS and CFT distance conjectures . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. . . .
. . . .
. . . .
. . . .
limits
. . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
156
158
159
160
162
168
.
.
.
.
170
171
171
173
175
.
.
.
.
.
177
179
181
182
184
185
.
.
.
.
.
.
186
186
186
187
189
191
199
6 Swampland V: de Sitter conjectures
202
6.1 (Non)-supersymmetry and (in)stability . . . . . . . . . . . . . . . . . . . . . 204
6.2 de Sitter and tree-level string theory . . . . . . . . . . . . . . . . . . . . . . 212
6.3 de Sitter conjectures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 214
7 Swampland VI: Finiteness and string lamppost
7.1 String lamppost and finiteness principles . . . .
7.2 Finiteness principle in AdS . . . . . . . . . . . .
7.3 String lamppost principle from brane probes . .
7.4 Finiteness of light species . . . . . . . . . . . . .
3
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
223
223
228
231
236
Introduction
After more than five decades of research, string theory has emerged as the most promising
candidate for describing the connection between the observed universe and a theory of
quantum gravity. It offers our deepest understanding of how quantum theory of gravity
works. We have learned how to construct large classes of vacua in various string theories,
including solutions which contain standard model matter content, such as non-abelian gauge
fields, chiral fermions, and multiple generations of matter fields. We have also discovered
that string dualities lead not only to connections between various string theories, but also to
the discovery of new quantum systems decoupled from gravity, in up to six dimensions.
The remarkable success of string theory may have led to the misunderstanding that any
quantum field theory that appears consistent without gravity can be coupled to quantum
gravity with string theory serving as the bridge. This expectation cannot be further from the
truth. As we currently understand, only a few special quantum field theories emerge as the
low energy limits of string theory. Essentially no generic quantum field theories can emerge,
and only very special ones do! That may sound unnatural from the viewpoint of effective
field theory but that is the lesson string theory is teaching us.
On the other hand, particle phenomenology and cosmology face a crisis of naturalness.
Parameters and choices of theories needed to explain our universe seem highly fine-tuned
and unnatural. It is not difficult to imagine that the reason they look fine-tuned is because
the consistency of quantum gravity, as string theory solutions offer, is not incorporated into
the notion of naturalness. With a correct prior, namely being able to couple the QFT to
gravity, the naturalness criteria changes dramatically enough to make QFTs that describe
our universe not as fine-tuned as they appear.
The Swampland program aims to delineate conditions on effective field theories which
distinguish the “good ones” (those that can couple to gravity consistently) from the “bad
ones”. The aim of the courses serving as the basis for these lecture notes was to introduce
this topic to students interested in doing research in this direction.
These lecture notes contain two basic parts. The first part includes several chapters
dealing with an overview of string theory, with a focus on the landscape from both the
worldsheet and spacetime perspectives. It is rather brief, but we try to be self-contained.
The topics we cover serve as useful background for ideas in the Swampland program. The
second part of these notes provides motivation for the program and explains its guiding
principles.
4
Part I
The string landscape
1
Bosonic string theory
We begin with a lightning review of perturbative bosonic string theory. String theory is a
theory of 1 + 1 dimensional relativistic fundamental objects that propagate in some target
space with an action defined on the worldsheet traced out by the strings.
1.1
Conventions
Before diving into the details, it is useful to set some of the conventions first. Our discussion
closely follows the modern textbook route [1–4] with many of the conventions of [1, 2]. We use
(σ 1 , σ 2 ) to parameterize the the string worldsheet Σ in Euclidean signature, where σ 2 plays
the conventional role of Euclidean time. We often extend the domain of physical operators on
the worldsheet by analytically extending them to complex values of σ 1 and σ 2 . A particularly
useful coordinate system for the resulting C × C space is
z = σ 1 + iσ 2 ,
z̄ = σ 1 − iσ 2 .
(I.1.1)
Note that z and z̄ are not necessarily complex conjugates1 . We can set the signature of the
theory by restricting to the appropriate 2d subspace of C × C. For example, the z ∗ = z̄
subspace corresponds to the Euclidean parametrization of the worldsheet, while the z, z̄ ∈ R
subspace corresponds to Lorentzian signature.
Since the functions are meromorphic in z and z̄, complex derivatives ∂z and ∂z̄ are well
defined. We refer to them respectively as holomorphic and anti-holomorphic derivatives.
When there is no possibility for confusion, we drop the subscripts and denote the associated
derivatives by
∂≡
∂
∂z
∂
and ∂¯ ≡
.
∂ z̄
(I.1.2)
¯ = 0. Holomorphic functions (∂f
¯ (z, z̄) = 0) are
They satisfy ∂z = ∂¯z̄ = 1 and ∂ z̄ = ∂z
written as f (z) whereas anti-holomorphic functions (∂f (z, z̄) = 0) are denotes by f (z̄).
The measure d2 z = dz ∧ dz̄ satisfies d2 z = 2dσ 1 dσ 2 , with the Jacobian factor included.
1
We denote the complex conjugate by z ∗ .
5
The integral over a closed contour in the complex plane is taken to satisfy
˛
1
= 2πi.
z
(I.1.3)
For convenience, we work in units where the string length ℓs = 1, i.e. where the string
tension is
T =
1.2
1
.
2π
(I.1.4)
Freely propagating strings
At its core, perturbative string theory is a theory of fundamental 1 + 1-dimensional objects
moving in a target space. If we view the amplitude as a function of the worldsheet of
the string, string theory is a 2d field theory where the amplitude of a given worldsheet
configuration is given by eiS where S is the action of the corresponding two dimensional field
theory.
M ∼ exp(iSworldsheet )
Figure I.1.1: A 2 → 2 scattering event of strings represented by a single string worldsheet.
The action on the worldsheet determines the scattering amplitude M.
Recall that the action for a relativistic particle of mass m is defined to be proportional
to the proper length of its worldline γ, i.e.
ˆ
p
S = −m dτ ∂τ X µ · ∂τ Xµ .
(I.1.5)
γ
The same line of reasoning suggests a natural candidate for strings, namely introducing an
action proportional to the area of string worldsheet. This action is known as the Nambu–
Goto action. For a string that traces X µ in a Minkowski background, the Nambu–Goto
6
action reads [1]
SN G = −T
ˆ
Σ
d2 σ
q
det (∂a X µ · ∂b Xµ )
(I.1.6)
where T = 1/(2π) is the string tension, i.e. mass per unit length, and Σ is the worldsheet
of the string. A priori, this action is difficult to quantize due to the presence of the square
root. By introducing a dynamical worldsheet metric gab , we can instead consider the simpler
Polyakov action [5–7],
ˆ
T
√
SP = −
(I.1.7)
d2 σ ggab ∂ a X µ · ∂ b Xµ .
2 Σ
Note that the metric gab is an auxiliary variable and we can solve for it up to an overall
scaling from the equation of motion. It is straightforward to verify that the two theories lead
to the same equations of motion for X µ and are therefore classically equivalent. The classical
theory of (I.1.7) enjoys several internal symmetries. The worldsheet theory is invariant under
global target space translations and Lorentz transformations,
X ′µ (σ) = Λµ ν X ν (σ) + v µ ,
Λ ∈ SO(d − 1, 1).
(I.1.8)
Locally, this two-dimensional field theory is also invariant under several gauge symmetries,
including diffeomorphisms (i.e. reparametrizations),
X ′µ (σ ′ ) = X µ (σ),
′
gab
(σ ′ ) =
∂σ c ∂σ d
gcd (σ),
∂σ ′a ∂σ ′b
(I.1.9)
with σ ′a = f a (σ b ), and Weyl transformations (i.e. local rescalings),
′
gab
(σ) = e2φ(σ) gab (σ).
(I.1.10)
The reason the above symmetries must be gauge symmetries and not ordinary symmetries is
to ensure that the metric g is a fictitious field and the Nambu–Gotu and Polyakov theories
are equivalent.
We perform a Wick rotation of the worldsheet theory to work in the Euclidean signature.
The Wick rotation will remove the minus sign in front of the Lorentzian action. The quantized
2d Polyakov theory is then defined in terms of a path integral over all field configurations as
ˆ
DXDg −SP [X,g]
Z=
e
,
(I.1.11)
VDiff×Weyl
where we have divided by the volume of the gauge group to render the expression finite. The
7
expression in (I.1.11) includes an implicit sum over inequivalent topologies (i.e. manifolds
which are not connected through the action of the gauge group). To make progress in the
quantum theory, we utilize the standard Fadeev-Popov gauge-fixing procedure [8], which
localizes the path integral to a single gauge slice at the cost of introducing a set of anticommuting b,c ghost fields. There is a single c ghost for every gauge parameter, and one
b ghost for every gauge-fixing condition. A particularly convenient choice of gauge-fixing
condition is the conformal gauge 2
gab = δab .
(I.1.12)
An infinitesimal gauge transformations is specified by three worldsheet functions: infinitesimal
coordinate transformations δσ 1 (σ), δσ 2 (σ) as well as the Weyl rescaling function φ(σ).
Therefore, we expect to have three ghost c fields in total. Moreover, every pair of worldsheet
indices a, b corresponds to a gauge-fixing condition gab = δab and should lead to a ghost field
bab . As the gauge-fixing conditions are symmetric in a and b, bab must also be symmetric.
It turns out the ghost field associated with Weyl transformations is easy to integrate out
since it only appears as a quadratic term in the action. Doing so imposes a tracelessness
condition on bab . Thus, we end up having two ghost fields ca corresponding to the coordinates
reparameterizations δσ a , a traceless symmetric tensor ghost field bab , and d free massless
scalars. The action for the massless scalars is
ˆ
T
SX =
d2 σ∂a X µ ∂a Xµ ,
(I.1.13)
2 Σ
while the ghosts ca , bab are governed by the action
ˆ
1
d2 σbab ∂ a cb .
Sgh =
2π Σ
The path integral of the gauge-fixed theory thus reduces to
ˆ
Z = DXDbDc e−SX −Sgh .
1.3
(I.1.14)
(I.1.15)
Basics of conformal field theory
It turns out that our choice of conformal gauge does not completely eliminate all of the gauge
redundancy. There are still residual gauge transformations corresponding to combinations
2
Although it is always possible to choose gab = δab locally in each coordinate patch of Σ, there can be
global obstructions to this gauge choice. We postpone the study of these global issues to the discussion of
moduli spaces of Riemann surfaces.
8
of diffeomorphisms and Weyl transformations that leave the metric invariant, referred to
collectively as the group of conformal transformations. In the conventional definition of
conformal theories in flat space, a diffeomorphism is accompanied by a rescaling of other
fields rather than the metric. It is straightforward to see these two definitions are equivalent
and the rescaling of metric could be absorbed in the rescaling of the fields. For example,
in the bosonic string action (I.1.13)+(I.1.14), a rescaling of the metric under the residual
conformal transformation
gab → e2ω(σ) gab
(I.1.16)
could be replaced with a transformation of the fields given by
(X, bab , ca ) → (X, bab e2ω(σ) , ca e−ω(σ) )
(I.1.17)
As a result, we can view the worldsheet theory as a theory in 2d flat space which has
conformal symmetry [9, 10]. However, it is important to remember that the conformal
symmetry is not a global symmetry, but a gauge symmetry of the theory! This is an important
distinction. There is no a priori reason why the conformal symmetry of a classical theory
must be preserved at the quantum level. However, in the case of string worldsheet, this is a
requirement for the mathematical consistency of the theory.
Since the dynamics of the strings are captured by a conformal theory, it is useful to briefly
review the basic properties of CFTs in two dimensions.3 . Every local QFT by definition
has a conserved operator known as the stress-energy tensor Tab . For theories which admit
a Lagrangian description, the stress tensor can be defined unambiguously by minimally
coupling the theory to gravity4
Tab = −4πg −1/2
δS
,
δg ab
(I.1.18)
where the overall coefficient is a matter of convention. In scale invariant theories the trace of
the energy momentum tensor vanishes on shell [11]. In conformal field theories, one can show
that the energy momentum tensor can be improved by adding a total derivative ∂c ∂d N abcd ,
such that the trace vanishes off shell as well. To ensure that T ab remains symmetric and
conserved, N needs to be symmetric under a ↔ b and c ↔ d and antisymmetric under
{a or b} ↔ {c or d}. In fact, such an improvement of the energy momentum tensor can
be achieved by adding a boundary term to the action [11]. The boundary term ensures the
action is Weyl invariant. If we work with this action, then Taa must vanish identically, which
3
Most of the essential CFT features in string theory are covered or at least mentioned in [1, 2]. For a
more complete reference, see the yellow book [10].
4
Minimal coupling refers to the gauging of the local Lorentz symmetry SO(1, d − 1).
9
in complex coordinates reads
Tzz̄ = 0.
(I.1.19)
Similarly, the conservation equation can be written as
¯ zz = ∂Tz̄z̄ = 0,
∂T
(I.1.20)
which implies that the diagonal components T ≡ Tzz (z) and T̄ = Tz̄z̄ (z̄) are holomorphic
and anti-holomorphic, respectively. It follows that both admit Laurent expansions around
the origin
X
X
T (z) =
Ln z −n−2 , T̄ (z̄) =
L̄n z̄ −n−2 ,
(I.1.21)
n
n
where the overall factor z −2 (z̄ −2 ) is chosen to agree with its scaling dimension ∆ = 2. We
know that the energy momentum tensor is the generator of coordinate transformations, and
conformal transformations are a special type of coordinate transformation in flat space. Thus,
we should expect the generators of conformal transformations can be expressed in terms of
T (z) and T̄ (z̄). In fact, Ln is the generator of (z, z̄) → (z + ǫz n , z̄) and L̄n is the generator of
(z, z̄) → (z, z̄ + ǫz̄ n ).5 These generators satisfy the following classical commutation relations,
[Lm , Ln ] = (m − n)Lm+n ,
(I.1.22)
and are known as the Witt algebra. After quantizing the theory, the algebra acquires a
central extension c. The centrally extended algebra, known as the Virasoro algebra Virc ,
takes the following form [12]
[Lm , Ln ] = (m − n)Lm+n +
c
(m3 − m)δm+n,0 .
12
(I.1.23)
Similarly, the anti-holomorphic modes L̄n generate an independent copy Virc̄ with central
charge c̄ not necessarily equal to c. The central charges c and c̄ represent the breaking of the
conformal symmetry at the quantum level.
All CFT states on the cylinder, which are dual to local operators at the origin of C via
the state-operator correspondence, can written in terms of states with definite weights (h, h̄)
under L0 , L̄0 . For unitary theories, these can be further decomposed into linear combinations
of primary and descendant states. A primary |ψi satisfies
Ln |ψi = L̄n |ψi = 0,
5
n > 0,
(I.1.24)
Conformal transformations in two-dimensions generically take the form of holomorphic functions f (z).
10
while a descendant takes the general form
|ψi′ = L−n1 · · · L−nk |ψi ,
n 1 , . . . nk > 0
(I.1.25)
Analogously, a primary operator O of weight (h, h̄) obeys the operator product expansion
(OPE)
T (z)O(0) =
1
h
O(0) + ∂O(0) + non-singular,
2
z
z
(I.1.26)
and similarly for T̄ O. In particular, higher-order singularities are absent. This implies that
under a conformal transformation, O transforms as
O(z ′ , z̄ ′ ) = (∂z ′ )−h (∂¯z¯′ )−h̄ O(z, z̄).
(I.1.27)
where z ′ = f (z) and z̄ ′ = f¯(z̄) are the associated conformal maps.
It is easy to verify this definition is equivalent to (I.1.24) via (I.1.23). The stress tensor
is not a primary, but nonetheless satisfies the OPE:
T (z)T (0) =
2
1
c
+ 2 T (0) + ∂T (0) + non-singular,
4
2z
z
z
(I.1.28)
which through (I.1.24) leads to the aforementioned Virasoro algebra (I.1.23).
1.4
Worldsheet gauge anomalies and the critical string
We now return to the gauge-fixed worldsheet theory, whose action in complex coordinates
takes the form
ˆ
1
¯ µ + b∂c
¯ + b̄∂c̄ ,
S=
d2 z ∂X µ ∂X
(I.1.29)
2π
where c = cz (z), b = bzz (z), c̄ = cz̄ (z̄), b̄ = bz̄z̄ (z̄). The energy-momentum tensor for the
matter fields X µ is6
¯ µ ∂X
¯ µ,
T̄m = −∂X
Tm = −∂X µ ∂Xµ ,
(I.1.30)
with central charge c = c̄ = d, where µ = 1, . . . , d. This agrees with the notion that the
central charge is a rough measure of the number of degrees of freedom in a CFT. The ghost
6
All operator products are assumed to be normal ordered with respect to some procedure, such as creationannihilation ordering (with creation operators on the left and annihilation operators on the right) or conformal
normal ordering.
11
theory admits an independent stress tensor of the form
Tgh = 2(∂c)b + c∂b,
¯ b̄ + c̄∂¯b̄
T̄gh = 2(∂c̄)
(I.1.31)
with central charge c = c̄ = −26. Therefore, the central charge of the combined matter+ghost
CFT is
ctot = d − 26.
(I.1.32)
Recall that before gauge-fixing, the classical worldsheet theory possessed a [Diff × Weyl]
gauge symmetry. The quantum theory, as defined by the Polyakov path integral, can suffer
from both global and gauge anomalies. Indeed, on a general curved worldsheet, bosonic
string theory suffers from a Weyl anomaly, which manifests itself through the non-vanishing
of the trace of the stress tensor7
Tzz̄ = −
c
R.
24
(I.1.33)
Here, R is the 2d Ricci scalar on the worldsheet, which completely captures the geometry of
a 2d manifold. To preserve the full [Diff × Weyl] gauge symmetry and consistently quantize
the theory, we must therefore take the number of scalar fields to be equal to 26. This in turn
implies that the dimensionality of spacetime is [13]
d = 26.
(I.1.34)
The resulting Weyl-invariant theory is known as critical bosonic string theory.
To summarize, the only dimension d where the d-dimensional Euclidean space could be
the target space of a non anomalous 2d CFT is d = 26.
1.5
BRST formalism
As we discussed in subsection 1.3, the conformal gauge choice gab = δab does not completely
eliminate the gauge redundancy. As a consequence, the naive Hilbert space of the theory
over-counts the physical states. The BRST formalism is a systematic method that allows us
to restrict to a faithful subspace of the Hilbert space where every physical state is represented
exactly once. It is based on introducing a new global fermionic symmetry known as the BRST
symmetry [14, 15].8 For the combined matter+ghost CFT, the current jBa associated with
7
Note that any CFT with c 6= c̄ necessarily has a gravitational (diffeomorphism) anomaly, and so we will
usually assume c = c̄ unless explicitly stated.
8
BRST refers to Becchi, Rouet, Stora, and Tyutin.
12
this symmetry is [16]
1
j̄B = c̄ · T̄m + c̄ · T̄gh ,
2
1
jB = c · Tm + c · Tgh ,
2
(I.1.35)
where jB ≡ jBz and j̄B ≡ jBz̄ . Its components are are (anti-)holomorphic like the stress tensor,
which implies jBa is conserved
¯ B + ∂ j̄B = 0.
∂a jBa = ∂j
(I.1.36)
The associated topological charge operator acting on a local operator O(0) is given by
˛
˛
¯ j̄B (z̄),
(I.1.37)
QB = dzjB (z) + dz
0
0
where the contour necessarily surrounds the origin. The action of the BRST symmetry can
be thought of as a gauge transformation with the gauge parameters replaced with c ghosts.
Note that the BRST symmetry acts on the b ghost as
QB · b(z) = T (z).
(I.1.38)
A key property of the BRST formalism is that QB is nilpotent, i.e.
Q2B = 0,
(I.1.39)
and so admits a cohomology of states. By gauge-fixing the worldsheet theory, we have
enlarged the original space of states for the matter CFT to include additional states generated
by free field oscillators built from the ghost fields. Naively, one might think that all such
states contribute to on-shell scattering processes in spacetime; as it turns out, many of these
states lead to equivalent spacetime physics. To account for this redundancy, we must restrict
the set of physical states to be cohomology classes of QB . That is, a physical state is the
coset of QB -closed states modulo QB -exact states:9
QB |Ψi = 0,
|Ψi ≃ |Ψi + QB |Λi ,
(I.1.40)
where |Λi is an arbitrary state. The statement that QB = 0 on physical states can be seen
as reformulation of the fact that all physical observables must be gauge-invariant.
9
To obtain a sensible unitary¸ S-matrix, we must¸ also supply an extra constraint on physical states:
b0 |Ψi = b̄0 |Ψi = 0, where b0 = dzzb(z) and b̄0 = dz̄ z̄ b̄(z̄). In the string field theory terminology, this
constraint is known as Siegel gauge.
13
1.6
Quantum string
We now turn to analyzing the spectrum of physical states of the (free) closed bosonic string.
In conformal gauge, we found that such states are in one-to-one correspondence with BRST
cohomology classes. Using (I.1.38), it is clear that all physical states satisfy L0 = L̄0 = 0.
In principle, we could use this to derive the masses of closed string excitations by choosing
a particular BRST representative for each cohomology class (for a detailed analysis, see
[1]) and extracting the oscillators and their algebras from the matter and ghost fields. A
faster approach to obtain the spectrum, which easily generalizes to more complicated string
theories, is to abandon conformal gauge and completely fix the gauge. A convenient gauge
choice that has this property is the so-called light-cone gauge.
Light-cone quantization
Light-cone (LC) gauge quantization is a particular approach to the quantization of sigma
models whose target space admits a light-like Killing vector. For string theory, the key is to
gauge-fix Diff×Weyl such that there is a map from a worldsheet light-like Killing vector to
one in the target spacetime. This removes the longitudinal degrees of freedom. As a bonus,
the associated bc ghosts completely decouple and can be neglected in the quantum theory.
The price we must pay for these simplifications is the loss of manifest Lorentz invariance at
the level of the classical worldsheet theory. Furthermore, we must also verify that it remains
a symmetry in the quantum theory.
Working with a worldsheet in Lorentzian signature, we define null spacetime and worldsheet
coordinates as,
X0 ± X1
σ0 ± σ1
√
X ± :=
, σ ± := √
.
(I.1.41)
2
2
Recall that taking the worldsheet metric flat, gab = ηab , does not completely fix the gauge
redundancy. The remaining gauge degrees of freedom can be eliminated by choosing the
light-cone gauge,
X + = x+ + α ′ p+ σ + , σ ± = σ 0 ± σ 1 ,
(I.1.42)
for some constants x+ and p+ which ties σ + to X + . We can set x+ to zero by reparametrizing
X + → X + − x+ . Note that X + is no longer a dynamical field. Naively this gauge-fixing
condition would appear to leave a free field theory with d − 1 = 25 free bosons. However,
this is not the case since the equation for the metric forces us to set
Tab = 0
(I.1.43)
as a constraint which must be imposed by hand. This set of constraints are collectively
14
referred to as Virasoro constraints. In light-cone gauge, they take the form
T++ = ∂+ X + · ∂+ X − − ∂+ X i · ∂+ X i = 0,
i = 1, · · · , 24,
(I.1.44)
with a similar expression for T−− . Together with (I.1.42) we then have
∂X − =
1
∂X i · ∂X i ,
+
p
(I.1.45)
and so the nonzero oscillatory modes of X − also decouple. Thus, by choosing light-cone
gauge we have explicitly removed the longitudinal degrees of freedom; only the 24 transverse
fields X i remain. The quantum theory of the free string in lightcone gauge can be defined by
canonical quantization. We can expand the transverse fields in terms of their Fourier modes
on the cylinder, i.e.
X i (σ) = xi +
pi 0
i X 1 i inσ−
i −inσ +
√
.
α
e
+
ᾱ
e
σ
+
n
n
p+
2 n6=0 n
The Hamiltonian H, which can now be identified with p− , is given by
!
∞
X
1
pi pi
i
i
α−n
αni + ᾱ−n
ᾱni + A + Ā ,
H= ++ +
2p
2p
n=1
(I.1.46)
(I.1.47)
where A and Ā are constants to be computed from the usual ordering ambiguity once we
quantize the theory. We can proceed by imposing equal-time commutation relations on the
X i and their conjugate momenta, or equivalently the following commutation relations on the
oscillators:
j
j
[αni , αm
] = [ᾱni , ᾱm
] = nδ ij δn,−m ,
[xi , pj ] = iδ ij .
(I.1.48)
Up to an overall normalization, the αni and ᾱni satisfy the bosonic creation-annihilation
commutation relations, with n < 0 corresponding to creation operators and n > 0 to
annihilation operators. There are still two unconstrained modes, the zero mode x− of X −
and p+ , which satisfy
[x− , p+ ] = −i.
(I.1.49)
The minus sign reflects the fact that the flat metric ηab in the conformal coordinates σ ±
takes the form η −+ = −1. Thus, the xµ describe the center-of-mass coordinates of the string,
while the oscillators describe its vibrational modes. We can decompose the Hilbert space into
representations of the Lorentz group SO(1, d − 1). In particular, the vacua of the harmonic
15
oscillators furnish an irreducible representation with states |pi labeled by a center-of-mass
momentum vector p ≡ (p+ , pi ). They satisfy
αnj |pi = ᾱnj |pi = 0,
n > 0.
(I.1.50)
The Fock space H of free string states can be constructed by repeated applications of the
creation operators to the boosted ground states above,
!
24 Y
∞
∞
24
YY j
Y
i
(ᾱ−n̄ )Njn̄
(α−n
)Nin |pi
(I.1.51)
|N, N̄; pi =
j=1 nj =1
i=1 n=1
Here, N = (Nin ) and N̄ = (Nin̄ ) are ordered collections of non-negative integer values that
determine the excitation number of each mode. Occasionally, we will also take N and N̄
P P
P24 P
to be the sums 24
i=1
n nNin and
i=1
n̄ n̄Nin̄ respectively. They are referred to as the
levels of the string state |N, N̄; pi.
Note that we have allowed ourselves to be slightly imprecise in describing the gauge-fixing
procedure. For the closed string, the light-cone gauge-fixing conditions described above leave
some residual gauge freedom – namely, translations in the σ direction. The actual space of
physical states Hphys consists of gauge-invariant states in H. These are the states uncharged
under translations in σ, i.e. those that obey the level-matching condition N = N̄ [17]. The
closed string Hilbert space is therefore
Hphys = Span{|N, N̄; pi}.
(I.1.52)
where N = N̄ . From now on we will simply refer to N as the level of the state.
To determine the masses of the closed string states in Hphys , notice that there is a
spacetime Lorentz invariant
m2 ≡ 2p− p+ − pi pi ,
(I.1.53)
which is just the usual mass squared invariant of relativistic particles. Using the fact that
p− = H, we can work systematically work out the closed string masses. Recall that the
Hamiltonian H includes two unfixed constants, A and Ā. It can be shown that these
quantities necessarily take the values
A = Ā = −1
(I.1.54)
for spacetime Lorentz invariance to be preserved. It follows that the closed string masses are
m2 ℓ2s = 4(N − 1),
16
N = 0, 1, 2, . . .
(I.1.55)
where we have temporarily restored the string length ℓs = 1.
Low-lying spectrum and NLSMs
Let us now analyze the states at each level (i.e. for each mass squared value) in detail and
discuss their spacetime interpretation. At level N = 0 we have a single state
|T i = |pi ,
m2 ℓ2s = −4,
(I.1.56)
which behaves as a spacetime scalar with negative mass squared, i.e. a tachyon. Soon, we
will introduce string interactions through a perturbative expansion. Although the free theory
is fully consistent, we will see that the presence of the tachyon introduces an incurable IR
divergence in loop diagrams, thus spoiling the theory’s consistency at the perturbative level.
This is not unlike what happens in QFT whenever the potential V (T ) of a scalar field T
develops a local maximum at T = 0 instead of a local minimum. The quantized theory
yields a scalar particle with negative mass squared that inevitably introduces divergences
in Feynman diagrams with loops. The solution in this case is to expand around a stable
vacuum (global minimum) where T = T0 and instead consider the quanta of its fluctuations
around this point. The phenomenon of tachyon condensation and whether there exists such
a vacuum remains an open problem for bosonic closed string theory.
The states at level N = 1 are given by
j
i
ζij α−1
ᾱ−1
|pi ,
m2 = 0,
(I.1.57)
where ζij is a rank-2 tensor with no constraints. According to Wigner’s classification, massless
particles in spacetime are associated with finite-dimensional irreducible representations (irreps)
of the little group SO(d − 2). We therefore decompose ζij into SO(24) irreps,
ζij = (ζ(ij) − ζδij ) + ζ[ij] + ζδij ,
(I.1.58)
which correspond to a symmetric-traceless irrep, an anti-symmetric irrep, and a (trace)
singlet, respectively. The symmetric-traceless states are simply the propagating modes of
a massless spin 2 particle, i.e. the graviton. The others corresponding to an antisymmetric
rank 2 tensor and a scalar are less familiar in Einstein gravity, but ubiquitous in theories
of gravity arising from stings. The tensor ζij represents the collective polarization of these
three states.
To each massless particle of the string we can associate a spacetime field in 26 dimensions,
Gµν ,
Bµν ,
17
φ,
(I.1.59)
which are, respectively, the spacetime metric, a 2-form gauge potential (known as the KalbRamond field or B-field, for short), and a scalar field (the dilaton). As we will explain below,
one can think of these fields as coherent states of the massless string states. Since coherent
states satisfy the classical equations of motion, we expect these fields to be governed by an
effective action that captures the low energy physics of these particles [18, 19]. The states
beyond level N = 1 have masses on the order of the string scale ℓ−1
p , and so their dynamics
−1
are irrelevant for low energies E ≪ ℓp , and can be safely neglected in the low energy effective
action.
Recall that the photon is the quantum of the electromagnetic field, and that a nonzero
classical background can be generated via coherent states of photons. Charged particles
´
couple minimally to this field via a worldline action i γ Aµ (X)dX µ . Analogously, coherent
states of the massless particles in string theory generate nonzero backgrounds for (I.1.59).
These fields couple minimally to the string worldsheet, and so it is permissible to generalize
the Polyakov formalism to include the effects of these backgrounds. The usual Polyakov
action in flat Minkowski space generalizes to
ˆ
1
SG =
Gµν (X)dX µ ⊗ dX ν .
(I.1.60)
4π Σ
The other massless fields introduce new terms that take the form
ˆ
ˆ
1
i
µ
ν
Bµν (X)dX ∧ dX , Sφ =
R(g)Φ(X),
SB =
4π Σ
4π Σ
(I.1.61)
where R(g) is the scalar curvature of the worldsheet. The first term is similar to the worldline
action of a charged particle. Indeed, we say that the string is (electrically) charged under a
one-form gauge symmetry associated with the B-field. For Φ constant, the second term is a
topological invariant of the worldsheet, and is proportional to the Euler characteristic χ. For
a compact, oriented worldsheet, it is given by
χ = 2 − 2g,
(I.1.62)
where g is its genus.
By pulling back the spacetime fields to the worldsheet, we find that the generalized
Polyakov action takes the form of a nonlinear sigma model (NLSM) on the worldsheet [18, 20],
ˆ
1
√
SN LSM =
(I.1.63)
d2 σ g Gµν (X)g ab + iBµν ǫab ∂a X µ ∂b X ν + R(g)Φ(X) .
4π Σ
Some brief comments are in order. The background fields are functions of the target space
coordinates X µ , and so (I.1.63) is generically a strongly coupled field theory, and with
18
infinitely many terms in the polynomial expansion! This makes analyzing anything but
the simplest of closed string backgrounds a daunting task. A more fundamental problem
is the lack of a guarantee that this theory is Weyl invariant in the quantum level. Weyl
invariance plays a crucial role in the consistency of BRST quantization and therefore in the
very fabric of the critical bosonic string. Using standard field theory techniques, it is possible
to calculate the beta functions of the nonlinear sigma model order by order in ℓ2s . Weyl
invariance requires that the beta functions vanish, and so give constraints on the allowed
string backgrounds. Miraculously, the lowest order constraints reproduce Einstein’s field
equations for Gµν (as well as equations of motion for B and φ). Of course, there are also
higher ordering corrections in ℓ2s that affect the high energy physics (after all, Einstein gravity
is not a UV complete theory, so this better not be the end of the story). Remarkably, the
effective action obtained by imposing Weyl symmetry can also be directly calculated from
string theory amplitudes – and the two expressions perfectly match [21].
1.7
String perturbation theory
Given the quantum closed string, we now want to introduce string interactions via splitting
and joining of strings. For strings in a general background (with an asymptotic dilaton value
Φ0 ) with action given by (I.1.63), this is described by the Polyakov path integral
Z=
∞
X
g=0
λ2g−2
0
ˆ
DXDg exp (−SG [X; g] − SB [X; g]) ,
(I.1.64)
where we have now explicitly included the sum over worldsheets of different topology, which
for 2d geometries reduces to a sum over discrete topologies labeled by the genus g ∈ N (i.e.
the number of holes). We are only considering interactions among closed strings, for which
the number of worldsheet boundaries b = 0. Note the distinction between the genus g in
the sum and the worldsheet metric gab . From the Gauss-Bonnet theorem we see that the
background dilaton exp(Φ0 ) and the bare coupling constant λ0 both contribute in the same
fashion, i.e. as exp(Φ0 )2g−2 and λ2g−2
respectively. Thus, we can combine them into a single
0
renormalized coupling constant λ and redefine the dilaton field such that its VEV reads
λ = heΦ i.
(I.1.65)
Notice that (I.1.64) takes the form of a perturbative series expansion, where λ plays
the role of the coupling “constant” of the theory. This is a slight misnomer, since one key
property of λ is that it is not a free parameter, bur rather is fixed dynamically. As we will
see, this is a universal feature of string theory backgrounds, whose parameters are generated
dynamically via the VEVs of scalar fields.
19
This is a remarkable feature of string theory! Even when we start with free string theory
(λ0 = 0), the spectrum includes coherent backgrounds where the perturbative description is
given by strings with non zero coupling λ = exp Φ0 . In other words, the free theory prescribes
the structure of the interacting theory!
Scattering states in conformal gauge
For asymptotically flat backgrounds, e.g. Gµν = ηµν , we can define an S-matrix describing
the scattering of closed string states. First, let us revisit the BRST formalism for the closed
string in conformal gauge. Recall that an on-shell physical string state in this background is
a weight (0,0) QB cohomology class that obeys the Siegel constraints. All states in the class
satisfy the level-matching conditions, and so the complete set of constraints on any state |ψi
in the class is given by
b0 |ψi = b0 |ψi = 0,
2
(I.1.66)
2
p = −m = 4(1 − N ),
N ∈ N ∪ {0},
(I.1.67)
where the mass-shell constraint now arises from BRST invariance. We can always choose their
representatives to be states of the form c1 c̄1 |V i, where c1 and c̄1 are free field oscillators of
the ghost fields c(z) and c̄(z̄), and |V i is a weight (1,1) state built purely from the fields in
the matter CFT (i.e. the oscillators of X µ ). Using the state-operator correspondence, these
are dual to vertex operators of the form
c(z)c̄(z̄)V (z, z̄),
V (z, z̄) = ∂ n1 X(z) · · · ∂¯m1 X(z̄) · · · eip·X(z,z̄) ,
(I.1.68)
where all operator products are assumed to be normal ordered. Note that BRST invariance
implies additional constraints on the tensor coefficients of these operators.
It is straightforward to determine the vertex operators (states) at each level in conformal
gauge. For instance, the tachyon is represented by the vertex operator
V (0) = eip·X ,
p2 = 4,
(I.1.69)
Similarly, the massless states at level 1 are associated with the vertex operator
¯ ν eip·X ,
V (1) = ǫµν ∂X µ ∂X
p2 = 0,
ǫµν pµ = ǫµν pν = 0,
(I.1.70)
where ǫµν is a polarization tensor. Its transversality property arises from BRST-invariance,
and ensures that the longitudinal degrees of freedom decouple – this is the expected result for
massless quanta of gauge fields in spacetime. Note that V (1) packages together the graviton,
dilaton, and B-field.
20
The string S-matrix
For a scattering process of n strings, the worldsheet takes the form of some compact manifold
Σg glued to a set of n infinitely long cylinders carrying the asymptotic closed string states.
The Diff×Weyl symmetry can be used to map this surface to the same Σg , but with n
punctures (i.e. marked points). This transformation effectively carries out the state-operator
mapping (I.1.68), with the ghosts removed. Each such operator has a natural pairing with
dz ∧ dz̄ which should be inserted in the gauge-fixed version of (I.1.64). These objects have
conformal weight (0, 0), and so preserve our choice of conformal gauge. It turns out that
while V itself is not BRST-invariant, its transformation is a total derivative on the moduli
space of inequivalent Riemann surfaces, and so its integrated version is BRST-invariant. We
therefore have that S-matrix elements take the schematic form [22]
+
* n ˆ
∞
X
Y
Vi (zi , z̄i )dzi ∧ dz̄i .
An ∼
λ2g−2
(I.1.71)
g=0
i=1
Σg
There are two primary issues with formula (I.1.71).
First, the summation over Riemann surfaces is incomplete. For example, for n = 0
and g 6= 1, it is not possible to take the metric to be flat globally. However, we can
make it flat in local patches with coordinates (σi1 , σi2 ) that cover the whole worldsheet.
Then the transition functions between two overlapping coordinates (σi1 , σi2 ) are given by
conformal transformations. If we consider the complex coordinates (zi , z̄i ) associated with
each coordinate patch (σi1 , σi2 ), the conformal transition maps between σs take the form of
holomorphic maps between the complex coordinates of each patch. Therefore, we can think
of the worldsheet as a 2 dimensional surface with a global complex structure, i.e. a Riemann
surface. However, not all Riemann surfaces of the same genus are equivalent. We can label
different Riemann surfaces by continuous parameters, called moduli, whose values do not
change under infinitesimal Diff×Weyl transformations. To account for this in (I.1.71), we
must integrate over the moduli space Mg of genus g surfaces in addition to summing over g.
The real dimension of Mg is
if g = 0,
0
dimR Mg =
2
if g = 1,
6g − 6 if g > 1.
(I.1.72)
For nonzero n, the moduli space includes the locations of the integrated vertex operators
giving 2n additional parameters – this is the moduli space of genus g Riemann surfaces with
n punctures.
21
Second, the Polyakov path integral in conformal gauge possesses some residual gauge
symmetry associated with global conformal transformations of Σg . These transformations
are generated by conformal Killing vectors (CKVs), which are global vector fields ξ a that
satisfy the conformal Killing equation
∇a ξb + ∇b ξa = gab ∇c ξ c .
(I.1.73)
CKVs correspond to conformal transformations that do not change the geometry of the
surface. We can use (and eliminate) these extra gauge redundancies to fix the position of
several vertex operators.
The way in which the moduli and CKVs enter into the path integral relates directly
to the ghost path integral. The ghost action admits a bc ghost number symmetry which
is anomalous on a general Riemann surface. The anomaly requires that all non-vanishing
correlation functions have ghost charge 6g − 6, where b has charge −1 and c has charge
+1. The result of gauge-fixing then implies that every modulus comes with a b insertion,
while every CKV comes with a c insertion. Thus, for each complex CKV we insert cc̄V (z, z̄)
´
instead of V dz ∧ dz̄. Overall, this result of gauge-fixing the string theory path integral can
be viewed as the string theory derivation of the Riemann-Roch theorem, which states that
the number of real moduli minus the number of CKVs of any Riemann surface is equal to
6g − 6.
The precise nature of the b ghost insertions traces back to the gauge-fixing procedure.
Recall that the moduli of Σg are encoded in the gauge-fixed metric gab (t). For each modulus
ti , it can be shown that this gauge-fixing procedure results in the insertion
ˆ
1
B(t) =
d2 z (bzz (µt )z̄z + bz̄z̄ (µt )zz̄ ) ,
(I.1.74)
2π
where µ is the Beltrami differential; its components can be computed directly in terms of the
metric and its derivatives,
1
∂
(µt )ab = g bc (t) gac (t),
2
∂t
(I.1.75)
and can be viewed as deforming the complex structure of the worldsheet (∂¯ → ∂¯ + µzz̄ ∂)
´
leading to an insertion of bzz µzz̄ + c.c..
22
In summary, the n-point scattering amplitude of bosonic strings is given by10
An ∝
∞
X
g=0
λ
2g−2
*dim(Mg )=6g−6 ˆ
Y
a=1
dim(CKG)
a
a
dt B(t )
n
Y
i=dim(CKG)+1
Y
cc̄Vb (zb , z̄b )
b=1
ˆ
Σg
Vj (zj , z̄j )dzi ∧ dz̄i
!+
(I.1.76)
,
where ti are the moduli of genus g surfaces and the overall coefficient can be fixed by
demanding unitarity. For genus g, there are 3g − 3 complex moduli, or 6g − 6 real moduli.
1.8
Tree-level scattering
The discussion in the previous discussion is somewhat formal, so let us consider some concrete
examples of string scattering amplitudes. The simplest string diagrams arise at tree level
(g = 0). The unique genus 0 Riemann surface is the Riemann sphere C∗ = C ∪ {∞} which
has no moduli. The CKVs can easily be found by searching for global vector fields ξ(z)∂z and
2
¯
ξ(z̄)∂
z̄ that are sufficiently regular at z = ∞. The solution ξ(z) = a + bz + cz corresponds
to 3 independent complex CKVs, which together generate the Mobius group PSL(2,C) of
fractional linear transformations
z→
az + b
,
cz + d
ad − bc = 1,
a, b, c, d ∈ C.
(I.1.77)
We must therefore fix the locations of three vertex operators. The n-point scattering amplitude
thus takes the form [9]11
A(0)
n =
*
3
Y
c(zi )c̄(z̄i )Vi (zi , z̄i )
i=1
n ˆ
Y
j=4
d2 zj Vj (zj , z̄j )
+
,
(I.1.78)
S2
where h· · · iS 2 indicates the correlation function on the Riemann sphere in the full matter
and ghost CFTs. For instance, the n = 5-point scattering amplitude is depicted in Figure
I.1.2.
10
Moduli space integrals for multi-loop amplitudes are rigorously derived in [23].
By treating the vertex operator positions as moduli, the n-point amplitude can be written in a more
symmetric form solely in terms of integrated vertex operators. See [1] for details.
11
23
cc̄V
cc̄V
´
´
V
V
cc̄V
Figure I.1.2: A five-point tree level
´ 2amplitude requires three fixed vertex operators cc̄V and
two integrated vertex operators d zV on the Riemann sphere. The value of the amplitude
is unaffected by the choice of location of the fixed vertex operators.
As a warm up exercise, we consider the scattering of three tachyons. The amplitude can
be computed using any free field theory techniques of choice. Up to an overall constant fixed
by unitarity, the result is
* 3
+
Y
(0)
AT 3 = g s
c(zi )c̄(z̄i )eipi ·X(zi ,z̄i )
≃ gs δ 26 (p1 + p2 + p3 ),
(I.1.79)
i=1
S2
Here, gs is the string coupling constant, which is proportional to the dilaton VEV λ. More
interesting is the four-tachyon amplitude,
+
* 3
ˆ
Y
(0)
AT 4 =
≃ gs2 δ 26 (p1 + p2 + p3 + p4 )A(s, t, u).
c(zi )c̄(z̄i )eipi ·X(zi ,z̄i ) d2 zeip4 ·X(z,z̄)
i=1
S2
(I.1.80)
where A is the famous Virasoro-Shapiro amplitude [24, 25]
ˆ
t
u
s
Γ
−1
−
Γ
−1
−
Γ
−1
−
4
4
4 .
A(s, t, u) = d2 z|z|−2u−4 |1 − z|−2t−4 = 2π
Γ 2 + 4s Γ 2 + 4t Γ 2 + u4
(I.1.81)
Here, s, t, and u are the usual Mandelstam invariants
s = −(p1 + p2 )2 ,
t = −(p1 + p3 )2 ,
u = −(p1 + p4 )2 ,
(I.1.82)
which satisfy s + t + u = 4m2T for the tachyon mass squared m2T = −4. The amplitude
in (I.1.81) has many interesting properties, such as duality, which we do not have time to
properly cover. Most important is that the 4-point amplitude obeys the unitarity properties
24
expected of an S-matrix element [24]. In particular, we expect the 4-point diagram to factorize
into two 3-point diagrams along with a simple pole via a unitarity cut in the s-channel (i.e.
separating p1 , p2 from p3 , p4 ). Indeed, we see that as an intermediate state goes on-shell with
s = m2N = 4(N − 1), the 4-point amplitude reduces to
s=4(N −1)
A(s, t, u) −−−−−−→
P2N (t)
,
s − 4(N − 1)
(I.1.83)
where P2N (t) is a degree 2N polynomial in t that accounts for which spins up to 2N are
exchanged, consistent with maximum spin at level N .
1.9
One-loop scattering
Now let us consider 1-loop amplitudes, which correspond to genus g = 1. Such worldsheets
have the topology of a torus T 2 . It is worthwhile to spend some time discussing the geometry
of the torus and the structure of its moduli space. We can construct a torus as a quotient
C/Λ, where Λ is a lattice generated from two basis vectors e1 and e2 . Its fundamental
domain consists of the parallelogram with two of its sides given by ei , with opposite sides
identified. As Riemann surfaces, two tori are equivalent if they are related by a conformal
transformation. For us, this means a particular torus can be described by a single complex
modulus τ , known as a Techmüller parameter, which corresponds to e1 /e2 . Without loss
of generality, such a torus can be constructed from the associated parallelogram with sides
given by e1 = 1 and e2 = τ , as shown in Figure I.1.3.
iR
τ
R
0
1
Figure I.1.3: The torus T 2 with complex structure τ as described by the quotient C/Λ,
where Λ is the lattice generated by {1, τ }. The fundamental domain is shaded in green, with
opposite sides identified.
25
iR
τ +1
τ
1
R
Figure I.1.4: Different parameterizations of the same torus are related by the action of the
modular group P SL(2.Z). A torus with modulus τ can also be described with modulus τ + 1.
−1/2
0
1/2
Figure I.1.5: A fundamental domain F of the modulus space of T 2 . The two vertical sides
are identified, as are the two circular arcs. The image of F under SL(2, Z) is the entirety of
H.
Naively, each torus can be parametrized by a modulus τ taking value in the upper-half
plane H. However, there is in fact an equivalence class of τ values which correspond to tori
that can be mapped to one another by different choice of basis for Λ. The group which relates
these different parameterizations is known as the modular group, which for T 2 is isomorphic
to P SL(2, Z), and is generated by τ → τ + 1 and τ → −1/τ . The modular group acts on a
26
given modulus as12
aτ + b
,
τ −→ τ =
cτ + d
′
a b
c d
∈ SL(2, Z).
(I.1.84)
To avoid over-counting in the path integral, we should really be integrating over values of τ
in a single fundamental domain of H/P SL(2, Z) such as
1
1
(I.1.85)
F = τ ∈ H − ≤ Re(τ ) ≤ , |τ | ≥ 1 ,
2
2
shaded in blue in Figure I.1.5.
We are now ready to talk about the genus 1 contribution to worldsheet amplitudes. Recall
that the nature of the ghost insertion is determined by the number of conformal Killing group
(CKV) and moduli. First, we find the number of c-ghosts required. The conformal Killing
group (CKG) of the torus, T 2 , is easy to describe: it consists of translations. There is thus
a single (complex) CKV, which leads to the insertion of a single pair cc̄ in all scattering
amplitudes. Intuitively, their location can be fixed using the translation symmetry. Second,
we work out the number of b-ghost insertions and their form. Earlier, we found that the
torus had a single complex modulus τ = τ1 + iτ2 . For convenience, we encode the moduli
dependence of T 2 in the metric,
ds2 = |dσ 1 + τ dσ 2 |2
(I.1.86)
such that the coordinates maintain the usual periodicity properties
(σ 1 , σ 2 ) ∼ (σ 1 , σ 2 ) + (2π, 2π).
The associated b-ghost insertions are then given by
ˆ
ˆ
1
i
i
2
Bτ Bτ̄ =
d zb(z)
d2 w̄b̄(w̄),
2
4πτ2
4πτ2
(I.1.87)
(I.1.88)
where the factor of 1/2 accounts for the fact that dτ dτ̄ = 2dτ1 dτ2 . Note that Vol(T 2 ) =
(2π)2 τ2 . From our previous discussion, the integration range of τ should be restricted to the
chosen fundamental domain F; we must also include an extra factor of 1/2 in the measure to
account for the Z2 in the CKG. Putting everything together, the 1-loop scattering amplitude
12
The correct modular group is P SL(2, Z) since a given element A ∈ SL(2, Z) and its additive inverse −A
both yield the same transformed value τ ′ .
27
takes the form
A(1)
n =
ˆ
F
d2 τ
2(4πτ2 )2
ˆ
d2 zd2 w
*
b(z)b̄(w̄)cc̄V1 (z1 , z̄1 )
n ˆ
Y
j=2
d2 zj Vj (zj , z̄j )
T2
+
,
(I.1.89)
T2
where h· · · iT 2 indicates the path integral over the fields on T 2 with implicit modulus τ .
The 1-point amplitude can be recast in a more symmetric form by recognizing that the
path integral associated to the correlation function in (I.1.89) is independent of the ghost
´
positions: this permits the substitution d2 zb(z) = 2Vol(T 2 )b(0) for both b-ghost insertions.
Since the full CKG T 2 ⋊ Z2 is finite (with volume 2Vol(T 2 )), it is equivalent to inserting
c-ghosts at arbitrary positions, using only integrated vertex operators, and then dividing by
the volume of the CKG to account for the gauge redundancy. The result is
+
ˆ 2 *
n ˆ
Y
d
τ
A(1)
d2 zj Vj (zj , z̄j )
b(0)b̄(0)c(0)c̄(0)
.
(I.1.90)
n =
4τ
2
2
F
T
j=1
T2
The torus partition function
(1)
Consider the vacuum amplitude A0 , which for the torus is well-defined and physically
meaningful. We will temporarily ignore the moduli space integration; the integrand
ZT 2 (τ ) = hb(0)b̄(0)c(0)c̄(0)iT 2
(I.1.91)
is simply the path integral over the matter and ghost fields with some extra ghost insertions
to saturate the fermionic zero modes. In the QFT language, this is often referred to as the
torus partition function. It is most readily calculated using the path integral formalism for
states in the Hilbert space. In this formalism, the path integral on a (Euclidean) cylinder of
length T simply computes the overlap hψf |e−HT |ψi i, where H is the Hamiltonian and |ψi,f i
are the final and initial states, respectively, which appear through the boundary conditions
implemented on each end of the cylinder. This line of reasoning extends to the torus, which
can be thought of as a cylinder of length 2πτ2 whose boundaries are glued after a 2πτ1 twist.
This twist is accounted for by inserting e2πiτ1 P in the overlap, where P is the generator
of translations along σ 1 , i.e. the momentum. The two operators H and P generate the
isometries of the cylinder, and can be expressed in terms of the Virasoro generators as
H = L0 + L0 ,
and
28
P = L 0 − L0 .
(I.1.92)
Finally, the operation of gluing becomes the trace over all states in the Hilbert space H with
periodic boundary condition for ghosts. The path integral can thus be rewritten as
(I.1.93)
ZT 2 (τ ) = TrH (−1)F b0 b̄0 c0 c̄0 q L0 q L0 , q = e2πiτ , q = e−2πiτ ,
where (−1)F , by definition, anti-commutes with the ghost fields and commutes with the
matter fields (this determines its action on all of the states via the state-operator mapping).
The full vacuum amplitude is given by reintroducing the moduli space integration:
ˆ 2
dτ
(1)
F
L0 L0
A0 =
.
(I.1.94)
TrH (−1) b0 b̄0 c0 c̄0 q q
F 2τ2
While we now have all the ingredients to compute the vacuum amplitude for the bosonic
string, we pause briefly to flesh out some of details of the CFT partition function; this will
come in handy later when more general matter CFTs arise, such as in string compactifications.
The full state space of the bosonic string splits into the direct sum Hm ⊕ Hgh , where the
subscripts denote the matter and ghost spaces, respectively. Consequently, the partition
function factorizes into separate ghost and matter contributions. The matter CFT contributes
L0 −c/24 L0 −c̄/24
Zm (τ ) = TrHm q
,
(I.1.95)
q
where the central charges c = c̄ = 26 now contribute due to the conformal anomaly on the
cylinder. This equation is often the more familiar one in the CFT context, and is equal to
h1iT 2 when there is no possibility for fermionic zero modes. Due to the trace, the partition
function above clearly gives a sum over the matter states, weighted by the conformal weights
assigned to each state. One might initially expect that such a sum is unconstrained, though
thanks to string theory we know this to be untrue. Given a parametrization τ of the torus,
there is a whole family of equivalent moduli generated by the action of P SL(2, Z). Since the
2
measure dτ2τ is invariant under this action, we therefore expect that Z(τ ) should generically
be modular invariant as well in any consistent string theory.13
Modular invariance has dramatic consequences at the level of the states in the CFT (and
by extension the string spectrum). Using the trace form of Z(τ ) reveals constraints on the
operator spectrum. For instance, invariance under τ → τ + 1 implies that all local operators
must have integer spins, i.e. h − h̄ ∈ Z. Invariance under τ → −1/τ constrains the high
energy spectrum in terms of the low energy data, which leads to various universal features
such as Cardy’s formula for the density of states.
We now return to the calculation of the closed string vacuum amplitude in (I.1.90),
13
Modular invariance at the level of the CFT translates to invariance of the worldsheet string theory under
large Diff×Weyl gauge transformations.
29
focusing first on the CFT partition function. The matter oscillators contribute
Tr′Hm q L0 −26/24 q̄ L̄0 −26/24 =
∞
Y
|q|−2/24
n=1
|1 − q n |−2
!26
= η −26 η̄ −26 ,
(I.1.96)
Q
where η = q 1/24 n (1 − q n ) is the Dedekind eta function. The trace over zero modes
(momentum eigenstates) gives
iV26
ˆ
d26 p
2
(q q̄)p /4 = iV26
26
(2π)
ˆ
d26 p −πτ2 p2
e
= iVd (4π 2 τ2 )−13 ,
(2π)26
(I.1.97)
where V26 is the 26-dimensional spacetime volume. The factor of i comes from Wick rotating
the X 0 field. The trace over ghost oscillators gives the additional factor
TrHgh (−1)F b0 b̄0 c0 c̄0 q L0 +26/24 q̄ L̄0 +26/24 = η 2 η̄ 2 ,
(I.1.98)
which serves to cancel the two longitudinal degrees of freedom. This is consistent with lightcone quantization, where only the transverse degrees of freedom are dynamical. In addition, it
is important to note that the cancellation between longitudinal and ghost degrees of freedom
is only possible for a single timelike direction in the target space [26]. In a unitary quantum
theory, the Hilbert space of states by definition should have a positive-definite inner product.
Consider the usual worldsheet string theory with 26 bosons and the bc ghost system, but now
suppose r of the bosons are timelike. The space of states for this theory includes negative
norm states due to the timelike and ghost oscillators that should drop out when we project
to the BRST cohomology. Therefore, in a unitary string theory the partition function should
be unaffected by the insertion of (−1)σ into the trace, where (−1)σ = +1 for spacelike
oscillators and (−1)σ = −1 for timelike. The ghost oscilators have signature (1, 1). Thus,
nonzero modes with this factor contribute
Q
2
(1 − q n )(1 + q n )
′
σ
F
L0 L̄0
= |(1 − q n )−24 |2 , (I.1.99)
= Q n
TrH (−1) (−1) b0 b̄0 c0 c̄0 q q̄
n )26−r (1 + q n )r
(1
−
q
n
where the last equality follows assuming unitarity. Therefore, we must take r = 1 and so
there is only a single timelike direction compatible with positive definite inner product on
BRST cohomology states.
Combining everything yields the vacuum amplitude [27]
(1)
A0
= iV26
ˆ
F
d2 τ
(4π 2 τ2 )−13 |η|−48 .
2τ2
(I.1.100)
Given that d2 τ /τ2 and τ2 |η|4 are themselves modular invariant, clearly the full amplitude is
30
as well. Note that the region τ2 → 0 is absent from our domain of integration. This regime
describes ultraviolet (UV), or high energy, processes. Without restricting the τ integration
to F, the integration would produce a UV divergence due to the τ2 → 0 region. Unlike in
QFT, this type of divergence is not present for the string vacuum amplitude. The modular
invariance of the torus effectively acts as a UV cutoff that renders the theory UV finite at one
loop. This behavior turns out to be true for general string amplitudes, and thus perturbation
of the closed bosonic string is expected to be UV finite. That being said, if we expand the
integrand as a power series in q, we find a term that behaves like q −1 . This is due to the
tachyon, and ultimately gives rise to an IR divergence. This is the failure of perturbation
theory and so bosonic string amplitudes are only formally defined. Our efforts have not truly
ended in failure, however, since these IR divergences will turn out to be absent for certain
superstring theories.
2
Bosonic string compactifications
Up to now we have considered closed strings propagating in flat Minkowski spacetime.
However, with the NLSM worldsheet theory introduced in Section 2.2, it is natural to consider
more general target spaces for the worldsheet fields. For example, it could be that the target
space of the worldsheet theory has a component of Rk−1,1 where the SO(1, k − 1) isometry
of Rk−1,1 extends to an exact global symmetry of the worldsheet theory. Such a theory could
potentially describe a string theory in a k dimensional Minkowski spacetime. We can go one
step further and replace Rk−1,1 with a space with non-zero curvatures associated with Gij and
Bij . Such a theory can potentially describe a theory in a curved spacetime with torsion! In
general we can think of any modular invariant conformal field theory with zero total central
charge as a string theory. Depending on the structure of the target space of the theory and
its global symmetries, the theory may or may not have a spacetime interpretation. In the
following we review some important examples of 2d CFTs and methods to construct them.
2.1
Geometric CFTs
Circling back to the Minkowski spacetime, an important class of string theory backgrounds
arise from worldsheet theories with the form of a product CFT
Rk−1,1 × Cc=26−k ,
(I.2.1)
where Rk−1,1 refers to the X CFT with k free noncompact scalars, and Cc=26−k is some
compact CFT with central charge c = 26 − k. The condition c = 26 − k ensures that after
taking the ghost theory into account, the total central charge vanishes. Often times this
31
auxiliary CFT can be expressed as a NLSM with a compact target space (though in general
the compact CFT need not have a geometric interpretation). Sometimes these backgrounds
can be thought of as k large spacetime dimensions with 26−k curled up dimensions where the
spectrum of C is discrete. In this subsection we study some examples where this geometric
interpretation is explicit in the sense that the Cc=26−k theory is a 26 − k dimensional NLSM.
These theories are often called compactifications of the 26 dimensional theory and the target
space of the NLSM is called the internal geometry. Typically, the geometric parametries such
as the size of the cycles become dynamical fields. Thus, the case where the 26 dimensions are
flat corresponds to a particular limit of the field space where the cycles have infinite radii.
The simplest string compactification that we can consider is one with a one-dimensional
internal geometry, a circle. In what follows, we will explain some aspects regarding such
theories.
Kaluza-Klein mechanism
First recall field theories coupled to classical gravity compactified on a circle of radius R.
Suppose the spacetime has d + 1 dimensions where xµ for µ ∈ {0, ..., d − 1} are non-compact
while xd is the compact coordinate with the periodicity condition xd ∼ xd + 2πR. Since the
spacetime momentum pd is generator of translation and a translation of 2πR along the circle
is identity.
ei2πR
p̂d
≡ 1.
(I.2.2)
Therefore, the eigenvalues of momentum along the circle pd = n/R are quantized with n ∈ Z.
Moreover, we can decompose a massless d+1 dimensional field φ into an infinite set of Fourier
modes along the compact dimension.
X
d
φ(x0 , ..., xd ) =
φn (x0 , ..., xd−1 )ein(x /R) .
(I.2.3)
n∈Z
The coefficients φn only depend on the non-compact coordinates and hence are d dimensional
fields. The spacetime excitation of φn has pd = n/R. And the (d + 1)-dimensional mass
formula
X
E2 =
pµ pµ
(I.2.4)
µ≤d
becomes
E 2 = n2 /R2 +
X
µ≤d−1
32
pµ pµ .
(I.2.5)
In other words, the fields develop nonzero masses via the compact momentum, i.e.
m2 = n2 /R2 .
(I.2.6)
Such modes are ubiquitous in general compactifications, where they are referred to as KaluzaKlein (KK) modes. In string theory, circle compactifications are more interesting because
the string can wind multiple times around the circle.
The compactified worldsheet theory
The worldsheet theory consists of 25 noncompact scalars X µ as well as a periodic scalar
X 25 : Σ → SR1 . For a string which winds w times around the circle we have
X 25 (σ + 2π, τ ) = X(σ, τ ) + 2πRw.
(I.2.7)
These winding modes are topologically distinct and can be thought of as solitons of the
worldsheet theory. The spacetime momentum is quantized according to
p
25
1
=
2π
ˆ
2π
dσ∂τ X 25 =
0
n
.
R
(I.2.8)
We must include both the KK and winding modes to ensure modular invariance on the
torus. The momentum and winding constraints imply that the mode expansion of X 25 on
the cylinder takes the form [28, 29]
n
i X αn25 −in(σ+τ ) α25
n in(σ−τ )
25
X (σ, τ ) = wRσ + τ + √
,
(I.2.9)
e
+
e
R
n
2 n6=0 n
where αn25 , α25
n are oscillator modes that obey the usual free oscillator algebra. The quantized
theory is very similar to that of the noncompact free boson, except for one crucial difference.
Previously, the ground state of the worldsheet theory had a continious degenerecy labeled by
26 dimensional momentum of the Tachyon. One can think of the degenrate vacua as a family
of states that transform to each other under the spacetime Lorentz transformations which are
global symmetries of the worldsheet. Now that one of the components of the momentum is
quantized, the worldsheet ground states are labeled by |pµ ; n, wi. The integer n is associated
with the quantization of momentum which reflects the symmetry of the worldsheet theory
generated by discrete translation along X 25 . But how about w? Could it be that the presence
of another integer signals additional features?
The answer is yes, and global symmetries of the worldsheet lead to gauge symmetry in
the spacetime! The pair (n, w) are in fact charges of a U (1)n × U (1)w global symmetry on the
33
worldsheet. In addition to the massless string states that give the 25-dimensional graviton,
B-field, and dilaton, we now have additional massless vectors generated by
[25
µ]
(25
ᾱ−1 α−1 |pµ ; 0, 0i ,
µ)
ᾱ−1 α−1 |pµ ; 0, 0i .
(I.2.10)
These states are the “photons” of the two U (1) factors. From the 26d point of view, the
gauge field for U (1)n arises from the metric G25µ , while that of U (1)w arise from the B-field
B25µ . Clearly the U (1)w symmetry is a purely stringy effect, since strings not particles are
25 25
charged under the B-field. Note that there is another massless state ᾱ−1
α−1 |pi that leads to
a massless scalar in spacetime known as a moduli field. It arises from G2525 , and so changes
the size of the circle, which is dynamical.
α025
Another difference in the worldsheet theory of the compact boson is that the oscillators
and α25
0 are distinct. We can see this by writing the zero modes as
1
X 25 (σ, τ ) = [pL (τ + σ) + pR (τ − σ)],
2
(I.2.11)
where we have introduced the left/right-moving momenta
α025 pL =
n
+ wR,
R
ᾱ025 = pR =
n
− wR.
R
(I.2.12)
These quantities naturally form a vector
1
l = √ (pL , pR )
2
(I.2.13)
which generates a 2d lattice p(1, 0) + q(0, 1) for integers p, q. The lattice has some special
properties in terms of the following inner product.
1 2
l1 · l2 = lL1 lL2 − lR
lR .
(I.2.14)
With this norm we can check that l · l′ ∈ Z. Such lattices are called integral. Moreover, the
norm
l · l = 2nw ∈ 2Z.
(I.2.15)
is an even integer. Lattices with this property are referred to as even. Moreover, if l′ · l ∈ Z
for all l in the lattice, then l′ is also in the lattice. This implies that the lattice is self-dual.
Such lattices are called Narain lattices [30].
The inner product (I.2.14) is preserved under SO(1, 1). Therefore, we can find a Narain
lattice by applying elements of SO(1, 1) on another one. We can generalize this to higher
34
dimensional lattices. Suppose we have a Narain lattice where pL takes values in a p dimensional
lattice and pR in a q dimensional lattice such that the overall lattice is even self-dual. We
denote such a lattice by Γp,q . Applying every element of SO(p, q) to a Narain lattice gives
another Narain lattice. However, not all of these transformations lead to distinct lattices. For
example, if we act on pL and pR separately with elements of SO(p) and SO(q) respectively,
the lattices will not change, and they simply reflect a change of coordinates. In fact, up to a
SO(p,q)
. The dimension of
discrete quotient, the moduli space of (p, q) Narain lattices is SO(p)×SO(q)
this space is pq. In the special case of p = q = 1, this space is can be parametrized by one
variable which is the radius R. The radius R serves as the SO(1, 1) boost parameter.
T-duality
The self-dual property of the lattice has important physical implications. Consider the Z2
self-dual transformation
1
R → , n ↔ w,
(I.2.16)
R
which can be recast as the map
pL → pL ,
pR → −pR .
(I.2.17)
The invariance of string theory under the duality transformation (I.2.16) is referred to as
T-duality [31].
Instead of R, we can consider a parameter λ more closely related to the moduli field,
R = eλ .
(I.2.18)
T-duality now takes the form λ → −λ. Under T-duality, the radius of the circle is no longer
an invariant notion. More surprisingly, very small radii are mapped to large radii, which
implies there is no absolute notion of distance. To resolve these issues, we must consider
what it means to have a circle of small radius. From the quantum viewpoint, it is unclear
how to resolve distances which are smaller than the size of a typical wavepacket. It is natural
then to define the physics of small radii in terms of the dual large radii theory.
In other words, as seen in Figure I.2.1, the space of inequivalent theories is the half-line
R ≥ 1. Equivalently, we could take the set of inequivalent theories to be 0 ≤ R ≤ 1, but it is
more natural to think in terms of the larger of the two equivalent radii: momenta continua
are more familiar than winding number continua. In particular, questions of locality are
clearer in the larger radius picture. For this parametrization, there is no radius smaller than
the self-dual radius:
√
Rself −dual = RSU(2)×SU(2) = α′ .
(I.2.19)
35
Clearly λ = 0 is invariant under this transformation and hence labels the self-dual point.
This allows us to simply the diagram of inequivalent theories in terms of λ as in Figure I.2.1.
SU(2)×SU(2) self dual point
λ=0
Figure I.2.1: The space of compactified bosonic string theories on S 1 . Here, λ = 0 is the
SU(2)×SU(2) self-dual point, so only theories with λ ≥ 0 are considered inequivalent.
The self-dual point is related to the emergence of new symmetries. As it turns out, at
the self-dual point R = α′ /R the duality transformation becomes a bonafide symmetry! In
particular, there is an emergent SU(2)×SU(2) gauge symmetry in spacetime, as shown in
Figure I.2.2.
R2 = 1
SU(2)×SU(2) gauge symmetry
Figure I.2.2: At the self-dual point R =
√
α′ , there is an SU(2)×SU(2) gauge symmetry.
It is fairly straightforward to derive the emergent symmetry from the worldsheet theory,
where we expect the appearance of new conserved currents. The mass-shell conditions for
the compactified theory at arbitrary radius are [1]
4
1 2 1 2
pL − pR + ′ (NL − NR ) = 0
2
2
α
1
1 2 1 2
m = pL + NL − 1 = p2R + NR − 1.
2
2
2
36
(I.2.20)
(I.2.21)
For the self-dual radius R =
√
α′ and m2 = 0, these take the simplified form
0 = n2 + w2 + 2(NL + NR − 2)
(I.2.22)
Let us focus on a single sector, say the left-moving one with NL > 0 and NR = 0. Then the
massless states are given by
(n, w) = (1, 1),
NL = 1, NR = 0 ,
(I.2.23)
(n, w) = (−1, −1),
NL = 1, NR = 0.
(I.2.24)
These two states correspond to the left-moving W ± bosons! The right-moving states are
recovered from NR = 1 and NL = 0. The Z bosons arise from states corresponding to
the aforementioned U (1) fields that emerge due to the KK compactification at any radius.
Altogether, these particles combine into two independent SU(2) triplets.
There is a nice connection between the duality group Z2 and the gauge group SU (2).
At the self-dual point, the duality group becomes a symmetry. Therefore, the Z2 must map
a representation of SU (2) to another representation of SU (2) with same dimensions. Since
every representation is specified by its weights, the symmetry group Z2 must map a weight
lattice of SU (2) to another weight lattice. In fact, the Z2 is the Weyl group of reflection
symmetries of the root lattice of SU (2). This connection extends to many other examples.
At the self-dual point, a subgroup of the duality group is the symmetry group of the root
lattice of the emergent gauge group.
Optional exercise : Consider a compactification on the torus T n (G, B). Show that the
partition function takes the form
Z=
P
1 2
1 2
q 2 pL q̄ 2 pR
,
η n η̄ n
for η = q 1/24
Y
n
(1 − q n ).
(I.2.25)
Check that it is invariant under modular transformations, i.e. τ → τ + 1 and τ → −1/τ .
Show that invariance under T-transformations implies that the lattice of momenta is even,
and invariance under S-transformations implies that the lattice is self-dual.
Exercise 1: Suppose we want to compactify d dimensions where d is the rank of an ADE
group G. Show that you can choose the background fields Bij and Gij such that the resulting
lattice of (PL , PR ) of the momenta in the compact directions is a weight lattice (w, w′ ) of G
where the difference w − w′ is in the root lattice of G. Using this, show that this lattice is
even and self dual. Now show that we can choose the size of the compact dimensions such
that the compactified theory has a G × G gauge symmetry.
37
2.2
WZW models and current algebras
More generally, we can talk about geometric string compactifications via NLSMs with an
arbitrary spacetime metric and H-flux where H = dB. For instance, we can take the target
space to be some group manifold G. The NLSM action can be rewritten in Wess-ZuminoWitten (WZW) form [32]
ˆ
2
k
Tr g −1 dg + ikSW Z ,
(I.2.26)
S=−
8π Σ=∂M 3
where the target space coordinates are embedded in the group element g = exp(iT a X a ) with
T a the generators of the Lie algebra Lie(G). The Wess-Zumino term [33]
ˆ
1
SW Z =
Tr(g −1 dg)3
(I.2.27)
24π M 3
arises from the B-field contribution, rewritten in terms of the H-flux via Stoke’s theorem.
Both terms in the WZW action are necessary to preserve conformal invariance to O(α′2 ) [34].
Optional exercise : Show that the Wess-Zumino term arises from the coupling of the B-field
to the worldsheet.
Current algebras
The WZW action is invariant under the the transformation
g → LgR−1 ,
(I.2.28)
where L and R lie in separate copies GL and GR of the group manifold. The theory thus
admits a GL × GR global symmetry with conserved currents
J i (z) = Tr ∂gg −1 T a ,
¯ −1 T a .
J¯j (z̄) = Tr ∂gg
(I.2.29)
In the quantum theory, the WZ term leads to an infinite-dimensional enhancement of this
global symmetry. The currents satisfy a particular type of OPE [35]
J a (z)J b (w) ∼
if abc J c
kδ ab
+
,
(z − w)2
z−w
(I.2.30)
which leads to an extension of the Virasoro algebra known as a current algebra. Here, f abc are
the structure constants for Lie(G) and the value k is quantized due to the Dirac quantization
38
condition for B. As it is an extension of Virasoro, the currents completely specify the stress
tensor and its associated central charge
c=
k dim G
k + ĥG
,
(I.2.31)
where ĥG is the dual Coxeter number for Lie(G). The classical limit k → ∞ corresponds to
the case where the group manifold becomes flat. In this limit, c ∼ dim G, coinciding with
the fact that the theory describes dim G free bosons.
Exercise 2: Show that rank(G) ≤ c ≤ dim(G). For simply laced the left hand side
saturates.
As a simple example, we consider a û(N ) current algebra at level k = 1 constructed from
N complex Weyl fermions ψ a . The worldsheet action on the complex plane takes the form
ˆ
¯ a,
S = d2 z ψ̄ ā ∂ψ
(I.2.32)
which have a singular OPE
Ψa (z)Ψb̄ (0) ∼
δ ab̄
.
z
(I.2.33)
Despite notation, both Ψ and Ψ̄ are holomorphic fields. Their stress tensor is given by
1
T = − δ ab̄ Ψa ∂ Ψ̄b̄
2
(I.2.34)
which fixes the central charge to be c = N , i.e. each complex fermion contributes c = 1. Note
that T̄ = 0 and so c̄ = 0 as well. The theory is naturally invariant under U (N ) rotations,
under which Ψ (Ψ̄) transforms as the fundamental representation (anti-fundamental). The
associated conserved currents take the form
J ab̄ = Ψa Ψb̄ .
(I.2.35)
It is a straightforward exercise to verify they satisfy the û(N ) current algebra with k = 1.
Coset models
Another collection of models we can consider are the so-called coset models (gauged WZW
models) [36]. Consider a WZW model with group G, stress tensor TG , and central charge cG .
39
Supposing that G contains some subgroup H ⊂ G, there is an additional set (TH , cH ) for the
WZW model for H. We can then “gauge out” H by constructing a new theory with a stress
tensor and central charge given by
TG/H = TG − TH ,
cG/H = cG − cH .
(I.2.36)
This corresponds to taking the original WZW action and gauging the subgroup H.
2.3
Orbifolds
Thus far we have studied toroidal compactifications of string theory, with the key stringy
ingredient being the emergence of winding modes, i.e. the string wrapping various onecycles. We also saw that in general it may be necessary to turn on H-flux (in addition to
the nontrivial metric) to preserve conformal invariance on the worldsheet. In this section, we
consider another example of worldsheet CFT, orbifolds, which are manifolds with singular
curvature [37–39].
Orbifold CFTs
Our first experience with orbifolds will be through orbifold CFTs, which arises from gauging
discrete worldsheet symmetries. As we will see, the singular geometry of the target space
will be emergent from this point of view.
Consider a unitary CFT invariant under some discrete symmetry group G, which we label
by CFTG . By definition, the group is represented unitarily, i.e. the theory admits a G-action
on states of the form
g : |ψi → U (g) |ψi ,
g ∈ G,
(I.2.37)
where U (g) is a unitary operator. When there is no room for ambiguity, we write g as a
shorthand for U (g). A natural attempt at gauging the symmetry is to project the Hilbert
space to a G-invariant subspace:
H −→ HG := PG H,
PG =
1 X
g.
|G| g∈G
(I.2.38)
Indeed, the map PG is a Hermitian projection operator
ln PG = PG ,
40
PG2 = PG .
(I.2.39)
The evaulation of partition function of this theory simply amounts to inserting PG in the
trace
1 X
ZG =
TrH g q L0 −c/24 q̄ L̄0 −c̄/24 .
(I.2.40)
|G| g∈G
An immediate problem is that ZG is not modular invariant, and so our naive gauging
procedure is incompatible with string perturbation theory. Consider an S-transformation
that interchanges the temporal and spatial circles. Before, we could think of g as acting
on states every time we wind around the temporal circle. Now, it must act when we wind
around the spatial circle. We can no longer think about g as acting on states but rather as
implementing different boundary conditions for local operators
O(σ + 2π, τ ) = Ug O(σ, τ )Ug† := g · O(σ, τ ).
(I.2.41)
Under the state-operator mapping, these operators correspond correspond to states in some
new space Hg known as a twisted sector. Note that the twisted sector coincides with neither
H nor PG H , where all of the (bosonic) operators are periodic.
We should therefore modify the gauged theory CFTG /G to include all G-invariant states,
including the twisted sectors
M
HCFTG /G =
PG H g .
(I.2.42)
g∈G
We are then left with a well-defined, modular invariant partition function
ZCFTG /G =
1
|G|
X
g,h∈G|[g,h]=1
TrHh g q L0 −c/24 q̄ L̄0 −c̄/24 ,
(I.2.43)
where in the h-twisted sector we have restricted to symmetries g that do not change the
sector (gh = hg).
Z=
1
|G|
P
gh=hg
g
h
Figure I.2.3: Partition function of an orbifold theory including the twisted sectors.
We can assign different phases to each sector which corresponds to turning on the B field.
However, theses phases, which are also called discrete torsion, cannot be assigned arbitrarily.
The phase assignment must not break modular invariance [40]. Of course just as in any
41
gauging, there cannot be anomalies which renders it inconsistent. In string theory context,
this amounts to the left-right level matching condition.
Geometric orbifolds
With the understanding of how to construct orbifold theories on the worldsheet, we now
investigate the consequences on the spacetime geometry.
As a simple example, we begin with toroidal orbifolds. Take a torus T d equipped with a
group action for some discrete group G. We can consider the quotient space T d /G, which
generically has a non-trivial topology. Any fixed points of G are necessarily singular. For
instance, consider the unit circle S 1 whose center coincides with the origin of the (x, y) plane.
It admits a Z2 symmetry that sends y → −y. The space S 1 /Z2 is clearly an orbifold, since
the symmetry leaves the two points at y = 0 and y = 12 fixed. We can visualize S 1 /Z2 as a
line segment stretching from y − 0 to y = 21 , with two singularities at its endpoints.
We now seek the CFT construction of geometric orbifolds. A NLSM with some target
space manifold M will generically be invariant under a discrete subgroup G of the full isometry
group of M . As we saw in the previous section, from the CFT perspective, discrete gauging
requires that orbifold states are singlets under G. Moreover, to ensure modular invariance
we must include all of the twisted sectors. For the closed string, it is easy to see that the
twisted sectors correspond to identifying spacetime points X i with the same G-orbit. Now,
an open string stretched between these two points is, in fact, a closed string. This leads to
new string states which can be precisely identified with the twisted sector states.
A particularly illuminating example is the bosonic string on the torus T 2 . The theory has
a Z2 symmetry that maps x 7→ −x for x ∈ T 2 . On the worldsheet, the Z2 -action is given by
∂X 1,2 7→ −∂X 1,2 ,
∂X µ → ∂X i ,
µ 6= 1, 2 ,
(I.2.44)
where the X µ correspond to the transverse spacetime directions (which we henceforth drop).
Note that (pL , pR ) 7→ (−pL , −pR ) and αn 7→ −αn under this map. In the untwisted sector,
the Z2 -invariant states are thus given by acting with an even/odd number of oscillators,
respectively, on
1
|±i = √ (|pL , pR i ± |−pL , −pR i) .
2
(I.2.45)
Recall that the partition function of the ungauged theory is
ZT 2
1 2
X
1 2
q 2 pL q̄ 2 pR
.
=
Q
2/24
n )2 |2
|q
(1
−
q
n
(pL ,pR )∈Λ
42
(I.2.46)
Once we insert the action of g into the trace, the ground state now contributes
hpL , pR | Tr q L0 q̄ L̄0 |−pL , −pR i ,
(I.2.47)
which is non-zero only for pL = pR = 0. We thus only keep the states with pL = pR = 0.
Given that g acts as q → −q, we easily find the partition function for the untwisted sector
1
1
1
Z = ZT 2 +
,
Q
2
2 |q 2/24 n (1 + q n )2 |2
(I.2.48)
where the factor of 1/2 arises from the 1/2 in Pg = (1 + g)/2.
The twisted sectors correspond to the choice of boundary conditions
X 1,2 (σ + 2π, τ ) = ±X 1,2 (σ, τ ).
(I.2.49)
These periodicity conditions lead to a new ground state as well as a different set of oscillators
with half-integer grading. There are four fixed points of the Z2 action on T 2 , which simply
produces a multiplicative factor in the twisted sector partition function. The ground state
energy in this sector can be calculated from the regularized sum
∞
1
1X
1
(n + η) = − + (η(1 − η)) .
2 n=0
24 4
(I.2.50)
The modes here are half-integers, so η = 1/2 and E = +1/48. This gives the trace in the
twisted sector,
4
(I.2.51)
trHtwisted q L0 −c/24 q̄ L̄0 −c̄/24 =
Q
n+1/2 )2 |2
−2/48
|q
n (1 − q
Of course, this trace only gives the partition function before gauging, i.e. we’re missing
trHtwisted (g(· · · )). This is given by
trHtwisted gq L0 −c/24 q̄ L̄0 −c̄/24 =
The complete T 2 /Z2 partition function is thus
|q
Q
−2/48
4
n+1/2 )2 |
n (1 + q
2.
(I.2.52)
2
2
1
1
1
+
+
.
ZT 2 /Z2 = ZT 2 +
Q
Q
Q
2
2
2
2 |q 2/24 n (1 + q n )2 | |q −2/48 n (1 − q n+1/2 )2 | |q −2/48 n (1 + q n+1/2 )2 |2
(I.2.53)
Exercise 3: Verify that the last term in (I.2.53) can be obtained by an appropriate modular
43
transformation on (I.2.51). Also verify (I.2.53) is invariant under τ → τ + 1/2.
Optional exercise: Consider the orbifold T 2 /Z3 . The Z3 action on T 2 can be written as
z = x1 + ix2 ,
z → ωz,
ω3 = 1 .
(I.2.54)
Construct the partition function of this theory; it is helpful to note that the twisted sectors
are complex conjugates of one another (ω and ω 2 = ω −1 ).
There are also perfectly sensible string theory vacua which are not bonafide geometries,
known as non-geometric compactifications [41–43]. These can arise when considering orbifold
theories which cannot be interpreted as target space geometries of the form M/G. As an
example, we can compactify on a torus T d to get states whose momenta (pL , pR ) lie in a 2ddimensional lattice Λ. For certain choices of metric and B-flux, it is possible to arrange things
such that the resulting theory has a larger symmetry group than that of the original torus.
This is due to the fact that the dimension of the lattice is what constrains the possible global
symmetries of the CFT (i.e. group actions). Continuing with the two-dimensional example,
such a lattice can have Z2 , Z3 , Z4 , or Z6 symmetry. However, for T 2 , the Narain lattice is
four-dimensional; it thus could potentially have a Z12 symmetry, which is not a symmetry of
T 2 . If we quotient by Z12 , the resulting CFT evidently does not possess the interpretation
of a NLσM with target space T 2 /G, so it is non-geometric. These constructions have been
studied in [42].
Monstrous moonshine
Even before phycisists had encountered orbifolds, mathematicians had been independently
studying asymmetric orbifold constructions with the aim of understanding conjectures about
the so-called monster group - the largest sporadic finite simple group (e.g. [44]). Recall that
all simple Lie groups belong to either an infinite series, SU (n), SO(n), Sp(n), or one of the
exceptional groups E6 , E7 , E8 , F4 , or G2 . This is similar to the classification of finite simple
groups, which can be split into a set of infinite families together with a finite number of
additional groups (known as the sporadic groups). The monster group happens to be the
sporadic group with the maximum order, namely
|G| ∼ 8 × 1053 .
(I.2.55)
For comparison, there are around 1080 atoms in the universe.
The monster group makes an unexpected appearance in the study of modular functions
which are complex functions with specific transformation properties under SL(2, Z) transformations.
44
An important modular function is called the j-function. The j-function is a meromorphic
invariant under SL(2, Z) with a single simple pole in the fundamental region of SL(2, Z) at
Im(τ ) → ∞.
The expansion of j reads
j(q) = q −1 + 744 + 196884q + 21493760q 2 + . . . ,
(I.2.56)
where the first term represents the simple pole at Im(τ ) → ∞. It turns out that the
expansion coefficients have a mysterious connection with the Monster group: Every coefficient
except the constant term 744, could be written as elementary sums of the dimensions
of some representations of the Monster group! For example, the smallest two irreducible
representations of the Monster group have dimensions 1 and 196883 which add up to 196884.
196884 = 196883 ⊕ 1 ,
(I.2.57)
In the 1970s, mathematicians noticed this connection [45] which motivated them to
construct the Monstrous Moonshine representation of the Monster group.
Now, let us go back to physics and see how string theory helps us to understand this
unexpected connection. Modular functions have a very natural place in string theory. Because
of modular invariance, they show up as partition functions of the worldsheet theory. If jinvariant were to be the partition function of a worldsheet theory, the Monstrous Moonshine
suggests that states of the theory fall into representations of the Monster group. In other
words, the theory probably has a Monster symmetry. Moreover, the q −1 term signals the
existance of a Tachyon, just like the Bosonic string theory. In fact, it turns out there is an
orbifold construction that reproduces the j-function as the worldsheet partition function!
This construction involves compactifying the 26d target space of the bosonic string to 2d
[46]. The resulting 24d internal space is chosen such that the left- and right-moving momenta
are elements of a particular 24d lattice, i.e.
24
|pL , pR i ∈ Γ24
L ⊗ ΓR .
(I.2.58)
The relevant conditions are that the lattice is even and self-dual, as required by modular
invariance, as well as that the shortest non-trivial vector has length p2L = 4. Recall that the
root lattices associated with the simple Lie algebras have vectors with a minimum length of
2. There is exactly one even self-dual lattice in 24 dimensions whose vectors all have length
p2L > 2, known as the Leech lattice Γ24
L [47]. Given the string compactification, we are free to
45
gauge the Z2 symmetry (the same symmetry present for T 2 ) and study the orbifold theory:
Γ24
L /Z2 ,
pL −→ −pL .
(I.2.59)
This is an asymmetric action which satisfies the level matching conditions in the twisted
sector. As before, the orbifold theory has a partition function of the form
Z=
1
2
1
+
1
g
1
In fact, it can be computed explicitly:
1
+ 1
2
Z = q −1 + A1 q 1 + · · · ,
+
g
A1 = 196884 .
g
g
. (I.2.60)
(I.2.61)
There is a single tachyon in the spectrum, hence the q −1 term with unit coefficient. The fact
that there is no O(q 0 ) term indicates that there are no massless particles in the spectrum.
The uniqueness properties of the j-function makes it easy to relate the partition function
to the j-function,
Z(q) = j(q) − 744 .
(I.2.62)
At the level of the string theory, this strongly suggest that the particle content of the
theory organizes into irreducible representations of the monster group – that is, the monster
group is the symmetry group of this system. It was later proven by Richard Borcherds that
the full vertex operator algebra (VOA) of the CFT is a generalized Kac–Moody algebra with
the Monster group acting on it [48].
2.4
Noncritical string theory
A key problem of the critical bosonic string is the existence of the tachyon, which spoils
the validity of perturbation theory. We seek a new background which admits a stable string
vacuum. Thus far, we have considered backgrounds with nontrivial curvature and B-flux,
but with a constant dilaton profile. Let’s now consider a d-dimensional flat Minkowski
background (Gµν = ηµν and Bµν = 0) but with a linear dilaton profile along a particular
spatial direction X i ,
φ(X) = QX i .
(I.2.63)
46
The nonlinear sigma model that corresponds to this choice of background is given by the
action
ˆ
1
√
(I.2.64)
S=
d2 σ g g ab ηµν ∂a X µ ∂b Xν + QR(g)X i .
4π Σ
In conformal gauge, the action reduces to that of d free bosons with the caveat that the stress
tensor is modified to
Tm = −(∂X µ ∂Xµ ) + Q∂ 2 X i .
(I.2.65)
The central charge changes accordingly:
cm = d + 6Q2 .
(I.2.66)
To cancel the ghost contribution, we set cm = 26 as usual. This fixes the value of Q to
r
26 − d
.
(I.2.67)
Q=
6
We can now use our favorite quantization method (e.g. BRST or light-cone) to determine
the string spectrum in this background. Crucially, we now find that the mass squared of the
tachyon (i.e. lowest mode) is
d−2
m2T = −
,
(I.2.68)
24
which vanishes for d = 2. We have thus found a string background where the tachyon is
stable!14 Such string theories where the initial background is not of critical dimension d = 26
are known as the noncritical string theory [7].15
In two dimensions, the moduli cannot be frozen because the boundary conditions of
scalar fields can be dynamically changed with finite energy. Thus, even though the theory
has a stable vacuum, we must now contend with a variable coupling without a convergent
boundary condition. This will create a problem since in regions where the effective coupling
λ(x) = eφ(X) grows large, we lose perturbative control of the theory. By imposing Weyl
invariance on the worldsheet of the theory we can find the tachyon profile of the background.
A profile that preserves Weyl invariance is
ˆ
1
√
i
d2 z gµeαX ,
(I.2.69)
δS =
4π
where α = Q1/2 is fixed by Weyl invariance (but µ is not). In this background, the region of
large coupling is now cutoff due to the tachyon background. The upshot is we can now do
14
More precisely, the lowest mode of the bosonic string transforms as a massless scalar, which is erroneously
but typically referred to as the tachyon.
15
An equivalent approach to the noncritical string is to sacrifice Weyl invariance and promote the Weyl
mode on the worldsheet to a dynamical field.
47
perturbation theory again, although the theory now depends on an additional parameter µ
set by the VEV of the tachyon field. Moreover, the worldsheet theory is now described by
an interacting CFT known as Liouville field theory [49, 50].
3
Superstring theory
The quantum bosonic string exhibits many interesting physical phenomena, but cannot be
probed beyond tree level due the presence of the tachyon. In this section, we will consider
a set of alternative models, collectively known as the superstring, with a supersymmetric
worldsheet theory. The superstring admits the requisite features of any quantum string
theory: a matter CFT, a ghost system, and BRST symmetry. Crucially, certain superstring
theories admit consistent truncations which render them free of anomalies and tachyons.
There are also certain models which exhibit spacetime supersymmetry.
3.1
Basics of the NSR formalism
The action of the superstring, in Polyakov form, describes N = (1, 1) supergravity on the
worldsheet [51, 52]. It is quite complicated, so we only focus on the action after gauge-fixing.
Similar to the bosonic string, we have d free scalars X µ and a pair of bc ghosts with spins
(2, −1) that arise from worldsheet diffeomorphisms and Weyl rescalings. We additionally
have d pairs of Majorana–Weyl fermions ψ µ and ψ̄ µ which transform as spacetime vectors.
There are also new commuting ghosts, namely the βγ ghosts with spins ( 32 , − 21 ), which arise
from gauge-fixing super-diffeomorphisms and super-Weyl transformations. The fields and
their data are described in Table I.3.1.
Altogether, they sit inside a free worldsheet action given by
ˆ
1 µ
1 µ¯
1
2
µ¯
¯ + b̄∂c + β ∂γ
¯ + β̄∂γ̄ .
SNSR =
d z ∂X ∂Xµ + ψ ∂ψµ + ψ̄ ∂ ψ̄µ + b∂c
2π
2
2
(I.3.1)
In total, the matter fields contribute cm = 3d/2 and the ghosts cgh = −15. To avoid the
appearance of a Weyl anomaly, we must take cm = 15 and so the superstring naturally lives
in d = 10 spacetime dimensions.
48
Field content
Conformal weight h
Central charge c
Xµ
0
1
2
(2, −1)
d
d
2
−26
( 23 , − 21 )
11
ψµ
(b, c)
(β, γ)
Table I.3.1: Field content in the gauge-fixed NSR worldsheet theory. The right-moving
counterparts have the analogous weights h̄ and central charges c̄.
The gauge-fixed worldsheet theory admits a residual symmetry known as N = (1, 1)
superconformal symmetry, which pairs every operator of weight h with another operator of
weight h+1/2 with opposite statistics. Both ψ µ and ψ̄ µ are superpartners of X µ , and similarly
for βγ and bc. The stress tensor T is no exception, and has a fermionic superpartner G of
weight 3/2 known as the super stress tensor or supercurrent. For the matter CFT, we have
1
Tm = −∂X µ ∂Xµ − ψ µ ∂ψµ ,
2
√
Gm = i 2ψ µ ∂Xµ .
(I.3.2)
From this we see that Gm admits the OPE
3
1
Tm (0) + ∂Gm (0),
2
2z
z
2
d
Gm (z)Gm (0) ∼ 3 + Tm (0).
z
z
Tm (z)Gm (0) ∼
(I.3.3)
(I.3.4)
The modes of G together with those of T , and their anti-holomorphic counterparts, then
obey the N = (1, 1) super-Virasoro algebra with central charge c = 15 [9]. Similarly, the
ghost stress tensor and supercurrent obey the same algebra, but central charge c = −15.16
Similar to the bosonic string, the gauge-fixed theory of the superstring possesses a BRST
symmetry with nilpotent charge
QB = cT + γG,
Q2B = 0,
(I.3.5)
where T, G are the matter+ghost (super) stress tensor.
16
The superconformal algebra naturally leads to the notion of a superconformal primary Φ, which is
annihilated by all of the raising modes of T and G. A superconformal primary sits in a multiplet with
operators of the form G−n Φ (with n > 0) known as superconformal descendants. A number of these operators,
including the superconformal primary, are themselves conformal primaries. For instance, ∂X µ is a descendant
of ψ µ , and so is b with β.
49
Similar to the free boson, we can expand ψ µ in modes ψrµ on the cylinder, which obey the
fermionic oscillator algebra:
{ψrµ , ψsν } = η µν δr,s .
(I.3.6)
However, unlike the free boson, the free fermion admits two types of boundary conditions
consistent with BRST symmetry: namely periodic or Ramond (R) boundary conditions and
anti-periodic or Neveu-Schwarz (NS) boundary conditions [52]. The associated modes are
different for the two choices, with r ∈ Z for R boundary conditions and r ∈ 21 Z for NS
boundary conditions, and so lead to separate Hilbert spaces. Furthermore, the choice of
boundary condition is independent for ψ µ and ψ̄ µ , and so the Hilbert space of the free
fermions decomposes into a total of four sectors, which we label by the choice of boundary
conditions: (NS,NS), (NS,R), (R,NS), and (R,R).17
As was the case for the bosonic string, it is simplest to extract the mass spectrum of the
superstring via lightcone quantization. In short, the ghosts and longitudinal modes of the
matter fields decouple, and we are left with the transverse fields (X i , ψ i , ψ̄ i ) for i = 1, . . . , 8.
The 8 scalars contribute −8/24 to the ground state energy, as before. The fermions contribute
different energies depending on the choice of boundary conditions. The NS sector ground
state, dual to the identity operator, contributes −8/48, which together with the bosonic
contribution adds up to −1/2. The mass of the NS sector ground state is thus
1
α′ 2
m =− .
4
2
(I.3.7)
We seem to have encountered the same tachyon problem as the bosonic string. Additionally,
i
the oscillators ψ−r
increase the weight by a half-integer r, and so the theory cannot be
modular invariant. As it turns out, both issues can be solved via a method known as the
GSO projection [53] which is forced on us by modular invariance.
3.2
Modular invariance and the GSO projection
While the different sectors are naively independent at the level of states, modular invariance
on the torus places additional constraints on which states are allowed. On the torus, the
fermions can be periodic (P) or antiperiodic (AP) along the temporal circle. This leads to
four choices of boundary condition, two for each circle. The different boundary conditions are
related by modular transformations. For instance, it is easy to see that a T transformation
17
The boundary conditions for ψ µ must be the same for all values of µ to preserve spacetime Lorentz
invariance as well as BRST symmetry on the worldsheet.
50
maps AA boundary conditions to PA boundary conditions:
τ →τ +1
−−−−→ P
A
.
(I.3.8)
A
A
An S-transformation always swaps the two circles, for instance:
τ →−1/τ
−−−−−→ A
P
.
(I.3.9)
P
A
Altogether, we see that AA, PA, and AP boundary conditions are interchanged under the
modular group of the torus, while PP boundary conditions are invariant.
Clearly modular invariance forces us to include all four sectors and as we will see including
everything fixes the problem. As we will now show the above sum leads to a consistent
truncation of states known as the Gliozzi-Scherk-Olive (GSO) projection [53]. We introduce
an operator G = ±1 known as the G-parity which implements changing boundary conditions
for fermions. The GSO projection amounts to keeping states with G = 1 and discarding
those with G = −1.
We define G-parity in the NS sector via
G = (−1)F +1 ,
(I.3.10)
where F is the fermion number (modulo 2). It can be expressed in terms of the fermionic
oscillators as
X
i
(I.3.11)
ψri ,
F =
ψ−r
r>0
The NS sector ground state |pi is defined to have F = 0, and so is thrown out. Excited states
i
receive F = +1 from each fermionic oscillator ψ−r
and F = 0 from each bosonic oscillator
i
i
α−n . Note that the first excited state, ψ−1/2 |pi, is preserved by the GSO projection. It has
m2 = 0 and transforms as an SO(8) vector 8v .
Next we analyze the R sector. Unlike the NS sector, the R sector possesses zero modes
which satisfy the Clifford algebra
{ψ0i , ψ0j } = δ ij ,
(I.3.12)
and so the ground state transforms as a Dirac spinor 8s ⊕ 8c under SO(8). In the R-sector,
the fermions contribute +8/24 to the ground state energy, and so the ground state is massless.
51
We define the G-parity of the R-sector through
G = Γ11 (−1)F ,
(I.3.13)
where the chirality matrix Γ11 is given by
Γ11 = 24 ψ02 ψ03 ψ04 ψ05 ψ06 ψ07 ψ08 ψ09 .
(I.3.14)
By definition, it acts as Γ11 = ±1 on states of definite chirality, and so the GSO projection
removes either 8s or 8c . Without loss of generality, we can choose G to act as +1 on
the 8s in the holomorphic sector, removing the 8c . For the antiholomorphic sector, we
then have a choice as to which chirality is removed from the R sector. This leads to two
(seemingly) inequivalent string theories, a nonchiral theory (type IIA) with N = (1, 1)
spacetime supersymmetry and a chiral theory (type IIB) with N = (2, 0). Note that for
both choices of GSO projection, all states of half-integer spin are removed, as required of
a modular invariant theory. Furthermore, spacetime supersymmetry is emergent (and is
notably before the GSO projection is enforced).
In total, there are four sectors of states in each theory depending on the choice of boundary
conditions, i.e. (NS,NS), (R,R), (NS,R), and (R,NS). The (NS,NS) and (R,R) sectors have
states with integer spins (spacetime bosons), whereas the (NS,R) and (R,NS) sectors have
states with half-integer spins (spacetime fermions). The massless states of the type IIA
theory consist of states in (8v ⊕ 8s ) ⊗ (8v ⊕ 8c ), whereas the type IIB theory has states in
(8v ⊕ 8s )2 , which can further be decomposed into the various irreducible representations of
Spin(8).
Let’s first focus on the spacetime bosons. The (NS,NS) sector leads to states of the form
8·9
8·7
+
− 1 = 1 + 28 + 35,
(I.3.15)
8v ⊗ 8v = 1 +
2
2
which are just the usual massless particles of the bosonic string, namely the dilaton, KalbRamond field, and dilaton, respectively. The (R,R) sector consists of states in the tensor
product of two 8-dimensional spinor representations. The bosonic represntations that appear
can be understood from sandwiching the SO(8) gamma matrices γ i between two spinors
ψ, χ, forming the invariants χγ i1 · · · γ ip ψ. An odd number of γ matrices leads to a type IIA
irrep, whereas an even number leads to a type IIB irrep. All of them are massless particles
corresponding to some p-form gauge field. For type IIA, we find
8s ⊗ 8c = 8 ⊕ 56
(I.3.16)
corresponding to a 1-form Cµ and a 3-form Cµνσ . Recall that the Dynkin diagram of SO(8)
52
has a Z3 symmetry which translates into a Z3 symmetry of the root system. The action of
the Z3 could be extended to the weights of any representation, mapping any representation
to another one with the same dimension. This transformation is called triality and it
permutes the three 8 dimensional representations 8v , 8s , and 8c . By applying the triality
transformation on (I.3.15), for type IIB, we find
8s ⊗ 8s = 1 ⊕ 28′ ⊕ 35′ ,
(I.3.17)
corresponding to a 0-form λ, an R–R 2-form Cµν , and a 4-form Cµνσρ . Notice that a 4-form
naively has 70 degrees of freedom. The 4-form corresponding to the 35′ in type IIB is in fact
self-dual, which removes the other half.
The torus partition function of the type IIA/B string theories can be determined after
imposing the relevant GSO projections. Note that for all cases, the NS and R sectors
contribute
Z
NS
Q
Q
1 A
1 q −1/2 n (1 + q n+1/2 )8 q −1/2 n (1 − q n+1/2 )8
Q
Q
=
−
=
(
n
n
2
2
n (1 − q )
n (1 − q )
Q
(1 + q n )8
1
1 A
n
Z =
+
0
=
16 Q
(
n
2
2
n (1 − q )
+P
R
P
).
+P
A
),
A
(I.3.18)
(I.3.19)
P
We can see that the inclusion of sectors with every boundary condition makes the overall
partition function modular invariant. The factor of 16 in Z R arises from the ground state
degeneracy, while the factors of 21 in both come from the GSO projection operator P =
1
(1 + G), which is inserted in the trace. Ultimately, we find that the partition function is
2
not just modular invariant, but it identically vanishes:
Z = Z NS − Z R = 0.
(I.3.20)
This is expected of the vacuum amplitude of a supersymmetric theory. Using similar logic,
it is possible to conclude that it vanishes for all genera.
3.3
Green-Schwarz superstring
Although the superstring enjoys spacetime supersymmetry, this is by no means obvious from
the point of view of the NSR formalism. Indeed, spacetime supersymmetry is visible in the
spectrum only after imposing the chiral GSO projection together on the matter and ghost
53
fields. The Green-Schwarz model of the superstring is an approach to formulate an equivalent
string theory on the worldsheet where spacetime supersymmetry is manifest [54].
In this formalism, the Nambu–Goto action for the bosonic string is replaced with a suitable
supersymmetric extension, where the matter field content consists of the usual bosonic fields
X µ as well as 32 anticommuting scalars SAα for α = 1, . . . , 16 and A = 1, 2. Taken together,
these fields are interpreted as superspace coordinates such that the X µ transform as an
SO(1,9) vector 8v and the SAα as two Majorana–Weyl spinors, either 16 or 16′ . Here, the
choice of string theory (type IIA or IIB) is chosen at the classical level by fixing the relative
chiralities of the SA .
The action is readily quantized in lightcone gauge, where the theory reduces to that
of 8 free scalars X i (z, z̄) and a set of free worldsheet fermions, S1a (z) and S2ȧ (z̄), where
a, ȧ = 1, . . . , 8. Unlike for the NSR string, the GS string does not require a GSO-like
projection. In fact, spacetime supersymmetry requires that the fermions be periodic on the
cylinder. Let’s first study the ground state energy. Each boson contributes −1/24 and
each fermion contributes +1/24, which cancel to give zero Casimir energy, as expected of a
supersymmetric theory. The zero modes of S1a and S2ȧ , which satisfy the Clifford algebra,
a
b
{S1,0
, S1,0
} = δ ab ,
(I.3.21)
commute with m2 , and so lead to a ground state degeneracy. For the holomorphic sector,
the ground state state consists of the vector |ii for i = 1, . . . , 8 and a spinor γai ḃ S a |ii. In
total, this corresponds to 8v ⊕ 8s . For the antichiral sector, we get 8v ⊕ 8s (type IIB) or
8v ⊕ 8c (type IIA). Thus, we see that we have reproduced the massless string spectrum as
the ground states of the worldsheet theory.
3.4
Examples of superstring compactifications
Recall that we can consider string compactifications on the circle, i.e. R1,8 × S 1 . Under
T-duality, the type IIA theory at radius R is mapped to IIB at radius 1/R. However, as
we just learned type IIB is a chiral theory whereas IIA is not. [55] At first glance, it seems
strange that the two theories could be dual under a change of the spacetime geometry. The
resolution is that, from the nine-dimensional perspective, there is no longer a well-defined
notion of chirality.
Exercise 4: Does IIA or IIB have parity symmetry, and if so which one?
Optional exercise: Try to make a simple argument why IIA on a circle of radius R is
equivalent to IIB on one of radius 1/R.
54
Up to this point we have not been so careful about the circle boundary conditions for
the spacetime fermions, but in fact we have been secretly choosing periodic conditions to
preserve supersymmetry. If we choose anti-periodic boundary conditions for fermions two
things happen; we break the supersymmetry and we get a tachyon at some radius [56]. The
tachyon would be a winding mode. Note that there would be no problem with this in field
theory because there would be no winding modes that could become tachyonic. This is an
example of how SUSY breaking has a lot more consequences in string theory than in field
theory.
Optional exercise: Consider string theory on R1,8 × S 1 . Show that if the fermions are
antiperiodic on S 1 , then for sufficiently small radius there is a tachyon in the spectrum.
In fact, is is more common than not for tachyons to arise whenever supersymmetry is
broken, although the two are not a priori related. For instance, string theory on R1,8 × S 1
with antiperiodic boundary conditions for the fermions leads to either tachyonic modes at
some unstable radius or an unstable dilaton suffering from tadpoles.
As a preview of what is to come later, we can also consider string theory compactified on
M × T 6 . This theory has N = 8 supersymmetry, which in the low energy limit becomes
N = 8 supergravity.
3,1
Optional exercise: Consider a 4d string compactification with N = 8 supersymmetry. How
many scalar are there? Verify the counting from type II theory.
3.5
The type I string
In our discussion of relativistic strings, we have thus far only mentioned closed strings whose
spacetime coordinates satisfy periodic boundary conditions
X µ (τ, σ + 2π) = X µ (τ, σ).
(I.3.22)
We can also talk about open strings, which topologically are equivalent to a line segment
with two endpoints. By convention, we take the open string worldsheet to be parameterized
by coordinates (σ, τ ) in [0, π] × R. More generally, the open string worldsheet is given by a
Riemann surface with boundary. For sake of clarity we will focus only on the bosonic fields,
though the same logic can be applied to the fermions and ghosts as well. Under a general
variation, the gauge-fixed Polyakov action now admits a boundary term
ˆ
σ=π
1
δS ⊃
dτ δX µ ∂ σ Xµ
.
(I.3.23)
2π
σ=0
55
Requiring δS = 0 as usual leads to two types of boundary conditions:
∂σ X µ (σ = 0, π, τ ) = 0 (Neumann),
X µ (σ = 0, π, τ ) = constant (Dirichlet).
(I.3.24)
Neumann (N) boundary conditions imply that there is no momentum flow across the endpoints
of the string, whereas Dirichlet (D) boundary conditions mean the string endpoints are fixed
in space and/or time. We are free to take a mixture of both types of boundary conditions
at each endpoint and for each X µ . Notice that (D) boundary conditions break Poincare
invariance by selecting a preferred point in spacetime. Although this may be somewhat
awkward, we will later discover that they arise naturally in the context of opens strings
ending on extended objects known as D-branes. For sake of brevity, we will only focus on
Neumann boundary conditions in this section.
Quantization of the open string is straightforward and closely parallels that of the closed
string. One primary difference is that now the X µ and (ψ µ , ψ̄ µ ) theories each admit only
a single set of oscillators. In light-cone gauge, we can label them as αni and ψrµ . There are
still two sectors for the fermions, an NS sector (r ∈ Z + 1/2) and an R sector (r ∈ Z).
Determining the physical states of the theory is straightforward and follows the standard
procedures (BRST, lightcone, etc). We are primarily interested in a closed+open string
theory, which is only consistent if we include a GSO projection on the open string sector [57].
In short, this follows from the closure of the the OPE of open+closed string vertex operators
(otherwise two open string vertex operators could lead to a closed string vertex operator that
gets projected out). The resulting massless spectrum is
8v ⊕ 8c ,
(I.3.25)
which parallels the left- and right-moving sectors of the type IIB string (except now there
is no tensor product). This is an N = 1 gauge multiplet, which includes a gauge boson in
the 8v and a gaugino in the 8s . One can check that this leads to a U (1) gauge theory in
spacetime.
The endpoints of the open string can be supplied with additional pointlike Chan–Paton
degrees of freedom [58]. A generic open string state |ψ; i, ji is now labeled by two indices
i, j = 1, . . . , N , representing the degrees of freedom at each endpoint. A conjugate state
should describe the same open string, and so these states are naturally captured by N × N
Hermitian matrices Hij . Of course, invariance of the inner product leads to a U (N ) symmetry
that acts as H 7→ U HU † . This is just the adjoint representation of U (N ). Although the
worldsheet CFT is unaffected by this method (the stress tensor, for instance, is unchanged),
the spacetime physics is completely different. For instance, there are now N 2 copies of each
gauge boson and gaugino, which together transform as an N = 1 vector multiplet in the
adjoint of U (N ): that is, there is a U (N ) gauge symmetry in spacetime!
56
The appearance of gauge theory for open strings with Chan–Paton factors is more than
a happy coincidence and turns out to be at the heart of non-perturbative effects in string
theory. Let us explore the physical meaning of this gauge theory. The Neumann/Dirichlet
boundary conditions that we impose on the end points of open strings in different directions
confine the endpoints to a submanifold in spacetime. This submanifold is called the D-brane
[55, 59, 60]. In perturbative string theory we view D-branes as part of the background rather
than dynamical objects. However, the ingredients of the string string theory background are
tightly constrained. For example, as we saw in bosonic string theory, the Weyl invariance
of the worldsheet CFT imposes certain equations of motion between the background fields.
Similarly, adding the D-brane imposes specific equations on the background fields and in
some way sources them.
To see why D-branes are fixed ingredients of the background, we can do a Heuristic
calculation to estimate the tension of D-branes. If the tension is very large in string units, it
is reasonable to approximate them as non-dynamical ingredients of the background.
The amplitudes in backgrounds with a D-branes involve vacuum diagrams with discs
ending on D-branes.
∼
AConnected
1
gs
Figure I.3.1: Vacuum amplitude in a background with a D-brane.
The one disc amplitude has a connected worldsheet with genus zero and one hole (g, b) =
(0, 1). Therefore, the amplitude goes like
A ∼ gs2g−2+b ∼ 1/gs .
(I.3.26)
Summing over all disconnected worldsheets to find the D-brane effective action is equivalent
57
to exponentiating the connected part [61]. We find
exp(−SD-brane ) ∼ exp(−gs−1 · VolD-brane ).
(I.3.27)
Therefore, the effective action has a prefactor that is proportional to 1/gs . In other words,
the tension of the D-brane is T ∼ 1/gs in string units. Note that for small gs ≪ 1 where the
string perturbation is expected to work, D-branes are very massive and can be approximated
to be non-dynamical.
It is also worth noting that Dirichlet boundary conditions and D-branes are required to
extend T-duality to open strings. This is due to the fact that T-duality acts on the left and
right moving components of the compact coordinate as
(XL , XR ) → (XL , −XR ).
(I.3.28)
Therefore, T-duality swaps ∂σ X and ∂τ X up to some factors. Consequently, T-duality
swaps a Dirichlet boundary condition along the compact direction with a Neumann boundary
condition and vice versa. In terms of D-branes, this means that a wrapped D-brane goes to
an unwrapped D-brane and vice versa [62].
Now let us imagine N coincident D-branes. Then each end of the open string can end on
any of the copies. In that case, we need a label in { 1, ..., n } to specify the boundary condition
of each open string state. These labels are the Chan–Paton factors! This observation teaches
us a very important lesson; Given that fields associated with open strings are confined to the
D-branes, we learn that there is a gauge theory living on the D-brane.
We now return to the type II closed string theories, which will soon be connected to the
open string story. The type II theories (I.3.1) respect a Z2 symmetry Ω that interchanges leftand right-moving fields, known as worldsheet parity. It acts on the closed string oscillators
as
Ω : αnµ ↔ ᾱnµ , ψnµ → ψ̄nµ , ψ̄nµ → −ψnµ .
(I.3.29)
Taking into account the GSO projection, only the type IIB theory continues to respect this
symmetry. We can then try to construct an unoriented string theory by trying to gauge
this symmetry. It can be shown that Ω preserves the NS–NS and R–R sector ground states,
whereas it swaps the NSR and RNS ground states. After projecting onto Ω-invariant states,
we are left with a new massless spectrum, which we can individually analyze in each of the four
spacetime sectors. In the NS–NS sector, the graviton and the dilaton survive whereas the Bfield is projected out. In the R–R sector, only the two-form gauge field remains, with the zeroform and self-dual four-form gone. Since the NSR and RNS sectors are exchanged under the
parity transformation, only a linear combination of the two survives the orbifold procedure.
The associated massless fermions that remain are a single Majorana–Weyl graviton and a
58
Majorana–Weyl fermion. Altogether, the theory therefore has N = (1, 0) supersymmetry.
This is not, however, the end of the story. This unoriented theory of closed strings is
inconsistent due to a one loop divergence. Moreover, a spacetime analysis of this chiral
theory reveals the presence of a gravitational anomaly. To cure both of these problems,
we can introduce unoriented, open strings. The open string theory of the previous section
also respects worldsheet parity, which nows acts on the oscillators as a phase rotation. The
projection to Ω-invariant states adds an extra constraint for the Chan–Paton degrees of
freedom. In particular, there are two allowed symmetry groups: SO(N ) or Sp(N ). As it
turns out, this theory of unoriented open strings is also inconsistent due to a gauge anomaly.
Miraculously, there is a method to cancel both the gravitional and gauge anomalies, known
as the Green-Schwarz anomaly-cancellation mechanism [63]. The gauge and gravitional
anomalies cancel if the gauge group is SO(32) or E8 × E8 . Thus, the open+closed unoriented
string theory with gauge group SO(32) is tachyon-free, anomaly-free, and supersymmetric!
It is commonly referred to as the Type I string. From the above construction we can see that
type I string theory is the orientifold of type IIB theory with 32 space-filling half D9-branes.
For convenience, we summarize the massless contents of its spectrum in Table I.3.2.
Massless spectrum (Type I)
sector
SO(8) × SO(32) irrep
particle content
closed (NS, NS)
closed (R,R)
closed mixed
(1, 1) ⊕ (35, 1)
(35′ , 1)
(8c , 1) ⊕ (56c , 1)
dilaton + graviton
2-form R–R field
dilatino + gravitino
open NS
open R
(8v , 496)
(8c , 496)
gauge bosons
gauginos
Table I.3.2: Massless spectrum of the SO(32) Type I string.
3.6
The heterotic string
In the previous section, we discovered that attaching Chan–Paton degrees of freedom to the
endpoints of open strings gave rise to gauge theories in spacetime. A natural question is
whether we can get gauge theories in ten dimensions from closed superstrings alone. In our
discussion on toroidal compactifications, we learned that global symmetries on the worldsheet
are associated with gauge symmetries in spacetime. Unfortunately, for the superstring, there
is no room for additional unitary degrees of freedom since the matter CFT already saturates
the central charge with cm = 15. What about the bosonic string? We can always take the
59
matter CFT to consist of 10 noncompact bosons and a unitary CFT with c = 16. However, as
was the case for toroidal compactifications, these extra degrees of freedom still usually have
the interpretation of (compact) spatial directions. To free ourselves of these constraints,
we consider a heterosis of the bosonic and superstring theories [64]. The worldsheet theory
consists of a left-moving bosonic CFT (cm = 26) and a right-moving SCFT (c̄m = 15). The
matter content consists of 10 noncompact bosons X µ , 10 right-moving fermions ψ̄ µ , and 16
chiral (left-moving) bosons. This theory is properly interpreted as living in ten spacetime
dimensions, with additional internal degrees of freedom.
Heterotic SO(32)
Although this definition of the heterotic string is perfectly fine, in these notes we will use a
more convenient formulation that replaces the chiral bosons with 32 left-moving free fermions
λA with A = 1, . . . , 32. At first glance, this appears to be an absurd thing to do: we’re
exchanging fields with different worldsheet statistics! However, this is feasible in 2d due to
a boson-fermion duality known as bosonization. The duality at least seems plausible, since
a free boson and Dirac fermion both have c = c̄ = 1. This correspondence continues to hold
at level of the chiral boson and two Weyl fermions, though the actual details of the duality
are somewhat involved. In any case, the worldsheet action is given by [64]
ˆ
1
¯ A .
¯ µ + ψ̄ µ ∂ ψ̄µ + λA ∂λ
Sm =
(I.3.30)
d2 z 2∂X µ ∂X
4π
As was the case for the ψ CFT, this is not quite enough data to fix the theory: we still need
to specify the boundary conditions of the λA . The action is invariant under a global O(32)
symmetry that acts on the fields as
U (O)λA U † (O) = OAB λB ,
O ∈ O(32).
(I.3.31)
Requiring O(32) invariance on the cylinder thus permits the boundary conditions
λA (σ + 2π, τ ) = OAB λB (σ, τ )
(I.3.32)
for any orthogonal matrix. Unlike the ψ CFT, we cannot use Lorentz invariance to restrict
the choice of this matrix. However, there are still other consistency conditions such as
the requirement of a modular invariant, tachyon-free theory. To achieve this, we need to
implement an additional chiral GSO projection on the states created by the λA . For the
heterotic string, it turns out that there are a total of 9 consistent choices of boundary
conditions and GSO projections. Three of these are tachyon free, and only two of the three
60
have N = 1 spacetime supersymmetry.18 These two theories, distinguished by their symmetry
groups, are called heterotic SO(32) (HO) and heterotic E8 × E8 (HE), respectively.19
The HO theory is singled out by choosing identical boundary conditions on all of the
fermions,
λA (σ + 2π, τ ) = ǫλA (σ, τ ), ǫ = ±.
(I.3.33)
We refer to the two sectors ǫ = ± as periodic (P) and antiperiodic (A), to distinguish them
from the R and NS sectors of the right-moving sector. This choice of BCs ensures that the
entire SO(32) symmetry is preserved.20 . There is a (−1)F fermion number symmetry,
{(−1)F , λA } = 0,
(I.3.34)
that commutes with all of the right-moving fields as well as ∂X i . For now, let’s focus on the
left-moving sector. The GSO projection we want to take is
(−1)F = +1,
(I.3.35)
where the A vacuum |0i is conventionally defined to have (−1)F = +1. We now analyze the
spectrum of the theory. In light-cone gauge, the left-moving part of the space of states is
constructed from the ∂X i oscillators αni as well as the λA oscillators λA
r , where by definition
the subscript indicates the oscillator weight. The ground state energy receives −1/24 for
each boson and −1/48 (AP) or +1/24 (P) for each fermion. The ground states energies are
thus
8
32
8
32
HO
EAP
=− −
= −1, EPHO = − +
= +1.
(I.3.36)
24 48
24 24
Naively, we then seem to conclude that there is a tachyon |0i in the theory. However, the
level-matching condition forbids this state since there is no tachyon in the GSO-projected
right-moving sector. We also see that the P sector does not contribute any massless degrees
of freedom. The relevant (massless) states coming from the AP sector are
i
α−1
|0i ,
B
λA
−1/2 λ−1/2 |0i ,
(I.3.37)
Under the SO(8) × SO(32) symmetry, these transform as (8v , 1) and (1, 496), respectively.
18
The third choice of GSO projection yields a non-supersymmetric string theory with gauge group SO(16)×
SO(16). It is chiral, anomaly-free, and tachyon-free [65, 66].
19
The emergence of both gauge groups is manifest in the bosonic formulation. The chiral boson CFT
is associated with a 16-dimensional Euclidean lattice, which is even and self-dual by modular invariance.
Miraculously, there only two such lattices: the root lattices of Spin(32)/Z2 and E8 × E8 ! The nonsupersymmetric theory can then be constructed as a Z2 orbifold of the E8 × E8 theory.
20
A more careful analysis of the representation theory shows that the global symmetry group is actually
Spin(32)/Z2 , where the Z2 differs from that of SO(32) ≃ O(32)/Z2 . However, the difference SO(32) and
Spin(32)/Z2 only appears through the allowed representations.
61
The analysis of the right-moving sector follows that of the NSR construction. The NS sector
has a single ground state |0i, and the R sector ground state transforms as an SO(8) spinor
|s̄i that transforms as
16 = 8s ⊕ 8c .
(I.3.38)
The GSO projection removes the would-be tachyon |0i as well as the antichiral spinor 8c .
The right-moving contribution to the massless spectrum is thus
8v ⊕ 8s .
(I.3.39)
The massless particle content is given by a tensor product of the two sets, which we list in
Table I.3.3. From
(8v , 1) ⊗ (8v ⊕ 8s , 1),
(I.3.40)
we recover the N = 1 SUGRA multiplet, which contains the usual massless bosonic fields
(graviton, dilaton, B-field) as well as a single gravitino. Additionally, the massless particles
include contributions from
(1, 496) ⊗ (8v ⊕ 8s , 1) = (8v , 496) ⊕ (8s , 496).
(I.3.41)
Here, we have a vector and a spinor transforming in the adjoint representation of the gauge
group, which form an N = 1 vector multiplet. We have therefore found a tachyon-free theory
of closed strings with N = 1 spacetime supersymmetry and an SO(32) gauge symmetry!
Massless spectrum (HO)
Sector
States
SO(8) × SO(32) irreps
Particle content
(−, NS)
j
i
|0; 0i
α−1
ψ̄−1/2
dilaton + B-field + graviton
j
B
λA
−1/2 λ−1/2 ψ̄−1/2 |0; 0i
(1, 1) ⊕ (28, 1) ⊕ (1, 1)
(8v , 496)
gluons
i
α−1
|0; s̄i
(8c , 1) ⊕ (56c , 1)
dilatino+gravitino
B
λA
−1/2 λ−1/2 |0; s̄i
(8s , 496)
gluinos
(−, R)
Table I.3.3: Massless spectrum of the SO(32) heterotic string.
62
Heterotic E8 × E8
Constructing the heterotic E8 × E8 theory is only slightly more involved. We now allow for
different boundary conditions between two equally partitioned sets of the fermions:
λA1 (σ + 2π, τ ) = ǫ1 λA1 (σ, τ ),
A1 = 1, . . . , 16,
(I.3.42)
λA2 (σ + 2π, τ ) = ǫ2 λA2 (σ, τ ),
A2 = 1, . . . , 16,
(I.3.43)
where ǫ1,2 = ±1. Consequently there are a total of four left-moving sectors, labeled by (ǫ1 , ǫ2 ).
This generically breaks the SO(32) symmetry to an SO(16) × SO(16) subgroup. There are
two independent fermion number symmetries on the left, (−1)F1 and (−1)F2 , which commute
with the opposite set of fermions, e.g.
[(−1)F1 , λA2 ] = 0.
(I.3.44)
On the left, we take the GSO projection
(−1)F1 = (−1)F2 = +1.
(I.3.45)
We now analyze the massless content of the theory. The tachyon of the (−, −) sector is still
projected out due to the level-matching condition and the (+, +) sector is purely massive.
However, now the (±, ∓) sectors contribute to the massless sector; indeed, their ground state
energies vanish by
8
16 16
HE
∓
= 0.
(I.3.46)
=− ±
E(±,∓)
24 48 24
In the (−, −) sector, the relevant states are
i
α−1
|0i ,
Bi
i
λA
−1/2 λ−1/2 |0i ,
(I.3.47)
where the fermionic oscillators must be taken from the same set due to the GSO projection. In
terms of SO(8) × SO(16)1 × SO(16)2 representations, the (−, −) sector therefore contributes
(8v , 1, 1) and (1, 120, 1), and (1, 1, 120), where 120 is the adjoint of SO(16). The (+, −)
ground state |σi transforms as an SO(16)1 Dirac spinor 256, which is projected to one of the
chiral spinors 128 by the GSO projection (this mirrors the R sector ground state of the rightmoving sector). Under the full symmetry group, it transforms as the (1, 128, 1). Similarly,
the (−, +) ground state transforms as the (1, 1, 128). Tensoring with the right-moving sector
gives the overall massless particle content,
(8v ⊕ 8s , 120, 1) ⊕ (8v ⊕ 8s , 1, 120) ⊕ (8v ⊕ 8s , 128, 1) ⊕ (8v ⊕ 8s , 1, 128),
63
(I.3.48)
where we have left out the usual N = 1 gravity multiplet. The key step is now identifying
the 120 ⊕ 128 irrep of SO(16) as the adjoint 248 of E8 . Therefore, these particles form an
N = 1 vector multiplet in the adjoint of E8 × E8 ! We list the full massless content of the
theory in Table I.3.4.
Massless spectrum (HE)
Sector
States
SO(8) × (E8 × E8 ) irreps
Particle content
(±, ±, NS)
j
i
|0; 0i
α−1
ψ̄−1/2
(1, 1) ⊕ (28, 1) ⊕ (35, 1)
dilaton + B-field + graviton
(±, ±, R)
i
α−1
|0; s̄i
(8c , 1) ⊕ (56c , 1)
dilatino + gravitino
j
Bi
i
λA
−1/2 λ−1/2 ψ̄−1/2
j
|σ; 0i
ψ̄−1/2
Bi
i
λA
−1/2 λ−1/2 |0; s̄i
|σ; s̄i
|0; 0i (8v , 248, 1) ⊕ (8v , 1, 248) gluons
(8s , 248, 1) ⊕ (8s , 1, 248) gluinos
Table I.3.4: Massless spectrum of the E8 × E8 heterotic string.
3.7
Superstring compactifications
Now that we’ve constructed all of the known consistent 10d superstring theories (type I and
II, heterotic), we can consider compactifications of spacetime of the form
R1,9−d × Md ,
(I.3.49)
where Md is a compact d-dimensional oriented Riemannian manifold, known as the internal
space. It can be shown that the compactified background preserves some of the supersymmetry
if there exists a covariantly constant spinor on Md ; that is, there exists some spinor ξ such
that
∇I ξ = 0 ,
(I.3.50)
where ∇I is the covariant derivative and I is an internal space vector index [67]. A convenient
way to determine if such a spinor exists is to analyze the holonomy group of Md . Recall that
the generic holonomy group for a Riemannian d-dimensional manifold is SO(d). Special
choices of the internal space will lead to “reduced holonomy,” where the holonomy group
is is some subgroup of SO(d). A covariantly constant spinor on Md exists if the minimal
spinor representation 8s of 10d contains a singlet in the decomposition under the holonomy
group. We now list some of the relevant manifolds (and their subgroups), enumerated by
their dimensionality, which preserve SUSY.
64
• d = 1: All 1d manifolds have trivial holonomy and so preserve supersymmetry. This
case has been covered previously.
• d = 2: The holonomy group is SO(2) ≃ U (1), and only the trivial subgroup gives
rise to a covariantly constant spinor. The only M2 with trivial holonomy is T 2 , and in
general the only Md with trivial holonomy is the d-torus, T d , which will always admit
covariantly constant spinors.
• d = 3: The holonomy group is SO(3) ≃ SU (2) which has no relevant non-trivial
subgroup. The only SUSY-preserving 3-fold is T 3 .
• d = 4: The holonomy group is SO(4) ≃ SU (2) × SU (2). The minimal spinor
representations are the chiral (2, 1) and antichiral (1, 2). Each transforms as a singlet
under the other’s SU (2). A 4-fold with SU(2)≃ Sp(1) holonomy is hyperKähler, and
is known as a K3 surface. One such example is the orbifold T 4 /Z2 . Thus, the two
SUSY-preserving options are T 4 and K3.
• d = 5: The holonomy group is SO(5). The analysis is identical to d = 4, where we
simply tensor with a circle (the holonomy subgroup is unchanged). The two options
are K3 × S 1 (or some twisted product) and T 5 .
• d = 6: The holonomy group is SO(6) ≃ SU (4). The obvious examples are T 6 and K3×
T 2 . Another interesting holonomy subgroup which admits a singlet in the branching
is SU (3) ⊂ SO(6). Manifolds of dimension 2N with SU (N ) holonomy are known as
Calabi–Yau N -folds CYN and admit a Ricci-flat Kähler metric (as proved by Yau). The
options are thus CY3 , T 6 and K3 × T 2 (or some twisted product).21
• d = 7: The holonomy group is SO(7). The obvious examples here are T 7 , K3 × T 3 , and
CY3 × S 1 . There is an additional manifold, the so-called G2 -manifold, whose holonomy
group is G2 ⊂ SO(7). For such groups, the spinor irrep decomposes as 8s → 7 ⊕ 1.
Manifolds M7 with such reduced holonomy are called G2 -manifolds.
• d = 8: The holonomy group is SO(8). In this case, there are two reduced holonomy
subgroups: SU (4), which corresponds to CY4 , and Spin(7), which leads to Spin(7)manifolds. Additionally, we have the obvious product manifolds such as G2 × S 1 and
CY3 × T 2 .
Exercise 5: Show that a holonomy group of Spin(7) of SO(8) for the compact manifold
preserves 1/8 of the supersymmetry.
21
For instance, we can consider the heterotic string theory compactified on CY3 , see e.g. [68].
65
Exercise 6: Show that the holonomy group of T 4 /Z2 where the Z2 acts as xi ∼ −xi is
SU (2).
Optional exercise: Determine the massless spectrum of type IIA on T 4 /Z2 . Note that
there are 16 fixed points of the Z2 action.
Optional exercise: Show that M8 = G2 × S 1 with G2 holonomy and M8 = CY4 with SU (4)
holonomy preserve the same amount of supersymmetry.
10 − d
Md
Fraction of
supercharges
10
Type IIA
Type IIB
Heterotic / Type I
N10 = (1, 1)
N10 = (2, 0)
N10 = (1, 0)
6
K3
1/2
N6 = (1, 1)
N6 = (2, 0)
N6 = (1, 0)
5
K3 × S 1
1/2
N5 = 2
N5 = 2
N5 = 1
4
K3 × T 2
1/2
N4 = 4
N4 = 4
N4 = 2
CY3
1/4
N4 = 2
N4 = 2
N4 = 1
CY3 × S 1
1/4
N3 = 4
N3 = 4
N3 = 2
G2
1/8
N3 = 2
N3 = 2
N3 = 1
3
Table I.3.5: Reduced SUSY in various superstring compactifications
By using the supersymmetric compactification in Table I.3.5 we can construct theories
with 4, 8, 16, 32 super charges in four dimensions. These are N = {1, 2, 4, 8}. What about
the other degrees of supersymmetries, N = {3, 5, 6, 7}? It turns out the algebra of an
interacting N = 7 supergravity automatically enhances to N = 8. The other supersymetries
can be constructed by asymmetric compactification on the left and right moving sector of
the worldsheet theory each giving rise to N = {1, 2, 4} supersymmetry.
N = 3 : (NL , NR ) = (1, 2)
N = 5 : (NL , NR ) = (1, 4)
N = 6 : (NL , NR ) = (3, 3) or (2, 4).
(I.3.51)
Note that the L-R subscript refers to the left-moving and right-moving sectors of the
worldsheet theory, and is not related to spacetime parity.
66
3.8
Model building
Treating the left- and right-moving modes separately, we get get chiral matter and gauge
theories in spacetime. However, we cannot construct a perturbative type II theory specifically
with chiral matter transforming in the SU (3) × SU (2) × U (1) gauge group, and so we will
not discuss such theories here.22 Instead, we can construct the relevant theories using the
E8 × E8 heterotic string. Focusing only on a single E8 , we have the decomposition
SU (3) × SU (2) × U (1) ⊂ SU (5) ⊂ SO(10) ⊂ E6 .
(I.3.52)
Note that the fundamental representation of E6 is a 27-dimensional representation whose
branching rule to an SO(10) is given by
27 = 10 + 16 + 1,
(I.3.53)
where 16 is the representation in which the standard model with neutrinos transform.23
Recall that the heterotic theory has an E8 × E8 gauge symmetry (see the Dynkin diagram
for E8 in Figure I.3.2). Using the Dynkin diagram, we can get the E7 and E6 subgroups by
contracting nodes. Casually speaking, the E8 by itself is not of phenomenological interest
but only via additional operations or such as breaking to smaller subgroups.
3
1
2
3
4
5
6
4
2
Figure I.3.2: A Dynkin diagram of an E8 .
By decomposing the adjoint representation 27 of E6 under the standard model subgroup,
one can get the corresponding representation as in (I.3.52). It is therefore rather elegant to
build a standard model from E6 , where the fundamental representation is 16. We can match
this 16 as the standard model matter with neutrinos.
Let us briefly investigate the equations of motion for the H flux, which in the low energy
22
In this context, we are considering only perturbative vacua here. The introduction of non-perturbative
methods like S-duality enables study of such theories.
23
In particular for SO(10) GUT, the matter content of the standard model comes from the 16, which is a
spinor representation of SO(10) [69].
67
effective theory satisfy
1
1
H = dB + ωcs (ωspin ) − ωcs (A),
2
2
1
(R ∧ R − F ∧ F ) ,
dH =
16π 2
(I.3.54)
where ω is the Chern-Simons terms, ωspin is the spin connection, R is the Ricci 2-form, and
F is the field strength for the Heterotic gauge group. Note that there are two terms that
compose dH: the first Pontryagin class
p1 (M ) ∼ R ∧ R
(I.3.55)
c2 (V ) ∼ F ∧ F.
(I.3.56)
p1 (M ) = c2 (V ).
(I.3.57)
and the second Chern class
If
´
dH = 0, then
In other words, string perturbation theory breaks down due to the presence of tadpole if
R ∧ R = 0 and F ∧ F 6= 0, or vice versa.
To preserve some supersymmetry, we take the internal space to have an SU (3) holonomy.
This implies that the spin connection ωµij transforms in the adjoint of SU (3), which enters
into the R ∧ R term through
R = dω + ω ∧ ω.
(I.3.58)
ω = ASU (3)
(I.3.59)
We can make the judicious choice
for the gauge connection of SU (3) ⊂ E8 , referred to as the identification of the gauge and
spin connection. With this construction, dH = 0 is automatically enforced. Note that this
is identical to the level-matching we covered in type II theories in Section 3. Hence, we can
conclude that we are in the same vacuum as in type II theories.
Recall that the adjoint representation of the E8 is 248 and so there are massless fields
transforming under SO(8) × E8 as
(8v ⊕ 8s ) ⊗ 248.
68
(I.3.60)
Note that E8 can be broken as
E8 −→ E6 ⊕ SU (3),
(I.3.61)
under which the 248 decomposes as
248 −→ (78, 1) ⊕ (27, 3) ⊕ (27, 3) ⊕ (1, 8),
(I.3.62)
where the first representation is the adjoint representations of E6 and the last representations
is the adjoint representation of SU (3). The two representations in the middle are of the
interest as they are the bifundamental representations. More precisely, the second and third
representations transform as 27-dimensional irreducible representations of E6 , where each of
them decomposes to SO(10) as
27 −→ 10 + 16 + 1.
(I.3.63)
Note that from considering the vectors on the right, we had
(8v ⊕ 8s ) ⊗ 8v ,
(I.3.64)
where the counting of the zero modes are given by the cohomologies of Calabi–Yau such that
#3 − #3 from the contribution from
(27, 3) ⊕ (27, 3).
(I.3.65)
One remark to make is that the internal Calabi–Yau only sees SU(3) part and this SU(3)
does not affect E6 .
The chiral spinors in 10d decompose in the compactification as
(10d) = (4d) ⊗ (6d) ,
(I.3.66)
where we will write γ1 , · · · , γ10 for the 10d γ-matrices. The product of the chirality of 4d
and 6d spinors is +1. 3 and 3 have opposite chirality spinors.
Here we have SO(4) ⊕ SO(6) and we count the net zero-modes of 3 and 3 from this
opposite chirality.
Recall that
/ = (D
/4 + D
/ 6 )ψ ⇒ (D
/ 4 + λ)ψλ = 0 .
Dψ
(I.3.67)
The eigenvalues λ act like effective mass parameters; the number of zero-modes is given by
the counting of the solutions when λ = 0.
The Calabi–Yau manifold is by definition Kähler. The Kähler function K(z i , z j ) leads to
69
a metric
gij = ∂i ∂ j K.
(I.3.68)
Kähler manifolds have U (d) holonomy, where d is the complex dimension of the manifold.
Calabi–Yau manifolds require SU (n) holonomy, which further requires the manifold to have
trivial first Chern class,
c1 (M ) = 0.
(I.3.69)
We have dz1 ∧ · · · ∧ dzn well-defined as this means U (1) ⊂ U (n) is trivial and hence is fixed,
i.e. dz1 ∧ · · · ∧ dzn is locally a constant, which implies the existence of a holomorphic n-form
Ω(n) . For a Calabi–Yau threefold we have a holomorphic three-form Ω(3) .
We can define complexified combinations of the γ-matrices, applicable in the case of a
complex manifold like a Calabi–Yau threefold:
γi − iγi+3 = γei† .
γi + iγi+3 = γei ,
(I.3.70)
The Clifford algebra is
{γei , γej } = 0 ,
{γei† , γej† } = 0 ,
{γei , γej† } = δeiej .
(I.3.71)
We can construct the following using the gamma matrices:
|0i ,
nCi γei† γej† |0i , · · · .
n γei† |0i ,
(I.3.72)
Also we can build
1,
dz ī ,
dz ī ∧ dz j̄ ,
dz ī ∧ dz j̄ ∧ dz k̄ .
(I.3.73)
We have (27, 3) where 3 gives triplets. Then
ψij (z) dz i ∧ dz j
(I.3.74)
gives the number of zero modes. This allows us to have the hodge number h1,1 to be given
as the number of zero modes in 3, which is identical to the number of zero modes in 27.
Recall that Calabi–Yau threefold has a well-defined three-form Ωijk . Using (3×3)Antisym =
3, we can get
ψij (z)dz i ∧ dz j ∧ dz k
70
(I.3.75)
along with the aforementioned threeform. Then we can conclude that h1,2 is given by the
number of zero modes in 27. The net number of zero modes is given by
h1,1 − h2,1 =
χ(Y )
,
2
(I.3.76)
where χ(Y ) is the Euler characteristic of the Calabi–Yau threefold. The ambiguity of the
choice for 3 and 3 interchangeably results in a mirror symmetry.
A strict Calabi–Yau threefold has two independent Hodge numbers. We have
h0,0 = h3,0 = h0,3 = h3,3 = 1
h1,0 = h2,0 = h0,2 = h0,1 = 0
h1,2 = h2,1
(I.3.77)
h2,2 = h1,1 ,
and the Euler characteristic is given by
χ = 2(h1,1 − h2,1 ) .
(I.3.78)
Optional exercise: Consider the type II string on T 6 /Z2 . Show that the theory is nonsupersymmetric and compute the massless particle content of the twisted sector. [Note: the
theory will be tachyonic. It is not known how to obtain a non-supersymmetric theory from
perturbative string theory that does not have either tachyons or rolling problems.]
Optional exercise: Consider heterotic string theory compactified on T 6 /Z3 , where the Z3
acts as zi → ωzi with ω 3 = 1. The Ramond sector gives the cohomology of the internal
manifold T 6 /Z3 . Show that the Hodge numbers are
h1,1 = 0 ,
and h2,1 = 36 ,
(I.3.79)
where the twisted sector contributes 27 to h2,1 and untwisted sector 9. The number of massless
modes of the resulting theory is determined by the cohomology of the internal Calabi–Yau
manifold.
Optional exercise: Recall an exercise that if is antiperiodic around circle, for a sufficiently
small radius, we get tachyons. Now add onto this exercise. Show that a winding mode
tachyon emerges at Hagedorn transition. Hagedorn transition is defined to be at TH = β1H
and is related to the asymptotic degeneracy
n(N ) = eα
√
CN
,
71
m2 ≃ CN,
n(m) ∼ eβH m .
(I.3.80)
The partition function
Z∼
X
n(m)e−βm
(I.3.81)
m
diverges at β = βH .
4
String dualities
As we discussed before, lowering the cutoff of a theory can lead to a set of different low-energy
theories with disconnected moduli. The reverse of this process (i.e. increasing the cut-off in
the UV completion) is expected to unify different low-energy theories. In other words, two
seemingly different low energy theories are in fact perturbative expansion of a single theory
at different corners of its moduli space. This idea is known as duality. By their nature,
dualities allow us to push past the perturbation theory and learn more about the underlying
non-perturbative theory, even if we do not know its exact formulation. String theory is filled
with dualities to the extent that whenever two lower dimensional constructions have the
same asymptotic geometry (e.g. Minkowski or AdS), gauge groups of the same rank, and the
same number of supercharges, they often turn out to be dual to each other. In the following
section, we will do a quick review of dualities in string theory and the lessons learned from
the underlying non-perturbative theory.
4.1
Supergravities in d ≥ 10
Let us start with a quick summary of supergravities in dimensions d ≥ 10. We have found
five 10d supergravities in previous sections by looking at the low-energy effective actions of
different string theories. These theories have different gauge groups, chiralities, and levels
of supersymmetries. In addition to these five 10d supergravities, there is a unique 11d
supergravity which we have not provided any string theory description of so far. These
theories and their field content are summarized in Table I.4.6.24
24
As we go to higher dimensions, the representations of supersymmetry get bigger.
Unbroken
supersymmetry in dimensions higher than eleven would require massless particles with spins greater than two
which would violate the Weinberg-Witten theorem [70, 71]. In 11d, there is only a single supermultiplet with
particles with spins ≤ 2. Thus, the field content is unique in 11d supergravity. Cremmer, Julia, and Scherk
were able to write down the action for this supermultiplet; [72] and many subsequent works have shown that
the theory is almost unique. See [73] for a review of this theory.
72
Properties
SUSY
Dimension
Bosonic content
Gauge group
IIB
N = (2, 0)
NSU SY = 32
d = 10
NS–NS: φ, Bµν , gµν
RR: φ̃, B̃µν ,
D̃µνρσ (F = F ∗ )
–
IIA
N = (1, 1)
NSU SY = 32
d = 10
NS–NS: φ, Bµν , gµν
RR: C1 µ , C3 µνρ
–
Heterotic
E8 × E8
N = (1, 0)
NSU SY = 16
d = 10
NS–NS: φ, Bµν , gµν
RR: Aµ ∈ e8 ⊕ e8
E8 × E8
Heterotic
SO(32)
N = (1, 0)
NSU SY = 16
d = 10
NS–NS: φ, Bµν , gµν
RR: Aµ ∈ so(32)
Spin(32)/Z2
Type I
N = (1, 0)
NSU SY = 16
d = 10
NS–NS: φ, gµν
RR: Bµν
NS+: Aµ ∈ so(32)
Spin(32)/Z2
11d
supergravity
N =1
NSU SY = 32
d = 11
gµν , Cµνρ
–
Theories
Table I.4.6: All tensors except gµν are antisymmetric. φ̃ is a pseudo scalar.
We will use Table I.4.6 throughout this section to check dualities between the low energy
effective field theories of different string theories.
4.2
T-duality for superstring theories
Let us start with T-duality [31]. In previous sections, we learned that compactifying the
bosonic string theory on a circle has a duality that switches momentum and winding modes
with each other. Let us try to extend the T-duality to supersymmetric string theories. From
the bosonic string theory we know how T-duality acts on the left and right moving part of
the bosonic worldsheet fields. Suppose X 9 (σ, τ ) is the compact coordinate and
X 9 = XL9 + XR9 ,
73
(I.4.1)
where XL and XR are respectively the left-moving and right-moving parts of X 9 . By looking
at the action of T-duality on the momentum and winding numbers n, w, and the radius R
we can see that T-duality acts on
XR9 = n ·
τ
− w · 2Rσ + · · ·
2R
(I.4.2)
as
XR9 ↔ −XR9 .
(I.4.3)
while keeping XL9 fixed. Given that worldsheet supersymmetry maps XR9 to ψ̃ 10 , T-duality
must act on the fermionic fields as
(ψ µ , ψ̃ µ ) → (ψ µ , −ψ̃ µ ).
(I.4.4)
Type II theories
Remember that the right-moving Ramond sector had degenerate ground states that decomposed
into two representation of the spacetime Lorentz group; 8s and 8c . Each of the two ground
state are annihilated by different operators. One by ψ̃ 8 +iψ̃ 9 and the other by ψ̃ 8 −iψ̃ 9 . Thus,
T-duality effectively switched the spacetime chirality of the ground state of the right-moving
Ramond sector. By definition, this maps IIA theory to IIB and vise versa.
Heterotic theories
Since in heterotic theories, the worldsheet does not have any right-moving fermions, T-duality
only acts on the bosonic worldsheet fields. If we look at the conjugate momenta associated
to the bosonic fields, PL is a 17 dimensional vector (16 from the 10d theory and 1 from the
extra circle) and PR is a one-dimensional vector corresponding to the compact circle. The
two vectors together must form an even self dual lattice Γ17,1 . However, since the space of
such lattices is connected, we find that the two theories share the same moduli space and are
the same. The two heterotic compactifications are dual theories expanding around different
corners of the moduli space of Γ17,1 Narain lattices.
Questions
Among the five ten dimensional supergravities in I.4.6, we were able to connect two pairs of
them by T-duality. The theories in each pair lie at different corners of the moduli space of a
74
unifying lower dimensional theory. This unification raises several natural questions. Can we
extend the web of dualities to the point that all perturbative constructions are connected?
Our study of T-duality was based on splitting the worldsheet fields into left and right
moving components. However, this cannot be done for open strings of the type I theory. In
fact the worldsheet parity acts on the boundary conditions of the open strings. In that case,
is there a way to T-dualize type I theory?
The dualities we studied so far are perturbative dualities, in the sense that the dual
theories become weakly/strongly interacting together. In other words, the moduli space of
the theories are overlapping and the duality can be proven to any order of perturbation.
The price of this control is that the dualities we discussed so far teach us little about nonpertubative ophysics. Can we potentially connect different corners with non-pertubative
(strong/weak) dualities?
We saw how moving towards the asymptotic of a lower dimensional theory can lead to
higher dimensional theory and unify them. Can we somehow apply the same idea to 10d
theories and find the 11d supergravity in the moduli space?
To answer these question we need to go back and do a closer examination of R–R (gauge)
fields in different string theories.
4.3
Branes
Let us begin with a general gauge theory in a spacetime of dimension d. From previous
experience we know that a U (1) gauge symmetry has an associated one-form gauge potential
A = Aµ dxµ .
(I.4.5)
It transforms under the gauge transformation as
A → A + dα
(I.4.6)
where dα(x) is the exterior derivative of an arbitrary function α(x) with appropriate boundary
conditions. From this we can construct the gauge-invariant, two-form field strength F = dA,
which consists of the electric and magnetic fields. It satisfies the homogeneous Maxwell
equations
dF = 0
(I.4.7)
by construction since d2 = 0.
The gauge field Aµ naturally couples to the worldline γ of a point-like (one-dimensional)
75
electric charge q through
ˆ
exp iq A .
(I.4.8)
γ
It is straightforward to check that this quantity is both Lorentz invariant and gauge invariant.
Given a gauge potential A, we can construct its magnetic dual by taking the Hodge dual
F̃ = ⋆F and solving for F̃ = dÃ. From the properties of the Hodge star, we observe that
F̃ is a (d − 2)-form and so à is a (d − 3)-form. In analogy with the point particle case, Ã
should couple to magnetic objects of charge p whose volume form matches the form of Ã, i.e.
objects with a (d − 3)-dimensional worldvolume Σ. It is easy to guess that such a coupling
should take the form
ˆ
exp ip à .
(I.4.9)
Σ
It appears that magnetically charged objects dual to electric point particles should extend
over d − 4 spatial directions. This is consistent with the well-studied case of d = 4, where
electric and magnetic charges are both point-like with zero spatial extent. For this case,
we also have that the sum of spatial dimensions of electric objects and their magnetic dual
objects is d − 4, which happens to hold for higher dimensions as well.
How does one measure the charges of objects in d dimensions? In the case of a pointlike particle, to find its total charge we surround it with a sphere S d−2 and integrate the
inhomogeneous part of Maxwell’s equations ⋆J = d ⋆ F over the ball comprising its interior.
Since F is a two-form, it is easy to see that J is necessarily a (d − 2)-form. We thus find that
the total charge of a point-like particle in d dimensions is
ˆ
⋆F.
(I.4.10)
Qe =
S d−2
This is just Gauss’s law for electric charge in d dimensions. By a similar argument, it is
straightforward to see that the magnetic dual of an electric particle is surrounded by a twosphere S 2 , and so its magnetic charge is measured by
ˆ
F.
(I.4.11)
Qm =
S2
Next let us turn our attention to extended objects with electric charge. Recall that a
p-brane is a (p + 1)-dimensional extended objects with worldvolume denoted by Σp+1 . It
naturally couples to a (p + 1)-form gauge field Ap+1 as
exp iQe
ˆ
Σp+1
76
Ap+1
!
.
(I.4.12)
Note that the gauge field transforms under p-form gauge transformations
Ap+1 (x) → Ap+1 (x) + dΛp (x).
(I.4.13)
That is, the gauge transformations are now generated by a p-form Λp (x), again with suitable
boundary conditions. In this case, we say that the p-brane is electrically charged under a
(p + 1)-form gauge symmetry. We can measure the charge of the brane by a generalized
version of Gauss’s law, namely
ˆ
Qe =
⋆Fp+2
(I.4.14)
S d−p−2
where Fp+2 = dAp+1 is the (p + 2)-dimensional field strength. From the point of view of the
directions orthogonal to the brane, the sphere S d−p−2 surrounds a point charge Qe .
The dual magnetic objects are defined in the same manner as before, coupling to a
(d − p − 1)-form magnetic potential Ãd−p−1 , with ⋆Fp+2 = dÃd−p−3 . It follows that the
objects should have dimension d − p − 3, and in particular they are (d − p − 4)-branes. (As
promised, the electric spatial dimension p and magnetic spatial dimension d − p − 4 add up
to d − 4.) To measure the magnetic charge, we surround the brane within a (p + 2)-sphere
S p+2 and integrate over the flux, i.e.
ˆ
Qm =
Fp+2 .
(I.4.15)
S p+2
Branes in string theory
The previous discussion was true of general gauge theories with charged, extended objects.
Let us now return to the appearance of branes in string theory. The massless p-form fields
in string theory give rise to gauge symmetries in spacetime. It remains to determine what
objects are charged under these fields.
All of the perturbative string theories (except type I) admit the “bosonic” cannon of
massless fields: the dilaton field, the metric, and the Kalb-Ramond NS–NS field also known
as the B-field (in type I, the B field comes from the R–R sector). The latter is a two-form
gauge field Bµν which couples directly (electrically) to the string worldsheet Σ as
ˆ
exp i B .
(I.4.16)
Σ
The B-field transforms under a spacetime gauge symmetry
B → B + dΛ
77
(I.4.17)
where Λ is a one-form gauge parameter. The gauge invariant field strength H = dB is often
referred to as the H-field or H-flux. From this point of view, the string is electrically charged
under the B-field. We can see this in more familiar language by taking the theory compactified
on a circle, say in the x9 direction, in which case the winding number of the string serves as
the conserved charge associated with a one-form Bµ9 . Note that the absence of a B-field in
NS–NS sector in type I theory implies that the winding number of the fundamental string is
not a conserved quantity in the compactified theory.
We saw that every electric object had a magnetic partner. It is natural to ask what objects
are magnetically charged under the B-field. The dual field strength is given by h̃ = ⋆H, and
so the dual gauge potential B̃ is necessarily a six-form field. Thus, the string is magnetically
dual to a five-dimensional object known as an NS five-brane. Unlike D-branes, which have a
tension proportional to 1/λ, it turns out that such objects have a tension that goes like 1/λ2
where λ is the string coupling.
Now we move onto the massless gauge fields originating in the R–R sector. On the
worldsheet they correspond to the R–R vertex operators, and so they do not naturally couple
to worldsheet. In other words, the string is not charged under these fields in the usual sense,
and whatever is electrically charged must necessarily be comprised of nonperturbative objects.
This is also true of the dual R–R fields, and hence of the associated magnetic objects as well.
The type I string is especially simple. It only has a single R–R two-form field Cµν that
couples electrically to 1d strings and magnetically to five-dimensional objects. The type IIA
theory has a one-form field Cµ (with 0d electric objects and 6d magnetic objects) and a
three-form field Cµνρ (with 2d electric objects and 4d magnetic objects). Similarly, the type
IIB string instead has even-form gauge potentials, with a two-form field Cµν (with 1d electric
objects and 5d magnetic objects), a four-form field Cµνσρ (with 3d electric and magnetic
objects), and a scalar C0 (with 7d magnetic objects and (-1)d electric objects).25 .
We can also consider the 11-dimensional supergravity theory, which turns out to have
deep connections with the 10d string theories. The 11d theory admits a single gauge
(11)
potential, namely the three-form Cµνρ . The objects charged under the three-form potential
are necessarily two-dimensional, and are generically referred to as membranes or M2 branes,
for short. From the usual dual argument, we also find that there should be five-dimensional
branes, referred to as M5 branes, which are the magnetic dual of M2 branes.
The appearance of these higher form gauge symmetries suggest the existence of some
charged branes. But how could we see these EFT branes in our microscopic perturbative
string description of supergravities?
To answer this question let us go back to T-duality and the only 10d supergravity that
25
These are called D(-1) branes which are not true states of the theory but rather a special class of
instantons which give non-perturbative corrections, beginning at order 1/λ, to the perturbative series.
78
was left un-T-dualized: the type I theory. This also happens to be the only 10d theory with
open strings. Open strings requrie boundary conditions and the reason T-duality is more
challenging for open strings is that it affects that boundary condition. T-dualizing in a given
compact direction switches the Dirichlet and Neumann boundary conditions with each other.
Therefore, suppose the T-dual of type I theory exist, it must include D(richlet)-branes of
lower dimension to account for the boundary conditions. This teaches us that D-branes are
essential to completing the picture of dualities. Luckily, these ”fundamental” objects turn
out to be the same as the macroscopic charged branes in the effective field theories. For
example, we can calculate the p-form charge of a D(p-1)-brane to see if it matches with the
fundamental EFT charge. It turns out it does [74]. Note that D-branes are part of the
background, therefore they are not perturbative objects. However, this does not mean they
are not dynamical objects either! The vanishing of the Weyl anomaly for the worldsheet
theory imposes certain equations on a background to be consistent. We can think of these
equations as equations of motions for background fields and D-branes. Equivalently, we can
study the interactions between D-branes by looking at the tree-level amplitudes involving
D-branes exchanging strings. For example, for two D-branes the leading contribution would
involve a cylinder connecting the two branes. Since the amplitude of a given worldsheet scales
like λ2g−2+b , at g = 0, b = 1 we get ∼ λ−1 . This signals that the tension of the fundamental
brane which, plays the role of the gravitational mass, in the Einstein frame is ∼ λ−1 (see [74]
for detailed calculation).
The fact that the tension of the D-brane is inversely proportional to the coupling constant
is another evidence that why they are non-perturbative objects. This is similar to the case
of gauge theory instantons which have actions proportional to 1/g 2 and therefore, are nonperturbative objects. One can estimate the action of non-perturbative objects from the
divergences of the perturbation series. Typically, in the presence of non-perturbative objects,
the perturbative expansion diverges after some point and the smallest term determines the
maximum resolution of the perturbation theory due to non-pertuabtive effects. For example,
going back to gauge theories, there are ∼ exp(O(l ln(l))) graphs with l loops and each of
them carry a factor of g 2l . Therefore, the l-loop amplitude goes like ∼ g 2l ∼ exp(O(l ln(l)))
which minimizes at a value of exp(O(−1/g 2 )). This is the amplitude of the gauge theory
instantons! Before the discovery of D-branes as non-perturbative ingredients of string thery,
Shenker did a similar calculation to show that their tension must go like 1/λ [75].
In the following, we use our knowledge of D-branes as non-perturbative objects of the
underlying theory to complete the web of dualities.
79
4.4
M-theory
Let us start with the 11d supergravity. Could it be that the 11d supergravity is just the lowenergy EFT of some corner of string moduli space? If so, it must be connected to theories
with the same number of supercharges like type II theories. It seems the answer to this
question is yes and the conjectural theory that has the 11d supergravity as its low energy
(11)
limit is called M-theory. The 11d supergravity theory consists of an 11d metric Gµν , a
(11)
three-form gauge potential Cµνρ , and a gravitino ψµα . It turns out that the EFT of the 11d
supergravity compactified on a circle yields the type IIA supergravity, i.e. the low energy
limit of the type IIA string theory [76].
M-theory and type IIA
The first item on our checklist is to match the (bosonic) field content of the two theories.
Recall that the massless content of the type IIA theory consists of a metric tensor Gµν , the
B-field Bµν , the dilaton φ, and the odd-valued gauge R–R gauge forms Cµ and Cµνρ . In
the compactified theory, the type IIA fields Bµν and Cµνρ originate from the dimensional
reduction of the 11d gauge potential. On the other hand, the dimensional reduction of the
(11)
11d metric Gµν should yield the 10d metric, a scalar, and a vector field. These correspond
precisely to Gµν , φ, and Cµ , respectively.
Next we can try and match the objects (perturbative and non-perturbative) in the two
theories. A string in 10d ought to be an M2 in 11d wrapped on a circle. Its radius R11 turns
out to be directly related to the expectation value of φ, i.e. the type IIA coupling. Reducing
the 11d gravity action on the circle (and only keeping the Einstein-Hilbert term) yields
ˆ
p
R11 d10 x e−2φM GM RM ,
(I.4.18)
where GM and RM are the 10d metric and its associated scalar curvature, as constructed
from the zero-momentum KK modes of the 11d metric. The field φM is a scalar arising
from the metric along the compact directions. Note that we have set the 11d mass scale to
Mpl = 1. The resulting theory is nonchiral, and so it is expected to be equal to the action of
type IIA supergravity
ˆ
p
1
10
−2φ
G s Rs ,
(I.4.19)
d
x
e
λ2
where λ = ehφi is the type IIA string coupling. Here we have written the type IIA action
in the string frame, where the dilaton multiplies the Einstein-Hilbert term. Weak coupling
in particular corresponds to small λ. It is clear that φM = φ. The metrics Gµν M and Gµν s
√
should be related by some field redefinition. Note that GM RM scales as G4M , and so if we
80
write Gµν M = f Gµν s then it follows that
R11 f 4 =
1
.
λ2
(I.4.20)
Thus, the two metrics are related by a rescaling of the form
Gµν M =
1
R11 λ2
14
Gµν s .
(I.4.21)
This has a consequence on energy measurements with respect to the two metrics. Let EM
be an energy measured using GM and Es an energy measured using Gs . From the above
expression we see that the two energies are related by
1
1
Es λ 4 = (R11 )− 8 EM .
(I.4.22)
Now let us return to the case of an M2 brane wrapping the circle, which results in a string
in 10d with some tension TM as measured in the M-theory frame (i.e. with respect to the
metric GM ). This compares to the tension Ts in the string frame as
1
1
Ts λ 2 = TM (R11 )− 4 .
(I.4.23)
which follows from (I.4.22) by dimensional analysis. In M-theory, there is only one scale (the
Planck scale). So, an M2 brane has a tension equal to one in Planck units. The circle has
radius R11 , and so the tension of the wrapped brane is given by TM = R11 . Note that the
string tension in the string frame is Ts = 1. We conclude that the radius of the circle and
the string coupling are related as26
3
R11
= λ2 .
(I.4.24)
We know that string perturbation theory in 10d breaks down as λ becomes large. This is
also the limit in which the eleventh dimension becomes relevant. So it makes sense that R11
increases with λ.
Recall that the KK reduction of the 11d metric yields a U (1) gauge field which is none
other than the type IIA R–R one-form field Cµ . The KK modes with nonzero (quantized)
momenta along the circle are charged under this symmetry, with the number of quanta being
the conserved charge. In the 10d nonchiral supersymmetry algebra of type IIA, this charge
appears as a central extension (it can also be obtained by dimensionally reducing the 11d
26
We have not been careful in tracking powers of 2π as well as the string length ℓs . However, one can check
that all powers of ℓs cancel out, and furthermore that equation (I.4.24) is correct including numerical factors
in Planck units.
81
supersymmetry algebra), which roughly speaking takes the form
{Q1α , Q̄2β } = −2PM
(I.4.25)
where Qiα for i = 1, 2 are the two sets 10d supercharges with chiral indices and PM is the KK
momentum. Following the supersymmetry algebra, one can derive a BPS bound relating the
charges of states in the theory to their masses. We will focus only on BPS states for which
the energy equals the KK momentum. In the M-theory frame, the energy of a KK excitation
with n units of momentum around the circle is thus
EM =
n
.
R11
(I.4.26)
The energy in the string frame is then
−1/8
Es = λ−1/4 R11
n
n
= .
R11
λ
(I.4.27)
As λ goes to zero, this blows up, so we do not see these states in string perturbation theory.
In closed string perturbation theory, the string coupling constant only appears as λ2g−2+b .
If we took the worldsheet to be a disk, we get λ−1 . In Type IIA, there are also D-branes
which can be treated as infinitely massive sources in open-string perturbation theory, where
the worldsheet has boundaries. These objects have been shown to be charged under the R–R
one-form as expected. The n in equation (I.4.27) corresponds to number of D-branes. The
states described by equation (I.4.27) are D0-branes because they couple to the R–R 1-form.
Now we consider the unwrapped M2-brane extending in the 10-dimensions of type IIA.
In the M-theory frame, its tension is
M2
= 1.
(I.4.28)
TM
Using equation (I.4.22), the tension of this object in the string frame is
TsM 2 = λ−1 .
(I.4.29)
This object is a D2-brane.
The main lesson is that as the string coupling goes to infinity, the type IIA theory becomes
eleven-dimensional. The Lorentz group SO(9,1) becomes extended to SO(10,1).
The degrees of freedom of a fluctuating brane correspond to a scalar field that lives on
the brane. The number of scalar fields living on a p-brane corresponds to the number of
transverse directions, d − (p + 1).
Suppose that we have a D-brane, which by definition is where open strings can end.
These endpoints of the open string can carry additional pointlike degrees of freedom known
82
as Chan–Paton factors. These degrees of freedom are charged under a one-form gauge-field
A which lives on the brane. Now consider the B-field, which couples to the string worldsheet.
We expect that the transformation
B → B + dΛ
(I.4.30)
for a one-form Λ should be a gauge symmetry that leaves the worldsheet action invariant.
This is true for closed strings, but fails to hold for worldsheets with boundary. The resolution
is that A must live on the boundary, i.e. on the D-branes, and transform in an opposite
manner,
A → A − Λ,
(I.4.31)
to compensate. For a stack of N coincident D-branes, we can use a naive counting argument
to show that the gauge symmetry should be at least U (1)N . It turns out that the gauge
group is actually U (N ). This follows from the fact that there are N 2 different ways for the
open string to end on the branes at the level of perturbation theory. The massless states of
the perturbative open string in the type II theories contains a single 8v , which is governed
by a 10-component field. On a Dp-brane, this field splits into a p + 1-dimensional gauge field
living on the brane and 9 − p scalars which describe the fluctuations of the brane. This gauge
field transforms under the nonabelian U (N ) gauge symmetry.
Recall that we have constructed an eleven-dimensional M-theory which when compactified
on an S 1 , gives the type IIA theory. We have identified the radius of the circle and the IIA
string coupling constant are identified as
R3 ∼ gs2 .
(I.4.32)
The three-form field Cµνρ is associated to the M2-brane, which is a 3-dimensional world
volume object. Then this M2-brane wraps around the S 1 yielding a string, which is identified
as IIA string. Similarly, we can have an M2-brane not wrapping around, which as discussed
above has a tension ∼ λ−1 and corresponds to D2 brane.
The magnetic version of the M2-brane is the M5-brane in M-theory. Let us consider
an M5-brane around this S 1 . Then we get D4-branes. When M5-branes are not wrapping
around the circle, we can consider the other 5-brane in M-theory: NS 5-brane, which is a
magnetic dual object to the string. In M-theory, there are also D6-branes and D8-branes.
First of all, D6-brane is the Kaluza-Klein monopole, which happens when the circle shrinks
at a point.
Consider the Taub-Nut geometry, which looks like R3 × S 1 asymptotically, where the
circle shrinks in the middle. Then such a point on R3 is the D6-brane, as represented in
83
Figure I.4.1. The D6-brane is a codimension-three object. On the other hand, D8-brane is a
codimension-one object, which dramatically impacts the global topology of the space and is
more difficult to describe in M-theory.
R3 × S 1
Taub-Nut
R3
D6-brane
Figure I.4.1: D6-brane is the object where the circle shrinks from the Taub-Nut space.
When we consider N number of D-branes, we get theory with a U(N) gauge symmetry.
The fact that one of them having a gauge symmetry is related to the fact that the B-field
that the strings couples to has a gauge symmetry. On the boundary of the worldsheet, we
get a gauge field and the strings are ending with s Dirichlet boundary condition. This is then
related to the twisted sectors between the branes. D-branes breaks half of the supersymmetry.
Thus we get the analog of having 16 supercharges via
(8v + 8s ) ⊗ (8v + 8s ),
(I.4.33)
with only half conserved. For example, D3-branes have a vector and six scalars:
2V + 6(0),
(I.4.34)
which adds up to have eight degrees of freedom. This gives in turn an N = 4 super Yang–Mills
with a U(N) gauge group.
M theory and type IIB
Now that we connected M-theory and IIA, we know that they live in the same moduli space.
However, we know that IIB and IIA share that property too because they are T-dual to each
other. Therefore, IIB and M-theory must also share the same moduli space. Let us try to
connect them directly to each other instead of taking the long route through IIA [77]. In
order to find the connection we follow the duality chain between M-theory on T 2 to IIB on
S 1 as represented in Figure I.4.2.
84
M-theory
M-theory
S1
IIB theory
⇒
IIA theory
S
1
9d theory
S
IIB theory
S
T2
1
1
9d theory
Figure I.4.2: M-theory and IIB theory yield the same 9d theory
From Table I.4.6 we know that type IIB theory has the following bosonic field content,
λs ,
Bµν ,
gµν ,
χ0 ,
eµν ,
B
e µνρλ ,
D
(I.4.35)
where the first three are from the NS–NS sector and the last three aref from the R–R sector
that are coupled to D-1-brane, D1-brane, and D3-brane respectively. The scalar fields can
naturally be composited and to become complexified as
τ =χ+
i
.
λs
(I.4.36)
Then we have three real parameters for a type IIB to be compactified on a circle – one
complex coupling parameter and one real parameter that is the radius
(RIIB , τ ).
(I.4.37)
On the other hand, M-theory is also parametrized by three real parameters:
(A , τe),
(I.4.38)
τe = τ.
(I.4.39)
where A is the area and τe is the Teichmüller parameter of torus and has a symmetry under
τe −→ τe + 1. This χ is the gauge degrees of freedom that is periodically valued. As we go
around the 7-brane (which has a codimension-two world volume), χ shifts by 1. Hence, τ
shifts by 1. It follows that this τ respects the symmetry of τ −→ τ + 1. Then it has the same
symmetry with τe from M-theory and hence we relate these two τ s:
Then we relate the other real variables,
1
= A,
R
85
(I.4.40)
where R is the radius from IIB theory and A is the area from M-theory.
In fact, the τe has a full SL(2, Z) symmetry, not just a shift symmetry. Suppose we take
χ = 0, then
λ −→
1
,
λ
(I.4.41)
which gives a strong-weak duality. Indeed, type IIB theory is self-dual under the strong-weak
duality.
eµν , which, respectively, coupled
Recall that Type IIB had two-form fields Bµν and a B
electrically to the fundamental string (also called the F1-brane) and the D1-brane. Under
the SL(2, Z) symmetry transformation
λs →
1
,
λs
(I.4.42)
these two fields are exchanged. This transformation swaps the tensions of the fundamental
string and the D1-brane, and thus it is necessary that the potentials coupled to these extended
e µνρλ ,
objects are also exchanged. One the other hand, there is only a single four-form field, D
and thus it must map to itself under the transformation (I.4.42). That is, the D3-brane
which couples to the four-form must be self-dual under the given transformation. As a
consequence of this it is immediate that the U (N ) N = 4 super Yang–Mills theory living on
the worldvolume of a stack of N D3-branes must be invariant under an SL(2, Z) symmetry, as
the D3-branes are simply mapped onto themselves. The worldvolume theory enjoys invariance
under the transformations
τ → τ + 1 , τ → −1/τ ,
(I.4.43)
of its complexified coupling constant, τ . This gives the S-duality of the N = 4 U (N ) SYM
in d = 4. Since the D5-brane and the NS5-brane are the magnetic duals of the fundamental
string and of the D1-brane they are also necessarily exchanged by the transformation (I.4.42).
This duality, which is called the S-duality of Type IIB string theory, is changing the
fundamental string into what one would perturbatively think of as a composite, heavy, object,
the D1-brane. We note that because the duality replaces fundamental objects with composite
objects one has to be careful with the definition of a Feynman path integral – to perform
the integral one must pick a particular duality frame, and integrate over the fundamental
degrees of freedom in that frame. The notion of what is light, or fundamental, that goes into
the definition of the path integral is not necessarily a duality invariant notion.
We now consider how these two different two-form fields of Type IIB arise by considering
M-theory compactified in a torus. That is, we will consider a compactification of Type IIB
on an S 1 , and look at the components of the two 10d two-forms that do not extend along
86
the S 1 , and thus they behave like 9d two-forms. These equally can be understood from the
perspective of the M-theory three-form, Cµνρ . To get a two-form one of the directions in
Cµνρ must lie along one of the directions inside the T 2 on which we are compactifying. A T 2
eµν arise
has two distinct one-cycles, the A and B cycles, and the two two-forms Bµν and B
from wrapping Cµνρ with one direction along each of these cycles. Realizing the SL(2, Z)
transformation (I.4.42) as a modular transformation of the complex structure of the torus, one
can see that the A and B cycles are exchanged. Generally this process involves wrapping the
M2-brane, which couples electrically to the three-form, along the cycle pA + qB of the torus.
For (p, q) = (1, 0) the resulting string is the fundamental string, and (p, q) = (0, 1) is the
D1-brane. The SL(2, Z) symmetry of the torus can be thought of, in this way, as generating
the SL(2, Z) self-duality symmetry of Type IIB from M-theory. M2 branes wrapping on
a general (p, q)-cycle (with p and q coprime) gives rise to a bound state of F-strings and
D1-branes.
Exercise 1: show that upon compactification on an S 1 and T-duality, D-brane dimensions
change up or down by one unit, depending on whether or not the brane wraps the S 1 . That is,
how do the D-branes of Type IIA transform into the D-branes of Type IIB under T-duality?
[Hint: if you consider a Dp-brane in one Type II theory and compactify on an S 1 which is
orthogonal to the Dp-brane worldvolume then the p does not change. However Type IIA
admits only supersymmetric branes with p even, and Type IIB only with p odd; thus the
T-duality should change the parity of the dimension of the brane worldvolume.]
Note: it is not true that the 11d M-theory picture is always the most useful way to study
string theory. In M-theory on T 2 with area A gets mapped to Type IIB on an S 1 with radius
R = 1/A .
(I.4.44)
This is consistent with the fact that the Kaluza–Klein modes of Type IIB, which are the
winding modes in Type IIA language, which in the M-theory uplift is then a M2-brane
wrapping also the other cycle, i.e. wrapping the entire torus. If we consider M-theory on a
T 2 and we shrink A → 0 then the theory is not 9d, as one would naively think, but because
of (I.4.44) it is, in fact, 10d Type IIB string theory. In this way M-theory would naively
miss Type IIB, and knowing about T-duality of Type IIA is an additional ingredient which
is obscured from the pure M-theory point of view.
Note that something strange happened in the type IIB/M-theory duality: they both
appear in an asymptotic corner of the moduli space of a 9d theory (ln(A) ∼ − ln(RIIB ) →
±∞). However, the dimensions of these two theories are different! In one corner we get a
10d theory while in the other corner we get an 11d theory! This shows that dimension is not
a good guiding principle to classify disconnected theories in quantum gravity.
87
4.5
Completing the web of dualities for NSU SY = 16
Thus far we have connected together all of the maximal supergravity theories (32 supercharges),
being (the massless sector of) Type IIA, Type IIB, and M-theory. We now want to include
the 10d supergravities with N = 1 supersymmetry, that is, the Type I and heterotic theories.
We have already established that the two heterotic theories, with gauge groups SO(32) and
E8 × E8 are related via S 1 compactification to 9d. Both Type I and heterotic SO(32) have an
SO(32) gauge group, which motivates us to search for a relationship amongst these theories
already in 10d.
To show this we first give a different perspective on Type I.
Type IIB construction of the type I theory
Type I is type IIB in 10d, orientifolded by the parity operator Ω, that reverses the orientation
on the worldsheet [78]. The invariant subspace of the spacetime is the “orientifold plane”,
which is 1+9 dimensional in this context, and thus is known as the O9-plane. The O9-plane
carries −32 units of D9-brane charge, and thus the theory would be inconsistent via a Gausslike law unless the charge is cancelled off by the inclusion of 32 D9-branes. This is the brane
interpretation for why Green–Schwarz found that the requirement for anomaly cancellation
is was an SO(32) gauge group – the 32 D9-branes naively generate a U (32) gauge group, but
this is quotiented to SO(32) by the parity reversal of the O9-plane.
Type I and heterotic SO(32)
Now we are ready to discuss the relation between Type I and heterotic SO(32) [76]. The
latter has an effective action from the genus zero worldsheet like
ˆ
1
Rh + Fh2 + · · · ,
(I.4.45)
2
λh
where Fh2 = g νβ g µα Fµν Fαβ comes from the field strength of the SO(32) gauge potential. In
Type I the SO(32) gauge symmetry comes from the open string sector and so the effective
action is instead of the form
ˆ
ˆ
1
1
RI +
FI2 .
(I.4.46)
λ2I
λI
The different scaling of the coefficients of the F 2 terms tells us immediately that these theories
cannot be directly the same, so let us compare the scaling relations. Since
R ∼ g4 ,
F ∼ g3 ,
88
(I.4.47)
it can be seen that the scaling gives
gh4
gI4
=
λ2h
λ2I
and
gh3
gI3
=
λ2h
λI
⇒
λI
=
λh
gI
gh
2
,
(I.4.48)
⇒
λI
=
λ2h
gI
gh
3
.
(I.4.49)
λI =
1
.
λh
Putting this altogether one finds
(I.4.50)
This tells us that if we begin with the weakly coupled heterotic SO(32) theory and we move
to strong coupling then there is a dual weakly coupled description in terms of Type I theory,
and vice versa. This is an S-duality between the two theories, which is ideal as we have a
method to understand the theory at both strong and weak coupling.
In fact, we cannot have two good description at the same limit of the moduli of the
weakly coupled theory. In other words, we always have separate understandings at the
opposite limits.
From a type I theory, consider an example where we stretch a D1-brane on the (1 + 1)d
spacetime with 32 D9-branes from the theory. Then there is a string coupling the D1 brane
and D9 branes. By looking into this sector, we can study the lightest mode to be a fermion
for each D9-brane coupled to the D1-brane. Therefore, the D9 branes give us 32 fermions
living on the D1-brane. Moreover, on the D1-brane we have (0, 8) supersymmetry, which we
can identify to the right movers of the heterotic string theory. Hence we can get a glimpse
of the duality between the type I theory and the heterotic theory from this picture as well
where the type I D1 brane is dual to the heterotic string.
So far we have connected type I theory and heterotic theory with SO(32) via duality,
but not for the heterotic theory with E8 × E8 . We can connect both heterotic theories since
when compactified on a circle, they result in the same 9d theory. However, heterotic theory
with E8 × E8 is not directly linked with type I, which requires a perturbative understanding
of the theory. This is because its string coupling constant diverges to describe its relation to
type I theory.
M-theory and heterotic
First we showed that all the theories with 32 supercharges share the same moduli space in
the sense that they correspond to different corners of a single moduli space. Next we showed
that the 10d theories with 16 supercharges share the same moduli space as well. Since type I
89
has an orientifold construction from type IIB theory, it is natural to ask if one can construct
theories with 16 supercharges directly from IIA and M-theory as well. If the dualities are
real, such a construction should follow from the chain of dualities. Hořava–Witten proposed
an M-theory construction that is placed in the same moduli space of E8 ×E8 Heterotic theory
[76, 79]. The Hořava–Witten theory is thought to describe the strong coupling limit of the
E8 × E8 Heterotic theory.
HetSO(32)
HetE8 ×E8
S1
9d theory
S1
Figure I.4.3: Heterotic string theories on a circle are identical
From M-theory on a circle, we get a type IIA theory by construction. Let us consider to
mod out the circle by Z2 and orbifold the theory. With this setup one can check that half of
the supersymmetry survives:
X 11 −→ −X 11 ,
γ 11 −→ −γ 11
(I.4.51)
in the language of spinors. Then such a γ will project out half of them, resulting in 16
supercharges from 32 supercharges before. We have two singularities as orbifold points.
some localized degrees of freedoms can live on these points. there are two (9 + 1)-dimensional
spaces. We can have each E8 to live on each wall. This is called Hořava–Witten construction.
The coupling of the heterotic strings is the radius.
S 1 /Z2
E8
E8
Figure I.4.4: An S 1 with Z2 orbifold points for Hořava–Witten construction
M-theory has a membrane, M2-brane. Heterotic theory on the other hand has no membranes.
So one might be puzzled how they can be mapped. In 11d supergravity, we have Cµνρ . What
happens to this in 10d? In order to keep the term C ∧ G4 ∧ G4 to be constant under Z2 ,
we have C → −C while preserving C11νρ . This C11νρ is Bνρ in heterotic theory. M2-branes
wrap the fundamental string that sees both E8 on the orbifold points, and hence has E8 × E8
current algebra. It follows that the wrapped M2-branes survive but not the unwrapped
M2-branes, and for M5 branes, the opposite happens.
90
Extending dualities to lower dimensions
So far we have connected M-theory, type IIA theory, type IIB theory, heterotic theories, and
type I theory. We can use string duality further down to lower dimensional theories. For
example, we can consider an M-theory compactified on K3 surfaces. This gives a 7d theory
with half amount of supersymmetries. Similarly, heterotic theory compactified on a T 3 gives
a 7d theory. Either heterotic theories result in the same theory in 7d theories as they were
already the same in 9d. We can construct to see if these 7d theories are dual to each other.
First of all, they have the same number of supersymmetry. The moduli space of the the
heterotic theory on a T 3 is given by
SO(19, 3)
× R+ ,
SO(19) × SO(3)
(I.4.52)
where the first part provides Lorentzian Narain lattice and R+ controls the strings coupling.
On the other side, M-theory on K3 surfaces, we can see the Hodge numbers for the K3
surfaces to be
h1.1 = 20 = 19 + 1,
(I.4.53)
so there are 19 complex deformations as one is trivialized by hyperkähler rotations mixing
h1,0 , h1,1 , h1,1 . Hence, we have 19 complex and 19 real deformations with 1 hyperk”ahler
rotation. Hence we have (19 × 3) + 1. This looks like we get the same moduli space
SO(19, 3)
× R+ ,
SO(19) × SO(3)
(I.4.54)
where we map both R+ of both moduli spaces. The bigger volume then means bigger coupling
for the other theory, which is reasonable. Hence we can have a map between M-theory on
K3 and heterotic theory on T 3 .
Let’s go down to six dimensions now. We can now ask about Type IIA compactified on
a K3 surface. This gives a 6d theory which we have already observed has a moduli space
SO(20, 4)
× R+ .
SO(20) × SO(4)
(I.4.55)
Similarly one observes that heterotic on T 4 has the same moduli space. These two sixdimensional theories are related by a strong-weak duality that maps between Type IIA on
K3 and heterotic on T 4 . This is particularly interesting as it relates a compactification on
the curved internal manifold, the K3, with one on a flat manifold, the T 4 ; in this way, all
of the intricate geometry of the K3 surface is captured dually in a straightforward toroidal
91
compactification of the heterotic string.
One immediate question is how the non-abelian gauge symmetry of the heterotic theory
is replicated in the compactification of the type IIA theory, which has no perturbative nonAbelian gauge symmetry, on K3. We will explain this later in this section.
4.6
F-theory
We have seen that we have dualities from heterotic theory on T 4 and type IIA theory on K3;
similarly, from heterotic theory on T 3 and M-theory on K3. One can speculate as to whether
this pattern uplifts further, and whether there is a duality between heterotic on T 2 and some
uplifted 12d theory on K3. Naively this will not be possible as there does not exist a 12d
supergravity theory that we can compactify on the K3 for the right-hand-side of the duality,
however, there is a hint that this might be possible. We take M-theory on T 2 which we said
is related to Type IIB on an S 1 . There is a limit where the raduis of the S 1 goes to infinity,
which is where the area of the T 2 goes to zero. While the T 2 is of zero volume, the data of it
is not completely absent from the Type IIB, for instance the complex structure mode of the
torus is a part of the IIB theory. In this way one can think of Type IIB has a 12d theory,
where two of the directions look like a zero-area torus.
Now we can consider a compactification of M-theory on some manifold which has a torus
fibration over some base space, B. When we take the limit where the area of the torus fiber
shrinks to zero volume we recover Type IIB compactified on B. If we now take a K3 surface
which admits a torus fibration over a 2d manifold then we can do this procedure to get an
8d theory which is the compactification of IIB on such a 2d manifold – this would be the
candidate theory for the dual to heterotic on T 2 that we speculated about above.
In fact, there exist K3 surfaces which are torus (or elliptic) fibrations over P1 . Such a K3
surface has a realization via a Weierstrass equation
y 2 = x3 + f8 (z1 , z2 )x + g12 (z1 , z2 ) ,
(I.4.56)
where f8 and g12 are degrees 8 and 12 homogeneous polynomials in the projective coordinates,
[z1 : z2 ] of the base P1 . To match with heterotic on T 2 we must count the moduli of this K3
surface. A polynomial of degree d has d+1 parameters because there are (d+1) coefficients in
the generic polynomial, which tells us that we naively have 9 + 13 = 22 parameters from the
complex structure moduli of the K3. An overall rescaling will remove one of these parameters,
and then there is an SL(2, C) action which removes three more parameters. Thus the K3
surface has 18 complex parameters. The Kähler parameters are just the volumes of the T 2
fiber and the P1 base, however, since the theory requires the fiber to shrink into zero volume,
the volume of T 2 is not a part of the theory. In turn, there is just one real Kähler parameter
92
controlling the size of the P1 .
The parameter space of the K3 is then given by
SO(18, 2)
× R+ ,
SO(18) × SO(2)
(I.4.57)
which is exactly the Narain moduli space for the compactification of heterotic on T 2 . Furthermore,
we see that
A ∼ λh ,
(I.4.58)
so increasing the area of the P1 makes the heterotic theory strongly coupled.
We have just described a funny compactification of Type IIB on a P1 which preserves only
half of the supersymmetry, as it is dual to a heterotic compactification. By writing K3 as a
torus fibration we allowed the complex structure modulus, τ , to depend on the holomorphic
coordinate on the P1 , z. Since
i
,
(I.4.59)
τ =χ+
λs
we see that χ and λs now depend also on z. We do not usually consider such compactifications
in superstring perturbation theory as there may be a point on spacetime where the string
coupling becomes large, and then perturbation theory breaks down. We can determine the
points in P1 where τ → ∞, where we have no perturbative control of the theory. It turns
out that when τ becomes infinite is exactly where the torus fibers of the elliptic fibration
degenerate. This occurs at the discriminant locus, which is when
2
∆ = 4f83 + 27g12
= 0.
(I.4.60)
Since ∆ is a degree 24 polynomial then there are 24 zeros of ∆ distributed over the P1 . We
note that τ is actually not a well-defined function of z, as it undergoes SL(2, Z) transformations
when one moves on a path around one of these zeros of ∆.
Then we have constructed a theory in 12d by identifying the τ as a shrunk torus on
top of IIB theory. This theory is called F-theory and can be viewed as a non-perturbative
compactification of type IIB [80]. This is a new type of compactification coming from string
duality. For example, at points where τ → ∞, we have monodromies given by SL(2, Z)
action. We know that a D7 produces a monodromy of τ → τ + 1 as we go around it.
The more complicated monodromies are sourced by bound states of SL(2, Z) images of D7
branes. Hence we can determine that at the zeros of ∆, there are non-perturbative 7-brane
characterized via their SL(2, Z) monodromy
! !
!
p r
1
p
SL(2, Z) action :
=
, qs = rq = 1,
(I.4.61)
q s
0
q
93
we call the non-perturbative brane that sources the above monodromy the (p, q) 7-branes.
Having 24 zeros of ∆ then yields 24 7-branes. However, we should not be allowed to have
24 7-branes as the charge does not cancel. The reason why we could have 24 D7-branes is
because we are in a non-abelian theory from SL(2, Z) action.
Note that a stack of N D7-brane has a U (N ) gauge group. Therefore, when the pinched
points are brought together, we can an enhancement of the gauge group. This can be easily
demonstrated via looking into pinched points. Each pinching point has a local description
xy = z.
(I.4.62)
xy = z(z − a1 )(z − a2 ) · · · .
(I.4.63)
Then all pinched points can be written as
When these pinched points are brought together, the local geometry becomes
xy = z n ,
(I.4.64)
and hence we get an SU(N) gauge group when we have AN −1 singularity.
In order to have all heterotic theory gauge symmetries from this construction, we need
to be able to put some 7-branes together to build an E8 . We can expect such arrangements
to be possible by using singularities of type D and E.
Let’s consider M-theory on K3 that admits singularities of ADE type of Lie algebra.
This is dual to heterotic theory on T 3 . Let us consider the case of AN −1 as an example to
demonstrate such is possible from M-theory. Geometry of type C2 /ZN gives a type AN −1
singularity. To consider the singularities of AN −1 type, we can think of having (N −1) number
of spheres, 2d cycles, touching each other as in Figure I.4.5.
Figure I.4.5: (N − 1) spheres touching each other to form an AN −1 type singularity
The size of these spheres are controlled by Kähler parameters of P1 s:
φ1 , φ2 , · · · , φN −1 .
(I.4.65)
Furthermore, M-theory has 3-form fields Cµνρ that give for every independent 2-cycle a 294
i
form. So we can write the three-form fields in the basis of 2-forms ωµν
as
Cµνρ =
X
i
Aiµ (x)ωµν
,
(I.4.66)
i
where Aiµ depends on the direction that is not compactified. Hence for every three-form
fields, we get a two-form and a one-form that is in the leftover space, which is the gauge field.
This gauge field has a U(1)N −1 symmetry automatically.
Now wrap M2-brane on one of the 2-cycles:
ei
´
M2
Cµνρ
∼ ei
¸
Aµ
.
(I.4.67)
The 3-form field Cµνρ is the one that is coupled to M2-brane, and hence on this gauge field
Aµ , there will be a U(1) charge. Thus we get a charged object corresponding to this M2brane wrapping around 2-cycle. Wrapping around just one 2-cycle, we are considering the
geometry of C2 /Z2 . Then we have two possibilites: we can have an M2-brane or an antiM2-brane, wrapping around with the opposite orientation. Hence we get two states: ±1
charges of U(1) based on the orientation of M2-brane. Via quantization of M2-branes, these
are gauge multiplets (vector multiplets). More precisely, they are charged massive vector
multiplets W ± where the mass is proportional to the area. More precisely, the mass of the
vector multiplets is proportional to
m ∼ T A,
(I.4.68)
where T is the tension of the M2-brane and A is the area. However, it is impossible to have
a charged massive vector multiplet under U(1) unless it is non-abelian U(1). In fact that is
possible as if the area shrinks to zero, we get a massless vector multiplet. In other words,
we get an U(1) with two charged objects that are opposite in charge, and we can conclude
that this is SU(2). Thus we see that when we wrap an M2-brane on one 2-cycle that has A1
singularity, we get a non-abelian gauge symmetry SU(2). Giving vacuum expectation value
to the scalar φ for this U(1), i.e. Higgsing the U(1), is equivalent to blowing up.
Now we can consider a general case of (N − 1) 2-cycles that had U(1)N −1 symmetry. This
will yield an SU(N) gauge symmetry. However, we do not have enough vectors as we have
only 2(N − 1) charged and (N − 1) neutral objects. We are required to wrap two touching
P1 s to bind and form a bound state to resolve such an issue.
Optional exercise: Reduce M-theory on K3 and convince yourself that there are sixteen
supersymmetries and that is enough amount of supersymmetries to have a scalar in the
gravity multiplet. In fact, except in 10d, the gravity multiplet with 16 supercharges in all
lower dimensions have scalars.
95
Exercise 2: We learned that we can wrap M2-branes on touching P1 s to form a bound
state. We can wrap many at once upto all (N − 1) of them. Each chain will give a charged
object upto ± sign. Check that this gives exact dimensions of SU(N). Checking along all the
chains upto the sign, show that we can get full rank of SU(N). By this way we can see the
charge and degeneracies of SU(N). [Hint: Focus more on the degeneracies than charge for
this exercise.]
Optional exercise: We can also have singularities of type D and E as well. Find what the
rules of binding the M2-branes for the D and E types in order to reproduce the rank of the
gauge groups. [Hint: For example, dim(E8 ) = 248.]
4.7
More dualities in lower dimensions
Suppose we have two theories A and B compactified on M1 and M2 respectively, yielding
the same theory on Rd . Then there are parameters corresponding to the moduli of M 1
and M 2 which give rise to scalar fields. We usually construct dualities by taking such
parameters to be constant in the resulting d-dimensional compactified theory. However, if
we can imagine these parameters vary in Rd we still expect to have the duality. In particular,
if the parameters vary gradually, we can go between the dual frames point by point in Rd .
As long as we preserve the amount of supersymmetries, in all the examples it is shown that
the duality persists even if the change of moduli breaks adiabatic principle. By considering
non-constant backgrounds and compactifying them while preserving some supersymmetry,
we can find more lower-dimensional theories that enjoy non-trivial dualities. The assumption
that higher-dimensional dualities continues to be true for non-constant backgrounds is called
the adiabatic assumption.
For example, Type IIB theory with varying the coupling constant over P1 × S 1 is dual to
M-theory on K3 when the parameters are mapped point by point [81]. On the other hand, we
take S 1 to have infinite size, which then corresponds to having elliptic fibration shrinking to
zero size in F-theory. Note that the adiabatic principle is explicitly violated from shrinking
the elliptic fibration.27 Moreover, M-theory on K3 is dual to heterotic on T 3 . By taking
heterotic on T 2 × S 1 , we have the duality between heterotic on T 2 and IIB on P1 , which was
explained earlier by building F-theory from IIB on P1 .
Heterotic on T 3
←→
Heterotic on T 2
M-theory on K3
←→
IIB on P1 × S 1
IIB on P1 (F-theory on K3)
←→
27
We are changing the topology of the internal space and letting τ → ∞. In fact, this is the case for any
duality using T-duality.
96
d
Now consider F-theory on an elliptic manifold Mell
, then the resulting theory is a (12−d)dimensional theory. When we have an elliptic manifold, we can consider F-theory for any
case that has a type IIB theory with a varying parameter via duality. Then we can construct
the following duality that results in the same (10 − d)-dimensional theory.
F-theory
d
Mell
M-theory
1
×S ×S
d
Mell
× S1
1
Type IIA theory
d
Mell
(10 − d)-dimensional theory
We can study these dualities for various dimensions. We have studied 7d theories earlier,
which was found to have a total of 16 or 32 supercharges to be preserved. In the 6d theory, we
can also get a theory with 8 supercharges. This can be achieved via heterotic theory on K3 or
F-theory on elliptically-fibered Calabi–Yau threefolds. In 5d, we also have the same amount
of supercharges. For 4d, the minimal number of supercharges is 4, which corresponds to
N = 1. Such theories can be constructed by compactifing heterotic theories on Calabi–Yau
threefolds [68], F-theory on Calabi–Yau fourfolds [82], or M-theory on G2 -manifolds [83, 84].
These are summarized in Table I.4.7.
10 − d
Number of supercharges
7
32, 16 supercharges
6
32, 16, 8 supercharges
5
32, 16, 8 supercharges
4
32, 16, 8, 4 supercharges
Table I.4.7:
Possible number of supercharges in lower-dimension theories via
compactifications. In the case of 4-dimensional theories there are more options not listed
in the table.
As an example, we can demonstrate how we can get a duality between two resulting
theories in 6d via heterotic theories on K3 and F-theory on elliptically-fibered Calabi–Yau
threefolds. Let us recall that we have a duality between heterotic theories on T 2 and F-theory
on K3 (or IIB theory on a P1 ). Now fiber both cases over a P1 . Then we have heterotic
theories on T 2 ⋉ P1 = K3 and IIB on a P1 fibered over a P1 base. Not precisely, but in
97
some sense we get a P1 × P1 . Note that P1 × P1 is the same as F0 . In general, we can have
a various way to fiber a P1 over a P1 base, which produces Hirzebruch surfaces Fn with a
variable n. When n > 12, such an Hirzebruch surface is no longer Calabi–Yau and hence it
only works for 0 ≤ n ≤ 12.
On the other side, we have to investigate heterotic theory on K3. Recall that we have
the H-flux to vanish
dH =
1
(R ∧ R − F ∧ F ),
16π 2
(I.4.69)
and for a K3, we have a nonzero R ∧R term, which is 24. Then it follows that F ∧F term will
have to include 24 instanton numbers. Note that considering E8 ×E8 , we can put for example
12 instantons numbers each, then this turned out to be corresponding to F0 from the F-theory
compactification. We can consider in generality to have (12 − n) and (12 + n) instantons on
each E8 for 0 ≤ n ≤ 12. This corresponds to Fn from F-theory compactification, which is
the exact match of the same range of n.
Optional exercise: Show that n = 12 is the maximal n for the Hirzebruch surface Fn to be
elliptically-fibered Calabi–Yau threefolds.
Additionally, M-theory on an interval S 1 /Z2 × K3 is also dual to heterotic theory on K3
(see Figure I.4.4 for the interval). Then on each end of the interval we have an orbifold point.
What does it mean to have E8 instanton numbers on each orbifold point? Having nontrivial
instanton numbers means we are turning on some excitation. Instantons shrinking into zero
size is equivalent to M5-branes approaching the boundary.28 Then in the equivalent picture
in F-theory, we should match the M5-branes, i.e. squeezing instantons on K3 to a point, to
the Hirzebruch surfaces Fn . This can be demonstrated with toric diagrams.
F0
F1
Figure I.4.6: These are toric diagrams of F0 and F1 . By blowing up a point and blowing
down another P2 on F0 , we get F1 .
We can have M-theory on an interval and both ends to have (12 − n) instantons and
(12 + n) instantons. In order to shift from n = 0 case to n = 1 case, we need to squeeze
28
This is because we are on 6d resulting theory from 11d theory and hence an instanton is equivalent to
11-6=5-dimensional brane.
98
one instanton out and move to the other end. Squeezing into zero size of the instanton
corresponds blowing-up a point geometrically, and pushing the M5-brane to the other end of
the boundary corresponds to blowing-down another point on toric diagram, as demonstrated
in Figure I.4.6.
In this manner we can see that all three theories, heterotic theory on a K3 surface, Ftheory on an elliptically-fibered Calabi–Yau threefold, and M-theory on an S 1 /Z2 , are all
dual to each other to give a six-dimensional resulting theory. Similarly, the same things
happen for all the other lower dimensions listed in Table I.4.7, which summarizes all string
dualities.
5
Complex geometry
5.1
Preliminary definitions
A complex manifold is a (topological) manifold that can be covered by patches of complex
coordinates z i = xi + iy i such that the transition functions between different patches are
holomorphic. This ensures that the notion of a holomorphic function f (z) is well-defined on
the entire manifold regardless of the choice of coordinates29 . On such manifolds there is a
natural notion of complex differential forms, with the space of (p, q) forms Ωp,q spanned by
elements of the form dz i1 ∧ · · · ∧ dz ip ∧ dz̄ j̄1 ∧ · · · ∧ dz̄ j̄q . This leads to a refined version of the
de Rham cohomology known as the Dolbeault cohomology.
Recall that the de Rham cohomology H p (M ) of a manifold M is the set of closed p-forms
modulo exact p-forms with respect to the exterior derivative d. It is isomorphic to the space
of harmonic p-forms, i.e. those that vanish under the Laplacian (d + ⋆d⋆)2 . Their dimensions
bp = H p (M ) are topological invariants known as the Betti numbers, related to the Euler
characteristic of M by
dim(M )
X
χ(M ) =
(−1)p bp .
(I.5.1)
p=0
For complex manifolds, we can consider an extended set of differential operators ∂ and ∂¯
which map (p, q) forms to (p + 1, q) and (p, q + 1) forms, respectively. They satisfy the
relations
¯ ∂ 2 = ∂¯2 = {∂, ∂}
¯ = 0.
d = ∂ + ∂,
(I.5.2)
¯
The Dolbeault cohomology H p,q , also a topological invariant, is defined as the space of ∂-closed
¯
(p, q)-forms modulo ∂-exact
(p, q)-forms. The dimensions of these spaces, known as Hodge
29
By definition, a Riemann surface is a one-dimensional complex manifold.
99
numbers, satisfy a myriad of relations such as hp,q = hq,p . We will have more to say about
this later in the context of Calabi–Yau manifolds.
A Kähler manifold is a Riemannian manifold with nonzero metric components gij̄ which
locally take the form gij̄ = ∂i ∂j̄ K(z, z̄) for a function K called the Kähler potential . The
Kähler form is a closed (1,1) form kij̄ that is essentially the antisymmetric version of gij̄ . It
¯
is related to the metric as k = ∂ ∂K,
with components kij̄ = −kj̄i = gij̄ . Kähler manifolds
of complex dimension n generically have a U (n) holonomy. Calabi–Yau manifolds CYn of
complex dimension n are special examples of Kähler manifolds that have a reduced SU(n)
holonomy. This implies that the curvature class of the U (1) piece of the spin connection
(i.e. the first Chern class) is zero. Yau proved that a Kähler manifold with vanishing first
Chern class admits a Ricci-flat Kähler metric, which has SU(n) holonomy. As mentioned
perviously, SU (n) holonomy guarantees the existence of a covariantly constant spinor, and
hence the preservation of supersymmetry.
Calabi–Yau manifolds come in families depending on their metric and complex structure.
The Kähler form is contained in the cohomology H 1,1 . Its dimension h1,1 is also the number
of ways we can deform the metric. In string theory, the addition of Bµν increases the
dimensionality of the moduli space. Deformations of Gµν and Bµν can thus be captured
by the complex combination
k + iB
(I.5.3)
which lives in the complexified (1,1)-form cohomology with real dimension 2h1,1 . In addition
to deforming the metric/B-field, we can also deform the complex structure, which is equivalent
to mixing ∂i and ∂¯j̄ . For instance, under a generic deformation this takes the form
∂¯ī −→ ∂¯ī + µīj (z, z̄)∂j .
(I.5.4)
In order to maintain ∂¯2 = 0, µ is required to satisfy
∂¯[ī µj̄]j = O(z, z̄).
(I.5.5)
To count the number of complex structure deformations, we first note that Calabi–Yau
manfiolds also have a globally defined (n, 0) form, represented in component form by the
tensor ǫi1 ···in , which can be used to covert an upper holomorphic index to n − 1 lower indices.
In particular, it can be used to write
µīj1 ···jn−1 = ǫjj1 ···jn−1 µīj .
(I.5.6)
¯ A ∂-exact
¯
(n−1, 1)-form
Since ∂¯[ī µj̄]j = 0, the (n−1, 1)-form µīj1 ···jn−1 is also annihilated by ∂.
100
corresponds to
µīj = ∂¯ī v j (z, z̄)
(I.5.7)
for some vector field v j , which is a trivial coordinate change (as opposed to a deformation
of the complex structure). Then the number complex structure deformations is given by
hn−1,1 = h1,n−1 .
5.2
Examples of Calabai–Yau manifolds
1-folds We first consider the simple example of a (non-singular) Calabi–Yau manifold, T 2 .
it is particularly nice because we known the explicit form of its metric, given by dzdz̄. Its
cohomology classes are spanned by the forms 1, dz, dz̄, dz ∧ dz̄. It immediately follows that
h0,0 = h0,1 = h1,0 = h1,1 = 1.
(I.5.8)
From this we see that there is a single Kähler deformation as well as one complex structure
deformation. Recall that the inequivalent tori are labeled by a single complex modulus τ ;
complex structure deformations correspond to changing the value of τ . The Kähler parameter
(modulus) corresponds to an overall rescaling. For string theory we shall also include the Bfield in our analysis. If we let A denote the area of the torus, we can define a complex Kähler
parameter ρ = B + iA. The parameter τ transforms under the usual SL(2,Z) symmetry.
Next we study the symmetries of ρ. There is a shift symmetry ρ → ρ + 1.30 There is also
another symmetry transformation on ρ, which is best illsutrated for the case where B = 0.
If the torus has radii R1 and R2 , then a T-duality transformation given by R1 → R11 and
R2 → R12 corresponds to ρ → − ρ1 . Combing these two symmetry transformations implies
that ρ also has an SL(2, Z) symmetry, with a fundamental domain identical to that of τ .
The T 2 manifold provides us with our first example of mirror symmetry, which exchanges
different Hodge numbers, and in particular the complex and Kähler structures. For simplicity,
consider again a torus with radii R1 and R2 and B = 0. Then
|τ | =
R2
,
R1
|ρ| = R1 R2 .
If we perform a T-duality transform on just R1 , then |τ | →
are exchanged under T-duality!
30
(I.5.9)
1
R1 R2
Recall that in the path integral, the Kalb-Ramond field appears as e2πi
units, so the theory has a shift symmetry B → B + 1.
101
and |ρ| →
´
B
R2
;
R1
so τ and ρ
with appropriately chosen
2-folds The obvious next example are Calabi–Yau 2-folds. There is of course the trivial CY
given by T 4 with trivial holonomy, but this behaves very similarly to the previous example of
T 2 . Instead we consider the orbifold T 4 /Z2 , which has a Z2 holonomy and is a singular limit
of K3. The Z2 acts on both coordinates as z i → −z i . The metric of this space is known but
singular, and so T 4 /Z2 is not a smooth manifold. To find its cohomology, we instead consider
the cohomology of T 4 , which like T 2 is given by the products of 1, dz i , dz̄ j̄ for i = 1, 2 and
j̄ = 1, 231 . To construct the cohomology of the orbifold, we simply mod out by the Z2 action
(which projects out the odd-dimensional forms) and include the twisted sectors. Recall the
exercise of type II on T 4 /Z2 . Each twisted sector contributes a single complex scalar, where
the number of twisted sectors is given by the number of fixed points of the action, which is 16
for the orbifold under consideration. The VEVs of these scalars corresponds geometrically to
deformations of the metric/B-field, and so they contribute directly to h1,1 . The cohomology
of T 4 /Z2 is thus given by h1,1 = 4 + 16 = 20. We list the cohomologies of T 4 and T 4 /Z2
below, represented as the matrices hp,q :
T4 :
1 2 1
2 4 2 ,
1 2 1
T 4 /Z2 :
1 0 1
0 20 0 .
1 0 1
(I.5.10)
We can now study the details of the T 4 /Z2 geometry. Recall that T 2 has two complex
moduli, τ and ρ. What are the analogous parameters for K3? Looking at the value of
h1,1 = 20, it would naively seem that there are 20 + 20 = 40 complex parameters, in analogy
with T 2 . This turns out to be correct, however we must be careful with their treatment.
The first caveat comes from the fact that there are two complex structure deformations
that preserve the metric. Let Ω denote the unique (2, 0)-form (which exists because h2,0 = 1)
and let Ω̄ denote the analogous (0, 2)-form. We take t ∈ C to parametrize the deformations
under consideration.. Defining a complex structure on a complex manifold of dimension n is
equivalent to choosing an (n, 0) form. We can thus introduce a real two-parameter family of
complex structures labeled by Ω̂(t), defined as
Ω̂(t) = Ω + tk + t2 Ω̄.
(I.5.11)
To show that Ω̂ defines a complex structure, we must verify that Ω̂2 = 0. This follows from
Ω2 = Ω̄2 = 0 as well as Ω ∧ Ω̄ = k ∧ k. Note that Ω ∧ Ω̄ is the volume form on the manifold.
The equation (I.5.11) then defines two deformations of the complex structure that leave the
31
Technically speaking, orbifold geometries with singular points are not Calabi–Yau manifolds. Instead,
one can construct a CY from an orbifold classically by smoothing any singularities. However, string theory is
well-defined on orbifold geometries. After correctly accounting for the twisted sectors, the orbifold theories
give results that precisely agree with the classical geometries that have been smoothed. PLEASE CHECK
102
metric (e.g. k) fixed.
Though it would now seem that we have 40-1=39 complex moduli (after dropping those
parametrized by t), the missing parameter is restored by correctly considering the B-field
deformations. The aforementioned complex parameters only cover the subspace of the moduli
space where the B-field is deformed by (1, 1)-forms.32 We must also take into account
deformations of the B-field by (2, 0) and (0, 2)-forms. This restores the number of complex
parameters to 40.
As it turns out, the moduli space of T 4 /Z2 is given by
.
SO(20, 4)
SO(20, 4, Z).
SO(20) × SO(4)
(I.5.12)
This is the analog of the moduli space of τ and ρ for T 2 , which live in (H × H/SL(2, Z)).
It is the same moduli space as heterotic theory on T 4 . If we turn off the B-field, then the
moduli space reduces to
.
SO(19, 3)
(SO(19, 3, Z) × Rradius ) .
SO(19) × SO(3)
(I.5.13)
To check that the dimensionality works out, note that there are 40 real complex structure
moduli and 20 real Kähler moduli (since B = 0). We must still subtract 2 due to the
complex structure deformations which preserve the metric, given by (I.5.11). There are thus
58 = 57 + 1 real moduli when B = 0.
To summarize, K3 surfaces are the only twofolds realizing the full SU (2) holonomy. In
turn, we deduce that all Calabi–Yau twofolds are K3 surfaces.
3-folds Now, we move on to the 3-dimensional case, which is especially interesting. First,
it is unclear whether the number of Calabi–Yau threefolds is even finite. To study these
surfaces, we can work out the Hodge numbers. Note that if there is a nontrivial (i.e. nonexact) one-form, then the fundamental group of the manifold is nontrivial. In particular, if
the fundamental group of CY3 is nontrivial, then the holonomy must be a proper subgroup
of SU(3). It follows that CY3 with SU(3) holonomy must have
h1,0 = h0,1 = 0.
32
For T 2 , Bµν has only a single component, Bzz̄ .
103
(I.5.14)
Generically, the cohomology of CY3 with SU(3) holonomy is given by
1 0
0 1
0 h2,1 h1,1 0
0 h1,1 h2,1 0 .
1 0
0 1
(I.5.15)
There are clearly only two numbers that characterize the basic topological structure, h1,1 and
h2,1 . However, there are in fact different CY3 -folds with identical Hodge numbers, so they are
not sufficient to distinguish between different surfaces. Currently, the largest known Hodge
numbers are on the order of 50033 .
As an easy but nontrivial example, we consider the orbifold T 6 /Z3 , where the Z3 acts
as (z1 , z2 , z3 ) → (ωz1 , ωz2 , ωz3 ) with ω 3 = 1. Its cohomology is most easily worked out by
again starting with that of T 6 , which is simply given by combinations of dzi and dz̄ī and their
products. The Z3 -invariant forms are dz1 ∧ dz2 ∧ dz3 , dz̄1 ∧ dz̄2 ∧ dz̄3 , and all (i, j)-forms for
i = j.
As in the previous example, the full cohomology receives contributions from the twisted
sectors of the worldsheet orbifold CFT. For Z3 , there is a single untwisted sector and two
twisted sectors. The number of ground states in each twisted sector is given by the number
of fixed points of the Z3 symmetry, which for a T 6 target space is 27. Thus, h1,1 and h2,2
are each increased by 27, yielding a total of 36 each. One interesting thing to note is that
due to the orbifold projection, h1,2 = 0, which implies that there are no complex structure
deformations. Roughly, this means that you can change the size of this manifold but not its
shape. This is an example of whats known as a rigid Calabi–Yau. For convenience, we list
the Hodge numbers for both T 6 and T 6 /Z3 below.
T6 :
5.3
1
3
3
1
3
9
9
3
3
9
9
3
1
3
,
3
1
T 6 /Z3 :
1 0 0 1
0 0 36 0
0 36 0 0 .
1 0 0 1
(I.5.16)
Calabi–Yau manifolds from complex projective spaces
The orbifolds considered in the previous sections are but a small subset of the set of Calabi–
Yau manifolds. As a non-orbifold example, consider the n-dimensional complex projective
33
It is believed that the Hodge numbers of CY3 -folds are bounded, but this remains unproven
104
space, CPn . Geometrically, the complex projective space is the set of complex lines in Cn+1
that pass through the origin. The most convenient description is in terms of the points
(z1 , . . . , zn+1 ) ∈ Cn+1 identified under
(z1 , . . . , zn+1 ) ∼ λ(z1 , . . . , zn+1 ),
λ ∈ C.
(I.5.17)
There is an additional restriction that 0 ∈ Cn+1 is not included. For instance, we can study
CP1 ≃ S 2 . There are two coordinate patches which correspond to either z 1 6= 0 or z 2 6= 0 –
these are the patches on S 2 which cover the north and south poles, respectively. Using our
knowledge of S 2 , we observe that CP1 does not admit a Ricci flat metric. More generally, CPn
admits metrics with positive curvature; in particular, they do not admit Ricci flat metrics,
which recall are an essential ingredient of any Calabi–Yau surface.
To construct the desired Ricci-flat manifold, we instead analyze hypersurfaces in CPn
defined through a homogenous degree-(n + 1) polynomial equation Pn+1 (zi ) = 0. The
homogeneity property is a consistency condition which follows from (I.5.17). For instance,
we could consider the equation
n+1
z1n+1 + · · · + zn+1
= 0.
(I.5.18)
The resulting hypersurface is a Calabi–Yau (n − 1)-fold whose first Chern class is zero the
n = 2 case yields a T 2 . As a special case, a CY2 -fold embedded in CP3 can be defined by
z14 + z24 + z34 + z44 = 0,
(I.5.19)
which is just a realization of K3. Similarly, the embedding of a CY3 -fold follows from
z15 + z25 + z35 + z45 + z55 = 0,
(I.5.20)
which is a 3-fold. This is known as a quintic 3-fold . As an aside, quintic 3-folds have Hodge
numbers
h2,1 = 101,
h1,1 = 1.
(I.5.21)
as well as an Euler characteristic given by
χ = 2 (h1,1 − h1,2 ) = −200.
(I.5.22)
More generally, we can obtain a CYn−1 -fold by choosing any degree-n homogenous polynomial.
For a 1-fold which is a T 2 , there is only a single term az1 z2 z3 up to a change of coordinates.
Here, a is a complex modulus which parametrizes the complex structure of T 2 . For CY2 ,
105
there are various terms we can add to (I.5.19). For CY3 , there are 19 possible terms; this
implies that such terms cannot completely capture the complex structure deformations of
K3.
Exercise 1: By writing down allowed terms of degree five (and removing redundancies),
show that for the quintic threefold, there are 101 independent complex deformations.
5.4
Singularities of K3 surfaces
Thus far, we have swept the issue of orbifold singularities under the rug. This was possible
for the worldsheet because string theory is well defined on such geometries, even if the target
space doesn’t take the form of a bona fide manifold. Our first encounter with singular
geometries was T 4 /Z2 . Near each of its 16 fixed points, the space locally looks like C2 /Z2 .
This follows from identifying two complex coordinates z1 and z2 under the Z2 reflection:
(z1 , z2 ) ∼ (−z1 , −z2 ).
(I.5.23)
It is instructive to define three new quantities
u := z12 ,
v := z22 ,
w := z1 z2 ,
(I.5.24)
which are clearly Z2 -invariant. There are two independent coordinates, with all three related
by
uv = w2 .
(I.5.25)
There is thus a one-to-one correspondence between C2 /Z2 and the hypersurface in C3 defined
by (I.5.25). This geometry becomes singular at the origin z1 = z2 = 0. We can resolve this
singularity by modifying (I.5.25) to
uv = (w −
√
µ)(w +
√
µ)
(I.5.26)
for some parameter µ.
Resolving the singularity leads to the emergence of an S 2 at the origin of C2 /Z2 −{0}. Note
that the left-hand side of (I.5.26) is invariant under the phase rotation v → e−iθ v, u → eiθ u.
√
This defines an S 1 which shrinks as w approaches ± µ. The emergent S 2 , with its S 1
subspace, is depicted graphically in I.5.1. Let’s return our attention to the T 4 /Z2 example.
Each of the 16 singularities is replaced by an S 2 , which each contribute +1 to h1,1 . This
is exactly what we found in Section 5.2 when considering the twisted sectors on the string
worldsheet.
106
√
− µ
√
µ
Figure I.5.1: A plot of the w plane, with an S 2 at the origin. The S 1 clearly shrinks to zero
√
at the points w = ± µ, which in effect defines the S 2 .
We can generalize the above example by instead considering a C2 /Zn orbifold singularity.
The Zn symmetry acts on z1 , z2 as
(z1 , z2 ) → (ωz1 , ω −1 z2 ),
ω n = 1.
(I.5.27)
The Zn is a discrete subgroup of SU (2), which is also true for the holonomy group. In analogy
with the Z2 case, we consider a C3 space parametrized by
u := z1n ,
v := z2n ,
w := z1 z2 .
(I.5.28)
The embedding of C2 /Zn in C3 is specified by the constraint
uv = wn .
(I.5.29)
Under the appropriate change of variables, this equation can be rewritten as
u2 + v 2 = w n ,
(I.5.30)
The singularity at w = 0 can be resolved in a similar manner at the previous case, replacing
wn with a product:
n
Y
2
2
u +v =
(w − αi ) .
(I.5.31)
i=1
If two of the αi parameters are brought close together, then equation (I.5.31) describes
the C2 /Z2 singularity in a neighborhood. So, equation (I.5.31) smooths out the C2 /Zn by
introducing (n − 1) P1 ’s, as shown in Figure I.5.2.
107
Figure I.5.2: Resolution of C2 /Zn singularity. Here, each singular point is replaced by a copy
of CP1 of nonzero size.
Optional Exercise: Show that the hypersurface defined by equation (I.5.25) is topologically
the same as the cotangent bundle of P1 , denoted by T ∗ P1 .
The (n − 1) P1 s, which we refer to as Ci for i = 1, · · · , n − 1, are a basis for the homology
of the deformed space. Their intersection numbers can be organized into the following matrix
(where empty entries are taken to be zero):
−2 1
1 −2 1
Ci · Cj =
..
. 1
1
1 −2
.
(I.5.32)
ij
If we represent each sphere by a node and each intersection between two P1 s by a line between
such nodes, then Figure I.5.2 looks like the An−1 Dynkin diagram. In fact, the matrix of
intersection numbers is just the Cartan matrix (multiplied by −1) associated to the An−1 Lie
algebra.
The resolution of the C2 /Zn singularity thus appears to be connected to the An−1 Lie
algebra. This is a particular example of a more general phenomenon where resolved singularities
are defined by discrete subgroups of SU (2). Generically, these discrete subgroups fall into
several infinite families, An and Dn , as well as a few exceptional subgroups E6 , E7 , E8 . Similar
to the An−1 example for C2 /Zn , in the general case the associated Dynkin diagrams indicate
how the deformed singularity behaves.
We may think of the deformations of equation (I.5.31) as complex structure deformations,
although the distinction between deformations of the complex structure and the Kähler
structure is ambiguous for K3 due to its hyperKähler structure.
Let us return to the example of C2 /Z2 and its connection to stringy geometry [37]. The
orbifold point geometrically corresponds to where the area of the P1 shrinks to zero. As
108
mentioned previously, the CFT with an orbifold target space is well-behaved, and so too is
this limit in the CFT moduli space. Note that we could have turned on a B field, which in
certain units is only defined modulo 2π. If the area of the P1 is A, then A + iB forms a
complexified Kähler parameter. We should then upgrade µ in equation (I.5.26) to a complex
parameter and identify
µ = A + iB .
(I.5.33)
It turns out that the orbifold point corresponds to A = 0 and B = π. In one of the previous
exercises, we learned that the Z2 orbifold CFT has a Z̃2 symmetry. This symmetry only
exists on the worldsheet when B takes the values of 0 or π. The Z̃2 symmetry then acts as
B → −B, which is compatible with the definition of the B-field. When B = 0, the path
integral of the worldsheet wrapping the P1 behaves like
X
A→0
(I.5.34)
e−nA −−−→ ∞ .
n>0
When B = π, the path integral instead behaves like
X
e−nA+iπn < ∞ ,
(I.5.35)
n>0
so the conformal theory is well-defined. The orbifold limit corresponds to B = π.
The resolution of orbifold singularities also has physical implications for the resulting
string theory. Only certain kinds of singularities can appear in the moduli space of a compact
K3 surface. This comes from the fact that the second homology of K3 is given by
h0,2 + h1,1 + h2,0 = 1 + 20 + 1 = 22 .
(I.5.36)
It follows that it is not possible to have an arbitrarily large number of P1 s which arise from
deforming singularities. Recall that we can only compactify string theory on a T 4 or a K3
surface if we want to preserve some supersymmetry. The types of physical theories which
result are thus constrained by the aforementioned bound on the number of P1 s.
5.5
Singularities of Calabi–Yau threefolds
There are two methods to deal with singularities: deformations and resolutions.34
We first consider deformations, which amounts to giving the singular S 3 a nonzero size.
34
For rigorous definitions of deformations and desingularizations, as well as more modern ways of utilizing
them for Calabi–Yau threefold compactifications, see [85–92].
109
The 3-dimensional analog of equation (I.5.26) is
z̃12 + z̃22 + z̃32 + z̃42 = µ,
(I.5.37)
which describes T ∗ S 3 , where the S 3 has size µ. The geometry is singular for the case where
the size of the S 3 vanishes. One can explicitly write down a Ricci flat Kähler metric, and
thus the total space of the cotangent bundle is a non-compact Calabi–Yau threefold. The
singularity appears at µ = 0. Note that this singularity is not an orbifold singularity, since
it does not take the local form C3 /G for some discrete isometry group G. We can deform it
by considering µ 6= 0, which in particular corresponds to a complex structure deformation.
We now consider resolutions, or giving the singular S 2 ≃ P1 a nonzero size. By a linear
change of coordinates, we can rewrite equation (I.5.37) (with µ = 0) as
!
z1 z3
= z1 z2 − z3 z4 = 0 .
(I.5.38)
det
z4 z2
Let v = (α, β)T be a nonzero vector annihilated by this matrix. be a nonzero vector that
is annihilated by the above matrix. Because its overall normalization is irrelevant, we can
think of v as an element of P1 . In a coordinate patch z where β 6= 0, we can write
z :=
α
.
β
(I.5.39)
We may parameterize the singular manifold of (I.5.38) by the variables z1 , z4 , z. The singularity
is located at z1 = z4 = 0, where the P1 parameterized by z shrinks to zero. Resolving (or
blowing-up) the singularity amounts to giving P1 a nonzero area, so this is an example of a
Kähler deformation. The geometry of the resolved singularity is a sphere P1 parameterized
by z; the directions z1 , z4 are normal to th sphere. Formally, we say that the entire geometry
is an O(−1) ⊕ O(−1) line bundle over P1 . The O(−1) notation means that the number of
zeros minus the number of poles of any holomorphic section of the line bundle is minus one.
In summary, we have shown two different ways of smoothing out the same singularity.
The deformation method gives an S 3 nonzero size, while the blow-up/resolution method
gives an S 2 nonzero size. As mentioned previously, these procedures correspond to complex
structure and Kähler deformations, respectively. To see how they are related, note that T ∗ S 3
has topology S 3 × R3 , or S 3 × S 2 × R+ . The S 2 is contractible, while the S 3 has nonzero
size. The singularity is restored by letting the S 3 shrink as R+ goes to zero. Likewise, the
line bundle considered above locally looks like S 2 × R4 , or S 2 × S 3 × R+ . In this case, the
singularity is restored by letting S 2 shrink as R+ goes to zero. The relation between these
two methods of smoothing a singular geometry is known as the conifold transition, as
illustrated in Figure I.5.3.
110
R+
R
+
S3
S3
S2
S2
Figure I.5.3: The left diagram corresponds to the (resolved) blown-up singularity, while the
right diagram corresponds to the deformed singularity.
String perturbation theory breaks down near the singular point, since the size of the
sphere becomes small relative to the string scale.
Calabi–Yau manifolds with different Hodge numbers can be related through transitions
like the conifold transition. It is conjectured that the number of Calabi–Yau threefolds is
finite. For known examples of Calabi–Yau manifolds, the largest Hodge numbers are on the
order of h1,1 + h2,1 ∼ 500. Thus, conjecturally there is a bound on the number of massless
fields that can arise from supersymmetric string compactifications on complex threefolds.
5.6
Toric geometry
Toric geometry is the study of algebraic varieties which are equipped with an embedded
algebraic torus (C∗ )p , such that the group action of the torus extends to the entire variety.
Toric spaces are a particularly tractable example of more general spaces as their topological
and geometric data can be understood through combinatorics. Many familiar spaces in
physics are in fact toric spaces, as we shall see.
A toric diagram is a representation of a toric space, in particular a Calabi–Yau space, as a
T fibration over some base, B. The cycles of the torus fiber degenerate over the boundaries
of B. One of the simplest examples of a toric space is the complex plane, C. By parametrizing
C by polar coordinates, |z|2 and θ, we see that C can be viewed as an S 1 fibered over the
positive real line, where the S 1 collapses at the origin, as depicted in Figure I.5.4. The toric
diagram describing this 1 complex dimensional toric space is then just the semi-infinite line.
p
|z|2
0
Figure I.5.4: Toric diagram of C. The circle, parametrized by the angle θ, shrinks to zero
when |z|2 → 0.
111
There is a natural symplectic form on this space. In terms of polar coordinates, it takes
the form
dz ∧ dz = d(|z|2 ) ∧ dθ .
(I.5.40)
Symplectic manifolds (i.e. smooth manifolds equipped with a symplectic form) form the
phase space of physical systems. Indeed, the 6d phase space of a particle moving in 3d is
a Calabi–Yau threefold: locally the momentum and position are given by the vertical and
horizontal coordinates on the cotangent bundle T ∗ M with M = S 3 , for example. In this
way a Calabi–Yau space can be viewed as a compact version of the phase space where the
symplectic form is the Kähler form.
Next, we consider C2 . Similarly to the case of C, switching to polar coordinates (|zi |2 , θi for
i = 1, 2) makes the fiber structure manifest. Collecting the angles and distances separately,
we observe that C2 can be realized as a T 2 fibration over the closed positive quadrant of R2 .
The A and B cycles of the torus vanish, respectively, on the x = |z1 |2 and y = |z2 |2 axes;
both degenerate to a point at the origin. Note that the space is smooth, as expected. The
toric diagram for C2 , with special degenerating points highlighted, is depicted in Figure I.5.5.
The generalization of this procedure to Cn is straightforward.
|z1 |2
(θ1 , θ2 )
0
|z2 |2
Figure I.5.5: Toric geometry of C2 .
By just drawing the base of the torus fibration, which is exactly the toric diagram, we
see that it is possible to visualize an n-dimensional complex toric space in terms of its lowerdimensional base.
Although the above examples described toric diagrams for (non-compact) Calabi–Yau
manifolds, we can also use toric diagrams to depict more generic manifolds. For example,
toric geometry says that a sphere is just an interval; more specifically, an S 2 can be realized
as an S 1 fibration over the interval, where the fiber collapses at the boundaries. The toric
diagram for S 2 is depicted in Figure I.5.6.
112
S2
0
S1
Figure I.5.6: An S 2 realized as an S 1 over the interval, where the circle shrinks at the ends
of the interval.
Next we consider two-dimensional complex projective space CP2 , parametrized by coordinates
z1 , z2 , and z3 identified by
(z1 , z2 , z3 ) ∼ λ(z1 , z2 , z3 ) ,
λ ∈ C∗ .
(I.5.41)
There are three phases, but only two are independent due to an allowed rescaling. Thus,
this geometry contains two circles, which can be represented with a 2d diagram. A general
point in the interior is associated with a smooth T 2 fiber. There is a line CP1 where each
of the three coordinates z1 , z2 , z3 individually go to zero. When any two coordinates vanish,
we are left with a single point. Any two such lines must meet at a corner, where both of the
associated coordinates vanish. The full diagram of a CP2 is given in Figure I.5.7.
In order for the geometry to be smooth everywhere, it is crucial that each corner of Figure
I.5.7 looks like C2 . That is, the two shrinking circles must form a basis for the homology
of the two-dimensional torus. Thus, they must intersect once. Suppose we had a geometry
where the two shrinking cycles are given by (1,0) and (1,2)35 . In this case, there is a corner
singularity that we can blow-up by inserting a P1 ; this yields a geometry which locally looks
like a toric diagram. This blow-up process is depicted in Figure I.5.8. For a Zn orbifold
singularity, one applies this blow-up procedure (n − 1) times to obtain (n − 1) P1 s.
35
A cycle (nA , nB ) of the torus T 2 is one which wraps the A-cycle nA times and the B-cycle nB times
113
z1 = 0
z3 = 0
T2
z2 = 0
locally C2
Figure I.5.7: Toric diagram of CP2 . Each edge corresponds to a CP1 .
cycle (1, 0)
Blow up
P1
cycle (1, 1)
cycle (2, 1)
Figure I.5.8: Toric representation of blowing up a C2 /Z2 singularity to obtain a P1 . After the
blowup, the intersection number of the cycles at each corner is 1, so there are no singularities.
This blowup resolves A1 singularity.
We can also represent S 3 via a toric action, despite the fact that S 3 is not a toric, or even
a complex, space. Given two complex coordinates z1 , z2 , S 3 is defined to be the locus
|z1 |2 + |z2 |2 = 1 .
(I.5.42)
The coordinates |z1 | and |z2 | take values on the interval [0, 1]. At each point on the interval
for |z1 |, there is an associated T 2 . At |z1 | = 0 one circle vanishes, whereas at |z1 | = 1 the
other circle vanishes. So we can think of a shrinking circle as filling in one of the cycles of
the torus. Combining the two coordinates together, we land at the fact that S 3 is essentially
two solid tori glued together.
We next turn to studying the conifold using the toric language. We can depict the
resolution and deformation of a singular conifold in Figure I.5.9. The intersection points
in the toric diagram of the singular conifold signal the location of singularities, as can be
determined from the degenerating cycles of the torus fibers. The resolution of a singular point,
as explained above, involves the creation of a P1 , which effectively replaces the singularity.
114
This is represented by the central interval on the left-hand-side of Figure I.5.9. The righthand side of the figure represents the deformed conifold, which recall involves replacing
the singularity with an S 3 of nonzero size. The deformation involves pulling the lines of
the singular diagram apart, which then has the structure of a T 2 fibered over an interval;
the degenerations at the endpoints are just an S 3 . In this way the two different methods
for smoothing the conifold singularity can be understood from the point of view of toric
diagrams.
|z1 |2
S2
singular limit
pull apart
|z4 |2
Figure I.5.9: Resolving and deforming a singular conifold.
Lastly, we discuss the C3 /Z3 singularity, where (z1 , z2 , z3 ) is identified with ω(z1 , z2 , z3 )
for ω 3 = 1. This singularity can only be smoothed with Kähler deformations, not complex
deformations. This is because h2,1 = 0 for C3 /Z3 as discussed before. The toric diagram
is three-dimensional and the singularity is located in the corner. The blown-up singularity
looks like chopping off a corner and replacing it with a triangle, which is CP2 . See Figure
I.5.10.
Blow up
P2
Figure I.5.10: Toric diagram of resolving the C3 /Z3 singularity to obtain CP2 .
115
6
6.1
Sigma models
Supersymmetric sigma models and mirror symmetry
Thus far, we have talked about the geometric structures that appear in superstring theory
from the spacetime perspective. To describe such geometries via the worldsheet, we consider
an N = (1, 1) σ-model with target space M , i.e. the supersymmetric version of (I.1.63),
whose action is given by [93, 94]36
ˆ
1
¯ ν + Gµν ψ µ ψ ν ∇z̄ ψ ν + ψ̄ µ ∇z ψ̄ ν + 1 Rµνσρ ψ µ ψ ν ψ̄ ρ ψ̄ σ .
d2 z(Gµν + Bµν )∂X µ ∂X
S=
4π
2
(I.6.1)
Here, Rµνσρ (X) is the Riemann curvature of M and ∇a is the spin covariant derivative, which
acts on the fermions as
1 µ
µ
µ
µ
∇z ψ = ∂ψ + Γνσ + Hνσ ∂X ν ψ σ .
(I.6.2)
2
As a trivial example, one may choose the target space to be Cn with zero H-flux, which
yields a theory of free bosons and free fermions. This is just the standard worldsheet action
of the type II superstring in flat Minkowski space. Its global symmetry group includes a U(1)
factor with current
J = gij̄ ψ i ψ j̄ .
(I.6.3)
Under this U(1) symmetry, ψ i has charge +1, while ψ ī has charge −1. Even if we take
the the target space be a general Calabi–Yau manifold, the worldsheet SCFT preserves this
symmetry because the U(1) holonomy of the manifold is trivial. That is, the U(1) piece of
the spin connection has no curvature, so the associated current J still exists for a generic
choice of Calabi–Yau manifold. In fact, there are independent U(1) currents associated with
left-movers and right-movers:37
J = gij̄ ψ i ψ j̄ ,
J¯ = −gij̄ ψ̄ i ψ̄ j̄ .
(I.6.4)
Recall that for a single chiral supersymmetry (which we take to be left-moving for concreteness),
there is a supercurrent G(z) with weight ( 23 , 0) given by
GL = gij̄ (ψ i ∂X j̄ + ψ j̄ ∂X i ).
36
(I.6.5)
In (I.6.1) and the equations which follow, purely anti-holomorphic operators will typically be adorned
with an overline to distinguish them from their holomorphic counterparts. In particular we reserve ∗ for
complex conjugation.
37
The minus sign is a convention. See for instance the bottom of page 385 in [2].
116
The two terms on the right-hand-side above have respective charges ±1 under J. Thus,
for the Calabi–Yau SCFT, the supersymmetry is enhanced to (2,2) because there are two
supercurrents G± on each side, where ± denotes the charge under J. To be precise, we write
G+ = gij̄ ψLi ∂X j̄ , G− = gij̄ ψLj̄ ∂X i ,
¯ j̄ , Ḡ+ = gij̄ ψ̄ j̄ ∂X
¯ i.
Ḡ− = gij̄ ψ̄ i ∂X
(I.6.6)
(I.6.7)
The full N = (2, 0) superconformal algebra is generated by T (z), J(z), G± (z) [95]. In addition
to the internal U(1) generated by J, there is a Z2 auto-automorphism which acts on the
supercurrents G1 = G+ + G− and G2 = i(G+ − G− ) as
Z 2 : G 1 → G1 ,
G2 → −G2 ⇔ G+ ↔ G− .
(I.6.8)
Having established the existence of the supercurrents G± for the left- and right-movers,
we now discuss chiral and antichiral operators. On the left side, a chiral operator φc (z)
satisfies
G+ (z)φc (0) ∼ 0.
(I.6.9)
Note that operators φc ≡ G+ φ̃ are trivially chiral, and we define chiral operators by quotienting
by such trivial operators. Likewise, an antichiral operator φa (z) satisfies
G− (z)φa (0) ∼ 0.
(I.6.10)
One can use the right-moving supercurrents to also define chiral and antichiral operators
on the right. So we have four possibilities: (c, a),(c, c),(a, c), and (a, a). Note that PCT
symmetry implies that the number of (c, a) operators equals the number of (a, c) operators,
and the number of (c, c) operators equals the number of (a, a) operators. So one can
independently specify the number of (c, a) and (c, c) operators.
For simplicity, let us specialize to the trivial target space manifold Cn . An example of a
(c, c) operator is given by
i j̄
(I.6.11)
φ++
1 1 = kij̄ ψ ψ̄ ,
,
2 2
where kij̄ is the Kähler 2-form. This operator is (c, c) because G+ ψ i ∼ 0 and Ḡ+ ψ̄ j̄ ∼ 0.
Note that by acting with the G− supercurrents, we obtain a marginal operator that can be
added to the action of the σ-model. The relevant OPE is
G− (z)Ḡ− (z̄) φ++
1 1 (0, 0) ∼
,
2 2
1
¯ j̄ (0).
k ∂X i (0)∂X
z z̄ ij̄
(I.6.12)
1 1
1
Note that φ++
1 1 has weight ( 2 , 2 ) and J-charge (1, 1), so that L0 = 2 J0 on the left and right
,
2 2
sides.
117
The relation L0 = 12 J0 holds more generally for all chiral fields (for antichiral fields, we
Ḡ−
have L0 = − 21 J0 ). Given a general (c, c) field of weight ( 21 , 12 ), we may act with G−
− 12 − 21
¯ Such operators can be added
to obtain a weight (1, 1) operator that is neutral under J, J.
to the action and correspond to deformations of the Calabi–Yau manifold. Likewise, we
can also obtain Calabi–Yau deformations from (c, a) operators of weight ( 12 , 21 ), denoted by
−
+
φ+−
1 1 , by acting with G 1 Ḡ 1 . The (c, c) deformations are related to Kähler deformations,
,
−
−
2 2
2
2
while the (c, a) operators are related to complex structure deformations. The correspondence
associates the charge of an operator under J, J¯ with raised or lowered indices. That is, a +1
¯ charge is associated with a lowered i (ī) index while a −1 J (J)
¯ charge is associated
J (J)
with a raised i (ī) index. For example, a (1, 1)-form dzi ∧ dz̄ī corresponds to an operator of
charge (1, 1), while the tensor µīj is associated with an operator of charge (1, −1).
Note that the overall sign of J¯ is a convention. If we flip its sign, then chiral operators
become antichiral operators and vice versa (note that we always define G± to have the Jcharge given by its superscript). This ambiguity means that given an abstract CFT, it is not
possible to precisely determine the manifold. The mirror symmetry conjecture asserts that,
a Calabi–Yau CFT is equivalent to another Calabi–Yau CFT on a manifold with the Hodge
numbers h1,1 and hn−1,1 swapped [96]. The same CFT can lead to two different Calabi–Yaus.
One may immediately identify T 6 /Z3 as a counterexample. See Figure I.6.1. However, this
is an exception and generally there are mirror pairs of CY. In general dimensions, mirror
symmetry relates a Calabi–Yau manifold with its mirror manifold which has swapped Hodge
numbers hp,q and hn−p,q .
1
0
0
1
0
0
36
0
0
36
0
0
1
1
0
0
←→
0
0
1
1
0
36
0
0
0
0
36
0
1
0
0
1
Figure I.6.1: A possible counterexample to the mirror symmetry conjecture. The left hand
side is the cohomology of T 6 /Z3 . Because T 6 /Z3 has h2,1 = 0, a putative mirror manifold
would have h1,1 = 0, which contradicts the existence of the Kähler form. Yet, there exists a
sense in which mirror symmetry holds.
6.2
Supersymmetric minimal models
We will now learn more about Calabi–Yau CFTs at certain points on the moduli space.
The N = (2, 2) supersymmetric minimal models are an important part of the discussion
[97]. First, recall that the unitary non-supersymmetric minimal models admit a Lagrangian
description given by the Landau-Ginzburg theory. They organize into an ADE classification.
118
The A-series models are labeled by an integer m = 2, 3, 4, . . ., with central charge
cm = 1 −
6
.
m(m + 1)
(I.6.13)
All primaries have the same L0 and L̄0 weights, and hence are scalars. The set of weights is
((m + 1)r − ms)2 − 1
: r, s ∈ N 1 ≤ r ≤ m − 1, 1 ≤ s ≤ m .
(I.6.14)
4m(m + 1)
One can then check that there are 2m − 3 relevant operators. The Landau-Ginsparg theory
has a single scalar field with Lagrangian
¯ + g(φ2 )m−1 .
L = ∂φ∂φ
(I.6.15)
The field φ has zero classical scaling dimension but it acquires a positive quantum scaling
dimension. The relevant operators of this Lagrangian are 1, φ, φ2 , . . . , φ2m−4 . The equations
¯ which is irrelevant. Thus, the number of
of motion imply that φ2m−3 is proportional to ∂ ∂φ,
relevant operators in the Landau-Ginsparg theory equals the number of relevant operators of
the corresponding minimal model, which is evidence for the proposed Lagrangian description.
Note that in these minimal models, c < 1. This is reasonable because in the absence of
any potential, the central charge would be 1. A potential effectively kills some degrees of
freedom (the field cannot go all the way to infinity).
To obtain a Lagrangian description of the (2,2) supersymmetric minimal models, we use
supersymmetric Landau-Ginsparg theory [98]. We will write down a Lagrangian and then
flow to an IR fixed point, so that the manifest supersymmetry is in fact part of a larger
superconformal symmetry.
Also note that 2d (2, 2) supersymmetry has the same number of supersymmetries as in 4d
N = 1 theory.38 We promote the scalar field φ to a chiral superfield φ. The chiral superfield
φ(z, z̄; θ± , θ̄± ) depends on z, z̄ and Grassmann numbers θ± , θ̄± in such a way that
D+ φ = D̄+ φ = 0,
where
(I.6.16)
∂
∂
+ θ− ,
(I.6.17)
+
∂θ
∂z
∂
∂
D̄+ φ = + + θ̄− .
(I.6.18)
∂ z̄
∂ θ̄
The ± indicates the charge under a global U(1) symmetry (R-charge), which we have called
D+ φ =
38
See [99] for 4d N = 1 theory.
119
J and J¯ earlier. In order for D+ and D̄+ to have J-charges (1, 0) and (0, 1) respectively, θ+
and θ̄+ must have J-charges (−1, 0) and (0, −1) respectively. Note that
(D+ )2 = (D̄+ )2 = 0.
(I.6.19)
The action is comprised of a D-term (also called the Kähler potential) and an F-term (also
called the superpotential):
ˆ
ˆ
ˆ
2
4
2
2 +
(I.6.20)
d z d θ K(φ, φ̄) + d zd θ W (φ) + d2 zd2 θ− W (φ).
As per usual, the notation W (φ) implies that W is a holomorphic function of φ. The action
(I.6.20) is manifestly supersymmetric in terms of the supercharges Q± and Q̄± . Integrating
over the Grassmann coordinates yields a functional of finitely many fields which depends
solely on the bosonic variables z, z̄. In particular, the potential for the complex scalar φ(z, z̄)
embedded in the chiral superfield φ is given by the superpotential:
∂W
V =
∂φ
2
.
(I.6.21)
The combined central charges of the complex scalar and its superpartner is 3; it is convenient
to define
c
ĉ := .
(I.6.22)
3
to match ĉ = 1 of the supersymmetric free theory with c = 1 of the standard free boson.
Because of the superpotential, the supersymmetric minimal models we describe have ĉ < 1.
The simplest choice of W is
W (φ) = φn .
(I.6.23)
A nonrenormalization theorem says that this superpotential receives no quantum corrections
[100]. The same cannot be said of the D-term, which generically receives quantum corrections.
The flowed theory of equations (I.6.20) and (I.6.23) has central charge
ĉ = 1 −
2
.
n
(I.6.24)
For the case n = 2, the theory is a free massive theory which flows to a trivial theory in the
IR, consistent with ĉ = 0. Because (D+ )2 = 0, we can use D+ to define a cohomology. The
chiral ring is defined to be the space of (D+ )-closed fields modulo (D+ )-exact fields. Due to
the equations of motion,
∂W
D+ D̄+ φ̄ ∝
∝ φn−1 .
(I.6.25)
∂φ
120
Thus, the chiral ring is generated by the fields
1, φ, φ2 , . . . , φn−2 .
(I.6.26)
These fields correspond to (c,c) operators in the SCFT39 . In order for the F-term to be
invariant under the U (1) × U (1) symmetry, φn must have J-charge (1,1) which implies that
φ has J-charge ( n1 , n1 ). Thus, the maximum charge of a (c,c) operator in this theory is
(n − 2) ·
1
.
n
(I.6.27)
It turns out that this must be equal to ĉ: this is plausible because earlier when we considered
the Calabi–Yau σ-model we saw how the chiral operators mimic the cohomology of the target
space manifold. The maximum charge thus corresponds to the dimension of the manifold,
which is ĉ.
Lastly, we note that all N = (2, 2) minimal models also fit into an ADE classification,
which can be specified by the choice of superpotential of two chiral superfields x and y. The
classification is spelled out in Table I.6.8.
Superpotential
Classification
W = xn
An−1
n≥2
W = xn + xy 2
Dn+1
n≥0
W = x3 + y 4
E6
W = x3 + xy 3
E7
W = x3 + y 5
E8
Table I.6.8: Classification of N = (2, 2) minimal models. X and Y are chiral superfields.
39
More generally, the chiral ring is generated by the (c,c) operators in any N = 2 SCFT. The OPE between
(c,c) operators is non-singular, which allows us to consider associative products of the form φi (0)φj (0) =
cijk φk (0). The coefficients cijk define the associated algebraic structure. It does not admit an inverse, leading
to its identification as a ring and not a group.
121
6.3
Mirror symmetry in minimal models
Consider the An−1 minimal model with superpotential W = φn . It has a Zn symmetry
defined by φ → ωφ where ω n = 1. This symmetry is generated by the operator
e2πi
J0 +J¯0
2
(I.6.28)
because φ has charge ( n1 , n1 ). We believe that if we orbifold the minimal model by this Zn
symmetry, we obtain the same minimal model. This is plausible because the minimal models
have been completely classified, and in many cases the central charge alone is enough to
select a model. There are no chiral fields in the untwisted sector because none of the chiral
ring elements of equation (I.6.26) are invariant under φ → ωφ. However, there are n − 1
twisted sectors, which is exactly the number of fields in equation (I.6.26). All states in the
orbifold theory must be invariant under J0 + J¯0 . Hence, for every chiral field in equation
(I.6.26) of charge (q, q), the orbifold theory has a (c,a) field of charge (q, −q). Recall that
the designation between (c,c) and (c,a) fields is ambiguous. We say that the φn theory is
“mirror” to itself.
6.4
Calabi–Yau SCFT from minimal models
We would like to describe a σ-model superconformal field theory with a Calabi–Yau target
space. We will start with a one-dimensional Calabi–Yau, i.e. the torus T 2 , which must have
ĉ = 1. To get ĉ = 1, we will take three copies of the A2 minimal model, with chiral superfields
x, y, z. The superpotential is
W = x3 + y 3 + z 3 .
(I.6.29)
This looks just like equation (I.5.19). However, the presence of chiral fields with fractional
charges is undesirable, since these charges are supposed to also specify elements of the
cohomology. We can simply eliminate fractional charges by modding out by a Z3 symmetry
generated by
e2πi
(J0 +J¯0 )
2
.
(I.6.30)
Now, this theory has a chance of being the conformal field theory with target space T 2 . In
particular, we can deform the superpotential by adding axyz, where the coefficient a is a
complex number, since xyz is a Z3 invariant. This deformation corresponds to a complex
structure deformation. However, we cannot identify a Kähler parameter ρ from such a
superpotential. It follows that if this is the σ-model with a T 2 target space, it must be the
theory at a particular fixed value of Kähler parameter. In summary, the theory is expected
122
to have a superpotential
W =
x3 + y 3 + z 3 + axyz
,
Z3
(I.6.31)
which is invariant under Z3 . As you will show in exercise, an orbifold CFT of the form
CF T /Zn itself has a Z̃n symmetry. From this we learn that (I.6.31) has a Z̃3 symmetry. This
picks out a particular T 2 , i.e. selects where ρ must lie in its moduli space (ρ = e2πi/3 ).
Z3 symmetry sits here
−1/2
0
1/2
Figure I.6.2: The moduli space of ρ. The point with Z3 symmetry is labeled.
Recall that ρ and τ are mirror to each other (that is, they can be swapped by T-duality).
Let us swap the roles of ρ and τ so that the theory we are considering is at the Z3 -symmetric
point of the τ moduli space. We may then deduce from the exercise that if we do not
quotient the superpotential to orbifold the Z3 symmetry in equation (I.6.31), we obtain the
superconformal field theory with target space T 2 /Z3 . The fractional charges of the chiral
fields correspond to the twisted sectors of the T 2 /Z3 theory. We can see the twisted sectors
in the geometry of W .
Since we are not modding Lagrangian by Z3 , we can choose our superpotential to be e.g.
W = x3 + y 3 + z 3 + αx,
(I.6.32)
wich would give a vev to fields that arise from the twisted sector. Because these fields have
conformal dimensions less than 1, they lead to tachyonic modes. To avoid tachyons, it is
important to have the Z3 quotient in equation (I.6.31).
For higher dimensions, recall that in CPn the equation
n+1
z1n+1 + · · · + zn+1
=0
defines a Calabi–Yau (n − 1)-fold.
(I.6.33)
This suggests that we can build the corresponding
123
superconformal field theory by taking (n + 1) minimal models, each with central charge
ĉ = 1 −
2
.
n+1
(I.6.34)
The total central charge is then
c = (n + 1) ĉ = n − 1,
(I.6.35)
which matches the dimensionality of the manifold. Of course, we should also quotient by a
Zn+1 symmetry. For instance, for a quintic Calabi–Yau threefold, the superpotential is given
by
x51 + · · · + x55
W =
,
(I.6.36)
Z5
which is invariant under Z5 . This superpotential should correspond to a point on the moduli
space of the quintic manifold. The Kähler parameter must be at a point with Z5 symmetry.
Similarly as in the case of T 2 , we expect that the manifold at this point should be at the size
of the string scale.
Exercise 1: Consider any CFT with a Zn symmetry. Show (at the level of the partition
function) that the orbifold theory, denoted by CF T /Zn , itself has a Zn symmetry, which we
denote by Z̃n . Furthermore, show (at the level of the partition function) that if we orbifold
CF T /Zn by Z̃n , we get the original CFT back.
6.5
Minimal models and Calabi–Yau σ-models
Until now, we have motivated the connection between minimal models and Calabi–Yau σmodels. In this section, we will actually prove this connection. Details, including the full
Lagrangian of the gauged linear σ-model, are given in [101]. Our starting point is 2d N =
(2, 2) gauge theory with gauge group U(1). We have a single chiral superfield p of charge
−(n + 1) and n + 1 chiral superfields xi with i = 1, . . . , n + 1 of charge 1. The superpotential
is chosen to be
W = p G(xi ),
(I.6.37)
where G is a homogeneous polynomial of degree (n + 1). We also impose that G satisfies
∂i G = 0 ∀i =⇒ G = 0.
(I.6.38)
That is, the gradient of G only vanishes at the origin. (I.5.18) is one example of such a
polynomial. The potential for the complex scalar fields within the chiral superfields is a sum
124
of nonzero terms, one of which is given by40
X
D 2 ∝ e2 (
|xi |2 − (n + 1)|p|2 − r),
(I.6.39)
i
where r is a real parameter coming from the Fayet-Iliopoulos (FI) term in the Lagrangian
and e is the charge. The superpotential makes an additional contribution to the potential:
n+1
X
∂W
∂xi
i=1
2
∂W
+
∂p
2
.
(I.6.40)
The corresponding CFT sits at the IR fixed point of the RG flow. At low energies, the
potential for the complex scalars should vanish. This implies that equations (I.6.39) and
(I.6.40) must equal zero, so (I.6.40) becomes
n+1
X
i=1
|p|2 |∂i G|2 + |G|2 = 0.
(I.6.41)
We observe that G = 0 at the minimum of the potential, so (I.6.38) leads to P = 0. In order
for (I.6.39) to vanish, we choose r > 0 such that
n+1
X
i=1
|xi |2 = r.
(I.6.42)
√
Thus, the magnitude of each of the xi is positive and bounded by r. Because the U (1) gauge
group is Higgsed in this vacuum, we need to quotient out the allowed space of Xi by overall
phase rotations. The space of vacua is given by {xi } ∈ Cn+1 subject to the identification
(x1 , . . . , xn+1 ) ∼ λ(x1 , . . . , xn+1 ),
λ ∈ C,
(I.6.43)
as well as the restriction
G = 0,
(I.6.44)
which is precisely the defining equation of a Calabi–Yau in CPn . The superpotential is
invariant along the RG flow, so we conclude that the IR fixed point of this gauge theory is
the supersymmetric σ-model with Calabi–Yau target space. This is the (n − 1)-dimensional
generalization of the quintic threefold.
Because r fixes the overall size of the xi , r corresponds to a Kähler parameter that sets
the size of the Calabi–Yau. If we want the σ-model to be weakly coupled (so that stringy
40
In this section, we use Xi and P to denote both the chiral superfields and their complex scalar components.
125
corrections to the geometry are small), we must take r large. Recall that for the quintic
Calabi–Yau, h1,1 = 1. We need another real parameter to combine with r to form the
complexified Kähler parameter; the θ parameter of the gauge theory fills this role. The
complex Kähler parameter ρ is then given by
ρ = θ + ir.
(I.6.45)
We now want to study the case of r being negative. If θ 6= 0, we still have nonzero ρ as r
passes through zero. If r is negative, then from equation (I.6.39) we see that p can no longer
be zero, so p acquires a vev. Because p has charge −(n + 1), the U (1) gauge symmetry is
broken down to Zn+1 , which is just the orbifold symmetry of the Landau-Ginsparg theories
we studied earlier. Since we set the superpotential to equation (I.6.37), we see that the theory
at r → −∞ becomes the Landau-Ginsparg theory studied earlier (compare with equation
(I.6.36)). We take the limit r → −∞ so that the contribution of the Xi fields to the vev of P
is negligible; it is only in this limit that we obtain the Landau-Ginsparg theory with the Z̃5
symmetry discussed earlier. Clearly, the opposite limits r → ∞ and r → −∞ probe different
regions of the ρ moduli space of the Calabi–Yau σ-model.
To summarize, the CFT on the quintic manifold at r → −∞ is the Landau-Ginsparg
theory of (I.6.36). We know from earlier that the orbifold Landau-Ginsparg theory with
superpotential
x5
W = 1
(I.6.46)
Z5
is the same as the Landau-Ginsparg theory with superpotential
W = x̃51 .
(I.6.47)
It trivially follows that the Z55 -orbifold theory with superpotential
x51
x5
x5
+ 2 + ··· + 5
Z5 Z5
Z5
(I.6.48)
W = x̃51 + x̃52 + · · · + x̃55 .
(I.6.49)
W =
is equivalent to the theory with
To get the quintic theory, we must mod out (I.6.49) by a diagonal Z5 . This is equivalent to
ungauging one of the Z5 symmetries in equation (I.6.48). If αi denotes the five different phase
rotations that make up the Z5 symmetries in equation (I.6.48) (αi5 = 1), then the quintic
126
theory corresponds to the theory of equation (I.6.48) with the additional restriction that
α1 α2 α3 α4 α5 = 1.
(I.6.50)
That is, to get the quintic theory we can start off with five copies of the W = x51 theory
and then orbifold by a Z45 symmetry defined by equation (I.6.50). We can write this Z45
J0 +J¯0
symmetry as a Z35 symmetry times one generated by e2πi 2 , which is the Z5 symmetry
that corresponds to α1 = α2 = . . . = α5 . This means that the CFT on the quintic is
isomorphic to the same theory orbifolded by the Z35 symmetry, i.e. with superpotential
W =
[x51 + x52 + · · · + x55 ] /Z35
e2πi
J0 +J¯0
2
.
(I.6.51)
The conclusion is that the quintic manifold mod Z35 is mirror to the quintic.
Exercise 2: Show that this theory admits just one complex structure deformation, given
by
αx1 x2 x3 x4 x5 .
(I.6.52)
Thus, h1,2 = 1 as it should be, since h1,1 = 1 for the quintic threefold. The 101 Kähler
deformations for the quintic mod Z35 manifold can be found by resolving its singularities.
7
Black Holes and holography
7.1
Black holes in string theory
We now want to try and use our microscopic understanding of branes in string theory to
unravel some of the mysteries of gravity, and in particular of black holes.
Black holes from wrapped branes
Black holes display features of thermodynamic systems, as studied by Bekenstein and Hawking.
In particular, in 4 spacetime dimensions, the thermodynamic entropy of a black hole is
proportional to the area of its horizon, [102–104]
S=
A
,
4
(I.7.1)
where the factor of 1/4 is universal. Recall that we are working in Planck units.
This result, which follows from a semiclassical field theory analysis, is perhaps somewhat
127
surprising since from a classical viewpoint black holes solutions have zero degrees of freedom.
Moreover, the semiclassical entropy calculation seems to suggest that black holes in fact
comprise a huge number of degrees of freedom. This leads to basic puzzle as to their origin.
While in general this is not known, string theory offers a potential solution: the degrees of
freedom of the black hole can be hidden in the extra compactified dimensions, for example
as branes wrapping cycles in some internal geometry.
Black holes in 3+1 dimensions in general are not well understood. However, there exist
supersymmetric versions of black holes, called BPS black holes, which have certain properties
protected by supersymmetry, and are therefore more straightforward to analyze [105]. Due
to supersymmetry, these properties can also obey rigid constraints. For instance, an extremal
BPS black hole in 3+1 dimensions must have a fixed mass-to-charge ratio, i.e.
M = |Q|.
(I.7.2)
As black holes are particle-like (i.e. 0+1 dimensional), we can attempt to construct them
by considering Dp branes wrapped on p-cycles. For example, one can consider a D3 brane
wrapped on C × S 1 , where C is a Riemann surface. In string theories compactified on some
five-dimensional manifold M5 with C × S 1 ⊂ M5 , such as M5 = K3 × S 1 with C ⊂ K3, the
wrapped branes behave as particles propagating in the five noncompact dimensions. From
the Kaluza-Klein reduction, the mass of the particle must be proportional to the genus of the
Riemann surface, and so for surfaces with sufficiently many handles, the particle becomes
very heavy and looks like a black hole. It turns out that the microscopic entropy of such
particles precisely matches the macroscopic entropy of black holes as given by Bekenstein
and Hawking [105].
Near-horizon limit of branes
If we zoom in very close to the particle (black hole) in the large dimensions, we observe a
horizon associated to a 5d geometry of the form
AdS2 × S 3 .
(I.7.3)
In fact, the emergence of AdS geometries in the near-horizon limit of black holes was well
known long before they were considered in the string theory context. In this case, the presence
of the heavy D3 branes backreacts with the surrounding geometry, curving it into a space
that looks like AdS near the horizon.
Let us instead start with a stack of N Dp branes in type IIA or IIB string theory. In
general, the presence of the branes backreacts with Minkowski space to give a new geometry
128
with metric [106]
1
1
ds2 = H(r)− 2 ds21,p + H(r) 2 ds20,9−p ,
(I.7.4)
where d2p,q is the flat Lorentzian metric in signature (p, q). It naturally splits into a contribution
to the metric parallel and perpendicular to the brane worldvolume. The warp factor H(r)
depends on the distance r perpendicular to the stack of branes and is given by
H(r) = 1 + c
gs N
.
r7−p
(I.7.5)
where gs is the closed string coupling and c is a constant. It also contributes to the dilaton
VEV e−2φ as
p−3
e−2φ = gs−2 H(r) 2 .
(I.7.6)
There are two main limits to consider. As r → ∞, that is, as one is sufficiently far away
from the stack of branes, the warp factor tends to H → 1. Consequently the metric becomes
flat and it is no longer possible to resolve the effects of the backreaction. On the other hand,
as one approaches the branes, in the limit where r → 0 (near-horizon limit), H → cgs N rp−7 ,
and the string coupling generically begins to run unless p = 3.
However, for the stack of D3 branes, the string coupling remains constant but the metric
is modified. In the near-horizon limit, the metric (I.7.4) reduces to
ds2 = √
p
1
r2 ds21,3 + cgs N r−2 ds20,6 .
cgs N
(I.7.7)
We can always rewrite the perpendicular part of the metric in spherical coordinates,
ds20,6 = dr2 + r2 dΩ25 ,
(I.7.8)
where dΩ25 is the metric of the unit 5-sphere. Notice that the the factor of r−2 in front of the
6d metric cancels the r2 in the spherical metric. In other words, in the near-horizon limit we
reach a point where the size of the S 5 does not shrink anymore. Overall, the metric becomes
p
p
1 2
1
2
2
√
r ds1,3 + cgs N 2 dr + cgs N dΩ25 ,
(I.7.9)
r
cgs N
which describes none other than the AdS5 × S 5 spacetime, where both the AdS and sphere
√
radii are given by cgs N .
Exercise 1: Show that we can write the AdSd+1 metric as
(ds2 )d+1 = r2 ds21,d−1 +
129
1 2
dr ,
r2
(I.7.10)
where
(ds2 )1,p+2 =
−dt2 + d~x2 + dy 2
.
y2
(I.7.11)
In addition to the AdS5 × S 5 geometry, we also need to consider the effects of the R–R
fluxes. The presence of the D3 branes indicates that there is an F5 flux, which in this case is
turned on along the S 5 as
ˆ
F5 = N .
(I.7.12)
S5
Unsurprisingly, the N D3 branes lead to N units of five-form flux. This looks like type IIB
theory compactified on an S 5 , which we did not discuss previously because S 5 does not admit
Killing spinors due to the fact that it is not a special holonomy manifold. However, in our
previous analysis we restricted our search to Minkowski vacua without fluxes. It turns out
that if you turn on fluxes and take positive curvature, the it sometimes happens that one
can get supersymmetric AdS solutions.
7.2
Holography
Holography from D3 branes
We just discovered that a stack of branes backreacts with the geometry, leading to a spacetime
that looks like AdS in the near-horizon limit. However, from the string theory perspective we
also know that the fluctuations of the brane are described by open string degrees of freedom
living on its worldvolume, which in the low energy limit admits a gauge theory description.
This led Maldacena to conjecture that these two descriptions are in fact equivalent . In
particular, the conjecture states that [107]
N = 4 super Yang–Mills
←→
Type IIB on AdS5 × S 5 ,
(I.7.13)
i.e. that the degrees of freedom living on the left and right are equivalent. This is known as
the holographic duality, or holography for short. The 4d spacetime where the gauge theory
lives is the boundary of the AdS5 and the information of the gauge theory on the boundary
and the gravitational theory in the bulk are supposed to be identical.
Strictly speaking, by boundary we mean the conformal boundary of AdS5 . This tells us
that any metric in the same conformal class should give identical results for the boundary
theory. In fact, the Euclidean AdS5 isometry group SO(1, 5) acts as the conformal group
on the boundary, and so the gauge theory is also a CFT. This had to be the case, otherwise
conformal transformations would not act faithfully on observables in the theory.
There is a rich history matching observables on both sides of the duality. The boundary
130
theory lives on S 4 . Every local CFT admits a local spin 2 (symmetric, traceless rank 2 tensor)
conserved current T µν (x), i.e. the stress tensor. Its correlation functions thus constitute a
universal set of observables in any CFT. In the dual bulk (gravitational) theory, it is natural
to consider gravitons scattering processes. While asymptotically AdS spacetimes do not
admit an S-matrix construction, scattering amplitudes are instead captured by correlation
functions of boundary operators sourcing fields in the bulk. In this case, the stress tensor
correlation functions compute amplitudes of gravitons traveling to and from the boundary
and scattering in the bulk. This logic extends to other observables on the two sides of the
duality, and there is a precise dictionary of how to relate gravitational computations in the
bulk to correlation functions in the gauge theory.
Conifold geometries
Let us now return to the brane description. In the backreacted geometry, there are two
spheres of interest, namely the S 4 that the D3 branes wrap and the S 5 where the fluxes live.
Far away from the branes, the S 5 can be shrunk to a point, whereas the S 4 corresponds
to the worldvolume of the D3 branes and is fixed. Going to the branes, we have that the
S 5 is stabilized by the five-form flux and so has finite size. Meanwhile the S 4 gets pushed
to the boundary and becomes trivial. In other words, the branes get “pushed to infinity”
and are nowhere to be found in the AdS bulk. In this sense the near-horizon limit has a
geometric interpretation as exchanging the fixed sphere with the one allowed to shrink to a
point. We can represent this transition as a cone with base S 4 × S 5 . Before the transition,
the S 4 is smoothed out by D3 branes, while after the S 5 is smoothed out by the fluxes. This
is depicted diagrammatically in Figure I.7.1 [108].
brane
flux
S5
S5
S4
S4
Figure I.7.1: Two diagrams depicting S 4 × S 5 geometries, where the branes reside on the S 4
and the flux on S 5 . The diagrams are related via holography due to the backreaction of the
branes on the geometry. Note that S 4 is the boundary of the AdS5 .
We can repeat these exercise relating spheres in different settings, such as for M2 branes
and M5 branes in 11d flat space. Recall that an M2 brane has a three-dimensional worldvolume
131
and couples electrically to the three-form gauge potential C3 . It can wrap an S 3 , whie the
eight transverse directions contain an S 7 . In the ambient space, the S 7 is trivial whereas the
S 3 is fixed, while in the near-horizon limit, the S 7 is stabilized by the seven-form flux ⋆dC3 ,
leading to an AdS4 × S 7 geometry. This transition is depicted in Figure I.7.2.
M2-brane
flux
S7
S7
S3
S3
Figure I.7.2: AdS4 × S 7 geometry the branes on AdS4 and fluxes on S 7 , where
´
S7
⋆G = N .
We can play the same game with M5 branes, where the 4 and 7 are essentially swapped.
The conifold geometry again takes the form of a cone with base S 6 × S 4 , where now the
branes wrap the S 6 and S 4 lives in the transverse directions. After taking the near-horizon
limit, we find an AdS7 × S 4 geometry with S 4 stabilized by the four-form flux dC3 . This is
represented in Figure I.7.3.
M5-brane
flux
S4
S4
S6
S6
Figure I.7.3: AdS7 × S 4 geometry the branes on AdS7 and fluxes on S 4 .
Chern–Simons
µ
S2
S2
S3
S3
Figure I.7.4: S 3 × S 2 geometry with Chern–Simons theory on S 3 yielding a conifold with
µ = N gs .
132
There are also similar effects that arise in the context of topological strings [108]. Recall
the conifold construction with base S 2 × S 3 . Adding N ≫ 1 branes to the topology theory
gives a Chern-Simons theory in target space, as represented in the left diagram of Figure
I.7.4. Here, the left diagram corresponds to the blown-up singularity, while the right diagram
corresponds to the deformed singularity. In the right diagram, we find a large N dual theory
where the original branes are replaced by a size µ = N gs .
AdS and positive curvature spaces
In all of the brane constructions considered so far we have an AdSp+1 × S q spacetime in the
near-horizon limit. One could imagine scenarios where the S q is replaced by another compact
manifold. However, it turns out that all such replacements must have positive curvature
Negative Λ means we need to have the curvature of the internal manifold to be positive.
With negative Λ, we have an anti-de Sitter space.
Exercise 2: This exercise considers the case of AdS5 × S 5 . First we compactify type IIB
theory on an S 5 (use D3-branes) with the 5-form R–R flux, which is self-dual, that has N
units around the S 5 :
ˆ
5
G5 = N,
(I.7.14)
S
where the internal size will be Rint ∼ 1/r2 , which determines the cosmological constant to
be Λ ∼ −1/r2 . Now we have to go to the Einstein frame in (4 + 1)-dimensions and consider
the potential V that depends on r such that
V (r) = −Λ(r).
(I.7.15)
Then compute the potential V (r) of the theory. The potential V (r) will then look like in
Figure I.7.5, where its minimum is at r = rs with
V (r) =
A
− Brb ,
a
r
(I.7.16)
´
´
where the first term comes from |G|2 and the second term from R both in the compact
manifold. Then more precisely, find a and b for such a potential and show that
rs ∼ (gs N )1/4 .
(I.7.17)
[Hint: The curvature of the radius is related to rs . When gs → 0, the AdS5 shrinks and only
see the boundary S 4 .]
133
V (r)
∼ r−a
rs
∼ r−b
r
´
Figure I.7.5:
the sum of two terms VG (r) ∝ |G|2
´ The potential of the theory is given by 1/4
and VR ∝ R, which has its minimum at rs ∼ (gs N ) where the radius is stabilized.
Optional exercise: Do the same computation for the M-theory compactification.
These are some examples of holographies we covered. In fact we can have more broad
versions of holography and it requires at least a relaxed version of the Einstein’s equation.
The nicest thing to have is when
Rij ∝ gij ,
(I.7.18)
such as Sasaki–Einstein manifold. In some sense these manifolds are related to Calabi–Yau.
For example in the example we covered we had S 5 . The boundary of S 5 is C3 and when
orbifolded, this can be a Calabi–Yau manifold:
C3 /Γ −→ S 5 /Γ.
(I.7.19)
Then by acting with orbifolds on the holographic side, we get new theories. Interestingly, all
the Calabi–Yau manifold we get via holography is noncompact.
134
Part II
The Swampland program
1
1.1
Introduction to Swampland program
Basic features of quantum field theories
We begin with a quick review of what constitutes a well-defined quantum field theory (QFT).
QFTs are powerful theories that allow us to compute various physical quantities with a wide
range of available perturbative and non-perturbative techniques. Whenever we talk about
a QFT, in addition to the computational machinery, we are also thinking of an underlying
mathematical structure (e.g. the space of operators and their algebra) that satisfy some
fundamental principles such as unitarity. The list of expected criteria that a good QFT
must satisfy could be long, but in low dimensions such as d ≤ 4, we have developed a good
understanding of what constitutes a ”healthy” QFT. In the following, we will review some
of the most important features of ”good” QFTs.
A quantum field theory usually comes with an action. The action S[Φ] is a classical
functional of the local fields in the theory. The quantum field theory builds upon the classical
theory by associating physical quantities with path integrals
ˆ
hO(Φ)i ∼ DΦe−S[Φ] O(Φ).
(II.1.1)
To write down the action, we usually start with a symmetry principle and find an action
that respects that symmetry (spacetime symmetries, gauge symmetries, etc.). However, the
symmetries do not always hold or even make sense at the quantum level. When that happens,
we say the symmetry is anomalous, and if the gauge symmetry is anomalous, the theory
is inconsistent. Local anomalies in four dimensions are associated with triangle Feynman
diagrams.
135
A simple dimensional analysis shows that anomalies in D dimensions correspond to
diagrams with 1 + D/2 gauge boson legs. For example, in 10 dimensions, the relevant
diagrams are hexagons with six external gauge boson legs. One can use the tetrad formalism
to view gravity as a gauge theory where the gauge group is the local Lorentz group and the
gauge bosons are the spin connections. Therefore, the six-point gravitational amplitude is
related to the breakdown of general covariance. This is called gravitational anomaly and
must vanish in a consistent theory of gravity. We will talk more about such anomalies in
supergravity backgrounds.
Another important feature of QFTs is the dependence of physical quantities on the
energy scales. This dependence is often captured by the RG flow. There is a class of
renormalizable field theories that become free theories at high energies. These theories are
called asymptotically free and we can describe their UV completion without appealing to a
cut-off or anything beyond QFT. Another group of such promising QFTs are scale invariant
theories.
For a generic QFT we are not this lucky and we usually need to define a cut-off. Depending
on whether the theory is renormalizable or not, we need finite or infinite number of parameters
to define the theory below the cut-off. This approach is called effective field theory (EFT).
The idea is that we can capture the physics at a certain energy scale by an effective field
theory and if we are lucky enough (the theory is renormalizable) we can find the corresponding
effective theory at other energy scales below the cutoff as well. We will come back to this
approach later.
1.2
Quantum gravity vs quantum field theory
So far we talked about ”good ” QFTs. Now we turn to an important question; what is the
role of gravity here? If we add it to a good field theory, could it still be a good QFT?
The idea to incorporate gravity into QFT is to view the metric as a field that interacts
with other fields and proceed with the quantization procedure. For example, the action of
136
a free massless scalar can be covariantized to the following action that includes interaction
with the gravitational field (metric gµν ).
1
√ R
d4 x g[ + ∂µ φ∂ν φg µν ],
2κ 2
M4
ˆ
S[φ] =
(II.1.2)
where κ = 8πG. If we decide to follow the path integral formulation II.1.1 for the above
action, we need to mod out by the diffeomorphisms. Schematically, we can write that as
ˆ
Dg
DΦe−S[Φ,g] .
Vol[Dif fM ]
(II.1.3)
But does this formula make computational sense? Feynman tried to use the tools of perturbative
QFT for gravity but he quickly ran into problems [109]. Let us demonstrate those problems.
Every theory has relevant and irrelevant operators depending on whether the coefficient of
the corresponding operator goes to ∞ or 0 as the energy scale decreases. If the theory
does not have irrelevant operators, we are in a good shape. For example the coupling
constants in QCD is defined by a specific relevant operator and the rest of the observables
are calculable from that coupling constant, which can be traded with an energy scale by
dimensional transmutation.
For gravity, the effective coupling for the graviton scattering goes like
E2
gef f (E) ∼ 2 .
MP
2
(II.1.4)
E
Where MP is the reduced Planck mass41 . This is different from the usual logarithmic
energy dependence 1/g(E)2 ∼ ln(E) and becomes strong for E >> MP . One might be
tempted to take the EFT approach explained earlier and input enough coupling constants
from experiment to find couplings at other energy scales. However, it turns out you have
infinitely many vertex operators with UV-divergent amplitudes that need to be kept track of.
41
Planck units conventions: lP
mp /(8π)1/(D−2) , TP = mp c2 /KB
=
(~G/c3 )1/(D−2) ,
137
tP
=
lP /c,
mP
=
~/(clP ),
MP
=
In other words, our theory needs infinitely many parameters as input and is not predictive!
So QFT+gravity, at least in the most naive sense, seems to be problematic. But where is
the problem really coming from?
An important remark before we go further:
As long as we are dealing with energies below the Planck scale E << MP , the changes in
the couplings are small and it is reasonable to believe that we have a nice classical effective
field theory. In other words, the Planck mass MP introduces a natural scale beyond which
the EFT should start to break down.
1.3
Why is gravity special?
To see why the canonical method of QFT could not have worked for gravity we need to take
a detour through the realm of black holes.
Black holes are solutions to the classical equations of motion for gravity. Large 4d black
holes of mass M have radius of ∼ M lP /MP and the spacetime outside their horizon is weakly
curved RlP2 . MP2 /M 2 for M ≫ MP . Since the curvature is very small (RlP2 ≪ 1) we can
think of black holes as IR backgrounds where Einstein’s classical equations are reliable.
rH ∼
M
l ,
MP P
2
Area ∼ rH
∼
M2 2
l ,
MP2 P
1
2
rH
∼
MP
M
T emperature ∼ κ ∼
MP
M
TP ,
Curvature ∼ R ∼
Entropy ∼ Area ∼
2
lP−2 ,
4d black hole
M2
,
MP2
Black holes in d = 4 are very simple in the sense that a classically stationary black hole
is described by only three parameters: mass, charge, and angular momentum. Bekenstein
speculated that there should be more degrees of freedom for black holes. Otherwise, by
dropping a thermal system with entropy into a black hole we can decrease universe’s entropy
and violate the second law of thermodynamics. In 1971 Hawking had shown the overall
areas of the black holes horizons always increases in collisions [110]. This result sounded very
similar to the second law of thermodynamics to Bekenstein and it motivated him to propose
138
that black holes must carry an entropy proportional to the area of their horizons [111]. If
black holes really have entropy, we can also define a temperature for them according to the
first law of thermodynamics dE = T dS. Hawking managed to find the temperature of black
κ
in
holes by showing that they emit an almost thermal radiation with temperature T = 2π
Planck units where κ is the surface gravity of black holes [104, 112]. This equation also fixed
the proportionality constant between black hole’s entropy and area and lead to the following
equation in Planck units.
S=
A
.
4
(II.1.5)
This single equation is the starting point of many of the strange aspects of quantum
gravity! This equation implies that despite the classical uniqueness of black holes, there must
be a huge number of degree of freedom represented by a black hole of mass M. In particular,
this implies the number of high energy bound states grows as exp(S(E)) ∼ exp(cE 2 ) for
some constant c.
But the fact that IR object such as large black hole have so much information about very
UV states is very strange. It implies that extremely low energy physics (i.e. large black
holes) and extremely high energy states somehow know about each other which is completely
contrary to the EFT perspective. This UV-IR dependence is a remarkable failure of UVIR decoupling used in EFT. With this knowledge, it is much easier to see why Feynman’s
approach to quantize gravity could have never worked, because the premise of EFT and
renormalization is to neglect the UV physics by renormalizing IR parameters, but a quantum
treatment of gravity even at large distances requires incorporating the high energy degrees of
freedom. Another reflection of this is that scattering of gravitons at very high energies (UV)
proceeds via large intermediate black holes (IR).
Exercise 1:
Part 1: Find the relationship between the mass of the black hole and the radius of
its horizon in any dimension d > 3. Numerical coefficients are not necessary. Just find the
correct power law dependence.
Hint: You can use find the exponent of r in gtt of higher dimensional Schwarzschild
solution using gravitational Green’s function in higher dimensions.
Part 2: Assuming that entropy S of a black hole of mass E grows like the volume of
the horizon, find a such that S(E) ∼ (E/mP )a and show that a > 1 in d > 342 . Argue
42
Many of the equations and inequalities break down in d ≤ 3 because the gravitational degrees of freedom
139
that in quantum gravity (QG), the number of single particle states (number of states with
hX i i = hP i i = 0 for an arbitrarily small but fixed IR cut-off) of mass M must grow like
a
ρ(M ) & eb(M/mP ) for some constants b > 0 and a > 1.
Part 3: Show that in any (non-gravitational) theory where the theory at any finite
energy range is described by an EFT, ρ(M ) . exp bM for some constant b > 0. Note that
we are assuming there is no energy scale where EFT approach does not work, so that the
high energy limit makes sense.
Hint: consider the thermodynamic partition function of this theory in a big box at small
temperatures.
Exercise 2:
exercise?
How do you think gravity could avoid the upper bound of the previous
From the exercises, we see that Hawking’s semi-classical calculation has dramatic implications
and that QFTs are very different from QGs. Moreover, it seems that black holes play a key
role in highlighting that difference. As we will see in the course, black holes are the star of
the show in many aspects of quantum gravity.
Luckily, we do not have to rely entirely on semi-classical celculations to study QG because
we already know some examples of QG. An existing consistent theory of quantum gravity is
string theory. The reason we think it is consistent is that we have consistent perturbative
descriptions of it in addition to non-perturbative dualities that relate those perturbative
descriptions to each other. So string theory is at least ”an” example of QG. But string
theory is not a theory of particles and this suggests another explanation for why Feynman’s
argument failed.
The fundamental classical objects in string theory are extended objects such as strings.
These strings can oscillate in different ways and the amplitudes of these oscillations can be
viewed as different degrees of freedom. This suggests, the degrees of freedom of a black
hole cannot be entirely comprised of particles, but we also need stringy oscillations. String
theory is strange in that it is unusually specific. For example super string theories must
be ten dimensional. Moreover, the set of theories is very restricted as opposed to EFTs
where we have so much freedom in choosing the gauge group, dimension, matter content,
etc. Also, supersymmetry seems to be more than a nice accessory in string theory since the
only well-understood stable examples of string theory are supersymmetric!
One might wonder how could a ten dimensional theory describe our four dimensional
universe? One answer is that the extra dimensions could be compact and small and there are
are topological.
140
limited options for how they could look. We will discuss this in more detail in the course.
Question : We do not see low-energy supersymmetry in our universe. So if supersymmetry
is a necessary feature of string theory are we not in trouble?
Luckily, supersymmetry is not required and can be broken in string theory. However,
usually the breaking of supersymmetry comes with losing some sort of controlablity. For
example, you lose stability in the sense that the value of the coupling constants may vary
over time. Also applied to our universe, which seems to be non-supersummetric, string theory
suggest our universe should decay in some sense. We will talk more about this in the course.
1.4
Problems of treating gravitational theories as EFTs
Let us review some of the basic features of the EFT approach. In the EFT approach, we
start with a symmetry and write an action that includes all the relevant terms that are
consistent with that symmetry. EFTs usually come with a cutoff based on the premise that
the low-energy physics can be described independently from the high energy physics. This
principle is called UV/IR decoupling .
We typically assume that the coupling constants are of order one in the appropriate mass
scale of the theory. This principle is called naturalness. Naturalness implies that if some
of these parameters is unusually small or large, there has to be a good explanation for it, e.g.
a missed symmetry. This is a cherished principle in particle physics.
The combination of UV/IR decoupling and naturalness is very powerful and has lead to
many successful predictions in standard model which is why particle physicist have so much
confidence in these principles.
In the following, we list some of the important questions that arise due to tensions between
experimental observations and the principles of the EFT approach.
Question 1: Why is dimension of spacetime four in our universe?
This is a question that is not commonly asked. However, from the naturalness point of
view, it is strange that we are living in such a small dimension. If every dimension from 1 to
∞ is allowed, d = 4 seems a very unnatural choice.
One potential answer could be that QFTs have special properties in low dimensions,
especially 4. For starters, we do not even know of any consistent UV complete QFT in more
than 6 dimensions. So there could be some truth to this argument.
141
Question 2: Why is the rank of the standard model gauge group so small?
Again, if every rank is allowed, why 4? An EFTheorist might view this as a question
about naturalness since we usually fix our symmetries and then proceed with finding the
right action. Nonetheless, this is an interesting question with no clear answer from the field
theory point of view.
Question 3: Suppose we have fixed the dimension to 4 and the gauge group to U (1) ×
SU (2) × SU (3). Why are the representations of the gauge group that appear in our universe
so small (i.e. fundamental and adjoint)?
One might argue that this could be related to asymptotic freedom since including arbitrarily
large representations change the sign of the beta function and destroys asymptotic freedom.
However, from the EFT perspective, why should we care about asymptotic freedom? Even if
our theory is not asymptotically free, we can always put a cutoff and proceed with calculation
as long as the theory is renormalizable. Furthermore, from string theory, we have examples
that show us asymptotic freedom for QFT is not a requirement for UV completion. So in
quantum gravity, asymptotic freedom is not a good guide and it cannot explain the smallness
of representations.
Question 4 (the hierarchy problem): Why is the vev of the Higgs field so much
smaller than MP ?
There is no principle for the mass of the Higgs field to be so small compared to the cut
off of the theory. EFTheorists have tried to find a symmetry-based explanation for this. For
example, one proposal is to explain via weakly broken supersymmetry (SUSY) since SUSY
implies non-renormalization theorems that prevents m from running to O(MP ). However,
none of the explanations so far have been quite successful in providing a natural explanation
for this puzzle while keep being compatible with incoming experimental results.
Question 5 (CC problem): From the EFT perspective, there is no apriori reason
for the cosmological constant Λ to not be of the order of Λ2EF T where ΛEF T is the cutoff.
However, in our universe, we have Λ ∼ 10−122 MP4 ! 10−122 is not order one by any means.
A popular solution to remedy this problem in the EFT picture is the Anthropic argument.
The argument roughly goes like this: If Λ were much greater than its measured value, the
expansion of the universe would be too fast for large gravitationally bound structures such
as large galaxies to exits. Without large galaxies, there would be no star formations and
heavy elements hence life as we know it could not exist. So if there were many many possible
142
theories (universes) with different values of Λ, the existence of the humankind that poses
this question, already puts an upper bound on Λ which is just a few orders of magnitude
higher than its measure value. Weinberg estimated this upper bound before Λ was measured
and the upper bound turned out to be just couple of orders of magnitude away from the
measured value [113]. This is a powerful example of scientific methodology. However, is this
argument enough or there is a more fundamental reason for the smallness of Λ? Maybe, or
maybe not...
Question 6 (another hierarchy problem): Why are the masses of some particles
orders of magnitudes different from each other? For example, the mass of neutrinos are
very small compared to the Higgs mass. Interestingly, the mass of the neutrinos is almost
mν ∼ Λ1/4 in Planck units where Λ is the cosmological constant. Is this a coincident?
Question 7 (dark energy): The common interpretation of dark energy is the value
of the scalar potential. If we are stuck in a local minimum of the potential, we must have
∇V (Φ) = 0. The existing experiments can only tell us that |∇V |/V is smaller than some
O(1) constant. So, if |∇V | is not zero, it must be extremely small and finely tuned which
would pose another naturalness problem.
V (φ)
|V ′ |
V
< O(1)
Λ
Our
universe
now
φ
Question 8 (a version of coincidence problem): The age of the universe is around
the natural timescale associated with cosmological constant O(Λ−1/2 ) in Planck units. Is
that a coincident or there is an explanation to it?
Question 9 (strong CP problem): From the EFT perspective, it is natural to add
a θF ∧ F term to the Lagrangian. This term would violate CP and have experimental
consequences. Experiment suggests |θ| < 10−10 which is unnaturally small. Peccei and
143
Quinn tried to explain this smallness by promoting θ to a dynamical variable (axion) that is
dynamically fixed. Promoting θ to a field is well-motivated by quantum gravity, but it still
does not explain why it should stabilize at such a small value.
Question 10 (homogeneity): How did the universe become so homogeneous with an
anisotropy that is so scale-invariant?
Inflation seems to be a contender to explain this observation. The premise of inflation
is that there was a long (compared to Hubble time) period of exponential expansion in the
early universe that homogenized the observable universe. However, the conventional models
to realize inflation are usually in tension with EFT. Inflation typically requires a potential
that is very flat (|V ′ | ≪ V ) over very long field ranges ∆φ ≫ MP which needs fine tuning
and is unnatural.
V (φ)
|V ′ | ≪ V
∆φ ≫ MP
φ
The question we would like to study in this course is that: Could quantum gravity (QG)
shed light on these problems by considering the impact of UV degrees of freedom?
So far, the only example of quantum gravity that we know of is string theory. It is
reasonable to use it to get as much insight about QG from it as possible. We can study
the above questions in some controlled examples in string theory to see if there are general
patterns.
String theory setups usually consist of a highly constrained higher dimensional theory
living in a 10 dimensional spacetime with some compact dimensions and some non-compact
dimensions. To model our 4d universe, we would need six compact dimensions.
144
Compactification
10d String theory
M
compact
×
R4
non-compact
4d theory
Depending on the choice of the compact manifold M , we find different 4d EFTs M →
EF T (M ). The geometric properties of M are reflected in physical properties of EF T (M ).
For example, as we will explain later, the easiest cases that we can study are supersymmetric
EFTs that typically correspond to Calabi–Yau manifolds M . It turns out that if we fix
the cutoff of the EFT, the number of known different supersymmetric theories that that
arise in QG is finite! This is very different from the typical EFT perspective where we have
continuous adjustable parameters that give us a continuous spectrum of theories.
EFTs in string
theory
Figure II.1.1: Space of ”good” EFTs.
As we will see, these examples are usually jammed in some specific corners of the theory
space. This observation motivates us to think there is an underlying fundamental reason
behind the patterns that we see. That maybe a theory of quantum gravity must always
follow specific criteria that are not obvious from EFT perspective. However, we should be
145
cautious that we might be misguided by the limited size of our supersymmetric sample set.
This is why it is important to support any observed pattern by some more reasoning that
bears on more basic physics (e.g. unitrarity or black hole physics). It is important to point
out that typically both the observation of patterns and the supporting reasoning should be
viewed as motivations and not a proof. In order to prove something robust about quantum
gravity, we first need to have a much clearer understanding of what quantum gravity is. Even
though we are not there yet, that is certainly the final goal.
1.5
The Swampland program
The discreteness of the set of consistent theories makes it difficult to discern whether a given
EFT has gravitational UV completion or not. In fact, you would need to measure the physical
parameters with infinite precision to do that. This makes it so much more difficult to say if
a theory is consistent with QG than whether it is not. The idea of the Swampland program
is to rule out the inconsistent theories rathen that pin point the consistent ones.
Swampland and Landscape: the EFTs that are consistent based on EFT reasoning (no
anomalies, etc...) but do not have a QG UV-completion are said to be in the ”Swampland ”
while the ones that do are said to belong to the ”Landscape”.
Example: As we will see in the class, the N=4 super Yang–Mills in d = 4 with a gauge
group of rank greater than 22 is in the Swampland [114].
The Swampland Program
Finding criteria that ensure a theory belongs to the Swampland using universal observations
in string theory as well as arguments based on more basic physics (unitarity, black hole
physics, etc.).
Note that, by definition our universe is in the Landscape. So finding criteria that cut
away corners of the theory space from the Landscape could lead to direct predictions about
our universe.
There are two different definitions for the Landscape. Suppose we call the previous
definition the QG-Landscape, there is also a string-landscape which corresponds to the set
of EFTs that are realized in string theory. Right now, since the only known well-defined
quantum theory of gravity is string theory, there seems to be no distinction between them.
However, generally String Landscape ⊂ QG Landscape. Recently, there has been more and
more evidence emerging in support of the equality of the two sets. The conjectural equality
of the two sets is often called the ”string lamppost principle (SLP)” (also called string
146
universality). If string theory turns out to be a tiny subset of the landscape, we are in a bad
shape because it would be very difficult to find correct Swampland conditions. However, as
we will see in the course, at least with enough supersymmetry, this does not seem to be the
case.
Some of the Swampland conditions that we will discuss are also motivated by black hole
physics but some others just have string theory backing. The former group thus have more
evidence to be true. In addition to these two sources, the overlaps and consistencies between
the different conditions is another source of assurance of their mutual validity.
If these principles are somehow all different sides of the same underlying principle, we
would like to unify them and find that core principle. Unfortunately, we are not there yet, but
we have a good web of statements that seem to be circling around a few more fundamental
statements. Even if one does not think of Swampland conditions as principles, they are still
very useful as organizing principles for the many examples we see in string theory.
Swa
mp
l an
dc
rite
ria
Landscape
Space of ”healthy” EFTs
The core of the Swampland program is the uniqueness of string theory. In string theory,
as we increase the cutoff, the landscape of theories that seemed to be disconnected, become
connected. It is believed that increasing cut-off high enough would lead to one single theory
with a single connected moduli space. Given that different Calabi–Yau manifolds lead to
different EFTs, this implies that all different Calabi–Yau manifolds must be transformable
to each other using specific geometric transitions. In fact, this statement is a well-motivated
math conjecture often known as Reid’s fantasy. We will talk more about this in the future
when we talk about dualities.
147
Scalar potential
Λ2
Λ1
EF T1
EF T2
Moduli
Figure II.1.2: In the above example, decreasing the cut off of the theory from Λ2 to Λ1 ,
breaks up the moduli space into two disconnected peices. Therefore, each local minimum is
described by a different EFT with cutoff Λ1 . However, by increasing the cutoff back to Λ2 ,
the moduli spaces of the two EFTs connect and we can describe both of them by a single EFT
with cutoff Λ1 . If we keep increasing the cutoff to infinity, we expect for all the low-energy
EFTs to become connected. Thus, all seemingly different EFTs are just different low-energy
corners of the moduli space of a more fundamental theory.
Let us summarize our introduction by going back to the differences between quantum
gravity and quantum field theory and list a few major differences between quantum gravity
from quantum field theory. We will sharpen these differences when we get to a detailed
discussion of the Swampland criteria.
• Locality: Quantum field theory is predicated on an algebra of local operators. A
local structure of observables is a starting point in any quantum field theory. However,
dualities strongly suggest that any local structure is emergent rather than fundamental
in quantum gravity. For example, in the AdS/CFT duality, the local classical theory
on the gravity side could be only trusted in the strong coupling limit on the CFT side.
In other words, a set of observables emerge and a local description in terms of these
observables becomes a good approximate description in a particular limit. Another
example is T-duality where the notion of ”local” in the compact dimension is very
different depending on the frame. A local excitation in one frame is a topological
winding state in the other frame and vice-versa.
• UV/IR decoupling: In Quantum field theory, the IR dynamics is thought to be
impacted by the UV theory in a very limited way via corrections to some terms in the
148
effective action. However, in quantum gravity, an IR calculation can be very sensitive
to UV details. The black hole entropy formula is a perfect example of this UV-IR
connection.
• Symmetries: In effective field theory, symmetries are guiding princinples, however,
this is not the case in quantum gravity. In fact, as we will see, quantum gravity avoids
global symmetries and is pretty picky about its gauge symmetries.
• Naturalness: If we consider a UV theory with O(1) couplings in the appropriate
mass unit of the theory, we can estimate quantitative and qualitative properties of the
low-energy field theory. In that sense, a UV theory sets a natural expectation for the
IR theory. However, what is natural in EFT, can be very unnatural in quantum gravity
and vice-versa. For example, exponentially light states are unnatural in field theory
while natural in quantum gravity and arbitrarily large gauge symmetries are natural in
field theory while unnatural in quantum gravity.
2
2.1
Swampland I: No global symmetry conjecture
No global symmetry: black hole argument
As we discussed in the previous section, black holes are low-energy windows into UV gravitational
physics. Many of the insights that we learn from black hole physics hinge on the fact that the
properties of black holes are universal. This includes their entropy formula or thermal features
of the Hawking radiation which does not seem to depend on the details of the effective field
theory. In the early days of black holes, these universalities raised many questions including
the information paradox. It was also pointed out that since the Hawking radiation seems to
only depend on the near horizon geometry which only depends on gauge charges, angular
momentum and mass, any other label gets lost in the black hole. This implies that if we throw
a conserved charge under a global symmetry that is not protected by a gauge symmetry, it
seems to get lost in the radiation and the conservation gets violated [115]. Hence, there can
be no global symmetries in quantum gravity.
Note that this is a separate issue from the information paradox. One might think that
the resolution of information paradox restores information of the what we throw in the black
hole, including the conserved charge. However, since the spectrum of the outgoing Hawking
radiation is blind to the global symmetry charge, this conservation cannot hold for global
symmetries. Therefore, the global symmetry is violated.
As it is clear from the statement of the no global symmetry conjecture, there must be a
difference between gauge and global symemtries. This raises an important question: What
149
is a physical definition that can separate the two symmetries from one another? Are gauge
symmetries just an artifact of our mathematical redundancy of the theory or do they have a
physical meaning to them?
One can ask similar questions about discrete symmetries. For example, what separates
discrete gauge symmetries from discrete global symmetries?
One way to think about discrete gauge symmetries is in terms of Higgsing a continuous
gauge symmetries. For example if one considers a complex scalar field with a unit U (1)
charge, the gauge symmetry can be Higgsed to Zn . However, can we always think about
discrete gauge symmetries in this way? The answer turns out to be no!
Another way to think about discrete gauge symmetries is in terms of lattice gauge theories
and taking the limit where the lattice spacing goes to zero. Let us see how this works in a
nutshell. For a continuous gauge group, we can define the theory on a principal bundle over
spacetime. The gauge field represents an infinitesimal change along the fibre as we parallel
transport along an infinitesimal line in spacetime. For discrete gauge groups, we quickly run
into a problem because the fibres can be discrete. Suppose each fibre is a a set of points.
Then, the only non-trivial information in the bundle are the holonomies. However, in a simply
connected spacetime like Rn , all loops are contractible and all holonomies must be trivial.
So it becomes unclear what the addition of gauge symmetry has to offer. However, lattice
gauge theory naturally resolves this via summing over discretized spacetimes. A lattice is
made up of holes and therefore the parallel transport can admit non-trivial holonomies as we
move along the cells. By taking the limit where the lattice spacing goes to zero and properly
regulating the physical observables, we can define a discrete gauge theory via lattice.
Although this description is very helpful, it seems more practical than fundamental and
it raises the question that whether there is a more abstract and fundamental definition for
symmetries?
Another natural question is that when we have a gauge symmetry many times it is
accompanied with a conservation from a global symmetry. For example, if one considers
a pure SU (3) gauge theory, the symmetry that acts like a gauge transformation but with
constant g(x) is a symmetry of the theory which does not fo to g = 1 as |x| → ∞. This
symmetry maps different gluons to each other and since different particles are mapped to
each other we should not think of such large gauge transformation as a gauge symmetry,
but a global symmetry. In fact this is the global symmetry responsible for conservation of
charges in gauge theories. But how is this different from a normal global symmetry? To give
a preview of the answer, it turns out such a global symmetry can only exist in non-compact
spaces. Moreover, when viewed as a symmetry on the operators rather than Hilbert space,
it cannot be defined locally. It only acts on the boundary of the non-compact space. We
will proceed with trying to come up with a definition of symmetries (gauge and global) that
150
pushes aside all the non-physical formulation-dependent aspects of symmetries and focuses
on the physical properties of the symmetries.
2.2
What is a global symmetry?
Let us start with continuous global symmetries as a case study. In that case, for every
generator τ a in the Lie algebra g of the global symmetry group, there is a conserved Noether’s
P
current jµa . To be more precise, a j a ca is the Noether’s conserved current corresponding to
P
the symmetry generated by exponentiating a ca τ a ∈ g. We can think of J = jµa dxµ as a
P
g∗ -valued one-form. This means, corresponding to every element g = a ca τa of g we assign
a one-form J(g) = jµa ca dxµ which is conserved (∂µ j µ = 0)
d ⋆ J(g) = 0,
(II.2.1)
at points with no charge present. Now using the Stokes’ theorem, we can rewrite the above
constraint as
ˆ
ˆ
d ⋆ J(g) = 0,
(II.2.2)
⋆J(g) =
Σd−1
M
where Σd−1 is a compact d − 1 dimensional orientable manifold and M is a d-dimensional
region such that its boundary is Σd−1 . Note that this is only true if the equation (II.2.1)
holds everywhere in M which means there is no charge inside Σd−1 . If we take Σd−1 to be
´
the non-compact hypersurface of constant time t = t0 , the integral Σd−1 ⋆J becomes
Q(g) =
ˆ
dxd−1 j0a ca ,
(II.2.3)
which is the Noether’s conserved charge. We can generalize the above definition to any
boundary-less hypersurface Σ (asymptotic or compact).
ˆ
⋆J(g),
(II.2.4)
Qg (Σ) =
Σ
Suppose exp(g) is an element of the symmetry group G, the conservation of the charge
operator (II.2.4) implies that the following operator is topological
Uexp(g) (Σ) = exp(Qg (Σ)).
(II.2.5)
What we mean by topological is that if we insert Ug (Σ) in the path integral, and variate the
hypersurface Σ, the result will not change until Σ hits a charged operator. We will see an
example of this in a moment.
151
The Noether’s conserved charge Q(g) is said to generate the action of the symmetry group
G. What that means is that it determines how the group G acts on the local operators and
Hilbert space. For example, consider a charged state in the Hilbert space which is prepared
by the insertion a charged local operator φi (x) in the path integral at some time before t < t0 .
By φi being charged we mean it transforms under some representation ρ under G. Now we
can think of insertion of Ug∈G (Σ) as an operator that acts on φi as
ˆ
ˆ
iS
DΦe Ug (Σ)φi (x) = DΦeiS ρ(g)ji φj (x).
(II.2.6)
This is often illustrated as in Figure II.2.1.
Ug (Σ)
φj (x)
Σ : t = t0 + ǫ
=
ρ(g)ji ×
φi (x, t0 )
Σ : t = t0 − ǫ
Figure II.2.1: moving a local charged operator past Ug (Σ) will replace that operator with
the action of the group on that operator. If there are no more operator insertions at earlier
times, we can remove Ug (Σ) from the right hand side.
The equation (II.2.6) is true for any boundary-less hypersurface Σ enclosing a local
charged operator φi (x).
Ug (Σ)
=
ρ(g)ji φj (x)
φi (x)
Figure II.2.2
This formulation of global symmetry encodes the information of a global symmetry
152
in topological operators Ug (Σ) associated with elements g in the symmetry group G that
defined on d − 1 dimensional boundary-less hypersurfaces Σ that act on 0-dimensional (local)
operators (e.g. φi (x)) that are enclosed by the hypersurface Σ.
We can brush aside all the unnecessary details of the above formulation and define a global
symmetry based on the action of the topological operators on local charged operators. This
allows us to even generalize the notion of global symmetry to higher dimensional topological
operators as follows [116].
The Swampland Program
A p-form global symmetry constitutes of a set of topological operators {Ug (Σ)|g ∈ G}
defined on boundary-less orientable submanifold Σ of dimension d − p − 1. The operator
{Ug (Σ)|g ∈ G} is topological in the sense that it corresponds to an insertion in the path
integral such that the result is independent from variation of Σ unless it hits a defect
operator Φ defined on a p-dimensional submanifold Σ′ that links with Σ. Passing Ug (Σ)
through Φ replaces it with another p-dimensional defect Φg . In this sense, the topological
operator Ug acts on p-dimensional charged defects.
Moreover, a global symmetry G must be equipped with a fusion algebra for the
topological operators Ug such that for every two elements g and g ′ in G, and homotopic
surfaces Σ1 , Σ2 , and Σ3 , there exist an element g” ∈ G such that Ug (Σ1 )Ug′ (Σ2 ) = Ug” (Σ3 )
in the absence of any charged operators between two of the Σi ∈ {1, 2, 3} (linking with
only one or two of them). See Figure II.2.3.
For the above data to be a global symmetry, we impose that there needs to be at
least one non-trivially charged defect.
Ug′ (Σ2 )
Ug” (Σ3 )
Ug (Σ1 )
=
Figure II.2.3
153
Moreover, the fusion algebra is not-necessarily Abelian. However, for p > 0 it is necessarily
Abelian. This is because any two boundary-less submanifolds with co-dimension p > 1 can
be continuously permuted without intersecting each other. See Figure II.2.4 for an example
of this.
Figure II.2.4
This explain the fact that all higher-form (p > 0) symmetries (global or gauged) are
always Abelian in String theory.
2.3
Non-invertible symmetries
Up to now we assumed that a group is behind the symmetry. However, this is not necessary
and we can have a more general fusion algebra for the topological operators. For example,
the topological operators can fuse as
X
Uα Uβ =
Cαβγ Uγ ,
(II.2.7)
γ
with some coefficients Cαβγ that do not satisfy all the properties of group multiplication such
as the existence of inverse elements. Such symmetries are called non-invertible symmetries
[117–121].
Let us investigate an example of non-invertible symmetry which naturally arises in string
theory. When we orbifold (mod out) a worldsheet theory by a discrete group G, we get a
twisted sector for every conjugacy class of G. The first important observation is that the
twisted sectors are labeled by conjugacy classes and not by the group elements.
Consider a twisted string that its initial and final endpoints are related by the action of
g ∈ G. Suppose the endpoints of the string are x and g(x). Take element h ∈ G and act
on the twisted string with h. The endpoints of the new string are h(x) and h · g(x) and are
related to each other by the action of hgh−1 . Since we have to identify the two strings under
orbifolding, we find that the twisted sectors of g and h−1 gh are indistinguishable. This is
why the twisted sectors are labeled by conjugacy classes rather than group elements.
154
∼
x
g(x)
hx
hgh−1 (hx)
Figure II.2.5: Two twisted strings in the same conjugacy class belong to the same twisted
sector.
Now let us see what happens if we fuse two particles (twisted strings) with conjugacy
classes Ci and Cj . The fusion will give us a state in tensor product of the Hilbert spaces HCi
and HCj associated with each conjugacy class. The resulting Hilbert space can be decompoed
into a linear combination of one-particle state as
HCi ⊗ HCj = ⊕k Nijk HC′ k ,
(II.2.8)
where HC′ k is a non-zero subspace of HCK and Nijk counts the multiplicity of the conjugacy
class Ck if we multiply the elements of Ci and Cj by the group product of G. One can think of
each Hilbert space on the right hand side as a scattering channel with two incoming particles
in HCi and HCj .
Ci
⊕k Nik j Ck
Cj
Figure II.2.6: A scattering vertex with two incoming particles in twisted sectors Ci and Cj
and an outgoing particle in a linear combination of 1-particle states in different conjugacy
classes.
Note that the above product is typically not invertible for non-Abelian groups. For
example, if you multiply the conjugacy class of g by the conjugacy class of g −1 , you always
get some conjugacy classes in addition to that of identity, unless g is in the center of the
group Z(G).
∀g 6∈ G : [g] · [g −1 ] 6= [1].
(II.2.9)
So far, we talked about the fusion rules for the charged operators. Now let us see if we
can define any topological operator that can define a symmetry. To define the symmetry
operators from fusion rules of charged operators, we review a general argument for diagonal
rational CFTs that applies to orbifolds. Suppose the fusion algebra takes the following form.
X
[i] × [j] =
Nijk [k].
(II.2.10)
155
Since there is no fundamental ordering for the operators, the matrices (N k )ij must be
symmetric. Moreover, due to the associativity of the fusion algebra, these matrices must
commute. Therefore, we can mutually diagonalize them by considering a new basis [i]′ of
charged operators.
(N k )′i,j = δij λki ,
(II.2.11)
For some real numbers λki . Then we can define a collection of commuting line operators Lk
such that if Lk encircles [i]′ it multiplies it by λki . For example, if we consider a Zn orbifold,
the corresponding line operators will form a Zn group. However, despite the commutativity
of these operators, they do not necessarily form a group. It is easy to see that the fusion rules
of Lk is the same as the fusion rules of (II.2.10). These line operators are called Verlinde
operators (see [117, 122]). A simple example of a non-invertible symmetry arises in the (4, 3)
minimal model which describes the critical 2d Ising model. This theory has three primary
operators {1, σ, ψ} that satisfy the following OPE.
σσ ∼ 1 + ψ
σψ ∼ σ
ψψ ∼ 1.
(II.2.12)
Since the Verlinde operators will satisfy the same fusion ring, Lσ will have no inverse, just
like σ has no inverse.
Any global symmetry on the string worldsheet can be thought of as a symemtry in the
spacetime. We can define the spacetime symmetry operator to act on the string states exactly
as the worldsheet symmetry operator acts on the operators that create those states on the
worldsheet. However, as we will later see, these spacetime symmetries always turn out to be
gauge symmetries.
2.4
What is a gauge symmetry?
In the previous subsection we gave a general definition of global symmetries. Now let us
revisit that discussion for gauge symmetries. We defined global symmetries based on their
action on local physical operators. So to define gauge symmetries, we need to know how the
gauge symmetries acts on local physical operators? But this is almost a trivial question for
gauge symmetries! They must not act on physical operators at all. In other words, local
physical operators must be gauge invariant. So if a symmetry G is gauged, a non-trivial
topological operator like Ug (Σ) does not exist for G. In this language, gauging a global
symmetry G means to start with a set of topological operators Ug and use them to restrict
156
the spectrum of physical operators by throwing out those that are not invariant under the
topological operators.
What we described above is not a satisfactory definition of gauge symmetry since it tells
us what gauge symmetry is not rather than what it is. Now we know that a gauge symmetry
is not a global symmetry, but can we define a gauge symmetry by its physical implications
or is it just a purely mathematical feature of the formulation of a theory?
To answer this question let us take a moment to think about a simple-looking but deep
question: what is the difference between a U (1) pure gauge theory and an R pure gauge
theory? The Lie-algebras of the two groups are the same. So any difference must depend
on the global structure of the group. But what kind of information is sensitive to the global
structure of the groups? The answer is the representations! The representations of U (1) are
quantized while the representation of R are labeled by a continuous parameter q
α → eiq α.
(II.2.13)
Representations of a gauge theory naturally appear with charged operators. But if we
consider a pure gauge theory (no charged operator) how is the information of the allowed
representations is encoded in the theory? In other words, what are some operators that
carry representations of the gauge group and come with a pure gauge theory? The answer
is Wilson loops! For every closed loop γ and a representation ρ of the gauge group there is
a gauge invariant operator Wρ (γ) that can be inserted in the path integral and captures the
information about the curvature of the associate gauge group. Therefore, gauge symmetries
have something more than not being a global symmetry, and that is having Wilson line
operators Wρ (γ) or their higher dimensional generalizations.
Higher-form gauge symmetries
A p-form gauge symmetry comes with defect operators Wρ (Σ) where ρ is some representation
of symmetry G and Σ is a p + 1-dimensional compact orientable manifold. A Wilson
operator can end on a p-dimensional charged operators defined on the boundaries of Σ.
Such operators must carry matching representations of symmetry G with that of the
Wilson operator.
´
Moreover, there are d − p − 1 dimensional topological operators (e.g. ⋆F ) that only
act on Wilson lines with boundary-less domains Σ by conjugation of G.
If Wilson operators can be defined on non-compact boundary-less domains (e.g. Σ
extends to asymptotic boundary) we say the gauge symmetry is long-range [123].
It was noted in [121] that in the absence of any charged operators, p-form gauge symmetry
G always comes with a p+1 form global symmetry Z(G) (center of G) generated by topological
157
operators
2.5
´
⋆F .
Non-compact spaces and boundary symmetries
In pure gauge theories, there are no local physical charged operators. Therefore, generally
there are no charged states either since we can think of charged states as a charged operator
acting on the vacuum. However, this argument has a loop hole for non-compact spaces. In
non-compact spaces it is possible to have a gauge-invariant charged operator, but the catch
is the charged operator does not have a compact support. For example, consider the SU (3)
pure gauge theory and the total number of a particular type of gluon of a given momentum.
This operator is definitely a charged operator because it changes under the SU (3) (gluons
get mapped to each other). However, it clearly does not have a compact support. We can
understand this in the language of Wilson loops too. In non-compact spaces, you can have
Wilson lines that extend all the way to infinity.
i
Asymptotic boundary
W[ρ] (γ)i
j
j
Figure II.2.7: Wilson lines can end on asymptotic boundary even in pure gauge theories.
Insertion of such Wilson operators creates net gauge charge in non-compact space.
These are gauge-invariant operators in the sense that any operator Ug (Σ) with a compact
support does not change them. However, they are charged in the sense that when acted
on the Hilbert space, they change the total gauge charge. Such Wilson operator creates
a charged particle in space and cancelling charged particle at infinity, effectively, creating
charge in the universe.
If we take Σ to a hypersurface that extends to infinity and cuts through the Wilson line,
Ug (Σ) acts on the Wilson line by changing the labels of the end points according to the
corresponding representation ρ.
158
i
Ug (Σ)
W[ρ] (γ)i
i′
W[ρ] (γ)i
j
=
′
j
ρi i ′ ·
j
j
Figure II.2.8: Gauge symmetry can induce a boundary global symmetry by its action on the
asymptotic endpoints of the Wilson lines that end on the asymptotic boundary.
However, it is important to note that not always such operators exist. Sometimes creating
a charge is infinitely costly. This happens when the gauge symmetry is confined. This is why
we made a distinction in for gauge symmetries with asymptotic Wilson lines by calling them
long-range gauge symmetries [123]. We summarize this section by highlighting the following
two important results for compact and non-compact spaces.
Net gauge charge in compact spaces
There is no net gauge charge in compact spaces.
The last statement is often stated as in compact spaces, field lines have no where to
escape, therefore the charges must cancel out.
Boundary symmetries in non-compact spaces
Long-range gauge symmetries induce global symmetries on the asymptotic boundary.
2.6
No global symmetry: holographic argument
Now that we have clear definitions for gauge and global symmetries, we can review a
holographic argument by Harlow and Ooguri that shows why there can be no global symmetries
in quantum gravities that admit holographic description [123]. Suppose we have a global
symmetry in the bulk. Consider a local operator φ(x) that is charged under the symmetry.
Since the global symmetry acts on any local operator, it acts on the boundary operators as
well. Therefore, the boundary CFT also has the same global symmetry. Suppose the action
159
of the global symmetry is given by Ug . Take a fine partition of the boundary into small
regions {Ri }. Then we can write
Ug = (Πi Ui (Ri )) ◦ Uedge .
(II.2.14)
The effect of Ug in the bulk is limited to the union of the entanglement wedges of regions Ri s
(see Figure II.2.9). For fine enough partition {Ri }, the entanglement does not contain x and
therefore the global symmetry does not act on φ(x) which contradicts out assumption.
Note that gauge symmetries avoid this argument because a charge operator cannot exist
on its own. It has to be connected to a Wilson line that extends to the boundary. Therefore,
the charge operator always has a point on the boundary which ensures that it is always acted
on in the entanglement wedge of {Ri }.
R1
R2
R3
φ(x)
.
.
.
Figure II.2.9: The action of the boundary global symmetry is restricted to the union of the
entanglement wedges shown by the grey area. Therefore, the supposedly charged operator
φ(x) is not acted upon by the global symmetry.
2.7
Symmetries in string theory
It is typically easier to verify that continuous symmetries in string theory are gauged. The
more non-trivial examples usually involve discrete symmetries. In the following, we will
review a few important examples of potential candidates for global symmetries in string
theory.
Let us start with the 11d supergravity (M theory). If there is a 0-forms symmetry, it is
likely that the 0-form symmetry will act non-trivially on a scalar in the theory. However,
M-theory has no scalar fields to be acted upon by 0-form global symmetries. There are no
160
apparent 0-form global symmetries in IIA either. The only scalar field is dilaton which does
not have any symmetry.
Now let us look at the IIB theory. IIB theory has a a complex scalar τ = τ1 + iτ2
which is a combination of the R–R and NS–NS scalars τ1 = C0 and φ̃ ∼ − ln(λ) = ln(τ2 )
respectively. The kinetic term of this scalar is proportional to ∂µ τ ∂µ τ̄ /τ22 . You might think:
Aha! There is a global symmetry τ1 → τ1 + ǫ. However, type IIB theory has BPS bound
states of p fundamental strings and q D1-strings that are charged under both B and B̃ and
have tensions [124, 125]
T =
p
(qτ2 )2 + (p + qτ1 )2 ,
(II.2.15)
in string frame. The above tension formula follows from the BPS formula. Since the discrete
spectrum of allowed tensions is not invariant under a continuous shift of τ1 , shift symmetry
is not an actual symmetry. But what about shifting τ1 by an integer and simultaneously
permuting the BPS strings by a representation of Z? Is this a global symmetry?
It turns out the answer is still no, but for a more non-trivial reason. Consider a background
¸
with a D7-brane. The D7 brane is magnetically charged under C0 = τ1 , therefore, dτ1
arouund the D7-brane must be one. That means, τ picks up a monodromy and goes to τ + 1.
Thus, the existence of D7 brane forces us to identify τ and τ + 1 rendering τ → τ + 1 to
be a gauge symmetry. All in all, thanks to of D1-branes and D7-branes, there is no global
symmetry in IIB theory.
Heterotic SO(32) and the type I theories also do not have any apparent 0-form global
symmetries. However, the situation is slightly different for the E8 ×E8 Heterotic theory. This
theory has a discrete Z2 symmetry that swaps the two e8 Lie algebras and their corresponding
representations. This Z2 seems to be a global symmetry, however, upon a closer look we can
see that it is a gauge symmetry. To see why, remember the Hořava–Witten construction
for the strong coupling limit of the Heterotic theory [76, 79]. In that construction, the Z2
symmetry at question is simply the symmetry that swaps the two endpoints of the interval
and all diffeomorphisms including this Z2 are gauged.
Generally, whenever we compactify a theory on a symmetric manifold, the isometries of
the manifold become gauge symmetries in the lower dimensional theory.
Let us finish this section with a more non-trivial example of a potential candidate for
global symmetry in string theory.
Consider compactifying IIB on K3. If there is no way of continuously deforming this
background to IIB on T 4 , we have a global symmetry! To see why, consider the following
two backgorunds: IIB on K3 × R6 and IIB on R4 × R6 . Now suppose we remove a small
disk D4 from the K3 and R4 and connect the two backgrounds on the S 3 boundary of the
161
removed D4 . This operation is called a surgery. You can see the resulting background in the
following picture.
K3
R4
S 3 = ∂D4
Figure II.2.10: A certain compactification can be viewed as a defect in a higher dimensional
theory after connecting the compact geometry to a non-compact space via surgery.
This can be viewed as a defect with 6d worldvolume. And if the number of such defects
is conserved, there is a global symmetry with these objects as its charged objects [126].
To go back to the black hole argument, we can compactify 10d IIB on T 5 such that these
defects become 0 + 1 dimensional and now the black hole argument tells us that their number
should not be conserved. So either such compactification of IIB on K3 is inconsistent (which
they are) or there is a way to continuously transform such compactifications to IIB on other
manifolds such as T 4 . We will come back to this example later.
2.8
Cobordism conjecture
As we discussed, the no global symmetry condition means there are no topological operators
that can label different backgrounds according to their global charges. Put more generally,
in quantum gravity we should not be able to tag different backgrounds. When we think of
global symmetries as lables that distinguish different backgrounds, dualities become natural
transitions between different backgrounds that ensure there are no such lables. In a sense,
dualities are realizing the no global symmetry condition by telling you that there are no exact
superselection sectors in quantum gravity. There is a formulation of this statement which
allows us to sharpen this intuition.
Suppose you compactify a theory on two different compact manifolds M and N . Then we
want the two compactifications to be transportable with a finite action process (dynamically
allowed). In other words, the two backgrounds M × Rd−k and N × Rd−k do not tag different
theories of quantum gravity. The transition between these two theories maifests itself as a
finite tension domain wall in the d − k dimensional theory.
162
M
XM,N
N
Rd−k
Domain wall
Figure II.2.11: If the compact manifolds M and N are cobordant, they can be realized
as boundaries of a higher dimensional manifold XM,N . We can use XM,N to continiously
transition between backgrounds with compact manifolds M and N as we move along a certain
non-compact dimension. As shown in the figure above, in the non-compact dimensions, this
configuration looks like a domain wall between two backgrounds with different comapact
manifolds M and N .
This statement has a close and natural connection to a mathematical concept called
cobordism. Two k-dimensional orientable manifolds M1 and M2 are called cobordant if we
can connect them via a k + 1 dimensional manifold that has M1 and M2 as its boundaries.
We can enrich this definition by imposing extra structure on M1 , M2 , and Σ (e.g. spin
structure). A class of manifolds that are cobordant are said to belong to the same cobordism
class. Cobordism classes come with a natural Abelian group structure + ,
[M1 ] + [M2 ] = [M1 ⊔ M2 ],
(II.2.16)
where ⊔ is the disjoint union. The identity element is the cobordism class of the empty
manifold. This abelian group that captures non-equivalent calsses of k dimensional manifolds
carrying a structure G is shown by ΩG
k.
According to our definition above, there is a natural appearance of the notion of cobordism
in quantum gravity, with the exception that rules of transition are determined by physics
(existence of a finite tension domain wall). We denote this cobordism group by ΩQG
k . The
QG
cobordism conjecture states that Ωk is trivial.
Therefore, whenever we have a non-trivial mathematical cobordism group ΩG
k , it means
that the mathematical constrains do not accurately capture the physical rules. Any nontrivial class is either:
163
• not allowed in quantum gravity (symmetry is gauged)
• can be transformed to the trivial class with a process that was not included in the
mathematical evaluation of the cobordism group (symmetry is broken)
Note that every non-trivial cobordism class Ωk defines a topological d − k − 1 defect
obtained from gluing M k to Rk on the boundary of a small disk Dk . To see the global
symmetry consider the group of homomorphisms from Ωk to U (1). Every element of this
group acts on the d − k − 1 defects. Therefore, non-trivial cobordism groups leads to nontrivial higher-form global symmetries.
The cobordism conjecture (which is in essence the no global symmetry conjecture) is the
ultimate duality. For example, it implies that there is a domain wall even between 10d string
theories without any dimensional reduction. In particular, any QG should be realizable in a
bubble inside any other QG.
Trivializing a cobordism class is equivalent to an end of the universe wall in the lower
dimensional theory. In the language of condensed matter physics, this means you can always
gap out any QG system by appropriate boundary conditions without any symmetry structure
preventing it. Every QG admits a domain wall!
There is a familiar challenge to have an end of the universe wall in chiral theories where
we have no parity symmetry to gap out. For example take the IIB theory. What boundary
condition do we put on gravitino which has a single chirality? Since the anomaly cancellation
relates gravitino to the four form gauge field, their boundary conditions are likely mixed. But
we do not know what that would exactly look like. There is a similar story with other string
theories. But if such walls exist, how come we have not discovered them yet?
Exercise 1: Prove that the boundary of the IIB theory in 10d (if there is one) breaks
supersymmetry completely. This explains why it is difficult to construct.
Now let us look at an example where such end of the universe walls can preserve supersymmetry.
For example in 11d where there is no chirality there is a well-known construction: the
Hořava–Witten wall.
Let us consider the cobordism classes of IIB. We do not know all the rules of quantum
gravitiy for IIB in terms of what backgrounds are allowed. But IIB does require a spin
structure and we can look at a subclass of rules. The cobordism classes of compact manifolds
with a spin-structure is as follows.
164
k
0
1
2
3
4
5
6
7
ΩSpin
k
Z
Z2
Z2
0
Z
0
0
0
Gens pt+ Sp1 Sp1 × Sp1 − K3 − − −
The k = 0 case is trivialized by an end of the universe wall which in non-supersymmetric
and unknown. But what about k = 1 generated by circle? We already know how to trivialize
it! We mod out by Z2 both the comapct circle and the base R. This is just the orientifold
plane. This construction is often called the pillowcase construction.
×
R8
Figure II.2.12: The ’pillowcase’ geometry as a IIB orientifold. The Z2 acts on R9 ×S 1 . It acts
as a reflection on the S 2 and one of the Rs. It also changes the orientation of the worldsheet.
Each one of the corners is an O7 plane. If we glue two of these geometries and add the correct
number of D7 branes, we get type IIB on T 2 /Z2 which looks like a full pillowcase.
The k = 2 cobordism classes generated by torus can be trivialized in a similar way. But
what about the k = 4 classes generated by K3? This is related to our discussion in the
previous section about gluing a K3 to R4 to get a conserved charge.
Exercise 2: Show that any end of the universe wall that could trivialize the cobordism class
of ΩSpin
(generated by K3) in IIB must be non-supersymmetric. (Hint: Show that the end
4
of the universe wall in IIB on K3 is not supersymmetric.)
Now let us think about the Heterotic case. There is more structure needed for it than
just spin. Consider the class where F ∧ F = 0. This equation requires the manifold to have
1
R ∧ R = 0 due to the equation of motion dH = 16π
2 [tr(R ∧ R) − tr(F ∧ F )]. In other words,
1
the first Pontryagin class 2 P1 (R) must be trivial. Such manifolds are called string manifolds.
String cobordism classes are shown in the following table.
165
k
0
1
2
3
4
5
6
7
ΩString
k
Z
Z2
Z2
Z24
0
0
Z2
0
Gens
3
3
3
pt+ Sp1 Sp1 × Sp1 SH
− − SH
× SH
−
We can trivialize the k = 3 case with NS5 branes. The generator is S 3 with unit H flux.
One can think of this S 3 as the 3-sphere that surrounds the NS5 brane. If we look at the
transverse dimensions to the NS5 brane, we can write it as R+ × S 3 , where the half-line is
parametrized by the distance from the NS5 brane. As the distance decreases towards 0, the
size of the 3-sphere carrying the H flux shrinks until it reaches a singular point, where the
NS5 brane is sitting.
S3
NS5
0
r
Figure II.2.13: A schematic representation of the four transverse dimensions of NS5, as a
shrinking S 3 that carries the H-flux.
We can use a similar argument to see that two intersecting NS5 branes trivialize the 6d
3
3
cobordism group which is generated by SH
× SH
. Consider two NS5 branes that have a 2d
8
intersection. We can parameterize the R transverse to that intersection as R2 ×R2+ ×(S 3 ×S 3 )
where each R+ denotes the distance from one of the NS5 branes. These distances also control
the sizes of the 3-spheres that surround the fivebranes. The boundary of R2 × R2+ which is
3
3
.
× SH
made up of the fivebranes, is a domain wall for the theory on SH
166
3
3
SH
(r1 ) × SH
(r2 )
r1
=D
i
n
st a
ce
f
r om
NS
51
R2
Dimensions
parallel
to both
fivebranes
×
r2 =Distance from NS52
Figure II.2.14: We view the 10d background with two intersecting fivebranes, as R2 × R2+ ×
(S 3 ×S 3 ). The R2 represents the two dimensions parallel to the intersection of the fivebranes,
3
while the rest is two copies of R+ × SH
= R4∗ , each of which is the four transverse dimensions
3
3
to a fivebrane. Therefore, we can think of this background as the 10 theory on SH
× SH
where the radii of the 3-spheres depend on the remaining four coordinates. The remaining
four non-compact dimensions form a R2 × R2+ which ends on a co-dimension one brane made
up of fivebranes.
We showed how to trivialize the cobodism classes for k = 3 and 6. But what about
k = 0, 1, and, 2? It is not difficult to see that the remaining cobordism require non-BPS
defects to be trivialized [126].
Now let us study a different type of cobordism classes. M-theory has parity so we can
compactify it on non-orientible manifolds which should carry a Pin structure. There are two
possible Pin structures but the formulation of M-theory is only consistent on Pin+ manifolds.
k
ΩPin
k
+
Gens
0
1
2
3
4
5
6
7
8
Z2
0
Z2
Z2
Z16
0
0
0
Z2 × Z32
pt − KB KB × Sp1 RP4 − − − HP2 , RP8
As we mentioned above, the first class can be killed by the Hořava–Witten wall. Generators
for the rest non-zero ones are respectively, KB , KB × S 1 , and RP4 . The first two cobordism
classes cannot be killed supersymmetrically, however, the last one is possible to trivialze in
a supersymmetric fashion and arises in the familiar compactification of M-theory on T5 /Z2
[127]. The Z2 acts on all coordinates of the torus by reflection, and it leaves 32 fixed points.
The geometry around each fixed point is R5 /Z2 which can be viewed as RP4 × R+ . The
defect sitting at the fixed point is the MO5 M-orientifold which is an end of the universe
wall to M-theory on RP4 and therefore, trivializes its cobordism class. Similarly, the MO1
167
[128] trivializes the cobordism class of R8 given that its transverse geometry is given by
R9∗ /Z2 = RP9∗ × R+ . As for the cobordism class generated by HP2 , we can show that it is
gauged. In other words, Compactification of M-theory on HP2 is not allowed. This follows
from the tadpole cancelation condition [128]. If we compactify M-theory on a compact 8dimensional manifold X, we have
1
NM 2 + G4 (X)2 = I8 (X),
2
where NM 2 is the number of spacefilling M2 branes and
ˆ
p2 (R) − (p1 (R)/2)2
.
I8 (X) =
48
(II.2.17)
(II.2.18)
Therefore, if we do not turn on any gauge fields (G4 = 0), or insert spacefilling M2 branes
(NM 2 = 0), the only allowed compactifications are those satisfying I8 (X) = 0. However,
I8 (HP2 ) = 1/8 6= 0, and therefore, is not an allowed compactification. Similar to the previous
cases, the remaining cobordism classes can be shown to require a non-BPS defect to be
trivialized.
Let us point out that since we do not have a complete formulation of quantum gravity,
we cannot exactly calculate the cobordism groups. This is why we resort to the approach
that we calculate the cobordism classes with approximate rules and show that in the exact
theory they indeed vanish.
The cobordism conjecture could lead to very powerful statements. For example, in
[129], the cobordism conjecture was used to argue that comapctifications of certain higher
dimensional supersymemtric theories on T 3 /Z2 must be allowed. By checking the anomaly
cancellation in the compactified theories, one can restrict the rank of the gauge group in the
original theories. In some cases these restrictions are so strong that the allowed ranks match
with the existing examples in string theory.
2.9
Baby universe hypothesis
In section, 2.5 we saw that in the presence of gauge symmetries, there is a major difference
between compact and non-compact spaces. Compact spaces cannot have states with net
gauge charge while non-compact Hilbert spaces can. We saw that this is because non-compact
spaces come with operators that extend to the boundary and can create net charge. In fact,
this enlargement of the Hilbert space is very natural from holography’s point of view. If
the degrees of freedom live on the boundary, then compact spaces must have none. In other
words, taking holography at its face value suggests the following hypothesis [130].
168
Baby universe hypothesis
The Hilbert space of quantum gravity on a compact boundary-less spaces with more than
three dimensions is trivial.
Note that boundary-less is a crucial condition for the baby universe hypothesis. For
example if you remove a disc from your compact manifolds, you would expect to have degrees
of freedom living on the boundary of the removed disc.
Note that there are counterexamples to this hypothesis in 2d quantum gravities. For
example a 2d quantum gravity could be described by an ensemble of many degrees of freedom
given by the SYK model. Two-dimensional quantum gravities are special and often avoid
Swampland conditions. For example, they can have global symmetries as well. For example,
the worldsheet theory of Heterotic strings admits the spacetime gauge symmetry as a global
symmetry global symmetry.
Some of the differences between two and higher dimensions might be related to the fact
that many of the Swampland conjectures are motivated by black holes which typically exist
in dimensions greater than two.
â
Baby
universe
â†
Figure II.2.15
The reason the hypothesis is called the baby universe hypothesis is due to the fact that
emission and absorption of compact universes is called a baby universe. Coleman studied
these processes [131] and showed that one can associate a creation and annihilation pair of
operators {ai , a†i } to any degree of freedom i of a baby universe. Moreover, they modify the
P
effective action as L = L0 (Φ, ...) + i (ai + a†i )L1 (Φ, ...). A coherent state |αi defined as
ai |αi = αi |αi corresponds to summing over processes involving emission/absorption of baby
universes with specific weights. Every |αi corresponds to a different vacuum of the theory.
169
This is very similar to the notion of θ-vacua in gauge theories. Note that α is a parameter and
not a field. The vacua labelled by different α belong to different superselection sectors which
violates the cobordism conjecture. Therefore, if one believes in no-cobordism conjecture (i.e.
no-global symmetry), the baby universe Hilbert space must be just one-dimensional.43
3
Swampland II: Completeness of spectrum
In the previous section, we talked about global symmetries in quantum gravity and we
argued that all the global symmetries must be gauged. We also saw that pure gauge theories
make sense and have physical implication in terms of the spectrum of physical operators.
We defined the gauge theories by the inclusion of Wilson loops or Wilson lines that have
asymptotic endpoints. However, if one adds charged particles, we can make new gauge
invariant operators by including those charged operators at the endpotins of the Wilson
lines. Such an operator can be thought of as an operator that creates/annihilates charged
particles at the endpoints of the Wilson line.
Asymptotic boundary
i
j
j
W[ρ] (γ)i
W[ρ] (γ)i
j
j
φi
W[ρ′ ] (γ ′ )ij
φi
(a)
φ
j
(b)
Figure II.3.1: (a) Gauge invariant Wilson operators that exist in any theory with long-range
gauge symmetry. (b) Additional gauge invariant Wilson operators that only exist in gauge
theories with local charged operators.
43
For a different perspective on the baby universe hypothesis see [132].
170
3.1
Completeness hypothesis
As we will see, in quantum gravity, whenever there is a symmetry on the boundary, not
only the symmetry is a gauge symmetry in the bulk, but also all the possible representations
of that gauge group always appear in the spectrum of the theory (see [115, 133] for earlier
arguments). This is called the completeness of spectrum conjecture. In principle, the question
of what charged operators are allowed is in principle independent from that of whether global
symmetries exist or not. However, it turns out that this conjecture is closely connected with
the no-global symmetry conjecture.
Let us emphasize two important points about what completeness of spectrum does and
does not mean.
• The charged states are not necessarily low-energy states. In other words, the states
that carry a certain representation of the gauge group might be very massive and not
part of the low-energy EFT.
• The charged states are not necessarily stable one-particle states. The charged states
could be multi-particle states or meta-stable bound states.
3.2
Evidence in string theory
Now that we know the statement of the conjecture, let us start examining this conjecture
with the simplest known examples of gauge theories in string theory; the higher form gauge
symmetries that typically come from the Ramond-Ramond sectors.
In type II theories, the charged particles corresponding to such gauge symmetries are
the D-branes which indeed are required to be in string theory by dualities. Moreover, for
M-theory, the existence of the electric and magnetic gauge symmetries associated with the
3-form Cµνρ forces us to include charged objects that are 2+1 and 5+1 dimensional. There
is indeed extensive evidence for such objects to exist in the non-perturbative describtion of
M-theory. These extended objects are the M2 and M5 branes.
Now let us move to the Heterotic string theories. Heterotic theories have conventional
0-form gauge symmetries. Moreover, the string sees the gauge symmetry. Let us make this
terminology more precise.
Typically, the gauge symmetry in spacetime is realized as a global symmetry on the
worldsheet theory. If the action of this global symmetry on the spectrum of the worldsheet
theory (string excitations) is non-trivial, we say the fundamental string sees the the gauge
symmetry. This implies that there are string states that as spacetime particles are charged
under the gauge symmetry. Therefore, non-trivial representations of G as a global symmetry
171
on the wolrdsheet theory usually corresponds to non-trivial representations of charged spacetime
states under the spacetime gauge group G.
In the case of the Heterotic theory, the symmetry of the Heterotic theory is realized as
an Affine Kac-Moody algebra on the worldsheet. This immediately implies that there are
operators on the worldsheet that transform in the adjoint of the gauge group. These are just
the spacetime gauge bosons. But how about the other representations? To check whether all
the representations appear or not, first we need to know the spectrum of all representations.
The set of all representations depends on the global structure of the group. In the E8 × E8
theory, the gauge group is E8 × E8 and the adjoint representation and its successive tensor
products gives all. However, in the SO(32) theory the gauge group is spin(32)/Z2 . Therefore,
the the full set of the representations of the gauge group consists of half of representations
of spin(32).
In these cases, the level of the Kac-Moody algebra is 1. The fundamental representations
that generate all possible representations are given by the corresponding Dynkin diagrams.
Extended
node
Ẽ8
Figure II.3.2: Affine Dynkin diagram of the central extension of e8 .
The Dynkin diagrams of Kac-Moody algebras have an extra node compared to the
ordinary Lie algebra due to the central extension. The fundamental representations are
labled by their highest weights which satisfy
X
ni di ≤ 1,
(II.3.1)
where ni is the coefficient of the i-th fundamental weight in the expansion of the highest
weight and di s are the Dynkin labels. Moreover, since the contributing labels for fundamental
representations are di = 1, they are allowed for all levels k.
In the case of the Heterotic SO(32) thanks to modular invariance, all such fundamental
representations are already included. This is because modular transformations map different
fundamental representations to each other and upon including one of them, we are forced to
include all of them.
So far our discussion was focused on examples of the completeness of spectrum for
continuous gauge groups. Suppose you have a discrete group G, do all representations appear?
172
3.3
Completeness of spectrum for discrete symmetries
Let us start our discussion with a special case where the discrete group is visible to the
fundamental string. As we explained above, this means that there are charged strong
excitations or in other words, the discrete gauge group realizes as a non-trivial global symmetry
on the worldsheet theory. If so, we argue that the string spectrum must include all of the
representations. Again, the key is the modular invariance!
Suppose you take a torus diagram and you put the a twisted boundary condition in the
time direction.
Im(τ ) = β
g
e
Figure II.3.3: The twisted sector corresponding to twisted boundary condition in the
Euclidean time direction.
Note that we are not orbifolding the theory, but just computing a certain partition
function,
T rŨg e−βH
.
T re−βH
(II.3.2)
where Ũg enforces the twisted boundary condition. We claim that the above twisted partition
function, in the limβ→0 vanishes unless g is the identity element. Let us give two different
arguments.
Argument 1: In the β → 0 limit, the worldsheet fields must have a very strong gradient in
the Euclidean time direction so to ensure the initial and final state differ by the action of g.
Therefore, the kinetic part of the worldsheet scalars in the action contribute as
e−
´
dσ
´β
0
|∇φ|2 dτ +...
=e
−O(
β
)
β2
→0
(II.3.3)
Argument 2: Modular transformation τ → 1/τ maps the partition function II.3.2 to the
amplitude over on the g-twisted ground state sector.
173
β
e
e
1
β
g
τ → 1/τ
g
Figure II.3.4: Modular transformation maps two twsited partition functions to one another.
Suppose the ground state of the twisted sector has energy Eg 6= 0, which is always the
case for non-degenerate CFT’s where the vacuum (E = 0) is unique. We find
1
Zg ∼ e−Eg β
(II.3.4)
Both arguments show that at high energies (small β), the states furnish a very special
representation such that the partition function II.3.2 vanishes for all elements other than the
identity element. For this to be true, the high energy representation of G must be very special.
It turns out there is exactly one such representation and that is the regular representation
which permutes the group elements by left or right group action which leads to isomorphic
representations. The regular representation of a finite group G, has a dimension |G| and it
acts on the basis vectors {eg }g∈G as ρ(s)eg = es◦g . It is easy to see that the character of the
regular representation χρ (s) = tr[ρ(s)] vanishes unless s is the identity. This is exactly what
we observed must happen to the states of theory at high energies.
The regular representation is also special in a second way, in that it decomposes into all
irreducible representations. Every irreducible representation Rα appears with multiplicity
dim(R) in the decomposition of the regular representation.
ρ ≡ ⊕α dim(Rα ) · Rα .
(II.3.5)
So we know that all of them appear and we also now how often they appear. Note that
since the argument 1 applies to all quantum field theories, above statement is more general
than 2d CFTs. For any QFT, if one representation of a discrete global symmetry appears,
all of them must appear. In fact, this argument works for continuous groups as well! For
174
example, take the spin group Spin(3) which is the universal cover of SO(3). Did we have
to have fermionic representations as well as bosonic representations? And if they do, how
frequent should they be compared to the bosonic representations?
Suppose the Spin group acts faithfully on the Hilbert space (i.e. there is at least one
frmionic representation), then we can use the above argument to deduce that the following
partition function must vanish at high temperatures.
T r(−1)F e−βH
,
T re−βg
(II.3.6)
where F is the number of spacetime fermions. If we rewrite this in terms of the ratio of
bosonic and fermionic degrees of freedom we find
∼(
1
nB − nf
) ∼ e− β .
nB + nf
(II.3.7)
Thus, even though high energy supersymmetry is not universal, there is some sense of
equality of fermionic and bosonic degrees of freedom at high energies that is universal.
3.4
Completeness of spectrum: arguments
So far we gave evidence for the completeness of spectrum in string theory, now we will give
a general explanation for it. Let us start with a continuous U (1). If there are no charges,
we have dF = dF̃ = 0. However, existence of the electric of magnetic sources will break this
vanishings. Cobordism conjecture tells you that these equations give you global symmetry.
So the existence of electric and magnetic states make sure that there are no global symmetries
(in this case 1-form symmetries) [129]. This argument is in fact correct for higher form gauge
symmetries as well.
However, this argument just tells us that we must have charged state, but those charges
do not have to be minimal charges. What if we only have certain multiples of the fundamental
charge? Then the spectrum would be incomplete. Let us see why this cannot happen.
Suppose the only charges that appear are multiples of ke where k > 1 is a natural
number and e is the fundamental charge. We show that such a theory has a Zk 1-form global
´
symmetry. The corresponding topological operator is Σ F̃ mod k. This operator measures
the charges of the Wilson lines that link with Σ mod k which is conserved.
175
Wq
ei
´
⋆F
Wq
Wq
=
eiq · Wq
ei
´
⋆F
= eiq ·
Figure II.3.5: In the absence of charged operators,
there is a 1-form global symemtry
´
associated with the topological operator exp(i ⋆F ).
Let us explain it slightly differently which is a useful logic. Consider a Wilson line. Define
an operator that when linked with the Wilson line that carries charge q, it multiples it by
q
e2πi k . Wilson loops with any charge q can appear but only then ones with q = nk can
end. However, changing q by a multiple of k does not change this k-th root of unity. Thus,
we find a Zk global 1-form symmetry. This shows us that no-global symmetry implies the
completeness of spectrum at least for the case of U (1).
But how about continuous non-Abelian cases? For non-Abelian symmetries there is an
even easier way of doing it. Multiples of every allowed weight in the weight lattice correspond
to the spectrum of representations of a U (1) subgroup that is generated by an element in the
Cartan subalgebra. From the Abelian argument we know that all of them must be occupied.
Therefore, all the weights in the weight lattice of any non-Abelian group must be occupied.
Representations
of a U(1)
Figure II.3.6: Any weight and its multiples can be viewed as the spectrum of representations
of a U (1) subgroup generated by an element of Cartan.
176
There is a second argument for completeness of spectrum using black holes. We will first
argue the completeness for Abelian groups. Then, one can use the trick in the last paragraph
to generalize the result to the non-Abelian groups. Consider a Reissner-Nordstrom black
hole with M ≥ Q in Planck units. If we take M and Q to be very large in Planck units,
the curvature outside the black hole would be small. Thus, the Bekenstein-Hawking formula
which is a semiclassical calculation is trustable. The formula tells us that there is a large
number of microstates with a given charge and mass. But how do we know that small charges
exist? If we consider two black holes with large charges Q and −Q + e the net charge of the
state would be a fundamental charge of the theory.
Now let us consider discrete gauge symmetries. What goes wrong if the spectrum of a
discrete gauge group is not complete? We will use an argument similar to the continuous case
to show that lack of completeness implies the existence of a 1-form global symmetry. The
idea is that we can define a non-invertible topological operator for every conjugacy class [g]
χR ([g])
that acts on a Wilson line carrying a representation R by multiplying it by size([g]) · dim(R)
,
where χR is the character of R [120, 121]. The number of such 1-form symmetries is equal
to the number of conjugacy classes of G. However, for finite groups, this number is the same
as the number of irreducible representations R. This matching is not a accidental and it can
be shown that charged particles in every irreducible representation is needed to break all of
these 1-form symmetries [120].
4
Swampland III: Weak gravity conjecture
Now we move on to a another Swampland conjecture which is closely related to the completeness
of spectrum conjecture. The completeness of spectrum conjecture deals with the existence of
charged states in a theory of quantum gravity, however, it falls short from giving any estimate
of their masses. In fact, the charged states could be so heavy that they are not included in
the low-energy theory. Therefore, to quantify the completeness of spectrum it would be
nice to have an upper bound on the mass of the supposed charged particles. The Weak
Gravity Conjecture (or WGC for short) tries to answer this question [134]. This Swampland
conjecture has been extensively studied due to its strong theoretical consequences, however,
it does not have as clean of a formulation as the previous Swampland conjectures. The
general idea is that if you have a gauge symmetry such as electromagnetic U (1), identical
charged states experience repulsive forces in addition to the universal gravitational attraction.
The Weak Gravity Conjecture proposes that there must always exist charged objects where
the repulsion is stronger than (or equal to) the gravitational attraction. In our universe
this is evident. The electrostatic repulsion between two electrons is much stronger than the
gravitational attraction between them, 43 orders of magnitude stronger to be more precise!
177
The comparison of these two forces between two identical charged particles comes down
to the comparison between the charge and mass of the particle in Planck units. So we would
like the final inequality to take the form q/m in Planck units to be greater than some constant
c at least for some charged states. As we will see later, black hole offer a natural candidate
for the constant c which is the charge-to-mass ratio of large extremal black holes.
But before discussing the precise statement or consequences of such a conjecture, let us
take a step back and ask why should there be a statement like this at all? Electric charge
represents the coupling of the gauge field. Can we not just take the coupling constant to
zero? As far as field theory is concerned, this is in fact a very desirable limit where everything
interacts weakly and perturbative calculations are more convergent44 . Why should gravity
say we cannot make a certain coupling too weak?
One argument could be that as the gauge coupling g goes to 0, the theory gets closer
and closer to having a global symmetry. At g = 0, the gauge symmetry becomes a global
symmetry. Therefore, this statement is trying to quantify the no-global symmetry conjecture,
just like it quantifies the completeness of spectrum conjecture.
Now let us look at this conjecture in String theory. String theory naturally incorporates
supersymmetry, and supersymmetry implies the BPS bound which states that mass is bigger
than or equal to the (central) charge. At first, supersymmetry and Weak Gravity Conjecture
seem to be in contradiction. BPS bounds apply to all states while Weak Gravity Conjecture
can be satisfied by only some states. This leaves a way for the two statements to be
compatible in a marginal way. This is usually achieved by the existence of BPS states
which saturate the BPS bound. Therefore, in supersymmetric theories, we can think of
Weak Gravity Conjecture as the statement that BPS states must exist.
The stability of BPS states is protected by supersymmtry. In fact, there is a natural
connection between stability and the states that satisfy WGC. Consider an unstable state
with a charge q and mass m that decays into some lighter particles with charges and masses
of (q1 , m1 ), (q2 , m2 ), ..., (qN , mN ). From conservation of charge and energy we find
q1 + q2 + . . . + qN = q,
m1 + m2 + . . . + mN ≤ m,
(II.4.1)
where the last inequality is due to positivity of the kinetic energy of the product particles.
From the two equations above we find
P
P
|qi |
|q|
| qi |
|qi |
≤ Pi
≤ max
≤P
.
(II.4.2)
i
m
mi
i mi
i mi
44
Note that by charge we mean IR coupling. In other words, we compare the forces at long distances.
There are scale dependent version of Weak Gravity Conjecture as well.
178
Therefore, in the outcome of any decay, there are always particles that satisfy the WGC
better than the initial particle. In other words, stabler particles tend to satisfy WGC better.
Even BPS states which often marginaly satisfy WGC are also stable. In fact in many string
theory examples any stable charged state seems to satisfy WGC. Let us study some examples
in string theory.
4.1
Evidence from string theory
Let us consider the type IIA theory in d = 10, there is a gauge field and a single D0 brane is
charged under the gauge field. The charge-to-mass ratio of D0 branes turns out to be equal
to that of extremal black holes. Therefore, they saturate the WGC inequality. Even N D0
branes can form a bound state which still saturate the WGC inequality.
As another example, consider the toroidal compactification of Heterotic theory on T d
down to R10−d . There is a Narain lattice Γ16+d,d . Before studying the string spectrum, note
that there is a charge lattice of a group with rank 16 + 2d labled by (PL , PR ). The BPS
formula tells us that m(PL , PR ) ≥ |PR |. The mass formula from the Heterotic string takes
the form 12 m2 = 12 PR2 + NR = 12 PL2 + NL − 1. This shows that the BPS bound m ≥ PR
is automatically satisfied and the equality only holds for NR = 0. For the BPS states that
saturate the bound we have
1 2
(P − PL2 ) = NL − 1,
2 R
(II.4.3)
which implies that 12 (PR2 −PL2 ) ≥ −1 in terms of charges. In fact, the Heterotic string occupies
all of this lattice for arbitrary PR with BPS states. However, if PL is large compared to PR ,
the inequality gets violated and the supersymmetry is broken. BPS bound gives us a WGClike inequality for PR , but what about other directions of the charge lattice? In particular,
the large PL /PR direction which also breaks the supersymmetry? In fact, Heterotic string
theory mass formula gives us a WGC-like inequality in terms of PL that is m2 ≥ PR2 − 2.
As you can see, when this inequality is saturated, it satisfies the WGC even better than the
BPS states. This is a universal observation that in directions of charge lattice where SUSY
is broken (since m < |PR | BPS states are absent), the WGC is satisfied even stronger.
Let us study the above example from a dual perspective. Heterotic theory on T 3 is dual
to M-theory on K3. In that case, the Narain lattice is the lattice of 2-cycles in K3. There
is a polarization from the metric of K3 that splits them into 19 self-dual and 3 anti-selfdual cycles. The dulaity maps the wrapping of M2 branes around 2-cycles to PL and PR
on the Heterotic side. The minimal mass (minimal area) configurations that saturates the
BPS bound are M2 branes arapped around holomorphic cycles. The charge of a 2-cycle is
179
calculated by
¯ j̄ − ∂X
¯ i ∂X j̄ ).
kij̄ (∂X i ∂X
(II.4.4)
This is because M-theory three form could be written in terms of the Kähler form as
Cµij̄ = kij̄ Aµ
(II.4.5)
However, (II.4.4) is not the area (mass) which is given by a similar expression except with a
¯ i = 0 which is
plus sign in parentheses. However, the two expressions are the same when ∂X
realized for the holomorphic cycles. Thus, holomorphic cycles have masses proportional to
their charges and saturate the BPS bound.
We can even recover the PR2 − PL2 ≥ −2 which is obvious from the mass formula on the
heterotic side. To see that, one must look at the self intersection of the holomorphic curve
which is Σ · Σ = PR2 − PL2 . However, we can also calculate the self-intersection differently.
For a Riemann surface in K3 which has a vanishing first Chern class (line bundle cotangent
to the Riemann surface) we have Σ · Σ = 2g − 2 where g ≥ 0 is the genus of the Riemann
surface. This is because Σ · Σ = χ(Σ) as the local geometry of Σ in K3 is T ∗ Σ. Thus, we get
the same inequality PR2 − PL2 = 2g − 2 ≥ −2 as the one on the Heterotic side.
All this evidence shows that BPS objects are just holomorphic curves with some choice of
complex structure on K3. We found the mass relation for BPS objects and they satisfy the
WGC just like their Heterotic counterparts. However, we do not know how to calculate the
non-holomorphic mass relations because we do not know the metric of K3, and so we cannot
check the non SUSY prediction of WGC.
Let us consider one last example before formulating weak gravity conjecture. Consider
the compactification of the type II on M × S 1 /Z2 . The Z2 acts freely on M and maps θ to
its antipodal point θ + π on the S 1 . If n is the winding number, in string units we get the
mass formula m = nR for winding states which saturates M = |Q|. You might expect after
modding out we can wrap half of S 1 so we get half masses. However, since the Z2 acts freely
on M , the two endpoints of the half-wrapped strings are separated on M . Because of this
stretching, the masses of the half-integer winding strings goes beyond the M = Q line in
Planck units. Therefore, we get an infinite lattice of odd windings that does not satisfy the
WGC. However, the ineuqality is still saturated by the infinite subblattice corresponding to
the even windings around S 1 . There are stronger versions of WGC that state there must be
infinite number of bound states that satisfy the WGC [135].
Naively you would have guessed that we can push the minimal charge that strictly satisfies
WGC even higher by modding out by Zn rather than Z2 . However, in all string theory
construction the isometry groups of compact spaces are always bounded. For example, this
180
leads to a precise (but non-trivial) mathematical proposal that the order of free symmetry
group of Calabi–Yaus of a given dimension is bounded.
Now we are ready to formulate the basic version of the Weak Gravity Conjecture.
4.2
Weak gravity conjecture: formulation
Weak Gravity Conjecture (basic version)
Consider a U (1) gauge field. The charged black holes have a mass charge formula that
prevents naked singularities and is given by Q ≤ Mext (Q). The conjecture states that
there is always a ”small” charged particle with charge q and mass m such that
m
Mext (Q)
≤ lim
.
Q→∞
q
Q
(II.4.6)
Here are some natural questions that arise about the above conjecture that lead to some
generalizations of the above formulation.
• How small is q? Note as we will see in some string theory examples, q does not
have to be the fundamental charge. The word small makes the formulation a bit
imprecise. Perhaps a more precise formulation would be that there is a universal
dimension dependent constant Nd such that there exists a charged particle with q < Nd
that satisfies the above conjecture.
• How about higher-form gauge symmteries? Usually any higher-form gauge symmetry
can lead to a 0-form gauge symmetry in a lower dimensional theory. Thus, it is natural
that a similar statement must hold for higher-form symmetries as well. The higher
dimensional generalization of the WGC replaces extremal black holes with extremal
black branes and replaces mass with tension.
• What if there are no large black holes? As we will see, there are cases that where
a modulus couples to the fields in a way that there are no large extremal black hole
solutions. For example, if we compactify IIB on a conifold singularity (zero-size 3cycle). When the size of the 3-cycle is non-zero, the black hole solutions have non-zero
mass that scales with the volume of the 3-cycle. However, in the conifold limit, the
black hole mass goes to zero. But there is still a non-trivial statement here! Naively,
WGC would tell us that there must be a stable charged particle with q/m greater than
or equal to that of extremal black holes which in this case is ∞. So, if we take the
conjecture at its face value, it predicts that there must be a stable massless charged
particle which turns out to be true. Let us take a closer look at this example.
181
Consider type IIB on CY 3 with a confiold singularity (shrinking S 3 ). If you take
the four form gauge field and write it as D = Ω ∧ A where Ωabc is a three form on
the Calabi–Yau and Aµ is a one-form in the non-compact spacetime. In the lower
dimensional theory, Aµ is a gauge field and the object that is charged under it is a
3 )|
D3-brane wrapped around the 3-cycle. There is a BPS bound m ≥ |n vol(S
where n
λ
is the charge and λ is the string coupling. However, it turns out the once wrapped
brane (n = 1), is the only bound state which also happens to saturate both the BPS
bound and satisfies the WGC formula if the radius of S 3 is fixed. However, given that
the radius of S 3 is set dynamically, and it flows to 0 for black hole solutions, both the
once wrapped D3-brane as well as would-be extremal black holes are massless. In this
example, the Weak Gravity Conjecture still has a non-trivial consequence even though
there are no black holes.
Note that the fact that n = 1 is the only bound state is not obvious. If you take two
D3 branes, you expect to get an SU (2) gauge group which lives on the S 3 . If you look
at thelow-energy configurations, they have transverse directions and it turns out there
is no normalizable states due to this.
• What about de Sitter? In de Sitter, the universe has a finite size which makes it
impossible to talk about infinitely large charged black holes. We will come back to de
Sitter spaces later.
So far we have argued why a statement like WGC is natural to have in string theory.
However, WGC is a sharp statement in the sense that all the constants in the statement
are determined by black hole physics. In the following, we review the motivation for this
conjecture.
4.3
Motivation
Take a massive charged extremal black hole with mass M and charge Q that satisfies the
extremality condition Mext (Q) = AQ, where A is some constant. Let us assume that the
black hole is not BPS. We do not think such black holes are stable, because if they are, the
black hole entropy formula counts an exponentially large number of microstates which are
stable without any more fundamental reason to protect their stability. So let us assume that
there are only finite number of stable non supersymmetric objects. In that case the black
hole should decay into another black hole with mass M − m and charge Q − q by emitting a
particle of energy m and charge q. The new black hole must satisfy the extremality inequality
M − m ≥ A(Q − q). Since the old black hole was extremal (M = AQ), we find m ≤ Aq. This
inequality means the emitted particle must satisfy the Weak Gravity Conjecture. Thus, the
black hole extremality formula motivates the Weak Gravity Conjecture [134].
182
Black holes are the natural extension of particles for large masses where gravitational self
energy is significant [136]. Therefore, it is reasonable to expect that the extremal bound M >
AQ has a a generalization that extends to particles as well. However, there are corrections
to the action that modify the extremality curve for black holes. Weak Gravity Conjecture
suggests that the curve must bend in the MExt ≤ Q direction to ensure that there can be
particles that satisfy M < AQ. This is called the mild form of Weak Gravity Conjecture.
The mild form of the WGC also implies that large extremal black holes themselves satisfy
the WGC.
In four dimensions, some of the higher derivative terms that affect the extremality bound
are (a/MP4 l )(F 2 )2 and (b/MP2 l )Fµν Fαβ W µναβ . In fact, all other such terms can be rewritten in
terms of these terms using the equations of motion. The mild WGC then leads to 4a − b ≥ 0
[137, 138].
The mild form of the WGC has been extensively tested in low-energy theories in string
theory Landscape [138]. Moreover, there have been arguments that suggest the mild form
of the WGC may be related to more fundamental principles such as unitarity and causality
[137, 139–143].
However, the extremality bound
M
M Ext = Q
M
Uncorrected
extremal
curve
M Ext = Q
Corrected
extremal
curve
Uncorrected
extremal
curve
Corrected
extremal
curve
Naked
Singularity
Naked
Singularity
Black
holes
Black
holes
(a)
Q
(b)
Q
Figure II.4.1: The dashed blue line is the usual uncorrected mass-charge relation for extremal
black holes. The solid black line is the corrected curve for black holes and the dashed black
line is its extension for particles. All the particles and black holes are expected to be above
the black curve. The red star represents a particle that satisfies the WGC (m ≤ q). If the
corrections tilts the extrmal curve upward (as in (b)), the WGC particle would be in the
prohibited region. Thus, assuming an extension of the extremality bound for particles, the
WGC suggests that corrections to extremality bound will tilt it downward, as in (a).
183
4.4
Festina lente
In de Sitter space there is a natural IR length scale H1 . Suppose, the cosmological constant
is so small that there is a very large scale separation between the UV and IR scales. In that
case, we expect the flat space WGC to hold in de Sitter as well. However, there is some new
interesting features due to the finite size of de Sitter. If we put a black hole, it cannot be
bigger than the Hubble scale. This puts an upper bound on how massive a black hole can
be. The black holes that saturate this bound are called the Nariai black holes. The region
of the allowed charged black holes in de Sitter is shown in the figure below.
Naked
Singularity
Q=M
M
Nariai black holes
Naked
Singularity
Q
Figure II.4.2: Extremal curve in de Sitter space. The extremal black holes on the upper edge
are the Nariai black holes whose horizons are of the order of Hubble horizon.
We know that the lower edge of the triangle is similar to the extremal curve in flat space
which leads to the WGC. But how about the upper edge? This edge turns out to give us an
opposite inequality! For Nariai black holes in 4d, we have M ∼ Q ∼ R ∼ H1 . The electric
field at the horizon goes like E = Q2 /R2 ∼ H. Suppose there is a particle with qH ≥ m2 ,
we will have a large Schwinger pair production which will lead to a large flow of charge from
inside the black hole to outside the horizon. However, such a process will force the black to
1 1
exit the allowed region and create a naked singularity. So we find m ≥ Λ 4 q 2 . This bound is
called Festina Lente[144]. Note that this bound is satisfied by electron in our universe.
1
Interestingly, in our universe, the Λ 4 scale is the mass scale of neutrino. Note that Festina
184
Lente does not apply to broken symmetries. However, if there is an unbroken phase, you
could use this in that phase. Therefore, an unbroken electroweak symmetry which requires
masses to be zero would have been inconsistent with a positive cosmological constant.
4.5
Applications
In four dimensions, we can apply the WGC to magnetic charges. If the electric charge is g, the
magnetic charge is a multiple of 1/g. Moreover, the masses of ’t Hooft–Polyakov monopoles
go like Λ/g 2 where the Λ is some UV energy scale where a symmetry is sponatiously broken
and the monopoles are created. Therefore we find Λ ≤ gMP which implies the UV cutoff is
not necesarily all the way at Planck scale in weakly coupled theories.
Now let us consider another application to axions. Axion couples to instantons through
θ(x)F ∧ F . If θ is not a field, we have free parameters which will violate the cobordism
conjecture. If we promote θ to a field and include other parts of its action we find
f 2 (∂θ)2 + V (θ) + iθF ∧ F.
(II.4.7)
The potential V (θ) receives non-perturbative corrections by instantons in the form of a cosine
term. The discrete shift symmetry of axion is a −1 form symmetry which is gauged due to
existence of instantons. If we think of this system as a −1 form gauge symmetry with
instantons being the charges objects, we can apply the WGC and find
Sins .
MP
,
f
(II.4.8)
where 1/f plays the role of charge. For instanton actions bigger than 1, we have f < MP .
WGC is naturally connected to an older conjecture called the cosmic censorship conjecture
which is also based on avoiding naked singularities [145]. The cosmic censorship conjecture
roughly states that naked singularities cannot arise due to natural physical dynamics. However,
recently a counteraxample was found in pure Einstein-Maxwell setup in AdS [146]. The idea
is to gradually turn on the electric field over time until a naked singularity appears. However,
the Swampland conditions immediately tell us that we must have charged states with small
masses which will screen large electric fields. Therefore, the WGC resolved the puzzle,
including the precise numerical factors.
There are more applications to cosmology and particle phenomenology that involve relationships
between the mass of the neutrino and the cosmological constant. For example, if you take
the strong version of WGC that says everything strictly satisfying WGC is unstable, the nonsupersymmetric AdS must be unstable. This is because a non-supersymmetric brane whose
near horizon geometry is that AdS (carries the same fluxes) would be unstable [147]. Now
185
we can use this for compactification of our universe on a circle which leads to an inequality
for the mass of neutrino [148].
In all the well-known non-supersymmetric constructions of AdS, there have been found
some instantons that create instabilities (e.g. [149]).
5
5.1
Swampland IV: Distance conjectures
Introduction
In the previous sections we talked about how the low-energy field theories typically have
finite field ranges. The idea was that by varying the scalar field, at some point the potential
energy might increase so much that it surpasses the cut-off scale. However, if one increase
the cut-off, previously distinct low-energy theories might unify in the sense that they are
realized in different corners of the UV theory’s field space (see Figure from section 1).
On the other hand, increasing the field range, lowers the EFT cut-off by bringing in new
light states which were not part of EFT. In this section we will try to quantify this observation
from examples in string theory. But first, we need to clarify our terminologies. Let us start
with the notion of moduli space.
5.2
Moduli space
There are different notions of moduli space. Sometimes it is taken to be the space parametrized
by scalar fields. However, for us it will represent the space of different vacua. For example,
suppose we have a background with a set of scalar fields φi and an effective action Γ. The
expectation value of the scalar fields extremizes the effective action,
δ
Γ(hφj i) = 0.
δφi
(II.5.1)
Once, we take a true quantum mechanically stable background, we can study perturbation
around that background. Suppose the effective action at low energies can be approximated
with the following local action
ˆ
1
Γlocal = dD x[ η µν gij (∂µ φi )(∂ν φj ) − Vef f (φi )] + . . .
(II.5.2)
2
where . . . represents the terms containing fields other than scalar fields. We are interested
in values of scalar fields where the potential is minimum. This might not be a single point,
186
but rather a manifold. Since the potential is constant in the directions of field space that
potential stays minimum, these are called the flat directions of the field space. We can think
of these values as boundary conditions for the scalar fields at infinity, each of which defines
a distinct vacuum. For now, we will assume that the minimum value of the potential is zero
which is to say the background is Minkowski. We will consider non-flat backgrounds later.
The Vef f = 0 subspace of the field space is called the moduli space and it represents the
space of different vacua. Suppose the moduli space is locally parametrized by scalar fields
φI . The restriction of the effective action to the moduli space takes the following form.
ˆ
1
dD x η µν gIJ (Φ)(∂µ φI )(∂ν φJ ).
(II.5.3)
2
One can show that under a reparmetrization of the scalar fields φI , gIJ transforms as a
symmetric rank 2 tensor. In other words, gIJ can be viewed as a metric on the field space.
The above action is called the non-linear sigma model and is simply the generalization of free
scalar field theory to the case where the field target space is geometrically non-trivial.
We will refer to the metric gIJ as the canonical metric on the moduli space. Using this
metric, we can
talk about geometric quantities associated with the moduli space, such as
´ q
´
√
distance dl gIJ dld φI dld φJ or volume dN φ g.
Note that the effective action generally has non-local terms, but at low energies (compared
to some UV scale) we expect a local description to be valid.
Now let us consider a special class of theories; supersymmetric theories.
5.3
Supergravities
Given that global supersymmetry is only a symmetry at V = 0, one might think that
supersymmetry always protects V = 0, however this is not true. For example, an N = 1
supergravity theory with some chiral fields has a scalar potential,
V = e−K (|DW |2 − 3|W |2 ),
(II.5.4)
where W is the superpotential and K is the Kähler potential. In string theory almost always
N = 1 theories come with non-zero superpotentials unless they are ”secretly” even more
supersymmmetric. When we have 8 or more supercahrges, which corresponds to N ≥ 2 in
four dimensions, there is no scalar potential in an ungauged supergravity. However, through
gauging we might get a scalar potential.
As we explained in the previous section, the moduli space M comes with a canonical
metric that can be read off from the kinetic term. But what do we know about the geometric
187
properties of M? Let us start with its dimension. It turns out M could be zero-dimensional.
For example, one (and possibly only) example in Minkowski background is 11 dimensional
supergravity. Other examples in AdS backgrounds are AdS7 × S 4 or AdS4 × S 7 . However
the AdS5 × S 5 IIB background has the IIB coupling as a modulus.
Exercise 1: Show there are scalars in M-theory that minimize their potential at the 11d
supergravity corner.
Now let us consider 10d supergravity theories which have more interesting moduli spaces.
All of these theories have one modulus in common, the dilaton. However, while the type
IIA, type I, and Heterotic theories all have a real dilaton, the type IIB theory has a complex
coupling constant τ . Note that in all of these examples the target space of dilaton is noncompact. We can also compute the distances and see that it goes off to infinity at the
asymptotes. This is thanks to the special form of dilaton’s kinetic term,
ˆ
∂λ
S10d ∝ ( )2 + . . .
(II.5.5)
λ
Im τ
l→0
Im τ → ∞
For type IIB theory where the coupling is complex, the metric takes a slightly different form
ds2 = dτ dτ̄ /(τ22 ). However, due to the SL(2, Z) identification the topology of the moduli
space is not like a plane. After the identifications, the moduli space has an infinite diestance
limit and two cusp points.
Identify
Cusps
Re τ
Figure II.5.1: The moduli space of type IIB theory is the H/SL(2, Z). The moduli space has
an infintie distance limit and two cusp singularities.
188
Note that as we go up in the infinite distance limit Im(τ ) → ∞, the size of the cycle that
wrap around the throat go to zero. Therefore, any geodesic can be shrunk to an arbitrarily
small length with continuous deformation. In other words, if we include the asymptotic
points, there is no non-trivial cycle. Also, the cusp points are usually when we get enhanced
gauge symmetries Z4 and Z6 . Note that the diameter of the moduli space is still infinite but
this time the volume is finite as opposed to the IIA or Heterotic case.
5.4
Dualities and infinite distances limits
There is very close connection between infinite distance limits in the moduli space and
dualities in quantum gravity. When we say two theories are dual, it means they have the
same moduli space and there is a correspondence between the physical objects in the two
theories. In perturbative dualities (usually T-dualities), the perturbative degrees of freedom
are mapped to each other. However, in non-perturbative (strong-weak) dualities, we must
include non-perturbative objects in each theory to complete the corresopndence. Some of
our most powerful insights into the non-perturbative features of quantum gravity come from
dualities. In that sense, understanding dualities is as fundamental as understanding quantum
gravity.
Moreover, usually, the dualities are between theories that have a well understood perturbative
description in some corner of their moduli space. In all the known examples, these ”corners”
are always some infinite distance limits. In other words, the dualities are between perturbative
descriptions of theories at two different infinite distance limits of the moduli space. Two most
important examples of such infinite distance limits are (1) when string coupling goes to 0,
(2) when the scalar field corresponding to the size of internal geometry goes to infinity. In
the first example, higher string excitations must be included and the pertuabtion becomes
more convergent. In the second example, higher KK modes must be included which can
be achieved by using the higher dimensional theory. In both cases, there is a rich physical
structure that enters and changes the IR description at the infinite distance limits. Moreover,
the nature of what happens is closely connected with the type of dualities that connect that
corner with other corners. Thus, the goal of understanding the infinite distance limits of the
moduli space is as deep as understanding the nature of dualities in quantum gravity. In fact,
we will see that just like dualities that fall into some universal classes, infinite distance limits
in string theory also seems to fall into some universal classes.
We only deeply understand infinite distance limits of the moduli space since we can use
perturbation theory as the coupling gets weak in some duality frame. Usually interesting
things happen at these infinite distance limits in the form of some states becoming light.
The reason these infinite distance limits are important is because these tells us more about
dualities which are some of the big mysteries about string theory. Note that from the field
189
theory perspective, it is very unnatural to expect any rich structure to appear at the infinite
distance limit in a flat direction.
With the above explanation in mind, let us now consider some more complicated examples
of the connection between dualities and infinite distance limits in string theory. Consider
type II theories on Calabi–Yau threefolds. The resulting theory is a four dimensional N = 2
theory. The moduli space of such theories is a direct product of the Coulomb branch (vector
multiplet scalars) and the Higgs branch (hypermultiplet scalars). From the string theory
perspective, the moduli of the Calabi–Yau become the moduli of the four dimensional theory.
For example, if we compactify IIB, the complex structure moduli become Coulomb branch
moduli and the Kähler moduli mix with R–R-fields to give the Higgs branch moduli.
On the other hand, if we compactify the heterotic theory on a d dimensional torus we
find that the moduli space is
SO(d + 16, d; Z)\
SO(d + 16, d)
× R+ .
SO(d + 16) × SO(d)
(II.5.6)
This space has many infinite distance limits. For example, you can take the limit where one
compactification radius goes to infinity.
SO(16 + d, d; Z) → SO(16 + d − 1, d − 1; Z) × SO(1, 1).
(II.5.7)
The leftover moduli would be the moduli space of the heterotic theory in one higher dimension
as expected.
Now let us consider another example. Consider supersymmetric theories in 6d with (1, 0)
supersymmetry. There are various ways of getting such a theory. For example we can put
Heterotic on K3, F-theory on elliptic CY threefold, or M theory on K3 × I. If you are given
a point in this moduli space, if the duality picture is correct, you should be able to go to
the inifinte distance limits of the moduli space to discover all these different corners. For
example, from the M theory perspective, one can shrink the the instantons at the end of the
interval and move them to the inside to get NS5 branes from the Heterotic perspective.
Let us take a step back. The infinite distance limits are very unnatural to have any
interesting physical characteristic from field theory perspective. However, in quantum gravity,
infinite distance limits have rich physics. The question of understanding infinite distance
limits in string landscape is as deep as the question of understanding dualities in string
theory.
Let us consider another example. Consider the type II theories (IIA and IIB) on CY
threefold. Both theories will give us a 4d N = 2 theory. Duality between IIA and IIB
suggests a pairing between Calabi–Yau threefolds such that IIA on one is the same as IIB
190
on the other. This symmetry is called the mirror symmetry and the two manifolds are called
the mirror pairs. The mirror symmetry has been very useful in studying superymmetric
theories. For exmaple, if you compactify on a threefold with ADE singularities you get ADE
gauge theories as we discussed before. If we compactify IIB on a threefold with an ADE
singularity, the D1 branes wrapped around the 2-cycle give non-perturbative corrections. In
fact, such non-pertubative corrections are so large that the formally infinite distance limit
of a shrinking CY beomes a finite distance point in the moduli space. Instead of dealing
with the instanton sum in IIB picture, it is easier to work in the S-dual IIB picture. In
the dual picture, D1 branes get mapped to the worldsheet instantons and the perturbative
calculations are trustable and automatically take the summed up worldsheet instantons into
account, which leads to the above claim.
Now we try to see what are the intrinsic way of seeing these infinite distance limits in the
lower dimensional theory without knowing the UV construction behind the theory.
5.5
Universal properties of infinite distance limits
The first observation is that at infinite distance limits we always get a tower of light states
(m ≪ Mpl ). And the second observation is that the tower is always weakly coupled. These
properties have been tested for many infinite distance limits in the known string constructions
[150–152].
Let us start with string excitations in 10d. If we use MP8 = MS8 /gs2 to go to the Einstein
1
frame, we see that mass of the string states in Planck units scales like gs4 . Therefore, in the
gs → 0 limit, the string states are light in Planck units, and by definition weakly coupled.
Therefore, the string excitations become stable light particles at the infinite distance limit.
Similarly, in the stroung coupling limit of type IIB when τ → 0, the D1 branes give
rise to a light tower of weakly coupled states. However, in type IIA the situation is slightly
different. In the strong coupling limit gs → ∞. The tower is the KK modes of M theory on
circle corresponding to D0 branes, the infinite distance limit is not longer a 10d theory. One
might wonder why IIA D0 branes are weakly coupled? This is because when we compactify
M theory, the coefficient of R is R11 and its inverse is some power of coupling D0 brane
coupling.
Now let us move to the two heterotic strings. Interestingly, the strong coupling limits of
the two theories are very different. In one (SO(32)) we get an exponential number of states
(string tower) and in the other (E8 × E8 ) we get much less number of states (KK tower).
It seems infinite distance limits correspond to infinite number of light states. However,
can we say the opposite? (i.e. infinite number of states only appear at infinite distances?)
The answer is no! Take M-theory on T 6 /Z3 . The T 6 is the product of three identical T 2 with
191
Teichmüller variable of ω and the Z3 acts by ×ω3 on each T 2 where ω = exp(2πi/3) is the
third root of unity (Fig II.5.2).
τ =ω
×ω
ωp
p
1
Figure II.5.2: Each T 2 is constructed by identifying z ∈ C with z + n + mω where n and m
are integers. The complex plane has a Z3 symmetry which acts as z → z × ω. Since this
symmetry maps the lattice {n + mω|n, m ∈ Z} to itself, it is also a symmetry of the torus.
This theory is at finite distance due to the finite distance resolution of the singularity.
However, there are infinite number of light particles. For example, if you take the P 2 from
the resolution of the singularity and wrap M2 branes around it, you get infinite number of
light particles. A degree-d curve in P2 is the Riemann surface.
p(z1d + z2d + z1d−1 z2 + ...) = 0
(II.5.8)
where P2 is the projective space (z1 , z2 ) ∼ λ×(z1 , z2 ). In this limit
with a genus g = (d−1)(d−2)
2
you get a conformal field theory with tensionless strings coming from M5 branes wrapping P2
and light particles corresponding to M2 branes wrapping surfaces that interact. In CFT with
more than four dimensions we almost always get tensionless strings. However, this tower is
not a weakly coupled string which distinguishes it from the tower of light states in the infinite
distance limits. Given this amount of supersymmetry, this is the only kind of new phases (or
critical phenomena) that you can have. In all these limits the gravity is decoupled meaning
the volume of the manifold can be taken to infinity.
Note that when we say a tower is weakly coupled, we measure the coupling in the original
192
description. For example, the KK tower is weakly coupled in the lower dimensional theory
but the theory could be strongly coupled in the higher dimensional description.
There is even more quantifiable structure to these infinite distance towers. You can
compute distance using the canonical metric. And you can ask how fast do the towers
become light? It turns out that the mass scale of the tower always goes to zero exponentially
as
mtower ∼ e−αdist.
(II.5.9)
where dist. is measured in Planck units and α is some O(1) constant. In all the known
1
examples α ≥ √d−2
[153].
Exercise 2: Show that type IIA theory has a tower of states with exponentially decreasing
masses m ∼ exp(−αϕ) in the gs → ∞ limit with α = 2√3 2 where ϕ is a scalar field with
canonical kinetic term − 21 (∂µ ϕ)2 in the Einstein frame.
Exercise 3: Similar to the previous exercise, show that in type IIB string theory, in both
limits gs → ∞ or gs → 0, there is a tower of states with exponentially decreasing masses
with a decay rate of α = 2√1 2 .
The exponential rate is the same for the two infinite distance limits in IIB but not the
same in IIA.
Another example is the SO(16) × SO(16) Heterotic theory which is non-supersymmetric
but we know how to describe it at weak coupling. In that case too, in the infinite distance
limit, we get an exponentially light tower of states.
Now we are ready to give a more precise formulation of the distance conjecture. First,
we give the basic version, then we state stronger versions with additional conditions.
Distance conjecture
Basic version: At any infinite limit in the moduli space, a light tower of light states
emerges with m ∼ e−α·dist. [154]. Additional versions include
1) The tower is weakly coupled.
2) If the moduli space is not a point, it is non-compact.
3) The first homology is always triviala .
4) There is always a dual description in the infinite distance limit. However, it is
193
difficult to make this statement precise.
This statement sounds like a consequence of the no-global symmetry conjecture given that a nontrivial homotopy leads to a topological charge. However, the conservation could be violated by moving
the closed curve in the direction of the massive scalars which are not part of the moduli space.
a
There is a refinement of the first and last conditions which states that there are only
two possibilities for the leading tower: either KK tower or a tensionless fundamental string.
This refinement is known as the emergent string conjecture [155]. This conjecture is very
non-trivial because we could have had tensionless membranes in the infinite distance limits,
but such membranes always turn out to remain relatively heavy.
The constant α is expected to be of order one, but the lack of a precise lower bound
for it in the formulation of the conjecture makes the conjecture a bit less precise than some
other conjectures, such as the WGC. However, we will see in the next section that de Sitter
conjectures suggest a precise lower bound for α that seems to be true for all stringy examples.
There are two natural ways to relate the mass of the tower to the scalar potential which are
m2 ∼ V and md ∼ V [156–158]. We will discuss these two options in the next section and
show that they lead to the following natural candidates for α.
α≥ √
2
1
or √
.
d−2
d d−2
(II.5.10)
1
This is consistent with all known cases where α ≥ √d−2
. The sharpened version of the
√
distance conjecture [153] states that 1/ d − 2 is indeed the lower bound for α. In [153],
it was shown that this proposal is invariant under dimensional reduction and is saturated
in toroidal compactifications of supergravities. Moreover, the authors in [153] made the
√
following interesting observation in string theory examples that α = 1/ d − 2 if and only
if one of the leading towers of states is a string tower and commented on its connection
with emergent string conjecture. In the following, we explain why the two conjectures are
indeed related. In particular, we show that the sharpened distance conjecture follows from
the emergent string conjecture.
According to the emergent string conjecture, every infinite distance limit in the moduli
space is either a fundamental string limit or a decompactification limit. Let us first assume
that a limit is a perturbative string limit. A limit with a fundamental string is a limit where
the scattering of all weakly coupled particles are given by a fixed worldsheet theory (in string
units) with a coupling that goes to 0. Therefore, this infinite distance limit, by definition,
corresponds to a limit where all scalars are kept fixed in the string frame with the exception
of the string coupling exp(φ), which is taken to 0.
´
By definition, φ couples to the worldsheet via Rφ in the string frame. Similarly,
´
the metric couples as g µν ∂X µ · ∂X ν in the string frame. Since we assume the string is
194
fundamental, we can apply the machinery of the perturbative string theory to the above
vertex operators to read off the tree-level amplitudes of graviton and φ. This will give us the
standard string theory result for the effective action in the string frame [1].
M d−2
S= s
2
ˆ
e−2φ (R + 4(∂φ)2 + . . .).
(II.5.11)
After going to the Einstein frame, we find
S=
ˆ
1
MPd−2
R + (φ̂)2 + . . . ,
2
2
where MPd−2 = Msd−2 exp(−2φ) and φ = φ̂ ·
√
d−2
.
2
Ms = M P e
(II.5.12)
If we combine the two, we find
1
κφ̂
− √d−2
.
(II.5.13)
Therefore, whenever the light states are described by a fundamental string, the coefficient in
√
the distance conjecture is exactly 1/ d − 2.
Now suppose the leading tower of light states is described by a KK reduction of a higher
dimensional field theory. If we take a D dimensional theory and
q compactify it down to d
D−2
ρ̂) where ρ̂ is the
dimensions, the mass of the KK tower will go like m ∼ exp(− (D−d)(d−2)
p
canonically normalized volume modulus. Note that the coefficient (D − 2)/[(D − d)(d − 2)]
√
is always greater than 1/ d − 2 and it saturates it at D → ∞. Therefore, for any KK
tower, if we move in the direction of the corresponding volume modulus, the tower satisfies
√
λ > 1/ d − 2. In the following, we show that the emergent string conjecture implies that no
√
mixing of the volume modulus with other moduli can bring this coefficient below 1/ d − 2.
We provide an algorithmic procedure that changes the infinite distance limit in a way that
strictly decreases the coefficient of the distance conjecture. Then we show that the endpoint
of the algorithm is either a string limit or a KK limit where only the volume modulus is
√
taken to infinity. Since the coefficient of the distance conjecture in both cases is ≥ 1/ d − 2,
√
We find that the coefficient of any tower is ≥ 1/ d − 2. The argument also trivially implies
that the only towers that saturate this bound are string towers.
The procedure
The limit that takes the volume modulus to infinity while keeping other moduli fixed
yields the largest coefficient of distance conjecture among theories which decompactify to
a particular theory. This is because it avoids any unnecessary change of moduli to which
the KK tower is insensitive. Let us try to lower the coefficient of distance conjecture from
195
q
(D−2)
(D−d)(d−2)
by gradually changing the direction of the limit in a region where the theory
still decompactifies to the same theory. The smallest coefficient for the KK tower (the largest
mixing of the volume modulus) must be achieved on the boundary of the region where the
higher dimensional description holds. According to the emergent string conjecture, there are
two possible scenarios for the new description at the boundary, we either decompactify to an
even larger dimension, or we get a string tower. If we find a string limit, we know that the
√
coefficient of the distance conjecture is 1/ d − 2. Therefore, we have found a strict lower
bound for the coefficient of the KK towers inside the decompactification region. Otherwise,
if we decompactify to an even larger internal geometry, we can repeat the algorithm to lower
the coefficient of the distance conjecture. There are only two ways the process stops: we
√
either find a string tower, which as we explained, provides a strict lower bound of 1/ d − 2
for the initial KK tower. Or we end up with a rigid decompactification limit, i.e. it only
occurs in one limit. In that case, that limit must correspond
to taking the volume modulus
q
(D−2)
to infinity, and the corresponding coefficient must be
for some D. Again, we
(D−d)(d−2)
√
find that the coefficient of the tower is strictly greater than 1/ d − 2.
√
The above argument shows that string towers satisfy α = 1/ d − 2, while for KK towers
√
α is strictly greater than 1/ d − 2. As we will explain in the next section, this sharpened
bound is also motivated by holography [159].
Another way to interpret the distance conjecture is that for a given cut-off, the field range
of any EFT is finite. Because if we move the vev of the moduli too much, there will be new
light states below the cut-off that were previously integrated out. Since these new states are
light enough to be excited, the EFT that assumes they are in their vacuum is no longer a
good approximation. Thus, the effective field theory breaks down.
Suppose the cutoff is Λ and the scale of the tower is m. The distance conjecture tells us
that if we traverse ∆φ ∼ | ln(m/Λ)| in the moduli space in Planck units, the effective field
theory will break down.
The finiteness of the moduli space is not surprising from the EFT perspective, however
the size of it is. In any effective field theory, we neglect some higher order irrelevant operators
in the perturbative expansion of the action. For example, operators that are proportional
to (φ/Λ)n . However, such operators will become significant at large values of φ. But the
difference between quantum gravity and non-gravitational field theory is that EFT puts a
polynomial upper bound in Λ on the field range rather than a logarithmic one.
It is always nice to connect Swampland conditions to black holes since we know some of
their universal features that must be true in any theory of quantum gravity. For distance
conjecture, there is a heuristic connection to black holes [160]. Consider the field configuration
that is φ = 0 at the origin and φ = φ0 at some fixed radius. It turns out for large enough
field ranges of O(1) in Planck units, this configuration collapses into a black hole. Therefore,
196
the EFT breaks down where we want to probe some regions of field range with light modes.
Sometimes in the infinite distance limit it appears that we get a tower of light instantons
instead of a tower of light states. In these cases it turns out the instantons correct the action
in such a way that the formally infinite distance point becomes a finite distance point at
the moduli space [161]. In general, tower of light instantons drastically modify the geometry
of the moduli space and their appearance signals that we are not working in the correct
perturbative duality frame.
For example, consider the type IIA on a Calabi–Yau threefold. If we take the radius
)2 kinetic term, we would naively think the
modulus for the CY to zero, because of the ∝ ( dr
r
field space distance goes to infinity. So both zero and infinite size are at infinite distance
limit. However, at the zero size, the worldsheet instantons that wrap aroung CY give large
corrections to the metric and make the infinite distance finite. This point is T-dual to
conifold singularity in IIB which we discussed earlier. You can even continue past the 0
size and analytically continue to negative volumes until the space eventually comes to an
end at the Landau-Ginzburg point with no massless modes and enhanced gauge symmetry.
After analytic continuation, the moduli space complexifies to a complex plane with LandauGinzburg point at the 0 and the zero size CY at 1.
Another example of instantons making a formally infinite distance limit finite is M theory
on quintic. The instantons corresponding to M5 branes wrapped around threefold make the
formally infinite distance limit of zero size quintic finite.
Particles vs instantons
Tower of light particles (0+1 d states) come with infinite distance and tower of light
instantons erase infinite distance limits.
A very natural question to ask about Swampland condition is that how do they change
under compactification? Suppose you have a tower of states at an infinite distance, would
you still get the same tower at the corresponding limit in the lower dimensional theory? The
answer is not necessarily! Because the 0 + 1 dimensional objects of the higher dimensional
tower can now wrap around the compact circle and become instantons that drastically correct
the geometry of moduli space.
We saw how the tower of light instantons erase infinite distance limits. One might wonder,
could it be that tower of weakly interacting light states create the infinte distance limits and
that is why they are associated with each other? There have been several attempts to
approach distance conjecture from this perspective. In particular, to assume that the moduli
space is always morally compact and it can only become non-compact (have an infintie
distance limit) due to corrections caused by light tower of states. In other words, the tower
197
of states cause the infinite distance rather than the other way around. We will come back to
this point and what morally compact means in the last section on finiteness conjecture.
For example, in field theory if we integrate out a fermion, all the terms involving scalars
that interact with that fermion receive corrections. Let us estimate this correction. Consider
an interaction like ψ̄m(φ)ψ.
p+q
ψ
φ
∂φ m
φ
∂φ m
q
q
ψ
p
Figure II.5.3: Correction to the propagator of φ from integrating out a missive spinor ψ
which interacts with φ via a term ψ̄m(φ)ψ.
The correction to the fermion propagator comes from the following integral
∼
ˆ
dd p
(∂φ m)2 .
2
(p/ + m)
(II.5.14)
P d−4
The correction to the metric takes the form of
mi (∂φ m)2 . Under certain assumptions,
one can show that corrections of this type can create an infinite distance limit for the scalar
field φ. For example, if we assume the tower to be uniform with jumps of ∼ ∆m(φ) (e.g. KK
2
tower), we end up with a metric of the form ds2 ∼ d(∆m)
∼ (d ln ∆m)2 which indeed has an
(∆m)2
infinite distance limit.
Another natural question to ask is how the two characteristics of the tower (coupling
strength and mass) are related? Suppose the tower is a charged light tower. In that case, the
WGC (if satisfied by the tower) tells us that m . g. Thus, it is not a coincidence that both
the coupling and mass are going to zero; one explains the other, including the exponential
dependence since g ∼ e−βφ . However, the other piece of the distance conjecture still remains
a mystery. Why weak coupling has something to do with infinite distance? Usually the
coupling is exponential with some modulus. In other words, the gauge coupling, if promoted
to a field, takes a kinetic form like ∼ (∂g/g)2 . A satisfying explanation for this behavior is
still missing. Usually in the web of the Swampland conjectures everything fits nicely together,
but there is always some missing piece that stops us from completely deriving one conjecture
from another.
In the above argument about the connection between the distance conjecture and the
198
WGC, we assumed that the tower is charged. However, not every tower is always charged.
For example, the strong coupling limit of E8 × E8 is M theory on interval where the tower is
a KK tower with no gauge charge. In this case there seems to be an approximate or Higgsed
U(1).
The distance conjecture has an important cosmological implication. In general, the
distance conjecture is in tension with conventional slow-roll inflation which you need a
5 − 10Mp field range for the inflation. In that case, the distance conjecture tells us that
that you get a tower of states that break down the EFT.
5.6
AdS and CFT distance conjectures
In the following, we use distance conjecture to motivate two other conjectures, one about
AdS and the other about CFTs. In all the well-controlled AdS constructions in string theory,
the spacetime takes the form AdSd × S p × M r where M r is some compact manifold. and the
length scale of the extra pieces (S p and M r ) scale like the AdS scale.
ΛAdS ∼ −
1
lS2 p
.
(II.5.15)
In holography there is a relation between the mass m of the particle in the bulk and and
dimension ∆ of the corresponding operator on the boundary
(mlAdS ) ∼ ∆.
(II.5.16)
Suppose we could find a case where the radii of the extra dimension decouples from the
AdS scale. In that case, we could find a huge gap in the dimesnions of the CFT operators. If
1
there were to be no gaps, we must always have masses which go like m ∼ |Λ| 2 ∼ 1/lAds . This
last equality, is observed in all the examples with well-controlled de Sitter constructions. But
this sounds very much like the distance conjecture. In fact, in string theory, the cosmological
constant is usually exponential in some modulus
Λ ∼ e−cφ ,
(II.5.17)
and the distance conjecture predicts a tower of states with masses
′
m ∼ e−c φ .
(II.5.18)
More generally, we can define distance in the space of metric configurations gµν and one can
see ∝ | ln(Λ)| is a natural notion of of distance to be used instead of φ. Another way of
199
deriving the same result is to combine the two equations (II.5.17) and (II.5.18). We find that
in the limit Λ → 0, there must be a tower of states with masses that are polynomial in Λ.
This is called the AdS distance conjecture [162]. The stronger version of the conjecture (for
SUSY case) fixes the exponent and claims that there is always a tower of states with masses
that go like
1
m ∼ |Λ| 2 .
(II.5.19)
This conjecture in particular means that there is no pure AdS quantum gravity. For
example, it rules out the holographic dual of pure AdS3 .
If we apply this to dS in our universe with small Λ, it tells us that in our universe we
should expect a tower of states m ∼ |Λ|α with α ∼ O(1). In other words, we must have a
hierarchy problem as a consequence of the cosmological constant problem. Moreover, they
have to be weakly coupled which could be a candidate for the dark sector. This has recently
lead to the dark dimension scenario with α = 41 [163, 164].
Exercise 4: For each massive particle in the standard model, assuming that it is in a tower
of states satisfying m ∝ Λα , find the value of α.
The AdS conjecture implies that we cannot have an arbitrarily large mass hierarchy in
AdS space. If we have a large AdS, the first excited states will have a mass of order Λ1/2
which in string theory compactifications correspond to the extra dimensions. In other words,
the AdS space cannot be studied in isolation and it always acompanies the extra dimensions,
because there cannot be a limit where the AdS scale goes down but the kk modes of the
extra dimensions or any analagous excitations are gapped enough to not be considered.
Let us try to come up with a counterexample for this statement. Consider an AdS
construction with a sphere as a compact manifold extra dimension. Suppose we want to
keep the AdS scale fixed while increase the masses of the KK modes. This would correspond
to keeping the curvature of the sphere fixed while decreasing its diameter. This is because
the mass of the KK modes correspond to the eigenvalues of the Laplacian which increase as
the diameter of the space increases. One might think an easy way to do that would be to
replace the sphere with some spherical orbifold where the sphere is moded out by a symmetry
subgroup. The simplest example would be Zn rotation group. However, if we mod out sphere
by this group, the diameter does not change.
200
Zn
Figure II.5.4: Quotienting out a sphere by its Zn subgroup will decrease its volume, but not
its diameter. The distance between the north and south remains unchanged.
But Zn is not the only symmetry of hypersphere. So, the AdS conjecture is making
a mathematical prediction: consider a sphere with radius 1 and mod it out by an isometry
subgroup Γ. There must be a minimum on the diamter of S/Γ. This is infact a true statement.
For example, it was show that for the case of three sphere the minimum diameter is achieved
by the icosahedral subgroup.
This has been shown to be true in an arbtirary Sasaki-Einstein manifolds coming from
CY.
Now let us apply the AdS conjecture to holographically realized CFTs. Conformal field
theories in dimensions less than five can have a moduli space which is called a conformal
manifold. If the CFT realized holographically, their moduli space matches with the moduli
space of massless scalars in the bulk.
Consider a symmetry generator in the CFT which has spin J and dimension greater than
d − 2 + J (according to unitarity). Particularly, when J = 2, you get the energy-momentum
tensor which has dimension d. But how about higher spin generators? Can we have them
and can they saturate the unitarity bound? These generators, if they exist, are called higher
spin symmetries. If they exist in a CFT, they would include J = 4 and infinitely many more.
So far, these are all claims that can be shown from CFT data.
Moreover, it can be shown that if such higher spin symmetries exist, there is a sector of
the theory that is a free CFT. This points in the conformal manifold are higher symmetry
(HS) points. We expect to find infinitely many weakly coupled currents as we get closer and
closer to an HS point. But, this sounds very similar to the statement of distance conjecture!
In fact, the only known examples of HS points are realized at the infinite distance in the
conformal manifold. For example in △ SYM, you take the τ → 0, the theory becomes free.
The CFT distance conjecture states that the higher symmetry points are always at inifnite
201
distance limits in the conformal manifold and they are free [165].
Now if we take the anomalous dimension γ4 = −(J + d − 2) how fast does it go to zero?
It scales like
[diam(M)] ∼ β lnǫ γ4
(II.5.20)
which is consistent with the distance conjecture. In the CFT context the conjecture says
that theere is a tower of higher spin currents that comes down with at least an exponent of
1/4.
Note that the CFT distance conjecture is more general than just an application of the
AdS distance conjecture because some CFTs might not have a holographic dual.
6
Swampland V: de Sitter conjectures
So far, all the string theory examples we have studied were either Minkowski or Anti-de Sitter
spacetimes. Similar to how Minkowski spacetime is the maximally symmetric spacetime with
zero cosmological constant, de Sitter (dS) and Anti-de Sitter (AdS) spaces are respectively
the maximally symmetric solutions with positive and negative cosmological constants. In
this section, we move away from Minkowski and AdS, and study dS spacetimes. The
symmetry algebra of the Minkowski, AdS, and dS are different which lead to very different
properties among the three spaces. For example, the symmetry algebra of Minkowski and
AdS can be extended to a supersymmetry algebra, however, as we will see, the same cannot
be done for de Sitter. Therefore, any potential dS construction in string theory is nonsupersymmetric. Given that there is no known stable non-supersymmetric examples in string
theory, constructing de Sitter, if at all possible, is much more challenging than Minkowski or
AdS in string theory.
Studying de Sitter spaces is particularly important, because our universe seems to be
approximately de Sitter at its current cosmological stage. This could be realized in different
ways. The cosmological constant, which is the vacuum energy density, could be the value
of the scalar field potential V (φ). The fact that this number seems to be almost constant
means that the universe is:
• stuck at a local minimum of V (φ), or
• V (φ) has a very small slope which makes it look almost constant, or
• significantly fine-tuned to be at top of the potential
202
V (φ)
V (φ)
V (φ)
′
|V | ≪ V
|V ′ | ≪ V
|V ′ | ≪ V
≃Λ
≃Λ
≃Λ
φ
φ
φ
Figure II.6.1: Three possible explanation for the slow variation of the cosmological constant.
As we will discuss later, the scalar potential is believed to not have absolute positive
minimum. Thus, if the first scenario happens, the potential will likely be lower somewhere
else in the field space. When this happens, the scalar field can tunnel through the potential
barrier to decrease the cosmological constant. Two examples of such processes are ColemanDeluccia instantons and Hawking-Moss instantons [166, 167]. Hence, no matter which
scenario happens, the value of cosmological constant is thought to eventually decrease. In
fact, in all known examples in string theory, loss of supersymmetry comes with an instability
even in Minkowski or AdS spaces. These observations suggest that the correct question to
ask is: How stable or unstable de Sitter space can be?
We can understand de Sitter space as a hypersphere embedded in a higher dimensional
flat space. Consider the solution to the following equation:
−X02 ± X11 + X22 + ... + Xd2 = ±R2 ,
(II.6.1)
{Q, Q† } = σµ P µ + ...,
(II.6.2)
P
where the metric in the ambient space is ds2 = −dX02 ± dX12 + i>1 dXi2 . The plus sign
corresponds to de Sitter space while the minus sign corresponds to Anti-de Sitter space. The
symmetries of the two spaces are respectively SO(1, d) and SO(2, d − 1). One of the main
differentiating features of dS is that it cannot support a global notion of positive conserved
energy. This is also where the tension with supersymmetry comes in. Supersymmetry algebra
tells us there must be a positive definite bosonic symmetry operator that can be written in
terms of square of supersymmetry generators. To be more precise, suppose Q is one of
the supersymmetry generators, {Q, Q† } is a positive definite bosonic symmetry generator.
Moreover, {Q, Q† } transforms as a tensor product of two spin 1/2 representations under
rotation, therefore, there must be a vector representation in the decompositition of {Q, Q† }.
203
where P µ is the density flow of a positive-semidefinite conserved quantity H.
ˆ
H=
⋆P,
(II.6.3)
Σ
where Σ is a space-like Cauchy surface. Therefore, if de Sitter geometry cannot support a
global positive deifinite energy, it cannot support supersymemtry.
If such an H exists, it would generate an isometry in de Sitter space. However, all the
∂
0 ∂
symmetries in de Sitter are generated by Killing vectors of the type X i ∂X
0 + X ∂X i or
∂
j ∂
The Noether’s conserved current has a term ∆L which
X i ∂X
j − X ∂X i , where i, j 6= 0.
captures the deformation of Lagrangian under the symmetry transformation. Due to the X i
prefactor in Killing vector fields, the term ∆L will linearly depend on the coordinate. Note
that this does not happen in flat space because the Killing vector ∂t does not have such a
~ 2 /2 + V
prefactor. Therefore, the corresponding conserved current is of the form (∂t )2 /2 + ∇
with no X i prefactor which allows it to be manifestly positive semidefinite. However, in de
Sitter, the sign of the current cannot be semidefinite and will depend on the sign of some
∂
0 ∂
combination of coordinates. For example, for the conserved current under X i ∂X
0 + X ∂X i ,
the corresponding conserved current can have a different sign depending on the sign of X i .
Exercise 1: Show that AdS avoids this problem.
In N = 1 supergravities, the scalar potential looks like
V = eK (|DW |2 − 3|W |2 ).
(II.6.4)
Supersymmetry requires DW = 0 but it does not require W = 0, which is why it can give
us non-positive cosmological constants.
6.1
(Non)-supersymmetry and (in)stability
The fact that we need to study de Sitter or quasi de Sitter backgrounds means we need to
study non-supersymmetric backgrounds. Let us go back to the Minkowski case and search
for non-supersymmetric Minkowski background. How can we break the supersymmetry?
Scherk-Schwarz mechanism
One way to break the supersymmetry completely is to impose anti-periodic boundary
condition on fermions in a circle. This boundary condition breaks all supersymmetry, however,
at the same time, for small radius, it creates a Tachyon from winding string which is the
familiar stringy Tachyon in the NS sector. This is the Scherk-Schwarz mechanism for breaking
204
the supersymmetry [56].
One might ask what is the ultimate fate of this universe? First, we can calculate the
effective potential for the scalar field corresponding to the radius. In supersymmetric theories,
the one-loop amplitudes vanish. However, now that supersymmetry is broken, the one-loop
contribution to the effective potential is no longer zero and therefore the potential gets
corrected. After going to the Einstein frame, we get a potential like V ∼ −R−nd where nd
is a positive constant that only depends on the spacetime dimension d (e.g. n4 = 6 [168]).
Hence, the quantum corrections generate a potential that shrinks the size of the circle. But
this is not all. The winding string has a scalar mode that at small enough radius R becomes
Tachyonic. Suppose we call this field φw , we see that our perturbative potential does not
have a minimum in the R, φw plane which means the theory is unstable.
Another way of seeing this instability is through non-pertubative processes. Witten
showed that in a theory with antiperiodic frmions, there is a finite action bounce solution
that creates a bubble of nothing which expands and eats the universe [169].
Smooth
geometry
Figure II.6.2: By capping the geometry in a smooth way, we allow the spacetime to end on
a domain wall. This is Witten’s bubble of nothing.
Note that if you put periodic boundary condition, the spin structure would be nontrivial
and shrinking the circle and capping the geometry would be impossible.
Thanks to the earlier perturbative analysis we knew that the solution is unstable and
thanks to the non-perturbative analysis, one can see what the fate of the instability is; the
space disappears!
Let us consider other examples of non-supersymmetric string theories.
Type 0 theories
In construction of type II string theories, we apply specific GSO projections to the
spectrum of closed superstring to find a consistent theory that is modular invariant, satisfies
level matching, and has a mutually local and closed OPE. It turns out that there are two
205
more restrictions of the full spectrum other than type IIA and type IIB that satisfy all these
consistency conditions. These theories are called 0A and 0B theories. The spectrum of these
theories is made up of NS-NS and R-R sectors (no mixed sectors). Both theories have NS-NS
states with the equal right moving and left moving worldsheet fermion numbers projection.
However, type 0A has the R-R states with (−)F = −(−)F̃ while the type 0B theory has states
with (−)F = (−)F̃ . It is easy to see that type 0B theory is chiral while type 0A theory has
a parity invariant spectrum. These theories have different spectrums than type II threories.
Since there are no NS-R or R-NS sectors, this theory has no spacetime fermions. Thus, it
does not have any supersymmetry. At the same time, it has the NS-NS tachyon. This is yet
another example of how the lack of supersymmetry is accompanied with an instability. Our
next examples are non-supersymmetric orbifolds.
Non-SUSY orbifolds
We can consider orbifolds T d /G with tori with periodic boundary conditions for fermions
(no Scherk-Schwarz mechanism) with a symmetry G that does not preserve any supersymmetry.
As an example, we will study compactifications on T 2 /Z3 . Consider a torus C/Γ such that
the lattice Γ is generated by shifting z by numbers r and ω · r where r ∈ R and ω is the third
root of unity.
The lattice Γ is invariant under multiplication by ω which makes this actions a Z3
√
√
symmetry of the torus. This action has three fixed points {0, exp(πi/6)r/ 3, ir/ 3}.
τ =ω
1
Figure II.6.3: The red points are the fixed point under the Z3 orbifold action
To study the spectrum of this theory, it is easiest to work in the Green-Schwarz formalism
where gauge is fixed to lightcone and the spacetime supersymemtry is manifest. Let us quickly
review the Green-Schwarz formalism. For bosnonic string, we can go to lightcone gauge
206
X + = 2α′ p+ τ and write the action in terms of the transverse coordinates X i (1 ≤ i ≤ d − 2)
as
ˆ
X
1
αβ
∂α X i ∂β X i ,
(II.6.5)
dσdτ
η
S=−
4πα′
i
where η is 2d Minkowski metric. The advantage of lightcone gauge is that all the remaining
oscillators are physical. Note in the lightcone gauge, the worldsheet fields furnish representations
of the little group of d dimensional massless particles which contains an SO(d − 2) subgroup
corresponding to the rotations in the transverse directions. For the bosonic worldsheet
fields, the remaining coordinates X i furnish a vector representation of SO(d − 2). We can
generalize the lightcone gauge to superstrings. In that case, we should incorporate some
spacetime fermionic ”coordinates” which will give rise to worldsheet oscilating modes. This
spacetime fermions are nothing other than the supercharges. This is why in the GreenSchwarz formalism, the spacetime supersymmetry is more manifest. For type II theories
where we have two spacetime supersymmetries, the superstring action in the lightcone gauge
is [54, 170]
ˆ
−1 X
i
S = dσdτ
∂α X i ∂ α X i +
S̄γ − ρα ∂α S,
(II.6.6)
′
4πα i
4π
where S Aα is both a worldsheet fermion and a spacetime fermion. The index A is a 2valued worldsheet index while the index α is a 32-valued spacetime index. γ µ are 32dimensional spacetime Dirac matrices and ρα are two-dimensional worldsheet Dirac matrices
(Pauli matrices). Lastly, S̄ is defined as
†
(γ 0 )αβ (ρ0 )A
S̄Bβ = SAα
B.
(II.6.7)
Note that although S 1α and S 2α are 32-dimensional, they do not live in a 32-dimensional
space. In fact, they only have 8 degrees of freedom. This is because each one is a 10d
Majorana–Weyl spinor which is a 16-dimensional real representation. Moreover, after going
to the lightcone gauge, they must satisfy the lightcone condition
(γ + )S A = 0.
(II.6.8)
These reduce the number of degrees of freedom in S 1 and S 2 to eight. In fact, we can think
of each one of them as the minimal irreducible spinor representation of the SO(8) subgroup
of the little group. The action can be then rewritten in terms of 8d spinors S Aa where
1 ≤ a ≤ 8 and S 1 and S 2 transform in the 8d spinor representation of Spin(8). Each one of
these components, just like the X i , gives rise to raising and lowering operators Sna that live
in 8s (or 8c depending on parity) of Spin(8) for any fixed n. Therefore, we have two sets of
207
raising and lowering operators that build up the spectrum:
µ
µ
living in 8v representation of Spin(8).
and α̃−n
• Bosonic modes α−n
1α
• Fermoinic modes S−n
and S 2α living in 8s or 8c representation of Spin(8).
Note that in the Green-Schwarz formalism, as opposed to the RNS formalism, worldsheet
fermions map to spacetime fermions.
Now let us study the weight lattice of each one of the two representations. Cartan of
SO(8) (or Spin(8)) is four dimensional. We can take the four generators to be rotations in
the four planes, {(X 1 , X 2 ), (X 3 , X 4 ), (X 5 , X 6 ), (X 7 , X 8 )}. The corresponding eigenvectors of
the representations under the elements of Cartan are:
8v : (±1, 0, 0, 0), (0, ±1, 0, 0, 0), (0, 0, ±1, 0), (0, 0, 0, ±1)
1
8c : {(s1 , s2 , s3 , s4 )|∀i ∈ {1, 2, 3, 4} : si = ± & Πi si = +1}
2
1
8s : {(s1 , s2 , s3 , s4 )|∀i ∈ {1, 2, 3, 4} : si = ± & Πi si = −1}.
2
(II.6.9)
Now let us go back to our T 2 /Z3 orbifold. The action of Z3 is an order-3 rotation in the
(X 1 , X 2 ) plane. Given that all the spinors have half-integer weight under this rotation, none
of them will be mapped to themselves under a θ = 4π/3 rotation. Note that the spinors S a
are the spacetime supersymmetry charges, therefore, there are no fermionic zero modes in
the untwisted sector, and no supersymmetry will be preserved under the Z3 rotation.
Now let us look at the twisted sector. The winding modes in the twisted sectors are
typically massive, however, near the fixed points, they can shrink to zero size and give us
a light spectrum. Therefore, we expect three copies of a light spectrum coming from the
degrees of freedom of the twisted sectors localizing around the fixed points. Let us start with
the (X 1 , X 2 ) = (0, 0) fixed point. We consider the perturbations around the string solution
where (X 1 (σ, τ ), X 2 (στ )) = (0, 0) and for i > 2, X i (σ, τ ) is constant. These perturbation are
subject to the twisted boundary condition ∂σ [X 1 + iX 2 ](σ, τ ) = ω[X 1 + iX 2 ]∂σ (σ + 2π, τ ).
These boundary conditions shift the frequencies n from integers by ±1/3. Similarly, the
twisted boundary conditions affect the fermionic oscillators S a with suitable (−)F action on
S to match the orbifold action order 3. However, given that all of them are affected by the
Z3 . The frequencies of all of them are shifted by ±1/3. Shifting the frequencies has a twofold impact: it changes the 2d casimir energy and changes the mass of the first excitations.
Therefore, the previosuly massless excitations, get gapped. Usually, light states are expected
to dominate the contribution to the casimir energy. In this case, there are six remaining
bosonic light states while all the other 2d single particle states are gapped. Since bosonic
degrees of freedom lower the casimir energy while fermionic degrees of freedom raise it, we
208
expect the final result to be negative which means the theory has a spacetime tachyon. Now
let us do a more precise calculation. The theory, before compactifying on an orbifold, was
supersymmetric and tachyon-free. Therefore, the casimir energy was zero. Given that two
of the bosonic modes (X 1 and X 2 ) have been gaped exactly like the fermionic modes (in
terms of the shift in the frequency), their effect cancel that of two of the fermionic modes.
Therefore, all that is left to calculate is the shift in the casimir energy from six fermionic
modes. If the modes numbers are shifted by something which is equal to 0 < η < 1 modulo
1, the casimir energy is shifted by each mode by (−1)F 41 η(1 − η) where F is the worldsheet
fermion number. Therefore, the total shift is equal to
6 · (−1) · [
11 2
1
( )] = − < 0.
43 3
3
(II.6.10)
The casimir energy of the 2d theory translates to the mass squared of the single particle state
with lowest m2 in spacetime. Therefore, this theory has tachyons in spacetime.
Exercise 2: Consider orbifolds with a singularity of the type C2 /Zn where the Zn acts by
multiplication by some powers of n-th root of unity on each C. Show that if the Zn preserves
supersymmetry, there is no Tachyon but if it does not, there is always a Tachyon.
Now let us give some good news! There is a 10d theory [65, 171] where there is no tachyon,
no supersymmetry, and even though it is chiral, it does not have gauge, gravitational, or
mixed anomalies thanks to the Green-Schwarz mechanism.
O(16) × O(16) Heterotic string
Since this theory is a modification of the Heterotic construction, let us first review
the E8 × E8 Heterotic string theory. The Heterotic string theory can be described in
two ways. One is in the Green-Schwarz formalism [54, 170] where there is 16 lef-moving
bosonic coordinates, 10 ordinary bosonic coordinates (left and right pair), and 16 fermionic
coordinates S a furnishing the 10d right-handed Majorana–Weyl representation. The S a are
nothing other than 10d supersymmetry generators. The 16 left-handed bosnonic coordinates
have a 16d Euclidean Narain lattice Γ16 . There are only two possibilities; the root lattice of
SO(32) and the root lattice of E8 × E8 . Now take the E8 × E8 theory. The compact bosons
have a fermionic description as well. To see that we should go to RNS formalism. Although
the supersymmetric Heterotic strings are easier to construct in Green-Schwarz formalism, the
non-supersymmetric versions are easier to work with in the RNS formalism. In the E8 × E8
theory in the light cone, we have three sets of 16 worldhseet Majorana–Weyl fermions. Two
sets are left movers (which equivalently describe the compact bosons) and one set is the usual
right-moving sector. Each of these fermions can have NS or R boundary condition. We allow
all 8 posiibilities. However, we do a GSO projection to only keep the states that have even
number of worlsheet fermion of each set. Moreover, we only keep the states that respect the
209
level-matching condition.
{N S+, R+}L × {N S+, R+}L × {N S+, R+}R /level-mathcing.
(II.6.11)
There is a left-mover tachyon which is thrown out by the level matching condition. We can
think of the above spectrum as a Z2 × Z2 orbifold of a theory with same boundary conditions
for all fermions. Before orbifolding, this theory is the heterotic analogue of the type 0 theory,
and similar to that is tachyonic. Let us call this theory the type 0 Heterotic theory. The
spectrum of type 0 Heterotic theory is
(N S FL1 , N S FL2 , N S Fr ) ∪ (R FL1 , R FL2 , R FR ),
(II.6.12)
where {FL1 , FL1 , FR } are the worldsheet fermion numbers and
FL1 + FL2 + FR ≡ 2
mod 2.
(II.6.13)
The E8 ×E8 Heterotic theory can be viewed as a Z2 ×Z2 orbifold of the type 0 theory. The first
Z2 multiplies the first left-moving fermion by (−1)F
L1 and the second Z2 does the same for the
second left-moving fermion. The untwisted sector of this orbifold is just (N S+, N S+, N S+)∪
(R+, R+, R+). However, in the twisted sector, the boundary conditions can mix between
NS and R. This leads to the spectrum (II.6.11) of the E8 × E8 theory. This theory is modular
invariant and consistent. However, there is a Z2 ambiguity known as discrete torsion. This
is to say, we can sum over the twisted sectors with specific weights that can be thought of as
´
exp(i B). Suppose α is the generator of the first Z2 and β is the generator of the second
Z2 . We want to change the β such that it acts non-trivially on α, however we should make
sure we can assign appropriate signs to the one-loops twisted sectors to maintain modular
invariance. Suppose ǫ(p, q) where p, q ∈ Z2 × Z2 is the sign of the one-loop twisted sector
where there is a p twist in the σ direction and a q twist in the τ direction. In the E8 × E8
theory, we take ǫ = +1 for all sectors. However, there is another modular invariant choice
which has ǫ(α, β) = −1 while ǫ(1, 1) = +1. This choice gives a new non-Tachyonic orbifold
theory [65, 171]. If one works out the ǫ(p, q) and does the level-mathcing carefully, one sees
that this theory has the following massless 10d spectrum.
Bosonic:
gµν , Bµν , Aµ , φ
Fermionic:
Ψ+ : (16, 16), Ψ− : (128, 1) ⊕ (1, 128),
(II.6.14)
where the numbers show the representation of Majorana–Weyl fermions under a O(16) ×
O(16). The subscript represent the chirality of the fermions. The massless vector fields Aµ
are in the adjoint of O(16) × O(16). Therefore, there is a O(16) × O(16) gauge symmetry
in the theory. Note that this theory is chiral because the left and right handed fermions
210
are in different representations of the gauge group. Chiral theories often have gravitational,
gauge, or mixed anomalies. However, this theory, although chiral, has the same number
of left and right handed 10d Majorana–Weyl spinors (256 of each). Therefore, there is no
gravitational anomalies. As for gauge and mixed anomalies, they are canceled via the GreenSchwarz mechanism. This theory is a non-supersymmetric and non-tachyonic theory which
is perfectly fine at tree-level. Whenever we give up supersymemtry, the magical cancelation
of one-loop vacuum amplitude might not occur. This would coorect the effective potential.
After computing the one-loop vacuum amplitude, it turns out the cosmological constant is
positive! In fact, we could expect this from the fact that there are more bosons than fermions.
Exercise 3: Count the number of massless degrees of freedom in Heterotic O(16) × O(16)
and show there are more fermionic degrees of freedom than bosonic.
Even though the vacuum energy is positive, this solution is not the usual de Sitter. The
reason for this discrepancy is that de Sitter is a solution to an effective with a positive constant
when the action is written in the Einstein frame where the coefficient of R is constant.
However, the string calculation give us the effective action in string frame where the Ricci
scalar has a dilaton dependent prefactor. We must redefine the metric to go from one frame to
another, and when we do that, we see that the previosly constant term in the action, becomes
dilaton dependent. In other words, what we have found is not a cosmological constant, but a
positive potential for dilaton. However, the potential turns out to be exponential in dilaton.
Exercise 4: Suppose the O(16) × O(16) theory has a cosmological constant in the string
frame. Go to the Einstein frame and show that the potential has exponential dependence in
dilaton V = e−αφ . Compute α.
Another way of seeing the exponential behavior of the potential is that the normal scale
for the vacuum energy in string theory is MsD which depends on the dilaton field in the
Einstein frame via
2D
4D
MsD ∼ MPD gsD−2 ∼ MPD e− D−2 Φ .
(II.6.15)
However, note that Φ is not canonically normalized in the Einstein frame yet which is why
the exponent is slightly different. The gs dependence signifies that this effective potential is
generated by quantum corrections from from string excitations.
To summarize, the O(16)×O(16) theory does not live in de Sitter space since the effective
potential has no minimum. Therefore, the effective value of cosmological constant will keep
211
descreasing.
Note that when V (φ) goes like e−αφ , |V ′ |/V ≃ α is an O(1) constant (in Planck units).
If we put this theory on a circle it turns out to be connected to Heterotic on a circle with
anti-periodic boundary condition with a suitable Wilson lines around the circle. As we know
from the Scherk-Scwarz mechanism, that theory is tachyonic. Therefore, we cannot use the
O(16) × O(16) theory to find similar lower dimensional theories.
Note that even though the O(16) × O(16) theory averted tachyonic instability, it has
a runaway instability. Other proposals for non-supersymmetric tachyon-free perturbative
string backgrounds also have runaway instabilities [172, 173]. The instability, in one form
or another, seems to be a universal feature of non-supersymmetric theories. There is no
existing proposal for a non-supersymmetric permanently stable vacuum. The question is,
how stable can a non-supersymmetric vacuum be? For example, in the Tachyonic examples
this is captured by the the imaginary mass of the tachyon. In the examples we mentioned,
the mass of the tachyon (imφ ), which is captured by second derivative of scalar potential, is
at least of the same order as the vacuum energy produced by quantum corrections. In other
words, |V ′′ /V | is at least order one in Planck units. The reason is that in all the examples,
the quantum corrected potential goes like V ∼ MsD where D is the spacetime dimension and
|V ′′ | = m2T achyon /2 ∼ Ms2 . Therefore, we have
|V ′′ |
∼ Ms−(D−2) & MP2−D ,
V
(II.6.16)
where in the last line we used D ≥ 2 and Ms < MP .
To summarize, in all the examples we reviewed, a positive cosmological constant either
comes with a runaway instability (|V ′ | & V ) or an unstable equilibrium (V ′′ . V ) where the
inequalities are written in Planck units. Both of these inequalities suggest that de Sitter are
not too stable. We will try to understand and explain these case-based observations in the
following sections.
6.2
de Sitter and tree-level string theory
In this section we review a general argument from [174] that shows the observation from the
last section were not just coincedences and they are in fact true at tree-level weakly coupling
limit of M-theory compactification. There are similar arguments that apply to the same or
other corners of the string theory landscape [175–180].
Consider M-theory and compactify it on an arbitrary manifold M with a non-vanishing
G-flux. This setup is the most generic M-theory construction. The lower dimensional theory
212
technically has a scalar potential with infinitely many scalars corresponding to infinitely
many possible deformation of the internal manifold and its fluxes. One might be tempted
to think that surely this infinite dimensional space has some critical point which is a local
minimum of the tree-level potential. However, it turns out as long as we can trust the classical
supergravity description (curvature and fluxes are sub-Planckian), there can be no critical
points. In fact, there is a stronger constraint than ∇V 6= 0. We show there is a universal
order one lower bound on |∇V |/V which prevents V from becoming too flat.
When we compactify the higher dimensional theory on M, the integral of the higher
dimensional action over M show up as an effective potential in the lower dimensional theory.
In the reduced Planck units, the potential reads
ˆ
1
√
V ≃−
(II.6.17)
g(R − |G|2 ).
2
M
Suppose we write the metric as a direct sum of a Minkowski metric with a warped internal
geometry.
ds2 = dxµ dxν ηµν + e2ρ ds2I ,
(II.6.18)
where ρ only depends on x and is a scalar field in the lower dimensional theory which controls
the overall size of the internal manifold. We will show that ρ is an unstable mode in the sense
that the potential monotonically decreases as ρ increases. The monotonicity of the potential
makes ρ continually descrease without ever stopping at a stable value.
Note that both terms in (II.6.17)
√ depend exponentially on ρ. For non-constant ρ(x),
´
the 11-dimensional action M×Rd GR gives a kinetic term for ρ which is not canonically
normalized (it is not of the form 1/2(∂µ ρ)2 ). To find the canonically normalized field, we
should first rewrite the action in the Einstein frame in which the lower dimensional Ricci scalar
has a constant coeffitient. The change of frame will change the coefficient of the kinetic term of
2
ρ. Finally, we normalize the field ρ such that the kinetic term is 1/2(∂µ ρ̂)p
. After some careful
calculation, one can see that the canonically normalized scalar is ρ̂ = ρ 9(11 − d)/(d − 2).
Exercise 5: Show that in M theory compactification to d dimensions, the potential is
proportional to VR e−λ1 ρ̂ + VG e−λ2 ρ̂ where
6
,
λ1 = p
(d − 2)(11 − d)
2(d + 1)
λ2 = p
(d − 2)(11 − d)
(II.6.19)
and ρ̂ is the canonically normalized volume modulus.
Note that VG is always positive since the contribution of the flux is always positive.
However, the contribution of VR could be negative or positive.
213
Now let us compute the slope in the ρ̂ direction in the regions where V is positive. If
both contributions to the potential are positive.
|V ′ |
λ1 e−λ1 ρ̂ VR + λ2 e−λ2 ρ̂ VG
λ1 e−λ1 ρ̂ VR + λ1 e−λ2 ρ̂ VG
=
≥
≥ λ1 ,
V
e−λ1 ρ̂ VR + e−λ2 ρ̂ VG
e−λ1 ρ̂ VR + e−λ2 ρ̂ VG
(II.6.20)
where we used λ2 > λ1 for d > 2. Now let us consider the case where VR is negative.
Assuming V is positive, we have
|V ′ |
−λ1 e−λ1 ρ̂ |VR | + λ2 e−λ2 ρ̂ VG
−λ2 e−λ1 ρ̂ |VR | + λ2 e−λ2 ρ̂ VG
=
≥
≥ λ2 .
V
−e−λ1 ρ̂ |VR | + e−λ2 ρ̂ VG
−e−λ1 ρ̂ |VR | + e−λ2 ρ̂ VG
(II.6.21)
Again, we used λ2 > λ1 . Note that in both cases, as long as V is posotive, we find a
lower bound on |V ′ |/V which is either λ1 or λ2 . Either of these numbers are universal O(1)
constants that only depend on the dimension of spacetime. Is this a proof that |V ′ |/V &
O(1)? Not quite. This proof only takes the tree-level action into account and neglects the
quantum effects. However, given that the classical piece is unable to produce a de Sitter, it
is safe to say our only hope to get a de Sitter space in string theory is via quantum effects.
Note that there might be directions in which the potential has a local minimum but this
shows that there is always a direction in which the potential is monotonically decreasing. In
other words, the direction of steepest descent is never too flat.
|∇V |
> λ1 .
V
6.3
(II.6.22)
de Sitter conjectures
In the previous sections, we considered non-supersymmetric examples in string theory and
observed that they always come with an instability which is either in the form of a local
maximum (tachyons) or a rolling potential. In both cases, the potential was never too flat
by which we mean either |V ′′ | & V or |V ′ | & V . Then we reviewed a general argument
that shows the inequality of the type |V ′ | & V holds in any well-controlled regime of Mtheory moduli space where quantum corrections are small. In this section, we formulate
these observations into concrete conjectures, and we study their consequences45 .
de Sitter conjecture
The effective scalar potential satisfies one of the following two inequalities at every point
45
For other refinements of the de sitter conjecture motivated by tachyonic de Sitter solutions [181], see
[182, 183].
214
in the field space [174, 184, 185]:
|∇V | ≥ c1 V
or
min(∇i ∇j V ) ≤ −c2 V,
i,j
(II.6.23)
where mini,j (∇i ∇j V ) is the minimum eigenvalue of the Hessian in an orthonormal basis
and c1 and c2 are O(1) constants in Planck units. This orthonormal basis, as well as the
size of the gradient vector |∇V |, are defined with respect to the canonical metric on field
space (gij ) which is determined by the kinetic terms of the scalar fields
1
LKinetic = − gij ∂µ φi ∂ µ φj .
2
(II.6.24)
A few quick remarks:
• The conjecture is trivially satisfied for V ≤ 0.
• The de Sitter conjecture forbids a positive valued, local minimum for scalar potential.
Because at such a point, we will have V > 0, |∇V | = 0, and mini,j (∇i ∇j V ) > 0. These
signs will violate both of the inequalities in the de Sitter conjecture.
• The two conditions are very similiar in spirit. They both say that any solution with a
positive cosmological constant is sufficiently unstable. If the first condition is satisfied,
the instability is given by a steep rolling direction in the field space. And if the second
condition is satisfied, we have a a steep tachyonic direction.
• Even though the motivations for the conjecture were completely unrelated to inflation,
the statement of the conjecture is almost equivalent to saying that inflation cannot
happen. For positive potentials, we can rewrite the two conditions in the following
form.
1 |∇V |2
ǫ= (
) > O(1)
2 V
or
η=
− mini,j (∇i ∇j V )
> O(1).
V
(II.6.25)
The two parameters ǫ and η are called slow-roll parameters in inflationary cosmology.
In many conventional inflationary models their value is required to be small to avoid an
initial condition fine-tuning problem. If the values of both parameters is small, there
is an attractor solution called the slow-roll trajectory which removes the fine tuning
problem. However, de Sitter conjecture seems to be directly against such inflationary
models. However, the constant numbers c1 and c2 are not explicitly known. One could
speculate that the universal values are ∼ 0.01 due to some numerical factors. However,
215
it is fair to say that conventional inflationary models are at least in tension with the
de Sitter conjecture.
• The no-go theorems for M-theory [174, 176] and type II theories [157, 177, 180] tell us
that at least there is a very strong evidence for this conjecture in the classical regime.
But the classical regimes are nothing more than the infinite distance limits in the moduli
space. In fact, in those limits, the first inequality in the dS conjecture seems to suffice
and V ′′ /V . −1 is not needed. Although the de Sitter conjecture is believed to be true
in the infinite distance limits of the moduli space, we do not have much evidence for
its validity in the interior of the moduli space, due to strong coupling.
The de Sitter conjecture is telling us that potential must decay exponentially in the
asymptotic of the field space. It is reasonable to assume de Sitter conjecture is related to the
distance conjecture which also concerns infinite distance limits. Consider an infinite distance
limit. The distance conjecture tells us that there is a tower of light states with exponentially
decaying masses m ∼ exp(−αφ). Normally, we would expect the same physics that leads
to the mass of the tower states to contribute to the vacuum energy which has a simension
of M d . Thus, it is natural to expect that V ∼ md ∼ exp(−αdφ).However, we know some
examples in flux compactifications where the contribution to the potential comes from the
flux terms |F |2 ∼ g 2 where g is the gauge coupling. We can use Weak Gravity Conjecture
to estimate g with m and conclude that the tower particles may contribute to the potential
as m2 instead of md . These arguments connect the undetermined universal constants in de
Sitter conjecture and distance conjecture to each other. If we had a sharper rationale for de
Sitter conjecture, we could not only fix the constants in the de Sitter conjecture, but even
fix the constant for the distance conjecture.
In the following we review another Swampland conjecture which can serve as a potential
rationale underlying the de Sitter conjecture.
Consider a homogeneous and isotropic expanding d-dimensional universe. Such a spacetime
has a E(d − 1) symmetry group. Such a spacetime is called an FRW solution and the metric
can be written as ds2 = −dt2 + a(t)2 dx2 where H = aȧ is called the Hubble parameter and
a(t) is called the scale factor. This solution is of significant phenomenological interest given
that our universe seems to be isotropic and homogeneous to a good degree.
The dynamics of a(t) is determined by the content of the theory and the initial conditions.
(d−1)(d−2) 2
For example, for
H ∝ Λ. The
2
√ a universe with a cosmological constant Λ, we have
constant H ∼ Λ corresponds to de Sitter space. As one can see from the metric, positive
value of H corresponds to an expanding universe and if H is constant, the expansion is
exponential (a(t) ∝ exp(Ht)). This fast expansion gives multiple physical meanings to the
natural length scale in this universe, which is 1/H and is called the Hubble horizon. If
H remains constant, any two points with spacelike seperation |∆x| > H1 will be out of each
216
time
other’s light cone. Therefore, there is a horizon at r = H1 from the perspective of the observer
who is moving at r = 0. But there is a second feature which in some ways is very unique
to de Sitter space. All the fields in a de Sitter background have some quantum fluctuations,
just as they would have in any background. However, due to the exponential expansion
in de Sitter space, these quantum fluctuations exponentially expand almost at the same
rate as the scale factor. At some point the fluctuations exist the Hubble horizon and the
expansion overcomes becomes so fast that even the peaks and troughs of the perturbation
exit each other’s lightcone. Therefore, the mode does not have enough time to react to the
expansion and it effectively ”freezes”. This means the amplitudes of the fluctuations change
but their profile does not. Modes with frequencies k ≫ H oscillate but the ones with k ≪ H
freeze. In addition to the freezing, these quantum flcuctuations classicalize upon their exit
from the Hubble horizon. This means, the Wigner distribution of the physical observables
becomes such that the quantum fluctuations can be well-estimated with a classical ensemble.
Intrestingly, these fluctuations remain classical even if at some point the expansion stops and
they re-enter the Hubble horizon.
Wavelength of fluctuations
Figure II.6.4: As trans-Planckian modes exit the Hubble horizon, they freeze out and
classicalize.
The freezing of the quantum fluctuation is a proposed mechanism to produce the fluctuations
in CMB. However, there is something bizarre about classical imprint of trans-Planckian
physics when they exit the Hubble horizon. If the expansion is sufficiently long, the quantum
fluctuations that eventually exit the Hubble horizon and classicalize can be as small as Plancksized. The Trans-Planckian Censorship Conjecture (TCC) states that this should not be
possible.
217
Trans-Planckian censorship conjecture (TCC)
In a consistent theory of quantum gravity, a long-lasting expansionary solution in which
Planck-sized fluctuations exist the Hubble horizon does not exist [156].
af
1
lp <
.
ai
Hf
(II.6.26)
Let us say a few quick remarks about TCC.
• The immediate consequence of TCC is that the expansion at a constant Hubble parameter
cannot last longer than τT CC ∼ H1 ln( MHP ). More concretely, TCC implies that regardless
of the dynamics of H, a significant change to the rate expansion must happen before
τT CC (See [158] for an overview of possibilities).
• The age of our universe is only a few orders of magnitude smaller than τT CC . Thus, our
universe marginally passes the test all thanks to the log term in τT CC . In a universe with
a cosmological constant, you can only measure the Hubble parameter using experiments
that have a Hubble size scale. For example, this could correspond to measurements of
the light coming from a Hubble time in the past. But according to TCC, as soon as
the universe is old enough to measure the Hubble parameter, it is approaching the end
of an era.
• If we have a scalar field with a monotonically decreasing potential, it cannot be too flat
over very long field ranges. The equation of motion for the scalar field gives
(d − 1)(d − 2) 2 1 2
H = φ̇ + V (φ)
d
2
′
φ̈ + (d − 1)H φ̇ + V = 0.
(II.6.27)
From this we find
1
H
>p
.
φ̇
(d − 1)(d − 2)
Now we can use TCC and change the integration variable to find
ˆ
H
dφ < − ln(Hf ).
φ̇
(II.6.28)
(II.6.29)
By combining the two equations, we find
Hf . e−∆φ/
218
√
(d−1)(d−2)
(II.6.30)
Since V . H 2 , we find
V . e−2∆φ/
√
(d−1)(d−2)
(II.6.31)
In the exponential case, we find
|
V′
2
.
|≥ p
V
(d − 1)(d − 2)
(II.6.32)
The above inequality is for trajectories that are driven by an exponential potentials in
addition to some extra positive contribution to the Hubble energy for example from
a tower of states. If we ignore the extra contribution of the tower and consider the
trajectories that are driven purely by the exponential potential, the TCC leads to
|
2
V′
|≥ p
,
V
(d − 2)
(II.6.33)
which is a stronger constraint46 . In [186], it was argued that the emergent string
conjecture implies that the mass scale of any tower of light states is always higher
than the Hubble scale. In that case, the states of the tower are too massive to get
excited and contribute to the cosmological evolution, and the TCC is equivalent to
the above condition on the asymptotic behavior of the potential [156]. This looks like
the de Sitter conjecture, but this time with an explicit coefficient. Therefore, TCC
gives a strong justification for de Sitter conjecture in the asymptotics of the moduli
space. In fact, there is no known counterexample to this inequality in the known string
theory constructions (see [156, 157, 187] for tests of TCC in string theory and [188]
for Karch-Randall setup). This fixes the coefficients of both the de Sitter conjecture
and distance conjecture through its relation to the de Sitter conjecture. Note that,
TCC is formulated based on the expansionary trajectory rather than any instantanious
configuration. Therefore, the consequences of TCC are usually constraints on the shape
of the potential over long field ranges rather than pointwise implications. This seperates
TCC from the de Sitter conjecture. For example, TCC does not imply |V ′ |/V & O(1)
at every point in the moduli space. This makes TCC a more relaxed constraint for
late time cosmology. However, TCC is very restrictive for early universe cosmology
[189]. The conventional inflationary models are either inconsistent with TCC or highly
fine-tuned too explain all the observational data.
• It is also worth mentioning that as opposed to the de Sitter conjecture, TCC allows
46
This condition was also suggested based on prohibiting eternal accelerated expansion. For cosmologies
that are driven by exponential potentials, TCC is satisfied if and only if the expansion is decelerated [156].
a
P
This follows trivially from afi < M
Hf since the right side is linear in time.
219
meta-stable de Sitter vacua. However, it requires their lifetime to be smaller than
τT CC . Consider an expansionary trajectory that is sourced by a cascade of tunnellings
between metastable vacua.
V
Domain wall
∆Λ {
Figure II.6.5: The blue curve is the effective potential which effectively captures the dynamics
of the universe as a result of a cascade of tunnellings between nearby vacua.
Suppose we can effectively describe the above expansion using an effective monotonic
potential Vef f . It turns out that applying TCC for the individual tunnelings implies
that |Vef′ f | > V 3/2 in Planck units [190]. Interestingly, this is exactly the inequality that
must be violated to get eternal inflation [191]. Thus, TCC suggests that there cannot
be an eternal inflation in the interior of the moduli space. Moreover, if the potential is
generated by a top-form gauge potential, TCC implies the higher-dimensional generalization
of Weak Gravity Conjecture for the domain wall between neighboring vacua [190].
• Let us go back to the connection to the distance conjecture. As we explained before,
we expect the potential to be either ∼ m2 or ∼ md in the asymptotic limits of field
space where m is the mass scale of the lightest tower [156–158]. If we want to be on
the conservative side, we can apply md ∼ V to (II.6.32) which leads to the following
bound on the coefficient of the distance conjecture is
2
α≥ √
.
d d−2
(II.6.34)
and on the stronger side, based on V ∼ m2 TCC suggests that
α≥ √
220
1
.
d−2
(II.6.35)
Remarkable, the above inequality coincides with the sharpened version of the distance
conjecture [153]. We can also motivate this relation via the Higuchi bound [192],
which states that particles with spin s ≥ 2 are heavier than the Hubble scale.√ If
we apply this bound to the particles of the lightest tower, we find m & H ∼ V .
√
Assuming the sharpened distance conjecture m . exp(−κ∆φ/ d − 2) this leads to
√
V . exp(−κ · 2∆φ/ d − 2) which agrees with (II.6.33). Note that this is only a
heuristic derivation because we applied the Higuchi bound to rolling backgrounds which
are not de Sitter spaces.
• There is an interesting connection between TCC and the holographic principle [159]. To
understand the holographic argument for TCC, let us review the holographic principle.
In quantum gravity, the notion of spacetime is expected to be emergent. A nice example
of this emergence is T-duality, where depending on the size of the compact dimension,
the spacetime that provides the best semiclassical description can change47 . If the
notion of spacetime is emergent in quantum gravity, true physical observables cannot
rely on a definition of spacetime. However, this raises the question that then what is the
meaning of an effective field theory in quantum gravity, given that it is fundamentally
a theory of local observables, e.g. fields. At its most basic form, the holographic
principle is the statement that physical observables in quantum gravity are defined
on the boundary of spacetime, and the right effective field theory is the one that
best produces such boundary observables. For example, in Minkowski spacetime these
boundary observables are scattering amplitudes, and in AdS space, they are boundary
correlators.
Now, one can apply the holographic principle to expanding universes with polynomial
expansion (a(t) ∼ tp ). These backgrounds are ubiquitous in string theory given that
exponential potentials lead to polynomial expansions. In [159], it was shown that an
effective field theory in such backgrounds could produce non-trivial boundary observables
if and only if p ≤ 1, which is equivalent to TCC48 . In other words, the holographic
principle can be satisfied if and only if TCC is satisfied.
The holographic principle also has non-trivial consequences for the mass of the weakly
coupled particles. If the mass of such a particle does not satisfy m . t1−2p , its
correlation functions will freeze out to a delta function at future infinity [159]. Therefore,
47
Note that, in T-dual descriptions, a local wavepacket in the compact manifold in one picture maps
to a winding state in the other picture. Therefore, there is no direct mapping between the points in two
spacetimes.
48
A similar argument is presented in [193] which makes an extra assumption about the physical observables
and draws stronger conclusions. The argument in [193] assumes that the physical observables are accessible
to a bulk observer. For example, an eternal de Sitter space can have dS/CFT boundary observables [194], but
those observables will not be fully measurable by any bulk observer. Therefore, eternal de Sitter, although
not ruled out by the argument we reviewed, does not meet the extra criterion assumed in [193].
221
such a field does not yield any non-trivial boundary data and violates the holographic
principle. The condition m . t1−2p can be expressed as
m . e−α(λ)φ ;
α(λ) =
4
λ
− ,
(d − 2)λ 2
(II.6.36)
p
where V ∼ exp(−λφ) is driving the expansion. For λ < 8/(d − 2), α(λ) is positive
and the above result implies the distance conjecture. Moreover, the strongest bound is
√
realized for λ = 2/ d − 2 which is
αmax = √
1
.
d−2
(II.6.37)
The above coefficient matches with the heuristic bound (II.6.35) which is also the
coefficient of the sharpened distance conjecture [153].
TCC provides a partial explanation for the de Sitter and distance conjectures by phrasing
them in terms of a physical process. TCC is well-supported in the asymptotic region of the
field space based on 1) non-trivial consistency in string theory, 2) connection to holography
[159], and 3) consistency with Weak Gravity Conjecture among other Swampland conjectures
[190]. However, there is less evidence for de Sitter conjecture in the interior of the moduli
space which makes them less rigorous. Among the Swampland conjectures, the ones that
are less rigorous are typically more phenomenologically powerful. But even though some
conjectures are less supported than others, all the conjectures seem to fit together nicely.
The main challenge with making de Sitter conjectures more rigorous is that there is no
supersymmetry in de Sitter and we lose analytic control. Given that the potentials that lead
to non-zero cosmological constants usually diverge (+∞) at some limit and vanish at some
other limit. Based on the minimum number of inflection points required, we can see that to
get a de Sitter vacuum, we need an interplay of at least three terms, unlike the AdS vacua,
which only need two terms49 .
49
A refinement of TCC in terms of the potential leads to similar non-trivial results as TCC for negative
potentials with AdS minima [195].
222
7
Swampland VI: Finiteness and string lamppost
The question that we study in this section is whether there could be infinitely many theories
of quantum gravities? From the EFT perspective this question sounds unnatural because
there is always a large number of inequivalent field theories. However, we will see that in
string theory the number of possibilities seems to be finite which motivates this question in
quantum gravity.
7.1
String lamppost and finiteness principles
Before proceeding further, let us distinguish two related, but separated question.
• Finiteness: is the number of possible low energy EFTs in quantum gravity finite?
• String Lamppost Principle: Are all consistent quantm gravity theories low-energy
limit of some string theory compacfitication?
These are both very important questions. We draw a lot of our intuition about quantum
gravity from string theory. However, if there are other theories of quantum gravities, we might
be getting the wrong kind of conclusions. This objection is often called the string lamppost
effect. It refers to the possibility that string theory is like a lamppost than only lights up
a small area, and searching within this bright spot might make us miss out on important
physics outside of the lamppost’s domain. However, if there is indeed only one theory of
quantum gravity, then the we do not have to worry about this objection. The postulate that
string theory is the only theory of quantum gravity is called the String Lamppost Principle
(SLP).
The hypothesis that the answer to the first question is yes is called the Finiteness Principle
and the hypothesis that quantum gravity is unique is called String Lamppost Principle.
In the following we will try to gain some intuition about these questions and see how
well-motivated finiteness principle and string lamppost principle are.
let us start our search of theories with the highly supersymmetric theories in Minkowski
space where we have more tools to limit the theory space. Let us start with theories with 32
supercharges which is the maximum number of supersymmetry50 .
NSU SY = 32:
The highest dimension for theories with 32 supercharges is d = 11.
50
In theories with more supersymmetry, any massless multiplet would contain a massless particle with
helicity h > 2 which violates the Weinberg-Witten theorem [71].
223
For d = 11, supersymmetry completely determines the low-energy effective action. Although
this proves the uniqueness of the low-energy theory, one could speculate that there could be
infinitely many theories that are different at high energies but identical at low energies. This
is a valid point and we cannot be sure about the uniquness of M theory, however, the fact
that the webs of string dualities which usually capture non-perturbative aspects of the theory
are pointing to the existence of a unique theory provides a strong reason to believe M theory
is unique.
Note that the question of uniqueness of quantum gravity (String Lamppost Principle)
cares about the UV physics, however, the question of finiteness is purely about the lowenergy physics. Thus, as far as finiteness is concerned, we call two theories the same if
they agree on the low-energy physics. In that sense, the 11d supergravity with N = 32 is
unique and the same is true in all the smaller dimensions because the supersymmetry fixes
the content of the theory completely.
In 10d, we can make two choices for the chirality of the supercharges. One corresponds
to type IIA and the other to type IIB. The supersymmetry argument shows the finiteness
for all theories with 32 supercharges, but how about the SLP? It turns out we get all these
supergravities in string theory. They are just toroidal compactifications of M-theory. In
fact, not only we see all the theories with 32 supercharges, but we see all of them in the
same moduli space! For example, from the IIB/M-theory duality we know that even the
type IIB supergravity corresponds to a particular limit in the moduli space of M-theory
compactifications of M theory on T 2 where the area of the T 2 goes to 0.
In string theory, theories of different dimensions can share the same moduli space. A
d + 1 dimensional theory can be thought of as a decompactification limit of a d dimensional
theory at which the SO(1, d − 1) Lorentz symmetry enhances to SO(1, d) symmetry.
So we saw that for N = 32, quantum gravity is finite and SLP is true. Now let us move
on to a more non-trivial case.
NSU SY = 16:
Theories with 16 supercharges in a Minkowski background only exist in dimensions d ≤ 10.
Let us start with d = 10. The supercharges must form a Weyl representation of the Lorentz
group. Thus, the supercharges and hence the spectrum of the theory is chiral. However,
chiral theories typically have gauge, gravity, and mixed local anomalies and cancellation of
all of these anomalies puts a strong constraint on the theories.
There are only two types of multiplets in 10d supergravity; vector multiplet and the
gravity multiplet. Therefore, the spectrum is uniquely determined by the gauge group. The
anomaly cancellation puts a very strong bound on the structure of the gauge group. There
are only 4 possibilities: {E8 × E8 , SO(32), U (1)248 × E8 , U (1)496 }. Only the first two are
224
realized in string theory, so the SLP is making a prediction that the last two are inconsistent.
We will come back to this prediction later.
Let us go down another dimension to d = 9. The two gauge theories that we know in 10d
both have rank 16. If we compactify any of them on a circle, the matter (non-gravitational
multiplets) of the 9d theory will have a gauge group of rank 17 (the extra rank comes from the
U (1) symmetry of the internal geometry). There are other ways to get a 9d supersymemtric
theory as well. For example, we can compactify the E8 × E8 Heterotic theory on S 1 /Z2 ,
where the Z2 switches the two E8 s and acts on the circle by x ∼ x + 12 . Note that the Z2 does
not have a fixed point, therefore, this is not a compactification on an interval. Otherwise, we
could not preserve the supersymmetry because the Heterotic theory is chiral. This theory will
have rank 9. We can also compactify M-theory on S 1 × S 1 moded out by Z2 which acts by a
minus sign on the first S 1 and by a half circle shift on the second S 1 . The internal manifold
in this case is a Klein bottle and the resulting theory is a 9d N=16 theory with rank 1
[196]. We can compactify these 9d theories on circle to find 8d supersymmetric theories with
ranks r ∈ {2, 10, 18}. All of the 8d constructions have a natural F-theory embedding where
F-theory is compactified on an elliptic K3. But the possible values for the ranks remains the
same. So we observe that:
• Supersymmetric 9d theories only have ranks 1,9, and 17.
• Supersymmetric 8d theories only have ranks 2,10, and 18.
But what about all of the missing ranks? Is the string lamppost missing them?
In 8 and 9 dimensions there is a nice argument based on global anomalies that fixes
the rank of gauge group modulo 2. Note that here by global anomaly, we do not mean
an anomaly of a global symmetry, but anomaly of a gauge transformation that cannot be
continuously deformed to the identity transformation. It turns out in dimensions 8 and 9,
there are diffeomorphisms that do not change the boundary, however, they could affect the
sign of the fermion measure. Let us see how this might happen. Imagine having a Dirac
spinor. If we integrate out a fermionic mode we find
ˆ
ˆ
ˆ
iS
/
Z = DφDA . . . DψDψ̄e = DφDA . . . eiSψ det[iD],
(II.7.1)
/ is the regularized product of all the
where Sψ does not depend on ψ anymore. The det[iD]
/ on the background. If we replace the Dirac spinor with a Mjorana spinor,
eigenvalues of iD
the numnber of degrees of freedom will be cut in half. Thus we find
ˆ
q
iSψ
/
Z = DφDA . . . e
det[iD].
(II.7.2)
225
/ picks up a phase under a diffeomorphism which is
Suppose the fermionic measure det[iD]
not continually connected to identity. Take the two configurations that are mapped to each
other under this diffeomorphism. They must be both included in the path integral. This is
because although they cannot go to each other continuously in the gauge orbit, they can still
continuously deform to each other outside their gauge orbit. Therefore, there is no sharp way
to impose a selection rule that would only keep one representative
p out of every equivalency
/ factor picks up a
class. But this could potentially create a problem. Suppose the det[iD]
−1 under the symmetry transformation. In that case the two gauge equivalent configurations
that are both included will cancel each other out and the path integral will vanish! This is
called a global anomaly and for a theory to be consistent, the fermionic part of the path
integral measure must be invariant under any gauge symmetry transformation that is not
homotopic to the identity transformation.
In the case of d = 8 and d = 9, the undesired scenario that we described happens for
some large diffeomorphism that is homotopic to the identity. However, the phase change is
only non-zero for odd number of Majorana fermions. Thus, the total number of Majorana
fermions in these dimensions must be even. In d = 8, 9 with 16 supercharges, the matter
content is very constrained. The only multiplets are gravity multiplet and vector multiplet,
therefore, we must have exactly r vector multiplets where r is the rank of the gauge group.
This allows us to calculate number of the Majorana fermions in terms of r. Then we can use
the global anomaly cancelation condition and we find r ≡ 0 mod 2 in d = 8 and r ≡ 1
mod 2 in d = 9 [197].
This argument partially explains the limited set of ranks that we observe in the string
theory constructions. But what about all the other missing ranks?
Let us think about finiteness now. We observe that the list of gauge group ranks always
ends at some number. Is that because we are missing infinitely many good theories, or is it
that there is a fundamental reason why the list must be finite.
Let us see if we can find a rule of thumb for the maximum allowed rank of the gauge
group. From the 9d and 8d case it seems that a toroidal compactification is the most optimal
option to obtain the maximum rank. If that is true, we would expect the rank of the gauge
group of any gravitational theories in Minkowski space with 16 supercharges to be bounded
by r ≤ 26 − d where d is the dimension of spacetime.
This upper bound is satisfied in all known string theory constructions. But could there
be a more basic explanation for it? These are all important questions and we will come back
to answer them later.
How about 6d theories? There are two kinds of theories with 16 supercharges, N = (2, 0)
or (1, 1). The first case is chiral and could have gauge and gravitational anomalies. In fact
the anomalies are so constraining for the theory that they uniquely determine the low-energy
226
theory. The next question is, does it belong to the Landscape? The answer turns out to be
yes. This is type IIB on K3. You can also get it from M theory by putting M theory on
T 5 /Z2 where the Z2 flips all the circles [127]. However, in that case you would need to assign
half-unit of flux to each one of the fixed points of T 5 /Z2 and have 16 M5 branes in the T 5
to cancel the fluxes. The moduli of the theory in the M-theory picture is determined by the
position of the M5 branes.
Exercise 1: Using the dualities that we have discussed in the class, show that type IIB on
K3 and M-theory on T 5 /Z2 where the Z2 flips the signs of all the five circles are dual to each
other.
NSU SY = 8:
Now let us reduce the supersymemtry even further. Consider the minimally supersymmetric
theory in 6d which has 8 supercharges. We can construct these theories in string theory by
putting F-theory on elliptic CY threefolds. We can also construct these theories by putting
Heterotic on K3, or type IIB on K3 orientifold. However, using the F-theory/Heterotic duality
described before, it is easy to see that all of these constructions have F-theory embedding.
The finiteness priniple, if true, would imply that there must be finite number of elliptic CY
threefolds. In fact, this is a true but very non-trivial mathematical statement. So finiteness
leads us on the right track in this case.
How about 5d theories? Eight supercharges is the minimal number of supercharges in 5d.
Thus such a theory would be an N = 1 theory in 5d. To construct 5d N = 1 theories, we can
put M theory on a Calabi–Yau threefold. Similar to the 6d case, the finiteness principle would
tell us that the number of Calabi–Yau threefolds must be finite. This is a very non-trivial
and famous conjecture by Yau. As of now, there is no known infinite family of Calabi–Yau
threefolds.
If we look at the moduli space of the CY’s, there could be singularities with some
loci (e.g. conifold). The Higgs mechanism allows us to go from one CY moduli space
to another by traversing finite distance in the field space and crossing the loci of singular
manifolds. Therefore, one can think of the moduli space of the CY compactification as a
collection of many individual moduli spaces that are glued together on lower dimensional loci
corresponding to singular CYs.
The uniqueness of quantum gravity (i.e. string lamppost principle) would imply that
the moduli space is unique and connected and thus all of the Calabi–Yau manifolds are
connected via geometric transitions through singular manifolds. This is another non-trivial
math conjecture known as Reid’s fantasy.
227
Now let us think about the low energy EFTs. Usually the field theories around the
singular loci have a limited field range. Therefore, if we impose a cutoff, the big moduli of
string theory gets chopped off into many smaller pieces, each of which is well-approximated
by a low-energy effective field theory.
Note that these EFTs from the low-energy perspective look completely different and
disconnected. However, a single theory of quantum gravity seem to be requiring many such
EFTs to be patched together.
So far we only talked about the Minkowski space, now let us discuss the finitness principle
and the string lamppost principle in AdS.
7.2
Finiteness principle in AdS
There is an obvious potential counterexample to the naive version of the finiteness principle
in AdS. There are infinitely many CFTs and infinitely many of them are expected to have
holographic dual, so we get infinitely many quantum gravities. For example different N = 4
SYM theories with different SU (N ) gauge groups all are holographic dual to AdS5 × S 5
quantum gravities. However, the cosmological constant of these theories is different since
lAdS ∼ N 2 . Another example is AdS7 × S 4 which is holographic dual of N parallel M 5
branes. But one can show that in that case too the cosmological constants are different for
different values of N .
Exercise 2: In the M-theory construction of AdS7 × S 4 , estimate lAdS in terms of N (the
number of M 5 branes).
Consider AdS7 × S 4 /Zn where the Zn acts on the sphere by rotation. The Zn action has
two fixed points at the poles. So we have two An−1 singularities on the sphere. Putting
M-theory on S 4 /Zn gives us an SU (n) × SU (n) gauge theory where each SU (n) corresponds
to one of the An−1 singularities. Since the curvature of the sphere is the same for all these
theories, they have the same AdS scale. Therefore, we have an infintie familiy of solutions
that have different matter content but same cosmological constant. One might think this is
surely a counter example to the finiteness principle in AdS!
228
An−1
Zn
An−1
Figure II.7.1: If we mod out the S 4 by Zn , we get two An singularities at the poles. Each of
these singularities will contribute an SU (n) gauge theory.
Let us take a moment to study the holographic dual of the above theory. Consider Mtheory on C2 /Zn . The locus of the singularity is seven dimensional. Suppose we want to
probe the singularity with a stack of N parallel M5 branes. The M5 brane’s wolrdvolume is
6 dimensional, therefore, it has one normal dimension in the locus of the singuliary. We can
show the normal direction to the M5 brane in the singularity as a line and the stack of M5
branes on the singulairy as a point on that line.
Since the locus is the fixed point of the Zn action, the theory on the singular locus has
a global Zn symmetry. After placing the stack of N M5 branes, we will have two SU (n)
actions, one one the left half-line and the other on the right half-line. These actions turn out
to induce an SU (n) × SU (n) global symmetry on the worldvolume theory of the stack of N
M5 branes. This SCFT is the holographic dual of M-theory on AdS7 × S 4 /Zn .
SU(n) symmetry
SU(n) symmetry
stack of N M5 branes
Singular locus
SU (n) × SU (n)
Figure II.7.2: We consider a stack of N M5 branes that probe an An n − 1 singularity which
is 7 dimensional. The line represents the normal direction to the M5 branes in the locus of
singularity.
One resolution to the apparent violation of the finiteness principle is that all of these AdS
constructions, there is an extra space which is as big as the AdS itself. Since the scales of
229
the AdS is always correlated with the extra space, we should always view them together and
cannot think of it as a lower dimensional theory. In that sense, the finiteness principle, should
be understood as the finiteness of the higher dimensional low-energy EFT which is correct
in all these examples. From this perspective, infinite families of lower dimensional theories
correspond to infinite families of defects (singularities and branes) in the same theory, which
is fine.
Note that this resolution only applies if the scale of AdS and some length scale of the
internal geometry are correlated. If one could take the internal space to be arbitrarily small
while the AdS scale is kept fixed, the resulting theory would truly be lower dimensional. Such
an AdS is called a scale-separated AdS which is in tension with the AdS distance conjecture.
There is another resolution which leads to a sharper formulation of finiteness principle
in AdS. Up to now we did not consider the cut-off of the low-energy EFT. We can think of
the finiteness principle as the following statement. For a fixed EFT cut-off Λcut-off , there are
finitely many AdS with cut-off Λcut-off .
We can use the AdS distance conjecture and holography to state the finiteness principle
in the CFT language. The AdS distance conjecture tells us that there is a tower of state with
masses m ∼ Λα in Planck units. For the EFT description to work, we need Λcut-off . m ∼ Λα .
On the other hand, we can express the cosmological constant in terms of the central charge
2
of the dual CFT as Λ ∼ c− d−2 . Combining these equations leads to
− d−2
2α
c < Λcut-off
.
(II.7.3)
The EFT cut-off also bounds the CFT central charge. This clarifies how imposing a cutoff can make the number of theories finite. Even though the inequality (II.7.3) is only for
CFTs with gravitational holographic dual, it is plausible that it is correct for all CFTs. If
so, the finiteness principle is making a prediction, that the number of CFTs with a central
charge lower than a cut-off is finite.
In retrospect, the introduction of cut-off was necessary. To see why, consider a large
number of non-supersymmetric compactifications such that their internal geometries are
different by small perturbations. For small-wavelength perturbations to the internal geometry,
the lower-dimensional EFTs will only change in the UV. Therefore, imposing a cut-off would
make the number of low-energy theories finite.
We provided strong evidence for finiteness principle in Minkowski and AdS spaces. In
the following subsection, we go back to the SLP. In the first subsection, there were a few
examples of supergravity theories that were non-anomalous but did not have any string theory
realization. In the following, we show why Swampland conditions rule out those theories.
230
7.3
String lamppost principle from brane probes
Let us start with the 10d supergravities. Anomaly allows a theory with U (1)496 gauge
symmetry. What could be the problem with it? We show that we can a lot of constraints
from the consitency of the branes in quantum gravity. But first, how do we know we have
branes?
The branes are usually required by the Swampland completeness princinple. We saw
many examples of this requirement in the section on Cobordism conjecture and also the
completeness hypothesis. However, it is not so easy to just add a brane. When we add a
brane, we want the QFTs on the worldvolume of the brane be unitary and consistent. It
turns out that the unitarity and other consistency conditions on the brane rule out many
possibilities. In some sense, the brane probes bootstrap the Swampland program.
In this section we use a stronger version of the completeness principle for supersymmetric
theories. We assume that if a brane is required by completeness, and it can be BPS, then
such a BPS brane has to exist. This is the generalization of completeness of spectrum of
gauge theories to gauged supersymmetry (supergravity). There is no known counterexample
to this statement. For example, in Narain compactification of the Heterotic string theory, in
every direction BPS of charge lattice, there is always a BPS particle.
In dimensions higher than 6, the supergravity string (that coupled to Bµν in the gravity
multiplet can be BPS. We will consider the theory living on the worldsheet of this BPS 1brane. Let us review a few key properties of 2d CFT. Consider a 2d CFT with left and right
central charges and some current algebras corresponding to global symmetry G. The current
algebra gives c = k dim(G)/(k + ĥG ). The current algebra could be either on the left or right
moving part. Suppose it is on the left, we get cG ≤ cL . This is because any extra piece that
satisfies unitarity has positive contribution to c. So, if we have an argument that the gauge
group appears as a current algebra on the worldsheet theory and that there is a bound for
the Virasoro central charge, we find a bound on the rank of the gauge group.
Also, if we have a representation, the dimension of the representation is c2 (R)/(k + ĥG ).
So, if we have a representation that is Higgsed, it should appear as a relevant operator on
the worldsheet theory with dimension less than 1 to be a relevant deformation.
Let us also review anomalies in 2d theories. We consider two classes of anomalies,
gravitational anomalies and global symmetry anomalies. Let us start with the anomaly
of global symmetry. The global symmetry we consider is the gauge symmetry of bulk which
is realized as a global symmetry on the brane. The anomaly diagram is a two point function.
Therefore, the change in the effective action of the brane under a symmetry transformation
231
with gauge parameter ǫ is
S2d → S2d − Kgauge
ˆ
tr[ǫF ],
(II.7.4)
where Kgauge is some number and F is the gauge field tensor on the worldsheet. This anomaly
is due to the non-invariance of the path integral measure of the 2d theory. On the other hand,
we know that the 2d theory is coupled to Bµν . Thus the worldsheet action has an external
´
coupling B. For the worldsheet theory to be non-anomalous, we need the change of B
under the gauge transformation be Kgauge tr[ǫF]. If we take the exterior derivative of both
expression we find
δH = Kgauge δtr[A ∧ F ],
(II.7.5)
where δ denotes the change under a gauge transformation in the bulk. Taking another exterior
derivative from both sides gives
δdH = Kgauge δ(tr[F ∧ F ]).
(II.7.6)
Similarly, for the gravitational anomaly, we find
δ(dH) = Kgravity δ(tr[R ∧ R]).
(II.7.7)
In fact the equations of motion in the bulk tell us that
1
dH = (tr R ∧ R − tr F ∧ F ).
2
(II.7.8)
The right hand side are contributions of the topological terms in the supergravity action to
the equations of motion. The presence of these terms are required by anomaly cancellation.
By mathcing the coefficients of R ∧ R and F ∧ F in the equations, we find
1
Kgauge = − ,
2
1
Kgravity = .
2
(II.7.9)
On the other hand, Kgauge and Kgravity can be calculated in terms of the worldsheet theory.
Their values are
Kgauge =
kR − kL
,
2
Kgravity =
cL − cR
.
24
(II.7.10)
So, we find
kL − kR = 1,
cL − cR = 12.
232
(II.7.11)
The study of brane probes via cancellation of the anomalies of the bulk symmetries is
called the anomaly inflow which is a very powerful method.
Exercise 3: Consider minimal supergravity in 10d. Show that if the supergravity string
(i.e. the string that couples to two-form Bµν in the gravity multiplet) is supersymmetric, it
will have (0, 8) supersymmetry.
Exercise 4: Suppose you have a p-brane where p is odd. Suppose the p-brane is charged
under a p-form gauge potential Ap+1 and dFp+2 is not identically zero (is some function of
curvature, etc.). In that case Fp+2 − dAp+1 could be a non-zero topological term. Moreover,
this term might not be gauge invariant in the sense that when integrated on a p + 2
dimensional surface M with a p + 1 dimensional boundary Σ, the result changes under
gauge transformation as
ˆ
ˆ
Fp+2 − dAp+1 =
ǫXp+1
(II.7.12)
δǫ
M
Σ
Show that Xp+1 must match the anomaly of worldvolume theory on the brane under the
action of the spacetime gauge group which realizes as a global symmetry on the brane.
Now let us go back to the 10d N = (1, 0) with a gauge group G. The anomaly inflow tells
us kl − kr = 1 and cL − cR = 12. Note that we have (0, 8) supersymmetry which has an SO(8)
R-symmetry on the righmoving side. Moreover, the level of the R-symmetry is proportional
to the central charge.
cR = 12κ,
(II.7.13)
where κ is the level of the R-symmetry. For the supergravity string, the R-symmetry has a
very physical meaning. It corresponds to the SO(8) rotation in the transverse direction to
the string. In fact, we can find κ by looking at the induced action from the R ∧ R term.
The coefficient of this term turns out to be the level κ. Thus, we find κ = 1 and cR = 12.
Note that cR = 12 is exactly the contribution of the 8 bosonic transverse modes plus their
fermionic counterparts on the worldsheet. Plugging this into (II.7.11) gives
cL = 24 & cR = 12.
(II.7.14)
Suppose we subtract the contribution of the transverse oscilations from the central charges
to define ĉL = cL − 8 and ĉR = cR − 12. The difference is due to the fact that there is no
supersymmetry on the leftmoving sector. The reduced central charges are
ĉL = 16 & ĉR = 0.
233
(II.7.15)
From cR = 0 we find that the theory on the rightmoving sector is trivial. Therefore, kR = 0.
Plugging this into (II.7.11) leads to
k̂L = 1 & k̂R = 0.
(II.7.16)
Therefore, the worldsheet theory is completely leftmoving. Moreover, from c = k dim(G)/(k+
ĥG ) we know rank(G) ≤ c which implies rank(G) ≤ 16. Therefore, the G = U (1)496 theory
or the G = E8 × U (1)248 are inconsistent and belong to the Swampland.
We can try to do the same analysis in lower dimensions. The supergravity equations of
motion tell us dH = κ2 tr R∧R− 21 tr F ∧F . Similar arguments as before tells us that cR = 12κ
and cL = 24κ. The theory on supergravity string is still (0, 8) and we have at least a U (1)
R-symmetry which corresponds to rotation in the transverse direction. The highest R-charge
S that appears on the string is related to the level by S = 2κ. Since the R-symmetry is the
spacetime rotation, the R-charge is the spin. Now we argue that κ ≤ 1. Let us compactify
the theory on a circle. We can wind the string around the circle to get a BPS particle. In
the limit where the circle shrinks to zero size, the mass of the string excitations goes to zero.
Suppose this limit exists (postulated by a strong version of distance conjecture), the limiting
theory has a massless particle with spin 2κ. According to Weinberg-Witten theorem, the
spin of this massless particles must be less than or equal to 2. Thus, κ ≤ 1.
There are two possibilities, κ = 0 or κ = 1. If κ = 1, we have cL = 24 and cR = 12. In d
spacetime dimensions, we have d − 2 transverse dimensions. Therefore, after subtracting the
contribution from the center of mass, we find cL = 26 − d. Since the central charge is less
than rank(G), we find rank ≤ 26−d. The highest rank is realized by Narain compactification
of Heterotic theory.
If κ = 0, we have a contradiction, because the central charges are 0. The resolution
is that all of our calculations was based on the assumption that the supergravity string
has (0, 8) supersymmetry. However, it could be that the worldsheet theory has enhanced
supersymmetry in the infrared. For κ = 0, the IR theory on the string must be a (8, 8)
theory. In this case, we have two R-symmetries. Using the spin argument above, we can say
the level of the R-symmetry on each side κR,L is at most 1 and the central charge of each side
is cL,R = 12κL,R . Due to the center of mass modes on each side, the central charge cannot be
0. So we find κL,R = 1 and cL,R = 12. If we subtract the contribution of the center of mass
modes, we find ĉL,R = 10 − d which must be greater than the rank of the gauge group. So
we find
κ = 0 : rank(G) ≤ 10 − 1
κ = 1 : rank(G) ≤ 26 − d.
234
(II.7.17)
Both of these bounds can be saturated. For example, 9d string theories with rank = 1
have κ = 0 (M-theory on Klein bottle).
Note that, a stack of N D3 branes with arbitrarily high N is not a counter example to
this result. Because, in that case, the gravity is not confined to the brane. The inequality
rank ≤ 26 − d applies to theories where the gravity is d dimensional.
The anomaly inflow argument gave us an upper bound on the rank, but as we discussed
before, the list of the available ranks in string theory is much more restricted. In 9d, the
available ranks are {1, 9, 17} and in 8d the available ranks are {2, 10, 18}. How can we explain
the absence of the other ranks?
An explanation for this list was given in [129] based on the cobordism conjecture. The
argument has many details but we summarize the main idea here. In [129], it was argued that
supergravities with d > 6 must have parity symmetries (sometimes more than 1). For this
parity symmetry to be broken, we must be able to compactify the theory on non-orientable
manifolds with appropriate Pin structure. We focus on compactifications on non-orientable 2d
manifolds. There are 8 different cobordism classes for these 2d manifolds that are generated
by RP2 . Cobordism conjecture tells us that each cobordism class must be trivializable. This
means, we must be able to have an end of the universe wall for the RP2 compactification.
Such a wall, would be a 7d defect. Moreover, since, we can detect the presence of this defect
from large distances (due to the RP2 boundary in the transverse directions) it must carry
a Z8 gauge charge. Therefore, if we put 8 of these defects together to cancel the gauge
charge, we should be able to find a compact singular 3-manifold with 8 defects which is
an allowed internal geometry. This is nothing other than T 3 /Z2 . Each one of the 8 fixed
points correspond to one of the defects. Therefore, the Cobordism conjecture implies that
supergravity theories in dimensions greater than 6 must have a consistent compactification on
T 3 /Z2 . The condition that the resulting lower dimensional theory be anomaly free imposes
a strong constraint on the matter content.
d = 9 : rank ≡ 1
mod 8
d = 7 : rank ≡ 1
mod 2.
d = 8 : rank ≡ 2
mod 8
(II.7.18)
This is a remarkable consistency check for SLP. However, the EFTs that we get in string
theory have more structure to them than the rank of the gauge group. For example, consider
the 8d theories. All of the known 8d theories in string theory have an F-theory construction.
That means every 8d theory comes with an elliptic K3. But where is the K3 in the low-energy
theory?
The theory in 8d has a gauge group which has 3+1 dimensional instantons. We can study
235
the classical moduli space of the gauge instantons. There are two branches in the moduli
space, Higgs branch and the Coulomb branch. The Coulomb branch corresponds to zero size
instantons (also called small instantons). The theory living on the small instantons is an
N = 2 4d theory. We assume that the rank of the gauge theory which is the dimension of
the couomb branch is 1. Thus, there is a U (1) living on the brane with a coupling τ .
From supersymmetry, we know that the geometry is hyperKähler and has one complex
dimension. Moreover, the geometry of the coulomb branch must be compact. Suppose it is
not compact. If we compactify the theory on T 3 , the eigenvalues of the Laplacian on the
moduli space correspond to the spectrum of particles in lower dimension. If the moduli space
is non-compact, the spectrum of the laplacian will be continious which is in contradiction with
the finiteness of black hole entropy. Therefore, the moduli space is a compact hyperKähler
and τ gives you an elliptic fibration of K3 over the Coulomb branch. We have thus found
the internal geometry of string theory in the EFT using Swampland conditions.
7.4
Finiteness of light species
The black hole entropy formula suggsests there is only a finite number of states at any given
energy. If we have infinite or arbitrarily large number of species, the entropy formula would
look problematic. So the fact that we do not have arbitrarily large number of species also
seem to be related to black holes. However, there is a loophole in this argument as we now
discuss.
Suppose the highest energy scale where the EFT is valid is Λ. The smallest black hole we
can describe using EFT must have a curvature R . Λ2 where R here represents the order of
magnitude of the Riemann curvature tensor. Suppose the entropy of this black hole is S, we
find
Λ<
1
(II.7.19)
1
S d−2
If we have Nspecies species of particles with masses below Λ that are sufficiently gapped (like
a KK tower), the number of high energy states is grows exponentially with & N . Therefore,
we find S ≥ N and
Λ<
1
1
.
(II.7.20)
d−2
Nspecies
This therefore suggests that the black hole entropy is not a good argument for bounding
Nspecies .
The number of species below energy E is defined such that the number of high-energy
236
states grows like & exp(cNspecies (E)). If the tower of state is very dense (like a the tower of
string excitations) the number of species is much smaller than the number of particles. For
string tower, it is known that the number of states with energy E grows like exp(cE/Ms ).
Therefore, Nspecies ∝ E/Ms . This is while the number of one-particle string excitations is
exponential. What happens for string tower is that the numebr of string excitations grows
so rapidly that the majority of the states at any given energy scale ≫ Ms are dominated by
one-particle states. This is different from the KK tower where the number of KK particles
grows polynomially with energy. However, in KK tower too we have Nspecies ∝ E/MKK . The
inequality (II.7.20) is called the species bound. It suggests that we can have arbitrarily large
number of species as long as the cut-off is small enough. We can use the inequality in the
opposite direction, which is to bound Λ from the number of species. The highest EFT cutoff
Λ is often called the species scale.
If we compactify a D dimensional manifold down to d dimensions, if the manifold is big we
d−2
D−2
vol(M ) = MP,d
get a KK tower of light species. MP,D
. The higher dimensional MP is below
the lower dimensional MP . So, there is a scale lower than the MP which prevents us from
going all the way to the higher dimensional MP . This could serve as the species scale. The
d−2
D−2
= MP,d
number of KK light states is given by N ∼ (MP,D /MKK )D−d . Then we find N MP,D
which exactly saturates the species bound. Therefore, the species bound predicts the correct
cut-off for KK theories.
Another example is 10d string theory. We have MP8 = MS8 /gS2 . We expect the species
scale to be string scale. The species scale is the radius where particles become black holes
and that is exactly the string scale. Following species bound, we find N (E = Λ) = g12 . But
s
why should that be true in string theory.
The Hagedorn entropy tells us that S(E) ∼ E/Ms . At the correspondence point E ∼
Ms /gs2 so we get the S ∼ 1/gs2 , and since this is the scale where the black hole description
should take over, everything hangs together.
Therefore, the species inequality is saturated by KK reduction and weakly coupled string
theory. Suppose we try to push the species scale up. The species bound tells us that in the
asymptotic of the field space where the number of species is large, the species scale would be
small. Therefore, to push up the cut-off as high as possible, we should explore the interior of
the moduli space. The biggest gap we can hope for is Planck mass which is realized for 11d
supergravity. In M-theory, we have a desert in term of the mass spectrum of the particles.
Species bound tells us that we should aim for a very small number of species. Let us motivate
this observation independently.
Consider F theory on elliptic threefolds. We can use D3 brane and wrap it around two
dimensional Riemann surfaces to get low dimensional BPS particles. The tension of the
branes will go like the area which is some function of Kähler moduli. In Planck units we have
237
Cij ti tj ∼ 1 where Cij is the metric on the on the Kähler moduli space. Suppose we have N
Kähler classes and a diagonal Cij , we find N t2 ∼ 1 where t is the average volume of a 2-cycle
1
in Planck units. So the masses of the excitations of the BPS string go like m ∼ T −1 ∼ N − 4
where T is the string tension. This inequality saturates the species bound (II.7.20) in d = 6.
Exercise 5: Take M-theory on a Calabi–Yau threefold. The M2 branes can wrap around
Riemann surfaces and M5 branes can wrap around divisors. Give a heuristic argument for
1
why you might expect the masses of the corresponding BPS particles to go like m ∼ N − 3
where N is the number of Kähler classes.
Exercise 6: consider D3 branes around cycles of CY threefods in 4d. The complex structures
do not receive quantum corrections. Show that the correponding particles saturate the species
bound too.
In the argument we assumed that Cij is diagonal which is not necessarily true. You can
check explicitly for the known examples, that the mass of the BPS states vs the number of
species follows a sharp line in the log-log plot but with a different slope than that of the
species bound [198].
Log10 (Nmassless )
0.0
-0.5
Log10 (ΛBPS )
-1.0
-1.5
•• ••
• •
3.0
3.5
4.0
4.5
• • ••••
•
•
•
•
•
•
• • • •• • •• • • • •• • • • • • ••
••••••••• ••• •••••• •• ••• •• •• • • •
••
•• ••••••••••••••••••••••••••••••••••••• •••• •• •• • • • • •
• ••••••••••••••• ••• • • • •
••• •• •••••••••••••••••••••••••••
••
•• ••
•••••••••••••••••
•• ••••••••••••••••••••••
•••••••••••
••••••
••••••••••••••••••••• •••••
••••••••••••••••••
••••
•••••
••
•
••••••••••••••••••••••••••
••••••••••
•
•
• ••••••••••
••
•
•
•
••
•
•
•
•
•
•
•
•
•
•
•
•
•
••
•
•
•
•
•
•
•
•
•
•••
•
•
•
•
•
••
••
••
•
•
••
•
•
•
••
••
•
••
•
•
•
•
•
•
•
•
••
•
••
•
••
•••••
••
•••
•••••
••••
•••••••••••
••••••••••••
••••••••
••••••
••••••••
••••••
• •••••••••••
•••
•••••
•••••
•••
••••
•••
•••
••••
••••••••••••••••
••••
••
••••
•••••
••••
••••
••••••
••
••••
•••••
••
•••
••••
••••
••
•••
••••
•••
•••
••
••••
•••••
•••
•••••••
••
••
••
••••
•••••••••••••••••••••
••••
••••
••••
••••
••••••
•••••
••••
••••
•••
••••••••••
••••
•••
•••••••
••
•••••••••••••••••••••••
•••••••
••••••
••••••••
••
••
••••
•••
••••
••••
••
••••
••
••
•••••••••
•••••••••••••••••••
•••
••
••••
••••
••••••
••••
•••••
••••••
••••••
•••••••
••••
•••••
••••
••••••••••••••••••
•••
•••
••
••••
•••
••
•••••
••••
•••••••
••
•••
••
••
•••••••
•••••
•••
••••
•••••
••••
••••
•••••
••
••
•••
••
••••••
•••
•••••••••
••••
••••
•••
•••
•••
•••••••••
•••••
•••••••••••
••••
•••••
•••••••••••
••••••
•••
••••••
•••••••••••
••
••••
•••••••
•••••
••
•••
•••
••••••••
•••
•••••••
•••••
•••
•••
••
••••
•
••
••••••
••••
••
••
••••
•
••
••••
••
•
•••••
••
•
•••••••••••
••
•••
•
•
••
•
•
•••
•
•
••
••
••
•
•
•
••
••••••••
••••••••••••
••••
•
•••
••
••••
•
•
••
•
•••••
•••••••••••••
•••••••
•
••
•
•
••••••
••••••
•
•
•
••••
•
••••••••
••••
•
•
•
•
•
••••••••
••
••••
•
••••
•
••
••
•••••••
•••••
••••••
••
•••••••••••••••••••••••••••••••••
•••••••••••
••
•••
•••••
••
•
•••
•
•
••
•••
•
•
••••
•••
••
•
••••
•••••••••••
••
••••••••••••••••
••••••••••••••
•••
•••••
••••••••
••••
•
•
•
••
••••
•••••
••
••
••••
•
•
•••
•••
•••••••
••••
•••
•••••
••••••••••••••
•
••••••••
•••••
•
••
••
••••••••••••••••••••••••••
••••
••••
••••
••
••••
••
•••••
•••
•
•
•
•
•
•
•
•
•
•
•••••
•
•
••
•
•••••
••••••
•••
••
•
•
•
••
••
•
•
••
••
••
••
••••••••
•
•
•
•• •
•••••
•
•
•
••
•
•
•
••••
•
•
••••
••••
•
•
•••
•
•
•
•••
••
•
••
•
•
•
•
••••
••••
•
•
••••
••
•••••••••••••••
••••
••
•••••••••••
••••••
•••••
•••••••••••••••••••••••••••••••••••••••••
••
••
•••
••
••
•
••
••
••••••••••
•••
•••
••••
••••••
•
•
••
••
••••••••••
••••••••••••••• •
•
•
•
•
••••
••
••••
•••••••
•
•
•
••••
•
•
•
•
••
••
••••••
•
•
••
••
••••••••••
•••
•••••
•••
•••
••••
••
••
••••••
••••••
••••••••••••••
••••
••••••••••••••••••••
••
•• ••••••••••••••••••••••••••••••••
••
••••••
•
•
•••••
•••••
•
•••
•••••
••••
•
•
•••••••
•
••••
•
•••••••••••••••••
•
•
•••••
•••••
••••••••••
••••••••
•••••
••
•
••••••
••••••••••••••
••••••
••
•••••••••••••••••••••
•
•
•
•••••••••••••
••
••
•
•
•
•
•
•
•
•
•
••
•
•
•••••
•
••••••••••
•
•••••••••
••••
•••••
••••••••
••••••••••••••••••••••••••••••••••••••••••••••••••••
••••••••••••
••••••••••••
•••
••••••
••••••
•••
•••••••••••••••••
•••••••••••••••••
••••••••••••••••••••••••••••••
•
••••••
•••
•
•
•••••••••••••••••••••
••••••••••••••••
••••
••••••••••
••
••••••••••••••••••••••••••••••••••••••••••••••••••••
•••••••
•
••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••
• •••••••••••••••••• •••••••
• • ••••••••
••••••••••
• •••
••
5.0
5.5
-2.0
Figure II.7.3: Plot taken from [198] shows the number of light species vs the mass scale of the
BPS states for a large family of Calabi–Yau compactifications. The red region is excluded
by the species bound.
If the curve of the BPS states continues passed its intersection with the species curve,
we get a contradiction with the species bound. Therefore, the species bound, together
with observed CY examples, seems to suggest that the curve must stop and the number
of possibilities are finite. Thus, the finiteness of vacua and massless species may indeed be
related to the finiteness of the black hole entropy.
238
It might be tempting to postulate that the finiteness of the quantum gravity path integral
is related to the finiteness principle. We claim that is in fact correct. Consider a cutoff Λ.
Compactify your theory all the way to 1 dimension of time. Given that the number of noncompact dimensions is small, the moduli no longer freeze and there is no superselection.
Therefore, we have to integrate over everything. If the number of Calabi–Yau spaces is
infinite, the zero modes would likely give rise to a divergent path integral. Therefore, the
finiteness of theories of quantum gravity is motivated by the finiteness of the quantum gravity
path-integral.
Acknowledgement
The research of A. B. and C. V. is supported by a grant from the Simons Foundation (602883,
CV) and by the NSF grant PHY-2013858. M. J. K. is supported by a Sherman Fairchild
Postdoctoral Fellowship and the U.S. Department of Energy, Office of Science, Office of High
Energy Physics, under Award Number DE-SC0011632.
References
[1] J. Polchinski. String theory. Vol. 1: An introduction to the bosonic string. Cambridge
Monographs on Mathematical Physics. Cambridge University Press, 2007.
[2] J. Polchinski. String theory. Vol. 2: Superstring theory and beyond. Cambridge
Monographs on Mathematical Physics. Cambridge University Press, 2007.
[3] E. Kiritsis. String theory in a nutshell. Princeton University Press, USA, 2019.
[4] K. Becker, M. Becker, and J. H. Schwarz. String theory and M-theory: A modern
introduction. Cambridge University Press, 2006.
[5] L. Brink, P. Di Vecchia, and Paul S. Howe. A Locally Supersymmetric and
Reparametrization Invariant Action for the Spinning String. Phys. Lett. B, 65:471–
474, 1976.
[6] Stanley Deser and B. Zumino. Consistent Supergravity. Phys. Lett. B, 62:335, 1976.
[7] Alexander M. Polyakov. Quantum geometry of bosonic strings. Phys. Lett., 103B:207–
210, 1981. 598(1981).
[8] L. D. Faddeev and V. N. Popov. Feynman Diagrams for the Yang-Mills Field. Phys.
Lett. B, 25:29–30, 1967.
239
[9] Daniel Friedan, Emil J. Martinec, and Stephen H. Shenker. Conformal Invariance,
Supersymmetry and String Theory. Nucl. Phys. B, 271:93–165, 1986.
[10] P. Di Francesco, P. Mathieu, and D. Senechal. Conformal Field Theory. Graduate
Texts in Contemporary Physics. Springer-Verlag, New York, 1997.
[11] Josep M. Pons. Noether symmetries, energy-momentum tensors and conformal
invariance in classical field theory. J. Math. Phys., 52:012904, 2011.
[12] M. A. Virasoro. Subsidiary conditions and ghosts in dual resonance models. Phys. Rev.
D, 1:2933–2936, 1970.
[13] C. Lovelace. Pomeron form-factors and dual Regge cuts. Phys. Lett. B, 34:500–506,
1971.
[14] C. Becchi, A. Rouet, and R. Stora. Renormalization of Gauge Theories. Annals Phys.,
98:287–321, 1976.
[15] I. V. Tyutin. Gauge Invariance in Field Theory and Statistical Physics in Operator
Formalism. 1975.
[16] Mitsuhiro Kato and Kaku Ogawa. Covariant Quantization of String Based on BRS
Invariance. Nucl. Phys. B, 212:443–460, 1983.
[17] P. Goddard, J. Goldstone, C. Rebbi, and Charles B. Thorn. Quantum dynamics of a
massless relativistic string. Nucl. Phys. B, 56:109–135, 1973.
[18] Curtis G. Callan, Jr., E. J. Martinec, M. J. Perry, and D. Friedan.
Background Fields. Nucl. Phys. B, 262:593–609, 1985.
Strings in
[19] Curtis G. Callan, Jr., Igor R. Klebanov, and M. J. Perry. String Theory Effective
Actions. Nucl. Phys. B, 278:78–90, 1986.
[20] D. Friedan. Nonlinear Models in Two Epsilon Dimensions. Phys. Rev. Lett., 45:1057,
1980.
[21] David Tong. String Theory. 1 2009.
[22] Joel Scherk and John H. Schwarz. Dual Models for Nonhadrons. Nucl. Phys. B,
81:118–144, 1974.
[23] Eric D’Hoker and D. H. Phong. Multiloop Amplitudes for the Bosonic Polyakov String.
Nucl. Phys. B, 269:205–234, 1986.
240
[24] M. A. Virasoro. Alternative constructions of crossing-symmetric amplitudes with regge
behavior. Phys. Rev., 177:2309–2311, 1969.
[25] Joel A. Shapiro. Electrostatic analog for the virasoro model. Phys. Lett. B, 33:361–362,
1970.
[26] I. B. Frenkel, H. Garland, and G. J. Zuckerman. Semi-infinite cohomology and string
theory. PNAS; Proceedings of the National Academy of Sciences, 83(22):8442–8446,
1986.
[27] Joseph Polchinski. Evaluation of the One Loop String Path Integral. Commun. Math.
Phys., 104:37, 1986.
[28] Michael B. Green, John H. Schwarz, and Lars Brink. N=4 Yang-Mills and N=8
Supergravity as Limits of String Theories. Nucl. Phys. B, 198:474–492, 1982.
[29] K. S. Narain, M. H. Sarmadi, and Edward Witten. A Note on Toroidal Compactification
of Heterotic String Theory. Nucl. Phys. B, 279:369–379, 1987.
[30] K. S. Narain. New Heterotic String Theories in Uncompactified Dimensions < 10.
Phys. Lett. B, 169:41–46, 1986.
[31] B. Sathiapalan. Duality in Statistical Mechanics and String Theory. Phys. Rev. Lett.,
58:1597, 1987.
[32] E. Witten. Nonabelian bosonization in two-dimensions. Commun. Math. Phys., 92:455–
472, 1984. 201(1983).
[33] J. Wess and B. Zumino. Consequences of anomalous ward identities. Phys. Lett.,
37B:95–97, 1971.
[34] E. Witten. Global aspects of current algebra. Nucl. Phys., B223:422–432, 1983.
[35] V. G. Knizhnik and A. B. Zamolodchikov. Current algebra and wess-zumino model in
two-dimensions. Nucl. Phys., B247:83–103, 1984. 690(1984).
[36] Krzysztof Gawedzki. Noncompact WZW conformal field theories. In NATO Advanced
Study Institute: New Symmetry Principles in Quantum Field Theory, pages 0247–274,
10 1991.
[37] L J. Dixon, J. A. Harvey, C. Vafa, and E. Witten. Strings on orbifolds. Nucl. Phys.,
B261:678–686, 1985. 678(1985).
[38] L. J. Dixon, J. A. Harvey, C. Vafa, and E. Witten. Strings on orbifolds. 2. Nucl. Phys.,
B274:285–314, 1986.
241
[39] Robbert Dijkgraaf, Cumrun Vafa, Erik P. Verlinde, and Herman L. Verlinde. The
operator algebra of orbifold models. Commun. Math. Phys., 123:485, 1989.
[40] Cumrun Vafa. Modular Invariance and Discrete Torsion on Orbifolds. Nucl. Phys. B,
273:592–606, 1986.
[41] K. S. Narain, M. H. Sarmadi, and C. Vafa. Asymmetric Orbifolds. Nucl. Phys. B,
288:551, 1987.
[42] Jeffrey A. Harvey, Gregory W. Moore, and C. Vafa.
COMPACTIFICATION. Nucl. Phys. B, 304:269–290, 1988.
QUASICRYSTALLINE
[43] K. S. Narain, M. H. Sarmadi, and C. Vafa. Asymmetric orbifolds: Path integral and
operator formulations. Nucl. Phys. B, 356:163–207, 1991.
[44] J. H. Conway and S. P. Norton. Monstrous moonshine. Bull. London Math. Soc.,
11(3):308–339, 1979.
[45] I. B. Frenkel, J. Lepowsky, and A. Meurman. A natural representation of the fischergriess monster with the modular function j as character. Proceedings of the National
Academy of Sciences of the United States of America, 81(10):3256–3260, 1984.
[46] A. N. Schellekens. Classification of ten-dimensional heterotic strings. Phys. Lett. B,
277:277–284, 1992.
[47] John Leech. Notes on sphere packings. Canadian Journal of Mathematics, 19:251–267,
1967.
[48] Richard E Borcherds. Monstrous moonshine and monstrous lie superalgebras. In Invent.
math. Citeseer, 1992.
[49] Harald Dorn and H. J. Otto. Two and three point functions in Liouville theory. Nucl.
Phys. B, 429:375–388, 1994.
[50] Alexander B. Zamolodchikov and Alexei B. Zamolodchikov. Structure constants and
conformal bootstrap in Liouville field theory. Nucl. Phys. B, 477:577–605, 1996.
[51] A. Neveu and J. H. Schwarz. Factorizable dual model of pions. Nucl. Phys. B, 31:86–
112, 1971.
[52] Pierre Ramond. Dual Theory for Free Fermions. Phys. Rev. D, 3:2415–2418, 1971.
[53] F. Gliozzi, Joel Scherk, and David I. Olive. Supersymmetry, Supergravity Theories and
the Dual Spinor Model. Nucl. Phys. B, 122:253–290, 1977.
242
[54] Michael B. Green and John H. Schwarz. Covariant Description of Superstrings. Phys.
Lett. B, 136:367–370, 1984.
[55] Jin Dai, R. G. Leigh, and Joseph Polchinski. New Connections Between String Theories.
Mod. Phys. Lett. A, 4:2073–2083, 1989.
[56] Joel Scherk and John H. Schwarz. Spontaneous Breaking of Supersymmetry Through
Dimensional Reduction. Phys. Lett. B, 82:60–64, 1979.
[57] Augusto Sagnotti. Open Strings and their Symmetry Groups. In NATO Advanced
Summer Institute on Nonperturbative Quantum Field Theory (Cargese Summer
Institute), 9 1987.
[58] Jack E. Paton and Hong-Mo Chan. Generalized veneziano model with isospin. Nucl.
Phys. B, 10:516–520, 1969.
[59] Petr Horava. Background Duality of Open String Models. Phys. Lett. B, 231:251–257,
1989.
[60] Joseph Polchinski. Dirichlet Branes and Ramond-Ramond charges. Phys. Rev. Lett.,
75:4724–4727, 1995.
[61] Joseph Polchinski. Combinatorics of boundaries in string theory.
50:R6041–R6045, 1994.
Phys. Rev. D,
[62] Joseph Polchinski, Shyamoli Chaudhuri, and Clifford V. Johnson. Notes on D-branes.
1 1996.
[63] Michael B. Green and John H. Schwarz. Anomaly Cancellation in Supersymmetric
D=10 Gauge Theory and Superstring Theory. Phys. Lett. B, 149:117–122, 1984.
[64] David J. Gross, Jeffrey A. Harvey, Emil J. Martinec, and Ryan Rohm. The Heterotic
String. Phys. Rev. Lett., 54:502–505, 1985.
[65] Lance J. Dixon and Jeffrey A. Harvey. String theories in ten-dimensions without spacetime supersymmetry. Nucl. Phys., B274:93–105, 1986. 93(1986).
[66] Luis Alvarez-Gaume, Paul H. Ginsparg, Gregory W. Moore, and C. Vafa. An o(16) x
o(16) heterotic string. Phys. Lett., B171:155–162, 1986.
[67] Shing-Tung Yau. On the ricci curvature of a compact kähler manifold and the
complex monge-ampére equation, i. Communications on pure and applied mathematics,
31(3):339–411, 1978.
243
[68] P. Candelas, Gary T. Horowitz, Andrew Strominger, and Edward Witten. Vacuum
configurations for superstrings. Nucl. Phys. B, 258:46–74, 1985.
[69] Richard Slansky. Group theory for unified model building. Physics reports, 79(1):1–128,
1981.
[70] W. Nahm. Supersymmetries and their Representations. Nucl. Phys. B, 135:149, 1978.
[71] Steven Weinberg and Edward Witten. Limits on Massless Particles. Phys. Lett. B,
96:59–62, 1980.
[72] E. Cremmer, B. Julia, and Joel Scherk. Supergravity Theory in Eleven-Dimensions.
Phys. Lett. B, 76:409–412, 1978.
[73] Daniel Z. Freedman and Antoine Van Proeyen. Supergravity. Cambridge Univ. Press,
Cambridge, UK, 5 2012.
[74] Clifford V. Johnson. D-branes. Cambridge Monographs on Mathematical Physics.
Cambridge University Press, 2005.
[75] Stephen H. Shenker. The Strength of nonperturbative effects in string theory. In
Cargese Study Institute: Random Surfaces, Quantum Gravity and Strings, pages 809–
819, 8 1990.
[76] Petr Horava and Edward Witten. Heterotic and type I string dynamics from elevendimensions. Nucl. Phys. B, 460:506–524, 1996.
[77] John H. Schwarz. The power of M theory. Phys. Lett. B, 367:97–103, 1996.
[78] Petr Horava. Strings on World Sheet Orbifolds. Nucl. Phys. B, 327:461–484, 1989.
[79] Petr Horava and Edward Witten. Eleven-dimensional supergravity on a manifold with
boundary. Nucl. Phys. B, 475:94–114, 1996.
[80] Cumrun Vafa. Evidence for F theory. Nucl. Phys. B, 469:403–418, 1996.
[81] A. Klemm, W. Lerche, and P. Mayr. K3 Fibrations and heterotic type II string duality.
Phys. Lett. B, 357:313–322, 1995.
[82] I. Brunner and R. Schimmrigk. F theory on Calabi-Yau fourfolds. Phys. Lett. B,
387:750–758, 1996.
[83] M. A. Awada, M. J. Duff, and C. N. Pope. N=8 Supergravity Breaks Down to N=1.
Phys. Rev. Lett., 50:294, 1983.
244
[84] Bobby Samir Acharya. M theory, Joyce orbifolds and superYang-Mills. Adv. Theor.
Math. Phys., 3:227–248, 1999.
[85] Mboyo Esole, Monica Jinwoo Kang, and Shing-Tung Yau. A New Model for Elliptic
Fibrations with a Rank One Mordell-Weil Group: I. Singular Fibers and Semi-Stable
Degenerations. 9 2014.
[86] Mboyo Esole, Patrick Jefferson, and Monica Jinwoo Kang. Euler Characteristics of
Crepant Resolutions of Weierstrass Models. Commun. Math. Phys., 371(1):99–144,
2019.
[87] Mboyo Esole, Patrick Jefferson, and Monica Jinwoo Kang. The Geometry of F4 -Models.
4 2017.
[88] Mboyo Esole, Ravi Jagadeesan, and Monica Jinwoo Kang. The Geometry of G2 ,
Spin(7), and Spin(8)-models. 9 2017.
[89] Mboyo Esole, Monica Jinwoo Kang, and Shing-Tung Yau.
Anomalies, and Phase Transitions. 12 2017.
Mordell-Weil Torsion,
[90] Mboyo Esole and Monica Jinwoo Kang. Flopping and slicing: SO(4) and Spin(4)models. Adv. Theor. Math. Phys., 23(4):1003–1066, 2019.
[91] Mboyo Esole and Monica Jinwoo Kang. The Geometry of the SU(2)× G2 -model. JHEP,
02:091, 2019.
[92] Mboyo Esole, Ravi Jagadeesan, and Monica Jinwoo Kang. 48 Crepant Paths to SU(2)×
SU(3). 5 2019.
[93] Curtis G. Callan, Jr. and Larus Thorlacius. SIGMA MODELS AND STRING
THEORY. In Theoretical Advanced Study Institute in Elementary Particle Physics:
Particles, Strings and Supernovae (TASI 88), 3 1989.
[94] Luis Alvarez-Gaume and Daniel Z. Freedman. Geometrical Structure and Ultraviolet
Finiteness in the Supersymmetric Sigma Model. Commun. Math. Phys., 80:443, 1981.
[95] M. Ademollo et al. Supersymmetric Strings and Color Confinement. Phys. Lett. B,
62:105–110, 1976.
[96] Philip Candelas, Xenia C. De La Ossa, Paul S. Green, and Linda Parkes. A Pair of
Calabi-Yau manifolds as an exactly soluble superconformal theory. Nucl. Phys. B,
359:21–74, 1991.
[97] Doron Gepner. Space-Time Supersymmetry in Compactified String Theory and
Superconformal Models. Nucl. Phys. B, 296:757, 1988.
245
[98] Edward Witten. On the Landau-Ginzburg description of N=2 minimal models. Int. J.
Mod. Phys. A, 9:4783–4800, 1994.
[99] Sven Krippendorf, Fernando Quevedo, and Oliver Schlotterer. Cambridge lectures on
supersymmetry and extra dimensions.
[100] S. J. Gates, Marcus T. Grisaru, M. Rocek, and W. Siegel. Superspace Or One Thousand
and One Lessons in Supersymmetry, volume 58 of Frontiers in Physics. 1983.
[101] E. Witten. Phases of n=2 theories in two-dimensions. Nucl. Phys., B403:159–222, 1993.
AMS/IP Stud. Adv. Math.1,143(1996).
[102] James M. Bardeen, B. Carter, and S. W. Hawking. The Four laws of black hole
mechanics. Commun. Math. Phys., 31:161–170, 1973.
[103] Jacob D. Bekenstein. Black holes and entropy. Phys. Rev. D, 7:2333–2346, 1973.
[104] S. W. Hawking. Particle Creation by Black Holes. Commun. Math. Phys., 43:199–220,
1975. [Erratum: Commun.Math.Phys. 46, 206 (1976)].
[105] Andrew Strominger and Cumrun Vafa. Microscopic origin of the Bekenstein-Hawking
entropy. Phys. Lett. B, 379:99–104, 1996.
[106] M. J. Duff and J. X. Lu. Black and super p-branes in diverse dimensions. Nucl. Phys.
B, 416:301–334, 1994.
[107] Juan Martin Maldacena. The Large N limit of superconformal field theories and
supergravity. Adv. Theor. Math. Phys., 2:231–252, 1998.
[108] Rajesh Gopakumar and Cumrun Vafa. On the gauge theory / geometry correspondence.
Adv. Theor. Math. Phys., 3:1415–1443, 1999.
[109] R. P. Feynman. Quantum theory of gravitation. Acta Phys. Polon., 24:697–722, 1963.
[110] S. W. Hawking. Gravitational radiation from colliding black holes. Phys. Rev. Lett.,
26:1344–1346, 1971.
[111] J. D. Bekenstein. Black holes and the second law. Lett. Nuovo Cim., 4:737–740, 1972.
[112] S. W. Hawking. Black Holes and Thermodynamics. Phys. Rev. D, 13:191–197, 1976.
[113] Steven Weinberg. Anthropic Bound on the Cosmological Constant. Phys. Rev. Lett.,
59:2607, 1987.
[114] Hee-Cheol Kim, Houri-Christina Tarazi, and Cumrun Vafa. Four-dimensional N = 4
SYM theory and the swampland. Phys. Rev. D, 102(2):026003, 2020.
246
[115] Tom Banks and Nathan Seiberg. Symmetries and Strings in Field Theory and Gravity.
Phys. Rev. D, 83:084019, 2011.
[116] Davide Gaiotto, Anton Kapustin, Nathan Seiberg, and Brian Willett. Generalized
Global Symmetries. JHEP, 02:172, 2015.
[117] Chi-Ming Chang, Ying-Hsuan Lin, Shu-Heng Shao, Yifan Wang, and Xi Yin.
Topological Defect Lines and Renormalization Group Flows in Two Dimensions. JHEP,
01:026, 2019.
[118] Ryan Thorngren and Yifan Wang. Fusion Category Symmetry I: Anomaly In-Flow and
Gapped Phases. 12 2019.
[119] Ying-Hsuan Lin and Shu-Heng Shao. Duality Defect of the Monster CFT. J. Phys. A,
54(6):065201, 2021.
[120] Tom Rudelius and Shu-Heng Shao. Topological Operators and Completeness of
Spectrum in Discrete Gauge Theories. JHEP, 12:172, 2020.
[121] Ben Heidenreich, Jacob McNamara, Miguel Montero, Matthew Reece, Tom Rudelius,
and Irene Valenzuela. Non-invertible global symmetries and completeness of the
spectrum. JHEP, 09:203, 2021.
[122] Gregory W. Moore and Nathan Seiberg. Classical and Quantum Conformal Field
Theory. Commun. Math. Phys., 123:177, 1989.
[123] Daniel Harlow and Hirosi Ooguri. Symmetries in quantum field theory and quantum
gravity. Commun. Math. Phys., 383(3):1669–1804, 2021.
[124] John H. Schwarz. An SL(2,Z) multiplet of type IIB superstrings. Phys. Lett. B, 360:13–
18, 1995. [Erratum: Phys.Lett.B 364, 252 (1995)].
[125] Martin Cederwall and P. K. Townsend. The Manifestly Sl(2,Z) covariant superstring.
JHEP, 09:003, 1997.
[126] Jacob McNamara and Cumrun Vafa. Cobordism Classes and the Swampland. 9 2019.
[127] Edward Witten. Five-branes and M theory on an orbifold. Nucl. Phys. B, 463:383–397,
1996.
[128] Katrin Becker and Melanie Becker. M theory on eight manifolds. Nucl. Phys. B,
477:155–167, 1996.
[129] Miguel Montero and Cumrun Vafa. Cobordism Conjecture, Anomalies, and the String
Lamppost Principle. JHEP, 01:063, 2021.
247
[130] Jacob McNamara and Cumrun Vafa. Baby Universes, Holography, and the Swampland.
4 2020.
[131] Sidney R. Coleman. Black Holes as Red Herrings: Topological Fluctuations and the
Loss of Quantum Coherence. Nucl. Phys. B, 307:867–882, 1988.
[132] Elliott Gesteau and Monica Jinwoo Kang. Holographic baby universes: an observable
story. 6 2020.
[133] Joseph Polchinski. Monopoles, duality, and string theory. Int. J. Mod. Phys. A,
19S1:145–156, 2004.
[134] Nima Arkani-Hamed, Lubos Motl, Alberto Nicolis, and Cumrun Vafa. The String
landscape, black holes and gravity as the weakest force. JHEP, 06:060, 2007.
[135] Stefano Andriolo, Daniel Junghans, Toshifumi Noumi, and Gary Shiu. A Tower Weak
Gravity Conjecture from Infrared Consistency. Fortsch. Phys., 66(5):1800020, 2018.
[136] Alek Bedroya. High energy scattering and string/black hole transition. 11 2022.
[137] Yuta Hamada, Toshifumi Noumi, and Gary Shiu. Weak Gravity Conjecture from
Unitarity and Causality. Phys. Rev. Lett., 123(5):051601, 2019.
[138] Yevgeny Kats, Lubos Motl, and Megha Padi. Higher-order corrections to mass-charge
relation of extremal black holes. JHEP, 12:068, 2007.
[139] Brando Bellazzini, Clifford Cheung, and Grant N. Remmen. Quantum Gravity
Constraints from Unitarity and Analyticity. Phys. Rev. D, 93(6):064076, 2016.
[140] Allan Adams, Nima Arkani-Hamed, Sergei Dubovsky, Alberto Nicolis, and Riccardo
Rattazzi. Causality, analyticity and an IR obstruction to UV completion. JHEP,
10:014, 2006.
[141] Brando Bellazzini, Matthew Lewandowski, and Javi Serra. Positivity of Amplitudes,
Weak Gravity Conjecture, and Modified Gravity. Phys. Rev. Lett., 123(25):251103,
2019.
[142] Nima Arkani-Hamed, Yu-tin Huang, Jin-Yu Liu, and Grant N. Remmen. Causality,
unitarity, and the weak gravity conjecture. JHEP, 03:083, 2022.
[143] Clifford Cheung, Junyu Liu, and Grant N. Remmen. Proof of the Weak Gravity
Conjecture from Black Hole Entropy. JHEP, 10:004, 2018.
[144] Miguel Montero, Thomas Van Riet, and Gerben Venken. Festina Lente: EFT
Constraints from Charged Black Hole Evaporation in de Sitter. JHEP, 01:039, 2020.
248
[145] Toby Crisford, Gary T. Horowitz, and Jorge E. Santos. Testing the Weak Gravity Cosmic Censorship Connection. Phys. Rev. D, 97(6):066005, 2018.
[146] Toby Crisford and Jorge E. Santos. Violating the Weak Cosmic Censorship Conjecture
in Four-Dimensional Anti–de Sitter Space. Phys. Rev. Lett., 118(18):181101, 2017.
[147] Hirosi Ooguri and Cumrun Vafa. Non-supersymmetric AdS and the Swampland. Adv.
Theor. Math. Phys., 21:1787–1801, 2017.
[148] Eduardo Gonzalo, L. E. Ibáñez, and I. Valenzuela. Swampland constraints on neutrino
masses. JHEP, 02:088, 2022.
[149] Riccardo Antonelli and Ivano Basile. Brane annihilation in non-supersymmetric strings.
JHEP, 11:021, 2019.
[150] Thomas W. Grimm, Chongchuo Li, and Irene Valenzuela.
Asymptotic Flux
Compactifications and the Swampland. JHEP, 06:009, 2020. [Erratum: JHEP 01,
007 (2021)].
[151] Pierre Corvilain, Thomas W. Grimm, and Irene Valenzuela. The Swampland Distance
Conjecture for Kähler moduli. JHEP, 08:075, 2019.
[152] Thomas W. Grimm, Eran Palti, and Irene Valenzuela. Infinite Distances in Field Space
and Massless Towers of States. JHEP, 08:143, 2018.
[153] Muldrow Etheredge, Ben Heidenreich, Sami Kaya, Yue Qiu, and Tom Rudelius.
Sharpening the Distance Conjecture in Diverse Dimensions. 6 2022.
[154] Hirosi Ooguri and Cumrun Vafa. On the Geometry of the String Landscape and the
Swampland. Nucl. Phys. B, 766:21–33, 2007.
[155] Seung-Joo Lee, Wolfgang Lerche, and Timo Weigand. Emergent strings from infinite
distance limits. JHEP, 02:190, 2022.
[156] Alek Bedroya and Cumrun Vafa. Trans-Planckian Censorship and the Swampland.
JHEP, 09:123, 2020.
[157] David Andriot, Niccolò Cribiori, and David Erkinger.
conjectures and the TCC bound. JHEP, 07:162, 2020.
The web of swampland
[158] Alek Bedroya. de Sitter Complementarity, TCC, and the Swampland. 10 2020.
[159] Alek Bedroya. Holographic origin of TCC and the Distance Conjecture. 11 2022.
249
[160] Yuta Hamada, Miguel Montero, Cumrun Vafa, and Irene Valenzuela. Finiteness and
the Swampland. 10 2021.
[161] Fernando Marchesano and Max Wiesner. Instantons and infinite distances. JHEP,
08:088, 2019.
[162] Dieter Lüst, Eran Palti, and Cumrun Vafa. AdS and the Swampland. Phys. Lett. B,
797:134867, 2019.
[163] Miguel Montero, Cumrun Vafa, and Irene Valenzuela. The Dark Dimension and the
Swampland. 5 2022.
[164] Eduardo Gonzalo, Miguel Montero, Georges Obied, and Cumrun Vafa. Dark Dimension
Gravitons as Dark Matter. 9 2022.
[165] Eric Perlmutter, Leonardo Rastelli, Cumrun Vafa, and Irene Valenzuela. A CFT
distance conjecture. JHEP, 10:070, 2021.
[166] Sidney R. Coleman and Frank De Luccia. Gravitational Effects on and of Vacuum
Decay. Phys. Rev. D, 21:3305, 1980.
[167] S. W. Hawking and I. G. Moss. Supercooled Phase Transitions in the Very Early
Universe. Phys. Lett. B, 110:35–38, 1982.
[168] G. von Gersdorff, M. Quiros, and A. Riotto. Scherk-Schwarz supersymmetry breaking
with radion stabilization. Nucl. Phys. B, 689:76–90, 2004.
[169] Edward Witten. Instability of the Kaluza-Klein Vacuum. Nucl. Phys. B, 195:481–492,
1982.
[170] Michael B. Green and John H. Schwarz. Supersymmetrical String Theories. Phys. Lett.
B, 109:444–448, 1982.
[171] Luis Alvarez-Gaume, Paul H. Ginsparg, Gregory W. Moore, and C. Vafa. An O(16) x
O(16) Heterotic String. Phys. Lett. B, 171:155–162, 1986.
[172] Augusto Sagnotti. Surprises in open string perturbation theory. Nucl. Phys. B Proc.
Suppl., 56:332–343, 1997.
[173] Shigeki Sugimoto. Anomaly cancellations in type I D-9 - anti-D-9 system and the
USp(32) string theory. Prog. Theor. Phys., 102:685–699, 1999.
[174] Georges Obied, Hirosi Ooguri, Lev Spodyneiko, and Cumrun Vafa. De Sitter Space
and the Swampland. 6 2018.
250
[175] Michael Dine and Nathan Seiberg. Is the Superstring Weakly Coupled? Phys. Lett. B,
162:299–302, 1985.
[176] Juan Martin Maldacena and Carlos Nunez. Supergravity description of field theories
on curved manifolds and a no go theorem. Int. J. Mod. Phys. A, 16:822–855, 2001.
[177] Mark P. Hertzberg, Shamit Kachru, Washington Taylor, and Max Tegmark.
Inflationary Constraints on Type IIA String Theory. JHEP, 12:095, 2007.
[178] Paul J. Steinhardt and Daniel Wesley. Dark Energy, Inflation and Extra Dimensions.
Phys. Rev. D, 79:104026, 2009.
[179] Timm Wrase and Marco Zagermann. On Classical de Sitter Vacua in String Theory.
Fortsch. Phys., 58:906–910, 2010.
[180] David Andriot. Open problems on classical de Sitter solutions.
67(7):1900026, 2019.
Fortsch. Phys.,
[181] David Andriot. Tachyonic de Sitter Solutions of 10d Type II Supergravities. Fortsch.
Phys., 69(7):2100063, 2021.
[182] David Andriot. On the de Sitter swampland criterion. Phys. Lett. B, 785:570–573,
2018.
[183] David Andriot and Christoph Roupec. Further refining the de Sitter swampland
conjecture. Fortsch. Phys., 67(1-2):1800105, 2019.
[184] Hirosi Ooguri, Eran Palti, Gary Shiu, and Cumrun Vafa. Distance and de Sitter
Conjectures on the Swampland. Phys. Lett. B, 788:180–184, 2019.
[185] Sumit K. Garg and Chethan Krishnan.
Swampland. JHEP, 11:075, 2019.
Bounds on Slow Roll and the de Sitter
[186] Tom Rudelius. Asymptotic scalar field cosmology in string theory. JHEP, 10:018, 2022.
[187] David Andriot and Ludwig Horer. (Quasi-) de Sitter solutions across dimensions and
the TCC bound. JHEP, 01:020, 2023.
[188] Rashmish K. Mishra. Confinement in de Sitter space and the swampland. JHEP,
01:002, 2023.
[189] Alek Bedroya, Robert Brandenberger, Marilena Loverde, and Cumrun Vafa. TransPlanckian Censorship and Inflationary Cosmology. Phys. Rev. D, 101(10):103502, 2020.
251
[190] Alek Bedroya, Miguel Montero, Cumrun Vafa, and Irene Valenzuela. de Sitter Bubbles
and the Swampland. 8 2020.
[191] Tom Rudelius. Conditions for (No) Eternal Inflation. JCAP, 08:009, 2019.
[192] Atsushi Higuchi. Forbidden Mass Range for Spin-2 Field Theory in De Sitter Spacetime. Nucl. Phys. B, 282:397–436, 1987.
[193] Tom Rudelius.
Asymptotic observables and the swampland.
104(12):126023, 2021.
Phys. Rev. D,
[194] Andrew Strominger. The dS / CFT correspondence. JHEP, 10:034, 2001.
[195] David Andriot, Ludwig Horer, and George Tringas. Negative scalar potentials and the
swampland: an Anti-Trans-Planckian Censorship Conjecture. 12 2022.
[196] Ofer Aharony, Zohar Komargodski, and Assaf Patir. The Moduli space and M(atrix)
theory of 9d N=1 backgrounds of M/string theory. JHEP, 05:073, 2007.
[197] Luis Alvarez-Gaume and Edward Witten. Gravitational Anomalies. Nucl. Phys. B,
234:269, 1984.
[198] Cody Long, Miguel Montero, Cumrun Vafa, and Irene Valenzuela. The Desert and the
Swampland. 12 2021.
252