Nucl - Phys.B v.688
Nucl - Phys.B v.688
Nucl - Phys.B v.688
www.elsevier.com/locate/npe
Two-loop superstrings
on orbifold compactifications ✩
Kenichiro Aoki a , Eric D’Hoker b , D.H. Phong c
a Hiyoshi Department of Physics, Keio University, Yokohama 223-8521, Japan
b Department of Physics and Astronomy, University of California, Los Angeles, CA 90095, USA
c Department of Mathematics, Columbia University, New York, NY 10027, USA
Abstract
The two-loop chiral measure for superstring theories compactified on Z2 reflection orbifolds
is constructed from first principles for even spin structures. This is achieved by a careful
implementation of the chiral splitting procedure in the twisted sectors and the identification of
a subtle worldsheet supersymmetric and supermoduli dependent shift in the Prym period. The
construction is generalized to compactifications which involve more general NS backgrounds
preserving worldsheet supersymmetry. The measures are unambiguous and independent of the gauge
slice.
Two applications are presented, both to superstring compactifications where 4 dimensions are Z2 -
twisted and where the GSO projection involves a chiral summation over spin structures. The first
is an orbifold by a single Z2 -twist; here, orbifolding reproduces a supersymmetric theory and it is
shown that its cosmological constant indeed vanishes. The second model is of the type proposed by
Kachru–Kumar–Silverstein and additionally imposes a Z2 -twist by the parity of worldsheet fermion
number; it is shown here that the corresponding cosmological constant does not vanish pointwise on
moduli space.
2004 Elsevier B.V. All rights reserved.
✩
Research supported in part by a Grant-in-Aid from the Ministry of Education, Science, Sports and Culture
and grants from Keio University (K.A.), and by National Science Foundation grants PHY-01-40151 (E.D.) and
DMS-02-45371 (D.P.).
E-mail addresses: ken@phys-h.keio.ac.jp (K. Aoki), dhoker@physics.ucla.edu (E. D’Hoker),
phong@cpw.math.columbia.edu (D.H. Phong).
0550-3213/$ – see front matter 2004 Elsevier B.V. All rights reserved.
doi:10.1016/j.nuclphysb.2004.04.001
4 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
1. Introduction
Over the past few years, important progress has been made on two-loop superstring
perturbation theory. A formula for the two-loop even spin structure superstring measure
has been constructed which is well-defined and independent of any choice of gauge
slice [1–5]. The independence of gauge slice choices is essential, since without it the
superstring scattering amplitudes would be ambiguous. The formula has been applied to
the evaluation of the two-loop N -point function for N 4 massless external NS–NS string
states, propagating in flat Minkowski space–time. In particular, the cosmological constant
for flat Minkowski space–time had been shown to vanish point by point on moduli space.
The original derivation of these results in [4] was carried out using a gauge where the
worldsheet gravitini are supported at two points q1 , q2 which obey the gauge condition,
Sδ (q1 , q2 ) = 0, where Sδ is the Szegö kernel (or fermion propagator) for even spin
structure δ. Recently, the results of [3] (which were expressed in a general gauge where
the measure is formulated in terms of meromorphic functions and forms) were used to
evaluate the superstring measure in a hyper-elliptic gauge by Zheng, Wu and Zhu in
[6,7], and the vanishing of the N 3-point functions was confirmed. The cancellations
of these amplitudes are examples of non-renormalization conjectures suggested by space–
time supersymmetry [8]. A bibliography of earlier work on this topic may be found in [4].
The purpose of the present paper is to derive the superstring measure (and the
cosmological constant) for orbifold compactification backgrounds produced by general
Z2 reflections. Certain orbifold models of this type provide some of the simplest
solvable examples of superstring theories with broken supersymmetry and are therefore of
considerable interest in particle physics. In particular, Kachru, Kumar and Silverstein [9]
have proposed Z2 × Z2 orbifolds of type II superstring theory with broken supersymmetry
(see also [10]), whose 1-loop cosmological constant vanishes by construction. Clearly, the
construction of any superstring theory model with broken supersymmetry and vanishing
(or very small) cosmological constant would be of great phenomenological value (see also
[11]).
In [9], arguments were presented in favor of the vanishing of the cosmological constant
to two-loop order (and higher), using the earlier string measure suggested by the picture
changing operator and BRST formalism [12,13]. Shortly thereafter, arguments were given
against the vanishing of the two-loop cosmological constant in the same model [14], also
using the picture changing operator and BRST formalism. In both approaches, special
choices for the worldsheet gravitini insertion points were made, the first in unitary gauge,
the second following [15]. By now, however, the picture changing operator and BRST
formalism is well known to be gauge slice dependent [13] (see also [16–19]) a fact that
casts doubts over the conclusions of both [9,14].
A reliable calculation of the cosmological constant requires a gauge slice independent
measure such as the one obtained in [1–4] for flat Minkowski space–time.
In the present paper, the two-loop chiral measure for superstring theories compactified
on Z2 reflection orbifolds is constructed from first principles for even spin structures. This
is achieved by a careful implementation of the chiral splitting procedure in the twisted
sectors and the identification of a subtle worldsheet supersymmetric and supermoduli
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 5
dependent shift in the Prym period. The limits of the measure are checked under various
degenerations and are found to agree with the behavior expected on physical grounds.
The construction is generalized to compactifications which involve more general NS
backgrounds, and preserve worldsheet supersymmetry. The measures are shown to be
unambiguous and independent of the gauge slice. Our original primary concern was with
the study of the models of [9]. The results obtained in this paper for the Z2 -twisted chiral
measure, however, will be indispensable ingredients in the construction of the Z2 -twisted
conformal field theory and string amplitudes to two-loop order in any orbifold model based
on Z2 reflections.
Two applications are presented, both to superstring compactifications where 4 dimen-
sions are Z2 -twisted and where the GSO projection involves a chiral summation over spin
structures, i.e., a summation carried out independently on left and right movers. The first
is an orbifold by a single Z2 twist, which is known to reproduce a supersymmetric the-
ory; it is shown here that its cosmological constant vanishes. The second model is of the
type proposed by Kachru, Kumar and Silverstein [9] and additionally imposes a Z2 twist
by the parity of worldsheet fermion number; it is shown here that the corresponding chiral
measure is non-vanishing. Therefore, the associated cosmological constant does not vanish
pointwise on moduli space. It is logically possible, though we shall argue it is unlikely, that
despite this result, a cancellation will occur for the cosmological constant when left and
right chiral blocks are assembled and the integration over moduli is carried out.
In [1–4], a detailed method was laid out for determining the two-loop superstring
measure on general space–times. Explicit formulas were obtained in those papers for the
case of flat Minkowski space–time M; they involve the matter and ghost supercurrents,
stress tensors, and partition functions.
All compactifications discussed in this paper will only affect the matter fields and will
leave the ghost fields unchanged. In orbifold compactifications C of flat Minkowski space–
time, the expressions for the matter supercurrent and stress tensor (in terms of the fields x µ
µ
and ψ± ) are identical to those of flat Minkowski space–time and will be denoted by SMm
µ
and TMm (or simply Sm and Tm when no confusion is possible). The fields x µ and ψ± will
be summed over untwisted and twisted sectors, dependent on C. Therefore, the partition
function, and more generally any vacuum expectation value will depend on C; they will
be referred to by ZC and · · ·C . (On a more general compactification background C, the
µ
expression for the supercurrent and stress tensor (in terms of x µ and ψ± ) will generally be
different from their expressions in flat space–time and will be denoted by SCm and TCm .)
Thus, the main new ingredients are the construction of the propagators for the Z2 -
µ
twisted fields x µ and ψ± , and the identification of the corresponding changes in the string
measure. These changes reflect the global geometry of the worldsheet with a quadratic
branch cut, and must be treated with care. We summarize here the main steps in our
derivation.
• The first step is to determine the Z2 -twisted bosonic propagator ∂z x(z)∂w x(w)ε for
given twist ε. Its explicit expression will be given in terms of the familiar Szegö kernel,
6 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
as well as the Prym period τε and the Prym differential ωε for twist ε. The propagator
for the Z2 -twisted fermionic field ψ+ was identified in [9].
• The next step is to carry out chiral splitting for the Z2 -twisted matter superfields. Since
the Z2 -twisted propagators are different from those of flat space–time, the effective
rules, derived in [20,21] for Minkowski space–time, will not apply directly in this
case. The Z2 -twisted case has to be worked out in its own right, and its own global
geometric features carefully accounted for. In fact, it is here that a subtle and crucial
supersymmetric correction ∆τε to the super Prym period makes its appearance.
• An important check is the independence of the gauge-fixed measure from the choice of
gauge slice. In fact, a formula will be derived for the two-loop measure in terms of the
partition function, supercurrent and stress tensor correlators for any compactification
that does not affect the ghosts, preserves worldsheet supersymmetry and has matter
central charge c = 15. The resulting measure will be shown to be unambiguous and
slice independent under these assumptions. This result was announced in [1].
• The next task is to express the Z2 -twisted superstring measure in terms of ϑ-constants.
The procedure follows [1–4], but here there is a significant new difficulty. A new,
internal momentum pε -dependent, contribution arises from the correction ∆τε to
the super Prym period. The explicit evaluation, in terms of ϑ-constants, of this
contribution is one of the more arduous steps in the paper. The resulting chiral
superstring measure depends on both the spin structure δ and the twist ε and is denoted
dµC [δ; ε](pε ).
• The limit of the chiral measure dµC [δ; ε](pε ) is computed for a variety of degener-
ations and checked versus the limiting behavior expected on physical grounds. The
limits will also be needed to investigate the behavior of the cosmological constant.
• Next, the behavior of the chiral measure dµC [δ; ε](pε ) under modular transformations
is derived. The full genus 2 modular group Sp(4, Z) will act on the twist cycle
and relate the measures for various twists. In carrying out the GSO projection, a
summation over all spin structures is to be carried out, for fixed twist. Therefore, it
is the subgroup Hε of all elements in Sp(4, Z) which leave ε invariant which will
be of greatest importance in determining all possible GSO projections consistent with
modular covariance of the chiral measure. The group Hε is studied in detail; it is found
to be generated by 5 independent elements (while Sp(4, Z) is generated by 6), the
various orbits under Hε for twists and spin structures are calculated, and the behavior
of dµC [δ; ε](pε ) is obtained.
• As a general principle, the GSO projection is carried out by summing over the
spin structures δ with suitable phases. In practice, for each specific physical model,
the conformal blocks entering the chiral string measure have to be identified and
summed over δ consistently with modular covariance. For the simplest Z2 -twisted
theories, where the orbifold group is exactly Z2 acting by reflection, the conformal
blocks are just dµC [δ; ε](pε ). However, for more complicated orbifold groups such
as Z2 × Z2 in the models of [9], the conformal blocks can be more subtle, and have
to be determined with some care. The cosmological constant for various asymmetric
Z2 -orbifold models is then determined by summing over the spin structures δ. In
particular, the cosmological constant in the models of Kachru, Kumar and Silverstein
[9] is shown to be non-vanishing pointwise on moduli space.
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 7
Fig. 1. Twisted cycles Aε and Bε and the single internal loop momentum pε .
In Section 2, standard facts about conformal field theory on genus 1 and genus 2
Riemann surfaces as well as the partition function of a Z2 -twisted boson [22] are reviewed.
A self-contained construction is presented of the Prym differential ωε (z), the Prym period
τε and the Schottky relations.1 The key result in Section 2 is for the Z2 -twisted boson
propagator on a genus 2 Riemann surface Σ. In a canonical homology basis with cycles
AI , BI , I = 1, 2, the 16 possible twists may be described in terms of half-characteristics
ε = (εI ,
I ). The bosonic field x(z) in twist sector ε satisfies the following boundary
conditions,
x(z + AI ) = (−)2εI x(z), x(z + BI ) = (−)2εI x(z). (1.1)
The corresponding Z2 -twisted propagator is characterized by the same boundary condi-
tions in both variables z and w, and the short distance singularity (z − w)−2 . It is obtained
in terms of the Szegö kernel Sδ for characteristic δ, and the normalized Prym differential
ωε ,
∂z x(z)∂w x(w) ε = Sδ + (z, w)Sδ − (z, w) − 4πi∂τε ln ϑi (0, τε )ωε (z)ωε (w). (1.2)
i i
The notation here is as follows. Given a twist ε = 0, six of the ten even spin structures δ are
such that δ + ε is also an even spin structure. These six spin structures may be parametrized
by δi± , with i = 2, 3, 4 and δi− = ε + δi+ and δi± modulo ε clearly map onto the 3 even spin
structures of genus 1. The ϑi are the corresponding genus 1 ϑ-functions for even spin
structures. The precise normalization of ωε (z) is derived in Section 2.6.
In Section 3, the chiral splitting procedure is carefully implemented in the twisted
sectors of the Z2 -twisted superstring theory. Because of the Z2 -twisting, only a single
internal loop momentum survives (instead of the two internal loop momenta in the
untwisted sector). This is illustrated in Fig. 1, where a quadratic branch cut is applied
along the cycle Cε producing double-valued behavior along cycle Dε . The only internal
loop momentum left crosses the remaining independent cycle Aε .
A subtle worldsheet supersymmetric and supermoduli dependent shift in the Prym
period is identified. The results are as follows: let AC [δ] be the left-right symmetric
compactified amplitude for fixed spin structure δ and n directions twisted by Z2 . For
1 Detailed discussions of genus 1 ϑ -functions and genus 2 spin structures, ϑ -functions, modular transforma-
tions and Thomae-type formulas are collected in Appendices A and B, respectively.
8 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
simplicity, only the 0-point function is considered here. It is well known (cf. [23, p. 967])
that the gauge-fixed expression for AC [δ] as an integral over supermoduli space is
A 2 2
AC [δ] = dm D(XB B̄C C̄) δ HA |B e−Im −Igh . (1.3)
s M2 A A
Here mA are parameters for a (3|2)-dimensional slice for the even and odd supermoduli,
HA is the corresponding super Beltrami differentials, X, B, B̄, C, C̄ are respectively the
matter and ghost superfields, and Im , Igh are respectively the matter and the ghost actions.
This expression is non-chiral, but one of its fundamental properties is that it can be
split as an integral over internal momenta of the norms squared of a chiral amplitude,
AC [δ, ε](pε ),
A 2 µ µ
AC [δ] = d 10−n pI d n pε dm AC [δ, ε](pε )2 eiπpI Ω̂I J pJ 2 .
ε
s M2 A (1.4)
µ
The chiral amplitude AC [δ, ε](pε ) is superholomorphic on supermoduli space sM2 and
may be computed in terms of the chiral partition functions ZC and ZM of the compactified
and Minkowskian theories respectively, following the method of [1–4]. Parametrizing the
genus 2 supermoduli space by the super period matrix Ω̂I J as bosonic coordinates and the
two Grassmann coordinates ζ 1 , ζ 2 as fermionic coordinates, one obtains,
ZC [δ; ε]
The super-Prym period τ̂ε and the super-period matrix Ω̂I J are related via this same
equation τ̂ε = Rε (Ω̂) and both are invariant under local worldsheet supersymmetry. The
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 9
quantity ∆τε is a subtle correction which arises from chiral splitting and is given by2
i
Γ [δ; ε] ≡ Z dζ 2 dζ 1 ∆τε (1.9)
we find
dµC [δ; ε](pε )
iπτε pε2 ZC [δ, ε] Ξ6 [δ]ϑ[δ]
4
=e + iπpε − n∂τε ln ϑi (0, τε ) Γ [δ; ε] d 3 Ω.
2
ZM [δ] 16π 6 Ψ10
(1.10)
The expression Ξ6 [δ](Ω) is the modular covariant form found in [1–4]. The key
new expression here is Γ [δ; ε], which requires a substantial calculation, especially the
determination of its overall sign. The final result is
± i ϑi4 ϑ[δi± ]2 ϑj4
Γ [δi , ε] = ±ν0 |µi . (1.11)
(2π)7 η12 ϑ[δi∓ ]2 ϑ[δj+ ]2 ϑ[δj− ]2
Here, the ± signs are correlated with one another on both sides of the equation. The ϑi ’s
are genus 1 ϑ-constants with respect to τε , while the ϑ[δ]’s are genus 2 ϑ-constants with
characteristic δ. The above expression for Γ [δi± , ε] is independent of j .
In Section 6, modular properties in the presence of a twist are studied. The subgroup Hε
that fixes a twist ε is determined: it has 48 elements, and is of the form
for suitable phases ηL [δL ; λL ]. The partition function for the asymmetric orbifold model
G is then given by
ZG = (det Im Ω)−5+n/2 dµL [λL ](pL ) ∧ dµR [λR ](pR )
pL ,pR λL ,λR
× K(λL , λR ; pL , pR ) (1.14)
where n is the number of compactified dimensions, and K(λL , λR ; pL , pR ) are suitable
constant coefficients, depending only on the labels λL , λR , pL , pR for left and right chiral
blocks. The determination of K is a difficult issue, and we shall discuss it further in a
forthcoming publication [24]. In this paper, we shall examine only the chiral models, that
is, only the issue of vanishing of the chiral blocks dµL [λL ](pL ) and dµR [λR ](pR ).
We carry out this procedure for the simplest Z2 models, where n = 4 and G is given
by Z2 reflections. The chiral blocks of this theory are of course just the conformal blocks
dµC [δ; ε](pε ), with index λ = λL = ε, of the Z2 theory we have just derived in (1.10). It
is shown that there is a unique choice of phases for each ε that is consistent with modular
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 11
covariance, given by
η[δ; ε] = 1. (1.15)
Upon carrying out the GSO projection, the term involving Γ [δ; ε] cancels due to the genus
1 Riemann identities and the chiral block reduces to
+ 2 − 2
eiπτε pL ϑ[δj ] ϑ[δj ] 3
2
The main goal of this section is to derive elements from the theory of Z2 -twisted fields
which we need. For this, we require some background on spin structures, ϑ-functions,
double covers, and Prym differentials, summarized in Sections 2.1 and 2.2. The partition
function for Z2 -twisted bosonic scalar fields has been obtained by Dijkgraaf, Verlinde,
and Verlinde [22]. It is recalled in Section 2.3. For our purpose, the main objects of
interest are the propagators. The fermionic one is easy to find, so we concentrate on the
bosonic propagator ∂z x(z)∂w x(w)ε . As a warm-up, it is derived for the case of genus 1
in Section 2.4. The rest of the section is devoted to extending this method to the case of
genus h = 2. The Schottky relations between ϑ-functions and the Prym differential are
also derived.
Given the intersection numbers (2.1), the basis is unique up to modular transformations
belonging to the modular group Sp(2h, Z). The structure of the modular group, its action
on spin structures, twists and ϑ-functions is summarized in Appendix B.3.
A spinor field on Σ requires the assignment of a spin structure on Σ, corresponding
to an element of H1 (Σ, Z2 ). In a given homology basis, a spin structure may be labeled
by a half-integer characteristics κ ≡ (κ |κ ) where κ and κ , each valued in {0, 1/2}h ,
and specifies the monodromies around A- and B-cycles respectively. The parity of the spin
structure κ is that of the integer 4κ · κ , and is modular invariant. For genus 1, the 4 spin
structures separate into 3 even and 1 odd (respectively denoted µ and ν0 ). For genus 2, the
16 spin structures separate into 10 even and 6 odd spin structures (respectively denoted by
δ and ν). An important pairing between characteristics is given by the signature,
for even spin structures, we have ϑi (z|τ ) ≡ ϑ[µi ](z|τ ), with the corresponding even spin
structure characteristics given by
µ2 = 12 0 , µ3 = (0|0), µ4 = 0 12 . (2.5)
Two fundamental building blocks for the theory of functions and forms on a Riemann
surface5 are the prime form E(z, w) and the Szegö kernel Sδ (z, w) (which we shall only
need for even spin structure δ),
z z
ϑ[ν]( w ωI , Ω) ϑ[δ]( w ωI , Ω)
E(z, w) = , Sδ (z, w) = . (2.6)
hν (z)hν (w) ϑ[δ](0, Ω)E(z, w)
The holomorphic 1/2 form hν (z) for odd spin structure ν obeys hν (z)2 = ∂I ϑ[ν](0)ωI (z).
The prime form is a holomorphic −1/2 form in z and w on the universal cover of Σ, which
behaves as z − w when z ∼ w, and is independent of ν. The Szegö kernel for even spin
structure δ is a meromorphic 1/2 form in z and w on Σ with a simple pole at z = w with
unit residue, The importance of these quantities derives from the fact that Sδ is the chiral
fermion propagator, while ∂z ∂w ln E(z, w) is the scalar propagator on Σ.
The remaining generators of H 1 (Σ, Z) will be denoted by Aε and Bε , and have intersection
numbers, #(Aε , Bε ) = #(Cε , Dε ) = 1, while all others vanish. Under the modular group,
all twists transform in two orbits, one consisting of ε = 0 and the other of all ε = 0. It
will often be convenient to use the action of the modular group on the twists to choose a
standard reference twist. A simple choice consists of taking
0 0
ε= ⇔ Aε = A1 , Bε = B1 , Cε = A2 , Dε = B2 . (2.9)
0 12
It will often be helpful to keep the notation Aε , Bε , Cε , Dε for the sake of generality.
5 For reviews of string perturbation theory, see [23,25]; for recent results on Green functions on twisted
surfaces, see [26]; for the original mathematical results, see [27].
14 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
Fig. 2. The genus 2 Riemann surface and its unramified branched double cover.
Double valued scalar fields on a surface Σ may also be viewed as single valued scalar
functions of the double cover Σ̂ of the original surface Σ. (Similarly, double valued spinor
fields may be viewed as well-defined sections of a spin bundle of the cover Σ̂.) If a function
is double valued along the cycle B2 , and has a quadratic branch cut across the conjugate
cycle A2 , the surface Σ̂ is obtained by cutting open Σ along the cycle A2 and gluing the
two boundaries of this surface to the boundaries of its image under the reflection involution
ι, as shown schematically in Fig. 2. Given any twist ε = 0, the double cover Σ̂ , as defined
above, is unique.
The double cover Σ̂ is a surface of genus 3, whose complex structure is induced by
the complex structure of Σ. Functions and forms on Σ that are Z2 -twisted (i.e., double-
valued) along the cycle B2 = Dε , become functions and forms on Σ̂ that are odd under
the reflection involution ι. Of special importance is the Prym differential, ωε , which
is a holomorphic 1-form, odd under the reflection ι. For genus 2, ωε is unique up to
normalization. By construction, its periods along the cycles A2 = Cε and B2 = Dε vanish.
Normalizing the period of ωε to 1 along A1 = Aε , its integral along B1 = Bε yields the
Prym period τε ,
ωε = 1, ωε = τε . (2.10)
Aε Bε
4 elements. Without loss of generality, we may assume that δi− = δi+ + ε. For the twist ε
of (2.9), for example, we have
µi µi
δi+ = , δi− = , (2.11)
0|0 0| 12
where µi are the genus 1 even spin structure assignments of (2.5). Clearly, a genus 2
surface with a twist ε = 0 has a canonically associated group of 6 even spin structures δi± ,
which in turn map 2 to 1 onto the 3 genus 1 even spin structures.
The period matrix ΩI J of the genus 2 surface uniquely determines the Prym period τε ,
up to the action of generators in the Torelli group;6 in this case, up to the transformations
τε → τε + 4 (see [22]). The relation between ΩI J and τε is given by the Schottky relations,
already introduced in (1.6). They may be expressed in an equivalent form as follows, for
any i = j and i, j = 2, 3, 4,
+ −
ϑ[δi+ ](0, Ω)2ϑ[δi− ](0, Ω)2 ϑ[δj ](0, Ω) ϑ[δj ](0, Ω)
2 2
= . (2.12)
ϑi (0, τε )4 ϑj (0, τε )4
Here, δi± are the group of 6 even spin structures introduced in the preceding paragraph
and ϑi are the genus 1 ϑ-functions associated with the genus 1 spin structures µi onto
which δi± map. Clearly, all three such relations are equivalent to one another. The Schottky
relations for genus 2 will be proven and their origin will be explained in Section 2.6.
The first key result of [22] of interest to us is the formula for the partition function for
a single Z2 -twisted boson on a circle of radius R. In a sector of twist ε, the worldsheet
instanton sum is given by
2
6 The Torelli group is the quotient of the full mapping class group, consisting of all equivalence classes of
disconnected diffeomorphisms of the surface Σ by the modular group.
16 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
Here, Z(Ω) is the inverse of the chiral bosonic partition function of the uncompactified
circle theory. In view of the Schottky relations (2.12), this expression is independent of the
choice of i and therefore only depends upon ε.
On the torus Σ = C/Z + τ Z, with modulus τ , a scalar field x(z) and a fermion field
ψ(z) with even spin structure δ, twisted by ε, obey the following monodromy conditions,
x(z + 1) = (−)2ε x(z), ψ(z + 1) = −(−)2δ (−)2ε ψ(z),
x(z + τ ) = (−)2ε x(z), ψ(z + τ ) = −(−)2δ (−)2ε ψ(z). (2.16)
The corresponding propagators satisfy the same monodromy conditions as the fields in
both z and w. The bosonic propagator Bε (z, w) is symmetric under interchange of z and
w, and has a double pole at z = w, while the chiral fermion propagator Sδ+ε (z, w) is
antisymmetric under interchange of z and w, and has a simple pole at z = w. Matching
monodromies and poles yields the following propagators,
ϑ1 (z − w|2τ )
Bε (z, w) = ∂x(z)∂x(w) = ∂z ∂w ln ,
ϑ1 (z − w + τ |2τ )
ϑ[δ + ε](z − w, τ )ϑ1 (0, τ )
Sδ+ε (z, w) = ψ+ (z)ψ+ (w) = . (2.17)
ϑ[δ + ε](0, τ )ϑ1 (z − w, τ )
The chiral fermion propagator is just the Szegö kernel with spin structure shifted to δ + ε.
It is convenient to recast the twisted boson propagator in terms of the Szegö kernel,
since it will turn out that it is these formulas which will lend themselves to generalization
to higher genus. Using the doubling formulas for genus 1, as well as the formulas for z-
derivatives (see Appendix A), the following relation is established (valid now for arbitrary
twist ε = 0)
Bε (z, w) = Sµ (z, w)Sµ+
(z, w), (2.18)
where µ is any even spin structure such that µ + ε is also even. There are two possible
choices for µ for any given ε = 0; e.g., if ε = (0| 12 ), one can have µ = (0|0) or µ = (0| 12 ).
Theories with worldsheet supersymmetry will be of special interest. The worldsheet
supercurrent formed of a free scalar x and a chiral fermion ψ+ with spin structure δ is
given by S = −(1/2)ψ+ ∂z x and is invariant under the Z2 twist ε. The supercurrent two-
point function is given by
1 1
S(z)S(w) = Bε (z, w)Sδ+
(z, w) = Sµ (z, w)Sµ+
(z, w)Sδ+
(z, w). (2.19)
4 4
Using the OPE for the two supercurrents yields the stress tensor for the system,
ĉ/4 T (w)/2
S(z)S(w) = + + reg . (2.20)
(z − w)3 z−w
Expanding the Szegö kernels, one find ĉ = 1, as expected. Using the heat equation,
∂z2 ϑ[µ](z|τ ) = 4πi∂τ ϑ[µ](z|τ ), the remaining contribution may be recast as follows,
ϑ[µ]ϑ[µ + ε]ϑ[δ + ε]
T ε,δ = πi∂τ ln . (2.21)
ϑ[0|0]ϑ[ 12 |0]ϑ[0| 12 ]
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 17
The corresponding partition function for one boson and one fermion of spin structure δ,
√
both twisted by ε, are obtained using the relation T ε,δ = 4π/ gδ ln Zε,δ /δg zz and are
ϑ[δ + ε](0, τ ) 1/2
Zε,δ = (2.22)
ϑ[ν + ε](0, τ )
with ν = ( 12 | 12 ) in agreement with known results [28,29].
Inspection of the monodromy equations (2.7) reveals that implementing a twist ε on the
fermion field ψ+ (z) and on its propagator simply results in replacing the spin structure δ
by δ + ε. Of interest here are only the cases where δ + ε is itself an even spin structure.7
The propagator for the twisted fermion is thus the Szegö kernel for δ + ε,
ψ+ (z)ψ+ (w) ε = Sδ+ε (z, w). (2.23)
Viewed on the surface Σ, this quantity is a double-valued section of a spin bundle for spin
structure δ; it becomes a single-valued section of a spin bundle on Σ̂.
Implementing a twist ε on a scalar field x(z) is achieved by imposing the monodromy
conditions of (2.7). It may be constructed in terms of the prime form for the covering
surface Σ̂ as was done in [22]. Actually, we shall only need the propagator for the twisted
field ∂z x,
Bε (z, w) ≡ ∂z x(z)∂w x(w) ε . (2.24)
It obeys the same monodromy conditions as x does in both z and w and has a double pole
at z = w with unit residue.
By analogy with the case of genus 1, the propagator Bε (z, w) may be more simply
constructed in terms of products of the Szegö kernels for even spin structures δi+ and δi−
of (2.11). Indeed, the products
readily produce the required double pole. Given the fact that δi+ + δi− = ε modulo integral
periods, they also manifestly exhibit the required monodromy by ε.
The differences Bi (z, w) − Bj (z, w) are holomorphic 1-forms in both z (as well as in
w), and have the same non-trivial monodromy as ∂z x. On a genus 2 surface, such a one-
form must be proportional to the Prym differential ωε (z). Since ωε is unique, there must be
one linear relation between the three Bi . Taking this into account, the twisted propagator
Bε may be expressed via any of the following three expressions,
A more detailed discussion of the Prym differential and period will be given later.
7 The cases where δ + ε is and odd spin structure will not contribute to the cosmological constant for the
orbifold models we shall consider later on.
18 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
The calculation of bi
Below, the coefficient bi will be computed using the partition function for the twisted
boson theory, derived in [22] and quoted in Section 2.3. The result, in the basis of reference
(2.11), is given by
The detailed calculation of bi proceeds as follows. The stress tensor for the twisted boson
x is computed in two different ways; first, from the propagator Bε ,
1 1
Tzz = lim Bε (z, w) − (2.28)
2 w→z (z − w)2
and second from the variation of the partition function Z qu [ε] of the twisted x field,
4π δ
Tzz = δzz ln Z qu [ε], δzz ≡ √ . (2.29)
g δg zz
The partition function Z qu [ε], was computed in [22], and was presented in the notation of
(2.11) in (2.15). Equating (2.28) and (2.29) yields the following results. In the variation
(2.29) of (2.15), the variation of Z and of the ϑ-functions ϑ[δi+ ]ϑ[δi− ] precisely matches
the contribution to (2.28) of the first term in (2.26). The remaining variation gives
1
bi ωε (z)2 = −δzz ln ϑi (0, τε ) = −(δzz τε )∂ ln ϑi (0, τε ). (2.30)
2
The variation of τε with respect to the metric is analogous to the variation of the
genus 2 period matrix, δzz ΩI J = 2πiωI (z)ωJ (z). In the next subsection (where a detailed
construction of the Prym differential will also be given), we shall derive the following
expression,
The construction of the normalized Prym differential, the Schottky relations and the
variational equation (2.31) for τε require a more concrete understanding of the Prym
differential. It is convenient to obtain the Prym differential from its relation to the Szegö
kernel,
ωε (z)ωε (w) ∼ Sδ + (z, w)Sδ − (z, w) − Sδ + (z, w)Sδ − (z, w) . (2.32)
i i j j
6
s2 = (z − pa ). (2.33)
a=1
Recall that the identification between even and odd spin structures and branch points
proceeds as follows,
δi+ = νa + νc + νd = −(νb + νe + νf ),
δi− = νb + νc + νd = −(νa + νe + νf ),
δj+ = νa + νd + νe = −(νb + νc + νf ),
δj− = νb + νd + νe = −(νa + νc + νf ),
δk+ = νa + νc + νe = −(νb + νd + νf ),
δk− = νb + νc + νe = −(νa + νd + νf ). (2.35)
We note that a, b cannot belong to the same set in the partition associated with the spin
structure δ since then, δ + ε would be odd.
Let δ be the even spin structure corresponding to the partition of the 6 branch points
into two sets A, B of 3 branch points each, as in (B.18). In the hyperelliptic representation,
the Szegö kernel for the spin structure δ is given by [27],
1 sA (z)sB (w) + sA (w)sB (z) dz 1/2 dw 1/2
Sδ (z, w) = , (2.36)
2 z−w s(z) s(w)
where the following notation is used,
8 A detailed discussion of the map between the ϑ -function and hyperelliptic formulations of genus 2 Riemann
surfaces was given in [4]; see Appendix B of this paper for a summary of the relevant facts.
20 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
The combinations of Szegö kernels that enter into (2.32) may be expressed in this
hyperelliptic formulation,
(pc − pd )(pf − pe )
Sδ + (z, w)Sδ − (z, w) − Sδ + (z, w)Sδ − (z, w) = dz dw,
j j k k 4scdef (z)scdef (w)
(pc − pf )(pe − pd )
Sδ + (z, w)Sδ − (z, w) − Sδ + (z, w)Sδ − (z, w) = dz dw,
k k i i 4scdef (z)scdef (w)
(pc − pe )(pd − pf )
Sδ + (z, w)Sδ − (z, w) − Sδ + (z, w)Sδ − (z, w) = dz dw. (2.38)
i i j j 4scdef (z)scdef (w)
The hyperelliptic form of the Prym differential is thus given by
dz
ωε (z) ∼ . (2.39)
scdef (z)
This quantity involves only 4 branch points, and by SL(2, C)-invariance depends on a
single modulus, which is the Prym period τε .
Using the heat equation 4πi∂I J ϑ = ∂I ∂J ϑ, the relation δzz ΩI J = 2πiωI (z)ωJ (z) and
ϑi π
∂τε ln (0, τε ) = i σ (µi , µj )ϑk (0, τε )4 (2.52)
ϑj 4
yields the equation announced in (2.31).
Here, Vi are vertex operators for physical states and the subscript C stands for the
functional integral evaluated in the superconformal field theory associated with C. Upon
factoring out the local symmetries of the theory, the integration over supergeometries
(EM A , ΩM ) reduces to an integral over supermoduli space [20,35], which is defined by
Let ε = 0 denote the Z2 -twist which is applied to n components of the scalar field x
and n components of the fermion fields ψ± taken to have spin structure δ. This means that
these fields satisfy the twisted boundary conditions of (2.7). The resulting chiral scalar and
chiral fermion propagators were computed in Section 2, and are given by
∂z x+ (z)∂w x+ (w) ε = Bε (z, w),
ψ+ (z)ψ+ (w) ε = Sδ+ε (z, w). (3.7)
24 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
The functional integral over n components of twisted ψ± fields is given by the determinant
of the Dirac determinant, which is holomorphically factorized (up to the Belavin–Knizhnik
anomaly [36], whose effects cancel in the full amplitude),
ϑ[δ + ε](0, Ω) n
/ n/2 =
(DetD) .
(3.8)
Z(Ω)
The quantity Z is the inverse of the chiral partition function of a free untwisted scalar;
its expression in terms of ϑ-functions was given in [2], Eq. (4.5) but will not be needed
here. The functional integral over x µ in the twisted sector is simpler than in the untwisted
sector, because no zero-mode occurs in the twisted sectors. By the Belavin–Knizhnik result
[36], the combination of determinants that holomorphically factorizes involves the inner
products of holomorphic one forms. In the twisted sector there is precisely one such form,
namely the Prym differential ωε , normalized as in (2.10). The functional determinant over
n twisted fields x is then given by the following expression,
1 ϑ[δj+ ](0, Ω)ϑ[δj− ](0, Ω) n
(Det ∆)−n/2 = . (3.9)
(2 Im τε )n/2 Z(Ω)2 ϑj (0, τ )2
The obstruction to its holomorphic factorization lies entirely in the prefactor involving
Im τε . Chiral splitting is achieved by introducing a single internal loop momentum pε , in
terms of which also the prefactor may be split,
1 2 2
n/2
= d n pε eiπτε pε . (3.10)
(2 Im τε )
The functional integral (3.3) exhibits further obstructions to chiral splitting. The first is
through the presence of the quartic fermion term χ χ̄ψ+ ψ− in the matter action Im ; the
second is through the non-holomorphic dependence of the full fermion propagator,
2π
∂z x(z)∂w x(w) ε = Bε (z, w) − ωε (z)ωε (w),
Im τε
2π
∂z x(z)∂w̄ x(w) ε = −2πδ(z, w) + ωε (z)ωε (w). (3.11)
Im τε
The first line may be established by comparing the stress tensors computed from a variation
of the scalar determinant (3.9) and from the full x-field propagator. This procedure is the
same as the one used in (2.28) and (2.29), but is now applied to the full twisted scalar
partition function and propagator. The second line in (3.11) is obtained by applying ∂w̄ to
the first line and then integrating in w; the remaining coefficient in front of ωε (z)ωε (w) is
fixed by requiring that the integration versus ωε (w) vanish.
Next, one proceeds in parallel with the proof of chiral splitting for the case of flat space–
time [21,23]. The result may be summarized in terms of a set of effective rules. The final
formula for the integration over the matter fields may be recast in the following form,
A 2 µ µ
AC [δ] = d 10−n pI d n pε dm AC [δ, ε](pε )2 eiπpI Ω̂I J pJ 2 ,
ε
s M2 A
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 25
N
AC [δ, ε](pε ) = δ HA |B Vichi
A i=1
2
d z + µ
× exp χ S(z) + pε
µ
dz ∂z x+ . (3.12)
2π z̄ +
Bε
Here, S(z) is the total worldsheet supercurrent, given by S(z) = Sm (z) + Sgh (z). The
variance of the Gaussian in the first line is given by the super-period matrix, which is a
shift of the bosonic period matrix by an even, Grassmann valued, and nilpotent element,
given by [23],
i
Ω̂I J = ΩI J − d 2 z d 2 w ωI (z)χz̄+ Sδ (z, w)χw̄+ ωJ (w). (3.13)
8π
Furthermore, Vichi is the chiral part of the vertex operator Vi . All contractions in this chiral
correlator · · ·+ are to be carried out with the help of the propagators for the chiral fields
x+ and ψ+ given in (3.7). In the present paper, the emphasis will be on the chiral measure
and the cosmological constant. Therefore, the precise form of the operators Vi and their
chiral part Vichi will not be needed and will not be presented here.
where 1x+ denotes the Z2 -twisted chiral boson partition function. Carrying out also the
ψ+ contractions of the pε -dependent terms, the following result is obtained,
2
d z + µ
exp χ S(z) + pε µ
dz ∂z x+
2π z̄ x+ ,ψ+
Bε
1 + +
Here, the product of the Z2 -twisted chiral scalar and fermion partition functions 1x+
and 1ψ+ is denoted by ZC [δ, ε]. In the above formula, all the pε -dependence may be
regrouped in terms of a Gaussian with the following variance,
i
τ̃ ≡ τ − d 2 z d 2 w χz̄+ ωε (z)Sδ+ε (z, w)χw̄+ ωε (w). (3.16)
8π
26 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
This correction has the same origin as the corrections to the period matrix that lead to the
super period matrix in the uncompactified string. Its proper interpretation here is, however,
more subtle, and will be presented in detail later.
The contributions of the ghost partition function and supercurrent correlators are
the same as they were in flat space–time and may be taken from [2]. Assembling all
contributions, the chiral measure is given by the following expression
a b(pa ) α δ(β(qα ))M ZC [δ; ε]
The process will therefore only affect the terms in (3.17) that are independent of χ . The
effect of this change of variables is a change in the worldsheet metric by means of the
Beltrami differential µ̂, defined above (up to a diffeomorphism). In any correlator, this
change may be implemented via the insertion of the stress tensor, as was explained in [2].
The combination of partition functions and finite-dimensional determinants may be treated
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 27
ζ 1 ζ 2 Ξ6 [δ](Ω)ϑ[δ](0, Ω)4
AM [δ](Ω) = Z + · , (3.21)
16π 6 Ψ10 (Ω)
where the normalized partition function Z of chiral matter, ghosts and superghosts on the
bosonic surface with χ = 0 is given by
a b(pa ) α δ(β(qα ))M
Z≡
det ωI ωJ (pa )
ϑ[δ](0)5 ϑ(p1 + p2 + p3 − 3∆) a<b E(pa , pb ) a σ (pa )3
= 15 . (3.22)
Z ϑ[δ](q1 + q2 − 2∆)E(q1, q2 )σ (q1 )2 σ (q2 )2 det ωI ωJ (pa )
All quantities entering this expression were defined in Section 2, except for the chiral scalar
partition function Z −1 and the holomorphic 1-form σ (z), which are defined by
28 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
In this subsection, the role of the quantity τ̃ε in the exponential involving the internal
loop momentum pε in (3.24) is clarified. By construction, τ̃ε is invariant under local
worldsheet supersymmetry. At first sight, this property would appear to qualify τ̃ε for the
supersymmetric generalization of the Prym period τε , but this hypothesis is invalid for the
following reasons.
Recall the Schottky relations on a bosonic Riemann surface with period matrix ΩI J ,
already presented in (1.6) and reformulated in (2.12). The solution of the Schottky relations
for τε as a function of Ω for a twist ε was denoted by the function τε = Rε (Ω) in (1.6).
Actually, this function will be multi-valued because, for given Ω, the Schottky relations
determine τ only up to a shift τε → τε + 4. This multivaluedness is required in particular
by the fact that the Dehn twist A1 B2 A−1 −1
1 B1 , which does not act on Ω, shifts τε → τε + 4,
as shown in [22].
The implications for the super-Prym period and for the quantity τ̃ε are as follows. The
genus 2 super-Riemann surface, specified by the supermoduli (ΩI J , ζ α ), uniquely projects
to a bosonic Riemann surface with period matrix Ω̂I J . This projection automatically
entails an associated super-Prym period τ̂ε , which is defined through the bosonic Schottky
relations from the super-period matrix Ω̂I J . In summary, we have the relations,
i
τ̃ε − τε = − d z d 2 w ωε (z)χz̄+ Sδ+
(z, w)χw̄+ ωε (w),
2
8π
i
Ω̂I J − ΩI J = − d 2 z d 2 w ωI (z)χz̄+ Sδ (z, w)χw̄+ ωJ (w) (3.25)
8π
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 29
In this section, the chiral superstring measure will be constructed for more general com-
pactifications than those involving Z2 twists. The total space–time for the compactifica-
tions considered here will again be denoted by C. The chiral measure will be evaluated at
fixed even spin structure. As announced in [1], under some basic but mild assumptions, it
will be shown that the chiral blocks are independent of any choices of gauge slice, just as
they were in flat space–time. The assumptions are
(1) the compactification only modifies the matter part of the theory, leaving the
superghost part unchanged;
(2) the compactification respects local worldsheet supersymmetry, so that the super-
Virasoro algebra with matter central charge c = 15 is preserved.
A simple prescription for their calculation will be given first in split gauge and then in
terms of the OPE of two supercurrents.
Chiral splitting (the fact that the superstring amplitudes are the norms squared
of supermeromorphic functions on supermoduli space) holds for the contribution of
30 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
AC [δ] = dmA 2 AC [δ; λ]2 ,
λ sM A
2
2
chi d z +
AC [δ; λ] = δ HA |B Vi exp χz̄ SC (z) . (4.1)
2π λ,+
A i
Here, the subscript λ refers to the fact that the amplitude is evaluated in the sector
associated with the super-conformal family λ. Notice that, compared to (3.12), no
additional factor depending on internal momenta is exhibited. The presence of internal
momenta amongst the labels for super-conformal blocks is indeed model dependent and is
assumed to be part of the definition of · · ·λ,+ .
In terms of AM [δ], the following expression is obtained for the chiral measure,
ZC [δ; λ] 1
Here, we have used the fact that the ghost contributions in the correlators SM (z)SM (w)M
and SC (z)SC (w)Cλ as well as in TM (z)M and TC (z)Cλ are identical and cancel out
upon taking differences, leaving only the matter correlators in (4.2). It is understood that
all parts of (4.2) are expressed with respect to the super-period matrix Ω̂I J .
9 A subscript C has been appended to the supercurrent S because for general compactifications, the
supercurrent may not assume the flat space–time form; the latter will henceforth be denoted by SM .
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 31
The slice independence of AC [δ; λ] may be deduced from the slice independence of
AM [δ], which was already established in [2,3], together with general properties of the
supercurrent and stress tensor correlators which enter into (4.2). Since the ghost parts
have cancelled out of the stress tensor and supercurrent correlators, their singularities with
the ghost insertion points have also cancelled. Therefore, both Tm (z)Cλ and Tm (z)M
are holomorphic and their difference is a holomorphic 2-form that is well-defined on the
Riemann surface, and thus the formula for AC [δ; λ] is independent of the choice of µ̂
within a given super-conformal family λ.
The supercurrent insertions are similarly independent of the points qα . Since the ghost
parts of SC and SM coincide, all the singularities in z and w with the insertion points
pa and qα are identical, and cancel upon taking the difference between the C and M
contributions. Thus, the only possible singularities in the SS correlator is when z → w.
But this singularity is precisely cancelled by the presence of the stress tensor contribution,
as was shown in the flat case in [2,3].
The mutual cancellation of these singularities is also a necessary and sufficient in order
to maintain local worldsheet supersymmetry, as shown in [2]. Indeed, each singularity
presents an obstruction to supersymmetry invariance since the supersymmetry variation
δξ χz̄+ = −2∂z̄ ξ + will pick up non-vanishing contributions at the poles. Just as in [2] for
flat space–time M, the effect of the singularity in the supercurrent correlator at z = w is
precisely cancelled by the variation δξ µ = ξ + χz̄+ of the stress tensor term. In summary,
the insertion of SCm (z)SCm (w)Cλ − SMm (z)SMm (w)M is completely singularity free
and hence AC [δ; λ] is slice independent, just as AM [δ] was.
Here, a (z, w) is the unique form of degree 1 and holomorphic in both z and w such
that the 3 holomorphic 2-forms a (z, z) are normalized by a (pb , pb ) = δab . Explicit
forms were given in [2, Eq. (1.15)]. The difference of the supercurrent and stress tensor
correlators on manifolds C and M, given by
Q(q1 , q2 ) ≡ + SCm (q1 )SCm (q2 ) Cλ − SMm (q1 )SMm (q2 ) M
1
− a (q1 , q2 )Sδ (q1 , q2 ) TCm (pa ) Cλ − TMm (pa ) M (4.6)
2 a
is holomorphic in both q1 and q2 and odd under the interchange of q1 and q2 . Therefore,
its dependence on qα is determined uniquely up to a qα -independent multiplicative factor.
The expression for AC [δ; λ] becomes,
ZC [δ; λ] ζ 1 ζ 2 ϑ[δ](0)4 Ξ6 [δ] ζ 1 ζ 2
AC [δ; λ] = Z+ · − ZQ(q 1 2 .
, q ) (4.7)
ZM [δ] 16π 6 Ψ10 4π 2
A further simplification takes place in the split gauge, defined by the following relation
between the insertion points q1 and q2 , Sδ (q1 , q2 ) = 0. In this gauge, the expression for
AC [δ; λ] simplifies to
ZC [δ; λ] ζ 1 ζ 2 ϑ[δ](0)4 Ξ6 [δ] ζ 1 ζ 2
AC [δ; λ] = Z+ · − Z SCm (q1 )SCm (q2 ) Cλ .
ZM [δ] 16π 6 Ψ10 4π 2
(4.8)
In this subsection, an alternative formula for the chiral measure is provided in terms
of simple data that may be obtained from the supercurrent OPE. This calculation may be
carried out in terms of the operators OM and OC , defined as follows,
1 TC (z) + TC (w)
SC (z)SC (w) = + (z − w)OC (w) + O(z − w)2 ,
4 z−w
1 TM (z) + TM (w)
SM (z)SM (w) = + (z − w)OM (w) + O(z − w)2 . (4.9)
4 z−w
Notice that the leading cubic singularity cancels since the central charges for M and for
C are assumed to be the same, namely c = 15. A convenient form for the chiral measure
based on the above OPE operators is obtained by letting all points pa collapse to the point
q2 and subsequently letting q1 → q2 .
Using methods similar to those employed in [3, Section 3.4], a limiting formula is
obtained for the summation against µa of the full stress tensor T (pa ) ≡ TCm (pa )C −
TMm (pa )M in terms of holomorphic Abelian differentials ωα∗ with normalization
ωα∗ (qβ ) = δαβ . In the OPE relation of (4.9), it is customary to expand with respect to the
coordinate q2 of the second operator, so we also let pa → q2 for all a = 1, 2, 3. As a result,
1 1 ∂ω2∗ (q2 )
lim a (q1 , q2 )T (pa ) = ∂T (q ) − T (q2 ). (4.10)
2 ∂ω1∗ (q2 ) ∂ω1∗ (q2 )
2
pa →q2
a
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 33
Substituting this result into the measure factor, we obtain the chiral measure
ZC ζ 1 ζ 2 ϑ[δ]4 Ξ6 [δ] ζ 1 ζ 2 ϑ[δ]8 Q̂(p3 )
AC = Z+ · − · · ,
ZM 16π 6 Ψ10 4π 2 M2ν1 ν2 ων1 (p3 )2 ων2 (p3 )2
(4.17)
as well as the contribution to the cosmological constant from the left-moving sector, by
integrating over ζ α .
The starting point is the chiral measure of (3.24), evaluated in split gauge defined by
Sδ (q1 , q2 ) = 0. In this gauge, the following simplifications occur: Ω̂I J = ΩI J , and thus
µ̂ = 0 and the matter supercurrent correlator in M vanishes because the fermion propagator
Sδ is evaluated between the points q1 and q2 . The remaining expression is given by,10
ZC [δ; ε] iπ(τε +∆τε )pε2
AC [δ; ε](pε ) = e
ZM [δ]
ζ 1 ζ 2 Ξ6 [δ]ϑ[δ](0)4 ζ 1 ζ 2
× Z+ − Z Sm 1 m 2 ε ,
(q )S (q ) (5.1)
16π 6 Ψ10 4π 2
where Z was defined in (3.22). As this formula is written in split gauge, we have
Ω̂I J = ΩI J , τ̂ε = τε and the expression for ∆τε , derived in (3.29), is
iζ 1ζ 2
∆τε = − ωε (q1 )Sδ+
(q1 , q2 )ωε (q2 ). (5.2)
4π
The focus of this paper will be on the chiral measure and the cosmological constant, both
of which receive contributions only from the top term in ζ 1 ζ 2 . The resulting chiral measure
takes the following form,
dµC [δ; ε](pε ) ≡ dζ 2 dζ 1 AC [δ; ε](pε )
2 ZC [δ, ε] Ξ6 [δ]ϑ[δ](0)4 Z
= eiπτε pε − Sm (q1 )Sm (q2 ) ε
ZM [δ] 6
16π Ψ10 4π 2
+ iπpε2 Γ [δ, ε] , (5.3)
10 For Z -twisting, the supercurrents S and S take on the same functional form (as do the stress tensors T
2 C M C
and TM ); thus, the subscripts M and C will be dropped from the operators. The subscript C on the correlator will
be replaced with the twist ε for each twisted sector.
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 35
It remains to calculate the various terms in the above expression in terms of ϑ-functions,
which will be the subject of the remainder of this section.
It will be assumed that n dimensions are being Z2 -twisted, leaving 10 − n dimensions
untwisted, and that n is at most 8. In this case, only even spin structures δ need to be taken
into account. In particular, this will be the case for the models of [9]. The chiral partition
functions ZM and ZC are well-known, and given by
ϑ[δ](0, Ω) 5
ZM [δ] = ,
Z3
+ −
ϑ[δ](0, Ω) 5−n/2 ϑ[δj ](0, Ω)ϑ[δj ](0, Ω)ϑ[δ + ε](0, Ω) n/2
ZC [δ; ε] =
Z3 Z 3 ϑj (0, τε )2
(5.5)
for any pair δj± such that ε = −δj+ + δj− .
The supercurrent correlator may be calculated in terms of the twisted scalar and fermion
propagators evaluated in Section 2,
n
Sm (q1 )Sm (q2 ) ε = Bε (q1 , q2 )Sδ+ε (q1 , q2 ), (5.6)
4
where the twisted scalar propagator is
As q1 , q2 obey the split gauge relation Sδ (q1 , q2 ) = 0, the first term on the rhs above
vanishes and we have Bε (q1 , q2 ) = bi ωε (q1 )ωε (q2 ). Using the explicit expression for bi ,
computed in (2.27), we have
Sm (q1 )Sm (q2 ) ε = −iπnωε (q1 )ωε (q2 )Sδ − (q1 , q2 )∂τε ln ϑi (0, τε ). (5.9)
i
The combination of this correlator with the factor of Z may be re-expressed conveniently
in terms of Γ [δ; ε],
Z Sm (q1 )Sm (q2 ) ε = 4π 2 nΓ [δ; ε]∂τε ln ϑi (0, τε ). (5.10)
This leads to the following formula for the chiral measure in terms of Γ [δ; ε],
36 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
2 ZC [δ, ε]
dµC [δ; ε](pε ) = eiπτε pε
ZM [δ]
Ξ6 [δ]ϑ[δ](0)4
× + iπpε
2
− n∂τε ln ϑ i (0, τ ε ) Γ [δ; ε] .
16π 6Ψ10
(5.11)
It only remains to evaluate Γ [δ; ε].
ε = −νa + νb , δ = δi+ = νa + νc + νd ,
δ + ε = δi− = νb + νc + νd . (5.12)
By choosing (abcdef ) to be an appropriate permutation of (123456), any twist and spin
structure assignment may be reached by these conventions.
The general expression for Γ [δ; ε] was given in (5.4); it involves the partition function
Z which was given in (3.22). Using the expression for Z in split gauge, calculated in
Eq. (3.15) of [4], we obtain
i
Γ [δ, ε] ≡ −Zωε (q1 )Sδ − (q1 , q2 )ωε (q2 ),
4π i
(5.15)
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 37
In view of the earlier choice δ = δi+ , and the split gauge condition Sδ (q1 , q2 ) = 0, the above
formula simplifies when z = q1 and w = q2 ,
1
ωε (q1 )ωε (q2 ) = σ (µi , µj ) S + (q1 , q2 )Sδ − (q1 , q2 ). (5.16)
π 2 ϑk4 δj j
Expressions in terms of ϑ-constants are most easily obtained by placing insertion points at
branch points. If q1 , q2 are in split gauge, then their limits to branch points are such that
q1 and q2 belong to the same set in the partition of all six branch points associated with δ.
The key difficulty in the evaluation of Γ [δ; ε] is that Z diverges as the points q1 and q2 ,
are taken to branch points, since ϑ[δ](q1 + q2 − 2∆) vanish. It turns out that one of the
three Szegö kernels arising in Γ [δ; ε] also vanishes, thereby making their ratio finite.
Sδ+
(pc , pd ) ∼ ϑ[δ + ε](νc − νd ) ∼ ϑ[νb ] = 0,
ϑ[δ](pc + pd − 2∆) = ϑ[δ](νc + νd ) ∼ ϑ[νa ] = 0. (5.18)
The split gauge condition to linear order in t yields a non-trivial condition on the growths
q̇1 and q̇2 , given by
The exponential factors K3 and K4 will be evaluated later. The final result for Γ [δ; ε] is
now obtained as follows,
i σ (µi , µj ) ϑ[δ]2
Γ [δ, ε] = · κ · κ · , (5.29)
8π 7 ϑk4 ϑ[δj ]2 ϑ[δj− ]2 ϑ[δ + ε]2
+
Neither κ, nor κ is intrinsic, but the product σ (µi , µj )κκ will be, as will be evidenced
by the fact that the final expression for Γ [δ; ε] is in terms of squares of ϑ-functions only.
5.3.1. Calculating κ
Let us summarize the definitions of the factors entering into κ,
C = −exp{−8πiνc Ωνd },
Cc = exp{−πiνc Ωνc − 2πiνc νc },
Cd = exp{−πiνd Ωνd − 2πiνd νd },
K1 ωνb (z) = ωI (z)∂I ϑ[δ + ε](νc − νd ),
K2 ωνa (z) = ωI (z)∂I ϑ[δ](νc + νd ),
K3 ϑ[δk+ ](0) = ϑ[δj+ ](νc − νd ),
K4 ϑ[δk− ](0) = ϑ[δj− ](νc − νd ). (5.31)
The K-factors may be computed starting from the following basic formula of (B.9),
Notice that under c ↔ d, K1 → −K1 while K2 → +K2 , even though these symmetries
are not manifest. Multiplying all factors yields
CK1 K3 K4
κ= = exp 4πi(νd νc + νc νd − νd νa + νc νf − νd νf + νb νc ) (5.34)
Cc2 Cd2 K2
which indeed takes the values ±1.
5.3.2. Calculating κ
A direct calculation of κ from its definition
π 4 Mab
κ ≡ ϑ[δ]2 ϑ[δi+ ]ϑ[δi− ]ϑ[δj+ ]ϑ[δj− ]ϑ[δk+ ]ϑ[δk− ] (5.35)
Mac Mcd Mda
involves the non-intrinsic sign factors, and must be computed case by case. To simplify
the process, we shall carry out this calculation for a single non-trivial twist and obtain
the expression for Γ [δ; ε] for all twists by modular invariance. This choice uniquely
determines νa and νb (up to interchange of a and b). The choice made here is the standard
one,
0 0
ε= , νa = ν2 , νb = ν4 . (5.36)
0 12
The even spin structures are fixed through the choice of the odd spin structures and (2.35).
The calculation of κ is started with the evaluation of the product of all 6 even
spin structures, ϑ[δ2+ ]ϑ[δ2− ]ϑ[δ3+ ]ϑ[δ3− ]ϑ[δ4+ ]ϑ[δ4− ]. This object is independent of the
remaining choices of c, d, e, f , since a permutation of these objects simply permutes the
various factors in the product. To evaluate it, we choose (c, d, e, f ) = (1, 3, 5, 6). The
even spin structures are then determined by (B.7) (see also (2.11)), and the ϑ-constants are
expressed in normalized form by
ϑ[δ2+ ]ϑ[δ2− ]ϑ[δ3+ ]ϑ[δ3− ]ϑ[δ4+ ]ϑ[δ4− ] = +ϑ[δ1 ]ϑ[δ2 ]ϑ[δ3 ]ϑ[δ4 ]ϑ[δ7 ]ϑ[δ8 ]. (5.38)
Next, the expressions for Mab , a, b = 1, . . . , 6 are needed. As was shown in [4], these
objects may be expressed as products of ϑ-constants, times a non-intrinsic sign factor.
Normalizing the ϑ-constants on the even spin structures in canonical form, as given by
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 41
Table 1
Calculation of κ
c d δ m2c mcd md2 κ
1 3 δ7 + − − −
1 5 δ3 + + − +
1 6 δ2 + − − −
3 5 δ1 + + − +
3 6 δ4 + − − −
5 6 δ8 + − − −
(B.5), the sign factors mab are tabulated in (B.24). The quantities needed are given by
• c = 1, d = 6 implies
+ +
δi = δ2 , δj = ν1 + ν4 + νf ,
−
δi = δ1 , δj− = ν1 + ν2 + νf .
42 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
Table 2
Calculation of the sign of Γ [δ; ε]
c d f δ µi µj µk κ κ σ (µi , µj ) Sign
1 3 5 δ7 µ2 µ4 µ3 + − + −
6 µ3 µ4 + − +
1 5 3 δ3 µ4 µ2 µ3 + + − −
6 µ3 µ2 + + −
1 6 3 δ2 µ3 µ2 µ4 − − − −
5 µ4 µ2 + − +
3 5 1 δ1 µ3 µ2 µ4 − + − +
6 µ4 µ2 + + +
3 6 1 δ4 µ4 µ2 µ3 + − − +
5 µ3 µ2 + − −
5 6 1 δ8 µ2 µ4 µ3 − − + +
3 µ3 µ4 − − +
Here, (i, j, k) is a permutation of the genus 1 spin structure indices (2, 3, 4), and δ = δiσ
with σ = ±. A more illuminating expression, whose form is more manifestly covariant
is obtained by factoring out a combination of the Z qu [ε] partition function for 4 twisted
directions. Using the familiar genus 1 relation ϑi ϑj ϑk = 2η3 , this leads to the following
final expression,
i ϑi4 ϑ[δiσ ]2 ϑj4
Γ δiσ , ε = σ ν0 |µi (5.44)
(2π)7 η12 ϑ[δ (−σ ) ]2
i
ϑ[δj+ ]2 ϑ[δj− ]2
The structure of the genus 2 modular group Sp(4, Z) and its action on spin structures,
twists and ϑ-functions is summarized in Appendix B.
The subgroup of the modular group Sp(4, Z) which acts on spin structures or on half
characteristics is Sp(4, Z2 ). It is isomorphic to the permutation group of the six branch
points and therefore has 6! = 720 elements. To determine the subgroup Hε of Sp(4, Z2 )
that leaves a given twist ε invariant, we proceed as follows. First, make a definite choice
for a twist, so that the invariance equation becomes
A B 0 0 ε D −C ε
M= , ε= , = . (6.1)
C D 0 12 ε −B A ε
This equation puts the following restrictions on the entries A12 = C12 = C22 = 0 and
A22 = 1 of the matrices. All elements of the group Hε are found by solving the symplectic
relation MJ M T = J under these restrictions. The independent generators for Hε are
generators of Hε = {M1 , M2 , M3 , T2 = ΣT Σ, S2 = SM1 SM1 }. (6.2)
The full group may be parametrized in terms of the Abelian subgroup Mε of “translations”,
Hε0 = {I, M1 , M2 , M3 , M1 M2 , M1 M3 , M2 M3 , M1 M2 M3 },
Table 3
The modular group Hε acting on even spin structures
δ M1 M2 M3 T2 S2
2 (M1 )
2 (M2 )
2 (M3 )
2 (T2 )
2 (S2 )
δ1 δ3 δ2 δ1 δ1 δ3 1 1 1 1 i
δ2 δ4 δ1 δ2 δ2 δ4 1 1 1 1 i
δ3 δ1 δ4 δ3 δ4 δ7 1 1 1 1 1
δ4 δ2 δ3 δ4 δ3 δ8 1 1 1 1 1
δ5 δ6 δ5 δ6 δ9 δ6 1 i 1 1 i
δ6 δ5 δ6 δ5 δ0 δ9 1 i 1 1 1
δ7 δ7 δ8 δ8 δ7 δ1 i 1 1 1 i
δ8 δ8 δ7 δ7 δ8 δ2 i 1 1 1 i
δ9 δ9 δ9 δ0 δ5 δ5 i i −1 1 i
δ0 δ0 δ0 δ9 δ6 δ0 i i −1 1 −1
44 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
In (2.46), a Schottky relation is derived, but a sharper form (namely the precise sign
involved in taking the square root of this relation) is needed in the present study, as
indicated in (2.12). The square root may be fixed by checking the consistency of (2.12)
with modular transformations and with degenerations, both of which are carried out in this
subsection.
The crucial ingredients in this calculation are the factors
(δ, M)2 when M ∈ Hε , since
they will determine the transformation rules for the genus 2 ϑ-constants. To compute them,
we make use of the values for
(δ, Mi )2 , i = 1, 2, 3,
(δ, S)2 ,
(δ, T )2 , and
(δ, Σ)2 , given
in (B.16), as well as the following cocycle rule, which may be derived from the definition
of
(δ, M) in terms of the ϑ-constants,
(κ, MM ) =
(κ, M ) ×
(M κ, M). (6.4)
The results are summarized in Table 3, and may be readily applied to the calculation of the
pairwise products
(δj , M)2
(δj +1 , M)2 . The results are given in Table 4.
For the choice of twist made here, (2.12) becomes,
Table 4
The modular group Hε acting on pairs δ ± of
even spin structures
j M1 M2 M3 T S2
1 1 1 1 1 −1
3 1 1 1 1 1
7 −1 1 1 1 −1
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 45
The transformation properties of ϑ-constants for genus 1, under the two canonical
generators, denoted here by T (1) and S (1) are given by
T (1) : τ → τ + 1,
(6.7)
S (1) : τ → −1/τ.
We have
ϑ2 (τ + 1) = eiπ/4 ϑ2 (τ ),
T (1) ϑ3 (τ + 1) = ϑ4 (τ ),
ϑ4 (τ + 1) = ϑ3 (τ ),
√
ϑ2 (−1/τ ) = √−iτ ϑ4 (τ ),
S (1) ϑ3 (−1/τ ) = √−iτ ϑ3 (τ ), (6.8)
ϑ4 (−1/τ ) = −iτ ϑ2 (τ ).
Therefore, it is clear that, signs and all, we have
M1 −→ T (1) ,
S2 −→ T (1) S (1) . (6.9)
Therefore, the genus 2 modular transformations M1 and S2 indeed induce ordinary genus
1 modular transformations on the Prym period. As a result, all the modular transformations
in Hε induce modular transformations on τε , which belong to the genus 1 modular group,
according to the above correspondence.
Table 5
The modular group Hε acting on twists
M ε1 ε2 ε3 ε4 ε5 ε6 ε7 ε8 ε9 ε10 ε11 ε12 ε13 ε14 ε15 ε16
M1 ε1 ε2 ε3 ε4 ε5 ε6 ε12 ε14 ε16 ε15 ε11 ε7 ε13 ε8 ε10 ε9
M2 ε1 ε2 ε3 ε4 ε11 ε13 ε7 ε8 ε15 ε16 ε5 ε12 ε6 ε14 ε9 ε10
M3 ε1 ε2 ε3 ε4 ε6 ε5 ε8 ε7 ε10 ε9 ε13 ε14 ε11 ε12 ε16 ε15
T2 ε1 ε2 ε4 ε3 ε9 ε10 ε7 ε8 ε5 ε6 ε15 ε14 ε16 ε12 ε11 ε13
S2 ε1 ε2 ε12 ε14 ε5 ε16 ε3 ε4 ε6 ε15 ε11 ε7 ε10 ε8 ε13 ε9
The non-trivial orbits O± [ε] may be characterized in a simple modular covariant manner,
υ ∈ O± [ε] ⇔ ε|υ = ±1 and υ = ε1 , ε2 . (6.12)
The first twist may be denoted ε and chosen as in (6.1). The non-trivial cases arise when
the second twist α belongs to either orbit O± [ε]. In each orbit, any representative may be
taken for the second twist α. The stabilizer groups of two twists ε and α will be denoted
by Hε,α ; their generators may be deduced from inspection of Table 5, and given by
Hε,ε3 ∼ generators {M1 , M2 , M3 },
Hε,ε5 ∼ generators {M1 , S2 , (M3 S2 T2 )}. (6.13)
The group Hε,ε3 is Abelian, but the group Hε,ε5 is non-Abelian.
Given the choice of homology basis adopted throughout this paper, the period matrix
may be parametrized as follows,
τ τ
Ω= 1 , (7.1)
τ τ2
where τ1,2 , τ ∈ C, subject to the constraint Im Ω > 0. Degenerations fall into two
classes according to whether the degeneration separates the surface into two disconnected
components or leaves the surface connected. Separating (respectively non-separating)
degenerations result from the shrinking of a homologically trivial (respectively non-trivial)
1-cycle. The limit τ → 0 is separating, while the limits τ1 → +i∞ or τ2 → +i∞ are
non-separating.
The flat space–time chiral measure depends on the spin structure and the limiting
behavior of the measure therefore depends upon the inter-relation between the spin
structure the homology of the degenerating cycle. These degenerations were worked out
explicitly—to leading order—in [4, Section 8].
The Z2 -twisted chiral measure (1.10) depends on the spin structure δ, AND on the
twist ε. Its limiting behavior will therefore depend upon the inter-relations between not
only the spin structure and the homology of the degenerating cycle, but also upon the
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 47
twist. Here, we shall derive the limit of only two representative cases, one separating, the
other non-separating; the other cases are similar, but their number is simply too large to
discuss usefully here. Finally, only the case where n = 4 dimensions are Z2 -twisted will
be discussed, since this will be the case of greatest physical interest, as discussed in the
next subsection.
This limit corresponds to letting τ → 0, while keeping τ1,2 fixed. The leading behavior
was given in [4]; the leading and subleading behaviors are as follows,
µi
Ψ10 (Ω) = 212 (2πτ )2 η(τ1 )24 η(τ2 )24 1 + 48τ 2 ∂ ln η(τ1 )∂ ln η(τ2 ) . (7.2)
Here, µi and µj represent the even spin structures on each genus 1 component; the even
spin structure which restricts to the odd spin structure on each genus 1 component will not
be needed here.
First, the limiting behavior is needed for the Prym period, since it enters into the form
of the chiral measure. Using the Schottky relation (1.6) as well as the above limiting
behaviors, we find
τε = τ1 + τ 2 ∂ ln ϑ3 (0, τ2 )ϑ4 (0, τ2 ) + O τ 4 (7.3)
ZC [δiσ , ε] σ iσ ϑi (0, τε )4
Γ δi , ε = ν0 |µi , i = 2, 3, 4, σ = ±. (7.5)
ZM [δi ]
σ (2π)7 η(τε )12
In view of the relation between τ1 and τε , the limit as τ → 0 of this quantity is regular and
is obtained simply from the above formula by replacing τε by τ1 . Putting all together, we
48 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
find
2 ν0 |µi ϑi (0, τ1 )
4
dµC δiσ , ε pεµ = eiπτε pε 6 8
2 π η(τ1 )12
−σ 1 π2 2
× 2 − 2πi σ + ∂ ln ϑ i (0, τ 1 ) − σ p .
τ ϑ2 (0, τ2 )4 2 2 ε
(7.6)
The physical analysis of this limit is as follows.
The 1/τ 2 singularity, which is familiar from the flat space–time chiral measure,
produces tachyon and massless scalar singularities in the channel connecting both genus 1
components. This is as expected, since the six spin structures δiσ restrict to even spin
structures on each genus 1 component. The summation over σ corresponds to the GSO
projection imposed on states in the NS sector which are traversing the A2 cycle. This part
of the GSO projection eliminates the corresponding tachyon intermediate state, and indeed
the partially summed measure is regular, as may be seen from
dµC [δi+ , ε] pεµ + dµC [δi− , ε] pεµ
2 ν0 |µi ϑi (0, τ1 )4
= −ieiπτε pε ∂ ln ϑi (0, τ1 ). (7.7)
25 π 7 η(τ1 )12
Notice also that a partial GSO resummation over the spin structures of the first genus 1
component vanishes to this order,
dµC δiσ , ε pεµ = 0 (7.8)
i=2,3,4
combination,
ZC [δiσ , ε] σ iσ ϑi (0, τ1 )4
Γ δ i , ε → ν0 |µ i (7.11)
ZM [δiσ ] (2π)7 η(τ1 )12
has a smooth limit. Putting all together, we obtain the following limits for the full measure,
2 σ ν0 |µi ϑi (τ/2, τ1 ) + ϑ1 (τ/2, τ1 )
4 4
dµC δiσ ; ε (pε ) = eiπτε pε + O(1). (7.12)
q 28 π 6 η(τ1 )6 ϑ1 (τ, τ1 )2
This result coincides with the untwisted case, as expected.
In this section, specific physical superstring theories are considered that are constructed
from flat Minkowski space–time orbifolded by groups acting by reflections and shift.
When the action of the orbifold group is the same (respectively different) on left and
right movers, the orbifold is referred to as symmetric (respectively asymmetric). The
models of [9], for example, are asymmetric orbifolds of type II superstring theory. For
definiteness, the present study will concentrate on type II theories with six or fewer
compactified dimensions, but the methods may be extended to heterotic orbifolds and/or
more compactified dimensions.
The construction of symmetric orbifolds may be carried out directly from the functional
integral by inserting projection operators and including twisted sectors; for completeness,
it will be briefly reviewed below. The construction of asymmetric orbifolds, on the other
hand cannot, in general, be carried out directly from the functional integral. Therefore,
properties that are known to hold generally for symmetric orbifolds (such as their behavior
under modular transformations) may or may not hold for asymmetric orbifolds. Various
methods have been proposed to circumvent these obstacles. The method of [40] uses a
doubling of the left- and right-moving degrees of freedom, while the results of [41] rather
suggest the use of operator methods. (For reviews and further references, see, e.g., [42].)
The approach taken here to the construction of asymmetric orbifold superstring theories
will be based on chiral splitting. The starting point will be the construction of the chiral
blocks, carried out separately for the left-movers and for the right-movers. Once the chiral
blocks are in hand, string amplitudes are obtained by assembling these left and right chiral
blocks in a manner consistent with the definition of the asymmetric orbifold model as well
as with modular invariance. This approach will be discussed in enough detail here so that
the cosmological constant in the KKS models can be investigated. Further study will appear
in a forthcoming publication [24].
Rn /G = TG /P̄G , (8.1)
i.e., as the coset of Rn by the full orbifold group G, or as the coset of the torus TG by the
group P̄G = G/ΛG , which is isomorphic to PG . The construction via TG /P̄G is generally
more convenient (because the groups P̄G are usually finite, while G is always infinite)
and is better suited for dealing with asymmetric orbifolds; therefore it will be adopted
throughout. The orbifold is Abelian (respectively non-Abelian) if the point group PG ∼ P̄G
is Abelian (respectively non-Abelian).
The orbifold string theory is constructed as follows. Let AI , BI , I = 1, 2 be a canonical
µ
homology basis and define the fermion fields ψ± with a spin structure δ. To the basis cycles
(AI , BI )we associate a sector (aI , bI ) of elements of P̄G which satisfies the homotopy
relation I =1,2 aI bI aI−1 bI−1 = I . (For Abelian orbifolds, this condition is automatically
fulfilled.) The sector is untwisted if aI and bI equal the identity in P̄G , and is twisted
otherwise. In a given sector λ = (aI , bI ), the fields obey the following monodromy
conditions,11
(x, ψ± )(z + AI ) = aI x(z), (−1)2δI aI ψ± (z) ,
(x, ψ± )(z + BI ) = bI x(z), (−1)2δI bI ψ± (z) . (8.2)
The string amplitude in the sector λ is given by the familiar functional integral but where
the fields now obey the monodromy conditions (8.2),
AC [δ; λ] = DEM DΩM δ(T )
A
DXµ e−Im . (8.3)
(aI ,bI )
The same procedure of gauge-fixing and chiral splitting as was carried out in (3.12) may
be applied here, and results in a set of chiral blocks.
The chiral blocks are indexed by the sector labels λ = (aI , bI ) and by the internal mo-
menta associated respectively with uncompactified (pu ) and compactified (pλ ) dimensions.
The effects due to the uncompactified internal momenta pu are the same as in flat space–
time and yield the familiar measure factor (det Im Ω)−5+n/2 ; henceforth, the dependence
on pu will suppressed. The internal momenta pλ associated with the compactified direc-
tions are discrete and will take values in the lattice Λ∗G dual to ΛG . The chiral blocks
will be denoted by dµC [δ; λ](pλ ) = dµC [δ; aI , bI ](pλ ) and the amplitude AC take on the
11 In the fermion monodromy, an extra sign may arise depending on the reference spin structure chosen in the
given homology basis. For simplicity, this factor has been omitted here.
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 51
form,
AC [δ; λ] = (det Im Ω)−5+n/2 dµC [δ; λ](pλ)2 . (8.4)
M2 pλ ∈Λ∗G
The physical problem with this amplitude is the presence of the tachyon, which in
particular causes the integration over moduli space to diverge.
In the GSO projection, left and right fermions are treated independently (via chiral
splitting, see [20,21]), and assigned spin structures δL and δR which are summed over
independently. A consistent GSO projection will systematically eliminate the tachyon in
all superstring loops. Isolating the chiral blocks introduces a phase arbitrariness. Thus, the
GSO projection requires a consistent set of phases ηL [δL ; λ] and ηR [δR ; λ], which may
depend upon the sector λ. The GSO projected amplitude for the symmetric orbifold is,
ZG = (det Im Ω)−5+n/2 dµL [λ](pλ ) ∧ dµR [λ](pλ ), (8.5)
M2 λ pλ ∈Λ∗G
As is familiar from flat space–time, the phase assignments on the left and right GSO
summation may be taken to be the same (as in type IIA) or different (as in type IIB) from
one another, even though for symmetric orbifolds the chiral blocks for fixed spin structure
for right movers are simply the complex conjugate of those for left movers.
monodromy conditions
(∂z x, ψ+ )(z + AI ) = aLI ∂z x(z), (−1)2δI aLI ψ+ (z) ,
(∂z x, ψ+ )(z + BI ) = bLI ∂z x(z), (−1)2δI bLI ψ+ (z) ,
(∂z̄ x, ψ− )(z + AI ) = aRI ∂z̄ x(z), (−1)2δ̄I aRI ψ− (z) ,
(∂z̄ x, ψ− )(z + BI ) = bRI ∂z̄ x(z), (−1)2δ̄I bRI ψ− (z) . (8.8)
These monodromy conditions are, however, only formal, as no real field x in the functional
integral will exist that satisfies conditions with aLI = aRI or bLI = bRI .
The strategy adopted here is to obtain the amplitudes to higher loop order for the
asymmetric theory from the chiral blocks of symmetric orbifold theories. Specifically, each
element f ∈ PG is written as a pair f = (fL ; fR ) of rotations of Rn . The set of all elements
fL forms a group, which will be denoted PL , while the set of all elements fR forms a group
PR . The group PG is thus viewed as a subgroup of PL ⊗ PR subject to the pairing relation
f = (fL ; fR ). The chiral blocks for the left movers are the holomorphic blocks of the
symmetric theory with orbifold group PL , while the chiral blocks for the right movers are
the antiholomorphic blocks of the symmetric theory with orbifold group PR . The internal
momenta of the left and right movers do not need to match, as is familiar from toroidal and
orbifold compactifications of the heterotic string. They will be denoted by pL and pR ; the
pair (pL , pR ) belongs to a self-dual even lattice associated with G. When the elements of
P̄G have non-trivial shifts, the self-dual lattice will, in general, depend on the sector label
λ = (aI , bI ).
Thus, the block associated with the spin structures δL and δR and sector λ = (aI , bI ),
where aI = (aLI ; aRI ) and bI = (bLI ; bRI ), and chiral sector labels λL = (aLI , bLI ),
λR = (aRI , bRI ), is given by
Just as in the case of flat Minkowski space–time (or, as explained in the previous
subsection, for symmetric orbifold compactifications) a GSO projection must be performed
to eliminate the tachyon. For most interesting asymmetric orbifolds, this projection is also
carried out in a chiral manner, i.e., summing independently over the spin structures of left
and right fermions. The relevant GSO resummed blocks are then precisely those of (8.6)
but now with different sector labels for left and right,
dµL [λL ](pL ) = ηL [δL ; λL ] dµC [δL ; λL ](pL ),
δL
dµR [λR ](pR ) = ηR [δR ; λR ], dµC [δR ; λR ](pR ). (8.9)
δR
In the next subsection, constraints will be established on the phases ηL and ηR arising from
modular symmetry.
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 53
The proposal made here for the partition function ZG of the asymmetric orbifold theory
with asymmetric orbifold group G is given by the expression
ZG = (det Im Ω)−5+n/2 K(λL , λR ; pL , pR )
pL ,pR λL ,λR
M2
× dµL [λL ](pL ) ∧ dµR [λR ](pR ). (8.10)
Here, the coefficients K(λL , λR , pL , pR ) may depend on both sets of left and right group
elements and internal momenta. Non-trivial internal momentum dependence in K will
arise in particular when the group elements of G involve non-trivial shifts. Clearly, much
of the structure of the original asymmetric group G is encoded in these coefficients. Their
evaluation can be a subtle issue, even to one-loop, and we shall return to this in [24]. In this
paper, we shall restrict attention to the pointwise vanishing of the cosmological constant,
and examine only the vanishing of each GSO resummed chiral block dµL [λL ](pL ) and
dµR [λR ](pR ).
In the models to be studied below, the point group PG will be Abelian and generated by
chiral Z2 reflections (possibly together with an action on internal quantum numbers, such
as by the operators (−)F ). In all such cases, the chiral blocks will be given (up to phases)
by the Z2 -twisted blocks dµC [δ; ε](pε ) which were derived earlier. The sector labels will
be abbreviated by λ ≡ (aI , bI ). The twist ε, which enters the chiral blocks dµC [δ; ε](pε )
will be determined by the sector labels λ, whence the notation ε = ε(λ).
It is assumed that the GSO summation over δ is carried out independently on left-
and right-movers. For the models of greatest physical interest, the number of Z2 -twisted
dimensions is 4. This is the number of twisted dimensions in the KKS model, as well as in
the orbifolding of the type II theories by a pure Z2 twist which yield again the same type II
theory. Henceforth, the number of twisted dimensions will be assumed to be 4.
In view of the above assumptions, the relevant summation is a chiral summation over
the measure calculated in (5.11) with n = 4. The result is given by (we use the abbreviation
δ = δL , λ = λL and p = pL ),
dµL [λ](p) = ηL [δ; λ] dµC δ, ε(λ) (p) = dµ(1) (2)
L [λ](p) + dµL [λ](p), (8.11)
δ
where the partial measures are defined by (still using the notation ε = ε(λ))
2 ZC [δ, ε]
dµ(1)
L [λ](p) = ηL [δ; λ]eiπτε p iπp2 − 4∂τε ln ϑi (0, τε ) Γ [δ; ε],
ZM [δ]
δ
(2)
2 ZC [δ, ε] Ξ6 [δ]ϑ[δ]4
dµL [λ](p) = ηL [δ; λ]eiπτε p . (8.12)
ZM [δ] 16π 6 Ψ10
δ
Here, ηL [δ; λ] are the left chiral GSO projection phases, which remain to be determined.
The assumption n = 4 actually leads to considerable simplifications. A key ingredient in
the chiral blocks is the ratio of chiral partition functions for the twisted space dimensions.
54 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
Constraints on the GSO phases arise from the requirement of modular invariance of
the full string measure. Under a general modular transformation M ∈ Sp(4, Z), the period
matrix Ω, the internal momenta pL , the spin structure δ, the sector label λ = (aI , bI ) and
the twist ε are transformed to Ω̃, p̃L , δ̃, λ̃ and ε̃, according to familiar rules, which are
summarized in Appendix B. Modular invariance requires the following transformation law
for the GSO resummed measure,
dµL [λ̃](p̃L , Ω̃) = ϕ(λ, M)(cτε + d)−2 dµL [λ](pL , Ω). (8.15)
The factor (cτε + d)−2 represents the effect of the modular transformation induced by M
on the Prym period τε , and ϕ is a set of phases.
The constraints on the GSO phases η are most easily established by restricting M to
(1)
the subgroup Hε of the modular group which leaves ε invariant. The two terms dµL
(2)
and dµL are functionally independent, and therefore must each result from a modular
(2)
covariant GSO summation over δ. The modular transformations of dµL are obtained by
combining the relation
8.5. Z2 orbifolds
We now consider the model where 4 directions are compactified by an orbifold group G
whose point PG group consists of single Z2 chiral reflection rL of the fields x and ψ. Since
rL2 = 1, all sectors may be labeled precisely by a single twist ε, i.e., λ = ε. As a result, the
modular covariance condition (8.19) further simplifies and becomes,
ηL [δ̃; ε] = ϕ(ε, M)ηL [δ; ε] for all M ∈ Hε . (8.21)
By a modular transformation, ε may be brought to the standard form (6.1). The spin
structures transform under Hε in the two orbits listed in (8.20), and (8.21) is to be solved
separately in each orbit.
Observe that if a modular transformation M ∈ Hε leaves any one of the spin structures
δ invariant, one has ηL [δ; ε] = ϕ(ε, M)ηL [δ; ε], and thus ϕ(ε, M) = 1. (The alternative,
ηL [δ; ε] = 0 would lead to zero GSO phases throughout the orbit, and thus a vanishing
partition function, which is excluded.) It also follows that any spin structures related by M
will have the same GSO phase ηL [δ; ε] = ηL [δ̃; ε].
Applying these observations to the case of orbit {δ1 , δ2 , δ3 , δ4 , δ7 , δ8 }, simple inspection
of Table 3 reveals that M1 , M3 and T2 have fixed points, so that ϕ(ε, M1 ) = ϕ(ε, M3 ) =
ϕ(ε, T2 ) = 1, and ηL [δ1 ; ε] = ηL [δ2 ; ε], ηL [δ3 ; ε] = ηL [δ4 ; ε], and ηL [δ7 ; ε] = ηL [δ8 ; ε].
From the first two relations, it follows that ϕ(ε, S2 ) = 1, while from the last that ϕ(ε, M2 ) =
1. Using the action of S2 and M2 it follows that ηL [δ; ε] is independent of δ within this
56 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
entire orbit, and may be set equal to ηL [δ; ε] = 1. (The analysis in the case of the orbit
{δ5 , δ6 , δ9 , δ0 } leads to the conclusion that ηL [δ; ε] is constant throughout also this orbit.)
Since the chiral measure for Z2 reflections is non-zero for only the first orbit above, the
GSO resummed measure becomes simply,
dµL [ε](pL ) = dµL [δ, ε](pL ). (8.22)
δ
Notice that with a single twist, the assignment of GSO phases consistent with modular
invariance (in the even spin structure sector) is unique, a situation that is familiar from flat
space–time [4].
(2)
For the above GSO phases, the chirally summed measure dµC vanishes pointwise on
moduli space. To prove this, the starting point is a relation, derived first in [4],
z
Ξ6 [δ]ϑ[δ] Sδ (z, w) =
4 2
Ξ6 [δ]ϑ[δ] (0)ϑ[δ]
2 2
ωI =0 (8.23)
δ δ w
for any pair of points z, w on the surface. It suffices to choose the pair to be branch points,
canonically associated with the twist via ε = −νa + νb and z = ∆ + νb and w = ∆ + νa ,
where ∆ is the Riemann class. Using (B.9) and the specific form of the standard twist ε,
the desired relation is readily obtained,
Ξ6 [δ]ϑ[δ]2 ϑ[δ + ε]2 = 0 (8.24)
δ
(1,2)
The vanishing of dµL might have been expected, on the grounds that the model obtained
by orbifolding by a single Z2 reflection group produces an orbifold superstring theory that
coincides with the original type II theory.12
12 We are grateful to Eva Silverstein and Shamit Kachru for detailed explanations of this point.
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 57
The superscripts indicate the dimension labels on which the corresponding generator acts.
The operators (−)FL and (−)FR denote the parity of left and right chirality worldsheet
fermion number respectively, with value + on NS states and − on R states. The operations
rL , rR , sL and sR are chiral reflections and shifts, defined below.
The product
√ s = sL sR is the usual shift by half a circumference. At the self-dual radius
R = 1/ 2, both operators sL and sR are of order 4. Compactification to a radius R restricts
the momentum parameters (pL , pR ) to a discrete Lorentzian lattice ΓR . To simplify the
notation, we shall not indicate this explicitly.
Given a spin structure δ, the effect of gL is to assign a sign factor depending on whether the
state traversing the corresponding cycle is NS or R. This assignment takes a simple form
in terms of the signature,
NS state traversing cycle α α|δ = +1,
R state traversing cycle α α|δ = −1. (8.30)
To check this, it suffices to work out a specific twist assignment, e.g., the standard twist
α = ε of (6.1), whose only non-zero entry is ε2 = 1/2. If δ2 = 0, the state traversing the
A2 cycle is indeed NS and the signature assignment comes out to be +1, while if δ2 = 1/2,
the state is R and we get −1.
The net effect in the sector (aI , bI ), parametrized by two twists ε and α, is to produce a
Z2 reflection of the fields x and ψ+ on the cycle ε, and an insertion of the phase α|δ. As
(aI , bI ) will run over all sectors, λ ≡ (ε, α) will run over all pairs of half-characteristics. It
follows that the set of chiral blocks for the Z2 × Z2 theory is given by
dµL [ε, α̃](p̃L , Ω̃) ≡ ϕ(ε, α, M)(cτε + d)−2 dµL [ε, α](pL , Ω). (8.33)
Here, the phase factor ϕ(ε, α, M) = α|(M)0 depends only on M and α.13 The presence
of the phase factor α|δ represents a generalization of discrete torsion [43] to the chiral
case.
By analyzing the action of the modular subgroup that leaves two twists invariant,
as done in Section 6.4, one may show that the above phase assignment is the unique
non-trivial choice consistent with modular invariance. The proof is similar to the one
given to determine the unique phase assignment for the Z2 case in Section 8.5. This
will be proven here for the case α ∈ O+ [ε] only; the other case is analogous. Upon
making the choice α = ε3 , the stabilizer of both twists, Hε,α , is generated by M1 , M2
and M3 . Acting on the spin structures {δ1 , δ2 , δ3 , δ4 , δ7 , δ8 }, M1 and M3 have fixed
points. Therefore, ϕ(ε, α; M1 ) = ϕ(ε, α; M3 ) = 1 and thus ηL [δ1 ; ε, α] = ηL [δ3 ; ε, α],
ηL [δ2 ; ε, α] = ηL [δ4 ; ε, α], and ηL [δ7 ; ε, α] = ηL [δ8 ; ε, α]. In view of the last equality,
we must have ϕ(ε, α; M2 ) = 1, and we therefore conclude that
13 To be precise, (M) is the inhomogeneous contribution in the modular transformation of spin structures
0
given in (B.12), which is common to all spin structures.
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 59
If all η are equal, we recover the case of twisting by a single Z2 . The only other linearly
independent case is when
ηL [δi ; ε, α] = +1, i = 1, 2, 3, 4,
ηL [δ7 ; ε, α] = ηL [δ8 ; ε, α] = −1 (8.35)
but this assignment is readily seen to coincide with ηL [δ; ε, α] = α|δ, which is what we
set out to prove.
For each sector λ = (ε, α), the chiral block dµC [ε, α](pL , Ω) may be analyzed by
exploiting its modular properties. Using a first modular transformation, ε may be mapped
into the standard form, familiar from (6.1),
0 0
ε= . (8.36)
0 12
Under the modular subgroup Hε (which leaves ε invariant), twists transform in 4 orbits,
as was shown in Section 6.3. The contribution to the GSO resummed measure must be
analyzed orbit-by-orbit.
(1) α ∈ O0 [ε]. This orbit corresponds to the untwisted sector and has only one element,
α = 0. Using the results of Section 7.2, one establishes dµL [ε, 0](pL, Ω) = 0.
(2) α ∈ Oε [ε]. This orbit also has only one element, α = ε. Using the results (8.20), and
(7.2), one establishes dµL [ε, ε](pL , Ω) = 0.
(3) α ∈ O− [ε]. The twists in this orbit are such that α|ε = −1. One may choose the
representative
0 0
α = 1 .
2 0
The signature with the spin structures becomes explicit: δi± |α = ±1. Therefore, the
summation over i = 2, 3, 4 vanishes and we have dµ(1) L [ε](pL , Ω) = 0. The measure
(2)
dµL [ε](pL , Ω) involves the sum
Ξ6 [δi+ ] − Ξ6 [δi− ] ϑ[δi+ ]2 ϑ[δi− ]2 .
i=2,3,4
In the separating degeneration, this sum vanishes to order τ 2 included. This may be seen
from the following limit, derived from (7.2) for each j = 3, 4,
2 2
µi µi µi
Ξ6 ϑ ϑ = O τ4 (8.37)
µj 00 0 12
i=2,3,4
using the Riemann identities of (7.9) for the summation over i = 2, 3, 4. It may be shown
that the order τ 4 term, on the other hand is non-vanishing.
60 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
(4) α ∈ O+ [ε]. The twists in this orbit are such that α|ε = +1. One may choose the
representative
1
0 2
α=
0 0
(1)
so that δ|α = (−)2δ1 . The measure dµL [ε](pL , Ω) manifestly vanishes, as the contri-
(2)
butions from α = + and α = − cancel one another out. The measure dµL [ε](pL , Ω) on
the other hand vanishes only to leading order as τ → 0, but is non-vanishing to order τ 2 .
To see this, the following limit is taken
α|δΞ6 [δ]ϑ[δ]2 ϑ[δ + ε]2
δ
= −256π 2τ 2 η(τ1 )12 η(τ2 )12 ϑ38 − ϑ48 (τ1 )ϑ24 ϑ32 ϑ42 (τ2 ) + O τ 4 . (8.38)
Thus, the chiral measure dµL [ε, α](pε ) is non-vanishing on both orbits α ∈ O± [ε].
The non-vanishing of the contribution from the orbit O− [ε] would appear to contradict
the conclusions of [9], where, instead, it is claimed to be vanishing. The arguments made
in [9] are based on the fact that, when viewed as elements of the full orbifold group G, the
generators f and g do not commute with one another. Since their commutator is a pure
translation, the elements f and g do commute, however, when viewed as elements of the
point group P̄G .
Standard arguments for symmetric orbifolds, based on functional integral methods,
guarantee the equivalence between coseting Rn by G and coseting TG by P̄G . They also
guarantee
that no contributions will arise from any sector in which the homotopy relation
−1 −1
I =1,2 I I aI bI = 1 fails to be satisfied, whether aI , bI take values in G or in P̄G .
a b
For asymmetric orbifolds, however, no direct functional integral formulation is
available. Therefore, the arguments made in the case of symmetric orbifolds may or
may not hold for asymmetric orbifolds. In particular, it is unclear how the coset theory
Rn /G should be constructed, since it is unclear how the zero mode of the field x should
be handled. Only the construction of the coset TG /P̄G is well-defined. Its construction
guarantees
that no contributions will arise from a sector in which the homotopy relation
I =1,2 a I b I aI−1 bI−1 = 1 fails to be satisfied, only when aI , bI take values in the space
group PG (but not necessarily when f and g take values in the full group G).
To summarize the situation for orbit O− [ε]: the GSO resummed chiral blocks in orbit
O− [ε] are non-vanishing. Yet, it is conceivable that assembling left and right blocks of the
asymmetric orbifold will effectively enforce the vanishing of contributions arising from
sectors in which the homotopy relation I =1,2 aI bI aI−1 bI−1 = 1 fails to be satisfied, when
aI , bI take values in the full group G. No evidence in favor of this possibility is available,
however, at this time.
In the orbit O+ [ε], the chiral GSO resummed blocks are also non-vanishing. As their
contribution is O(τ 2 ) in the separating degeneration limit, they dominate in this limit
(over the orbit O− [ε], which is O(τ 4 )). This result contradicts the claim of [9], where
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 61
the corresponding chiral contribution was found to vanish. Noticing that the sectors for
this block all satisfy the homotopy relation, no cancellation should be expected when
assembling left and right blocks.
To summarize, the non-vanishing of the contribution of orbit O+ [ε] shows that the
cosmological constant cannot vanish pointwise on moduli space and gives strong evidence
that the full cosmological constant in the KKS models is non-vanishing. The net effect
of the correct treatment of the superstring measure given here is that the cosmological
constant in KKS models involves the sum over the new modular object Ξ6 [δ], discovered
in [4], which, in contrast to the measure derived from the BRST picture changing operator
formalism does not lead to a cancellation in the chiral blocks.
Acknowledgements
We are happy to acknowledge conversations with Michael Gutperle, Per Kraus, and
Edward Witten. We are especially grateful to Shamit Kachru and Eva Silverstein for
several helpful discussions of various fine points in the theory of asymmetric orbifold
compactifications and for detailed explanations of their specific models.
Let ω and ω be the two half period of the torus and τ = ω /ω its modulus. The
Weierstrass function obeys
℘ (z)2 = 4 ℘ (z) − e1 ℘ (z) − e2 ℘ (z) − e3 . (A.1)
Recall the prime form E and the Szegö kernel Sµ for even spin structure µ at genus 1,
ϑ1 (z − w|τ ) ϑ[µ](z − w|τ )
E(z, w) = , Sµ (z, w) = . (A.2)
ϑ1 (0|τ ) ϑ[µ](0|τ )E(z, w)
The correspondence between the classical Jacobi ϑi -functions and the spin structures is
given by ϑi (z|τ ) ≡ ϑ[µi ](z|τ ) with
µ1 = 12 12 , µ2 = 12 0 , µ3 = (0|0), µ4 = 0 12 . (A.3)
The Dedekind η-function is related to the product of all ϑ-constants,
ϑ1 (0|τ ) = −2πη(τ )3 = −πϑ2 (0|τ )ϑ3 (0|τ )ϑ4 (0|τ ). (A.4)
We have the following doubling formulas,
In this appendix, we review from [4] some fundamental facts about genus 2 Riemann
surfaces, their spin structures, ϑ-functions modular properties, and relations between the
ϑ-function and the hyperelliptic formulations.
On a Riemann surface Σ of genus 2, there are 16 spin structures, of which 6 are odd
(usually denoted by ν) and 10 are even (usually denoted by δ). Each spin structure κ can be
identified with a ϑ-characteristic κ = (κ |κ ), where κ , κ ∈ {0, 1/2}2 , represented here
by column matrices. The parity of the spin structure κ is that of the integer 4κ · κ . The
signature assignment between (even or odd) spin structures κ and λ is defined by
It will be convenient to use a definite basis for the spin structures, in a given choice of
homology basis, as in [4]. The odd spin structures may be labeled as follows,
0 0 0 1 1 0
2ν1 = , 2ν3 = , 2ν5 = ,
11 11 11
1 1 1 1 1 1
2ν2 = , 2ν4 = , 2ν6 = . (B.4)
00 01 10
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 63
The pairs for which νi |νj = −1 are 14, 16, 23, 25, 35, 46; all others give νi |νj = +1.
The even spin structures may be labeled by,
0 0 0 0 0 1 0 1
2δ1 = , 2δ2 = , 2δ3 = , 2δ4 = ,
00 01 00 01
0 0 0 1 1 0 1 0
2δ5 = , 2δ6 = , 2δ7 = , 2δ8 = ,
10 10 00 01
1 0 1 1
2δ9 = , 2δ0 = . (B.5)
10 11
The pairs for which δi |δj = −1 are 25, 26, 29, 20, 37, 38, 39, 30, 45, 46, 47, 48, 58, 50,
67, 69, 70, 89; all others give +1.
For genus 2 there exist many relations between even and odd spin structures, some of
which will be needed here. First, the sum of all odd spin structures is a double period,
ν1 + ν2 + ν3 + ν4 + ν5 + ν6 = 4δ0 . (B.6)
Second, each even spin structure δ can be written as δ = νi1 + νi2 + νi3 (modulo integral
periods), where the νia , a = 1, 2, 3 are odd and pairwise distinct,
ν1 + ν2 + ν3 = δ7 + 2ν3 , ν1 + ν2 + ν4 = δ5 + 2ν4 ,
ν1 + ν2 + ν5 = δ3 + 2ν5 , ν1 + ν2 + ν6 = δ2 + 2ν6 ,
ν1 + ν3 + ν4 = δ8 + 2ν3 , ν1 + ν3 + ν5 = δ0 + 2ν1 , (B.7)
ν1 + ν3 + ν6 = δ9 + 2ν3 , ν1 + ν4 + ν5 = δ4 + 2ν5 ,
ν1 + ν4 + ν6 = δ1 + 2δ0 , ν1 + ν5 + ν6 = δ6 + 2ν5 .
Clearly, the mapping {νi1 , νi2 , νi3 } → δ is 2 to 1, with νi1 + νi2 + νi3 and its complement
4δ0 − (νi1 + νi2 + νi3 ) corresponding to the same even spin structure, in view of (B.6).
B.2. ϑ-functions
Here, ϑ is even or odd in ζ depending on the parity of the spin structure. The following
useful periodicity relations hold, in which m , m ∈ Z2 and λ , λ ∈ C2 ,
one defines the particularly important ϑ-constants, ϑ[δ] ≡ ϑ[δ](0, Ω). For every odd spin
structure, there exists a Riemann relation for ϑ-constants,
ν|δϑ[δ]4 = 0. (B.10)
δ
Modular transformations M form the infinite discrete group Sp(4, Z), defined by
T
A B A B 0 I A B 0 I
M= , = , (B.11)
C D C D −I 0 C D −I 0
where A, B, C, D are integer valued 2 × 2 matrices and the superscript T denotes
transposition. To exhibit the action of the modular group on 1/2 characteristics, it is
convenient to assemble the 1/2 characteristics into a single column of 4 entries. In this
notation, the action of the modular group on spin structures κ and twists ε is then given by
[39]
κ̃ D −C κ 1 CD T
= + diag ,
κ̃ −B A κ 2 AB T
ε̃ D −C ε
= . (B.12)
ε̃ −B A ε
Here and below, diag(M) of a n × n matrix M is an 1 × n column vector whose entries
are the diagonal entries on M. On the period matrix, modular transformations act by
Ω̃ = (AΩ + B)(CΩ + D)−1 , while on the Jacobi ϑ-functions, we have
T
ϑ[κ̃] (CΩ + D)−1 ζ, Ω̃
−1 Cζ
=
(κ, M) det(CΩ + D)1/2 eiπζ(CΩ+D) ϑ[κ](ζ, Ω), (B.13)
where κ = (κ |κ ) and κ̃ = (κ̃ |κ̃ ). The phase factor
(κ, M) depends upon both κ and
the modular transformation M and obeys
(κ, M)8 = 1. Its expression was calculated in
[39], and is given by
(δ, M) =
0 (M) exp{2πiφδ (M)}, where
1
0 (M) = exp 2πi tr(M − I ) ,
2
8
1 T 1
φδ (M) = − δ D Bδ + δ B T Cδ − δ C T Aδ
2 2
1
+ δ D T − δ C T diag AB T . (B.14)
2
The modular group is generated by the following elements
I Bi 1 0 0 0 0 1
Mi = , B1 = , B2 = , B3 = ,
0 I 0 0 0 1 1 0
0 I
S= ,
−I 0
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 65
Table 6
Modular transformations of even spin structures
δ M1 M2 M3 S Σ T
2 (δ, M1 )
2 (δ, M2 )
2 (δ, M3 )
δ1 δ3 δ2 δ1 δ1 δ1 δ1 1 1 1
δ2 δ4 δ1 δ2 δ5 δ3 δ4 1 1 1
δ3 δ1 δ4 δ3 δ7 δ2 δ3 1 1 1
δ4 δ2 δ3 δ4 δ9 δ4 δ2 1 1 1
δ5 δ6 δ5 δ6 δ2 δ7 δ5 1 i 1
δ6 δ5 δ6 δ5 δ8 δ8 δ6 1 i 1
δ7 δ7 δ8 δ8 δ3 δ5 δ9 i 1 1
δ8 δ8 δ7 δ7 δ6 δ6 δ0 i 1 1
δ9 δ9 δ9 δ0 δ4 δ9 δ7 i i −1
δ0 δ0 δ0 δ9 δ0 δ0 δ8 i i −1
σ 0 0 1
Σ= , σ= ,
0 −σ −1 0
τ+ 0 1 1 1 0
T= , τ+ = , τ− = . (B.15)
0 τ− 0 1 −1 1
The transformation laws for even spin structures under these generators are given in
Table 6; those for odd spin structures will not be needed here and may be found in [4].
We shall be most interested in the modular transformations of ϑ-constants ϑ 2 [δ] and
thus in even spin structures δ and the squares of
, which are given by
(δ, M1 ) = exp{2πiδ1(1 − δ1 )},
2
(δ, S) = −1,
2
(δ, M2 ) = exp{2πiδ2(1 − δ2 )},
2
(δ, Σ)2 = −1, (B.16)
(δ, M3 )2 = exp{−4πiδ1 δ2 },
(δ, T )2 = +1.
A convenient way of establishing these values is by first analyzing the case of the shifts
Mi , whose action may be read off from the definition of the ϑ-function. The values for the
transformations S, Σ and T may be obtained by letting the surface undergo a separating
degeneration Ω12 → 0, and using the sign assignments of genus 1 ϑ-functions. The non-
trivial entries for
2 are listed in Table 6.
parametrized by14
6
s2 = (x − pa ). (B.17)
a=1
In the hyperelliptic representation, there is another convenient way of identifying spin
structures. Each spin structure can be viewed then as a partition of the set of branch points
pa , a = 1, . . . , 6 into two disjoint subsets, in the following way.
ν odd ⇔ branch point pa ,
δ even ⇔ partition A ∪ B, A = {pa , pb , pc }, B = {pd , pe , pf }, (B.18)
where (a, b, c, d, e, f ) is a permutation of (1, 2, 3, 4, 5, 6).
As shown in [4], Thomae relations may be derived most easily in terms of the bilinear
ϑ-constants Mab ≡ Mνa νb which were defined in [4],
Mab ≡ ∂1 ϑ[νa ](0, Ω)∂2ϑ[νb ](0, Ω) − ∂2 ϑ[νa ](0, Ω)∂1 ϑ[νb ](0, Ω). (B.19)
The abbreviation ∂I ϑ[ν] ≡ ∂I ϑ[ν](0, Ω) will be convenient. The holomorphic 1-form
ωνa (z), defined for any odd spin structure νa by ωνa (z) ≡ ωI (z)∂I ϑ[νa ], has a unique
double zero at pa , which is the branch point that may be canonically associated with νa .
The following fundamental results may be derived from a comparison of holomorphic
forms in the ϑ-functions and hyperelliptic formulations,
Nνb (pa − pb ) ωνb (pa ) Mνa νb
= = , (B.20)
Nνc (pa − pc ) ωνc (pa ) Mνa νc
where Nνa depends only on a and not on b. Taking the cross ratio of four branch points
(with a, d = b, c), the normalization factors N cancel out and we get the desired identity
pa − pb pc − pd M νa νb M νc νd
· = . (B.21)
pa − pc pb − pd M νa νc M νb νd
This is a Thomae-type formula, relating ϑ-constants to rational expressions of branch
points.
In [4], it was shown that the following relation holds between Mab and the ϑ-constants
for even spin structure,
Mab = mab π 2 ϑ[νa + νb + νc ], (B.22)
c=a,b
14 It is customary to introduce a local coordinate system z(x) = (x, s(x)), which is well-defined also at the
branch points. Throughout, the formulas in the hyperelliptic representation will be understood in this way.
However, to simplify notation, the local coordinate z(x) will not be exhibited explicitly.
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 67
where m2ab = 1. The sign factor mab is not intrinsic, for two reasons. First, Mab is odd
under a ↔ b, while the product of ϑ-constants on the rhs is even. Second, the ϑ-constants
on the lhs and on the rhs arise to the first power, so that Mab is actually sensitive to
shifts in the ν’s by single periods (although they are invariant under shifts by any double
period). Here, the convention will be taken that the odd spin structures ν are normalized
as given in (B.4), while the even spin structures νa + νb + νc are truncated to the standard
normalization given in (B.5),
νa + νb + νc ≡ δi , i = 0, 1, . . . , 9. (B.23)
In the calculation of Γ [δ; ε] above, the signs are in fact needed. The sign factors are then
given as follows,
mab = −mba ,
mab = +1, ab = 15, 23, 25, 26, 35, 45, 46,
mab = −1, ab = 12, 13, 14, 16, 24, 34, 36, 56. (B.24)
The derivation was given in [4]. To obtain the results in the above form, one proceeds as
follows. Let ν0 be the unique genus 1 odd spin structure and let µ1 , µ3 and µ5 (and denote
an independent set by µ2 , µ4 and µ6 ) denote the three distinct genus 1 even spin structures,
obeying µ1 + µ3 + µ5 = ν0 , then any pair of genus 2 odd spin structures may be expressed
in the following basis,
µ1 ν µ3
ν1 = , ν2 = 0 , ν3 = . (B.25)
ν0 µ2 ν0
The result of [4] is then,
µ3 µ5 µ1 µ1
Mν1 ν2 = −π 2 ϑ ϑ ϑ ϑ ,
µ2 µ2 µ4 µ6
ν µ5 µ5 µ5
Mν1 ν3 = −π 2 σ (µ1 , µ3 )ϑ 0 ϑ ϑ ϑ , (B.26)
ν0 µ2 µ4 µ6
with the sign factor σ (µ1 , µ3 ) = −σ (µ3 , µ1 ) given as follows,
σ (0|0), 0 1 = σ 1 0 , (0|0) = σ 1 0 , 0 1 = +1.
2 2 2 2 (B.27)
Notice that the ordering of µ1 and µ2 determines the sign on the right hand side. Going
through all cases, (B.24) is readily derived.
References
[1] E. D’Hoker, D.H. Phong, Two-loop superstrings I, main formulas, Phys. Lett. B 529 (2002) 241–255, hep-
th/0110247.
[2] E. D’Hoker, D.H. Phong, Two-loop superstrings II, the chiral measure on moduli space, Nucl. Phys. B 636
(2002) 3–60, hep-th/0110283.
[3] E. D’Hoker, D.H. Phong, Two-loop superstrings III, slice independence and absence of ambiguities, Nucl.
Phys. B 636 (2002) 61–79, hep-th/0111016.
68 K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69
[4] E. D’Hoker, D.H. Phong, Two-loop superstrings IV, the cosmological constant and modular forms, Nucl.
Phys. B 639 (2002) 129–181, hep-th/0111040.
[5] E. D’Hoker, D.H. Phong, Lectures on two-loop superstrings, Hangzhou, Beijing, 2002, hep-th/0211111.
[6] Z.J. Zheng, J.B. Wu, C.J. Zhu, Two-loop superstrings in hyperelliptic language. I: the main results, Phys.
Lett. B 559 (2003) 89, hep-th/0212191;
Z.J. Zheng, J.B. Wu, C.J. Zhu, Two-loop superstrings in hyperelliptic language. II: the vanishing of the
cosmological constant and the non-renormalization theorem, Nucl. Phys. B 663 (2003) 79, hep-th/0212198;
Z.J. Zheng, J.B. Wu, C.J. Zhu, Two-loop superstrings in hyperelliptic language. III: the four-particle
amplitude, Nucl. Phys. B 663 (2003) 95, hep-th/0212219.
[7] C.J. Zhu, Two-loop computation in superstring theory, hep-th/0301018;
W.J. Bao, C.J. Zhu, Comments on two-loop four-particle amplitude in superstring theory, JHEP 0305 (2003)
056, hep-th/0303152.
[8] E. Martinec, Non-renormalization theorems and fermionic string finiteness, Phys. Lett. B 171 (1986) 189;
R. Iengo, C.J. Zhu, Notes on non-renormalization theorem in superstring theories, Phys. Lett. B 212 (1988)
309;
O. Lechtenfeld, W. Lerche, On non-renormalization theorems for four-dimensional superstrings, Phys. Lett.
B 227 (1989) 373.
[9] S. Kachru, J. Kumar, E. Silverstein, Vacuum energy cancellation in a nonsupersymmetric string, Phys. Rev.
D 59 (1999) 106004, hep-th/9807076;
S. Kachru, E. Silverstein, Self-dual nonsupersymmetric type II string compactifications, JHEP 9811 (1998)
001, hep-th/9808056;
S. Kachru, E. Silverstein, On vanishing two loop cosmological constants in nonsupersymmetric strings,
JHEP 9901 (1999) 004, hep-th/9810129.
[10] J. Harvey, String duality and non-supersymmetric strings, hep-th/9807213;
Z. Kakushadze, S.H.H. Tye, Brane world, Nucl. Phys. B 548 (1999) 180, hep-th/9809147;
G. Shiu, S.H.H. Tye, Bose–Fermi degeneracy and duality in non-supersymmetric strings, Nucl. Phys. B 542
(1999) 45, hep-th/9808095.
[11] S. Deser, B. Zumino, Broken Supersymmetry and Supergravity, Phys. Rev. Lett. 38 (1977) 1433;
E. Cremmer, B. Julia, J. Scherk, S. Ferrara, L. Girardello, P. van Nieuwenhuizen, Spontaneous symmetry
breaking and Higgs effect in supergravity without cosmological constant, Nucl. Phys. B 147 (1979) 105;
E. Cremmer, S. Ferrara, C. Kounnas, D.V. Nanopoulos, Naturally vanishing cosmological constant in N = 1
supergravity, Phys. Lett. B 133 (1983) 61;
A.B. Lahanas, D.V. Nanopoulos, The road to no-scale supergravity, Phys. Rep. 145 (1987) 1;
L. Andrianopoli, R. D’Auria, S. Ferrara, M.A. Lledo, Gauging of flat groups in four-dimensional
supergravity, JHEP 0207 (2002) 010, hep-th/0203206.
[12] D. Friedan, E. Martinec, S. Shenker, Conformal invariance, supersymmetry, and string theory, Nucl. Phys.
B 271 (1986) 93.
[13] E. Verlinde, H. Verlinde, Multiloop calculations in covariant superstring theory, Phys. Lett. B 192 (1987)
95–102;
H. Verlinde, A note on the integral over fermionic supermoduli, Utrecht Preprint No. THU-87/26, 1987,
unpublished.
[14] R. Iengo, C.J. Zhu, Evidence for non-vanishing cosmological constant in non-SUSY superstring models,
JHEP 0004 (2000) 028, hep-th/9912074.
[15] E. Gava, R. Iengo, Modular invariance and the two loop vanishing of the cosmological constant, Phys. Lett.
B 207 (1988) 283;
C.J. Zhu, Two loop computations in superstring theories, Int. J. Mod. Phys. A 4 (1989) 3877.
[16] J. Atick, J. Rabin, A. Sen, An ambiguity in fermionic string perturbation theory, Nucl. Phys. B 299 (1988)
279–294.
[17] G. Moore, A. Morozov, Some remarks on two-loop string calculations, Nucl. Phys. B 306 (1988) 387–404.
[18] H. La, P. Nelson, Unambiguous fermionic string amplitudes, Phys. Rev. Lett. 63 (1989) 24–27.
[19] J. Atick, G. Moore, A. Sen, Some global issues in string perturbation theory, Nucl. Phys. B 308 (1988) 1;
J. Atick, G. Moore, A. Sen, Catoptric tadpoles, Nucl. Phys. B 307 (1988) 221–273.
[20] E. D’Hoker, D.H. Phong, Superholomorphic anomalies and supermoduli space, Nucl. Phys. B 292 (1987)
317.
K. Aoki et al. / Nuclear Physics B 688 (2004) 3–69 69
[21] E. D’Hoker, D.H. Phong, Conformal scalar fields and chiral splitting on super Riemann surfaces, Commun.
Math. Phys. 125 (1989) 469–513.
[22] R. Dijkgraaf, E. Verlinde, H. Verlinde, c = 1 conformal field theories on Riemann surfaces, Commun. Math.
Phys. 115 (1988) 649–690.
[23] E. D’Hoker, D.H. Phong, The geometry of string perturbation theory, Rev. Mod. Phys. 60 (1988) 917–1065.
[24] K. Aoki, E. D’Hoker, D.H. Phong, On the construction of some asymmetric orbifold models, in press.
[25] E. Verlinde, H. Verlinde, Chiral bosonization, determinants and the string partition function, Nucl. Phys.
B 288 (1987) 357;
E. Verlinde, H. Verlinde, Superstring Perturbation Theory, in: M. Green, et al. (Eds.), Superstrings, vol. 88,
World Scientific, Singapore, 1989, pp. 222–240.
[26] R. Russo, S. Sciuto, Twisted determinants on higher genus Riemann surfaces, Nucl. Phys. B 669 (2003) 207;
R. Russo, S. Sciuto, Twisted determinants and bosonic open strings in an electromagnetic field, hep-
th/0312205.
[27] J. Fay, Theta Functions on Riemann Surfaces, in: Springer Lecture Notes in Mathematics, vol. 352, Springer,
Berlin, 1973.
[28] P. Ginsparg, Applied Conformal Field Theory, in: E. Brézin, J. Zinn-Justin (Eds.), Les Houches 1988,
Session XLIX, Fields, Strings and Critical Phenomena, North-Holland, Amsterdam, 1990.
[29] L. Dixon, P. Ginsparg, J.A. Harvey, ĉ = 1 superconformal field theory, Nucl. Phys. B 306 (1988) 470.
[30] A. Erdelyi (Ed.), Higher Transcendental Functions, in: Bateman Manuscript Project, vol. II, Krieger,
Melbourne, FL, 1981.
[31] F. Gliozzi, J. Scherk, D. Olive, Supersymmetry, supergravity theories and the dual spinor model, Nucl. Phys.
B 122 (1977) 253;
E. Witten, in: W.A. Bardeen, A.R. White (Eds.), Symposium on Anomalies, Geometry, Topology, World
Scientific, Singapore, 1985, p. 61;
N. Seiberg, E. Witten, Spin structures in string theory, Nucl. Phys. B 276 (1986) 272.
[32] M.B. Green, J.H. Schwarz, Supersymmetrical string theories, Phys. Lett. B 109 (1982) 444.
[33] D.J. Gross, J.A. Harvey, E.J. Martinec, R. Rohm, Heterotic string theory, II. The interacting heterotic string,
Nucl. Phys. B 267 (1986) 75.
[34] E. Martinec, Superspace geometry of superstrings, Phys. Rev. D 28 (1983) 2604;
D. Friedan, Notes on string theory and two-dimensional conformal field theory, in: M.B. Green, D.J.
Gross (Eds.), Unified String Theories: Proceedings of the 1985 Santa Barbara Workshop, World Scientific,
Singapore, 1986;
S. Chaudhuri, H. Kawai, H. Tye, Path integral formulation of closed strings, Phys. Rev. D 36 (1987) 1148.
[35] E. D’Hoker, D.H. Phong, Loop amplitudes for the fermionic string, Nucl. Phys. B 278 (1986) 225;
G. Moore, P. Nelson, J. Polchinski, Strings and supermoduli, Phys. Lett. B 169 (1986) 47.
[36] A. Belavin, V. Knizhnik, Algebraic geometry and the theory of quantized strings, Phys. Lett. B 168 (1986)
201.
[37] E. D’Hoker, D.H. Phong, Superstrings, Super Riemann Surfaces, and Supermoduli Space, in: Symposia
Mathematica, in: String Theory, vol. XXXIII, Academic Press, London, 1990, Preprint UCLA-89-TEP-32.
[38] K. Aoki, E. D’Hoker, D.H. Phong, Unitarity of closed superstring perturbation theory, Nucl. Phys. B 342
(1990) 149.
[39] J.I. Igusa, Theta Functions, Springer-Verlag, Berlin, 1972;
J.I. Igusa, On Siegel modular forms of genus two, Am. J. Math. 84 (1962) 175;
J.I. Igusa, On the graded ring of theta-constants, Am. J. Math. 86 (1964) 219.
[40] K.S. Narain, M.H. Sarmidi, C. Vafa, Nucl. Phys. B 288 (1987) 551.
[41] R. Dijkgraaf, C. Vafa, E. Verlinde, H. Verlinde, Commun. Math. Phys. 123 (1989) 485.
[42] E. Kiritsis, Introduction to string theory, hep-th/9709062;
W. Lerche, A.N. Schellekens, N.P. Warner, Phys. Rep. 177 (1989) 1.
[43] C. Vafa, Nucl. Phys. B 273 (1986) 592.
Nuclear Physics B 688 (2004) 70–100
www.elsevier.com/locate/npe
Abstract
Sen has recently drawn attention to an exact time-dependent boundary conformal field theory
with the space–time interpretation of brane creation and annihilation. An interesting limit of this
BCFT is formally equivalent to an array of D-branes located in imaginary time. This raises the
question: what is the meaning of D-branes in imaginary time? The answer we propose is that
D-branes in imaginary time define purely closed string backgrounds. In particular we prove that
the disk scattering amplitude of m closed strings off an arbitrary configuration of imaginary branes is
equivalent to a sphere amplitude with m + 1 closed string insertions. The extra puncture is a specific
closed string state, generically normalizable, that depends on the details of the brane configuration.
We study in some detail the special case of the array of imaginary D-branes related to Sen’s BCFT
and comment on its space–time interpretation. We point out that a certain limit of our set-up allows to
study classical black hole creation and suggests a relation between Choptuik’s critical behavior and
a phase-transition à la Gregory–Laflamme. We speculate that open string field theory on imaginary
D-branes is dual to string theory on the corresponding closed string background.
2004 Elsevier B.V. All rights reserved.
1. Introduction
Worldsheet duality between open and closed strings is one of the truly fundamental
ideas of string theory. Many modern developments have originated from the application
of this idea to supersymmetric configurations of D-branes. In this paper we show that the
study of non-supersymmetric, unstable D-branes gives some clues for a new intriguing
incarnation of open/closed string duality.
Our starting point is the real-time process of D-brane creation and annihilation, which
has recently attracted much attention [1–21]. In particular, Sen [2] introduced a simple
0550-3213/$ – see front matter 2004 Elsevier B.V. All rights reserved.
doi:10.1016/j.nuclphysb.2004.03.017
D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100 71
class of models in bosonic string theory obtained by perturbing the flat-space c = 26 CFT
with the exactly marginal deformation
λ dt cosh X0 (t) , (1.1)
where t is a coordinate on the worldsheet boundary, and λ is a free parameter in the range
0 λ 12 . This is a family of exact solutions of classical open string theory whose space–
time picture is that of an unstable brane being created at a time X0 ∼ −τ and decaying at
a time X0 ∼ +τ , with τ = − log(sin(πλ)). For λ = 12 the lifetime of the brane is zero, that
is, there is no brane to be found anywhere. Moreover, the corresponding boundary state
appears to vanish identically [2]. This is fascinating as it seems to suggest that for λ = 12
the BCFT (1.1) describes the stable closed string vacuum, where open string degrees of
freedom are absent. Somehow the boundary perturbation (1.1) with λ = 12 must get rid of
the worldsheet boundaries!
In the framework of (1.1), we have the opportunity to precisely test a scenario [22,23]
for how purely closed string amplitudes may be obtained at the tachyon vacuum. The basic
idea is that as the tachyon condenses, worldsheets with large holes are suppressed, and the
integration over moduli space should localize to the region where the holes shrink to points.
This heuristic picture was made somewhat more concrete in [23], where it was argued that
amplitudes for m external closed strings on the disk reduce at the tachyon vacuum to sphere
amplitudes with the same m closed string punctures plus an additional insertion of a zero-
momentum state (possibly a soft dilaton), a remnant of the shrunk boundary. This analysis
[23] was performed in the framework of a regulated version of vacuum string field theory
[24]. However, due to subtleties in the regulation procedure, it was difficult to make this
conclusion completely precise.
Since we wish to focus on the case λ = 12 , it is very useful to realize that at this critical
value the BCFT admits a simple description. By Wick rotation X0 → iX, one obtains the
well-known exactly marginal deformation λ cos(X) [25] (in fact this is how Sen arrived
at (1.1) in the first place), which for λ = 12 is equivalent to an infinite array of D-branes
located at X = 2π(n + 12 ) [25,26]. We could thus say that at the critical value λ = 12 ,
the time-dependent boundary deformation (1.1) becomes an array of D-branes located at
imaginary times
1
X0 = i2π n + , n ∈ Z. (1.2)
2
This is an empty statement if we do not define the meaning of D-branes in imaginary time.
Our approach to that issue is very simple: any quantity that one wishes to compute for a
configuration of D-branes in imaginary time should be obtained by Wick rotation of the
configuration in real space. This prescription gives consistent answers with an interesting
physical meaning.
A natural class of observables is given by scattering amplitudes of closed strings on the
disk in the background of the brane configuration (1.2). We find that these reduce to sphere
amplitudes with m + 1 punctures. The extra puncture is, however, not a soft dilaton as in
[23], but a non-trivial closed string state that involves the whole tower of massive modes.
Thus the BCFT (1.1) with λ = 12 describes a purely closed string background with no
72 D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100
physical open string degrees of freedom. This background, however, not is the closed string
vacuum, but a specific time-dependent state with non-zero energy. The detailed features of
this background are very reminiscent of ‘tachyon matter’ [3,4].
While these results were first obtained in the special case (1.2), the basic conclusion is
much more general. A generic configuration of imaginary D-branes defines a closed string
background. The details of the background depend on the details of the configuration of
imaginary D-branes. Exactly marginal open string deformations, for example deformations
that move the positions of the imaginary branes, are naturally reinterpreted as deformations
of the closed string background. The case (1.2) is seen to be very special as the closed string
state has divergent norm [12], and the background admits an additional exactly marginal
deformation which is associated with the creation of an actual brane in real time.
An outline of the paper is as follows. In Section 2 we spell out the basic prescription for
how to deal with the array of D-branes in imaginary time. We consider a more general
case in which X0 = ia(n + 12 ), where a is an arbitrary parameter, and find a general
formula for disk amplitudes associated with such an array. In Section 3 we analyze in
detail disk amplitudes for scattering of m external closed strings from the array of D-
branes in imaginary time. By an exact computation, we show that they are equivalent
to sphere amplitudes with the same m closed strings insertions plus the insertion of an
extra closed string state |W . The details of this closed string state, in particular its space–
time interpretation, are studied in Section 4. The energy of the state is finite and of order
O(gs0 ) = O(1) for any a > 2π . The case a = 2π , corresponding to λ = 12 in the BCFT
(1.1), is seen to be special as the normalization and energy of the state diverge [12].
We suggest a heuristic mechanism to cutoff this divergence, namely we point out that
the gravitational back-reaction makes the effective distance aeff = 2π + γ gs , with γ > 0,
leading to a total energy of order 1/gs .
In Section 5 it is shown that although the imaginary array of branes does not have
propagating open string degrees of freedom, open strings still play an important role.
There still exists a discrete set of on-shell open string vertex operators corresponding to
exactly marginal deformations, for example deformations that move the branes around in
imaginary time. These open string moduli are re-interpreted as closed string deformations,
according to a precise dictionary. In Section 6 we show that, subject to certain reality
conditions, one can distribute D-branes quite freely in the complex X0 plane. The reduction
of disk amplitudes to sphere amplitudes still holds in this general case. We briefly comment
on extensions to the superstring. In Section 7 we briefly discuss some ideas about the open
string field theory associated with D-branes in imaginary time and speculate that a version
of vacuum string field theory may be obtained in the limit a → ∞.
Section 8 is devoted to the case a = 2π . In this case there is an additional exactly
marginal open string deformation (which we may label as ‘cosh(X0 )’) which is not dual
to a purely closed deformation, as it introduces instead an actual D-brane in real time. We
give a treatment of (1.1) for all λ 12 by representing the boundary state as an infinite array
of some specific smeared sources, and then performing the Wick rotation. We find that for
λ < 12 a time-delay of order τ is introduced between the incoming and the outgoing parts
of the closed string wave, while for |X0 | < τ there is an actual source for the closed string
fields.
D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100 73
2. Preliminaries
Let us denote the scattering amplitude of some closed strings off a single D-brane
located at X = 0 by Ã. Here X is a spatial coordinate. To find the scattering amplitude
S for an array of D-branes located at imaginary time we could proceed in two steps:
(i) Find the scattering amplitude S̃ for an array of D-branes located at the real positions
X = (n + 12 )a, n ∈ Z. This is given by
∞
1
S̃(P , . . .) = Ã(P , . . .)ei(n+ 2 )aP
n=−∞
∞
= Ã(P , . . .) (−1)n 2πδ(P a − 2πn), (2.1)
n=−∞
where P is the total momentum in the X direction and the dots denote other
variables that the amplitude may depend on. This is a precise equality in the sense
of distributions. Simply put, the momentum has to be quantized due to the periodicity
in X.
(ii) Apply (inverse) Wick rotation
where now X0 and E are real. This has the effect of turning the spatial coordinate X
into a temporal coordinate X0 and of rotating the array of D-branes from the real axis
to the imaginary axis, X0 = i(n + 12 )a. So it takes us exactly to the set-up that we wish
to study. Wick rotating (2.1) we get the formal expression
∞
S(E, . . .) ≡ S̃(iE, . . .) = Ã(iE, . . .) (−1)n 2πδ(iEa − n). (2.3)
n=−∞
Naively this implies that S(E, . . .) is identically zero since for any real E the delta
functions vanish. However one has to be more careful, as Ã(iE, . . .) may blow up for
some real values of E yielding a non-zero S(E). Clearly the discussion so far has been
quite formal, for example the summation (2.1) does not commute with the Wick rotation
(2.2) in the sense that if we first Wick rotate and then sum over the array, the sum does not
converge for any real E. We need to specify an unambiguous prescription for the analytic
continuation (2.2). Let us illustrate how a natural prescription comes about in a simple
example.
2.2. Example
This is a periodic function with period a (see Fig. 1), given in a neighborhood of X = 0 by
π cosh(cX) a
G̃array (X) = , |X| . (2.7)
c sinh( ca
2 ) 2
Now we wish to Wick rotate. Of course, G̃array (X) is not an analytic function, precisely
because of the δ-function sources located at X = (n + 12 )a. So we need to specify what we
exactly mean by the analytic continuation X → −iX0 . A natural prescription is to focus
D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100 75
Fig. 1. A graph of G̃array (X), which has the interpretation of the field produced by an infinite array of δ-function
sources (‘D-branes’) located at X = n(a + 12 ). The dashed line represents the analytic continuation to |X| > a2 of
the branch around the origin.
In principle, all amplitudes considered in the paper could be expanded in terms of the
example studied above. It would be nice however to have a general formula that gives
S(E, . . .) in terms of Ã(P , . . .). To this end consider, for generic Ã(P , . . .),
∞ ∞
G̃array (X) = dP eiP X (−1)n 2πδ(aP − 2πn)Ã(P , . . .). (2.10)
−∞ n=−∞
76 D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100
Fig. 2. Integration contours in the complex P plane. The zeros of sin(aP /2) are denoted by the symbols ‘x’
along the real P axis. The black dots represent possible poles of à and the thick line represents a possible cut.
We assume that the only singularities of à are on the imaginary P axis.
In all cases that we study in the present paper Ã(P , . . .) is an analytic function with poles
or cuts only along the imaginary P axis. Moreover Ã(P , . . .) goes to zero for |P | → 0
sufficiently fast to validate to following argument.
With the help of the residues theorem we can write
1 Ã(P , . . .)
G̃array (X) = dP eiP X , (2.11)
2i sin( aP
2 )
C
where the contour C is depicted in Fig. 2. By our analyticity assumptions on Ã(P , . . .), the
curve C can be deformed to C˜ without crossing any singularities (see Fig. 2) and without
picking up any contributions from the two semi-circles at infinity (that are not shown in
Fig. 2). So we conclude, after Fourier transforming back to momentum space, that
S(E) = F (E) DiscE Ã(iE) , (2.12)
where
1
F (E) = . (2.13)
2 sinh( aE
2 )
D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100 77
3. Disk amplitudes
To gain some insight into the meaning of the D-brane array at imaginary times (1.2), we
now turn to a detailed analysis of disk scattering amplitudes of external closed strings. We
start by considering the simplest possible case, namely the amplitude of two closed string
tachyons. We first study this concrete example using standard methods, and then describe
a more abstract point of view, that can be generalized easily to m-point functions involving
arbitrary on-shell closed strings. A clear physical interpretation will emerge.
The simplest non-trivial case that we consider is the disk amplitude with the insertion
of two closed string tachyons. This example, which we are going to work out in full detail,
contains already much of the essential physics.
To apply the prescription derived in the previous section, we need to evaluate the
disk two-point function Ã(p1 , p2 ) for a standard D-brane. We consider a D(p − 1)-brane
with Dirichlet boundary conditions for X̃M , M = 0, . . . , 25 − p, and Neumann boundary
conditions for X̃m , m = 26 − p, . . . , 25. We are using a notation that will be natural for
the theory after the Wick rotation: X̃0 is a spatial coordinate that will become timelike
after Wick rotation. In order to have a standard D-brane with Neumann conditions in time,
we take one of the X̃m to be timelike. So the Wick rotation is actually a double Wick
rotation: it transforms X̃0 from spacelike to timelike and one of the parallel directions X̃m
from timelike to spacelike.1 In practice, the amplitude Ã(p1 , p2 ) will be written in terms
of kinematic invariants, so these distinctions are of no much consequence.
It is convenient to break up the momenta into parallel and perpendicular directions,
M m
pµ = p⊥ , p , (3.1)
s = p1
2
= p2
2
,
t = (p1 + p2 )2 = (p1⊥ + p2⊥ )2 . (3.2)
1 The case p = 0 is special, as we must choose one of the transverse directions X̃M to be timelike.
78 D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100
In our conventions2 α = 1, so that the on-shell condition for closed string tachyons is
p2 = 4.
The two-point disk amplitude has one modulus (four real coordinates minus three
conformal Killing vectors). We work with a slightly unconventional parametrization of this
moduli space that will be easy to generalize later when we turn to higher-point functions.
Namely we shall fix the positions of the two vertex operators and integrate over the radius
of the disk. We represent the disk as the complex domain Hρ = {z ∈ C, |z| ρ}, that is we
cut out a hole of radius ρ from the complex plane. The measure for the modulus ρ is
dρ
(b0 + b̄0 ) . (3.3)
ρ
Using the standard doubling trick, a closed string vertex operator V (p, z, z̄) is replaced by
the two chiral insertions VL (p, z) and VL (p , ρ 2 /z̄), where p = (−p⊥ , p ). For closed
string tachyons, V (p, z, z̄) = c(z)c̄(z̄) exp(ipX(z, z̄)) and VL (p, z) = c(z) exp(ipXL (z)).
Fixing the tachyon vertex operators at z = 1 and z = ∞, we have
1
1 dρ
Ã(p1 , p2 ) = dz
2πi ρ
0 |z|=ρ
× VL (p1 ; ∞)VL(p2 ; 1)b(z)VL p2 ; ρ 2 VL (p1 ; 0) , (3.4)
where the symbol
, denotes a CFT correlator on the plane. A short calculation gives
1
s−2 (t/4 − 1) (s − 1)
Ã(p1 , p2 ) = dρ ρ t /2−3 1 − ρ 2 = . (3.5)
2 (t/4 + s − 2)
0
This is of course a standard result, with a familiar interpretation [29,30]. The scattering
amplitude of a closed string off a D-brane shows the usual ‘dual’ structure with poles both
in the open and in the closed string channel. In the open string channel, the poles are located
at s = 1, 0, −1, . . . and arise from expanding around ρ = 1 where the vertex operators
approach the boundary. In the closed strings channel, we see poles at t = 4, 0, −4, . . .,
arising from expanding around ρ = 0 where the boundary shrinks to zero size.
We are now in a position to apply the prescription (2.12).3 The discontinuity with
respect to E = E1 + E2 = −i(p10 + p20 ) comes from the poles in the variable t, and so
we find that
∞
1
S(p1 , p2 ) = fk (s)δ(t/4 − 1 + k), (3.6)
2 sinh( a|E|
2 ) k=0
2 Moreover in writing a string amplitude Ã(p , . . . , p ), we treat all momenta as incoming. Finally our
1 m
convention for the Minkowski metric is ‘mostly plus’.
3 Notice that Ã(p , p ) obeys our analyticity assumptions, indeed it is an analytic function of the total
1 2
momentum P 0 with singularities (poles) only for imaginary P 0 , and behaves as 1/|P 0 |2s for large |P 0 |.
D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100 79
where
(−1)k (s − 1) (2 − s)(3 − s) · · · (1 + k − s)
fk (s) = = . (3.7)
2k! (s − k − 1) 2k!
Several remarks are in order. First, all the contributions to S come from the ρ → 0
region of the moduli space. The sharpest way to see this is to introduce a cut-off ρ 1.
Then the amplitude à becomes analytic in t since the poles in the closed string channel
disappear. Applying (2.12) yields S = 0 for any . This vindicates the original intuition that
the boundary state for the array of imaginary D-branes should correspond to a ‘hole of zero
size’ (an extra puncture) in the worldsheet. This intuition will be made very precise below.
Second, S has no poles in s, the open string channel! The external closed strings do not
couple to any on-shell open string degrees of freedom. Since there are no D-brane sources
in real time, this is as expected. We are describing a purely closed string background.
More precisely, (3.6) describes a sphere amplitude with the two tachyons insertions and
an additional on-shell closed string state that involves excitation at all levels of the tower of
massive modes. Indeed, the prefactor fk (s) is a polynomial of degree 2k in p , consistently
with the fact that a closed string mode at level k has up to 2k Lorentz indices.
In order to secure this result, and to prepare for the generalization to higher-point
amplitudes, we now repeat the computation in a more abstract language. We still write
the amplitude in the domain Hρ , but instead of using the doubling trick, we represent the
effect of the boundary at |z| = ρ by the insertion of a boundary state |B̃ p−1|z|=ρ . This is
the full boundary state defining the D(p − 1)-brane located at X̃M = 0.
A boundary state is a ghost number three state in the closed string Hilbert space, obeying
among other things the conditions
(QB + Q̄B )B̃ p−1 = 0, (b0 − b̄0 )B̃ p−1 = 0. (3.8)
In radial quantization, a state at radius ρ can be obtained from a state at radius ρ = 1 by
propagation in the closed string channel,
p−1
B̃ = ρ L0 +L̄0 p−1
B̃ . (3.9)
|z|=ρ |z|=1
We can then write
1
dρ
Ã(p1 , p2 ) =
|V (p1 ; ∞, ∞)V (p2 ; 1, 1)(b0 + b̄0 )ρ L0 +L̄0 B̃ p−1 |z|=1 . (3.10)
ρ
0
To proceed, we insert a complete set of states {|k, i},
dρ
1
k2
Ã(p1 , p2 ) = dk
|V (p1 ; ∞, ∞)V (p2 ; 1, 1)|k, iρ ( 2 +2li )
ρ
i 0
×
k, i|(b0 + b̄0 )B̃ p−1 |z|=1 , (3.11)
80 D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100
where li is the level of the state |k, i, and we integrate over the modulus ρ to get
1
Ã(p1 , p2 ) = dk
|V (p1 ; ∞, ∞)V (p2 ; 1, 1)|k, i 2
2 + 2li
k
i
p−1
×
k, i|(b0 + b̄0 )B̃ |z|=1
. (3.12)
This expression exhibits the decomposition of the amplitude into a source term from the
boundary state, a closed string propagator, and the 3-point interaction vertex. If we now
apply the master formula (2.12) we find
S(p1 , p2 ) =
|V (p1 ; ∞, ∞)V (p2 ; 1, 1)|W , (3.13)
where4
δ(k 2 /2 + 2l(i))
|W ≡ dk |k, i
k, i|(b0 + b̄0 )B p−1 |z|=1
i 2 sinh( a|E|
2 )
δ(L0 + L̄0 )
= (b0 + b̄0 )B p−1 |z|=1 . (3.14)
2 sinh( a|E|
2 )
We see that the interaction of the two tachyons with the imaginary array is captured by a
ghost number two closed string state |W . Using (3.8) and {b0 , QB } = L0 , we have
4 Notice that in the formula below we drop the ‘tilde’ on the symbol for the boundary state: |B p−1 denotes
the boundary state after double Wick rotation.
D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100 81
where R{·} denotes radial ordering. Inserting as before an intermediate complete set of
states, and performing the ρ integral, we find
Ã(p1 , . . . , pn ) = dk d 2 z3 · · · d 2 zm
i
×
|R V1 (p1 ; ∞, ∞)V2(p2 ; 1, 1) · · ·Vm (pm ; zm , z̄m ) |k, i
k2
+2li
ρ02
×
k, i|(b0 + b̄0 )B̃ p−1 |z|=1 . (3.18)
k2
2 + 2li
Now we extract the discontinuity with respect to the energy E. Clearly we get
contributions from the poles in the propagators ∼ k 2 /2+2l
1
. A priori, the integrals over
i
the coordinates zi may generate additional singularities. However it is not difficult to show
that extra singularities can only arise when a (proper) subset of the vertex operators have
an on-shell total momentum. We can define the amplitude by analytic continuation away
from these singular points, and then disregard these singularities. This is reminiscent of the
celebrated canceled propagator argument [31]. We can then write5 (see Fig. 3)
S(p1 , . . . , pm ) = d 2 z3 · · · d 2 zm
× V1 (p1 ; ∞, ∞)V2(p2 ; 1, 1) · · · Vm (pm ; zm , z̄m ) W(0, 0) .
(3.19)
5 Notice that the dependence on ρ drops because of the delta function δ(L + L̄ ). In other terms, the ρ
0 0 0
integral localizes to ρ → 0 and the upper limit of integration ρ0 is immaterial.
82 D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100
Fig. 3. Before the double Wick rotation we have a standard disk amplitude. The disk can be viewed as the region
Hρ , which is the complex plane with a hole of radius ρ. There are contributions to the scattering amplitude from
all values of ρ ρ0 , where ρ0 is the distance of the closest puncture. After the double Wick rotation the only
contribution is coming from ρ = 0. The hole shrinks to a point leaving behind an extra puncture W inserted at
the origin.
The coordinates zi , i = 3, . . . , m, are integrated over the full complex plane, so this is
the string theory amplitude for m + 1 insertions on the sphere. We can check that the
counting of moduli is consistent with this result. We started with 2m − 3 moduli for a
disk with m closed punctures and performed an explicit integration over ρ. This gives
2m − 4 = 2(m + 1) − 6, which is the number of moduli for a sphere with m + 1 punctures.
D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100 83
3.4. Interpretation
Let us summarize what we have learned. Disk amplitudes for m external closed strings
off the D-brane array at the imaginary times X0 = i(n + 1/2)a are completely equivalent
to (m + 1)-point amplitudes on the sphere, with the extra closed string insertion
gs d 2 z W(z, z̄),
where W is given by (3.14). The explicit factor of gs , which was omitted in the previous
formulae, comes from the relative normalization of disk amplitudes to standard sphere
amplitudes. Unlike the case of an ordinary brane, we are finding that for the imaginary array
disk amplitudes give a purely closed correction (of order gs ) to the background. There are
no D-brane sources in real time, only closed string radiation satisfying the homogeneous
wave equation.
The absence of sources in real time can also be deduced from the fact that all one-
point functions on the disk are trivially zero in our prescription. This is consistent with
the computations in [3], where the stress tensor (related to the graviton one-point function)
was found to vanish in the BCFT (1.1) for λ = 12 . There are two equivalent ways to see
this in our language. The one-point function on the disk is a smooth function of the total
energy E, so there is no discontinuity in E. Alternatively, the discussion above implies that
a disk one-point function is equal to a sphere two-point function with the extra insertion
of W; but sphere two-point functions are zero because of the infinite volume of the unfixed
moduli.
Since disk amplitudes provide the first order correction to the background, it is very
natural to expect that amplitudes with multiple boundaries give the necessary higher-order
corrections. A genus zero amplitude with m external closed strings and b boundaries,
is expected to become after Wick rotation a sphere amplitude with m + b punctures,
with b insertions of gs d 2 z W. Notice that since the boundaries are indistinguishable,
the sum over boundaries exponentiates to the insertion of exp( d 2 z gs W), which has
the interpretation of a coherent state of closed string radiation. One may investigate
this issue rigorously by representing the effect of the boundaries using boundary states,
and decomposing the moduli space of a surface with b boundaries and m closed string
punctures in terms of closed string vertices and propagators [32]. The moduli space
integration should localize to the region where each boundary shrinks to an extra puncture.
A priori, one may also expect that regions of moduli space where zero-size boundaries
collide could provide additional contributions. For example one may expect that an annulus
amplitude would reduce to a sphere amplitude with two insertions of gs W, plus a sphere
amplitude with a single insertion of a new operator gs2 W1 coming from the shrinking of
both boundaries to the same point. This issue should be investigated further.
In this section we study the physical properties of |W in more detail and make contact
with [11,12]. Our starting point is the boundary state |B p−1, which is obtained from
84 D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100
double Wick rotation of |B̃ p−1 . So |B p−1 is associated with a D(p − 1)-brane located at
XM = 0, in a space with metric (ηMN , δmn ). One has
∞
p−1 M
B = N δ X exp − µ†
a Sµν ã ν†
|0; k = 0. (4.1)
n n
n=1
N is a normalization constant that can be found for example in [33], and
where the dots indicate higher massive modes. The first term is the closed string tachyon,
which is an artifact of the bosonic string and is standard practice to ignore. Below we first
discuss the massless modes and then consider the massive modes.
In order to read off the dilaton and gravity wave profile from the second term we have
to undo their mixing. The unmixed dilaton and graviton (in the Einstein frame) take the
form (see, e.g., [33])
S · (φ)
hµν = Sµν − ηµν , φ = S · (φ) , (4.3)
η · (φ)
where
1
(φ) = (ηµν − kµ lν − kν lµ ), k · l = 1, l 2 = 0. (4.4)
2
Let us first look at the case p = 0 (an array of imaginary D(−1)-branes). One finds
that the Einstein metric is completely flat as all the expectation values of hµν vanish.
This means that the part of the leading term in the ADM mass, which scales like 1/gs ,
vanishes (see also [12]).6 The dilaton, on the other hand, does not vanish. So for p = 0, the
massless fields in |W consist of a spherically symmetric dilaton wave φ(r, X0 ) in 25 + 1
dimensions, whose energy is of order O(gs0 ) = O(1).
For p > 0 the 26-dimensional metric in the Einstein frame is non-trivial. However the
fields profiles are translationally invariant in the p longitudinal directions Xm , and to read
off the ADM mass we are instructed to dimensionally reduce to the 26 − p transverse
dimensions. One finds that in the (26 − p)-dimensional Einstein frame the metric is zero,
and we have again only a spherically-symmetric dilaton wave. We conclude that to order
1/gs the mass vanishes for general p.
6 This is consistent with Sen’s observation [3] that the stress tensor vanishes for the λ = 1 BCFT (1.1).
2
D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100 85
The 7
√ space–time profile of this dilaton wave is quite interesting [11]. For fixed radius
r = X X , the field decays exponentially fast as X → ±∞. For fixed X0 , the field
M M 0
decays as 1/r 23−p as r → ∞, just like the fields produced by an ordinary Dp-brane, but
with a different numerical coefficient. (These two asymptotic behaviors match in a region
of thickness of the order of a around the light-cone X0 = ±r.) To be precise, up to an
overall numerical constant that will not be relevant for the discussion below, the leading
asymptotic behavior as r → ∞ of the various fields is, in the 26-dimensional Einstein
frame,
(1 + K)(d − 3) − 2p 1
h00 → ,
d −2 r 23−p
δmn 2d − 2p − 5 − K 1
hmn → ,
2 d −2 r 23−p
δMN 2p + 1 + K 1
hMN → ,
2 d −2 r 23−p
d − 2p − 3 − K 1
φ→ . (4.5)
4 r 23−p
Here d = 26 and the parameter K is set to 1 for the standard Dp-brane and is set to −1 for
the background |W .
An exercise one can now do is to compute the force acting on a probe D0-brane in this
dilaton wave background [11]. One has to be a bit careful here with the exact meaning of
the ‘force’ between the brane and the wave since this is not a static set-up. The ‘force’
that can be computed using (4.5) for K = −1 is acting during a finite time interval
−T X0 T at a distance r in the limit Tr → 0. In the Einstein frame in 26 dimensions
the DBI action for the D0-brane takes the form
11 √
dX0 e− 12 φ g00 . (4.6)
From this and (4.5) with p = 0 we can deduce that the ratio between the force acted upon
the D0 by a standard brane and the ‘force’ acted on it by the dilatonic wave in |W is
Fstandard 11
12 φ(K = 1) + 12 h00 (K = 1) 12
= = . (4.7)
FW 11
12 φ(K = −1) + 1
2 h00 (K = −1) 11
This is in agreement with [11] where this ratio was calculated in a different way.
m2 = 4(n − 1). For n > 1, their field profile (proportional to 1/ sinh(a|E|/2) is strongly
peaked at energies |E| − m 1/a. This means that for any a 2π , the states are with good
approximation non-relativistic already at level n = 2, with the approximation improving at
7 From the discussion in Section 6 of [11] it is not obvious that Eq. (6.27) in that section describes just a
dilaton wave. However, this can be verified with the help of Eq. (4.3).
86 D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100
For a = 2π the expectation value of the energy diverges for p = 0, 1, 2. Naively, for
p > 2 the energy is finite. However, for any p the expectation values of powers
E k
will eventually diverge for sufficiently high k [12], and hence for a = 2π the uncertainty
in the energy9 is infinite for any p.
This divergence has a very natural physical interpretation. As reviewed in the
introduction and further discussed in Section 8, for a = 2π the background of imaginary
branes admits an infinitesimal deformation that introduces a D-brane source in real time,
and we should then expect that this background also has an energy of order 1/gs .
1
Ē ∼ . (4.13)
ε
So to obtain the expected scaling ∼ 1/gs the parameter b has to be equal to one. An
argument why this is plausible was given in [12] from the point of view of open string field
theory. Here we provide an alternative heuristic argument from the closed string channel,
that justifies why b = 1 and also why γ > 0.
Let us think about this issue from the point of view of the array before Wick rotation.
When gs = 0 the distance between the branes is 2π . When gs is turned on the branes will
slightly curve space–time due to their mass so that the distance between them is no longer
2π . Actually since their mass is ∼ 1/gs and the Newton constant scales like gs2 the back-
reaction is of order gs and so the proper distance between the branes is indeed as in (4.12).
γ is positive simply because the metric components in the transverse direction to the brane
scale like
γ̃
g⊥ ∼ 1 + gs , (4.14)
r 23−p
with γ̃ > 0.
It is interesting to check if this argument generalizes when we put N rather than just
one brane at each site. First, the back-reaction of the metric scales like gs N which means
that now ε ∼ gs N . Combining this with (4.13) and with the fact that |W gets multiplied
10 More precisely, as shown in [12] and further elaborated in Section 6 of this paper, the limit X0 → ∞ of
(1.1) corresponds for all λ (up to a trivial time translation) to the outgoing (X0 > 0) part of our state |W , and
symmetrically X0 → −∞ of (1.1) corresponds to the incoming (X0 < 0) part of |W .
11 For p > 0 (4.11) cannot be trusted as the higher moments of the energy (
E k −
Ek )1/k have a worse
degree of divergence than the mean value Ē =
E.
88 D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100
In Section 3 we showed that scattering amplitudes off the imaginary array do not have
any open string poles. This means that there are no propagating open strings degrees of
freedom, consistently with the fact that there are no branes in real time. However there still
is a discrete set of on-shell open string vertex operators, which demand an interpretation.
In this section we argue that they are dual to deformations of the closed string state |W .
For simplicity let us start by considering the case p = 0, the array of D(−1)-branes at
imaginary times X0 = i(n + 12 )a. The open string spectrum should be read off from the
theory before double Wick rotation, where we have an array of D(−1)-branes at the spatial
locations X̃0 = (n + 12 )a. For generic12 distance a, the only matter primaries of dimension
(n) (n)
one are Ṽµ = ∂ X̃µ , where µ are space–time indices and n ∈ Z labels the position of the
D(−1)-brane. In the double-Wick rotated theory the physical open string states are just the
same, up to trivial relabeling. So the most general state in the open string cohomology of
the theory of D(−1)-branes at imaginary times can be written as
25 ∞
µ
|f = f(n) ∂Xµ(n) (0)c1 |0. (5.1)
µ=0 n=−∞
Clearly these are the exactly marginal open string deformations that correspond to moving
the positions of the D(−1)-branes.
Next we can consider disk amplitudes S(p1 , . . . , pm ; f ) for m closed strings scattering
off the imaginary array, with one additional insertion of (5.1) on the boundary of the disk.
Without any open strings puncture, S(p1 , . . . , pm ) was shown in Section 3 to be a sphere
amplitude with an extra insertion of the closed string state |W . How does the addition of
|f change this conclusion?
µ
To get some insight, consider first the case f(n) = f µ for all n. This deformation
is simply a Goldstone mode associated with a rigid translation Xµ → Xµ + fµ of
the whole array. It is clear that in this simple case S(p1 , . . . , pm ; f ) is obtained from
S(p1 , . . . , pm ; f ) by replacing |W with its infinitesimal translation, |W → f µ ∂µ |W .
In this special example the open string insertion |f corresponds to a symmetry of the
vacuum broken by the closed string background |W .
In the more general case (5.1), it is not difficult to show13 that S(p1 , . . . , pm ; f ) will
still be a sphere amplitude with m + 1 punctures, where the extra closed puncture is now
δf W, the infinitesimal deformation of W obtained by displacing the branes according
to |f . Generically this deformation will change the total energy of the closed string state.
The intuitive picture is that the branes act as sources for the closed string fields. Since these
sources are not in real time, the resulting closed string fields are homogeneous solutions of
the wave equation, and we have a purely closed background. Changing the positions of the
branes changes the details of the closed string field profiles. Thus the open string moduli
(5.1) are re-interpreted as deformations of the closed string background. In the next section
this picture is made precise.
There is one important restriction on the allowed motions of the branes: the resulting
µ
closed string fields must be real. This imposes certain constraints on the moduli f(n) .
Starting from the array at X0 = in(a + 12 ), reality of the closed string fields demands
that the D(−1)-branes be moved in pairs, that is, the kth D(−1)-brane at X0 = ia(k + 12 ),
k > 0, together with its mirror partner at X0 = ia(−k + 12 ). The reality condition is
µ µ ∗
f(k) = f(−k) ∀k ∈ N, (5.2)
where ∗ denotes complex conjugation. As long as we refrain from considering branes with
complex spatial coordinates Xi , i = 1, . . . , 25, we should keep the spatial moduli f(k)
i real,
and then (5.2) implies f(k) = f(−k) . For the time coordinate X on the other hand, there
i i 0
are two possible motions for a given pair of branes, as illustrated in Fig. 4.14
The reality condition can be simply rephrased by saying that the D-brane configuration
must be symmetric under reflection with respect to real time axis. A simple way to see
this constraint is to focus for example on the massless closed string fields. The field
produced by some δ-function sources located at the origin in space and at times X0 = Yk0
is schematically [11]
1
φ(Xµ ) ∼ , (5.3)
k
[−(X − Yk )2 + (Xi )2 ]s
0 0
where the power s depends on the number of transverse directions. We are interested in
measuring the field φ(Xµ ) for real values of its arguments Xµ . It follows that for φ(Xµ )
to be real, the locations of the sources {Yk0 } must either be real or come in complex
conjugate pairs. In the next section we write down the prescription to obtain the closed
string background associated with an arbitrary configuration for imaginary branes, and it
will be apparent that the same reality condition is valid in full generality.
13 We cannot directly apply the prescription of Section 2. However, the relevant disk amplitude is simple
enough that can be evaluated directly in the theory of the spatial array X̃0 = a(n + 12 ), and then Wick rotated.
14 Interestingly, for a pair of branes before the double Wick rotation there are also two allowed exactly marginal
motions along X = −iX0 , but of course they are the independent translations of each brane along the real X axis.
90 D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100
Fig. 4. The two possible motion modes of a pair of D-branes in the complex X0 plane.
For p > 0 there are additional on-shell open strings since the open string tachyon can
be put on-shell provided pm pm = 1, where pm is the momentum along the Neumann
directions. These are precisely the exactly marginal deformations studied in the original
work by Callan et al. [25]. For example the deformation λ cos(Xm ), where Xm is one of
the directions along the brane, continuously interpolates between Neumann and Dirichlet
boundary conditions for the coordinate Xm . At the critical value λ = 12 , we get an array of
D(p − 2)-branes localized at Xm = 2π(n + 12 )i. The deformed boundary state is known for
all values of λ [25] and we can easily apply our prescription (3.14) to compute |W(λ ).
As λ varies, these states are a family of purely closed string backgrounds. An open
string deformation, in this case cos(Xm ), is then again re-interpreted as a closed string
deformation.
One important conclusion from the discussion in the previous section is that there is
nothing fundamentally special about the array of D-branes at X0 = i(n + 12 )a. Exactly
marginal open string deformations allow to move the positions of the branes in the complex
X0 plane, with the only constraints coming from the reality condition. By sending off most
of the branes to large imaginary time, we can also consider configurations with a finite
number of branes.
The simplest configuration consists of only one pair of branes at X0 = ±iβ (with β real,
β > 0). As in Section 2.2, we start with the pair in Euclidean space at X = ±β. The analog
D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100 91
of (2.7) is
2π −βc
G̃pair (X) = e cosh(cX), |X| β. (6.1)
c
Notice that the only change relative to (2.7) is in the prefactor. This was foreordained since
in a finite neighborhood of X = 0 both G̃pair (X) and G̃array (X) obey homogeneous wave
2 − c )G̃ = 0 and are even under X → −X. Wick rotation then gives the usual
d 2
equation ( dX
sourceless solution ∼ cos(cX0 ). In Fourier transform
π
Spair (E, . . .) = e−βc δ(E − c) + δ(E + c) , (6.2)
2c
which we can write
Spair (E, . . .) = Fpair (E) DiscE [A], (6.3)
where
Fpair (E) = sign(E)e−β|E| . (6.4)
This result should be compared with (2.9), (2.12), (2.13).
This formula can be generalized further by displacing the D-brane pair along real time,
that is at X0 = α ±iβ. In this case Fpair (E) → sign(E)e−β|E|+iαE . By linear superposition,
an arbitrary configuration of M D-brane pairs at positions X0 = αk ± iβk , k = 1, . . . , M,
leads again to an amplitude of the form (2.12), where now the prefactor takes the general
form
M
F (E) = sign(E)e−βk |E|+iαk E . (6.5)
k=1
It is worth pointing out that our prescription to compute the closed string fields
associated with imaginary D-branes is equivalent to the second-quantized point of view
taken in Section 3 of [12]. They propose to obtain the wavefunction for the closed string
fields at X0 = 0 by cutting open the Euclidean path integral in the presence of D-brane
sources located at Euclidean time X̃0 < 0. To evaluate the expectation value of the closed
string fields in such a wave function, one needs to construct the full configuration of sources
symmetric under X̃0 → −X̃0 (obtained by simply reflecting the sources located at X̃0 < 0
to X̃0 > 0), and read off the solution at X̃0 = 0. This is the second-quantized version of
what we do here.
An interesting open question is whether as we vary the configuration of imaginary
D-branes, the state |W spans the full closed string cohomology. It is clear that to have
any chance of success we must introduce more general boundary states than the ones
considered in this paper (for example, D-branes with magnetic and electric fields on their
worldvolume), and allow for an infinite number of imaginary branes. If the answer to
this question is positive then the open/closed string duality takes a new form since at the
fundamental level closed strings and D-branes are unified. Each closed string puncture in
a string theory amplitude could be effectively represented by a hole in the worldsheet with
appropriate boundary conditions.
6.1. Superstring
We have found an intriguing relation between D-branes in imaginary time and purely
closed backgrounds. Since D-branes admit an open string description, this suggests that one
may be able to obtain a dual description of closed string theories in terms of open strings.
We have already seen in Section 5 that the exactly marginal open string deformations have
a natural reinterpretation as deformations of the closed string background. However since
there are no propagating on-shell open string degrees of freedom on imaginary D-branes,
it is clear that if such a complete open/closed duality exists, it must involve the off-shell
open strings. In this section we offer some very brief and incomplete speculations in this
direction.
We would like to propose that the open string field theory (OSFT) on a configuration
of imaginary D-branes is dual to the corresponding closed string theory. To make sense
of this speculation we must define what is the OSFT for imaginary branes. Applying our
usual strategy, we start with OSFT on standard D-branes, and double Wick rotate. While
from a first quantized point of view it may be subtle to define the Wick-rotated open string
theory, in the second quantized approach we have the luxury of a space–time action, which
seems straightforward to analytically continue.
Let us sketch how this may come about in the example of the array of D(−1)-branes.
We start with an array of D(−1)-branes at X̃0 = (n + 12 )a. The open string field Ψj k has
Chan–Paton labels j, k ∈ Z running over the positions of the D-instantons, and the cubic
OSFT action [35] takes the form
1 1
jk 1
S[Ψ ] = − 2 Ψj k , QB Ψj k +
Ψj k , Ψj l ∗ Ψlk +
Ψjj , C . (7.1)
g0 2 jk
3
j kl j
15 Note however that one has to be careful here with the reality conditions discussed in the previous sections
[40].
94 D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100
Finally, the OSFT (7.1) may provide us with new clues about the string field theory
around the tachyon vacuum. As a → ∞, the energy of |W goes to zero, and we approach
the tachyon vacuum. Interestingly, in the same limit |W does not vanish completely, but it
becomes purely a zero-momentum dilaton. Thus we recover precisely the scenario of [23].
Since for any a such an open string field theory should make sense this may provide us
with a consistent regularization of vacuum string field theory, which should be related to
the one considered in [23] by some non-trivial field redefinition.
As discussed in Section 4, for a = 2π the normalization and the energy associated with
the closed string state |W diverge [12]. This singularity signals the appearance of new
open string degrees of freedom. Open strings stretched between neighboring branes in
imaginary time have conformal dimension L0 = (a/2π)2 , which equals one for a = 2π .
The infinite periodic array has the special property that a specific linear combination of
these marginal operators (the one which is invariant under translations X0 → X0 + 2π and
is even under X0 → −X0 ) is in fact exactly marginal.16 A new branch of moduli space
opens up for a = 2π . Indeed, the theory for a = 2π is equivalent to the λ = 12 critical point
of the BCFT (1.1). Turning on the exactly marginal open string deformation (‘cosh(X0 )’)
we can reduce the value of λ, from λ = 12 (the purely closed string background |W ) to
λ = 0 (the usual brane with Neumann boundary condition in time). This is the sense in
which the array for a = 2π is very special: it admits an exactly marginal deformation that
cannot be interpreted purely as a deformation of the closed background.
One can obtain many insights into the physics for λ < 1/2 by a simple extension of the
methods in Section 2. The basic idea is that in the Euclidean BCFT with X = −iX0 , taking
λ < 12 amounts to regulating the delta-function sources located at X = 2π(n + 1/2) with
smooth lumps. The precise way to make this regulation can be gleaned from the boundary
state [25] for general λ. Focusing on the oscillator free part of the boundary state,
∞
n −nτ
|B0 ∼ 1 + 2 (−1) e cos nX(0) |0, τ ≡ − log sin(πλ) . (8.1)
n=1
16 In principle also the odd operator, ‘sinh(X0 )’, is exactly marginal. However in the bosonic string this
deformation would bring us to the ‘wrong’ side of the tachyon potential for X0 < 0.
D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100 95
The interpretation of this formula is clear: for λ < 12 , the boundary state corresponds to an
infinite array of smeared sources j˜τ ; indeed j˜τ is a well-known representation for δ(X) in
the limit τ → 0.
If we sum the source j˜τ (X) over the infinite array and then Wick rotate X → −iX0 we
find
tanh((X0 + τ )/2) − tanh((X0 − τ )/2)
Jτ X0 = . (8.3)
4π
Already at this stage we see the crucial difference compared to λ = 12 (τ = 0). Now there
is a non-zero source localized at real time for |X0 | < τ , which is to say that an unstable D-
brane appears at X0 = −τ and disappears at X0 = τ . For later use let us record the Fourier
transform of this function,
sin(τ E)
ρτ (E) = . (8.4)
sinh(πE)
The next step is to repeat the exercise for the fields generated by this smeared array. The
Fourier transform of the Euclidean source j˜τ (X) is e−τ |P | . Therefore, in the notations of
Section 2.2, Eq. (2.4) should be replaced by
e−τ |P |
Ã(P , . . .) = . (8.5)
P 2 + c2
As in Section 2.3, we sum over the array and use the residue theorem to write the total field
as a contour integral,
1 e−τ |P |
G̃τ (X) = dP eiP X 2 . (8.6)
2i (P + c2 ) sin(πP )
C
If we now move the contour over the imaginary P axis as in Fig. 2, and Wick rotate, we
find
1 eiτ E e−iτ E
Sτ (E) = − , (8.7)
2 sinh(πE) (E + i)2 − c2 (E − i)2 − c2
which we recognize as
1
Sτ (E) = Gret (E)eiτ E − Gadv (E)e−iτ E . (8.8)
2 sinh(πE)
This way of writing the answer makes the space–time interpretation manifest (see also
[12]). For X0 < −(τ + r), we have purely incoming radiation. For X0 > τ + r, we have
purely outgoing radiation. Outside of these two cones (for (X0 )2 − r 2 < τ 2 ), the fields are
the same as the ones produces by a static source. The thickness of the transition regions
is of the order of the string length. Notice that the outgoing radiation is produced by the
rapid change of the source for X0 ∼ τ , and similarly the incoming radiation is correlated
with the change at X0 ∼ −τ . This process can be described some finely tuned incoming
closed strings that create an unstable D-brane at X0 = −τ , which then decays at X0 = τ
into outgoing closed strings.
96 D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100
aspects of the system. In particular, in [27] Choptuik showed, via numerical analysis, that
a universal behavior occurs at the critical point where black hole formation first occurs.
A crude summary of his results is the following. Consider a spherically symmetric wave
of a massless scalar field
ηφ r, X0 . (9.1)
The strength of the non-linear effects of the gravitational back-reaction of the wave grows
with the overall coefficient η. When η → 0 the linearized approximation is exact while
for η → ∞ a large black hole will be formed, with an exponentially small amount of
energy escaping the black hole formation as an outgoing radiation.17 Therefore, for a given
field profile φ(r, X0 ) there is a special value η∗ where the black hole formation first takes
place. While η∗ certainly depends on the details of φ, the time evolution of the system
for η → η∗ admits scaling behavior that is fixed by a certain constant, ∆. This constant
is universal in the sense that it does not depend on φ, but it can depend for example on
the number of spatial directions. Another critical exponent δ is related to the scaling of the
mass of the black hole near the transition. These fascinating results have been explored
quite extensively in the last decade.
In this section we take advantage of the fact that a spherically symmetric dilaton wave
can be viewed as a configuration of D-branes located in imaginary time to relate Choptuik’s
findings to a phase transition somewhat reminiscent of the Gregory–Laflamme instability
[28]. For simplicity we will mostly phrase the discussion in the context of the bosonic
string, ignoring as usual the closed string tachyon. The extension to the superstring of the
scaling arguments given below is straightforward.
The first step is to understand how to get from our |W a background containing only
a classical dilaton wave. It is clear that to suppress massive closed string modes we need
to take las → ∞. (Here we have restored the string length ls , that was set to one in the rest
of the paper.) However the total energy of the wave will then go to zero as 1/a 25−p in
this limit (see (4.9), and a black hole will not be formed this way. A simple way to obtain
a configuration with enough energy without exciting the massive modes is to increase the
number of branes at each point X0 = ia(n + 12 ), as in the discussion at the end of Section 4.
With N D-brane at each point the particles density, n̄/V , and the energy density = Ē/V
(V is the volume along the wave) of the wave for large a are (from (4.9)
24−2p 24−2p
n̄ N 2 ls N 2 ls
∼ , ∼ . (9.2)
V a 24−p a 25−p
Let us now state the exact conditions that must be satisfied in the limit we want to take:
(1) The string coupling gs should go to zero to suppress quantum effects, and the total
number of particles n̄ should go to infinity so that |W is well described by a classical
wave.
(2) las → 0 to ensure that the massive closed strings decouple.
17 The system is classical so this radiation is classical and should not be confused with Hawking radiation.
98 D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100
(3) The gravitational radius associated with the total energy of the wave
1
rG = (GN ) 23−p , (9.3)
where should be comparable to the wavelength, a (here GN is the Newton constant in
26 dimensions). If we define the dimensionless parameter
1
48−2p
rG (gs2 N 2 ls /a 25−p ) 23−p
ζ= = , (9.4)
a a
then in our set-up ζ plays the role of η in (9.1) That is, for small ζ the linearized
approximation is valid and a black hole will not be formed while for large ζ non-linear
effects are important and lead to black hole formation. So the critical value ζ ∗ is a
number of order one.
It is easy to verify that these conditions are satisfied in the following limit
24+β−p
a fixed, ls → 0, gs ∝ lsβ , N ∝ 1/ ls , β > 0. (9.5)
So far what we have done is to take a certain ‘decoupling’ limit that leaves us with
the classical picture of black-hole formation. The obvious question to ask is what does
this mean from the D-brane point of view. In particular, does anything special happens at
ζ = ζ ∗ in the array of D(p − 1)-branes located at X̃0 = (n + 12 )a, i.e., in the array before
the double Wick rotation?
The gravitational radius associated with N D-branes located at each site is
1
lG ∼ (gs N) 24−p ls . (9.6)
When lG is much smaller than the distance between the D-branes, a, the gravitational
interactions between them is small and they can be considered as separated points.
However, when lG is larger than a the gravitational interaction between them is so strong
that effectively they form a black hole along X̃0 . It is easy to see from (9.6), (9.4) that
this transition occurs at the same point as the wave to black hole transition. Since the two
processes are related by Wick rotation it is very tempting to suspect that there is a precise
numerical relation between the exponents of [27] and the exponents, yet to be found, in the
phase transition just described.
It should be stressed that we although the transition considered here is somewhat
reminiscent of the Gregory–Laflamme instability, there are also obvious differences. In the
Gregory–Laflamme case one starts with a black p-brane and finds, at the level of linearized
equations of motion, an instable mode; condensation of this tachyon is then conjectured to
lead to an array of black (p − 1)-branes. (This expectation however may even be false, see,
e.g., [42–44] for recent discussions.) In our case we start from the array and we turn on
the coupling constant to eventually form a (possibly non-uniform) black p-brane, so we
are going in the opposite direction. More crucially, we do not start with an array of black
p-branes. The D-branes are extremal to begin with in the sense that they have no finite
area horizon. In the process of turning on the coupling constant the radius of their zero
area horizon grows until they meet. At that point a combined finite area horizon will be
formed.
D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100 99
Let us illustrate this in the context of the superstring. In this case there are also no closed
string tachyon so the whole discussion makes more sense. We can consider either an array
of BPS D(p − 1)-branes, which must then be alternating as D/D̄, or an array of non-BPS
D(p − 1)-branes. Consider for example an array of D3/D̄3 in type IIB. For small ζ the D3-
branes interact weakly and so they can be treated as separated stable objects. As we turn
on ζ their gravitational radius start to touch. At that stage the fact that we have alternating
branes anti-branes is crucial since the charges can annihilated to form a black 4-brane. This
black 4-brane does not carry any 6-form charge (we are in type IIB).
It would be extremely interesting to understand Choptuik’s critical behavior from the
point of view of the dual open string field theory discussed very briefly in Section 7. Very
schematically, the way this could work is as follows. From the open string field theory
point of view the classical non-linear gravity dynamics is obtained in the loop expansion.
We expect that the lightest open string mode stretched between the branes, which at the
classical level has large positive m2 , becomes tachyonic for ζ > ζ ∗ due to quantum open
string effects. It may be possible to see this effect in a one-loop computation, in the spirit
of [45]. If this scenario is correct, the universal physics at the transition point would be
completely captured by a massless field, and we would have an explanation of gravitational
critical behavior in terms of a second order phase transition in the dual OSFT.
Acknowledgements
References
[1] M. Gutperle, A. Strominger, Spacelike branes, JHEP 0204 (2002) 018, hep-th/0202210.
[2] A. Sen, Rolling tachyon, JHEP 0204 (2002) 048, hep-th/0203211.
[3] A. Sen, Tachyon matter, JHEP 0207 (2002) 065, hep-th/0203265.
[4] A. Sen, Field theory of tachyon matter, Mod. Phys. Lett. A 17 (2002) 1797, hep-th/0204143.
[5] A. Sen, Time evolution in open string theory, JHEP 0210 (2002) 003, hep-th/0207105.
[6] P. Mukhopadhyay, A. Sen, Decay of unstable D-branes with electric field, JHEP 0211 (2002) 047, hep-
th/0208142.
[7] A. Strominger, Open string creation by S-branes, hep-th/0209090.
[8] A. Sen, Time and tachyon, hep-th/0209122.
[9] F. Larsen, A. Naqvi, S. Terashima, Rolling tachyons and decaying branes, JHEP 0302 (2003) 039, hep-
th/0212248.
[10] M. Gutperle, A. Strominger, Timelike boundary Liouville theory, hep-th/0301038.
[11] A. Maloney, A. Strominger, X. Yin, S-brane thermodynamics, hep-th/0302146.
[12] N. Lambert, H. Liu, J. Maldacena, Closed strings from decaying D-branes, hep-th/0303139.
[13] B. Chen, M. Li, F.L. Lin, Gravitational radiation of rolling tachyon, JHEP 0211 (2002) 050, hep-th/0209222.
100 D. Gaiotto et al. / Nuclear Physics B 688 (2004) 70–100
[14] S.J. Rey, S. Sugimoto, Rolling tachyon with electric and magnetic fields: T-duality approach, hep-
th/0301049.
[15] N. Moeller, B. Zwiebach, Dynamics with infinitely many time derivatives and rolling tachyons, JHEP 0210
(2002) 034, hep-th/0207107.
[16] S. Sugimoto, S. Terashima, Tachyon matter in boundary string field theory, JHEP 0207 (2002) 025, hep-
th/0205085.
[17] J.A. Minahan, Rolling the tachyon in super-BSFT, JHEP 0207 (2002) 030, hep-th/0205098.
[18] T. Okuda, S. Sugimoto, Coupling of rolling tachyon to closed strings, Nucl. Phys. B 647 (2002) 101, hep-
th/0208196.
[19] J. Kluson, Exact solutions in open bosonic string field theory and marginal deformation in CFT, hep-
th/0209255.
[20] I.Y. Aref’eva, L.V. Joukovskaya, A.S. Koshelev, Time evolution in superstring field theory on non-BPS
brane, I: rolling tachyon and energy–momentum conservation, hep-th/0301137.
[21] A. Ishida, S. Uehara, Rolling down to D-brane and tachyon matter, JHEP 0302 (2003) 050, hep-th/0301179.
[22] S.L. Shatashvili, On field theory of open strings, tachyon condensation and closed strings, hep-th/0105076.
[23] D. Gaiotto, L. Rastelli, A. Sen, B. Zwiebach, Ghost structure and closed strings in vacuum string field theory,
hep-th/0111129.
[24] L. Rastelli, A. Sen, B. Zwiebach, String field theory around the tachyon vacuum, Adv. Theor. Math. Phys. 5
(2002) 353, hep-th/0012251.
[25] C.G. Callan, I.R. Klebanov, A.W. Ludwig, J.M. Maldacena, Exact solution of a boundary conformal field
theory, Nucl. Phys. B 422 (1994) 417, hep-th/9402113.
[26] J. Polchinski, L. Thorlacius, Free fermion representation of a boundary conformal field theory, Phys. Rev.
D 50 (1994) 622, hep-th/9404008.
[27] M.W. Choptuik, Universality and scaling in gravitational collapse of a massless scalar field, Phys. Rev.
Lett. 70 (1993) 9.
[28] R. Gregory, R. Laflamme, Black strings and p-branes are unstable, Phys. Rev. Lett. 70 (1993) 2837, hep-
th/9301052.
[29] I.R. Klebanov, L. Thorlacius, The size of p-branes, Phys. Lett. B 371 (1996) 51, hep-th/9510200.
[30] A. Hashimoto, I.R. Klebanov, Decay of excited D-branes, Phys. Lett. B 381 (1996) 437, hep-th/9604065.
[31] J. Polchinski, String Theory, vol. 1, Cambridge Univ. Press, 1995.
[32] B. Zwiebach, Closed string field theory: quantum action and the B–V master equation, Nucl. Phys. B 390
(1993) 33, hep-th/9206084.
[33] P. Di Vecchia, M. Frau, I. Pesando, S. Sciuto, A. Lerda, R. Russo, Classical p-branes from boundary state,
Nucl. Phys. B 507 (1997) 259, hep-th/9707068.
[34] M. Green, J. Schwarz, E. Witten, Superstring Theory, vol. 1, Cambridge Univ. Press, 1987.
[35] E. Witten, Noncommutative geometry and string field theory, Nucl. Phys. B 268 (1986) 253.
[36] J.A. Shapiro, C.B. Thorn, BRST invariant transitions between closed and open strings, Phys. Rev. D 36
(1987) 432.
[37] J.A. Shapiro, C.B. Thorn, Closed string–open string transitions and Witten’s string field theory, Phys. Lett.
B 194 (1987) 43.
[38] B. Zwiebach, Interpolating string field theories, Mod. Phys. Lett. A 7 (1992) 1079, hep-th/9202015.
[39] A. Hashimoto, N. Itzhaki, Observables of string field theory, JHEP 0201 (2002) 028, hep-th/0111092.
[40] Work in progress.
[41] N. Drukker, Closed string amplitudes from gauge fixed string field theory, hep-th/0207266.
[42] G.T. Horowitz, K. Maeda, Fate of the black string instability, Phys. Rev. Lett. 87 (2001) 131301, hep-
th/0105111.
[43] S.S. Gubser, On non-uniform black branes, Class. Quantum Grav. 19 (2002) 4825, hep-th/0110193.
[44] B. Kol, Topology change in general relativity and the black-hole black-string transition, hep-th/0206220.
[45] S.R. Coleman, E. Weinberg, Radiative corrections as the origin of spontaneous symmetry breaking, Phys.
Rev. D 7 (1973) 1888.
Nuclear Physics B 688 (2004) 101–134
www.elsevier.com/locate/npe
Abstract
We compute the next-to-next-to-leading order (NNLO) contributions to the three splitting
functions governing the evolution of unpolarized non-singlet combinations of quark densities in
perturbative QCD. Our results agree with all partial results available in the literature. We find that
the correct leading logarithmic (LL) predictions for small momentum fractions x do not provide
a good estimate of the respective complete results. A new, unpredicted LL contribution is found
for the colour factor d abc dabc entering at three loops for the first time. We investigate the size
of the corrections and the stability of the NNLO evolution under variation of the renormalization
scale. Except for very small x the corrections are found to be rather small even for large values
of the strong coupling constant, in principle facilitating a perturbative evolution into the sub-GeV
regime.
2004 Published by Elsevier B.V.
1. Introduction
Parton distributions form indispensable ingredients for the analysis of all hard-scattering
processes involving initial-state hadrons. The dependence of these quantities on the fraction
x of the hadron momentum carried by the quark or gluon cannot be calculated in
perturbation theory. However, the scale-dependence (evolution) of the parton distributions
can be derived from first principles in terms of an expansion in powers of the strong
first order, while Pq q̄ and the flavour-independent (‘sea’) contributions Pqq and Pqsq̄ are of
v s
order αs2 . A non-vanishing difference Pqq s − P s occurs for the first time at the third order.
q q̄
This general structure leads to three independently evolving types of non-singlet
distributions: the evolution of the flavour asymmetries
±
qns,ik = qi ± q̄i − (qk ± q̄k ) (2.2)
± , is governed by
and of linear combinations thereof, hereafter generically denoted by qns
±
Pns = Pqq
v
± Pqvq̄ . (2.3)
The sum of the valence distributions of all flavours,
nf
v
qns = (qr − q̄r ), (2.4)
r=1
evolves with
s −
v
Pns = Pqq
v
− Pqvq̄ + nf Pqq − Pqsq̄ ≡ Pns + Pns
s
. (2.5)
The first moments of Pns − and P v vanish, since the first moments of the distributions q −
ns ns
v
and qns reflect conserved additive quantum numbers.
We expand the splitting functions in powers of as ≡ αs /(4π), i.e., the evolution
i (x, µ2 ), i = ±, v, are written as
equations for qns f
d αs (µ2f ) n+1 (n)i
2
qns x, µf =
i 2
Pns (x) ⊗ qns
i
x, µ2f , (2.6)
d ln µf 4π
n=0
to the splitting functions discussed above. The relative sign is the standard convention.
Note that in the older literature an additional factor of two is often included in Eq. (2.7).
The calculation follows the methods of Refs. [24–26,44,45]. We employ the optical
theorem and the operator product expansion to calculate Mellin moments of the deep-
inelastic structure functions. Since we treat the Mellin moment N as an analytical
parameter, we cannot apply the techniques of Refs. [24–26], where the M INCER program
[46,47] was used as the tool to solve the integrals. Instead, the introduction of new
techniques was necessary, and various aspects of those have already been discussed in
Refs. [45,48–50]. Here we briefly summarize our approach, focusing on some parts which
have not been presented yet. It should be emphasized that we have at our disposal a very
powerful check on all our derivations and calculations by letting, at any point, N be some
positive integer value. Then we can resort to the approach of Refs. [24–26] and, with the
help of the M INCER program, the checking of all programs greatly simplifies.
We start by constructing the diagrams for the forward Compton reactions
computation of the P µ P ν projection which is still in progress. The results for the three-
loop coefficient functions will thus be presented in a future publication [69].
To solve the integrals we apply the following strategy [45,49]. We set up a hierarchy
of classes among all diagrams depending on the topology, for instance ladder, Benz or
non-planar. Within a certain topology, we define a sub-hierarchy depending on the number
of P -dependent propagators. We define basic building blocks (BBBs) as diagrams of a
given topology in which the quark momentum P flows only through a single line in the
diagram, while composite building blocks (CBBs) denote all diagrams with more than one
P -dependent propagator. We determine reduction schemes that map the CBBs of a given
topology class to the BBBs of the same topology class or to simpler CBB topologies.
Subsequently, we use reduction identities that express the BBBs of a given topology class
in terms of BBBs of simpler topologies.
This procedure has been discussed to some extent in Refs. [45,49]. It exploits various
categories of relations between the integrals which can be derived as follows. For a generic
loop integral depending on external momenta P and Q, the first category are integration-
by-parts identities [54,70],
∂ µ
d D pn µ pj × (· · ·) = 0. (2.9)
n
∂p i
These give a number of non-trivial relations by making various choices for the pi and pj
from the loop momenta. Additionally pj can be equal to P or Q. The second category
is based on scaling arguments [45] in Mellin space. They involve applying one of the
operators
∂ ∂ ∂
Qµ , Pµ , Pµ (2.10)
∂Qµ ∂Qµ ∂P µ
both inside the integral and to the integrated result. The scaling in Mellin space tells us
the effect of these operators on the integrated result, while inside the integral we just work
out the derivative. These relations naturally involve polynomials linear in N . The fourth
operator of this kind,
∂
Qµ , (2.11)
∂P µ
cannot be used naively in this context, because it does not commute with the limit
P · P → 0. More care is needed in this case and we will come back to this shortly.
A third category of relations is obtained along the lines of the Passarino–Veltman
decomposition into form factors [71]. In Mellin space we write
µ
d D pn pi × (· · ·) = Qµ IQ + P µ IP , (2.12)
n
where IQ and IP are the two form factors. By contracting Eq. (2.12) either with Qµ
or Pµ , the IQ and IP are determined in terms of a number of integrals. Next, by taking
the derivative with respect to Qµ , the relevant identities can be obtained. Because the
momentum pi can be any of the loop momenta, Eq. (2.12) gives us as many relations as
there are loops. Again, in Mellin space, these relations contain polynomials linear in N .
106 S. Moch et al. / Nuclear Physics B 688 (2004) 101–134
The fourth and the fifth category of relations are new. Together with the form factor
relations from Eq. (2.12) they were crucial in setting up the reduction scheme for the three-
loop topologies. They are based on operators that do not commute with the limit P · P → 0.
In the fourth category, one considers the dimensionless operators
P ·Q µ ∂
O1 = Q , (2.13)
Q·Q ∂P µ
∂ ∂
O2 = P · Q µ , (2.14)
∂P ∂Qµ
(P · Q)2 ∂ ∂
O3 = . (2.15)
Q · Q ∂P ∂P µ
µ
Each individual operator Oi does not commute with the limit P · P → 0, but certain
linear combinations of the Oi do. However, one has to extend the ansatz based on scaling
arguments in N -space. Specifically, one has for the N th moment of an integral I (N)
2P · Q N (0)
I (N) = (Q · Q)α CN
Q·Q
2P · Q N−2 P · P (2)
+ (Q · Q)α CN + · · · , (2.16)
Q·Q Q·Q
(0) (2)
where the CN and CN are dimensionless functions of N , and α adjusts the mass
(2)
dimensions. The novel feature is here the term CN proportional to P · P , which one
may call higher twist. In contrast, for the relations based on Eq. (2.10) it was sufficient to
(0)
restrict the ansatz to CN .
Applying the differential operators Oi in Eqs. (2.13)–(2.15) to the ansatz (2.16), one
finds that the combinations
Solving Eq. (2.18) for IQ and IP as above, however keeping all terms P · P , substituting
into Eq. (2.19) and finally taking the limit P · P → 0, we find
∂ µ P · pi µ ∂ Q · pi P · Q − Q · QP · pi
p I= Q I + (D + N − 2) I. (2.20)
∂P µ i P ·Q ∂P µ (P · Q)2
Again, as the momentum pi can be any of the loop momenta, Eq. (2.20) gives us as many
relations with polynomials linear in N as there are loops.
Taken together, the reductions of category one to five suffice to obtain a complete
reduction scheme. In particular, the reduction equations of category two to five involve
explicitly the parameter N of the Mellin moment. They give rise to difference equations
in N for an integral I (N),
±,s
Here we present the anomalous dimensions γns (N) in the MS-scheme up to the
third order in the running coupling constant αs , expanded in powers of αs /(4π). These
quantities can be expressed in terms of harmonic sums [6,7,59,60]. Following the notation
of Ref. [59], these sums are recursively defined by
M
(±1)i
S±m (M) = (3.1)
im
i=1
108 S. Moch et al. / Nuclear Physics B 688 (2004) 101–134
and
M
(±1)i
S±m1 ,m2 ,...,mk (M) = Sm2 ,...,mk (i). (3.2)
i m1
i=1
The sum of the absolute values of the indices mk defines the weight of the harmonic sum.
In the n-loop anomalous dimensions written down below one encounters sums up to weight
2n − 1.
In order to arrive at a reasonably compact representation of our results, we employ the
abbreviation Sm ≡ Sm (N) in what follows, together with the notation
N± Sm = Sm (N ± 1), N±i Sm = Sm (N ± i) (3.3)
for arguments shifted by ±1 or a larger integer i. In this notation the well-known one-loop
(LO) anomalous dimension [1,2] reads
(0)
γns (N) = CF 2(N− + N+ )S1 − 3 , (3.4)
and the corresponding two second-order (NLO) non-singlet quantities [4,6] are given by
17 28
γns (N) = 4CA CF 2N+ S3 −
(1)+
− 2S−3 − S1
24 3
151 11
+ (N− + N+ ) S1 + 2S1,−2 − S2
18 6
1 4 11 1
+ 4CF nf + S1 − (N− + N+ ) S1 − S2
12 3 9 3
3
+ 4CF 2 4S−3 + 2S1 + 2S2 − + N− [S2 + 2S3 ]
8
− (N− + N+ )[S1 + 4S1,−2 + 2S1,2 + 2S2,1 + S3 ] , (3.5)
CA
(1)−
γns (N) = γns
(1)+
(N) + 16CF CF −
2
× (N− − N+ )[S2 − S3 ] − 2(N− + N+ − 2)S1 . (3.6)
+ (N)
The three-loop (NNLO, N2 LO) contribution to the anomalous dimension γns
corresponding to the upper sign in Eq. (2.3) reads
3 5 10 10 4 2
(2)+
γns (N) = 16CA CF nf ζ3 − + S−3 − S3 + S1,−2 − S−4 + 2S1,1
2 4 9 9 3 3
25 257 2 2 2
− S2 + S1 − S−3,1 − N+ S2,1 − S3,1 − S4
9 27 3 3 3
23
− (N+ − 1) S3 − S2
18
1237 11 317 16
− (N− + N+ ) S1,1 + S1 + S3 − S2 + S1,−2
216 18 108 9
S. Moch et al. / Nuclear Physics B 688 (2004) 101–134 109
2 1 1 1 1 1
− S1,−2,1 − S1,−3 − S1,3 − S2,1 − S2,−2 + S1 ζ3 + S3,1
3 3 2 2 3 2
1657 15 31 67
+ 16CF CA2 − ζ3 + 2S−5 + S−4 − 4S−4,1 − S−3
576 4 6 9
11 3
+ 2S−3,−2 + S−3,1 + S−2 − 6S−2 ζ3 − 2S−2,−3 + 3S−2,−2
3 2
1883 16
− 4S−2,−2,1 + 8S−2,1,−2 − S1 − 10S1,−3 − S1,−2 + 12S1,−2,1
54 3
5 1 176 13
+ 4S1,3 − 4S2,−2 − S4 + S5 + S2 + S3
2 2 9 3
+ (N− + N+ − 2)[3S1 ζ3 + 11S1,1 − 4S1,1,−2 ]
9737 19 91
+ (N− + N+ ) S1 − 3S1,−4 + S1,−3 + 8S1,−3,1 + S1,−2
432 6 9
29 19
− 6S1,−2,−2 − S1,−2,1 + 8S1,1,−3 − 16S1,1,−2,1 − 4S1,1,3 − S1,3
3 4
11
+ 4S1,3,1 + 3S1,4 + 8S2,−2,1 + 2S2,3 − S3,−2 + S3,1 − S4,1
12
1 1967 121
− 4S2,−3 + S2,−2 − S2 + S3
6 216 72
1
− (N− − N+ ) 3S2 ζ3 + 7S2,1 − 3S2,1,−2 + 2S2,−2,1 − S2,3
4
3 29 11 1
− S3,−2 − S3,1 + S4,1 + S2,−3 − S2,−2
2 6 4 2
28 2376 8 5
+ N+ S3 − S2 − S4 − S5
9 216 3 2
17 13 2
+ 16CF n2f − S1 + S2
144 27 9
2 11 1
+ (N− + N+ ) S1 − S2 + S3
9 54 18
45 151 89 134
+ 16CF 2 CA ζ3 − − 10S−5 − S−4 + 20S−4,1 + S−3
4 64 6 9
31 9
− 2S−3,−2 − S−3,1 + 2S−3,2 − S−2 + 18S−2 ζ3 + 10S−2,−3
3 2
26
− 6S−2,−2 + 8S−2,−2,1 − 28S−2,1,−2 + 46S1,−3 + S1,−2
3
28 185
− 48S1,−2,1 + S1,2 − S3 − 8S1,3 + 2S3,−2 − 4S5
3 6
133 209 242
− (N− + N+ − 2) 9S1 ζ3 − S1 + S1,1 − 14S1,1,−2 − S2
36 6 18
110 S. Moch et al. / Nuclear Physics B 688 (2004) 101–134
33 14
+ 9S2,−2 + S4 − 3S3,1 + S2,1
4 3
107 173
+ (N− + N+ ) 17S1,−4 − S1,−3 − 32S1,−3,1 − S1,−2
6 9
103
+ 16S1,−2,−2 + S1,−2,1 − 2S1,−2,2 − 36S1,1,−3 + 56S1,1,−2,1
3
109 43 11
+ 8S1,1,3 − S1,2 − 4S1,2,−2 + S1,3 − 8S1,3,1 − 11S1,4 + S2,2
9 3 3
+ 21S2,−3 − 30S2,−2,1 − 4S2,1,−2 − 5S2,3 − S4,1
31 67
+ S2,−2 − S2,1
6 9
+ (N− − N+ ) 9S2 ζ3 + 2S2,−3 + 4S2,−2,1 − 12S2,1,−2 − 2S2,3
1 11 33 59 127 1153
+ 13S4,1 + S2,−2 + S4 − S3,1 + S3 + S2,1 − S2
2 2 2 9 6 72
4 23 73 151
+ N+ 8S3,−2 + S3,1 − 2S3,2 + 14S5 + S4 + S3 + S2
3 6 3 24
23 3 4 59 4
+ 16CF 2 nf − ζ3 + S−3,1 − S2 + S−4
16 2 3 36 3
20 20 8 8 4
− S−3 + S1 − S1,−2 − S1,1 − S1,2
9 9 3 3 3
25 4 1 67 4 4
+ N+ S3 − S3,1 − S4 − (N+ − 1) S2 − S2,1 + S3
9 3 3 36 3 3
325 2 32 4
+ (N− + N+ ) S1 ζ3 − S1 − S1,−3 + S1,−2 − S1,−2,1
144 3 9 3
4 16 4 11 2 10 1
+ S1,1 + S1,2 − S1,3 + S2 − S2,−2 + S2,1 + S4
3 9 3 18 3 9 2
2 8
− S2,2 − S3
3 9
29 15
+ 16CF 3 12S−5 − − ζ3 + 9S−4 − 24S−4,1 − 4S−3,−2 + 6S−3,1
32 2
− 4S−3,2 + 3S−2 + 25S3 − 12S−2 ζ3 − 12S−2,−3 + 24S−2,1,−2
67
− 52S1,−3 + 4S1,−2 + 48S1,−2,1 − 4S3,−2 + S2 − 17S4
2
31
+ (N− + N+ − 2) 6S1 ζ3 − S1 + 35S1,1 − 12S1,1,−2 + S1,2
8
+ 10S2,−2 + S2,1 + 2S2,2 − 2S3,1 − 3S5
S. Moch et al. / Nuclear Physics B 688 (2004) 101–134 111
+ (N− + N+ ) 23S1,−3 − 22S1,−4 + 32S1,−3,1 − 2S1,−2 − 8S1,−2,−2
flavours and a typical value αs = 0.2 for the strong coupling constant (recall that the terms
up to order αsn+1 are included at Nn LO). Numerically, the colour factors take the values
CF = 4/3, CA = 3 and d abc dabc /nc = 40/9. Note that the latter normalization is different
from that employed in Ref. [58].
The NNLO corrections are rather small under these circumstances, amounting to less
than 2% for N 2. At large N the anomalous dimensions behave as
ln N 1
γns(n−1)±,v
(N) = An (ln N + γe ) − Bn − Cn +O , (3.10)
N N
where γe is the Euler–Mascheroni constant and the coefficients are specified in the next
+ = γ + / ln N , also shown in Fig. 1, approaches a constant for N → ∞.
paragraph. Thus γ̃ns ns
(n)+ (n)+
The asymptotic results are indicated by replacing γ̃ns (N = 15) by γ̃ns (N → ∞) for
S. Moch et al. / Nuclear Physics B 688 (2004) 101–134 113
+
Fig. 1. The perturbative expansion of the anomalous dimension γns (N ) for four flavours at αs = 0.2. In the right
part the leading N -dependence for large N has been divided out, and the corresponding asymptotic limits are
indicated as discussed in the text.
the respective highest term included in the curves (e.g., for n = 2 at NNLO). Obviously
the approach to the asymptotic limit is very slow. Yet the results at N → ∞, which can be
derived much easier than the full N -dependence [73], do provide a reasonable first estimate
of the corrections.
The leading large-N coefficients An , which are also relevant for the soft-gluon
(threshold) resummation [34–37], are given by
A1 = 4CF ,
67 5
A2 = 8CF − ζ2 CA − nf ,
18 9
245 67 11 11 55
A3 = 16CF CA2 − ζ2 + ζ3 + ζ22 + 16CF2 nf − + 2ζ3
24 9 6 5 24
209 10 7 1
+ 16CF CA nf − + ζ2 − ζ3 + 16CF nf −
2
. (3.11)
108 9 3 27
The nf -independent contribution to the three-loop coefficient A3 is also a new result of
the present article. Inserting the numerical values of the ζ -function and the QCD colour
factors it reads A3 |nf =0 ∼
= 1174.898, in agreement with the previous numerical estimate
of Ref. [37]. The constants Bn can be read off directly from the terms with δ(1 − x) in
Eqs. (4.5), (4.6) and (4.9). Surprisingly, the coefficients Cn in Eq. (3.10), which are also
114 S. Moch et al. / Nuclear Physics B 688 (2004) 101–134
best determined using those x-space results, turn out to be related to the An by
4. Results in x-space
(n)±,s
The splitting functions Pns (x) are obtained from the N -space results of the previous
section by an inverse Mellin transformation, which expresses these functions in terms
of harmonic polylogarithms [62–64]. The inverse Mellin transformation exploits an
isomorphism between the set of harmonic sums for even or odd N and the set of harmonic
polylogarithms. Hence it can be performed by a completely algebraic procedure [45,64],
based on the fact that harmonic sums occur as coefficients of the Taylor expansion of
harmonic polylogarithms.
Our notation for the harmonic polylogarithms Hm1 ,...,mw (x), mj = 0, ±1 follows
Ref. [64] to which the reader is referred for a detailed discussion. The lowest-weight
(w = 1) functions Hm (x) are given by
1
lnw x, if m1 , . . . , mw = 0, . . . , 0,
Hm1 ,...,mw (x) = w!x (4.2)
0 dz fm1 (z)Hm2 ,...,mw (z), else
with
1 1
f0 (x) = , f±1 (x) = . (4.3)
x 1∓x
A useful short-hand notation is
and
67 11
(1)+
Pns (x) = 4CA CF pqq (x) − ζ2 + H0 + H0,0
18 6
14
+ pqq (−x)[ζ2 + 2H−1,0 − H0,0] + (1 − x)
3
17 11
+ δ(1 − x) + ζ2 − 3ζ3
24 3
5 1 2 1 2
− 4CF nf pqq (x) + H0 + (1 − x) + δ(1 − x) + ζ2
9 3 3 12 3
3
+ 4CF 2 2pqq (x) H1,0 − H0 + H2
4
− 2pqq(−x)[ζ2 + 2H−1,0 − H0,0 ]
3
− (1 − x) 1 − H0 − H0 − (1 + x)H0,0
2
3
+ δ(1 − x) − 3ζ2 + 6ζ3 , (4.6)
8
CA
(1)−
Pns (x) = Pns
(1)+
(x) + 16CF CF − pqq (−x)[ζ2 + 2H−1,0 − H0,0 ]
2
− 2(1 − x) − (1 + x)H0 . (4.7)
Here and in Eqs. (4.9)–(4.11) we suppress the argument x of the polylogarithms and use
5 167 1 25
+ δ(1 − x) − ζ2 + ζ22 + ζ3
4 54 20 18
5 69
+ 16CA CF 2 pqq (x) ζ3 − ζ22 − H−3,0 − 3H−2 ζ2 − 14H−2,−1,0
6 20
151 41 17
+ 3H−2,0 + 5H−2,0,0 − 4H−2,2 − H 0 + H 0 ζ2 − H 0 ζ3
48 12 2
13 23 2
− H0,0 − 4H0,0 ζ2 − H0,0,0 + 5H0,0,0,0 + H3 − 24H1ζ3
4 12 3
67 31
− 16H1,−2,0 + H1,0 − 2H1,0ζ2 + H1,0,0 + 11H1,0,0,0 + 8H1,1,0,0
9 3
67 11
− 8H1,3 + H4 + H2 − 2H2 ζ2 + H2,0 + 5H2,0,0 + H3,0
9 3
1 67 31
+ pqq (−x) ζ22 − ζ2 + ζ3 + 5H−3,0 − 32H−2 ζ2 − 4H−2,−1,0
4 9 4
31 31 9
− H−2,0 + 21H−2,0,0 + 30H−2,2 − H−1 ζ2 − 42H−1ζ3 + H0
6 3 4
− 4H−1,−2,0 + 56H−1,−1ζ2 − 36H−1,−1,0,0 − 56H−1,−1,2
134 31
− H−1,0 − 42H−1,0ζ2 − H3,0 + 32H−1,3 − H−1,0,0
9 6
31 13 29 67
+ 17H−1,0,0,0 + H−1,2 + 2H−1,2,0 + H0 ζ2 + H0 ζ3 + H0,0
3 12 2 9
89 31
+ 13H0,0ζ2 + H0,0,0 − 5H0,0,0,0 − 7H2 ζ2 − H3 − 10H4
12 6
133 167
+ (1 − x) + 4H0,0,0,0 − ζ3 − 2H0ζ3 − 2H−3,0 + H−2 ζ2
36 4
77 209
+ 2H−2,−1,0 − 3H−2,0,0 + H0,0,0 − H1 − 7H1 ζ2
4 6
14
+ 4H1,0,0 + H1,0
3
43 25
+ (1 + x) ζ2 − 3ζ22 + H−2,0 − 31H−1ζ2 − 14H−1,−1,0
2 2
13 55 1457
− H−1,0 + 24H−1,2 + 23H−1,0,0 + H0 ζ2 + 5H0,0ζ2 + H0
3 2 48
1025 155
− H0,0 − H2 + H2 ζ2 − 15H3 + 2H2,0,0 − 3H4
36 6
1 2 37 242
− 5ζ2 − ζ2 + 50ζ3 − 2H−3,0 − 7H−2,0 − H0 ζ3 − H0 ζ2 − H0
2 2 9
185 28
− 2H0,0ζ2 + H0,0 − 22H0,0,0 − 4H0,0,0,0 + H2 + 6H3
6 3
S. Moch et al. / Nuclear Physics B 688 (2004) 101–134 117
151 205 247 2 211 15
+ δ(1 − x) + ζ2 ζ3 − ζ2 − ζ2 + ζ3 + ζ5
64 24 60 12 2
245 67 12 1 1043
+ 16CA2 CF pqq (x) − ζ2 + ζ22 + ζ3 + H0 + H−3,0
48 18 5 2 216
3 31
+ 4H−2,−1,0 − H−2,0 − H−2,0,0 + 2H−2,2 − H0 ζ2 + 4H0ζ3
2 12
389
+ H0,0 − 2H2,0,0 − H0,0,0,0 + 9H1 ζ3 + 6H1,−2,0 − H1,0 ζ2
72
11 31
− H1,0,0 − 3H1,0,0,0 − 4H1,1,0,0 + 4H1,3 + H0,0,0
4 12
11
+ H3 + H4
12
67 11 11
+ pqq (−x) ζ2 − ζ22 − ζ3 − H−3,0 + 8H−2ζ2 + H−2,0
18 4 6
11
− 4H−2,0,0 − 3H−1,0,0,0 + H−1 ζ2 + 12H−1ζ3 − 16H−1,−1ζ2
3
67
+ 8H−1,−1,0,0 + 16H−1,−1,2 + H−1,0 − 8H−2,2 + 11H−1,0ζ2
9
11 11 3 1
+ H−1,0,0 − H−1,2 − 8H−1,3 − H0 − H0 ζ2 − 4H0 ζ3
6 3 4 6
67 31 11
− H0,0 − 3H0,0 ζ2 − H0,0,0 + H0,0,0,0 + 2H2ζ2 + H3 + 2H4
18 12 6
1883 1 1
+ (1 − x) − H0,0,0,0 + 11H1 − H−2,−1,0 + H−3,0
108 2 2
1 1 523 13 5
− H−2 ζ2 + H−2,0,0 + H0 + H0 ζ3 − H0,0 − H0,0,0
2 2 36 3 2
+ 2H1 ζ2 − 2H1,0,0
8
+ (1 + x) 8H−1 ζ2 + 4H−1,−1,0 + H−1,0 − 5H−1,0,0 − 6H−1,2
3
13 3 43 5 11 1
− ζ2 + ζ22 − ζ3 − H−2,0 − H0 ζ2 − H2 ζ2
3 8 4 2 2 2
5 1 3
− H0,0 ζ2 + 7H2 − H2,0,0 + 3H3 + H4
4 4 4
1 1 2 8 17 19 5
+ H0,0 ζ2 + ζ2 − ζ2 + ζ3 + H−2,0 − H0 + H0 ζ2 − H0 ζ3
2 4 3 2 2 2
13 5 1
+ H0,0 + H0,0,0 + H0,0,0,0
3 2 2
1657 281 1 97 5
− δ(1 − x) − ζ2 + ζ22 + ζ3 − ζ5
576 27 8 9 2
118 S. Moch et al. / Nuclear Physics B 688 (2004) 101–134
1 1 5
+ 16CF n2f pqq (x) H0,0 − + H0
18 3 3
13 1 17 5 1
+ (1 − x) + H0 − δ(1 − x) − ζ2 + ζ3
54 9 144 27 9
1 55 5
+ 16CF 2 nf pqq (x) 5ζ3 − 4H1,0,0 − + H 0 + H 0 ζ2
3 16 8
3 10 10
+ H0,0 − H0,0,0 − H1,0 − H2 − 2H2,0 − 2H3
2 3 3
2 5 3 10
+ pqq (−x) ζ2 − ζ3 + H−2,0 + 2H−1 ζ2 + H−1,0 + H−1,0,0
3 3 2 3
1 5
− 2H−1,2 − H0 ζ2 − H0,0 − H0,0,0 + H3
2 3
10 19 4 2 4
− (1 − x) + H0,0 − H1 + H1,0 + H2
9 18 3 3 3
4 25 1 2 7
+ (1 + x) H−1,0 − H0 + H0,0,0 + H0 + H0,0
3 24 2 9 9
4 23 5 29 17
+ H2 − δ(1 − x) − ζ2 − ζ22 + ζ3
3 16 12 30 6
9 2
+ 16CF 3 pqq (x) ζ − 2H−3,0 + 6H−2 ζ2 + 12H−2,−1,0
10 2
3 3 13
− 6H−2,0,0 − H0 − H0 ζ2 + H0 ζ3 + H0,0 − 2H0,0,0,0 + 8H1,3
16 2 8
+ 12H1ζ3 + 8H1,−2,0 − 6H1,0,0 − 4H1,0,0,0 + 4H1,2,0 − 3H2,0
+ 2H2,0,0 + 4H2,1,0 + 4H2,2 + 4H3,0 + 4H3,1 + 2H4
7 9
+ pqq (−x) ζ22 − ζ3 − 6H−3,0 + 32H−2ζ2 + 8H−2,−1,0 + 3H−2,0
2 2
− 26H−2,0,0 − 28H−2,2 + 6H−1 ζ2 + 36H−1 ζ3 + 8H−1,−2,0
− 48H−1,−1ζ2 + 40H−1,−1,0,0 + 48H−1,−1,2 + 40H−1,0ζ2 + 3H−1,0,0
3 3
− 22H−1,0,0,0 − 6H−1,2 − 4H−1,2,0 − 32H−1,3 − H0 − H0 ζ2
2 2
9
− 13H0ζ3 − 14H0,0ζ2 − H0,0,0 + 6H0,0,0,0 + 6H2 ζ2 + 3H3
2
+ 2H3,0 + 12H4
31
+ (1 − x) 2H−3,0 − + 4H−2,0,0 + H0,0 ζ2 − 3H0,0,0,0 + 35H1
8
5
+ 6H1 ζ2 − H1,0 + H2,0
2
S. Moch et al. / Nuclear Physics B 688 (2004) 101–134 119
37 2 93 81
+ (1 + x) ζ2 − ζ2 − ζ3 − 15H−2,0 + 30H−1ζ2 + 12H−1,−1,0
10 4 2
539 191
− 2H−1,0 − 26H−1,0,0 − 24H−1,2 − H0 − 28H0ζ2 + H0,0
16 8
85
+ 20H0,0,0 + H2 − 3H2,0,0 − 2H3,0 + 13H3 − H4
4
67
+ 4ζ2 + 33ζ3 + 4H−3,0 + 10H−2,0 + H0 + 6H0ζ3 + 19H0ζ2
2
− 25H0,0 − 17H0,0,0 − 2H2 − H2,0 − 4H3
29 9 18 2 17
+ δ(1 − x) − 2ζ2 ζ3 + ζ2 + ζ2 + ζ3 − 15ζ5 . (4.9)
32 8 5 4
The x-space counterpart of Eq. (3.8) for the evolution of the ‘minus’ combinations (2.2) is
given by
CA 134
(2)−
Pns (x) = Pns
(2)+
(x) + 16CA CF CF − pqq (−x) ζ2 − 4ζ22 − 11ζ3
2 9
22 44
− 4H−3,0 + 32H−2 ζ2 + H−2,0 − 16H−2,0,0 − 32H−2,2 + H−1 ζ2
3 3
+ 48H−1ζ3 − 64H−1,−1ζ2 + 32H−1,−1,0,0 + 64H−1,−1,2
268 22 44
+ H−1,0 + 44H−1,0ζ2 + H−1,0,0 − 12H−1,0,0,0 − H−1,2
9 3 3
2 134
− 32H−1,3 − 3H0 − H0 ζ2 − 16H0ζ3 − H0,0 − 12H0,0ζ2
3 9
31 22
− H0,0,0 + 4H0,0,0,0 + 8H2 ζ2 + H3 + 8H4
3 3
367 1 2
+ (1 − x) + ζ2 + 2H−3,0 − 2H−2 ζ2 − 4H−2,−1,0 − 10H−2,0
18 2
140
− 2H0,0 + 2H−2,0,0 + 2H0 ζ3 + H0,0 ζ2 − H0,0,0,0 + 8H1ζ2 + H1
3
26
+ (1 + x) 32H−1ζ2 − 18ζ2 − 23ζ3 + H−1,0 − 16H−1,0,0
3
481 70
− 32H−1,2 − H0 − 29H0 ζ2 + 5H0,0,0 + 24H3 + H2
18 3
− 2ζ2 − 2ζ3 + 32H0 + 14H0ζ2 + 2H0,0,0 − 16H3
CA 20 4 8
+ 16CF nf CF − pqq (−x) 2ζ3 − ζ2 − H−2,0 − H−1 ζ2
2 9 3 3
40 4 8 2 20
− H−1,0 − H−1,0,0 + H−1,2 + H0 ζ2 + H0,0
9 3 3 3 9
4 4
+ H0,0,0 − H3
3 3
120 S. Moch et al. / Nuclear Physics B 688 (2004) 101–134
61 8 8 41 4
+ (1 − x) − H1 + (1 + x) 2H0,0 − H−1,0 + H0 − H2
9 3 3 9 3
C A
+ 16CF 2 CF − pqq (−x)[9ζ3 − 7ζ22 + 12H−3,0 − 64H−2ζ2
2
− 16H−2,−1,0 − 6H−2,0 + 52H−2,0,0 + 56H−2,2 − 12H−1 ζ2
− 72H−1ζ3 − 16H−1,−2,0 + 96H−1,−1ζ2 − 80H−1,−1,0,0
− 96H−1,−1,2 − 80H−1,0ζ2 − 6H−1,0,0 + 44H−1,0,0,0 + 12H−1,2
+ 8H−1,2,0 + 64H−1,3 + 3H0 + 3H0 ζ2 + 26H0ζ3 + 28H0,0ζ2
+ 9H0,0,0 − 12H0,0,0,0 − 12H2ζ2 − 6H3 − 4H3,0 − 24H4]
− (1 − x)[15 + 8H−3,0 + 8H−2,0,0 + 61H0 + 6H0 ζ3 + 2H0,0ζ2
− 6H0,0,0,0 + 12H1ζ2 + 60H1 + 8H1,0]
+ (1 + x)[24ζ2 + 57ζ3 + 10H−2,0 − 48H−1 ζ2 − 4H−1,0 + 40H−1,0,0
+ 48H−1,2 + 59H0ζ2 − 22H0,0 − 35H0,0,0 − 22H2 − 4H2,0 − 44H3]
+ 8ζ2 − 42ζ3 − 4H−2,0 + 42H0
− 38H0ζ2 + 14H0,0 − 16H2 + 26H0,0,0 + 24H3 . (4.10)
(2)s
Finally the Mellin inversion of γns (N) in Eq. (3.9) leads to the following result for the
(2)s
leading (third-order) difference Pns (x) of the ‘valence’ and ‘minus’ splitting functions:
d abc dabc 1 50 41 5
(2)s
Pns (x) = 16nf (1 − x) + ζ2 − ζ22 − H−3,0 + H−2 ζ2
nc 2 3 12 4
9 3 1
− H−2,0,0 + H3 + 2H−2,−1,0 + H0,0ζ2 − H1 ζ2
4 2 2
3 91
− H1,0,0 + H1
4 12
1 3 3 13 1
+ (1 + x) H−1,−1,0 − H−1 ζ2 + H0 − H−1,0 + H−1,0,0
2 2 4 6 2
3 9 29 41
+ 2H−1,2 − H−2,0 + H0 ζ2 + H0,0 + H2 − H2 ζ2
2 4 12 12
1 3
− H2,0,0 + H4
2 2
1 1
− + x 2 [3H−1 ζ2 + 2H−1,−1,0 − 2H−1,0,0 − 2H−1,2 + H1 ζ2 ]
3 x
1
+ x 2 [5ζ3 − 2H3 + 2H−2,0 + 4H0ζ2 − 2H0,0,0 + 2H1 ζ2 ]
3
91 9
+ H0 + ζ3 − ζ2 + ζ22 − H0 ζ3 − H0 ζ2 − 2H0,0ζ2
24 2
3 1 1
+ H0,0 − H0,0,0 + H0,0,0,0 + H−2,0 − H3 . (4.11)
8 4 2
S. Moch et al. / Nuclear Physics B 688 (2004) 101–134 121
Here m w nv represents all mergers of m w and nv in which the relative orders of the
elements of m w and n v are preserved. All algorithms for this algebraic procedure have
been coded in F ORM, some explicit examples are given in Refs. [64,76].
(n)±,v
The large-x behaviour of splitting functions Pns (x) reflects the large-N behaviour
of the corresponding anomalous dimensions in Eq. (3.10). Specifically, the (identical)
(2)±,v
large-x behaviour of Pns (x) is given by
(2)±,v A3
Px→1 (x) = + B3 δ(1 − x) + C3 ln(1 − x) + O(1). (4.13)
(1 − x)+
The constants A3 and C3 have been specified in Eqs. (3.11) and (3.12), respectively, while
the coefficients of δ(1 − x) are explicit in Eq. (4.9). At small x the splitting functions can
(n)±,s
be expanded in powers of ln x. For the three-loop non-singlet splitting functions Pns (x)
one finds
(2)i
Px→0 (x) = D0i ln4 x + D1i ln3 x + D2i ln2 x + D3i ln x + O(1). (4.14)
Generally, terms up to ln2k x occur at order αsk+1 . Keeping only the highest n + 1 of
these, one arrives at the Nn Lx small-x approximation. Like the large-x coefficients, these
(2)+
contributions can be readily extracted from our full results using Eq. (4.12). For Pns
we obtain
2
D0+ = CF3 ,
3
22 4
D1+ = CF2 CA − 4CF3 − CF2 nf ,
3 3
+ 121 472
D2 = − 30ζ2 CF CA +
2
+ 96ζ2 CF2 CA + [4 − 104ζ2]CF3
9 9
44 64 4
− CF CA nf − CF2 nf + CF n2f ,
9 9 9
3934 370
D3+ = − 92ζ2 CF CA2 + + 216ζ2 + 48ζ3 CF2 CA
27 9
1268
− [30 + 192ζ2 + 96ζ3]CF3 − − 8ζ2 CF CA nf
27
88 2 88
− CF nf + CF n2f , (4.15)
9 27
122 S. Moch et al. / Nuclear Physics B 688 (2004) 101–134
or, after inserting CA = 3 and CF = 4/3 and the numerical values of ζ2 and ζ3 ,
D0+ ∼
= 1.58025,
+∼
D = 29.6296 − 2.37037nf ,
1
D2+ ∼
= 295.042 − 32.1975nf + 0.592592n2f ,
D3+ ∼
= 1261.11 − 152.597nf + 4.345679n2f . (4.16)
(2)−
The corresponding coefficients for Pns are given by
10 3
D0− = −CF CA2 + 4CF2 CA − C ,
3 F
40 14 20 16
D1− = CF CA2 − CF2 CA − 4CF3 + CF2 nf − CF CA nf ,
9 9 9 9
152
D2− = [81 + 14ζ2 ]CF CA2 − + 96ζ2 CF2 CA − [60 − 104ζ2]CF3
3
196 80 4
− CF CA nf + CF2 nf + CF n2f ,
9 3 9
3442 100 1850 680
D3− = + ζ2 + 112ζ3 CF CA2 + − ζ2 − 336ζ3 CF2 CA
27 3 9 3
88 568 32 2
+ CF n2f − [286 − 192ζ2 − 224ζ3]CF3 + + CF nf
27 9 3
2252 8
− − ζ2 CF CA nf , (4.17)
27 3
and
D0− ∼
= 1.43210,
−∼
D = 35.5556 − 3.16049nf ,
1
D2− ∼
= 399.205 − 39.7037nf + 0.592592n2f ,
∼ 1465.93 − 172.693nf + 4.345679n2 .
D3− = (4.18)
f
The coefficients D0± of the leading logarithms in Eqs. (4.15) and (4.17) agree with the
predictions in Ref. [31] based of the resummation of Ref. [30]. Finally the small-x
(2)s
expansion of Pns (x) reads
d abc dabc 1
D0s = nf ,
nc 3
abc d
d abc 2
D1 =
s
nf − ,
nc 3
d abc dabc
D2s = nf (18 − 10ζ2),
nc
d abc dabc
D3s = nf (56 + 2ζ2 − 16ζ3), (4.19)
nc
S. Moch et al. / Nuclear Physics B 688 (2004) 101–134 123
(2) +
Fig. 2. The nf -independent three-loop contribution P+,0 (x) to the splitting function Pns (x), multiplied by (1−x)
for display purposes. Also shown in the left part is the uncertainty band derived in Ref. [29] from the lowest six
even-integer moments [24–26]. In the right part our exact result is compared to the small-x approximations
defined in Eq. (4.14) and the text below it.
or, inserting the QCD value of 40/9 for the group factor d abc dabc /nc ,
D0s ∼
= +1.48148nf , D1s ∼
= −2.96296nf ,
s ∼ s ∼
D = +6.89182nf ,
2 D = +178.030nf .
3 (4.20)
(2)±
The n0f and n1f parts of the functions Pns (x) in Eqs. (4.9) and (4.10) are separately
shown in Figs. 2–4 together with the approximate expressions derived in Ref. [29]
mainly from the integer-N results of Refs. [24–26]. Also shown for the non-fermionic
contributions in Figs. 2 and 3 are the successive approximations by small-x logarithms as
defined in Eq. (4.14) and the text below it. As can be seen from Eqs. (4.16) and (4.18),
(2)±
the coefficients of lnk x for Pns increase sharply with decreasing power k. Consequently
the shapes of the full results in Figs. 2 and 3 are reproduced only after all logarithmically
enhanced terms have been included. Even then the small-x approximations underestimate
(2)+ (2)−
the complete results by factors as large as 2.7 and 2.0, respectively, for Pns and Pns at
−4
x = 10 , rendering the small-x expansion (4.14) ineffective for any practically relevant
value of x. Keeping only the Lx (ln4 x) or NLx (ln4 x and ln3 x) contributions leads
to a reasonable description only at extremely small values of x. Therefore, meaningful
estimates of higher-order effects based on resumming leading (and subleading) logarithms
in the small-x limit appear to be difficult.
(2)s
The new three-loop n1f contribution Pns with the colour structure d abc dabc /nc is
graphically displayed in Fig. 5 for nf = 1. Rather unexpectedly, also this function behaves
124 S. Moch et al. / Nuclear Physics B 688 (2004) 101–134
−
Fig. 3. As Fig. 2, but for the splitting function Pns (x). The first seven odd moments underlying the previous
approximations [29] also shown in the left part have been computed in Ref. [26].
(2) ±
Fig. 4. The n1f three-loop contributions P±,1 (x) to the splitting functions Pns (x), compared to the uncertainty
bands of Ref. [29] based on the integer moments calculated in Refs. [24–26].
S. Moch et al. / Nuclear Physics B 688 (2004) 101–134 125
like ln4 x for x → 0, and here the leading small-x terms do indeed provide a reasonable
approximation. In fact, this function dominates the small-x behaviour of the non-singlet
(2)±
splitting functions, for nf = 4 being, for example, about 7 times larger than Pns (x) at
x = 10−4 . The presence of a (dominant) leading small-x logarithm in a term unpredictable
from lower-order structures appears to call into question the very concept of the small-x
resummation of the double logarithms αsk+1 ln2k x.
(2)i
In view of the length and complexity of the exact expressions for the functions Pns (x),
it is useful to have at ones disposal also compact approximate representations involving,
besides powers of x, only simple functions like the ‘+’-distribution and the end-point
logarithms
5. Numerical implications
In this section we illustrate the effect of our new three-loop splitting functions
(2)±,v ±,v
Pns (x) on the evolution (2.6) of the non-singlet combinations qns (x, µ2f ) of the
quark and antiquark distributions. For all figures we employ the same schematic, but
characteristic model distribution,
±,v
xqns x, µ20 = x a (1 − x)b (5.1)
with
a = 0.5, b = 3, (5.2)
facilitating a direct comparison of the various splitting functions contributing to Eq. (2.6).
For the same reason the reference scale is specified by an order-independent value for the
strong coupling constant usually chosen as
αs µ20 = 0.2. (5.3)
This value corresponds to µ20 25–50 GeV2 for αs (MZ2 ) = 0.114–0.120 beyond the
leading order, a scale region relevant for deep-inelastic scattering both at fixed-target
experiments and, for much smaller x, at the ep collider HERA. Our default for the number
of effectively massless flavours is nf = 4. The normalization of qns i is irrelevant for our
10−5 x 10−2 . The NNLO effects are even smaller for qns + at small x, but considerably
larger for qns v at x < 10−3 . For example, at x 10−4 , where P (2)v (x) exceeds P (2)− (x)
ns ns
by a factor of about 8 as discussed in the paragraph above Eq. (4.21), the ratio of the
corresponding corrections in Fig. 7 amounts to 2.5. Recall that the scale derivatives (2.6) do
not probe the splitting functions locally in x due to the presence of the Mellin convolution.
The numerical values for q̇ns v (x, µ2 ) are presented in Table 1 for four characteristic
0
values of x. Also illustrated in this table is the dependence of the results on the shape of
the initial distribution, the number of flavours and the value of the strong coupling constant.
The relative corrections are rather weakly dependent of the large-x power b in Eq. (5.1).
They increase at small-x with increasing small-x power a, i.e., with decreasing size of qns v.
abc s
At large x, where the nf d dabc /nc contribution Pns is negligible, the NNLO corrections
decrease with increasing nf . At small-x this decrease is overcompensated in q̇ns v by the
−3
effect of Pns . Except for very small momentum fractions x 10 (where the non-singlet
s
quark densities play a minor role for most important observables) the NNLO corrections
amount to 15% or less even for a strong coupling constant as large as αs = 0.5. Hence
the non-singlet evolution at intermediate and large x appears to remain perturbative down
to very low scales as used in the phenomenological analyses of Refs. [77,78] and in non-
perturbative studies of the initial distributions like those of Refs. [38–43].
128 S. Moch et al. / Nuclear Physics B 688 (2004) 101–134
+
Fig. 6. The perturbative expansion of the logarithmic scale derivative d ln qns /d ln µ2f for a characteristic
+
non-singlet quark distribution xqns = x 0.5 (1 − x)3 at the standard scale µr = µf .
−,v
Fig. 7. As Fig. 6, but for the scale derivatives of the two other non-singlet combinations qns .
S. Moch et al. / Nuclear Physics B 688 (2004) 101–134 129
Table 1
The LO, NLO and NNLO logarithmic derivatives q̇ns v ≡ d ln q v /d ln µ2 at four representative values of x,
ns f
together with the ratios rn = Nn LO/Nn−1 LO − 1 for the default input parameters specified in the first paragraph
of this section and some variations thereof
x LO NLO NNLO r1 r2 r2 /r1
default (Fig. 7)
10−4 6.546 × 10−2 8.424 × 10−2 9.163 × 10−2 0.287 0.088 0.31
0.002 5.632 × 10−2 6.875 × 10−2 7.041 × 10−2 0.221 0.024 0.11
0.25 −5.402 × 10−2 −6.331 × 10−2 −6.457 × 10−2 0.172 0.020 0.12
0.75 −1.949 × 10−1 −2.189 × 10−1 −2.222 × 10−1 0.123 0.015 0.12
a = 0.8
10−4 1.660 × 10−1 2.351 × 10−1 2.818 × 10−1 0.417 0.198 0.48
0.002 1.249 × 10−1 1.583 × 10−1 1.650 × 10−1 0.268 0.042 0.16
0.25 −4.352 × 10−2 −5.171 × 10−2 −5.283 × 10−2 0.188 0.022 0.12
0.75 −1.930 × 10−1 −2.168 × 10−1 −2.200 × 10−1 0.123 0.015 0.12
b=5
10−4 6.474 × 10−2 8.278 × 10−2 8.917 × 10−2 0.279 0.077 0.28
0.002 5.324 × 10−2 6.432 × 10−2 6.546 × 10−2 0.208 0.018 0.09
0.25 −7.835 × 10−2 −9.022 × 10−2 −9.180 × 10−2 0.151 0.018 0.12
0.75 −2.300 × 10−1 −2.580 × 10−1 −2.619 × 10−1 0.122 0.015 0.12
nf = 3
10−4 6.546 × 10−2 8.480 × 10−2 9.187 × 10−2 0.295 0.083 0.28
0.002 5.632 × 10−2 6.942 × 10−2 7.174 × 10−2 0.233 0.033 0.14
0.25 −5.402 × 10−2 −6.406 × 10−2 −6.588 × 10−2 0.186 0.028 0.15
0.75 −1.949 × 10−1 −2.219 × 10−1 −2.269 × 10−1 0.139 0.023 0.16
nf = 3 and αs = 0.5
10−4 1.636 × 10−1 2.845 × 10−1 3.949 × 10−1 0.739 0.388 0.53
0.002 1.408 × 10−1 2.227 × 10−1 2.589 × 10−1 0.581 0.163 0.28
0.25 −1.350 × 10−1 −1.978 × 10−1 −2.262 × 10−1 0.465 0.144 0.31
0.75 −4.871 × 10−1 −6.563 × 10−1 −7.346 × 10−1 0.347 0.119 0.34
+ +
Fig. 8. The dependence of the NLO and NNLO predictions for q̇ns ≡ d ln qns /d ln µ2f on the renormalization
scale µr for six typical values of x. The initial conditions are as in Fig. 5.
reduced by including the third-order corrections over the full x-range. At NNLO both
the points of fastest apparent convergence and the points of minimal µr -sensitivity,
+ /∂µ = 0, are rather close to the ‘natural’ choice µ = µ for the renormalization
∂ q̇ns r r f
scale.
The relative scale uncertainties of the average results, conventionally estimated by
i (x, µ2 = 1 µ2 · · · 4µ2 )] − min[q̇ i (x, µ2 = 1 µ2 · · · 4µ2 )]
max[q̇ns r 4 f f ns r 4 f f
i
q̇ns ≡ (5.5)
i (x, µ2 = 1 µ2 · · · 4µ2 )]|
2|average[q̇ns r 4 f f
is shown in Fig. 9 for all three cases i = ±, v. These uncertainty estimates amount to 2%
or less except for x 10−3 , an improvement by more than a factor of three with respect
to the corresponding NLO results. Taking into account also the apparent convergence of
S. Moch et al. / Nuclear Physics B 688 (2004) 101–134 131
Fig. 9. The renormalization scale uncertainty of the NLO and NNLO predictions for the scale derivative of qns i ,
i = ±, V , as obtained from the quantity q̇ns defined in Eq. (5.5). Here and in Figs. 6 and 7 the spikes close to
i
x = 0.1 reflect the sign-change of q̇ns
i and do not constitute appreciable absolute corrections and uncertainties.
the series in Figs. 6 and 7, it is not unreasonable to expect that the effect of the four-
loop non-singlet splitting functions—which most likely will remain uncalculated for quite
some time—will be less than 1% for x > 10−3 . This expectation is consistent with the
(3)i
Padé estimates of Pns employed in Ref. [83] for the N3 LO large-x evolution of the
deep-inelastic structure functions F2 and F3 . At very small values of x the higher-order
corrections will presumably be considerably larger.
6. Summary
together with other new features [53], in the symbolic manipulation program F ORM [52]
which we have employed to handle the almost prohibitively large intermediate expressions.
Our results have been presented in both Mellin-N and Bjorken-x space, in the latter case
we have also provided easy-to-use accurate parametrizations. Our results agree with all
partial results available in the literature, in particular we reproduce the lowest seven even-
or odd-integer moments computed before [24–26]. We also agree with the resummation
predictions [30,31] for the leading small-x logarithms ln4 x of the splitting functions
+ (x) and P − (x) governing the evolution of flavour differences of quark–antiquark sums
Pns ns
and differences. However, an unpredicted term of the same size is found also for the new
d abc dabc /nc contributions Pns s to the splitting function for the total valence distribution.
At large x we find that the coefficient of the leading integrable term ln(1 − x) at order n
is proportional to the coefficient of the (only) ‘+’-distribution 1/(1 − x)+ at order n − 1,
a result that seems to point to a yet unexplored structure.
We have investigated the numerical impact of the three-loop (NNLO) contributions on
the evolution of the various non-singlet densities. The effect of the new contribution Pns s (x)
is very small at√large x but rises sharply towards x → 0, reaching 10% for a standard
Regge-inspired x initial distributions at x 10−5 . At x > 10−3 , on the other hand, the
perturbative expansions for the scale dependences d ln qns (x, µ2f )/d ln µ2f appear to be
very well convergent. For αs = 0.2, for example, the NNLO corrections amount to 2% or
less for four flavours, a factor of about 8 less than the NLO contributions. Also the variation
of the renormalization scale leads to effects of about ±2% at NNLO in this region of x.
Corrections of this size are comparable to the dependence of the predictions on the number
of quark flavours, rendering a proper treatment of charm effects rather important even for
large-x non-singlet quantities, see Refs. [84,85] and references therein.
F ORM files of our results, and F ORTRAN subroutines of our exact and approximate
x-space splitting functions can be obtained from the preprint server http://arXiv.org by
downloading the source. Furthermore they are available from the authors upon request.
Acknowledgements
The preparations for this calculation have been started by J.V. following a suggestion
by S.A. Larin. For stimulating discussions during various stages of this project we
would like to thank, in chronological order, S.A. Larin, F.J. Yndurain, E. Remiddi,
S. Moch et al. / Nuclear Physics B 688 (2004) 101–134 133
References
[42] O. Schroeder, H. Reinhardt, H. Weigel, Nucl. Phys. A 651 (1999) 174, hep-ph/9902322.
[43] H. Weigel, E. Ruiz Arriola, L.P. Gamberg, Nucl. Phys. B 560 (1999) 383, hep-ph/9905329.
[44] D.I. Kazakov, A.V. Kotikov, Nucl. Phys. B 307 (1988) 721;
D.I. Kazakov, A.V. Kotikov, Nucl. Phys. B 345 (1990) 299, Erratum.
[45] S. Moch, J.A.M. Vermaseren, Nucl. Phys. B 573 (2000) 853, hep-ph/9912355.
[46] S.G. Gorishnii, et al., Comput. Phys. Commun. 55 (1989) 381.
[47] S.A. Larin, F.V. Tkachev, J.A.M. Vermaseren, NIKHEF-H-91-18.
[48] J.A.M. Vermaseren, S. Moch, Nucl. Phys. B (Proc. Suppl.) 89 (2000) 131, hep-ph/0004235.
[49] S. Moch, J.A.M. Vermaseren, A. Vogt, Nucl. Phys. B 646 (2002) 181, hep-ph/0209100.
[50] J.A.M. Vermaseren, S. Moch, A. Vogt, Nucl. Phys. B (Proc. Suppl.) 116 (2003) 100, hep-ph/0211296.
[51] P. Nogueira, J. Comput. Phys. 105 (1993) 279.
[52] J.A.M. Vermaseren, math-ph/0010025.
[53] J.A.M. Vermaseren, Nucl. Phys. B (Proc. Suppl.) 116 (2003) 343, hep-ph/0211297.
[54] G. ’t Hooft, M. Veltman, Nucl. Phys. B 44 (1972) 189.
[55] C.G. Bollini, J.J. Giambiagi, Nuovo Cimento B 12 (1972) 20.
[56] J.F. Ashmore, Lett. Nuovo Cimento 4 (1972) 289.
[57] G.M. Cicuta, E. Montaldi, Lett. Nuovo Cimento 4 (1972) 329.
[58] T. van Ritbergen, A.N. Schellekens, J.A.M. Vermaseren, Int. J. Mod. Phys. A 14 (1999) 41, hep-ph/9802376.
[59] J.A.M. Vermaseren, Int. J. Mod. Phys. A 14 (1999) 2037, hep-ph/9806280.
[60] J. Blümlein, S. Kurth, Phys. Rev. D 60 (1999) 014018, hep-ph/9810241.
[61] S. Moch, P. Uwer, S. Weinzierl, J. Math. Phys. 43 (2002) 3363, hep-ph/0110083.
[62] A.B. Goncharov, Math. Res. Lett. 5 (1998) 497, available at http://www.math.uiuc.edu/K-theory/0297.
[63] J.M. Borwein, et al., math.CA/9910045.
[64] E. Remiddi, J.A.M. Vermaseren, Int. J. Mod. Phys. A 15 (2000) 725, hep-ph/9905237.
[65] S. Moch, J.A.M. Vermaseren, Nucl. Phys. B (Proc. Suppl.) 89 (2000) 137, hep-ph/0006053.
[66] G. ’t Hooft, Nucl. Phys. B 61 (1973) 455.
[67] W.A. Bardeen, et al., Phys. Rev. D 18 (1978) 3998.
[68] S.A. Larin, J.A.M. Vermaseren, Phys. Lett. B 259 (1991) 345.
[69] J.A.M. Vermaseren, A. Vogt, S. Moch, in preparation.
[70] K.G. Chetyrkin, F.V. Tkachev, Nucl. Phys. B 192 (1981) 159.
[71] G. Passarino, M. Veltman, Nucl. Phys. B 160 (1979) 151.
[72] J.A. Gracey, Phys. Lett. B 322 (1994) 141, hep-ph/9401214.
[73] C.F. Berger, Phys. Rev. D 66 (2002) 116002, hep-ph/0209107.
[74] L. Lewin, Polylogarithms and Associated Functions, North-Holland, New York, 1981.
[75] T. Gehrmann, E. Remiddi, Comput. Phys. Commun. 141 (2001) 296, hep-ph/0107173.
[76] J. Blümlein, hep-ph/0311046.
[77] M. Glück, E. Reya, A. Vogt, Z. Phys. C 67 (1995) 433.
[78] M. Glück, E. Reya, A. Vogt, Eur. Phys. J. C 5 (1998) 461, hep-ph/9806404.
[79] W.E. Caswell, Phys. Rev. Lett. 33 (1974) 244.
[80] D.R.T. Jones, Nucl. Phys. B 75 (1974) 531.
[81] O.V. Tarasov, A.A. Vladimirov, A.Y. Zharkov, Phys. Lett. B 93 (1980) 429.
[82] S.A. Larin, J.A.M. Vermaseren, Phys. Lett. B 303 (1993) 334, hep-ph/9302208.
[83] W.L. van Neerven, A. Vogt, Nucl. Phys. B 603 (2001) 42, hep-ph/0103123.
[84] E. Laenen, S. Riemersma, J. Smith, W.L. van Neerven, Nucl. Phys. B 392 (1993) 162.
[85] A. Chuvakin, J. Smith, W.L. van Neerven, Phys. Rev. D 61 (2000) 096004, hep-ph/9910250.
[86] D.J. Broadhurst, A.L. Kataev, C.J. Maxwell, hep-ph/0403037.
[87] G.P. Korchemsky, Mod. Phys. Lett. A 4 (1989) 1257.
[88] J. Kodaira, L. Trentadue, Phys. Lett. B 112 (1982) 66.
Nuclear Physics B 688 (2004) 135–164
www.elsevier.com/locate/npe
Abstract
We compute the vacuum polarization on the lattice in quenched QCD using non-perturbatively
improved Wilson fermions. Above Q2 of about 2 GeV2 the results are very close to the predictions
of perturbative QCD. Below this scale we see signs of non-perturbative effects which we can describe
by the use of dispersion relations. We use our results to estimate the light quark contribution to the
muon’s anomalous magnetic moment. We find the result 446(23) × 10−10 , where the error only
includes statistical uncertainties. Finally we make some comments on the applicability of the operator
product expansion to our data.
2004 Elsevier B.V. All rights reserved.
0550-3213/$ – see front matter 2004 Elsevier B.V. All rights reserved.
doi:10.1016/j.nuclphysb.2004.03.026
136 QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164
1. Introduction
and Q2 ≡ −q 2 (so that Q2 > 0 for spacelike momenta, Q2 < 0 for timelike). Π can be
computed on the lattice for spacelike momenta Q2 > 0.
Π can also be calculated in perturbation theory. Π has to be additively renormalized,
even the one-loop diagram (with no gluons involved) is logarithmically divergent. This
renormalization implies that the value of Π can be shifted up and down by a constant
depending on scheme and scale without any physical effects. However, the Q2 dependence
of Π(Q2 ) is physically meaningful, and it must be independent of renormalization scheme
or regularization.
Experimentally Π can be calculated from data for the total cross section of e+ e− →
hadrons with theoretical predictions of QCD by means of the dispersion relations [1]
n n
∞
2 (−1) d R(s)
2
12π Q Π Q 2
=Q
2
ds , (3)
n! d Q2 (s + Q2 )n+1
4m2π
where
σe+ e− →hadrons(s) 3s
R(s) = = 2
σe+ e− →hadrons(s). (4)
σe+ e− →µ+ µ− (s) 4παem
The first derivative of the vacuum polarization (the n = 1 term in Eq. (3)) is referred to as
the Adler D-function [2]:
dΠ(Q2 )
D Q2 = −12π 2 Q2 . (5)
dQ2
The anomalous magnetic moment of the muon (g − 2)µ can be calculated to very high
order in QED (5 loop), and measured very precisely. (g − 2)µ is more sensitive to high-
energy physics than (g − 2)e , by a factor m2µ /m2e , so it is a more promising place to look
QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164 137
Fig. 1. A typical Feynman diagram contributing to the CΠ part of the vacuum polarization Π is shown as
diagram (a). A diagram contributing to the AΠ part of Π is shown in (b). In this paper we only consider diagrams
of the first type, with both photons attached to the same quark line.
for signs of new physics, but to identify new physics we need to know the conventional
contributions very accurately. QED perturbative calculations take good account of muon
and electron loops, but at the two-loop level quarks can be produced, which in turn will
produce gluons. The dominant contribution comes from photons with virtualities ∼ m2µ ,
which is a region where QCD perturbation theory will not work well.
Π(Q2 ) can be split into two contributions with a different dependency on the quark
charges, as shown in Fig. 1,
2
−12π 2 Π Q2 = ef CΠ µ2 , Q2 , mf + ef ef AΠ µ2 , Q2 , mf , mf . (6)
f f,f
The CΠ term begins with a tree-level term which is O(αs0 ), while the first contribution to
the AΠ term is O(αs3 ). Furthermore, if flavour SU (3) is a good symmetry, the contribution
to AΠ from the three light flavours (u, d, s) cancels because eu + ed + es = 0, and the only
surviving contributions to AΠ come when both f and f are heavy quarks (c, b, t). In this
paper we will concentrate on the CΠ term, both because it is larger, and because it is much
easier to measure on the lattice.
For large (spacelike) momenta CΠ (µ2 , Q2 , mf ) can be expressed by means of the
operator product expansion, OPE
CΠ µ2 , Q2 , mf = c0 µ2 , Q2 , mf
c4F (µ2 , Q2 ) c4G (µ2 , Q2 ) αs 2
+ m f ψ̄f ψ f + G
(Q2 )2 (Q2 )2 π µν
c F (µ2 , Q2 ) 1
+ 4 m f ψ̄
f fψ + O , (7)
(Q2 )2
(Q2 )3
f
where c0 , c4F , c4 F and c4G are the Wilson coefficients and µ is the renormalization scale
parameter in some renormalization scheme such as MS. The f sum extends over the
flavours of the sea quarks (internal quark loops not directly connected to the photon lines).
138 QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164
The Wilson coefficients can be computed in perturbation theory, while the non-perturbative
physics is encoded in the condensates mf ψ̄f ψf (µ), αs G2µν (µ), etc.
Perturbatively the functions D, Eq. (5), and R, Eq. (4), are known to four loops for
massless quarks [3,4], while the coefficient c0 is known to three loops in the massive
case [5]. The coefficients c4F , c4 F and c4G are known up to O(αs2 ) [6–8] for massless quarks.
In this paper we shall compute Π(Q2 ) on the lattice and compare the result with current
phenomenology [9]. Preliminary results of this calculation were presented in [10].
The structure of this paper is as follows. After this Introduction we discuss the lattice
setup in Sections 2 and Appendix A. The results are presented in Section 3 and Appendix
C. In Section 4 and Appendix B we compare with perturbation theory. In Sections 5 and 6
we present a simple model which describes our lattice data well. In Section 7 we use this
model to give a lattice estimate of the hadronic contribution to the muon’s anomalous
magnetic moment. Finally in Section 8 we make some comments on the applicability of
the operator product expansion, and give our conclusions in Section 9.
2. Lattice setup
We work with non-perturbatively improved Wilson fermions. For the action and
computational details, as well as for results of hadron masses and decay constants, see [11–
13]. Full details of the vacuum polarization calculation can be found in [14].
To discretize the hadronic electro-magnetic current
Jµem (x) = ef Jµf (x) (8)
f
with
2 aqµ
q̂µ = sin . (11)
a 2
f f
On the lattice the photon self-energy Π is not simply given by Jµ (x)Jν (0). This is
because the lattice Feynman rules include vertices where any number of photons couple to
a quark line (not just the single-photon vertex of the continuum) [15]. We are therefore led
to define
with
(1)
Πµν (q̂) = a 4 eiq(x+a µ̂/2−a ν̂/2) Jµf (x)Jνf (0) , (13)
x
and
(2)
Πµν (q̂) = −aδµν Jµ(2) (0) , (14)
where
1
Jµ(2) (x) = ψ̄(x + a µ̂)(1 + γµ )Uµ† (x)ψ(x)
2
+ ψ̄(x)(1 − γµ )Uµ (x)ψ(x + a µ̂) . (15)
It then follows that (12) fulfils the Ward identity
q̂µ Πµν (q̂) = q̂ν Πµν (q̂) = 0. (16)
The conserved vector current (9) is on-shell as well as off-shell improved [16] in forward
matrix elements. In non-forward matrix elements, such as the vacuum polarization, it needs
to be further improved,
imp 1
Jµ (x) = Jµ (x) + ia cCVC ∂ν ψ̄(x)σµν ψ(x) , (17)
2
where we have some freedom in choosing ∂ν on the lattice. Any choice must preserve (16),
and ideally it should keep O(a 2 ) corrections small. Considering the fact that the conserved
vector current Jµ (x) ‘lives’ at x + a µ̂/2, the natural (or naive) choice for the improved
operator would be
imp 1
Jµ (q̂) = Jµ (q̂) + iacCVC eiq(x+a µ̂/2)
2 x
1
× Tµν (x + a ν̂) + Tµν (x + a µ̂ + a ν̂)
4a
− Tµν (x − a ν̂) − Tµν (x + a µ̂ − a ν̂)
1 aqµ
= Jµ (q̂) + cCVC cos sin(aqν ) eiqx Tµν (x), (18)
2 2 x
where Tµν (x) ≡ ψ̄(x)σµν ψ(x). Unfortunately it turns out that this choice introduces very
large O(a 2 Q2 ) errors into Π . We found that
imp aqν iqx
Jµ (q̂) ≡ Jµ (q̂) + cCVC sin e Tµν (x) (19)
2 x
was a better choice of improved current, because it makes the O(a 2 Q2 ) terms much
smaller, and so we will adopt this definition. The difference between the definitions (18)
and (19) is O(a 2 Q2 ), so we are free to choose whichever definition leads to the largest
scaling region in Q2 (both agree at small Q2 ).
In Fig. 2 we show the unimproved polarization tensor along with the two different
choices of improvement term in the case of free fermions. The best agreement with
140 QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164
Fig. 2. The effects of improving the current operator. We compare various curves, all calculated in the free-field
case at the one-loop level with q ∝ (0, 0, 0, 1) and κ = 0.123 (corresponding to am = 0.065). The dotted curves
show the continuum one-loop result, shifted vertically to match with lattice results. The dashed line shows
the one-loop lattice result without any improvement of the current. The dot-dashed curve shows the result of
improving with (18) and the solid curve the result of improving with (19). The upper two curves differ by
O(a), the difference ∼ am ln a 2 q̂ 2 . The lower lattice curves are both O(a) improved, but we see that the naive
improvement (dot-dashed curve) has very large O(a 2 ) discretization errors. Improving with (19) produces a
lattice result much closer to the continuum result.
continuum physics comes from using (19), which is the prescription we will use in the
rest of this paper.
(1) (2)
In Appendix A we give explicit expressions for Πµν (q̂) and Πµν (q̂) in terms of the
link variables and the quark propagators.
3. Lattice calculation
To facilitate the extrapolation to the chiral and continuum limits, we have made
simulations at three different values of β with three or more different κ values at each β.
The parameters are listed in Table 1. The lattice data for the vacuum polarization for the
individual momenta and β and κ values are given in Appendix C.
First, let us discuss the value we use for cCVC . From the fermion-loop contribution to the
gluon propagator, computed in [18] to O(m) in lowest order of lattice perturbation theory,
QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164 141
Table 1
Parameters of the lattice simulation. The improve-
ment coefficient in the fermionic action was taken
to be cSW = 1.769 for β = 6.0, cSW = 1.614 for
β = 6.2 and cSW = 1.526 for β = 6.4 [17]
β κ V # Conf.
6.0 0.1333 164 97
6.0 0.1339 164 44
6.0 0.1342 164 44
6.0 0.1345 164 44
6.0 0.1345 324 16
6.2 0.1344 244 51
6.2 0.1349 244 38
6.2 0.1352 244 51
6.4 0.1346 324 50
6.4 0.1350 324 29
6.4 0.1352 324 50
Fig. 3. Chiral extrapolation of CΠ (q̂ 2 , m) at β = 6.0. The points shown range from a 2 q̂ 2 ≈ 3.0 to ≈ 6.5. All are
calculated with cCVC = 1.
142 QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164
Fig. 4. The slope M(q̂ 2 ) for the improved (solid points, cCVC = 1) and unimproved (open points, cCVC = 0)
vector current as a function of a 2 q̂ 2 at β = 6.0. The dashed line shows the height of the plateau in one-loop
lattice perturbation theory, setting cCVC = 1 in Eq. (20).
we obtain
(0)
c0 q̂ 2 , am = 3 ln a 2 q̂ 2 − 3am(1 − cCVC ) ln a 2 q̂ 2 − 3.25275141(5)
+ 1.19541770(1)cCVC − 7.06903716(4)cCVC2
+ am 6.46270704(30) − 5.29413266(6)cCVC
+ 1.67389761(2)cCVC
2
+ O a 2 , m2 , (20)
where am = 1/2κ − 1/2κc . We use c0(0) to refer to the lowest order, g 0 , perturbative
contribution to c0 . As said earlier, the q̂ 2 dependence of Π and c0 is physical. Therefore
in an O(a)-improved calculation there should be no O(a) terms which depend on q̂ 2 . On
the other hand a constant added to c0 has no physical effect, so there is no objection to
constant terms of O(a) in Eq. (20). We see that there is an unphysical am ln(a 2 q̂ 2 ) term in
(20) unless cCVC = 1 + O(g 2 ). In the following we take cCVC = 1 and make the ansatz
CΠ q̂ 2 , am = CΠ q̂ 2 , am = 0 + am M q̂ 2 . (21)
In Fig. 3 we show the quark mass dependence of CΠ (q̂ 2 , am) for several momenta, which
justifies assuming a linear am dependence. In Fig. 4 we show the slope M(q̂ 2 ) with and
without improvement of the conserved vector current. The derivative M should tend to a
constant when Q2
m2 . With no improvement term (open points) we see that M has
a logarithmic dependence on q̂ 2 , corresponding to an unphysical am ln(a 2 q̂ 2 ) term in
c0 . We see that for cCVC = 1 the slope is, within error bars, independent of q̂ 2 down to
small momenta. This shows that the choice cCVC = 1 has eliminated or greatly reduced
QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164 143
Fig. 5. A check for finite volume effects. At β = 6.0, κ = 0.1345 we have measured the polarization tensor both
on a 164 lattice (white points) and on a 324 lattice (black points). The agreement is excellent, and we conclude
that finite volume effects are negligible.
the unphysical logarithm in c0 . We conclude that the tree-level value cCVC = 1 is a good
choice.1
The lattice sizes in Table 1 were chosen such that the physical volume is approximately
equal for all three β values. As we have done simulations at β = 6.0, κ = 0.1345 on
two different lattice volumes we check for finite volume effects. We have not found any,
see Fig. 5. This figure also shows that on the larger lattice we can measure the vacuum
polarization at much lower values of Q2 , which is an important advantage. The quark
boundary conditions are antiperiodic in all four directions, while gluon and photon fields
have periodic boundary conditions in all directions.
1 If c
CVC should ever be computed non-perturbatively, or in perturbation theory, it should be kept in mind that
we have used Eq. (19) as the definition of the improvement term.
144 QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164
Fig. 6. The deviation of lattice data from continuum perturbation theory. The data are at β = 6.0, κ = 0.1345 on
a 164 lattice (white points) and on a 324 lattice (black points). Below Q2 ∼ 2 GeV2 there is a visible deviation
from perturbation theory. The dotted line shows the effect expected from a gluon condensate as expected in the
OPE.
We have seen that perturbation theory, even when supplemented with higher twist terms
from the operator product expansion, has difficulty in explaining the low-Q2 region of the
data. Can we understand this region in some other way?
The cross section ratio R(s) is given by the cut in the vacuum polarization, or in other
words there are dispersion relations which give the vacuum polarization if we know R(s).
One way of modelling Π is to make a model for R(s), and then calculate Π from R,
using the dispersion relation Eq. (3). This should be quite robust, since Π gets contributions
QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164 145
Table 2
Lattice data on the vector meson used as input for the dispersion
relation fits [11,13]. Numbers in italics have been interpolated
from nearby κ values
β κ amPS amV 1/fV
6.0 0.1333 0.4122(9) 0.5503(20) 0.2055(14)
6.0 0.1339 0.3381(15) 0.5017(40) 0.2217(15)
6.0 0.1342 0.3017(13) 0.4904(40) 0.2293(14)
6.0 0.1345 0.2561(15) 0.4701(90) 0.2387(90)
6.2 0.1344 0.3034(6) 0.4015(17) 0.2210(20)
6.2 0.1349 0.2431(6) 0.3663(27) 0.2403(24)
6.2 0.1352 0.2005(9) 0.3431(60) 0.2474(47)
6.4 0.1346 0.2402(8) 0.3107(16) 0.2252(21)
6.4 0.1350 0.1933(7) 0.2800(20) 0.2423(19)
6.4 0.1352 0.1661(10) 0.2613(40) 0.2448(35)
from a large range in s, little inaccuracies in the model R get washed out, and we can hope
that even a crude representation of R will give a good result for Π . We will keep the model
simple so that we can do all the integrals analytically.
Perturbatively
αs
R(s) = ef Nc 1 +
2
+ ··· , (23)
π
f
where Nc is the number of colours (3 in our case). We know that really the low s behaviour
of R is more complicated than that, it is dominated by the ρ(770), ω(782) and φ(1020)
mesons. Following [7], let us make the following model for R. We ignore the splitting
between the ρ and ω, which comes from the AΠ -type diagrams which we have dropped.
We also treat these mesons as narrow resonances, each contributing a δ function to R. The
continuum part of R takes a while to climb up to the value in Eq. (23). We will represent
this rise by a step function at some value s0 . So, our model is
R(s) = ef2 Aδ s − m2V + BΘ(s − s0 ) , (24)
f
where we would expect B to be slightly above 3 in order to match Eq. (23).
Using the dispersion relation, this R(s) translates into a vacuum polarization
A
CΠ Q2 = B ln a 2 Q2 + a 2 s0 − + K, (25)
Q2 + m2V
where K is a constant which is not determined from the dispersion relation, and which
never appears in any physical quantity. Again we identify the continuum quantity Q2 with
the lattice quantity q̂ 2 .
The constant A can be expressed in terms of the decay constant fV , which has been
measured on the lattice. The cross section for the production of a narrow vector resonance,
V , is [7]
ΓV →e+ e−
σe+ e− →V (s) = 12π 2 δ s − m2V . (26)
mV
146 QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164
Table 3
The result of fit I, Eq. (25), including only data with a 2 q̂ 2 < 5
β κ V a 2 s0 B K χ2
6.0 0.1333 164 0.395(32) 3.13(6) −32.98(11) 4.3
6.0 0.1339 164 0.397(43) 3.13(8) −33.14(14) 2.2
6.0 0.1342 164 0.391(39) 3.10(8) −33.18(13) 3.0
6.0 0.1345 164 0.409(61) 3.11(9) −33.30(16) 2.5
6.0 0.1345 324 0.403(79) 3.12(10) −33.30(18) 0.7
6.2 0.1344 244 0.282(21) 3.10(5) −32.94(7) 1.9
6.2 0.1349 244 0.278(24) 3.07(5) −33.06(8) 1.9
6.2 0.1352 244 0.253(25) 3.06(5) −33.10(7) 2.2
6.4 0.1346 324 0.164(13) 3.06(3) −32.68(5) 0.8
6.4 0.1350 324 0.150(14) 3.03(4) −32.76(6) 0.6
6.4 0.1352 324 0.124(12) 3.02(3) −32.78(5) 1.2
so
m2V
A = 12π 2 . (37)
fV2
We first try making fits to the lattice data for the vacuum polarization CΠ using Eq. (25)
with B, K and s0 as free parameters. To avoid problems from lattice artefacts of O(a 2 q̂ 2 )
we have only used data with a 2 q̂ 2 < 5. We will call this simple ansatz fit I. A, the weight
of the vector meson contribution, is determined by Eq. (37). The vector meson masses
and decay constants which we use are shown in Table 2. They have been taken from [11,
13]. We can compare the values for B and K with the one-loop lattice perturbation theory
result, Eq. (20), which gives B = 3 and K = −27.38 + am8.53.
A typical result is shown in the top panel of Fig. 7. As can be seen from Table 3, the
χ 2 values for the fits are low in every case. Nevertheless we see that the data deviates
quite strongly from the fit when a 2 q̂ 2 is large. The small deviations from the fit at smaller
q̂ 2 are not random—they occur in the same place in every data set, and depend on the
direction of q. Points where q is near the diagonal direction (1, 1, 1, 1) lie lower than
points measured for momenta away from this diagonal direction.
To correct for this behaviour we add terms to our fit to parameterize O(a 2 ) lattice errors.
There are two possible terms at O(a 2 ):
4
2 2 2 µ qµ
a q and a . (38)
q2
The second term has only cubic symmetry, and can give rise to a dependence of CΠ on the
direction of q, which would not occur in the continuum. To include these terms we fit with
148 QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164
Fig. 7. A comparison between fits I and II. The data are at β = 6.0, κ = 0.1333 on a 164 lattice. Fit I only uses
data with a 2 q̂ 2 < 5 (vertical dashed line). The second fit describes the data better.
Fig. 8. The deviation of lattice data from the fit of Table 4. The data are at β = 6.0, κ = 0.1345 on a 164 lattice
(white points) and on a 324 lattice (black points). The fit describes the data extremely well.
Table 4
The result of fit II, including all data
β κ V a 2 s0 B K χ2
6.0 0.1333 164 0.357(50) 3.00(12) −32.97(17) 1.52
6.0 0.1339 164 0.357(68) 2.99(16) −33.13(23) 0.69
6.0 0.1342 164 0.354(62) 2.97(15) −33.18(21) 0.83
6.0 0.1345 164 0.37(10) 2.97(17) −33.30(25) 0.70
6.0 0.1345 324 0.35(11) 2.92(33) −33.22(28) 0.06
6.0 0.1345 164 &324 0.358(77) 2.93(11) −33.22(15) 1.49
6.2 0.1344 244 0.262(35) 3.00(12) −32.94(12) 0.14
6.2 0.1349 244 0.252(40) 2.94(13) −33.05(14) 0.12
6.2 0.1352 244 0.232(38) 2.95(12) −33.11(12) 0.13
6.4 0.1346 324 0.156(21) 3.01(11) −32.69(7) 0.08
6.4 0.1350 324 0.144(23) 2.99(12) −32.78(9) 0.10
6.4 0.1352 324 0.117(18) 2.97(10) −32.81(7) 0.11
describes the direction dependence of the data. Fits with the ansatz (39) show practically
no deviation from the data. This can also be seen in Fig. 8, where we show the deviation in
the case where we have data on two lattice sizes.
In Fig. 9 we show the results converted into physical units. The agreement between
the different β values is fair, and the value of s0 in agreement with phenomenology (for
example, [23] finds s0 = 1.66(22) GeV2 in the I = 1 channel).
150 QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164
Fig. 9. The threshold s0 (white points) compared with m2ρ (black points). Squares are for β = 6.0, triangles for
6.2, and circles for 6.4.
Fig. 10. The parameter B from Table 4. The β values are shown by the same symbols as in Fig. 9.
QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164 151
In Fig. 10 we show the values for the fit parameter B. The value we find is always very
close to the tree-level value 3.
With a good description of the vacuum polarization tensor in the low-Q2 region we can
make statements about phenomenologically interesting quantities such as the contribution
of the u, d and s-quarks to the muon anomalous magnetic moment [24].
Traditionally the hadronic contribution to the muon’s anomalous magnetic moment is
found from R(s) via a dispersion relation [25] (see [26] for a recent review)
2 ∞
αem ds s
aµhad = K R(s), (41)
3π 2 s m2µ
4m2π
where
1
s x 2 (1 − x)
K = . (42)
m2µ x 2 + (1 − x)s/m2µ
0
The muon mass, mµ , is 105.7 MeV. For us it is more useful to deform the contour and find
this same number from the vacuum polarization at spacelike momenta [27]
∞ 2
α2 dQ2 Q
aµhad = em2 2
F 2
12π 2 Π(0) − Π Q2
3π Q mµ
0
2 ∞ 2
αem dQ2 Q 2
= 2
ef F CΠ Q , mf − CΠ (0, mf )
3π 2 Q2 m2µ
f 0
+ annihilation, (43)
where the kernel F is
2
Q 16m4µ
F = . (44)
m2µ (Q2 )2 (1 + 1 + 4m2µ /Q2 )4 1 + 4m2µ /Q2
for each of our data sets. We then need to extrapolate (or interpolate) to the chiral limit (for
the u and d quarks) and to the strange quark mass. As can be seen in Fig. 11 the integral is
dominated by Q2 ∼ 3m2µ , so we need to extrapolate in Q2 , as our lowest measured value
152 QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164
Fig. 11. The product F (Q2 /m2µ )[CΠ (Q2 , mf ) − CΠ (0, mf )] for the case β = 6.0, κ = 0.1345. The shaded
region shows the uncertainty.
is at Q2 = 0.17 GeV2 . We do our extrapolation of CΠ by using the results of fit II from the
previous section.
The results are shown in Fig. 12, plotted against m2PS (the square of the pseudoscalar
mass). We see that I depends strongly on the quark mass, with heavier quarks making
a smaller contribution (as one would expect). There is also some dependence on β. To
extrapolate to the continuum we fit the data with an ansatz of the form
I f = A1 + A2 a 2 + B1 + B2 a 2 m2π . (46)
This describes the data well (χ 2 /dof = 0.53), and gives the continuum limit shown by
the dashed line in Fig. 12. The physically relevant values of I f are at the physical pion
mass (M 2 = 0.019 GeV2 ) for the u and d quarks, and at the mass of a hypothetical s̄s
pseudoscalar meson with M 2 = 2m2K − m2π = 0.468 GeV2 for I s . The values at these
points are
Fig. 12. The results of the integral (45). The β values are shown by the same symbols as in Fig. 9. The dashed
line shows our extrapolation to the continuum limit, and the points shown with stars are our results for the light
(u, d) and strange quarks.
The shortfall in the value of aµhad can probably be attributed to quenching. In particular
a quenched calculation omits the process e+ e− → ππ , which is the only contribution
allowed at very low s.
What does the success of our fit function Eq. (24) tell us about the applicability and
usefulness of the OPE?
To answer this question let us look at the large Q2 behaviour of the formula (25):
The expansion has the same form as the OPE, at least at leading order when there are no
logarithmic corrections to the Wilson coefficients. We can now look at the higher twist
terms in the expansion, and estimate how important they are in comparison with the gluon
condensate contribution.
By looking at the first few terms we can relate our parameters s0 and A to the
condensates in the OPE. In the chiral limit there is no 1/Q2 term, so
A = Bs0 . (50)
154 QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164
Fig. 13. Our simple model for the contribution of the u and d quarks to R(s).
This says that the area under the meson δ-function is the same as the grey area in Fig. 13,
a typical sum-rule style result. Substituting this into the 1/Q4 term we get
1 αs
Bs0 m2ρ − s0 = −π 2 GG. (51)
2 π
Again, a typical sum-rule result—if there were no gluon condensate, the meson would lie
at s0 /2, exactly in the middle of the gap. Putting m2ρ = 0.6 GeV2 (the physical value) and
(αs /π)GG = 0.012 GeV4 gives s0 = 1.26 GeV2 .
Next, in Fig. 14, we plot a comparison of the threshold model, Eq. (25), perturbation
theory (just 3 ln a 2 Q2 at this level) and perturbation theory plus the gluon condensate
contribution, which scales like 1/(Q2 )2 . The physically irrelevant constant K has been
set to 0 in all cases.
The curve from the threshold model looks physically sensible, it goes to a finite value
at Q2 = 0, which is what must happen if R(s) does not extend all the way down to s = 0.
However, even though we chose our parameters A and s0 so that the threshold model
would match the gluon condensate prediction at high Q2 , the two curves do not resemble
each other closely.
Let us now subtract out the perturbative piece, to see things more clearly, Fig. 15. We
can only find a region where the gluon condensate region is important when we concentrate
on the large Q2 region. If Fig. 15 is what the real world looks like, the prospects for getting
at the gluon condensate look poor. The gluon condensate term is overwhelmed by higher
order terms in the OPE before it has a chance to get significant.
What is the conclusion of this exercise? In the region where we can easily see deviations
from perturbation theory the OPE is not very useful, because many operators of very high
dimension are all contributing, not just the leading 1/(Q2 )2 contribution coming from the
gluon condensate. To see the gluon condensate uncontaminated by higher order operators
we would need to look in the region Q2 ∼ 10 GeV2 with very accurate data (error bars at
least 2 orders of magnitude smaller than in this paper). This is unfortunately not a realistic
prospect.
QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164 155
Fig. 14. A comparison of the threshold model, Eq. (25) (solid line), perturbation theory (dotted line) and
perturbation theory plus the gluon condensate contribution (dashed line). In this plot we have chosen the
physically irrelevant constant K to be 0 when all quantities are expressed in GeV.
Fig. 15. The same as Fig. 14, but with the perturbative piece subtracted. Note that in this figure we are
concentrating on the behaviour at rather large Q2 where non-perturbative effects are small.
9. Conclusions
We have computed the vacuum polarization in the limit of two light flavours in quenched
QCD for three β values (6.0, 6.2 and 6.4) and on lattices as large as 324 . It was important
to improve the action and the vector current. We found good agreement with three-
loop perturbation theory in the interval 2 Q2 20 GeV2 . The lattice data show some
156 QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164
indication of non-perturbative effects at the lower end of the Q2 range. We can describe
these very well with a model of R(s) which includes vector mesons and threshold effects.
In order to make firm predictions, we need more data at small Q2 and at smaller lattice
spacings with high statistics.
From the low-Q2 region of the vacuum polarization we can extract a lattice value for
aµhad , the hadronic contribution to the muon’s anomalous magnetic moment. We find the
value 446(23) × 10−10 which is of the right order of magnitude, though lower than the
physical value. Our estimate could be improved by using a larger lattice size, enabling us to
reach lower Q2 which would reduce uncertainties from extrapolation. Naturally, dynamical
calculations would be very interesting.
Acknowledgements
Appendix A
(1)
We denote the quark propagator from lattice points x to y by G(x, y). For Πµν (q̂) we
then obtain
a 4 iq(x+a µ̂/2−a ν̂/2)
(1)
Πµν (q̂) = e
4 x
× Tr (1 + γν )Uν† (0)γ5 G† (x + a µ̂, 0)γ5 (1 + γµ )Uµ† (x)G(x, a ν̂)
− (1 − γν )Uν (0)γ5G† (x + a µ̂, a ν̂)γ5 (1 + γµ )Uµ† (x)G(x, 0)
− (1 + γν )Uν† (0)γ5G† (x, 0)γ5(1 − γµ )Uµ (x)G(x + a µ̂, a ν̂)
+ (1 − γν )Uν (0)γ5G† (x, a ν̂)γ5 (1 − γµ )Uµ (x)G(x + a µ̂, 0)
a5
− cCVC eiq(x+a µ̂/2)q̂λ
4 x
× Tr (1 + γµ )Uµ† (x)G(x, 0)σνλγ5 G† (x + a µ̂, 0)γ5
− (1 − γµ )Uµ (x)G(x + a µ̂, 0)σνλ γ5 G† (x, 0)γ5
a5
+ cCVC eiq(x−a ν̂/2) q̂σ
4 x
× Tr (1 + γν )Uν† (0)γ5 G† (x, 0)γ5σµσ G(x, a ν̂)
QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164 157
− (1 − γν )Uν (0)γ5G† (x, a ν̂)γ5 σµσ G(x, 0)
a 6 2 iqx
− cCVC e q̂λ q̂σ
4 x
× Tr σµσ G(x, 0)σνλ γ5 G† (x, 0)γ5 , (A.1)
(2)
where the improvement term has been integrated by parts. For Πµν (q̂) we obtain
a
(2)
Πµν (q̂) = δµν Tr (1 + γν )Uν† (0)G(0, a ν̂) + (1 − γν )Uν (0)G† (0, a ν̂) . (A.2)
2
(1) (2)
To compute Πµν (q̂) and Πµν (q̂) we have to do a minimum of five inversions for each
gauge field configuration (and each κ value), which makes the calculation computationally
quite expensive, in particular on 324 lattices.
Appendix B
Perturbative results
Before we describe the lattice calculation in detail, we present here the perturbative
Wilson coefficients c0 , c4F and c4G . We will work in the quenched approximation, in which
contributions from sea quarks are neglected, and which corresponds to c4 F = 0.
We write
αs (µ2 )
c0 µ2 , Q2 , m = c0(0) µ2 , Q2 , m + CF c0(1) µ2 , Q2 , m
π
αs (µ2 ) 2 2 (2) 2 2 (2)
+ CF c0 µ , Q , m + CF CA c0 µ2 , Q2 , m
π
+ ···, (B.1)
with CF = 4/3 and CA = 3 for SU(3). These coefficients can be found in Eqs. (27)–(30)
of [5] (recall that Q2 ≡ −q 2 ). In the MS scheme the coefficients c0(0) , c0(1) , c0(2) and c0(2)
read
2 2
(0) 2 9 20 4 Q2 m̄2 4m̄ 1 1 Q2
c0 µ , Q , m̄ = −
2
− ln 2 − 8 2 + + ln ,
4 9 3 µ Q Q2 4 2 m̄2
(B.2)
(1) 2 2
c0 µ , Q , m̄
9 55 Q2 4m̄2 Q2
=− − 4ζ (3) − ln 2 − 2 4 − 3 ln 2
4 12 µ Q µ
2 2 2
4m̄ 1 11 Q 3 2 Q2 3 Q2 Q2
+ + ζ(3) + ln 2 + ln − ln ln ,
Q2 24 8 m̄ 4 m̄2 2 m̄2 µ2
(B.3)
158 QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164
(2) 2
c0 µ , Q2 , m̄
9 143 37 1 Q2
=− − − ζ (3) + 10ζ(5) + ln 2
4 72 6 8 µ
2
4m̄ 1667 5 35 51 Q2 9 2 Q2
− 2 − ζ(3) − ζ(5) − ln 2 + ln 2 , (B.4)
Q 96 12 6 8 µ 4 µ
c0(2) µ2 , Q2 , m̄
9 44215 227 5 41 Q2 11 2 Q2 11 Q2
=− − ζ (3) − ζ(5) − ln 2 + ln 2 + ζ(3) ln 2
4 2592 18 3 8 µ 24 µ 3 µ
2 2 2
4m̄ 1447 4 85 185 Q 11 2 Q
− 2 + ζ(3) − ζ(5) − ln 2 + ln 2 , (B.5)
Q 96 3 12 24 µ 8 µ
where ζ (3) = 1.20206 · · · and ζ (5) = 1.03693 · · ·, and m̄ refers to the quark mass at the
scale µ in the MS scheme. In (B.2), (B.3) terms of O((m̄2 /Q2 )3 ) modulo logarithms have
been neglected, while in (B.4), (B.5) terms O((m̄2 /Q2 )2 ) are dropped.
The Wilson coefficients multiplying the quark and gluon condensate are [6,8]
2 αs (µ2 )
c4F (µ, q) = −12π 2 2+ + ··· ,
3 π
11 αs (µ2 )
c4G (µ, q) = −π 2 1 − + ··· . (B.6)
18 π
Note that both mq̄q and (αs /π)G2µν are renormalization group invariants, which means
that they do not depend on µ and the renormalization scheme. The vacuum polarization
Π(Q2 ) itself is not an observable, but its derivatives are. Therefore the result (7) can only
depend on µ and the scheme in terms of an integration constant (independent of Q2 ).
αs (µ2 ) αs (µ2 ) 2 365 11 Q2
D Q 2
=3 1+ + − 11ζ(3) − ln 2 . (B.7)
π π 24 4 µ
We should be able to do better than this because the massless Adler function is known to
four loops [3]. We can use this perturbative result for the Adler function to improve the
QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164 159
αs (Q2 ) αs (Q2 ) 2 365
R(s) = 3 ef2 1 + + − 11ζ(3)
π π 24
f
αs (Q2 ) 3 87029 1103 275 121 2
+ − ζ(3) + ζ(5) − π
π 288 4 6 48
2
αs (Q2 ) 3 55 5
+ ef − ζ(3) + O αs4 (B.8)
π 72 3
f
and
αs (Q2 )
αs (Q2 ) 2 365
D Q2 = 3 ef2 1 + + − 11ζ(3)
π π 24
f
αs (Q2 ) 3 87029 1103 275
+ − ζ(3) + ζ(5)
π 288 4 6
2
αs (Q2 ) 3
55 5
+ ef − ζ(3) + O αs4 . (B.9)
π 72 3
f
Note that although the first terms of R and D coincide, the αs3 term is different. The extra
“π 2 ” term in R arises from analytic continuation,
2 ln3 (Q2 /µ2 ) → [ln(s/µ2 )±iπ]3 , see [3].
The first part of (B.9), proportional to f ef , is the derivative of the CΠ term in (6), while
the second term, proportional to ( f ef )2 , comes from the derivative of the AΠ term.
From (B.9) we have a differential equation for c0 :
∂
Q2 2
c0 µ2 , Q2 , m = 0
∂Q
αs (Q2 ) αs (Q2 ) 2 365
=3 1+ + − 11ζ(3)
π π 24
3
2
αs (Q ) 87029 1103 275 4
+ − ζ(3) + ζ(5) + O αs . (B.10)
π 288 4 6
We can solve this by using the known β-function [30]
αs (Q2 ) i+2
∂ αs (Q2 )
Q2
=− βi (B.11)
∂Q2 π π
i
with
11 51 2857 149753 891
β0 = , β1 = , β2 = , β3 = + ζ(3) (B.12)
4 8 128 1536 64
in the case of quenched SU(3).
160 QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164
Table 5
The vacuum polarization CΠ (q̂ 2 , am) on the 164 lattice at β = 6.0
n κ = 0.1333 κ = 0.1339 κ = 0.1342 κ = 0.1345
(1, 0, 0, 0) −38.15(11) −38.60(15) −38.84(14) −39.03(16)
(1, 1, 0, 0) −36.59(10) −36.89(13) −37.05(12) −37.17(13)
(1, 1, 1, 0) −35.52(09) −35.75(12) −35.89(11) −35.99(12)
(1, 1, 1, 1) −34.72(09) −34.91(12) −35.04(11) −35.12(12)
(2, 1, 1, 1) −32.89(08) −33.06(10) −33.15(09) −33.24(10)
(2, 2, 1, 1) −31.82(07) −31.98(10) −32.07(09) −32.15(10)
(2, 2, 2, 1) −31.07(07) −31.22(09) −31.30(08) −31.40(09)
(2, 2, 2, 2) −30.47(07) −30.62(09) −30.71(08) −30.81(09)
(3, 2, 2, 2) −29.61(06) −29.76(09) −29.85(08) −29.95(09)
(3, 3, 2, 2) −28.98(06) −29.13(08) −29.22(08) −29.32(08)
(3, 3, 3, 2) −28.47(06) −28.63(08) −28.72(07) −28.82(08)
(3, 3, 3, 3) −28.05(06) −28.20(08) −28.29(07) −28.39(08)
(4, 3, 3, 3) −27.54(05) −27.70(08) −27.79(07) −27.89(08)
(4, 4, 3, 3) −27.12(05) −27.28(07) −27.37(07) −27.47(08)
(4, 4, 4, 3) −26.76(05) −26.91(07) −27.00(07) −27.11(07)
(4, 4, 4, 4) −26.42(05) −26.58(07) −26.67(07) −26.78(07)
(5, 4, 4, 4) −26.11(05) −26.26(07) −26.36(06) −26.46(07)
(5, 5, 4, 4) −25.82(05) −25.97(07) −26.07(06) −26.17(07)
(5, 5, 5, 4) −25.55(05) −25.70(07) −25.80(06) −25.90(07)
(5, 5, 5, 5) −25.29(05) −25.44(07) −25.54(06) −25.64(07)
(6, 5, 5, 5) −25.09(05) −25.25(06) −25.34(06) −25.45(07)
(6, 6, 5, 5) −24.90(04) −25.05(06) −25.15(06) −25.25(06)
(6, 6, 6, 5) −24.71(04) −24.86(06) −24.96(06) −25.06(06)
(6, 6, 6, 6) −24.52(04) −24.67(06) −24.77(06) −24.87(06)
(7, 6, 6, 6) −24.42(04) −24.57(06) −24.67(05) −24.77(06)
(7, 7, 6, 6) −24.31(04) −24.46(06) −24.56(05) −24.66(06)
(7, 7, 7, 6) −24.19(04) −24.34(06) −24.44(05) −24.54(06)
(7, 7, 7, 7) −24.07(04) −24.22(06) −24.32(05) −24.42(06)
The solution is
c0 µ2 , Q2 , 0
12 2 3403 αs (Q2 )
= 3 ln Q2 − ln αs Q + − + 12ζ(3)
11 242 π
2
2301587 273 2
αs (Q )
+ − + ζ (3) − 25ζ(5) + O αs3 + const. (B.13)
15972 2 π
The constant of integration can be fixed by comparing with (B.1)–(B.5), giving the final
renormalization group improved result2
2 2 Q2 12 αs (Q2 ) 3403 αs (Q2 )
c0 µ , Q , 0 = 3 ln 2 − ln + − + 12ζ(3)
µ 11 αs (µ2 ) 242 π
2 Note the interesting result that due to a cancellation there is no ζ (3) term in the coefficient of α (µ2 ) and no
s
ζ (5) in the coefficient of αs (µ2 )2 . This surprised us.
QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164 161
Table 6 Table 7
The vacuum polarization CΠ (q̂ 2 , am) The vacuum polarization CΠ (q̂ 2 , am) on the 244 lattice at
on the 324 lattice at β = 6.0 β = 6.2
n κ = 0.1345 n κ = 0.1344 κ = 0.1349 κ = 0.1352
(1, 0, 0, 0) −41.64(22) (1, 0, 0, 0) −40.24(14) −40.87(17) −41.16(16)
(1, 1, 0, 0) −40.67(19) (1, 1, 0, 0) −38.78(12) −39.17(14) −39.35(12)
(1, 1, 1, 0) −39.85(17) (1, 1, 1, 0) −37.74(11) −38.03(13) −38.17(11)
(1, 1, 1, 1) −39.16(16) (1, 1, 1, 1) −36.96(10) −37.19(12) −37.30(11)
(2, 1, 1, 1) −37.58(14) (2, 1, 1, 1) −35.23(09) −35.40(10) −35.47(09)
(2, 2, 1, 1) −36.51(13) (2, 2, 1, 1) −34.16(08) −34.31(09) −34.38(08)
(2, 2, 2, 1) −35.72(13) (2, 2, 2, 1) −33.39(07) −33.54(08) −33.60(07)
(2, 2, 2, 2) −35.09(12) (2, 2, 2, 2) −32.79(07) −32.94(08) −33.00(07)
(3, 2, 2, 2) −34.20(12) (3, 2, 2, 2) −31.93(06) −32.08(07) −32.13(07)
(3, 3, 2, 2) −33.52(11) (3, 3, 2, 2) −31.29(06) −31.44(07) −31.49(06)
(3, 3, 3, 2) −32.99(11) (3, 3, 3, 2) −30.78(06) −30.94(07) −30.99(06)
(3, 3, 3, 3) −32.54(11) (3, 3, 3, 3) −30.35(06) −30.52(06) −30.57(06)
(4, 3, 3, 3) −31.97(11) (4, 3, 3, 3) −29.81(06) −29.98(06) −30.03(06)
(4, 4, 3, 3) −31.50(10) (4, 4, 3, 3) −29.37(06) −29.54(06) −29.59(06)
(4, 4, 4, 3) −31.11(10) (4, 4, 4, 3) −28.99(05) −29.17(06) −29.22(06)
(4, 4, 4, 4) −30.77(10) (4, 4, 4, 4) −28.67(05) −28.84(06) −28.90(05)
(5, 4, 4, 4) −30.36(10) (5, 4, 4, 4) −28.29(05) −28.46(06) −28.52(05)
(5, 5, 4, 4) −30.01(10) (5, 5, 4, 4) −27.96(05) −28.14(05) −28.20(05)
(5, 5, 5, 4) −29.70(10) (5, 5, 5, 4) −27.67(05) −27.85(05) −27.91(05)
(5, 5, 5, 5) −29.43(9) (5, 5, 5, 5) −27.41(05) −27.58(05) −27.64(05)
(6, 5, 5, 5) −29.11(9) (6, 5, 5, 5) −27.12(05) −27.30(05) −27.37(05)
(6, 6, 5, 5) −28.84(9) (6, 6, 5, 5) −26.87(05) −27.05(05) −27.11(05)
(6, 6, 6, 5) −28.59(9) (6, 6, 6, 5) −26.64(05) −26.82(05) −26.89(05)
(6, 6, 6, 6) −28.36(9) (6, 6, 6, 6) −26.42(05) −26.60(05) −26.66(05)
(7, 6, 6, 6) −28.11(9) (7, 6, 6, 6) −26.21(05) −26.39(05) −26.45(05)
(7, 7, 6, 6) −27.88(9) (7, 7, 6, 6) −26.01(05) −26.19(05) −26.26(05)
(7, 7, 7, 6) −27.68(9) (7, 7, 7, 6) −25.82(04) −26.00(05) −26.07(04)
(7, 7, 7, 7) −27.48(9) (7, 7, 7, 7) −25.64(04) −25.82(05) −25.89(04)
2301587 273 αs (Q2 ) 2
+ − + ζ(3) − 25ζ(5)
15972 2 π
2
151 αs (µ ) 566749 5 αs (µ2 ) 2
−5+ + − + ζ(3)
484 π 383328 3 π
3 2 3 2
+ O αs Q , αs µ (B.14)
where µ is the MS scale.
One can check that the differences between (B.14) and (B.1) are indeed O(αs3 ) by
making the substitution
αs (Q2 ) αs (µ2 ) αs (µ2 ) 2 Q2 αs (µ2 ) 3 Q2
→ − β0 ln 2 − β1 ln 2
π π π µ π µ
3
2
αs (µ ) Q 2
+ β02 ln2 2 + O αs4 (B.15)
π µ
in (B.14).
162 QCDSF Collaboration / Nuclear Physics B 688 (2004) 135–164
Table 8
The vacuum polarization CΠ (q̂ 2 , am) on the 324 lattice at
β = 6.4
n κ = 0.1346 κ = 0.1350 κ = 0.1352
(1, 0, 0, 0) −41.88(16) −42.52(22) −42.84(18)
(1, 1, 0, 0) −40.38(15) −40.77(19) −40.97(15)
(1, 1, 1, 0) −39.33(14) −39.61(17) −39.75(14)
(1, 1, 1, 1) −38.53(13) −38.74(17) −38.86(14)
(2, 1, 1, 1) −36.82(11) −36.96(13) −37.02(11)
(2, 2, 1, 1) −35.73(10) −35.84(11) −35.89(10)
(2, 2, 2, 1) −34.93(09) −35.04(10) −35.09(09)
(2, 2, 2, 2) −34.31(08) −34.42(10) −34.46(08)
(3, 2, 2, 2) −33.44(08) −33.55(08) −33.58(08)
(3, 3, 2, 2) −32.79(07) −32.89(08) −32.93(07)
(3, 3, 3, 2) −32.27(07) −32.37(07) −32.41(07)
(3, 3, 3, 3) −31.83(06) −31.94(07) −31.98(06)
(4, 3, 3, 3) −31.28(06) −31.39(06) −31.42(06)
(4, 4, 3, 3) −30.83(06) −30.94(06) −30.98(06)
(4, 4, 4, 3) −30.45(05) −30.56(06) −30.60(05)
(4, 4, 4, 4) −30.12(05) −30.23(06) −30.28(05)
(5, 4, 4, 4) −29.73(05) −29.84(05) −29.88(05)
(5, 5, 4, 4) −29.39(05) −29.51(05) −29.55(05)
(5, 5, 5, 4) −29.09(05) −29.21(05) −29.25(05)
(5, 5, 5, 5) −28.83(04) −28.95(05) −28.99(05)
(6, 5, 5, 5) −28.53(04) −28.65(05) −28.69(04)
(6, 6, 5, 5) −28.26(04) −28.39(05) −28.43(04)
(6, 6, 6, 5) −28.02(04) −28.15(04) −28.19(04)
(6, 6, 6, 6) −27.80(04) −27.93(04) −27.97(04)
(7, 6, 6, 6) −27.56(04) −27.70(04) −27.74(04)
(7, 7, 6, 6) −27.35(04) −27.48(04) −27.52(04)
(7, 7, 7, 6) −27.15(04) −27.28(04) −27.32(04)
(7, 7, 7, 7) −26.96(04) −27.10(04) −27.14(04)
What have we gained by using the renormalization group? Most importantly, the Q2
derivative of (B.14) is correct to four loops, one order better than (B.1). The uncertainties
in (B.14) are O(αs3 (Q2 ), αs3 (µ2 )) while in the original formula (B.1) terms of order
O(αs3 (µ2 ) ln3 (Q2 /µ2 )) have been neglected. So, if we are interested in describing a large
range of Q2 , Eq. (B.14) ought to be the better formula. In addition, (B.14) exhibits correct
physics, because it is the sum of a piece depending only on Q2 and a piece depending only
on µ2 , which is only approximately the case for (B.1).
Appendix C
In Tables 5–8 we give the results for CΠ (q̂, am) at the various β and κ values. We label
the lattice momenta by the vector n, q̂µ = (2/a) sin(πnµ /L). The momenta were chosen
close to the diagonal of the Brillouin zone to avoid large O(a 2 ) effects.
References
[19] S. Capitani, M. Lüscher, R. Sommer, H. Wittig, Nucl. Phys. B 544 (1999) 669;
P. Boucaud, G. Burgio, F. Di Renzo, J.P. Leroy, J. Micheli, C. Parrinello, O. Pène, C. Pittori, J. Rodriguez-
Quintero, C. Roiesnel, K. Sharkey, JHEP 0004 (2000) 006.
[20] M. Guagnelli, R. Sommer, H. Wittig, Nucl. Phys. B 535 (1998) 389.
[21] O. Dumbrajs, R. Koch, H. Pilkuhn, G.C. Oades, H. Behrens, J.J. De Swart, P. Kroll, Nucl. Phys. B 216
(1983) 277.
[22] Particle Data Group, K. Hagiwara, et al., Phys. Rev. D 66 (2002) 010001.
[23] R.A. Bertlmann, G. Launer, E. de Rafael, Nucl. Phys. B 250 (1985) 61.
[24] T. Blum, Phys. Rev. Lett. 91 (2003) 052001;
T. Blum, hep-lat/0310064.
[25] C. Bouchiat, L. Michel, J. Phys. Radium 22 (1961) 121.
[26] A. Nyffeler, hep-ph/0305135.
[27] B.E. Lautrup, A. Peterman, E. de Rafael, Phys. Rep. 3 (1972) 193.
[28] S. Eidelman, F. Jegerlehner, Z. Phys. C 67 (1995) 585.
[29] F. Jegerlehner, J. Phys. G 29 (2003) 101;
K. Hagiwara, A.D. Martin, D. Nomura, T. Teubner, Phys. Lett. B 557 (2003) 69;
K. Hagiwara, A.D. Martin, D. Nomura, T. Teubner, hep-ph/0312250;
M. Davier, S. Eidelman, A. Hocker, Z. Zhang, Eur. Phys. J. C 27 (2003) 497.
[30] T. van Ritbergen, J.A.M. Vermaseren, S.A. Larin, Phys. Lett. B 400 (1997) 379;
J.A.M. Vermaseren, S.A. Larin, T. van Ritbergen, Phys. Lett. B 405 (1997) 327.
Nuclear Physics B 688 (2004) 165–188
www.elsevier.com/locate/npe
Received 2 December 2003; received in revised form 27 February 2004; accepted 6 March 2004
Abstract
The 4-loop sunrise graph with two massless lines, two lines of equal mass M and a line of mass m,
for external invariant timelike and equal to m2 is considered. We write differential equations in
x = m/M for the Master Integrals of the problem, which we Laurent-expand in the regularizing
continuous dimension d around d = 4, and then solve exactly in x up to order (d − 4)3 included; the
result is expressed in terms of harmonic polylogarithms of argument x and maximum weight 7. As a
by product, we obtain the x = 1 value, expected to be relevant in QED 4-loop static quantities like the
electron (g − 2). The analytic results were checked by an independent precise numerical calculation.
2004 Published by Elsevier B.V.
1. Introduction
This paper is devoted to the analytic evaluation of the master integrals (MIs) associated
to the 4-loop sunrise graph with two massless lines, two massive lines of equal mass M,
another massive line of mass m, with m = M, and the external invariant timelike and equal
to m2 , as depicted in Fig. 1.
We will follow for the analytic integration the differential equation method already
proposed in [1], further developed in [2] (and then used in [3,4] and in many subsequent
applications which would be too long to report here), as well as the finite difference
method [5,6] for an independent numerical check.
The differential equation method was already followed in two similar two- and three-
loop calculations [7,8]; the fact that its use could be extended without major changes to the
present four-loop calculation witnesses for its generality and power. Among the advantages
of the method, it allows a rather clear separation of the merely algebraic part of the work
(which is, not surprisingly, always very heavy in this kind of multiloop calculations, and
can be most conveniently processed by a computer algebra program, in our case FORM [9]),
from the really analytic issues of the problem, which can then be better investigated without
the disturbance of the algebraic complexity. In our case, indeed, the heart of the analytic
calculation was the study of a homogeneous fourth order differential equation, whose
solutions turned out to be, in a remarkably simple way, either a rational fraction or repeated
quadratures of rational fractions. The required four-loop integral could then be obtained
almost mechanically by repeated quadratures in terms of harmonic polylogarithms [10].
Several other different approaches to the analytic evaluation of multiloop integrals are
available in the literature, such as the powerful asymptotic expansion method (a fairly
complete account can be found in the recent book [11]), and it would be interesting to
compare the advantages and drawbacks of the various methods for the exacting, four-loop
integration which we consider; but as the results of the present paper are new, a meaningful
comparison cannot yet be carried out.
Following, as already said above, the approach already used in [7,8], we identify the
MIs of the current problem within the continuous d-dimensional regularization, write the
system of differential equations in x = m/M satisfied by the MIs, convert it into a higher
order differential equation for a single MI, Laurent-expand in (d − 4) around d = 4, solve
the associated homogeneous equation at d = 4 (as in previous cases, the solutions of the
homogeneous equation are surprisingly simple) and then use recursively Euler’s method of
the variation of the constants for obtaining the coefficient of the (d − 4) expansion in closed
analytic form. The result involves harmonic polylogarithms (HPLs) [10] of argument x and
weight increasing with the order in d − 4. We push the analytic calculation, which works
up to virtually any order in d − 4, up to (d − 4)3 included, involving HPLs of weight up
to w = 7 included. The integration constants are fixed at x = 0; as a by product we obtain
the values at x = 1, which are relevant in the evaluation of 4-loop static quantities such
as the electron (g − 2) in QED. The result was checked and confirmed by an independent
numerical calculation performed with the method of [5,6].
S. Laporta et al. / Nuclear Physics B 688 (2004) 165–188 167
The plan of the paper is the following. In Section 2 we define the master integrals and
write the differential equations; in Section 3 we study the x → 0 behaviour; in Section 4 we
work out the Laurent-expansion in d −4 and discuss the associated homogeneous equation;
in Section 5 we write down the solutions by Euler’s method of the variation of the constants
and give the x = 1 values; Section 6 deals with the independent numerical calculation by
which we check the results at x = 1 of the previous section. in Section 7 we carry out the
evaluation of the x → 0 values by direct integration in the parametric space.
Fi (d, M 2 , m2 , P 2 = −m2 )
1 d d k1 d d k2 d d k3 d d k4 Ni
= ,
(2π)4(d−2) k12 k22 (k32 + M 2 )(k42 + M 2 )[(P − k1 − k2 − k3 − k4 )2 + m2 ]
(2.1)
where the 5 numerators Ni are M 2 , k1 · k3 , p · k3 , k1 · k2 , p · k2 . In terms of the
dimensionless variable x = m/M and putting P = Mp one can introduce 5 dimensionless
functions Φi (d, x) through
dΦ3 (d, x) 3(d − 2) 3(d − 2)
=− − Φ1 (d, x) − 3Φ2 (d, x) − Φ4 (d, x)
dx 2(1 − x) 2(1 + x)
3(d − 2) 9(d − 2) 9(d − 2)
− + − Φ3 (d, x) + Φ5 (d, x) , (2.5)
x 2(1 − x) 2(1 + x)
dΦ4 (d, x) 2(d − 2)
= Φ2 (d, x) + Φ4 (d, x) , (2.6)
dx x
dΦ5 (d, x) 2(d − 2)
= Φ3 (d, x) + Φ5 (d, x) . (2.7)
dx x
At variance with the cases discussed in [7,8], the system is homogeneous; indeed, quite
in general the non homogeneous terms are given by the MIs of the “subtopologies” of the
considered graph, obtained by shrinking to a point any of its propagator lines. When that
is done for the 5-propagator “topology” of Fig. 1, one obtains the product of 4 tadpoles;
but as the considered graph has two massless propagators, at least one massless tadpole
is always present in the product; as in the d-dimensional regularization massless tadpoles
vanish, the product of the 4 tadpoles is always equal to zero—and therefore the differential
equations are homogeneous.
By inspection, one sees that Φ3 (d, x), Φ5 (d, x) appear in the r.h.s. of Eqs. (2.3)–(2.7)
only in the combination
d 4 Φ(d, x) 3
2 d Φ(d, x)
x 3 (1 − x 2 ) + x 2
1 + 5x 2
− 3(d − 4)(1 − 3x )
dx 4 dx 3
S. Laporta et al. / Nuclear Physics B 688 (2004) 165–188 169
d 2 Φ(d, x)
− x 12 + 6x 2 + (d − 4)(13 + 32x 2) + (d − 4)2 (1 + 26x 2)
dx 2
+ 12 − 18x + (d − 4)(25 − 2x ) + 8(d − 4) (2 + 5x )
2 2 2 2
dΦ(d, x)
+ 3(d − 4)3 (1 + 8x 2 )
dx
+ 4x +12 + 29(d − 4) + 23(d − 4)2 + 6(d − 4)3 Φ(d, x) = 0. (2.12)
The expression of the other MIs in terms of Φ(d, x) and its first 3 x-derivatives reads
1 1 x2 2x 2 7x 2
Φ2 (d, x) = − + − + Φ(d, x)
2 6(d − 2) 5(d − 2)3 3(d − 2)2 15(d − 2)
x(1 − 7x 2 ) x(1 + 16x 2) x(11 + 28x 2 ) dΦ(d, x)
− + −
10(d − 2)3 12(d − 2)2 60(d − 2) dx
2 2
x (1 − 2x ) x (1 − 10x ) d Φ(d, x)
2 2 2
− −
5(d − 2)3 30(d − 2)2 dx 2
x 3 (1 − x 2 ) d 3 Φ(d, x)
− , (2.13)
20(d − 2)3 dx 3
x2 1x 2 3x 2
Φ3 (d, x) + Φ5 (d, x) = − − + Φ(d, x)
15(d − 2)3 3(d − 2)2 5(d − 2)
x(1 − 7x 2 ) x(1 − 8x 2 ) x(1 − 12x 2) dΦ(d, x)
+ − +
30(d − 2)3 12(d − 2)2 20(d − 2) dx
2 2
x (1 − 2x ) x (2 − 5x ) d Φ(d, x)
2 2 2
+ −
15(d − 2)3 30(d − 2)2 dx 2
x 3 (1 − x 2 ) d 3 Φ(d, x)
− , (2.14)
60(d − 2)3 dx 3
1 1 4x 2 3x 2 43x 2
Φ4 (d, x) = − − + − + Φ(d, x)
2 6(d − 2) 5(d − 2)3 (d − 2)2 15(d − 2)
2x x(14 − 30x 2) x(11 + 43x 2) dΦ(d, x)
+ − −
5(d − 2)3 5(d − 2)2 15(d − 2) dx
2 2
4x (1 − 2x ) 3x (1 − 5x ) d Φ(d, x)
2 2 2
+ −
5(d − 2)3 10(d − 2)2 dx 2
x 3 (1 − x 2 ) d 3 Φ(d, x)
+ . (2.15)
5(d − 2)3 dx 3
By inspection, one finds that the most general solution of Eq. (2.12) can be expanded
for x → 0 in the form
170 S. Laporta et al. / Nuclear Physics B 688 (2004) 165–188
4
Φ(d, x) = x αi
A(i)
n (d)x
2n
, (3.1)
i=1 n=0
α1 = 0,
α2 = (d − 2),
α3 = −(d − 2),
α4 = (3d − 7); (3.2)
the A(i) (i)
0 (d) are the 4 integration constants, and all the other coefficients An (d) for n > 0
are determined by the differential equation Eq. (2.12) once the integration constants are
fixed.
It is interesting to recall the leading exponents of the x → 0 expansions of the solutions
(2)
of the corresponding equations encountered in [7,8]. Calling αi the exponents for the 2-
(3)
loop graph of [7] and αi those of the 3-loop graph of [8], one finds that there are always
4 exponents. The explicit values in the case of [7] are
(2)
• α1 = 0,
(2)
• α2 = (d − 2),
α3(2) = −(d − 2),
α4(2) = (d − 3) (3.3)
and in the case of [8]
(3)
α1 = 0,
• α2(3) = (d − 2),
α3(3) = −(d − 2),
(3)
α4 = (2d − 5), (3.4)
where the exponents marked by a bullet (•) correspond to the behaviours forced by the
inhomogeneous terms. The similarity between the 3 sets of behaviours is impressive: the
first 3 exponents are identical, the fourth differ in steps of d − 2 for each additional loop.
In more details, 2 of the exponents of Eqs. (3.3) correspond to the 2 independent
solutions of the associated homogeneous differential equation, which is of 2nd order,
while the other 2 are forced by the 2 independent behaviours for x → 0 developed by
the inhomogeneous terms (both products of 2 tadpoles, the first product of two tadpoles
of mass M, the second of a tadpole of mass M and a tadpole of mass m = Mx). In the
x → 0 expansion of the most general solution, one is therefore left with 2 undetermined
integration constants, corresponding to the two homogeneous solutions, while all the
terms with the behaviours of the inhomogeneous terms are fully determined by the
inhomogeneous equation itself. Similarly, in Eqs. (3.4) 3 exponents correspond to the 3
solutions of the 3rd order homogeneous equation, with the 4th exponent forced by the
S. Laporta et al. / Nuclear Physics B 688 (2004) 165–188 171
substituting the ansatz Eq. (3.1) in Eq. (2.12) and dropping A(3) (4)
0 (d), A0 (d), one finds for
Φ(d, x) Eq. (2.11) the x → 0 expansion
(1) 2(2d − 5)(3d − 8) 2
Φ(d, x) = A0 (d) 1 − x + O(x ) 4
3d(d − 4)
(2) (d − 3)(d − 4)(3d − 8) 2
+ A0 (d)x d−2
1+ x + O(x ) .
4
(3.5)
2d(2d − 7)
(1)
The expansion depends on the two as yet unspecified integrations constants A0 (d),
A(2)
0 (d)—which are to be provided by an independent, explicit calculation. That is done in
Section 7, see Eq. (7.14), by direct integration in the parametric space.
Let us note here, for completeness, that in the present case the knowledge of the
regularity of the solution at x = 1 does not provide any additional information.
φ1 (x) = x 2 . (4.5)
We then substitute φ(x) = φ1 (x)ξ(x) in Eq. (4.4), obtaining the following 3rd order
equation for the derivative of ξ(x)
d3 d2
x 3 (1 − x 2 ) 3 + 3x 2 (3 − x 2 ) 2
dx dx
2 d
+ 6x(1 + 2x ) − 6(5 + 2x ) ξ (x) = 0,
2
(4.6)
dx
S. Laporta et al. / Nuclear Physics B 688 (2004) 165–188 173
in agreement (of course) with the coefficients of the 4t h and 3rd x-derivative of φ(x) in
Eq. (4.4).
5. The solution of the differential equations for the coefficients of the expansion
in d − 4
With the results established in the previous section one can use Euler’s method of the
variation of the constants for solving Eq. (4.2) recursively in n, starting from n = −4. We
write Euler’s formula as
x
4
dx (n)
Φ (n)
(x) = φi (x) Φi(n) + Mi (x )K (x ) , (5.1)
W (x )
i=1 0
where the φi (x) are the solutions of the homogeneous equation given in Eqs. (4.5), (4.14),
(n)
the Φi are the as yet undetermined integration constants, the Wronskian W (x) can be
read from Eq. (4.15), the Mi (x) are the minors of the φi (x) in the determinant Eq. (4.15),
and the K (n) (x) are the inhomogeneous terms of Eq. (4.3). The constants Φi(n) are then
fixed by comparing the expansion in x for x → 0 of Eq. (5.1) with the expansion in d − 4
for d → 4 of Eq. (3.5). Explicitly, we find, for instance,
1 2
Φ (−4) (x) = − x , (5.2)
64
1 9 2 1 4 1
Φ (−3) (x) = − + x − x − x 2 H (0; x), (5.3)
384 256 192 48
41 143 2 37 4 5 6
Φ (−2) (x) = − x + x − x
4608
3072 2304 9216
3 1 1
+ − x 2 x 2 H (0; x) − x 2 H (0, 0; x), (5.4)
64 96 48
805 205 2 743 4 599 6
Φ (−1) (x) = − + x − x + x
55296
4096 27648 221184
43 11 2 5 4 2
− − x + x x H (0; x)
768 384 2304
3 1 1
+ − x 2 x 2 H (0, 0; x) − x 2 H (0, 0, 0; x), (5.5)
64 96 48
S. Laporta et al. / Nuclear Physics B 688 (2004) 165–188 175
3173 1 2443 2 1
Φ (0) (x) = − π 4x2 − x − x 4 ζ(3)
663552 3840 49152 96
15649 4 34061 6 1
+ x − x + ζ(3)
331776
5308416 96
5 1 1 121 4 599 6
− − ζ (3)x + 2
x +
2
x − x H (0; x)
1536 48 768 2304 55296
1 5 1 7 2 1 5 6
+ + − x + x4 + x
96 1536 x 2 256 96 1536
× H (1, 0; x) − H (−1, 0; x)
43 2 11 4 5 6
− x − x + x H (0, 0; x)
768 384 2304
(1 − x 4 )
+ H (0, 1, 0; x) − H (0, −1, 0; x)
32
1 2
+ x H (0, 0, 1, 0; x) − H (0, 0, −1, 0; x)
16
3 1 2 2 1
+ − x x H (0, 0, 0; x) − x 2 H (0, 0, 0, 0; x). (5.6)
64 96 48
The full results become quickly too lengthy to be reported explicitly here, so we give only
(k)
the values of the integration constants up to order 3 included. We find Φ4 = 0, for (all)
−4 k 3; the other constants are:
As a byproduct, using the results of [12], we obtain the values at x = 1, i.e., the on
mass-shell values at −p2 = m2 = M 2 of Eq. (2.1), depicted in Fig. 2,
1 7 17
Φ(d, x = 1) = − + −
64(d − 4)4 256(d − 4)3 768(d − 4)2
835 7379 1
+ − − π4
73728(d − 4) 1769472 1920
6766055 53 2 7
+ (d − 4) − + π + π4
42467328 1152 7680
7 2 29
− π ζ(3) + ζ(5)
256 256
2 1449210865 1913 2 53 2 1855
+ (d − 4) − π + π ln 2 − ζ(3)
1019215872 4608 128 2304
17 49 2 203
− π4 + π ζ(3) − ζ(5)
23040 1024 1024
63 2 9 3 2 4
− π ζ(3) ln 2 − a4 π 2 − π ln 2
256 32 256
3 4 2 245 2 661
+ π ln 2 + ζ (3) + π6
256 1024 322560
S. Laporta et al. / Nuclear Physics B 688 (2004) 165–188 177
182188906799 734695 2 1913 2
+ (d − 4)3 − + π − π ln 2
24461180928 331776 512
66955 583 2 2 8707 4 265 4
+ ζ(3) + π ln 2 − π + ln 2
9216 384 147456 768
265 119 2 493
+ a4 − π ζ(3) + ζ(5)
32 3072 3072
441 2 21 2 4 21 4 2
+ π ζ(3) ln 2 + π ln 2 − π ln 2
1024 1024 1024
661 63 1715 2
− π6 + a4 π 2 − ζ (3)
184320 128 4096
627 2 123 2 5 9917 2
− π ζ(3) ln2 2 − π ln 2 + π ζ(5)
512 1280 4096
107 4 3 109 4 2205 2
+ π ln 2 − π ζ(3) + ζ (3) ln 2
1536 12288 1024
45 15 6 81
+ ζ(3) ln4 2 − π ln 2 − a4 π 2 ln 2
128 2048 32
45 81 45 1395
+ a4 ζ(3) − a5 π 2 − a5 ln2 2 + ζ(5) ln2 2
16 32 16 1024
45 45 135 45 45
− a6 ln 2 + b6 ln 2 − a7 + b7 − d7
4 16 8 8 32
15 29335
+ ln7 2 + ζ(7) + O (d − 4)4 . (5.16)
1792 2048
At variance with [12], we have expressed the results in terms of the constants listed in
Table 1; the first column is the name of the constant, the second column its value as
harmonic polylogarithm of suitable argument, the third as Nielsen polylogarithm (when
available), the last the numerical value.
Once the explicit analytic expressions of the coefficients of the Laurent-expansion of
Φ(d, x) in (d − 4) are known, one can use Eqs. (2.13)–(2.15) for obtaining the coefficients
of Φ2 (d, x), Ψ3 (d, x) and Φ4 (d, x). The coefficients of the expansion of Ψ5 (d, x) are then
Table 1
Constants up to weight 7 appearing in Eq. (5.16)
Constant HPL NPl Numerical value
ζ (3) H (0, 0, 1; 1) S2,1 (1) 1.2020569031595942854
a4 H (0, 0, 0, 1; 1/2) S3,1 (1/2) 0.51747906167389938633
ζ (5) H (0, 0, 0, 0, 1; 1) S4,1 (1) 1.0369277551433699263
a5 H (0, 0, 0, 0, 1; 1/2) S4,1 (1/2) 0.50840057924226870746
a6 H (0, 0, 0, 0, 0, 1; 1/2) S5,1 (1/2) 0.50409539780398855069
b6 H (0, 0, 0, 0, 1, 1; 1/2) S4,2 (1/2) 0.0087230030575968884272
ζ (7) H (0, 0, 0, 0, 0, 0, 1; 1) S6,1 (1) 1.0083492773819228268
a7 H (0, 0, 0, 0, 0, 0, 1; 1/2) S6,1 (1/2) 0.50201456332470849457
b7 H (0, 0, 0, 0, 0, 1, 1; 1/2) S5,2 (1/2) 0.0041965726953603256975
d7 H (0, 0, 0, 0, 1, −1, −1; 1) – 0.0022500546439578516764
178 S. Laporta et al. / Nuclear Physics B 688 (2004) 165–188
recovered by integrating in x Eq. (2.10); the quadrature is trivial to carry out in terms of
HPLs, and the integration constants are fixed by the condition Ψ5 (d, 0) = 0, which follows
at once from Eqs. (2.9), (2.2), (2.1).
In this section we will calculate Φ(d, x = 1) with high numerical precision by suitably
using the difference equation method described in [5,6] and references therein. At variance
with [6], we will apply the method directly to the Master Integral with massless lines. This
will imply some additional work for establishing the initial conditions of the difference
equations.
We define
−2d d d k1 d d k2 d d k3 d d k4
I5 (d, n) = π ,
(k12 + 1)n ((k2 − k1 )2 + 1)((k3 − k2 )2 + 1)(k4 − k3 )2 (p − k4 )2
p2 = −1, (6.1)
so that I5 (d, 1) is equal to Φ(d, x) of Eq. (2.11) at x = 1 up to a known multiplicative
factor
4
I5 (d, 1) = 4(1 + ) Φ(d = 4 − 2, x = 1). (6.2)
By combining identities obtained by integration by parts one finds that I5 (d, n) satisfies
the third order difference equation
1
I5 (d, n) = t n−1 v5 (d, t) dt, (6.4)
0
d4
4t 4 (8t + 1)(t − 1)2 v5 (d, t)
dt 4
d3
+ 4t 3 (t − 1) 24(d + 1)t 2 + (18 − 26d)t − 7d + 12 3 v5 (d, t)
dt
S. Laporta et al. / Nuclear Physics B 688 (2004) 165–188 179
+ t 2 576(d − 1)t 3 + (−108d 2 − 420d + 648)t 2 + (46d 2 + 38d − 144)t
d2
+ 71d 2 − 284d + 288 v5 (d, t)
dt 2
+ t (576d − 960)t 3 + (−216d 2 + 360d)t 2 + (−18d 3 + 190d 2 − 496d + 384)t
d
+ 77d 3 − 533d 2 + 1236d − 960 v5 (d, t)
dt
+ (d − 3)(2d − 5)(3d − 8)(5d − 12)v5 (d, t) = 0. (6.5)
We will look for the solution of Eq. (6.5) in the form of a power series expansions;
inserted in Eq. (6.4) and integrated term by term it will provide very accurate values of
I5 (d, n). As the convergence is faster for larger n, we will consider “large enough” values
of the index n (see below); the repeated use “top–down” of Eq. (6.3) (i.e., using it for
expressing I5 (d, n − 3) in terms of the I5 (k) with k = n, n − 1, n − 2) will give the values
corresponding to smaller indices, till I5 (d, 1) is eventually obtained. To go on with this
program, initial conditions for v5 (d, t) are needed.
From the definition Eq. (6.1), and introducing spherical coordinates in d-dimension for
the loop momentum k1 , d d k1 = k1d−1 dk1 dΩ(d, k̂1) one has
∞
1 (k12 )d/2−1 dk12
I5 (d, n) = d f5 d, k12 , (6.6)
2
(k12 + 1)n
0
dΩ(d, k̂1)
f5 d, k12 = I4 d, 1, (p − k1 )2 , (6.7)
Ω(d)
where Ω(d) is the d-dimensional solid angle, and I4 (d, 1, (p − k1 )2 ) is the 3-loop (off
mass-shell) sunrise integral
I4 d, n, (p − k1 )2
d d k2 d d k3 d d k4
= π −3d/2 . (6.8)
((k2 − k1 ) + 1) ((k3 − k2 )2 + 1)(k4 − k3 )2 (p − k4 )2
2 n
1
1 d d
I5 (d, n) = d t n−1 (1 − t) 2 −1 t − 2 f5 d, 1−t
t dt, (6.9)
2 0
from which one gets the relation between v5 (d, t) and f5 (d, (1 − t)/t)
1
d (1 − t) 2 −1 t − 2 f5 d, 1−t
d d
v5 (d, t) = t dt. (6.10)
2
From that relation we see that we can derive boundary conditions for v5 (d, t) in the
t → 1 limit from the expansion of f5 (d, k12 ) in the k1 → 0 limit, which is easy to obtain.
Indeed, only the first denominator of Eq. (6.8) depends on k1 ; expanding for small k1 and
180 S. Laporta et al. / Nuclear Physics B 688 (2004) 165–188
p2 = −1. (6.15)
The problem of evaluating the I4 (d, n) is similar to the original problem of evaluating
the I5 (d, n), but in fact it is much simpler, as the I4 (d, n) involve one less loop and one
less propagator. As above, by using integration-by-parts identities one finds that I4 (d, n)
satisfies the third order difference equation
6(n − 1)(n − 2)(n − 3)I4 (d, n) − (n − 2)(n − 3)(10n − 7d − 10)I4 (d, n − 1)
+ (n − 3) 2n2 + (2d − 18)n − 7d 2 + 29d − 8 I4 (d, n − 2)
+ (n − d − 1)(n − 2d + 1)(2n − 3d)I4 (d, n − 3) = 0. (6.16)
We solve the difference equation by using again the Laplace ansatz
1
I4 (d, n) = t n−1 v4 (d, t) dt, (6.17)
0
S. Laporta et al. / Nuclear Physics B 688 (2004) 165–188 181
d3
2t 3 (3t + 1)(t − 1)2 v4 (d, t)
dt 3
d2
+ t 2 (t − 1)(36t 2 + (6 − 7d)t − 9d + 18) v4 (d, t)
dt 2
d
+ t 36t 3 − 14dt 2 + (−7d 2 + 33d − 36)t + 13d 2 − 61d + 72 v4 (d, t)
dt
+ (d − 3)(2d − 5)(3d − 8)v4 (d, t) = 0. (6.18)
Following the procedure used above, we write
∞
1 (k22 )d/2−1 dk22
I4 (d, n) = d f4 k22 , (6.19)
2
(k22 + 1)n
0
dΩ(d, k̂2)
f4 k22 = I3 d, (p − k2 )2 (6.20)
Ω(d)
d d k3 d d k4
I3 d, (p − k2 )2 = π −d , (6.21)
((k3 − k2 ) + 1)(k4 − k3 )2 (p − k4 )2
2
1 d d
v4 (d, t) = d (1 − t) 2 −1 t − 2 f4 d, 1−t
t . (6.22)
2
At variance with the previous case, the function f4 (d, k22 ) is not regular for k2 → 0, as at
k2 = 0 the value of the external momentum squared (p − k2)2 becomes the threshold of the
2-loop sunrise graph associated to I3 (d, p2 ). But it is not difficult to evaluate analytically
I3 (d, q 2 ) for generic off-shell q 2 by using Feynman parameters:
2(5 − d) 3 − d2 2 d2 − 1
I3 (d, q ) =
2
d 2 F1 3 − d, 2 − 2 ; 2 ; −q ,
d d 2
(d − 4) (3 − d) 2
2
with N = −(2d − 5), as in the term (q 2 + 1)2d−5 of Eq. (6.23), one obtains
f4 (d, k22 ) = a0 (d) 1 + O k22
d2 52 − d 1
+ 1 b0 (d)(k22 ) 2 (2d−5) 1 + O k22 . (6.25)
2(5 − 2d) 2 (3d − 5)
Using the variable 1/(k22 + 1) = t in Eq. (6.25) and inserting it in Eq. (6.22) one gets the
initial condition for v4 (d, t) at the singular point t = 1
a0 (d) d
v4 (d, t) = d (1 − t) 2 −1 1 + O(1 − t)
2
52 − d 1
+ 1 b0 (d)(1 − t) 2 (3d−7) 1 + O(1 − t) . (6.26)
2(5 − 2d) 2 (3d − 5)
By inspecting the equation (6.18) one gets that the behaviour at t = 0 of v4 (d, t) is
in the points t = 1, 1/2, 1/4, 1/8, 1/16 and 0 (as above, this subdivision is due to the
presence of a singular point at t = −1/8). By inspecting the equation (6.5) one gets that
the behaviour at t = 0 of v4 (d, t) is
(4)
+ c5 (d)t (−5d+12)/2, (6.29)
so that for d → 4 the integral (6.4) is surely convergent for n 5; then we calculate the
integral (6.4) for n = 5, 6, 7, 8, 9 by integrating the series term by term. By using repeatedly
“top–down” the recurrence relation (6.16) starting from n = 9, we obtain I5 (d, 6), I5 (d, 5)
(used for cross-check), I5 (d, 4), . . . , I5 (d, 1). The result, up to the coefficient of order 5 in
(d − 4) included, is
I5 (d, 1)=(1 + )4 −0.25 −4 − 0.875 −3
− 1.416666666666666666666666666667 −2
− 1.449652777777777777777777777778 −1
− 14.055442461941065705599451765901
− 90.416062304531327135375791542063
− 1170.3684076603804614545918785105 2
− 5299.6462727245240600241060624284 3
− 37132.219579420859394978093377604 4
− 144381.92488313453838475109116166 5 + O( 6 ) (6.30)
Taking into account the normalization (6.2) one finds that the first 8 terms of Eq. (6.30)
agree with Eq. (5.16).
We want only to mention that we have also independently checked the numerical
result (6.30) by calculating the master integral with all masses equal to one by difference
equations, and then by using the value so obtained as initial condition for the integration
of a differential equation in the photon mass λ from λ = 1 to λ = 0.
7. The x → 0 values
and
−1 dd l 1 1 n − d2 1
C (d) = , (7.2)
(2π)d−2 [(l − q)2 + c]n 4 3 − d2 (n) cn− d2
one gets
dd l 1 1
C −1 (d)
(2π) d−2 [(k − l) + a] (l + b)β
2 α 2
1 d d
1 α + β − d2 x 2 −β−1 (1 − x) 2 −α−1
= dx , (7.3)
4 (α)(β) 3 − d2 [k 2 + d(a, b, x)]α+β− 2
d
0
where
ax + b(1 − x)
d(a, b, x) = .
x(1 − x)
We rewrite Eq. (2.11) as
C −4 (d) dd q d d k2 d d k1
Φ(d, x) =
(2π) 4(d−2) [(p − q) + x ] (q − k2 )
2 2 2 (k2 − k1 )2
dd l
× , (7.4)
[(k1 − l)2 + 1](l 2 + 1)
and then use repeatedly Eq. (7.3), using in the order the parameters y, y1 , y2 , z for
integrating the loops l, k1 , k2 , q, obtaining
(1 − 2(d − 4))
Φ(d, x) =
1024(d − 3)(d − 4)(2d − 5)(2d − 7) 4 1 − 12 (d − 4)
1 1
d d d
−2
× dy y 2 −2 (1 − y) 2 −2 dy1 y12 (1 − y1 )d−3
0 0
1 d
−2 3
× dy2 y22 (1 − y2 ) 2 d−4 Ψ (d, x, y, y1, y2 ), (7.5)
0
where
1 3
z3− 2 d
Ψ (d, x, y, y1, y2 ) = dz , (7.6)
[(1 − z)2 x 2 + zD(y, y1 , y2 )]5−2d
0
1
D(y, y1 , y2 ) = . (7.7)
y(1 − y)(1 − y1 )(1 − y2 )
The above formulae are valid for any x; form now on we take 0 < x 1. For definiteness,
we take also d to be “just bigger” than 2 (i.e., d = 2 + η, with 0 < η 1). The z-integral
will be carried out first. To that aim, introduce an infinitesimal parameter Z, such that
0 < x 2 Z 1 (a possible choice might be Z = −x 2 ln x, but the exact value of Z will
S. Laporta et al. / Nuclear Physics B 688 (2004) 165–188 185
Ψ1 (d, x, y, y1, y2 ) = dz
x 2 5−2d
D(y, y1 , y2 ) + z
0
Ψ1 (d, x, y, y1, y2 )
3 Z
2 z4− 2 d
=
d − 2 [x + zD(y, y1 , y2 )]
2 5−2d
0
Z 3
z3− 2 d
+ (2d − 5)x 2
dz 2
[x + zD(y, y1 , y2 )]6−2d
0
186 S. Laporta et al. / Nuclear Physics B 688 (2004) 165–188
Z 3
2d − 5 2 z3− 2 d
=2 x dz ,
d −2 [x 2 + zD(y, y1 , y2 )]6−2d
0
as the end-point contributions vanish for d just bigger than 2. We can now modify the
integration interval for Ψ1 (d, x, y, y1, y2 ) into
Z ∞ ∞
dz = dz − dz,
0 0 Z
and write, correspondingly,
Ψ1 (d, x, y, y1, y2 ) = Ψ1∞ (d, x, y, y1, y2 ) − Ψ1Z (d, x, y, y1, y2 ), (7.10)
where, thanks to the previous integration by parts, the two resulting z-integrals (from 0 to
∞ for the first, from Z to ∞ for the second) are now both convergent for d just bigger
than 2.
We start again from the second term,
∞ 3
2d − 5 2 z3− 2 d
Ψ1Z (d, x, y, y1, y2 ) = 2 x dz ;
d −2 [x 2 + zD(y, y1 , y2 )]6−2d
Z
in the denominator we can neglect x2, the resulting integral is trivial and the result can be
written as
2d − 5 1 2 x 2 d −1 ∞
Ψ1Z (d, x, y, y1, y2 ) = 2 z2 ,
d − 2 [D(y, y1 , y2 )] 6−2d d −4 z Z
By collecting the results Eqs. (7.13), (7.11), (7.10), (7.9), (7.8), one obtains the value of
the function Ψ (d, x, y, y1, y2 ) to be substituted in Eq. (7.5); recalling Eq. (7.7) one sees
that all the remaining integrations factorize and can be carried out in terms of Euler’s Beta-
functions Eq. (7.12).
It is clear that the final result for Eq. (2.11) for x → 0 consists of two terms, the first
constant (independent of x), the second proportional to x d−2 . In the notation of Eq. (3.5)
we have
(1) 3d − 11
A0 = −
8(d − 2)(d − 3)(d − 4)3 (2d − 5)(2d − 7)(3d − 8)(3d − 10)
(1 − (d − 4))(1 − 2(d − 4)) 2 1 + 12 (d − 4) 2 1 − 32 (d − 4)
× ,
4 1 − 12 (d − 4) (1 − 3(d − 4))
2(2d − 7)
A0 (2) = −
3(d − 2)2 (d − 3)(d − 4)4 (3d − 8)(3d − 10)
1 + 12 (d − 4) 1 − 32 (d − 4) 2 (1 − (d − 4))
× . (7.14)
2 1 − 12 (d − 4) (1 − 2(d − 4))
(1)
Let us observe that the term A0 is the value of the vacuum graph in Fig. 3, in agreement
with the result in Eq. (A.12) of [13] (up to a different normalization).
Acknowledgements
We are grateful to J. Vermaseren for his kind assistance in the use of the algebra
manipulating program FORM [9], by which all our calculations were carried out.
References
Abstract
It is shown that hypothetical neutrino–majoron coupling can suppress neutrino flavor oscillations
in the early universe, in contrast to the usual weak interaction case. This reopens a window for a
noticeable cosmological lepton asymmetry which is forbidden for the large mixing angle solution in
the case of standard interactions of neutrinos.
2004 Elsevier B.V. All rights reserved.
1. Introduction
0550-3213/$ – see front matter 2004 Elsevier B.V. All rights reserved.
doi:10.1016/j.nuclphysb.2004.04.002
190 A.D. Dolgov, F. Takahashi / Nuclear Physics B 688 (2004) 189–213
equivalent to an addition of
4 2
15 ξµ,τ ξµ,τ
Nν = +2 (1)
7 π π
massless neutrino species at the BBN epoch. If the latter is bounded by Nν < 1 (safe
bound) then |ξµ,τ | < 1.5. For less conservative bound, Nν < 0.4 [3], the dimensionless
chemical potentials should be below 0.9. Still, asymmetry of the order of unity would have
noticeable cosmological effects on large scale structure formation and CMBR. The limits
quoted above are valid if the chemical potential of only one kind of neutrino is non-zero.
If a conspiracy between different chemical potentials is allowed, such that a positive effect
of ξµ or ξτ is compensated by ξe or vice versa, the limits would be somewhat weaker.
According to Ref. [4] they are: |ξe | < 0.2 and |ξµ,τ | < 2.6.1
The bounds on chemical potentials of νµ and ντ can be significantly improved because
of the strong mixing between different neutrino flavors [6]. This mixing gives rise to the
fast transformation between νe , νµ , and ντ in the early universe and leads to equilibration
of asymmetries of all neutrino species. Thus the BBN bound on any chemical potential
becomes essentially that obtained for νe [7] (see also the papers [8]):
In this case the cosmological impact of neutrino degeneracy would be negligible. However,
a compensation of ξe by other types of radiation is still possible, which was studied in
Ref. [9]. The authors of Ref. [9] put the constraints on ξe and Nν based on WMAP data
and BBN:
It is interesting to see if one could reasonably modify the standard model to allow large
muonic and/or tauonic charge asymmetries, together with a small electronic asymmetry,
to avoid conflict with BBN. This is the aim of this work. A natural generalization is to
introduce an additional interaction of neutrinos with massless or light (pseudo)Nambu–
Goldstone boson, majoron [10]. The idea to invoke majoron to modify neutrino oscillations
in primeval plasma was discussed in Refs. [11,12]. In these papers, two different
mechanisms which could block or suppress oscillations between active and hypothetical
sterile neutrinos have been considered. We have, however, some concerns about the
validity of their results which we will discuss in the next section. Let us note that in this
paper we consider an impact of neutrino majoron interactions on the oscillations between
active neutrinos and not on active–sterile oscillations, as is done in the mentioned above
papers.
1 We are not concerned in this paper with the origin of such large lepton asymmetry. Thus far, many
cosmological scenarios that explain both the small observed baryon asymmetry and a possible large lepton
asymmetry simultaneously, have been proposed [5].
A.D. Dolgov, F. Takahashi / Nuclear Physics B 688 (2004) 189–213 191
and, e.g., for δm2 = 10−2 eV2 and η ∼ 1 the mixing angle in matter would be suppressed
more than by three orders of magnitude in the BBN range of temperatures, T ∼ MeV.
However, this result is valid for mixing between active and sterile neutrinos and is not
true for active–active mixing. In the last case the effective potential has large off-diagonal
matrix elements [15], Vab , with a = b, which compensate the suppression induced by the
diagonal terms. This is the reason for non-suppressed oscillations between active flavors
and for the equilibration of all leptonic asymmetries [7].
Possible additional interactions of neutrinos with majoron can be described by the
Lagrangian:
Lint ∼ χ gab νaT Cνb + h.c. , (11)
where χ is the operator of the majoron field, C is the matrix of charge conjugation, and
gab are the coupling constants. We will assume for simplicity, though it is not necessary,
that only the flavor-diagonal coupling is non-vanishing, gab ∼ gaa δab . We will consider
the coupling matrix with more general form later. Let us also assume that the majoron is
so light that its decay and inverse decay are not essential at the BBN epoch. Neutrino–
(anti)neutrino scattering through majoron exchange gives rise to a contribution to neutrino
effective potential (proportional to forward scattering amplitude) which can be estimated
as
(χ)
Vab ∼ gaa gbb T . (12)
The constants gaa should satisfy several constraints to make the mechanism operative.
(χ)
Firstly, the diagonal part of the potential Vaa should be larger than the weak potential
V (w) given by Eq. (6), while its off-diagonal components must be much smaller than
(χ) (χ)
the diagonal ones, Vab Vaa , that is, the flavor symmetry in the neutrino–majoron
interactions should be strongly broken. These two conditions would ensure suppression
of flavor changing oscillations between active neutrinos. To realize these conditions the
following constraints should be imposed:
2
−11 (a) Tae
2
gaa > 10 η and gee gaa . (13)
1 MeV
Here a labels µ or τ and the coupling of majoron to νe is assumed to be much weaker than
those to νµ and/or ντ ; Tae is the temperature at which νa are effectively transformed into νe
in the standard theory. According to calculations of Ref. [7] it takes place around Tae = 1
MeV for the large mixing angle solution to solar neutrino deficit, while the transformation
between νµ and ντ in the standard theory takes place somewhat above 10 MeV. In fact, as
we see in what follows from numerical calculations, the more accurate bound is much less
restrictive than this simple estimate (see Eq. (49) or Figs. 6 and 7 below).
The second inequality (13) implies that the off-diagonal components of the effective
potential are suppressed in comparison with the diagonal ones and the oscillations
remain blocked. This is not so in the case of the standard weak interactions which are
flavor symmetric and the large value of the denominator due to large Vaa in Eq. (4) is
compensated by the same large off-diagonal components of the effective potential.
Secondly, the coupling constants gaa should not be too large, otherwise flavor non-
conserving reactions of the type νe νa ↔ ν̄e ν̄a (or similar) would lead to equilibration of all
A.D. Dolgov, F. Takahashi / Nuclear Physics B 688 (2004) 189–213 193
leptonic charges. To avoid that the rate of these reactions, Γea ∼ σea T 3 , should be smaller
than the cosmological expansion rate H ∼ T 2 /mPl , where mPl = 1.221 × 1022 MeV is the
Planck mass. Thus, to suppress e − µ or e − τ transformation through direct reactions one
needs
T
2 2
gaa gee < 10−22 . (14)
1 MeV
This conditions should be satisfied for temperatures above the BBN range, i.e., T > 1
MeV. Similarly, if we require that νa νa ↔ ν̄a ν̄a should not occur efficiently, the coupling
constants must satisfy a similar inequality with gee replaced with gaa .
There are quite strong limits on possible coupling of majoron to neutrinos which follow
from astrophysics [16,17]; discussion and references to earlier works can be found e.g.
in the book [18]. Astrophysics allows either very small or quite large coupling constants.
The former is quite evident, while the latter appears because strongly interacting majorons,
though efficiently produced inside a star, cannot propagate out and carry away the energy,
thus opening a window for large values of the coupling. It is not so for the coupling
to νe because the latter is bounded from above by the data on double beta decay [19],
gee < 3 × 10−5 . Together with the supernova bounds, the upper limit is shifted down to
gee < 4 × 10−7 [17], with a small window around (2–3) × 10−5 . So we assume in the
following that gee < 4 × 10−7 . For µ or τ the allowed regions are: gaa < (3–5) × 10−6 or
gaa > (3–5) × 10−5 . Evidently the conditions specified above can be satisfied. As reference
values we may take gee = 10−7 and gaa = 5 × 10−6 which satisfy all constraints presented
above and would lead to a suppression of the transformation of νµ or ντ into νe , thus
permitting a large muonic or tauonic asymmetry combined with a small electronic one at
BBN. Note that it is not necessary to assume a large value of gaa 10−5 to accomplish this.
As shown later, such a large coupling constant would change the scenario into something
more complex.
If majoron is lighter than the mass difference of neutrinos then a heavier neutrino would
decay into a lighter one and majoron. This process might leave traces in the flavor ratios of
the high-energy neutrinos from distant astrophysical sources [20], which can be detected by
e.g., IceCube [21]. This could be potentially sensitive to very small values of the neutrino–
majoron coupling. Existing bound [22] on life-time/mass of decaying neutrinos based on
the solar neutrino data, τ/m 10−4 s/eV, is too weak to be essential for the mechanism
discussed in the present paper.
In the subsequent sections we will present more accurate calculations but before turning
to a closer examination of the scenario, let us make a few comments on earlier papers where
majoron suppression of neutrino oscillations in the early universe have been considered.
In Ref. [11] it was assumed that there exists a coupling of an active and sterile neutrinos
to majoron. Let us note that in the version of the theory that we consider here no sterile
neutrinos are introduced: the negative helicity states are identified with neutrinos, while
the positive helicity states are identified with antineutrinos. The coupling considered in
Ref. [11] has the form:
where νa is an active neutrino flavor and νs is a sterile one. The authors argued that
the effective potential induced by the interactions of active and sterile neutrinos with
majoron strongly suppressed the oscillations. However they omitted off-diagonal Vas -term
in the effective potential which might invalidate their result. Possibly this mechanism of
oscillation suppression may operate in a more complicated version of the model.
In Ref. [12] a different mechanism of neutrino oscillation blocking by majoron has
been suggested. According to the equation of motion, the total leptonic current including
majoron and neutrino contributions is conserved:
f D 2 χ + Dµ J µ = 0, (16)
where Dµ is the covariant derivative in cosmological background, f is the vacuum
expectation value of the Higgs field responsible for the spontaneous breaking of leptonic
UL (1)-symmetry and J µ is the leptonic currents of fermions. In spatially homogeneous
case the solution to this equation is
ain 3
χ̇(t) = χ̇in − Jint /f + J t (t)/f, (17)
a(t)
where a(t) is the cosmological scale factor and the sub-index “in” indicates initial values.
We assume that χ̇in = 0. Hence the authors of Ref. [12] concluded that χ̇ = ηB nγ /f ,
where ηB = 6 × 10−10 is the baryon asymmetry of the universe. The contribution to the
neutrino effective potential from this term is δV χ = χ̇/f and, according to [12], with
f 2 = (5–9) GeV2 the potential would be strong enough to suppress the oscillations.
However, according to the usual scenarios, baryon asymmetry could be created at much
larger temperatures, much higher than this scale. Thus, after spontaneous symmetry
breaking, leading to creation of majoron, J t remains constant in comoving volume and
χ̇ = 0. A non-zero χ̇ might be created if lepton asymmetry was generated at spontaneous
breaking of leptonic UL (1) or by the neutrino oscillations themselves but both these cases
need further and more detailed investigation. So in our considerations we assume that the
mechanism of Ref. [12] is not operative.
1 ∗
Lint = χ gab ΦaT σ2 Φb + gab Φb† σ2 Φa∗
2
i ∗ †
= χ gab νaT Cνb + gab νb Cνa∗ , (19)
2
A.D. Dolgov, F. Takahashi / Nuclear Physics B 688 (2004) 189–213 195
Fig. 1. s- and t-channel diagrams of neutrino–(anti)neutrino scattering through majoron exchange. Both diagrams
contribute to the off-diagonal component of the effective potential.
Fig. 2. Diagram of neutrino–majoron scattering through neutrino exchange. If majorons are not abundant in the
primeval plasma, this diagram gives negligible contribution to the effective potential.
Ep ≡ |p|,
Etot ≡ p Ep + q Eq + r Er + s Es ,
p·q
cos θpq ≡ ,
|p||q|
∗
F (p, q) ≡ gab νa (p)T Cνb (q) + gab νb (−q)† Cνa∗ (−p), (21)
2 Note that there are other diagrams if small majorana masses are taken into account. However, the amplitudes
of such diagrams are suppressed by powers of mν /Eν 1 for relativistic neutrinos.
196 A.D. Dolgov, F. Takahashi / Nuclear Physics B 688 (2004) 189–213
= dp ai (p)up + bi (−p)† v−p e−ip t +ipx ,
0
(22)
Vp ab
= dq g ρq + ρ Tq g ab , (23)
4|p||q|
where the density matrices are defined as
aj† (p) ai (p ) = (2π)3 δ (3) (p − p )[ρp ]ij ,
†
bi (p) bj (p ) = (2π)3 δ (3) (p − p )[ρ p ]ij . (24)
If we take the coupling constant matrix gab to be diagonal, the effective potential becomes
(νν)
∗ 1 T
Vp ab
= gaa gbb dq ρq + ρ Tq ab (no summation)
4|p||q|
∗ T2
∼ gaa gbb .
|p|
This result confirms the rough estimate of the effective potential in the previous section.
The neutrino–majoron scattering shown in Fig. 2 also contributes to the effective
potential and can be calculated similarly. The result is
(νχ)
fχ (q) †
Vp ab
= dq g g ab , (25)
4|p||q|
where fχ (p) is the number density of majorons with momentum p. Note that this
expression vanishes if the abundance of majorons is negligible in comparison with
(eq)
their equilibrium value, i.e., fχ (p) fχ (p). Thus the complete effective potential for
neutrinos with momentum p induced by interactions with majorons is given by
(χ)
1 † T
Vp ab = dq g ρq + ρ Tq + fχ (q) · 1 g ab , (26)
4|p||q|
where 1 is the unit matrix in the flavor basis.
A.D. Dolgov, F. Takahashi / Nuclear Physics B 688 (2004) 189–213 197
(el)
The contribution of Icoll to this integrated difference vanishes. The resulting equation takes
the form:
9|g|4
∂x eξ − eξ̄ = − 7 3 e2ξ − e2ξ̄ , (30)
2 π h
where we have used the formula presented in Appendix C. It is also assumed above that
ξ = −ξ̄ . Otherwise the contribution from L-conserving ν̄ν-annihilation would not vanish
in the collision integral. If ξ 1 we may neglect exp(−ξ ) and solve Eq. (30) as
exp[ξ(0)]
exp ξ(x) = . (31)
1 + 9|g|4 x exp[ξ(0)]/(128π 3h)
The lepton asymmetry would not be destroyed by reaction (27) if
Note that the collision integral with this amplitude squared involves an IR logarithmic
divergence, therefore we need to input a lower cutoff scale. Taking into account the
thermal effects, the dispersion relations for the majoron and the neutrinos change from
those at zero temperature. Especially, they obtain finite thermal masses, which provide the
desired infra-red cutoff. For |g| ∼ O(10−5 ), the thermal effect of the neutrino–majoron
interaction dominates over that due to the electroweak interaction, so the IR divergent part
is regularized as ∼ log(4E 2 /m2T ) with mT ∼ |g|T .
Kinetic equation governing the production of majorons is similar to Eq. (28), and has
the form:
(eq)
Demanding that nχ < nχ we obtain
which gives a constraint similar to Eq. (32). However, if we take into account quantum
statistics, the bound becomes slightly weaker for large charge asymmetry of neutrinos, as
shown in Fig. 5.
So, to summarize, the muonic or tauonic lepton asymmetry would be erased if the
neutrino–majoron interaction is sufficiently strong, i.e., |g| 10−5 . If we restrict ourselves
to a scenario that the majoron–neutrino interactions suppress neutrino oscillations and
thereby keep the large lepton asymmetries intact, the largest coupling constant should
be smaller than ∼ 10−5 . However this does not necessarily mean that the conspiracy
between the speed-up effect of ξµ,τ and a shift of β equilibrium due to ξe is impossible for
|g| 10−5 . In fact, if the L-non-conserving interactions became efficient only after muonic
and tauonic neutrinos decoupled from the thermal plasma, then the neutrino degeneracy
would still be maintained in muonic or tauonic neutrinos resulting in their larger energy
density at BBN. Majorons, as well, can contribute to additional relativistic degrees of
freedom at BBN. We will discuss these issues below.
200 A.D. Dolgov, F. Takahashi / Nuclear Physics B 688 (2004) 189–213
Fig. 5. Upper bounds on |g| obtained from the requirement that L-violating reactions, 2ν ↔ 2ν̄ and ν + ν̄ ↔ 2χ ,
are out of equilibrium. Numerical and analytical bounds for the former reaction are shown as solid and
long-dashed lines, respectively. The short-dashed line represents the constraint obtained numerically for the latter
reaction.
As is established now, neutrino mass eigenstates are related to the flavor eigenstates
through the orthogonal matrix (we neglect a possible CP violation):
νa = Uaj νj , (37)
where a = e, µ, τ and j = 1, 2, 3. We have chosen the parameters so that in the limit of
small mixing νe ≈ ν1 , νµ ≈ ν2 , ντ ≈ ν3 . However, the mixing between neutrinos are known
to be large and they cannot be taken as a dominant single mass eigenstate.
Kinetic equations for the density matrix of oscillating neutrinos can be written as
[23,25]:
i ρ̇ = H(1) , ρ − i H(2), ρ , (38)
where the first commutator term includes the vacuum Hamiltonian and the effective
potential of neutrinos in medium calculated in the first order in Fermi coupling constant,
GF (weak interaction part) and in the second order in the coupling to majoron. The latter
is given by Eq. (26) and does not contain off-diagonal terms if only one flavor coupling
constant dominates. The weak interaction potential contains off-diagonal terms of the same
order of magnitude as diagonal ones because of universal coupling of W and Z bosons
to all neutrino flavors. Their explicit expressions can be found, e.g., in Ref. [7]. In what
follows we will skip upper indices (1) and (2) at H.
The second anti-commutator term in Eq. (38) describes breaking of coherence induced
by neutrino scattering and annihilation as well as neutrino production by collisions in
primeval plasma. It includes imaginary part of the Hamiltonian calculated in the second
order in GF and the fourth order terms in neutrino–majoron coupling g related to neutrino–
neutrino scattering through majoron exchange. If majorons were abundant in the primeval
plasma the processes of neutrino–majoron scattering should also be included.
A.D. Dolgov, F. Takahashi / Nuclear Physics B 688 (2004) 189–213 201
Since the “atmospheric” neutrino mass difference is much larger than the “solar” one,
we may simplify the problem considering the process in two steps. First, oscillations
between ντ and νµ were switched-on. This started at temperatures about 10–15 MeV and
might lead to equilibration of tauonic and muonic charge asymmetries if these asymmetries
were not erased by reactions (27). Later at T 3 MeV, oscillations would start between νe
and νµ , which is a certain mixture of cντ + sνµ . This process is potentially dangerous for
BBN because it could change number density and spectrum of νe and ν̄e .
If mixing is effective only between two neutrinos, the density matrix is 2 × 2 and kinetic
equations for its components have the form:
are matrix elements of the mixing matrix; in the case under consideration Uµ2 = Uτ 3 =
cos θ ≡ c and Uµ3 = −Uτ 2 = sin θ ≡ s. The coherence breaking terms are given by the
usual collision integrals I (coll) in the equations for the diagonal components and by
Γµτ = 1.1 × 10−22 y/x 5 MeV (42)
for the non-diagonal component ρµτ = ρτ∗µ . In the expression for Γµτ we neglected the
contribution from the majoron related processes. The effective potential Vab contains
contributions from the usual weak interactions and from the neutrino–majoron interactions
(26). We assume that the weak part is dominated by the charge asymmetric contribution,
which is true in the case of a large charge asymmetry:
(w)
Vab = 1.5C1 GF T 3
dy (ρab − ρ̄ab ) ≡ B dy (ρab − ρ̄ab ) (43)
with C1 defined in Eq. (6), dy = d 3 y/(2π)3 , ρ̄ is the antineutrino density matrix, and the
coefficient 1.5 comes from (11/4) · 2 ζ(3)/π 2 , i.e., from normalization to photon number
density.
Eqs. (39)–(41) can be solved numerically but we can make reasonable estimates
analytically in the following way (for more details see, e.g., Refs. [1,26]). In the case of
strong coherence breaking, i.e., Γµτ
H , Eq. (41) can be formally solved in the stationary
point approximation, i.e., putting r.h.s. equal zero. In this way we obtain:
Hµτ (ρµµ − ρτ τ )
ρµτ =
Hµµ − Hτ τ − iΓµτ
ρµµ − ρτ τ δm232 iΓµτ − c2 δm232 /2E
≈ Vµτ + s2 1 + , (44)
Vµµ − Vτ τ 4E Vµµ − Vτ τ
where δm232 ≡ m23 − m22 , s2 = sin 2θ and c2 = cos 2θ .
This expression could be substituted into Eqs. (39) and (40) for the diagonal components
but one cannot obtain the closed equations for the latter because the potential Vµτ contains
an integral from ρµτ over momentum, see Eq. (43). We assume for simplicity that only
202 A.D. Dolgov, F. Takahashi / Nuclear Physics B 688 (2004) 189–213
(w)
weak-interaction potential has noticeable off-diagonal components, i.e., Vµτ Vµτ . It can
be true if, e.g., |gτ τ |
|gµµ |. To express Vµτ through diagonal components we integrate
Eq. (44) and similar equation for ρ̄µτ over momentum and subtract one from the other. To
simplify the expressions let us present the diagonal part of the potential as a sum of charge
symmetric (majoron) and antisymmetric (weak) parts:
(χ)
Vµµ − Vτ τ ≡ V+ + V−(w) , (45)
(w)
where the charge asymmetric part V− can be separated into two terms containing
integrals from neutrino and antineutrino elements of the density matrix (see Eq. (43)),
V−(w) = Vν − V̄ν .
After straightforward calculations we obtain:
(χ) 2
V+ − (Vν − V̄ν )2 s2 δm232 B
Vµτ = (χ) (χ)
dy
V+ (V+ − Vν − V̄ν ) 4E
ρµµ − ρτ τ iΓµτ − δm232 c2 /2E
× (χ) 1+ (χ)
V+ + Vν − V̄ν V+ + Vν − V̄ν
ρ̄µµ − ρ̄τ τ i Γ¯µτ − δm232 c2 /2E
− (χ) 1+ (χ)
. (46)
V+ − Vν + V̄ν V+ − Vν + V̄ν
(χ) (w)
This result is valid if |V+ | |V− |. In the case of the usual weak interactions considered,
(χ) (w)
e.g., in Refs. [7,8] the situation is opposite, |V+ | |V− |, and the character of the
approximate analytical solution is completely different.
(χ) (w) (χ)
For a rough estimate let us assume that |V+ | > |V− | and |V+ | > |δm232 |/2E, while
|Vµτ | < |s2 δm32 |/2E, as follows from Eq. (46). In this case we find
2
2 2
δm32 Γµτ (ρµµ − ρτ τ )
H x∂x ρµµ = − s2 (χ) (χ) 2
− Iµµ
(coll)
. (47)
4E (Vµµ − Vτ τ )
The evolution of the difference of muonic and tauonic charges is determined by
(µτ )
L (µτ )
≡ dy Ly ≡ dy (ρµµ − ρτ τ − ρ̄µµ + ρ̄τ τ ). (48)
The collision integrals disappear from the time derivative of this difference since weak
interactions conserve leptonic charges and majoron induced processes are assumed to be
suppressed, see Section 4 and in particular Eq. (32). The evolution of L is governed by the
equation:
−4 2
dL(µτ ) |g| δm232 dy (µτ )
≈ −7
Ly (49)
dx 1.5 × 10 −3
2.5 × 10 eV 2 y
and for the majoron coupling constant bounded by the conditions presented above, L(µτ )
remains practically constant till nucleosynthesis epoch, i.e., x ∼ 1.
We have performed numerical calculations to solve Eqs. (39)–(41) in a similar way to
Ref. [7]. The coupling constant matrix gab is assumed to take the form: gab = g δaτ δbτ ,
A.D. Dolgov, F. Takahashi / Nuclear Physics B 688 (2004) 189–213 203
Fig. 6. The evolutions of ξµ and ξτ for several values of |g| for the maximal mixing and δm232 = 2.5 × 10−3 eV2 .
The initial conditions are ξµ = −0.5 and ξτ = 1.0.
for simplicity. The evolution of dimensionless chemical potentials ξµ and ξτ for several
values of |g| are shown in Fig. 6. One can see that the lepton asymmetries of νµ and
ντ equilibrate around T ∼ 12 MeV when |g| = 0. We have checked that the evolution
remains the same for |g| < 10−8 . As |g| increases, the oscillations become less efficient
and completely stop for |g| 10−7 , which well agrees with the analytical estimates. It
should be also noted that the lower bound on |g| obtained from Eq. (49) does not depend
on the magnitude of the initial lepton asymmetries. We have checked that the oscillations
are similarly stuck for |g| 10−7 even if the initial lepton asymmetries are taken to be
smaller. The only difference is the temperature at which the asymmetries equilibrate when
g = 0.
We can repeat the same arguments for oscillations between νe and νµ . The squared
mass difference and mixing angle parameters for the oscillations between νe and νµ are
δm221 ≡ m22 − m21 = 7.3 × 10−5 eV2 and sin2 θ = 0.315 [27]. The coupling constant matrix
gab is similarly approximated to be gab = g δaµ δbµ . Also the effective potential induced
by the energy densities of electrons and positrons is taken into account, since here we
consider oscillations including νe . The lower bound of |g| becomes slightly relaxed due
to the smaller mass difference, as can be seen from Fig. 7. The reason why the lepton
asymmetries of νe and νµ are not equilibrated completely when g = 0 is that the mixing
angle we adopt is not maximal.
Lastly, we comment on the form of the coupling matrix, gab , for which the (νe − νµ )-
transformations are suppressed and, consequently, a large lepton asymmetry of the universe
is allowed. So far we have assumed that gab is diagonal in order to deal with both ντ − νµ
and νe − νµ oscillations similarly. However, in fact, a coupling matrix with more generic
form works as well. We would like to stress that the equilibration of the muonic and tauonic
lepton asymmetries is not harmful to our purpose, as long as these asymmetries are not
204 A.D. Dolgov, F. Takahashi / Nuclear Physics B 688 (2004) 189–213
Fig. 7. The evolutions of ξe and ξµ for several values of |g| with sin2 θ = 0.315 and δm221 = 7.3 × 10−5 eV2 .
The initial conditions are ξe = 0.1 and ξµ = −0.5.
erased through either oscillations or by the reactions shown in Figs. 3 and 4. Therefore
we do not have to assume the hierarchical structure in gτ τ , gµτ , and gµµ , and they are
constrained only by the astrophysical bounds:
with (a, b) = {(τ, τ ), (µ, τ ), (µ, µ)}. In order to suppress the oscillations between νe and
νµ , at least one of these coupling constants should be larger than ∼ 10−7 , while |gee |,
|geµ |, and |geτ | must be much smaller than ∼ 10−7 . Thus we conclude that the coupling
matrix suitable for our purpose should satisfy:
In the simplest class of majoron models, the coupling matrix, gab , is proportional to the
neutrino mass matrix (mν )ab . In particular, for the normal mass hierarchy, m1 < m2 < m3 ,
the reconstructed mass matrix is often parametrized as [28]
0 0 λ λ2 λ λ
g ∝ mν ∝ 0 1 1 or λ 1 1 (52)
λ 1 1 λ 1 1
with λ ∼ 0.2. Therefore the coupling matrix of this type can satisfy the conditions (51). Of
course, in the more involved models, the coupling matrix is not necessarily proportional to
the neutrino mass matrix.
A.D. Dolgov, F. Takahashi / Nuclear Physics B 688 (2004) 189–213 205
overshoot it. Then we do not need to suppress neutrino oscillations, therefore the coupling
constant matrix gab is allowed to take rather arbitrary values as long as the L-violating
processes come into equilibrium after decoupling of muonic and tauonic neutrinos.
Thus our examination indicates that a conspiracy between the speed-up effect and
the shift of β equilibrium is possible for |gab | 10−5 , as long as |gee |, |gea | |gab |
with a, b = µ, τ . Furthermore, even for |gee |, |gea | 10−5 , such a cancellation might be
possible in a somewhat different way. This considerably extends the allowed parameter
space, although rigorous study might be necessary to obtain further quantitative results.
In this paper we have shown that the hypothetical neutrino–majoron interaction can
suppress neutrino oscillations in the primordial plasma to prevent lepton asymmetries of all
neutrino species from being equilibrated. The exact form of the effective potential induced
by this interaction is calculated. We have found an allowed range of the coupling constant:
10−7 < |g| < 5 × 10−6 (more precisely, see Eq. (51)), which satisfies the astrophysical
bounds and makes the scenario operative. For the coupling constant in this range, νe − νµ
oscillation in the early Universe is blocked, thereby keeping the cosmological lepton
asymmetry of electron type unchanged. The upper bound comes from the requirement that
lepton number is effectively conserved, and the lower bound is obtained from the study of
the evolution of the lepton asymmetries both analytically and numerically, in two flavor
approximation. The constant matrix in the simplest class of majoron models can satisfy
the desired constraints, in the case of the normal mass hierarchy. Furthermore, even for
|g| 10−5 , the large energy density necessary to cancel the effect of ξe can be supplied
by majorons, νµ and ντ , even if muonic and/or tauonic lepton asymmetries were erased.
Thus we conclude that an addition of the majoron field to the standard model can reopen
a possibility that the effect of ξe is compensated by large ξµ,τ (or by the extra energy of
majoron itself), thereby curing a probable discrepancy between the BBN and CMBR.
Acknowledgements
A.D. is grateful to the Research Center for the Early Universe of the University of Tokyo
for the hospitality during the time when this work started. F.T. is grateful to M. Kawasaki
and K. Ichikawa for useful discussions. F.T. would like to thank the Japan Society for the
Promotion of Science for financial support.
Here we present the derivation of the effective potentials Eqs. (23) and (25). First
the notations and conventions that we adopt here, are listed. The metric is taken to be
(−, +, +, +). The gamma matrices in the chiral representation are
0 1 0 σi 1 0
γ = −i
0
, γ = −i
i
, γ5 = , (A.1)
1 0 −σi 0 0 −1
A.D. Dolgov, F. Takahashi / Nuclear Physics B 688 (2004) 189–213 207
where σi are the Pauli matrices. The gamma matrices in the Dirac representation are related
to those in the chiral representation as
µ µ 1 1 1
γDirac = U γchiral U † , U = √ , (A.2)
2 1 −1
and they are given by
1 0 0 σi 0 1
γ = −i
0
, γ =i
i
, γ5 = . (A.3)
0 −1 −σi 0 1 0
The charge conjugation matrix C is defined as
−i σ2 0 (chiral),
0 −σ
C ≡ iγ 2 γ 0 = 0 σ2
2
(A.4)
−i (Dirac).
σ 0 2
Hereafter we work with the gamma matrices in the Dirac representation. The momentum
expansion of a free Dirac field ψ(x) with mass m is given by
ψ(x) = dp a(p, s)u(p, s)e−iEp t +ipx + b† (p, s)v(p, s)eiEp t −ipx
s
dp a(p, s)u(p, s) + b† (−p, s)v(−p, s) e−ip t +ipx ,
0
= (A.5)
s
where we defined p0 ≡ p Ep = p p2 + m2 with p = ±1. The sign of p is chosen so
that the positive and negative-energy solutions are reproduced. s = ± describes the sign of
the spin projection eigenvalue, and we have used the notation dp ≡ d 3 p/(2π)3 . The Dirac
spinors u(p, s) and v(p, s) take the form,
1 Ep + m
u(p, s) = χu (s),
2Ep (Ep + m) −p · σ
1 −p · σ
v(p, s) = χv (s). (A.6)
2Ep (Ep + m) Ep + m
The two-component spinors χu (s) and χv (s) are not independent but related to each other
through: χv (s) = −iσ2 χu (s)∗ . If we choose them to be eigenstates of the helicity operator
p · σ /|p|, they are
−iϕ/2 −iϕ/2
e cos θ2 −e sin θ2
χu (+) = , χ u (−) = ,
eiϕ/2 sin θ2 eiϕ/2 cos θ2
−iϕ/2 −iϕ/2
−e sin θ2 −e cos θ2
χv (+) = , χv (−) = , (A.7)
eiϕ/2 cos θ2 −eiϕ/2 sin θ2
where θ and ϕ are defined as p = |p|(sin θ cos ϕ, sin θ sin ϕ, cos θ ). For flipped momentum
−p, {θ, ϕ} should be replaced with {π − θ, ϕ + π}.
In deriving the effective potential, the neutrino field ν(x) is approximated to be a
massless left-handed field:
νi (x) = dp ai (p)up + bi (−p)† v−p e−ip t +ip x ,
0
(A.8)
208 A.D. Dolgov, F. Takahashi / Nuclear Physics B 688 (2004) 189–213
where
1 + γ5 1 1
up ≡ u(p, −) = √ χu (−),
2 2 1
1 + γ5 i 1
u−p ≡ u(−p, −) = √ χu (+),
2 2 1
1 + γ5 1 1
vp ≡ v(p, +) = √ χu (−),
2 2 1
1 + γ5 i 1
v−p ≡ v(−p, +) = √ χu (+). (A.9)
2 2 1
The annihilation and creation operators a, b, a † and b† satisfy the anticommutation
relations.
ai (p), aj (q)† = (2π)3 δij δ (3) (p − q),
bi (p), bj (q)† = (2π)3 δij δ (3) (p − q),
others = 0. (A.10)
Similarly, the momentum expansion of the free majoron field is
χ(x) = dp aχ (p)uχ (p) + aχ (−p)† vχ (−p) e−ip t +ipx ,
0
(A.11)
where
1
uχ (p) = vχ (p) = . (A.12)
2Ep
The annihilation and creation operators aχ , aχ† satisfy the commutation relations:
Etot = p Ep + q Eq + r Er + s Es ,
p·q
cos θpq ≡ ,
|p||q|
∗
F (p, q) ≡ gab νa (p)T Cνb (q) + gab νb (−q)† Cνa∗ (−p). (A.21)
Once the effective Hamiltonian is obtained, we can calculate the effective potential for
neutrinos according to Ref. [23]. All we have to do is to evaluate
Heff (t = 0), aj (p)† ai (p) = −(2π)3 δ (3) (0)[Vp , ρp ]ij , (A.22)
210 A.D. Dolgov, F. Takahashi / Nuclear Physics B 688 (2004) 189–213
where [Vp ]ab is the effective potential matrix. Using the anticommutation relations (A.10),
we obtain the final result:
(νν)
1 † T
Vp ab
= dq g ρq + ρ Tq g ab . (A.23)
4|p||q|
Next we derive the effective Hamiltonian, which describes neutrino–majoron scattering.
In this case we integrate out the neutrino field instead of majoron field. The equation of
motion for the neutrino is
∗
−iγ 0 γ µ ∂µ νa + iχgab Cνb∗ = 0. (A.24)
The solution is then
ipµ ∗ d 4q
ν̃a (p) = g χ̃(p + q)γ µ γ 0 C ν̃b∗ (q). (A.25)
p2 ab (2π)4
Substituting this result into Eq. (19), we obtain the effective Hamiltonian:
(νχ) i ∗ d 4 p d 4 q d 4 r d 4 s i(p+q+r+s)x
Heff (t) = − gab gbc d x 3
e
2 (2π)4 (2π)4 (2π)4 (2π)4
(q + r)µ
×χ̃(p)χ̃ (q) ν̃¯ c (−r)γ µ ν̃a (s)
(q + r)2
i
where we take both neutrino and majoron fields to be on-shell in the second equation.
Applying Eq. (A.22), we obtain the effective potential due to the neutrino–majoron
scattering:
(νχ)
fχ (q) †
Vp ab
= dq g g ab , (A.27)
4|p||q|
(χ)
where the number density of majorons with momentum p, np , is defined as
†
aχ (p) aχ (p ) = (2π)3 δ (3) (p − p ) fχ (p).
Here we calculate the invariant amplitude squared for the reaction, 2ν ↔ 2ν̄. Provided
that the diagonal coupling constants are larger than the non-diagonal ones, and that one
of the diagonal coupling constants dominates, we consider neutrinos of the flavor with
the largest coupling. Hereafter we drop the sub-indices for flavor. The diagrams, which
contribute to the reaction, are shown in Fig. 3.
A.D. Dolgov, F. Takahashi / Nuclear Physics B 688 (2004) 189–213 211
where α and β represent symbolically the initial and final states. Each matrix element for
s-, t- and u-channel diagrams is given as
g2 1 T
M(s)
p3 p4 ,p1 p2 = up2 Cup1 vpT4 Cvp3 ,
(2π) (p1 + p2 )
3 2
g2 1 T
M(t )
p3 p4 ,p1 p2 =− vp3 Cup1 vpT4 Cup2 ,
(2π) (p1 − p3 )
3 2
g2 1 T
M(u)
p3 p4 ,p1 p2 = vp4 Cup1 vpT3 Cup2 , (B.2)
(2π) (p1 − p4 )
3 2
g2 1 3
M(s)
p3 p4 ,p1 p2 + Mp3 p4 ,p1 p2 + Mp3 p4 ,p1 p2 =
(t ) (u)
√ . (B.3)
(2π)3 E1 E2 E3 E4 4
Substituting this result into Eq. (B.1), the invariant amplitude squared becomes
Here we write down the formulas for the collisional integral in the Boltzmann
approximation for completeness. For the derivation, see Ref. [29].
3 3 3
d y2 d y3 d y4 −y3 −y4 (4)
e δ (y1 + y2 − y3 − y4 ) = 4π 2 e−y1 ,
y2 y3 y4
3 3 3 3
d y1 d y2 d y3 d y4 −y3 −y4 (4)
e δ (y1 + y2 − y3 − y4 ) = 32π 3 . (C.1)
y1 y2 y3 y4
References
[25] A.D. Dolgov, Sov. J. Nucl. Phys. 33 (1981) 700, Yad. Fiz. 33 (1981) 1309 (in Russian);
M.A. Rudzsky, Astrophys. Space Sci. 165 (1990) 65;
B.H.J. McKellar, M.J. Thomson, Phys. Rev. D 49 (1994) 2710.
[26] A.D. Dolgov, F.L. Villante, Nucl. Phys. B 679 (2004) 261.
[27] G.L. Fogli, E. Lisi, A. Marrone, D. Montanino, A. Palazzo, A.M. Rotunno, eConf C030626, 2003, THAT05,
hep-ph/0310012.
[28] A.Y. Smirnov, hep-ph/0311259.
[29] A.D. Dolgov, S.H. Hansen, D.V. Semikoz, Nucl. Phys. B 503 (1997) 426.
Nuclear Physics B 688 [FS] (2004) 217–265
www.elsevier.com/locate/npe
Abstract
The influence of the noise on the long-time ageing dynamics of a quenched ferromagnetic spin
system with a non-conserved order parameter and described through a Langevin equation with a
thermal noise term and a disordered initial state is studied. If the noiseless part of the system is
Galilei-invariant and scale-invariant with dynamical exponent z = 2, the two-time linear response
function is independent of the noise and therefore has exactly the form predicted from the local
scale-invariance of the noiseless part. The two-time correlation function is exactly given in terms of
certain noiseless three- and four-point response functions. An explicit scaling form of the two-time
autocorrelation function follows. For disordered initial states, local scale-invariance is sufficient for
the equality of the autocorrelation and autoresponse exponents in phase-ordering kinetics. The results
for the scaling functions are confirmed through tests in the kinetic spherical model, the spin-wave
approximation of the XY model, the critical voter model and the free random walk.
2004 Elsevier B.V. All rights reserved.
1. Introduction
The study of ageing phenomena as they are known to occur in glassy and non-glassy
systems presents one of the great challenges in current research into strongly coupled
many-body systems far from thermal equilibrium. A common example of this kind of
system is obtained as follows. Consider a magnet at a high-temperature initial state before
0550-3213/$ – see front matter 2004 Elsevier B.V. All rights reserved.
doi:10.1016/j.nuclphysb.2004.03.028
218 A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265
2 This extension of dynamical scaling has an analogue in critical equilibrium systems: there global scale-
invariance can be extended to conformal invariance, see [17,24,44] for introductions.
3 In order to avoid misunderstandings, we recall that (1.7) holds for the total response function as defined in
(1.1) without any subtractions meant to extract an ‘ageing part’.
220 A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265
in [46,50]. The status of the scaling function fC (y) of the spin–spin correlator is less
clear, however. Building on the Ohta–Jasnow–Kawasaki approximation (see [9]) Gaussian
closure procedures [8,83] in the O(n)-model produce approximate forms for fC (y)
but we do not know of any other approach which does not involve some uncontrolled
approximation.
Given the phenomenological success of (1.7), we wish to understand better where such a
supposedly general and exact result derived from a dynamical symmetry and without using
any model-specific properties might come from. In this paper, we shall concentrate on the
important special case z = 2 which describes the phase-ordering kinetics after a quench to a
temperature T < Tc of a ferromagnet with a non-conserved order parameter [85]. We recall
that local scale-transformations are dynamical symmetries of certain differential equations,
such as the free diffusion/free Schrödinger equation for z = 2. Indeed, the maximal
dynamical symmetry of the free diffusion equation ∂t φ = D φ is known since a long time
to be the so-called Schrödinger group [41,63,70], to be defined in Section 2. Also, it is well
established that the same group also describes the dynamic symmetry of non-relativistic
free-field theory [42,57]. It also arises as dynamical symmetry in certain non-linear
Schrödinger equations [31,35,62,81,82], the Burgers equation [56,60] or the equations
of fluid dynamics [74]. If, in addition, D is also considered as a variable, Schrödinger
invariance in d dimension becomes a conformal invariance in d + 2 dimensions [48].
The classification of non-linear equations and of systems of equations admitting as a
dynamical symmetry the Schrödinger group or one of its subgroups (e.g., the Galilei group)
has received a lot of mathematical attention, see [18,30,32–34,72,82]. The extension to
dynamical exponents z = 2 needed for quenches to criticality or for glassy systems will be
left for future work.
However, the setting just outlined is not yet sufficient for the description of ageing
phenomena. Rather, we are interested in the time-dependent behaviour of spin systems
coupled to a heat bath at temperature T . It is usually admitted that after coarse-graining,
this may be modeled in terms of a Langevin equation. If there are no macroscopic
conservation laws, the Langevin equation for the coarse-grained order parameter φ =
φ(t, r) should be model A in the Hohenberg–Halperin classification [53]
∂φ(t, r) δH
= −D + η(t, r) (1.8)
∂t δφ
where H is the classical Hamiltonian, and D stands for the diffusion constant or
equivalently some relaxation rate. Thermal noise is described by a Gaussian random force
η = η(t, r) and is thus characterized by its first two moments
η(t, r) = 0, η(t, r)η(s, r
) = 2DT δ(t − s) δ(r − r
), (1.9)
where T is the bath temperature. It is well known [9,53] that this formalism describes the
relaxation of the system towards its equilibrium state given by the probability distribution
Peq ∼ e−H/T . In addition, initial conditions must be taken into account and are described
in terms of the initial correlation function
a(r − r
) := C(0, 0; r, r
) := φ(0, r)φ(0, r
) (1.10)
and where we already anticipated spatial translation invariance.
A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265 221
Neither the thermal noise nor the initial correlations described by a(r) are included
into the local scale-transformations as studied in [46] which come from systems such as
the free diffusion equation. In this paper, we want to show how both these sources of
fluctuations may be taken into account and we shall explicitly derive the two-time response
and correlation functions. Our analysis will be restricted to the case where z = 2 which,
for instance, is already enough to describe ageing below criticality.
As we shall show, it is useful to slightly generalize the problem and to consider the
kinetics of systems which in the simplest case may be described by a quadratic Hamiltonian
of the form
2
1 ∂φ
H[φ] = dt dr + v(t)φ ,
2
(1.11)
2 ∂r
where v(t) is a time-dependent external potential. Formally, at the level of relativistic free-
field theory, v(t) corresponds to a (time-dependent) mass squared which would measure the
distance from a critical point. Alternatively, v(t) may be viewed as a Lagrange multiplier in
order to ensure the constraint φ(t, r)φ(t, r) = 1 and we shall make this explicit through
the example for the kinetic spherical model in Section 5. In a physically more appealing
way, time-dependent potentials arise when a many-body system is brought into contact
with a heat bath whose temperature T (t) is time-dependent [78,79]. In this paper, we shall
be interested in the dynamics symmetries of Langevin equations derived from a free-field
Hamiltonian (1.11). In particular, we shall compare the situation without (i.e., T = 0) and
with thermal noise (i.e., T > 0). For simplicity, we shall refer to all Eqs. (1.8) obtained
from the Hamiltonian (1.11) as ‘free Schrödinger equations’.
The study of the dynamic symmetries of such free-field theories will yield useful
insights which we expect to extend to physically more realistic interacting field theories
where H would also contain higher than merely quadratic terms. If we identify D −1 = 2im,
it is clear that the order parameter φ is given by a noisy Schrödinger equation in an external
time-dependent potential v(t).
This paper is organized as follows. In Section 2, we first review the basics of
Schrödinger invariance in the absence of thermal noise and without initial correlations and
then show that through a gauge-transformation involving v(t), the entire phenomenology
of ageing and in particular (1.7) can be reproduced. We also consider the selection rules
which follow from Galilei invariance. In Section 3, after having reformulated the problem
in terms of the field-theoretic Martin–Siggia–Rose formalism, we study the effects of
thermal noise and/or initial correlations on free-field theory given by a Hamiltonian (1.11).
In Section 4, these results are extended to any field theory with for T = 0 and a(r) = 0
is Galilei-invariant. We find that the two-time response function R is independent of
both T and a(r) and obtain a new reduction formula (4.9) which relates C to certain
three- and four-point response functions to be evaluated in the noiseless theory and
discuss the scaling of the resulting two-time autocorrelation function. In Sections 5–7,
these results are tested in several exactly solvable systems (with an underlying free-field
theory) undergoing ageing with z = 2, namely, the kinetic spherical model, the XY-model
in spin-wave approximation, the critical voter model and the free random walk. We
conclude in Section 8. Appendix A deals with technical aspects of Gaussian integration and
Appendix B analyses a special four-point response function. In Appendix C we consider a
222 A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265
Since the kinematic symmetries of the free Schrödinger equation are well understood and
the realization of the Schrödinger group is explicitly known, the corresponding realization
for the case at hand, similar to (2.2), readily follows.
A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265 223
(0)
It turns out that the only change occurs in the companion function fg . Let fg stand
for the companion function of the free Schrödinger equation, then because of the gauge
transformation Eq. (2.4) we find
t
fg (t, r) = fg(0) (t
, r) exp −D du v(u) , (2.5)
t
where t
= t
(t) has been defined in Eq. (2.1). The generators of the Lie algebra sch1 of
this realization of Sch(1), appropriate for Eq. (2.3) with the potential v(t), read
v(t)
X−1 = −∂t + time drift,
2M
1 x v(t)
X0 = −t∂t − r∂r − + t dilatation,
2 2 2M
v(t) M 2
X1 = −t 2 ∂t − tr∂r − xt + t 2 − r special Schrödinger transformation,
2M 2
Y−1/2 = −∂r space translation,
Y1/2 = −t∂r − Mr Galilei transformation,
M0 = −M phase shift, (2.6)
where we expressed the diffusion constant D −1 = 2M as the ‘mass’ M and x denotes the
scaling dimension of the wave function φ(t, r). Of course, x = d/2 for a solution of the
free Schrödinger equation, but it will be useful to consider arbitrary values of x as well.
The non-vanishing commutators of the Lie algebra sch1 spanned by the generators (2.6)
are
n
[Xn , Xn
] = (n − n
)Xn+n
, [Xn , Ym ] = − m Yn+m ,
2
[Y1/2 , Y−1/2 ] = M0 , (2.7)
where n, n
∈ {±1, 0}, m ∈ {± 12 } and with straightforward extensions to d > 1, see [46].
δφ(t, r)
R(t, s; r, r
) :=
= φ(t, r)φ̃(s, r
) ,
δh(s, r ) h=0
C̃(t, s; r, r ) := φ̃(t, r)φ̃(s, r
) ,
(2.8)
where h is the magnetic field conjugate to the order parameter φ. Later, we shall often refer
to t as the observation time and to s as the waiting time.
Generalizing the above definition to v(t) = 0, we say that a field φ is quasiprimary
if its infinitesimal change under sch1 is given by the generators (2.6) with v(t) = 0.
A quasiprimary field φ is characterized by its scaling dimension x and its ‘mass’ M 0. In
turn, if the response field φ̃ associated to φ is also quasiprimary, it has a scaling dimension
denoted by x̃ and the ‘mass’
M̃ = −M 0. (2.9)
This important fact will be used later on. The argument leading to the result (2.9) was
discussed in detail in [48] and will not be repeated here.
If both φ and φ̃ transform as quasiprimary fields of a Schrödinger-invariant theory,
the generators (2.6) can be used to derive restrictions on the form of any multipoint
correlator and in particular determine the two-point functions completely. If Xi is any
of the generators of schd acting on the ith particle in a n-point correlator A{ti , r i } where
i = 1, . . . , n (see (2.8) for n = 2), we have a set of differential equations
C̃0 (t, s; r, r
) = 0, C0 (t, s; r, r
) = 0, (2.11)
whereas the response function is basically the gauge-transformed expression of the well-
known zero-potential Gaussian response R, i.e.,
k(t)
R0 (t, s; r, r
) = R(t, s; r, r
),
k(s)
−x M (r − r
)2
R(t, s; r, r ) = δx,x̃ r0 Θ(t − s)(t − s) exp − ,
2 (t − s)
t
1
k(t) := exp − du v(u) , (2.12)
2M
where r0 is a normalization constant and Θ is the Heaviside function which expresses
causality. As they stand, Eqs. (2.11), (2.12) hold for T = 0 and we shall from now on use
the index 0 to remind the reader of this fact.
On the other hand, if the system is not rotation-invariant, we can repeat the same
argument in any fixed direction of space and the non-universal constant M becomes
direction-dependent. Indeed, rotation invariance is broken for phase-ordering systems
A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265 225
defined on a lattice [86,87] for sufficiently small temperatures. Even then, local scale-
invariance still holds in every single space direction, as exemplified in the 2D and 3D Ising
models with Glauber dynamics [49].
A few comments are in order.
1. Eqs. (2.11), (2.12) provide a manifest example of the superselection rule of Galilei
invariance, also known as Bargman superselection rule [2]. Explicitly, if Φi (ti , r i ) are
Galilei-covariant fields, each with a ‘mass’ Mi , Galilei covariance implies [2,43]
Φ1 (t1 , r 1 ) · · · Φn (tn , r n ) = δM1 +···+Mn ,0 F {ti , r i } (2.13)
1 + a − λR /2
x = x̃ = 1 + a, v(t) = (2M) . (2.14)
t
4 In the systematic terminology of [17], this should be rather called a response operator, because φ̃ will
become an operator in a canonical quantization scheme of the action. The notion of a response field should have
been reserved to the canonically conjugate variable of φ̃. Since we shall not use the operator formalism here, we
shall simply, but sloppily, talk about φ̃ as a response field.
A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265 227
• Thermal fluctuations and initial conditions are explicitly included. To emphasize this,
notice that the noisy contributions to the MSR action are
S[φ, φ̃] − S0 [φ, φ̃] = − du dr du
dr
φ̃(u, r)κ(u, u
; r − r
)φ̃(u
, r
),
1
κ(u, u
; r) = T δ(u − u
)δ(r) + δ(u)δ(u
)a(r), (3.7)
2
where κ includes both the effects of thermal and of initial-state fluctuations.
• The response field φ̃ describes the thermal noise as can be seen from the equations of
motion derived from the free-field MSR action (3.5)
∂φ(t, r)
= φ(t, r) − v(t)φ(t, r) + 2T φ̃(t, r),
∂t
∂ φ̃(t, r)
− = φ̃(t, r) − v(t)φ̃(t, r). (3.8)
∂t
The first equation (3.8) reduces to the Langevin equation (1.8) provided one makes the
formal identification
It will be useful to split the calculation of averages into two steps as follows:
F φ(ti , r i ), φ̃(tj , r j ) = F φ(ti , r i ), φ̃(tj , r j ) exp −S[φ, φ̃] − σ [φ, φ̃] 0 ,
F φ(ti , r i ), φ̃(tj , r j ) 0
= Dφ Dφ̃ exp −S0 [φ, φ̃] F φ(ti , r i ), φ̃(tj , r j ) , (3.11)
where the notation 0 (and more generally the index 0) refers from now on to averages of
the non-fluctuating theory. This allows to make use of the Schrödinger invariance of the
noiseless theory.
In this section, we find both response and correlation functions for free-field Martin–
Siggia–Rose theory, as given by Eqs. (3.2), (3.3), (3.5).
Q(u, u
; r, r
)
1 0 δ(u − u
)δ(r − r
)(− − ∂u ) + 12 v(u)
= .
2 δ(u − u
)δ(r − r
)(− + ∂u ) + 12 v(u) −2κ(u, u
; r − r
)
(3.13)
This peculiar form of the Lagrangian density is quite suggestive as regards the Galilei
invariance. When κ = 0, that is in absence of noise, the quadratic form Q is antidiagonal.
This is one way of presenting the Bargman superselection rules and leads, in particular, to
φφ = φ̃ φ̃ = 0 which is a manifestation of Galilei invariance of the noiseless system.
The presence of noise just breaks this symmetry.
In order to study systematically the role of the noise, we shall expand around the non-
fluctuating theory. The correlation functions C0 , C̃0 and the linear response function R0
A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265 229
are
C0 (t, s; r, r
) = 0, C̃0 (t, s; r, r
) = 0,
k(t) −d/2 1 (r − r
)2
R0 (t, s; r, r
) = Θ(t − s) 4π(t − s) exp − . (3.14)
k(s) 4 (t − s)
This follows since the quadratic form Q is antidiagonal the field φ can only be coupled
to φ̃ and the result is just the bare propagator of the theory. It is clear that only the
antidiagonality of Q is important to derive this result and the explicit free-field form (3.12)
is not required for (3.14) to hold.
These results fully agree with the Schrödinger invariance prediction Eqs. (2.12), (2.11)
with the identifications x = x̃ = d/2, M = 1/2 and r0 = (4π)−d/2 .
A further manifestation of the Galilei invariance of the noiseless theory is the fact that
φ · · · φ φ̃ · · · φ̃ 0 = 0 (3.15)
n m
unless n = m as is easily checked. This is a further example of the Bargman superselection
rule (2.13) and will be important in what follows.
R0 (t, s, u, u
; r, r , R, R
) := φ(t, r)φ(s, r
)φ̃(u, R)φ̃(u
, R
) 0 ,
(4)
(3.19)
(4)
where R0 is a noiseless four-point function.
Repeating the same arguments as before, it is an easy task to compute the two-point
correlations of response fields. They are
C̃(t, s; r, r
) = 0. (3.20)
We emphasize that Eqs. (3.16), (3.19), (3.20) will hold for any theory satisfying the
Bargman’s superselection rule (3.15). We shall come back to this in Section 4.
The four-point function R0(4) is simple to access for free fields since it factorizes into a
product of two-point functions because of Wick’s theorem. We have
R0 (t, s, u, u
; r, r , R, R
)
(4)
Together with the explicit form of κ, this yields the final result
C(t, s; r, r
) = Cth (t, s; r, r
) + Cpr (t, s; r, r
),
Cth (t, s; r, r ) = 2T du dy R0 (t, u; r, y)R0 (s, u; r
, y),
Cpr (t, s; r, r
) = dy dy
R0 (t, 0; r, y)a(y − y
)R0 (s, 0; r
, y
), (3.22)
where we separated C in a thermal term Cth and an initial term Cpr . We clearly see that
while the only contributions to C come from the noise, R does not depend on it.
We summarize the results obtained so far as follows.
1. It is satisfying that the well-known result C̃ = φ̃ φ̃ = 0 is naturally reproduced, see
(3.20).
2. The independence equation (3.17) of the two-time response function of T and of the
initial correlations goes beyond the usual scaling arguments as reviewed in Section 1.
This explains to some extent the success of the existing confirmations of that prediction
of local scale-invariance.
3. We arrive at an explicit expression for the two-time correlators, which are obtained in
terms of a contraction of two response functions. We also see that the earlier result
C = 0 comes from neglecting both initial and thermal fluctuations.
It is useful to present these results also in momentum space. Using spatial translation
invariance, define the Fourier transform of any two-point function A(t, s; x, y) =
A(t, s; x − y) as
Â(t, s; q) := dr A(t, s; r) exp(−iq · r). (3.23)
Rd
A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265 231
quench T = Tc , the situation is different in that both initial and thermal fluctuation may
contribute to the long-time dynamics. From Eq. (3.27) we expect that the small-q behaviour
of the term a(q) − Tc /q 2 will determine the long-time dynamics.
We now generalize the results of the previous section to any theory whose noiseless
MSR action is Schrödinger-invariant. As we shall stay within the context of classical field
theory, the symmetries of the MSR action are the same as for the corresponding Langevin
equation (1.8). The formulation of the MSR action, using a non-conserved order parameter
described by model A dynamics [53] is almost unchanged with respect to Section 3. We
have
φ
(t
, r
) = fv (t, r)φ(t, r), φ̃
(t
, r
) = fv−1 (t, r)φ̃(t, r), (4.3)
where the companion function fv reads [70]
M 2
fv (t, r) = exp Mr · v − v t . (4.4)
2
The noisy contributions to the action transform as
S[φ
, φ̃
] − S[φ, φ̃] = −T dt dr φ̃ 2 (t, r) fv−2 (t, r) − 1 ,
1
σ [φ , φ̃ ] − σ [φ, φ̃] = − dr dr
a(r − r
)φ̃(0, r)φ̃(0, r
)
2
× fv−1 (0, r)fv−1 (0, r
) − 1 . (4.5)
A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265 233
Therefore, for fixed temperature and initial conditions, the noise always destroys Galilei
invariance.5 Thus, the only dynamic symmetries of the noisy Langevin equation should be
space translations and dilatations (and possibly space rotations).
Consequently, one would merely expect the following scaling forms
t (r − r
)2
C(t, s, r, r
) = s −b GC , ,
s t −s
t (r − r
)2
R(t, s; r, r
) = s −a−1 GR , , (4.6)
s t −s
where GC,R are undetermined scaling functions. For free-field theories, one would have
a = b = d/2 − 1 by dimensional counting.
The form of the scaling functions GR,C will now be determined through a generalization
of the expansions carried out in Section 3. The main results will be Eqs. (4.8) and (4.12),
(4.16), respectively.
R(t, s; r, r
) = R0 (t, s; r, r
). (4.7)
The expression of R0 has been derived earlier and using the gauge transform (2.4) we
recover exactly the same result as in Eq. (2.12), namely,
k(t) M (r − r
)2
R(t, s; r, r
) = δx,x̃ r0 Θ(t − s)(t − s)−x exp − . (4.8)
k(s) 2 (t − s)
We stress that, given only Galilei and scale-invariance of the noiseless theory, this result
should hold for any initial and final temperature Ti and Tf . At this stage, nothing has yet
been said on the time-dependence of v(t).
5 The breaking of Galilei invariance through thermal noise can be visualized as follows: consider a system in
contact with a thermal bath at constant and uniform temperature T > 0. If the system moves with respect to the
bath with a constant speed v, the apparent temperature measured in the system will depend on the angle between
the direction of measurement and v.
234 A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265
Combining (4.12) and (4.15) we obtain, for the first time, a generic prediction for the form
of the two-point correlation function. It is remarkable that under the condition (4.11) only
the noiseless three-point response functions are required in order to predict any two-time
autocorrelator.
It is useful to write down the autocorrelation function in the form C(t, s) = Cth (t, s) +
Cpr (t, s). Here the thermal part Cth and the preparation part Cpr are given by
x̃2 −x−d/2
t
Cth (t, s) = T s d/2+1−x−x̃2 −1
s
1
d/2−x̃2
k(t)k(s) t t/s + 1 − 2θ
× dθ 2 − θ (1 − θ ) Φ ,
k (sθ ) s t/s − 1
0
x̃2 −x−d/2
a0 k(t)k(s) d/2−x̃2−x t d/2−x̃ t t/s + 1
Cpr (t, s) = s −1 Φ ,
2 k 2 (0) s s t/s − 1
Mw 2 2
Φ(w) := dR exp − R Ψ R , (4.16)
2
and we have explicitly used s < t.
As they stand, the above expressions for C(t, s) do not yet necessarily describe a
dynamical scaling behaviour, since the form of k(t) is still completely general. We now
assume, in addition, that we are dealing with a system with dynamical scaling. In order
to reproduce the usual phenomenology of ageing systems, we must have, at least for
sufficiently large times (see Section 2)
k(t) k0 t . (4.17)
Comparing the general form of R(t, s) as given in Eq. (4.8) with the phenomenologically
expected scaling Eqs. (1.2), (1.3), we read off
λR
x = 1 + a, =1+a − , (4.18)
2
and in particular, the scaling function (1.7) is recovered. On the other hand, for C(t, s)
we find the following scaling form, written down separately for the thermal and the initial
term, where y = t/s 1 is the scaling variable
1
d/2−x̃2 y + 1 − 2θ
fCth (y) = y (y − 1) x̃2 −x−d/2
dθ θ −2
(y − θ )(1 − θ ) Φ ,
y −1
0
pr a0 d/2−x̃2+ x̃2 −x−d/2 y +1
fC (y) = y (y − 1) Φ , (4.20)
2 y −1
236 A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265
where Ψ0 is a constant. If this leading term should be still accurate for larger values of ρ,
the following expression for the scaling function Φ(w) is found
(λC ) 2 λC −λC
Φ(w) ≈ Ψ0 Sd w =: Φ0 w−λC , (4.27)
2 M
where Sd is the surface of the unit sphere in d dimensions. Eq. (4.27) should hold in the
w → ∞ limit. Provided this from is still valid for all w, we would obtain the following
simplified form, with y = t/s
2 −λC /2
a0 Φ0 (y + 1)2 −λC /2 2 (y + 1)
C(t, s) ≈ = Meq , T = 0, (4.28)
2 y 4y
2 ,
where we also related the leading constant to the squared equilibrium magnetization Meq
2 f (t/s) with f (1) = 1, see [39].
in order to recover the usual scaling form C(t, s) = Meq C C
which is the most general form compatible with the standard phenomenological con-
straints. An approximate form of the scaling function Φ(w) may be obtained from the
(3)
requirement that the three-point response function R0 (t, s, u; r) is non-singular for t = s.
We now have x̃2 = d/2 and find Ψ (ρ) Ψ0,c ρ x−d/2 as ρ → 0. This leads to, for w → ∞
where Φ0,c is a constant. If, in addition, we may use this form also for finite values of w,
we would obtain the simplified form
1
fC (y) ≈ Φ0,c y 1+a−λC /2
dθ θ λC −2−2a (y + 1 − 2θ )−1−a , T = Tc . (4.34)
0
In this and the next two sections we describe phenomenological tests of the predictions
of Schrödinger invariance, that is local scale-invariance with z = 2, which were derived in
Section 4 in concrete physical models of ageing behaviour.
6 A careful study shows that provided the limit N → ∞ is taken before any long-time limit, the mean spherical
constraint (5.1) and a full, non-averaged, spherical constraint lead to the same results [29].
240 A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265
where in the second line the limit t − s 1 was taken and Ir is a modified Bessel function.
In particular, this reproduces the known autoresponse function [38]
!
R(t, s) = R(t, s; r, r) = f (t − s)/2 g(s)/g(t). (5.8)
This is in exact agreement with Eq. (3.17) and we identify the mass M = 1/2.
We now turn to the correlator but the sake of brevity merely deal with the autocorrelator
explicitly. In the spherical model one has (with t > s) [38,75,77]
C(t, s) = φ(t, r)φ(s, r)
s
1 t +s t +s
=√ A + 2T du f − u g(u) . (5.9)
g(t)g(s) 2 2
0
In order to show how to recover this explicitly from Eq. (3.22), we write again C(t, s) =:
Cpr + Cth and discuss the two terms separately. The first one is, where we use the explicit
form (5.7) of R = R0
g(0) g(0) (r − y)2 (r − y
)2
Cpr = dy dy
(4πt 4πs)−d/2 exp − −
g(t) g(s) 4t 4s
R2d
(2π)−d/2 2 t −q
2 s+i(q+q
)r
=√ dq dq
e−q Jd , (5.10)
g(t)g(s)
R2d
where
Jd := dy dy
a(y − y
)e−iq·y−iq ·y , (5.11)
R2d
Jd = 2−d dζ dζ
a(ζ )e−i 2 ·(q+q ) e−i 2 ·(q−q )
R2d
= (2π)d δ(q + q
) dζ a(ζ )e−iq·ζ . (5.12)
Rd
We finally obtain
1 1
dq e−q
2 (t +s)
Cpr = √ dζ a(ζ )e−iq·ζ . (5.13)
g(t)g(s) (2π)d
Rd Rd
When the support of a(ζ ) is restricted to the hypercubic lattice, we therefore have, indeed,
for long times, t + s 1
A((t + s)/2)
Cpr √ , (5.14)
g(t)g(s)
A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265 241
in agreement with the first term in (5.9). The second term in (3.22) is analyzed as follows,
for t > s
∞
−d/2 g(u) g(u)
Cth = 2T du dy 4π(t − u)4π(s − u)
g(t) g(s)
0 Rd
(r − y)2 (r − y)2
× exp − − Θ(t − u)Θ(s − u)
4(t − u) 4(s − u)
s
2
2
g(u)
dq dq
dy e−q t −q s+(q +q )u+i(q+q )·r e−i(q+q )·y √
2 2
= 2T du
g(t)g(s)
0 R3d
s
2T −d
dq e−2q
2 ((t +s)/2−u)
=√ du g(u)(2π)
g(t)g(s)
0 Rd
s
2T t +s
√ du g(u)f −u , (5.15)
g(t)g(s) 2
0
where the last line holds for sufficiently large arguments of the function f . Taken together
with Cpr , the expected agreement between the general result Eq. (3.22) and the exact
expression (5.9) for the spherical model is thus recovered. The full space–time correlator
can be checked similarly.
It is interesting to note that for the confirmation of R, see (5.7), we need the condition
t − s 1, while for the confirmation of C, we also need s 1 (which implies t + s 1).
So far, we have not yet used the explicit form of g(t) which follows from Eq. (5.5).
Indeed, it is well known that for long times, one has [38,46,77]
Another system which can be exactly analysed is the kinetic XY model in spin-
wave approximation. As we shall see, it provides an instructive example on the correct
identification of the quasi-primary scaling fields in a given model.
where the coupling constant J has been set to unity and the sum runs over nearest
neighbours. The relaxational dynamics is assumed to be described by a Langevin equation.
We prepare the system initially a temperature Ti and quench it at time t = 0 to the final
temperature T = Tf , so that the angular variable obeys [9]
∂φ(t, r) δH[φ]
=− + η(t, r), (5.21)
∂t δφ(t, r)
where η represents an uncorrelated Gaussian noise with zero mean and variance
η(t, r)η(t
, r
) = 2Tf δ(t − t
)δ(r − r
). (5.22)
Here, we shall exclusively study the coarsening dynamics in the low-temperature
regime, that is
Ti , Tf Tc (d), (5.23)
A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265 243
2πη(Ti ) Ti
â(q) = Ĉ(0, 0; q) = = 2, (5.27)
q2 q
where η(Ti ) is the standard equilibrium critical exponent and the relation 2πη(Ti ) = Ti
valid in the spin-wave approximation was used, see [4,61,84].
into a bulk term Σ[φ, φ̃, h] and an initial term σ [φ, φ̃]. These two terms include the
thermal and the initial noise. Explicitly
∂φ
Σ[φ, φ̃, h] = dt dr φ̃ − φ + sin(φ)h1 − cos(φ)h2
∂t
− T dt dr φ̃(t, r)φ̃(t, r) (5.29)
and
1
σ [φ, φ̃] = − dr dr
φ̃(0, r)a(r − r
)φ̃(0, r
), (5.30)
2
where the function a(r) describes the initial conditions according to Eq. (5.27).
We now simplify the general expressions for the two-point functions. There is nothing
to do for the angular correlation function C(t, s; r, r
) = φ(t, r)φ(s, r
) and we start with
the magnetic correlation function Γ which is given by
Γ (t, s; r, r
) = Dφ Dφ̃ cos φ(t, r) − φ(s, r
) exp −S[φ, φ̃] . (5.31)
For a vanishing magnetic field the bulk action Σ[φ, φ̃, 0] is a quadratic form in the fields
φ, φ̃ which are therefore Gaussian. Standard techniques explained in Appendix A lead to
1
2
Γ (t, s; r, r ) = exp − φ(t, r) − φ(s, r ) (5.32)
2
C(t, t; r, r) + C(s, s; r
, r
)
= exp C(t, s; r, r
) − , (5.33)
2
where in the second line the argument of the exponential was expanded. This gives Γ in
terms of angular correlators C.
Next, we consider the response functions. For the angular response R, we quote from
the MSR formalism the standard result
R(t, s; r, r
) = φ(t, r)φ̃(s, r
) . (5.34)
It remains to consider the response of the spin vector S at time t and position r to some
magnetic field h(s, r
) at time s and position r
. From the definition (5.25) we have
ρ(t, s; r, r
)
cos φ(t, r) − cos φ(t, r) sin φ(t, r) − sin φ(t, r)
= lim + , (5.35)
h→0 h1 (s, r
) h2 (s, r
)
where the average · is to be taken with a magnetic field. Expanding the action (5.28) to
first order in both components h1 , h2 of the magnetic field, we find
cos φ(t, r) = cos φ(t, r) + h1 (s, r
) cos φ(t, r) sin φ(s, r
)φ̃(s, r
) ,
sin φ(t, r) = sin φ(t, r) − h2 (s, r
) sin φ(t, r) cos φ(s, r
)φ̃(s, r
) . (5.36)
It follows that the response function can be expressed as
ρ(t, s; r, r
) = φ̃(s, r
) sin φ(t, r) − φ(s, r
) . (5.37)
A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265 245
Since both fields φ, φ̃ are Gaussian, it can further be shown that (see Appendix A)
ρ(t, s; r, r
) = R(t, s; r, r
)Γ (t, s; r, r
), (5.38)
and we see explicitly that the relationship between R and ρ is non-trivial. That no higher
correlators than the magnetic two-point correlation function Γ enter is a consequence of
the Gaussian nature of the theory at hand.
Eqs. (5.33), (5.34), (5.38) are the main results of this subsection. Before we can evaluate
them, we need some information on the validity of the spin-wave approximation.
where Ĝ is given by
1
Ĝ(u; q) := 2
exp −q 2 (u + Λ2 ) , (5.40)
q
and we have explicitly introduced an UV-cutoff which simulates the lattice spacing (we
shall let Λ → 0 at the end). Therefore, a two-point correlation function φφ is of order
O(Ti , Tf ) whereas a response function φ φ̃ is of order O(1) in the initial and final
temperatures.
In order to discuss further the validity of the spin-wave approximation, we keep the
next-order term as well and consider the Hamiltonian
1 2 4
H[φ] = H0 + dr ∇φ(r) + g4 ∇φ(r) , (5.41)
2
where g4 is some constant. A straightforward calculation shows that to first order in g4 , the
correction to the spin-wave approximation of the two-point correlation function is given
by
3
δC(t, s; r, r ) = du dR φ(t, r)φ(s, r
)φ̃(u, R)∇ ∇φ(u, R)
2 2
Ti Tf
O , , (5.42)
Tc (d) Tc (d)
where the six-point function is factorized into two-point function by Wick’s theorem. As
a result, the spin-wave approximation is a first-order approximation in the initial and
final temperatures. Consistent expressions of two-point functions must be expanded to first
246 A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265
1. The 2D XY model was studied in detail in the spin-wave approximation before [4,84]
and we now show that their results, although they might at first sight appear to be
different, agree with Eqs. (5.43), (5.46), (5.47). For notational simplicity, we restrict
A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265 247
η(T )/4
Λ4 (t + s + Λ2 )2
Γ (t, s) = S(t, r) · S(s, r) =
(2t + Λ2 )(2s + Λ2 )(t − s + Λ2 )2
η(T ) Λ4 (t + s + Λ2 )2
= exp ln
4 (2t + Λ2 )(2s + Λ2 )(t − s + Λ2 )2
T 1
1+ ln t + s + Λ2 − ln t − s + Λ2 − ln 2t + Λ2
4π 2
1 2 (5.48)
− ln 2s + Λ + ln Λ + O T ,
2 2
2
δS(t, r)
ρ(t, s) = lim
h→0 δh(s, r)
η(T )/4
1 Λ4 (t + s + Λ2 )2
=
4π(t − s + Λ2 ) (2t + Λ2 )(2s + Λ2 )(t − s + Λ2 )2
1
= Γ (t, s), (5.49)
4π(t − s + Λ2 )
Table 1
Ageing exponents of the d-dimensional XY model in the spin-wave approximation. Here ηi,f = η(Ti,f ) =
Ti,f /(2π ) describe the initial and final correlation exponents
Angular correlation and response
a b λC λR
d =2 0 0 0 0
d >2 d/2 − 1 d/2 − 1 d d
Magnetic correlation and response
a
b
λ
C λ
R
d =2 ηf /2 ηf /2 (ηi + ηf )/2 2 + (ηi + ηf )/2
d >2 d/2 − 1 d/2 − 1 d d
4. Having discussed the values of the ageing exponents, we wish to compare the form
of the scaling functions with the predictions of local scale-invariance as derived
in Section 4. This requires, however, the correct identification of the quasiprimary
fields in our system, see Section 2. It is only the quasiprimary fields which are
expected to transform in a simple way under a local scale-transformation and the
transformation laws of more complicated fields built from quasiprimary fields must
derived accordingly.
In the case at hand, it is clear from the complicated structure of the magnetic
correlation and response functions that the magnetic order parameter S(t, r) does not
correspond to a quasiprimary field. Rather, the quasiprimary field should be identified
with the phase φ(t, r). Indeed, the form of the angular response R(t, s) is in perfect
agreement with the prediction (4.8), (4.17) of local scale-invariance. This suggests that
the response field φ̃(s, r) should be quasiprimary as well.
5. Having thus identified φ and φ̃ as quasiprimary fields of the model, it is now clear
that the angular autocorrelation function C(t, s) should be compared to the critical
dynamics correlation function as derived in Section 4. However, a direct comparison
with Eq. (4.34) is not possible, since in its derivation fully disordered initial conditions
were assumed.
We shall therefore proceed in two steps. First, we shall consider the case Ti = 0.
Because of our initial conditions (5.27) the initial correlator then vanishes and
Eq. (4.34) should now hold true. Second, we shall show that in the context of the free-
field theory underlying the spin-wave approximation of the XY model, the restriction
to uncorrelated initial states can be lifted.
We now set Ti = 0 and Tf = T . From the exponents in Table 1, the predicted
autocorrelator scaling function follows from (4.34) as
1
fC (y) = Φ0 dθ (y + 1 − 2θ )−d/2, d 2, (5.50)
0
and we see immediately that this is in agreement with the explicit angular correlator
(5.46), upon identification of Φ0 .
A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265 249
Finally, if we also allow for Ti > 0, there is a contribution to C(t, s) from the initial
condition. We then return to the basic result (4.9) and decompose C(t, s) = Cth + Cpr .
The thermal term Cth was treated before and the preparation term is analysed in
Appendix B, with the result
2(4π)−d/2
Cpr = Ti (t + s)1−d/2 , d > 2, (5.51)
d −2
in complete agreement with (5.46). The case d = 2 is treated similarly.
Having checked that both correlation and response functions agree with the local scale-
invariance prediction, we now inquire what can be said on the approach of the model
towards equilibrium. A convenient way to study this is through the so-called fluctuation–
dissipation ratio [21], see Section 1. Since we have seen that in the XY model angular
and magnetic observables behave quite differently, it is convenient to define two distinct
fluctuation–dissipation ratios, namely,
∂Γ (t, s) −1
Ξ (t, s) := Tf ρ(t, s) ,
∂s
∂C(t, s) −1
X(t, s) := Tf R(t, s) . (5.52)
∂s
Of particular interest will be the limit fluctuation–dissipation ratios X∞ defined in Eq. (1.5)
and similarly Ξ∞ . We have seen before that the scaling of autocorrelation and autoresponse
functions is according to the expectations of nonequilibrium critical dynamics. In this case,
according to the Godrèche–Luck conjecture [38], X∞ and Ξ∞ should be universal.
We shall use the available exact results in the XY model to test this conjecture
by studying the dependence of X and Ξ on the ratio α := Ti /Tf of initial and final
temperatures.
confirms that Ξ should be well defined. For large values of y, that is for well separated
times, we have
−d/2
Tf y −1
Ξ (y) , (5.54)
Ti − Tf 2
and therefore Ξ∞ = 0, indeed, a universal constant. It is remarkable that the asymptotic
value of Ξ (y) should be independent of d and that is agrees with the value Ξ∞ = 0
of phase-ordering kinetics in the low-temperature phase with an ordered, non-critical
equilibrium state. This kind of result should be more typical of an ordered ferromagnetic
equilibrium state as it occurs for d > 2 but is not really expected for d = 2 since the
equilibrium 2D XY model is critical even below TKT .
Finally, for large y the asymptotic form of Ξ (y) is independent on whether the system
is cooled or heated. The temperatures merely enter into a scaling amplitude.
Here the configuration C (r) is obtained from C by inverting the single spin at site r. Finally,
the transition rates for a spin reversal σr → −σr are given by
1
d
1
Wr (C) = 1− (σr σr+ek + σr σr−ek ) , (6.2)
2 2d
k=1
where the ek , k = 1, . . . , d, form an orthonormalized basis of unit vectors on the d-
dimensional hypercubic lattice.
With respect to the kinetic spherical model and the XY model studied previously, the
voter model is different since in general it does not satisfy detailed balance and therefore
will not relax to an equilibrium state. By considering a general kinetic Ising model with
a dynamics respecting the global Z2 -symmetry, it can be shown that the transition rates
(6.2) correspond to the critical point of the so-called linear voter model [73,88]. The
non-equilibrium kinetics of the critical voter model (6.2) has been studied in detail by
Dornic [26]. In d = 1 dimensions the model coincides with the kinetic Glauber–Ising
model at zero temperature (which we shall revisit in Appendix C) but for d > 1 these
two models are different. In particular, it is known that the domain growth of the voter
model is not driven by the minimization of the surface tension between the two phases
[25,26] but which is the mechanism which drives ageing in simple ferromagnets [9,11].
We are interested here in the correlation functions Cr (t) = σr (t)σ0 (t) and Cr (t, s) =
σr (t)σ0 (s) which are easily seen to satisfy the following equations of motion [26,28]
∂ ∂
Cr (t) = 2 r Cr (t), Cr (t, s) = r Cr (t, s), (6.3)
∂t ∂t
subject to the following boundary conditions
found, the form of local scale-invariance as presented here is not applicable.8 We therefore
concentrate on the dimensions 2 < d < 4. From the asymptotics of C(t, s) in (6.5) we read
off the exponents
d
b= − 1, λC = d (6.6)
2
since the exponent z = 2 is known, see [26,64]. From Eq. (4.34) we expect
1
fC (y) = Φ0,c dθ (y + 1 − 2θ )−d/2
0
Φ0,c
= (y − 1)−d/2+1 − (y + 1)−d/2+1 , (6.7)
d −2
in full agreement with (6.5) and we identify the normalization constant Φ0,c = 2γd (d/
2π)d/2 . Indeed, we see that the form (4.33)—which in principle is only valid asymptoti-
cally—is in fact exact in the critical voter model. This is not surprising in view of the
underlying free-field theory.
In conclusion, for the critical voter model with two competing steady states ageing
occurs. The scaling form of the two-time autocorrelation function is in exact agreement
with the prediction of local scale-invariance. This is the first time such an agreement is
found for a system without detailed balance.
Last, but not least, we briefly consider the simplest example of a system undergoing
ageing: the free random walk [21]. The Langevin equation describing the time-evolution
of the order parameter φ reads
∂φ(t)
= h(t) + η(t), (7.1)
∂t
where a deterministic external field h(t) has been added in order to be able to compute
response functions. The Gaussian noise η is characterized as usual by its first two moments,
see Eq. (1.9). Here, we choose the notation such that the relationship to local scale-
invariance becomes evident.
The autocorrelation and linear autoresponse functions were already calculated by
Cugliandolo et al. [21]. They obtained, with the initial condition C(t, 0) = 0
δφ(t) 1
R(t, s) = = φ(t)η(s) = Θ(t − s),
δh(s) h=0 2T
C(t, s) = φ(t)φ(s) = 2T min(t, s). (7.2)
Clearly, the system undergoes ageing, since C(t, s) does not merely depend on t − s
and furthermore, it stays forever out of equilibrium, since the fluctuation–dissipation ratio
X(t, s) = 1/2 [21].
We wish to check that the results (7.2) are compatible with the predictions of local scale-
invariance. For the autoresponse function, the first equation of (7.2) is clearly compatible
with (4.7), (4.17), (4.18), with the exponents a = −1 and λR = 0. For the autocorrelation
function, we expect from (4.12)
δ 2 φ(t)φ(s)
C(t, s) = T du , (7.3)
δh(u)2 h=0
where in view of the initial condition C(t, 0) = 0 used in Eq. (7.2) we assumed a vanishing
initial correlator.
In order to calculate the above derivative, we solve the Langevin equation for a given
field h and obtain the autocorrelation function
s t
C t, s; [h] = C t, s; [0] + dv dw h(v)h(w), (7.4)
0 0
where C(t, s; [0]) = C(t, s) is of course given by the second Eq. (7.2). The required second
derivative of the autocorrelation function becomes in a field-theoretic formulation some
three-point correlator (see Eq. (4.10)) and it is now easy to see that
(3) δ 2 C(t, s; [h])
R0 (t, s, u) = = 2Θ(t − u)Θ(s − u). (7.5)
δh(u)2 h=0
Inserting this into (7.3), the desired result for C(t, s) in Eq. (7.2) is, indeed, recovered.
We read off the exponents b = −1 and λC = 0. Of course, the exponent equalities a = b
and λC = λR are a necessary requirement for having a non-vanishing limit fluctuation–
dissipation ratio X∞ = 1/2.
In conclusion, the evidence from the two-time autocorrelation and autoresponse
function of the free random walk is fully consistent with local scale-invariance.
We have carried out such a study, for the important special case where z = 2 and
the initial state is fully disordered. Considering the Langevin equation as the classical
equation of motion of a MSR-type field theory, we have calculated the two-time correlation
and linear response functions by studying how the dynamical symmetry properties of the
noiseless part of that field theory are reflected in these noisy averages. These averages can
be written in form of a perturbative expansion around the noiseless theory and we have
shown that only a finite number of terms in these series contributes. Specifically, we have
found:
1. The two-time linear response function R = φ φ̃ involving only quasiprimary fields
is independent of both the thermal and the initial noise. This explains why the form
(1.7) of scaling function fR (y) of the linear response—previously derived from the
symmetries of the noiseless theory—has been reproduced in many different systems
with T > 0 either exactly or with a considerable numerical precision [12,16,21,38,39,
45–47,49,58,69,77].
2. We obtain the reduction formula Eq. (4.9) which expresses the two-time correlation
function in terms of certain noiseless three- and four-point response functions. For
the uncorrelated initial conditions (4.11) only a single noiseless three-point response
function is needed.
3. The scaling forms of correlation and response functions are governed by the two non-
trivial exponents λC and λR which are, in general, distinct from each other. Given
the initial correlator (4.11), local scale-invariance with z = 2 provides a sufficient
condition for the exponent equality
λC = λR . (8.1)
(8.3)
A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265 255
At least, these forms are consistent with the required asymptotic behaviour of fC (y)
as y → ∞, see Section 1. For free-field theories (8.2) holds for all values of w and
then (8.3) becomes exact.
In the past, approximate expressions for the scaling of magnetic correlation functions
were derived from Gaussian closure procedures for kinetic O(n)-models undergoing
phase-ordering kinetics, see [9]. This gives for the magnetic autocorrelation function
at T = 0 [7,8,83]
d/4
n 1 n+1 2 4y
fC,BPT (y) ≈ B ,
2π 2 2 (y + 1)2
d/2
1 1 n+2 4y
× 2 F1 , ; ; , (8.4)
2 2 2 (y + 1)2
where B is Euler’s beta function and 2 F1 a hypergeometric function. It is easy to
see that the scaling function (8.3) with T = 0 is recovered from (8.4) in the n → ∞
limit. In the notation of Section 5.2, Eq. (8.4) implies an exponent λ
C = d/2. This is
a typical value for a free-field theory for T < Tc (which, indeed, describes the O(n)-
model in the n → ∞ limit) but which will, in general, not hold for n < ∞ and thus
(8.4) cannot be expected to represent well the behaviour for y = t/s large.9 One might
wonder whether the long-standing difficulties in arriving at scaling functions which
cover adequately the whole range of values of y should not be related at least partially
to having worked with dynamic variables which might turn out to be not the most basic
ones of the model?
6. We have tested these predictions on the exact solutions of the kinetic spherical model
and the XY model in the spin-wave approximation. In these cases, the exponent a is
given by (1.4). In order to compare the exact model results with the prediction (8.3),
it was necessary to carefully identify the quasiprimary fields of the models. For the
spherical model, the natural magnetic order parameter and its response field could be
used as quasiprimary fields. On the other hand, for the XY model it turned out that the
coarse-grained magnetic moment is not quasiprimary but rather the angular variable is.
These examples underline the importance of the correct identification of the quasipri-
mary fields in a given model.
The explicit results in the XY model also illuminate in a new way the Godrèche–
Luck conjecture on the universality of the limit fluctuation–dissipation ratio. Further
tests on the form of the autocorrelation function in the Glauber–Ising model in d 2
dimensions are presently carried out and will be reported elsewhere.
7. We also showed that the autocorrelator in the critical voter model in 2 < d < 4
dimensions and which does not satisfy detailed balance, again agrees with (8.3). This
is the first example of a new domain of application of local scale-invariance. We also
confirmed (8.3) for the free random walk.
9 Indeed, to leading order in 1/n and for uncorrelated initial conditions [6],
d d −1
λ
C = d2 − 43 (d + 2) 2d d
9 B 1 + 2 ,1 + 2 n .
256 A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265
These four confirmations, although all based on an underlying free-field theory provide
further evidence in favour of a Galilei invariance of the noiseless theory.
8. The scaling of the linear response of the 1D Glauber–Ising model at T = 0 can only
be explained through a generalization of the representations of the Lie algebra of local
scale-invariance. It would be interesting to see whether a similar phenomenon could
be found in different 1D systems undergoing ageing at T = 0.
Our approach has been based in an essential way on the assumption of Galilei invariance
of the noiseless theory. However, Mazenko [67] recently studied phase-ordering kinetics
in the context of the time-dependent non-linear Ginzburg–Landau equation. He carried
out a second-order perturbative calculation around a Gaussian theory which is equivalent
to the Ohta–Jasnow–Kawasaki approximation and reports a deviation of the two-time
autoresponse function R(t, s) from the local scale-invariance prediction (1.7). A similar
difficulty had been observed before by Calabrese and Gambassi who studied non-
equilibrium dynamics (that is T = Tc ) of the O(n)-model, for both model A and model
C dynamics, through MSR field theory [13–15]. Again, already their classical action is
manifestly not Galilei-invariant.10
At face value, Mazenko’s result [67] is in disagreement with the simulational data
obtained from the 2D and 3D Glauber–Ising model with T < Tc and based on the master
equation. These do reproduce (1.7) for the autoresponse function R(t, s) [47,49,52] as well
as the extension for the spatio-temporal response R(t, s; r − r
) [49]. Could this mean that
there are subtle differences between the formulation of stochastic systems either through
a master equation or else through a coarse-grained Langevin equation11 and which affect
the formal Galilei invariance of the theory? Alternatively, if under renormalization the
dynamical exponent z = 2 remains constant, might the theory flow to a fixed point where
asymptotically Galilei invariance would hold?
All in all, based on a postulated extension of dynamical scaling to some local scale-
invariance, we have reformulated the problem of finding the scaling function of the two-
time autocorrelation function of ageing systems as one of a discussion of the properties
of certain three-point response functions of the noiseless theory. The evidence available at
present suggests that this approach might be capable of shedding a new light on the issue.
Finally, it might be of interest to search for extensions for dynamical exponents z = 2
and/or to study ageing systems with a conserved order parameter. Asymptotic information
on the two-point functions in the latter case is now becoming available [40].
We present the details of the calculations of the magnetic two-time correlation function
Γ and its associated linear response function ρ for the d-dimensional XY model in the
10 At criticality, the combined effect of thermal and initial fluctuations leads for interacting theories to a non-
trivial value of z = 2 under renormalization so that our arguments are no longer directly applicable.
11 In 1D and at T = 0, the autocorrelation exponent λ = 1 found in the Glauber–Ising model [37,65] differs
C
from the exactly known exponent λC = 0.6006 . . . [10] determined in the time-dependent Landau–Ginzburg
equation.
A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265 257
(A.1)
and
1
κ(u, u
; r − r) = T δ(u − u
)δ(r − r
) + δ(u)δ(u
)a(r − r
). (A.7)
2
258 A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265
−d/2 (r − r
)2
R0 (t, s; r, r ) = φ(t, r)φ̃(s, r ) 0 = Θ(t − s) 4π(t − s) exp − ,
4(t − s)
C0 (t, s; r, r
) = φ(t, r)φ(s, r
) 0 = 0,
C̃0 (t, s; r, r
) = φ̃(t, r)φ̃(s, r
) 0 = 0. (A.8)
Below, we shall need the equal-time response function R(t, s; r, r
)|t =s . To give to this
quantity a value, one may discretize the Langevin equation. This may be done according
to several different schemes, see [59,92]. Here, we shall use the Itô prescription which
amounts to
R0 (t, t; r, r
) = 0. (A.9)
The magnetic response function reads from (A.1)
ρ(t, s; r, r
) = φ̃(s, r
) sin φ(t, r) − φ(s, r
) exp −s[φ, φ̃] 0 (A.10)
where 0 is the average with the exp(−S0 [φ, φ̃]). Expanding the sine and using the
Newton’s binomial identity, we have
∞ 2k+1
ρ(t, s; r, r
) = ρk,p (t, s; r, r
),
k=0 p=0
(−1)k+p+1
ρk,p (t, s; r, r
) =
p!(2k + 1 − p)!
× φ̃(s, r
)φ p (t, r)φ 2k+1−p (s, r
) exp −s[φ, φ̃] 0 , (A.11)
so that ρk,p (t, s; r, r
) is given by a (2k + 2)-point function, containing only one φ̃
contribution and 2k + 1 fields φ. The Bargman superselection rule (A.8) implies that the
only contractions that will lead to non-vanishing averages come from the kth order in the
expansion of the exponential, so that we have
k
(−1)p+1
ρk,p (t, s; r, r
) = dj dj
p!(2k + 1 − p)!k!
j =1
p
× φ̃(s, r )φ (x, t)φ 2k+1−p
(s, r
)φ̃(j )κ(j, j
)φ̃(j
) 0 , (A.12)
where for clarity the notation j (respectively, j
) stands for (uj , r j ) (respectively, (u
j , r j
))
and where integrals run over this set of 2k variables.
Wick’s theorem states that the integrand decomposes into sums of products of two-point
functions. In order for a contraction not to vanish, the field φ̃(s, r
) must contract with one
of the p fields φ(t, r), which leads to
ρk,0 (t, s; r, r
) = 0,
k
(−1)p+1
ρk,p (t, s; r, r
) = dj dj
R0 (t, s; r, r
)
(p − 1)!(2k + 1 − p)!k!
j =1
p−1
× φ (t, r)φ 2k+1−p
(s, r )φ̃(j )κ(j, j
)φ̃(j
) 0 .
(A.13)
A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265 259
(2k)!k!
k=0 j =1
2k
× φ(t, r) − φ(s, r
) φ̃(j )κ(j, j
)φ̃(j
) 0 . (A.14)
At this stage, it will be interesting to give another equivalent expression of the magnetic
correlation function Γ (t, s; r, r
). Using the same strategy as before, but now expanding
the cosine we find
∞
Γ (t, s; r, r ) = γk (t, s; r, r
),
k=0
(−1)k 2k
γk (t, s; r, r
) = φ(t, r) − φ(s, r
) exp −s[φ, φ̃] 0 . (A.15)
(2k)!
For the same reason as before, because of mass conservation, only the expansion of order
k of the exponential term will contribute and lead to non-vanishing contractions, such that
γk (t, s; r, r
)
k
1 2k
= dj dj
φ(t, r) − φ(s, r
) φ̃(j )κ(j, j
)φ̃(j
) 0 . (A.16)
(2k)!k!
j =1
ρ(t, s; r, r
) = R0 (t, s; r, r
)Γ (t, s; r, r
). (A.17)
This is the main result of this appendix and gives Eq. (5.38) in the text.
We shall not discuss here the general solution of these equations. For our purposes it is
enough to observe that if we decompose F in the following symmetrized way
F = G(t, R)G(s, R
) + G(t, R
)G(s, R), (B.3)
which for a free-field theory would follow from Wick’s theorem, then
−x−x̃ M R2
G(t, R) = G0 t exp − (B.4)
2 t
produces a solution to both covariance conditions.
We apply this result to the preparation part Cpr of the autocorrelation function, with the
intention to use the result in the spin-wave approximation of the XY model. From Eq. (4.9)
we have
1
dR dR
a(R − R
)R0 (t, s, 0, 0; 0, 0, R, R
)
(4)
Cpr = (B.5)
2
R2d
Ti (ts)−(x+x̃)/2
= dR dR
dq
(2π)d q2
R3d
M R 2 R 2
× exp iq · (R − R
) − + (B.6)
2 t s
∞
dq
Ti e−q (t +s+z)
2
= dz (B.7)
(2π)d
0 Rd
(4π)−d/2
= Ti (t + s)1−d/2, (B.8)
d/2 − 1
where in going to (B.6) we used the initial condition (5.27) and the result (B.3), (B.4) from
above, next in going to (B.7) we specialized to a free Gaussian field (where x = x̃ = d) of
mass M = 1/2 and the last step we also assumed d > 2.
Eq. (B.8) provides the preparation term, for any initial temperature Ti , as required for
the analysis of the autocorrelation function in the XY model in Section 5.2
In the text we have seen that local scale-invariance implies the following form of the
two-time autoresponse function
1+a−λR /z
t
R(t, s) = r0 (t − s)−1−a . (C.1)
s
In spite of a nice agreement with a large variety of models, this expression is not verified
for the 1D Ising model with Glauber dynamics at T = 0.
A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265 261
We now show how Schrödinger invariance can be generalized such that the exact
response function (C.4) can be reproduced. Obviously, time-translation invariance is
broken in ageing systems. Therefore, as already pointed out in [45,46], the dynamical
symmetry cannot be the Schrödinger Lie algebra sch1 which contains the time-translation
generator X−1 = −∂t , but a subalgebra without this generator might be acceptable. We
consider the algebra [48]
age1 := {X0 , X1 , Y−1/2 , Y1/2 , M0 } (C.6)
and keeping the commutation relations (2.7) we now look for a more general realization of
age1 . In this way, we write the generators as {Ξ0,1 , Υ±1/2 , M0 }. These must be of the form
1 x M 2
Ξ0 = −t∂t − r∂r − , Ξ1 = −t 2 ∂t − tr∂r − xt − g(t) − r ,
2 2 2
Υ−1/2 = −∂r , Υ1/2 = −t∂r − Mr,
M0 = −M, (C.7)
262 A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265
[Ξ1 , Ξ0 ] = Ξ1 , (C.8)
which leads to
t∂t g − g = 0, (C.9)
with the solution g(t) = Kt, with K some constant. From (C.7), the dynamical exponent
z = 2. If we were to require, in addition, [Ξ1 , Ξ−1 ] = 2Ξ0 (and thereby go from age1
back to sch1 ), we would recover K = 0. Now, a quasiprimary field φ of age1 will be
characterized by a triplet (x, K, M).
We can now generalize local scale-invariance by requiring that the autoresponse
function R(t, s) formed from a quasiprimary field φ and its associated quasiprimary
response field φ̃ to transform covariantly under the generators Ξ0 and Ξ1 . It is a solution
of the system of linear partial differential equations
x x̃
t∂t + + s∂s + R(t, s) = 0,
2 2
2
t ∂t + (K + x)t + s 2 ∂s + (K̃ + x̃)s R(t, s) = 0, (C.10)
where x̃ and K̃ refer to the response field φ̃. Solving the system (C.10) gives as final result
x + x̃
a= − 1, A = a + K + K̃, λR = 2x + 2K. (C.12)
2
In contrast with the previous realization of age1 , K, K̃ = 0 is possible and then a and A
differ from each other.
Comparison with the exact result (C.4) of the 1D Glauber–Ising model now gives
complete agreement and we identify the exponents
1
a = 0, A=− , λR = 1. (C.13)
2
Of course, the values of a and λR have been obtained before [37,65] but A seems to be a
new exponent.
At present, it must remain open whether the unusual properties of the 1D Glauber–Ising
model are related to the fact that Tc = 0 and therefore the critical and low-temperature
properties might have become mixed.
Also, it remains to be seen whether the form of the autocorrelation function can be
understood from the generalized realization of the ageing algebra age1 . We hope to come
back to this elsewhere.
A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265 263
References
[44] M. Henkel, Phase Transitions and Conformal Invariance, Springer, Heidelberg, 1999.
[45] M. Henkel, M. Pleimling, C. Godrèche, J.-M. Luck, Phys. Rev. Lett. 87 (2001) 265701.
[46] M. Henkel, Nucl. Phys. B 641 (2002) 405.
[47] M. Henkel, M. Paeßens, M. Pleimling, Europhys. Lett. 62 (2003) 644.
[48] M. Henkel, J. Unterberger, Nucl. Phys. B 660 (2003) 407.
[49] M. Henkel, M. Pleimling, Phys. Rev. E 68 (2003) 065101(R).
[50] M. Henkel, A. Picone, M. Pleimling, J. Unterberger, cond-mat/0307649.
[51] M. Henkel, G.M. Schütz, J. Phys. A 37 (2004) 591.
[52] M. Henkel, M. Paeßens, M. Pleimling, Phys. Rev. E 69 (2004), in press, cond-mat/0310761.
[53] P.C. Hohenberg, B.I. Halperin, Rev. Mod. Phys. 49 (1977) 435.
[54] D.A. Huse, Phys. Rev. B 40 (1989) 304.
[55] C. Itzykson, J.M. Drouffe, Théorie Statistique des Champs 1, Editions du CNRS, Paris, 1989.
[56] E.V. Ivashkevich, J. Phys. A 30 (1997) L525.
[57] R. Jackiw, Phys. Today 25 (1) (1972) 23.
[58] H.K. Janssen, B. Schaub, B. Schmittmann, Z. Phys. B 73 (1989) 539.
[59] H.K. Janssen, in: G. Györgyi, et al. (Eds.), From Phase Transitions to Chaos, World Scientific, Singapore,
1992, p. 68.
[60] V.L. Katkov, Zh. Prikl. Mekh. Tekh. Fiz. 6 (1965) 105.
[61] J.M. Kosterlitz, D.J. Thouless, J. Phys. C 6 (1973) 1181.
[62] V.I. Lahno, J. Phys. A 31 (1998) 8511.
[63] S. Lie, Arch. Math. Nat. Vid. (Kristiania) 6 (1882) 328.
[64] T.M. Ligget, Stochastic Interacting Systems: Contact, Voter and Exclusion Process, Springer, Heidelberg,
1999.
[65] E. Lippiello, M. Zanetti, Phys. Rev. E 61 (2000) 3369.
[66] P.C. Martin, E.D. Siggia, H.H. Rose, Phys. Rev. A 8 (1973) 423.
[67] G.F. Mazenko, Phys. Rev. E 69 (2004) 016114.
[68] P. Mayer, P. Sollich, J. Phys. A 37 (2004) 9;
P. Mayer, L. Berthier, J.P. Garrahan, P. Sollich, Phys. Rev. E 68 (2003) 016116.
[69] T.J. Newman, A.J. Bray, J. Phys. A 23 (1990) 4491.
[70] U. Niederer, Helv. Phys. Acta 45 (1972) 802.
[71] U. Niederer, Helv. Phys. Acta 47 (1974) 167.
[72] A.G. Nikitin, R.O. Popovych, Ukr. Math. J. 53 (2001) 1255.
[73] M.J. de Oliveira, J.F.F. Mendes, M.A. Santos, J. Phys. A 26 (1993) 2317.
[74] L. O’Raifeartaigh, V.V. Sreedhar, Ann. Phys. 293 (2001) 215.
[75] M. Paeßens, M. Henkel, J. Phys. A 36 (2003) 8983.
[76] M. Perroud, Helv. Phys. Acta 50 (1977) 233.
[77] A. Picone, M. Henkel, J. Phys. A 35 (2002) 5575.
[78] A. Picone, M. Henkel, J. Richert, J. Phys. A 36 (2003) 1249.
[79] A. Picone, Thèse de doctorat, Nancy, 2004.
[80] A.M. Polyakov, Sov. Phys. JETP Lett. 12 (1970) 381.
[81] R.O. Popovych, N.M. Ivanova, H. Eshraghi, math-ph/0311039.
[82] G. Rideau, P. Winternitz, J. Math. Phys. 34 (1993) 558.
[83] F. Rojas, A.D. Rutenberg, Phys. Rev. E 60 (1999) 212.
[84] A.D. Rutenberg, A.J. Bray, Phys. Rev. E 51 (1995) R1641.
[85] A.D. Rutenberg, A.J. Bray, Phys. Rev. E 51 (1995) 5499.
[86] A.D. Rutenberg, Phys. Rev. E 54 (1996) R2181.
[87] A.D. Rutenberg, B.P. Vollmayr-Lee, Phys. Rev. Lett. 83 (1999) 3772.
[88] F. Sastre, I. Dornic, H. Chaté, Phys. Rev. Lett. 91 (2003) 267205.
[89] G. Schehr, P. Le Doussal, Phys. Rev. E 68 (2003) 046101.
[90] L.C.E. Struik, Physical Ageing in Amorphous Polymers and Other Materials, Elsevier, Amsterdam, 1978.
[91] U.C. Täuber, Critical Dynamics: A Field-Theory Approach to Equilibrium and Non-Equilibrium Scaling
Behaviour, 2004, in press.
A. Picone, M. Henkel / Nuclear Physics B 688 [FS] (2004) 217–265 265
[92] N. van Kampen, Stochastic Processes in Physics and Chemistry, second ed., North-Holland, Amsterdam,
1997.
[93] C. Yeung, M. Rao, R.C. Desai, Phys. Rev. E 53 (1996) 3073.
[94] J. Zinn-Justin, Quantum Field Theory and Critical Phenomena, fourth edition, Oxford Univ. Press, Oxford,
1989–2002.
[95] W. Zippold, R. Kühn, H. Horner, Eur. Phys. J. B 13 (2000) 531.
Nuclear Physics B 688 [FS] (2004) 266–290
www.elsevier.com/locate/npe
Abstract
For the one-dimensional Hubbard model with Aharonov–Bohm-type magnetic flux, we study
the relation between its symmetry and the number of Bethe states. First we show the existence
of solutions for Lieb–Wu equations with an arbitrary number of up-spins and one down-spin, and
exactly count the number of the Bethe states. The results are consistent with Takahashi’s string
hypothesis if the system has the so(4) symmetry. With the Aharonov–Bohm-type magnetic flux,
however, the number of Bethe states increases and the standard string hypothesis does not hold. In
fact, the so(4) symmetry reduces to the direct sum of charge-u(1) and spin-sl(2) symmetry through
the change of AB-flux strength. Next, extending Kirillov’s approach [J. Sov. Math. 30 (1985) 2298,
J. Sov. Math. 36 (1987) 115], we derive two combinatorial formulas from the relation among the
characters of so(4)- or (u(1) ⊕ sl(2))-modules. One formula reproduces Essler–Korepin–Schoutens’
combinatorial formula for counting the number of Bethe states in the so(4)-case. From the exact
analysis of the Lieb–Wu equations, we find that another formula corresponds to the spin-sl(2) case.
2004 Elsevier B.V. All rights reserved.
1. Introduction
0550-3213/$ – see front matter 2004 Elsevier B.V. All rights reserved.
doi:10.1016/j.nuclphysb.2004.04.004
A. Nishino, T. Deguchi / Nuclear Physics B 688 [FS] (2004) 266–290 267
dimensional spin- 21 isotropic Heisenberg spin chain [1], opened a new realm of mathe-
matical physics. Roughly speaking, Bethe’s work consists of three parts: (i) Bethe states
are introduced and Bethe equations, which are sufficient conditions for the Bethe states
being eigenstates, are derived; (ii) the existence of solutions for the Bethe equations are
discussed in simple cases, and the general forms of solutions are conjectured, which are
called string hypothesis; (iii) under the string hypothesis, the formula for counting the num-
ber of the Bethe states is provided, which leads to the combinatorial completeness of Bethe
ansatz. At present the Bethe ansatz method is applied to various kinds of one-dimensional
spin chains and strongly correlated electron systems [16,22,30].
In Bethe’s work, the combinatorial formula for counting the number of Bethe states
possesses a wealth of mathematical implications. In the case of the isotropic Heisenberg
spin chain, the Bethe states constructed by finite-valued solutions of Bethe equations
do not produce all the eigenstates. In fact the system has sl(2) symmetry and the
Bethe states are sl(2)-highest [6]. The eigenstates other than highest weight vectors, i.e.,
sl(2)-descendant states, are constructed by applying the lowering operator to the Bethe
states. Mathematically, the number of Bethe states is interpreted as the multiplicity of
irreducible components in the tensor products of two-dimensional highest weight sl(2)-
modules. Bethe’s formula is also extended to the generalized Heisenberg spin chains
with higher spins or sl(n) symmetry, for which the powerful tools such as Q-systems are
introduced [12,13].
The application of Bethe ansatz method to the one-dimensional Hubbard model was
given by Lieb and Wu [21]. The Bethe equations for the Hubbard model are often called
Lieb–Wu equations. Takahashi’s string hypothesis asserts that, in the thermodynamic limit,
the solutions of Lieb–Wu equations are approximated by string solutions, and the number
of Bethe states is estimated under the hypothesis [28,29]. The Hubbard model with even
sites has so(4) symmetry [7,32,33]. Essler, Korepin and Schoutens proved that, when the
system has the so(4) symmetry, the Bethe states are so(4)-highest [5]. They also showed in
a combinatorial way that all the eigenstates are obtained by taking the so(4)-descendants
of the Bethe states into account [3]. On the other hand the completeness of Bethe ansatz
for the system with odd sites, which has just sl(2) symmetry related to the spin degrees of
freedom, has not been discussed.
In this article, we study the Bethe states in the one-dimensional Hubbard model. In
particular, we deal with the system on a ring with Aharonov–Bohm-type magnetic flux.
The system has so(4) symmetry only at special values of the AB-flux strength and the so(4)
symmetry reduces to spin-sl(2) symmetry for other values. More precisely, for a generic
value of the AB-flux strength, the so(4) symmetry breaks into the direct sum of the charge-
u(1) and the spin-sl(2) symmetry. Varying the AB-flux strength, we investigate solutions
of Lieb–Wu equations. Here we recall that all the enumeration of Bethe states we have
mentioned above are based on the string hypothesis. However, the violation of the string
hypothesis is numerically observed: for the spin- 21 isotropic Heisenberg spin chain, some
of the string solutions reduce to real solutions when the number of sites is large, which
is called redistribution phenomenon [4,8,9]. Thus, without making any approximation, we
discuss the existence of Bethe ansatz solutions with the AB-flux. In particular, we show
the existence of solutions of the Lieb–Wu equations with an arbitrary number of up-spins
and one down-spin. Here we employ a graphical approach [2,24]. We exactly count the
268 A. Nishino, T. Deguchi / Nuclear Physics B 688 [FS] (2004) 266–290
number of solutions in the case and verify that the enumeration with the string hypothesis
is correct only in the so(4)-case. We find that the Lieb–Wu equations for the system with
only the spin-sl(2) symmetry have more solutions than those in the so(4)-case. Next we
study the combinatorial completeness of Bethe ansatz. We obtain the relation among the
characters of so(4)-modules through the power series identities similar to Kirillov’s [12,
13], which gives a new proof of Essler–Korepin–Schoutens’ combinatorial completeness of
Bethe ansatz [3]. We also introduce a new combinatorial formula derived from the relation
among the characters of (u(1) ⊕ sl(2))-modules. The formula suggests the combinatorial
completeness of Bethe ansatz for the system only with the spin-sl(2) symmetry, which has
not been discussed in the literature.
The Bethe ansatz solutions with the AB-flux should be quite important in the low-
dimensional physics of the one-dimensional Hubbard model. As pointed out by Kohn,
the low frequency conductivity is directly related to the shift of the energy levels due to
twisted boundary conditions [15]. Sharpening Kohn’s argument on electron systems in
any dimensions, Shastry and Sutherland discussed effective mass of the one-dimensional
Hubbard model through the twisted boundary conditions [25]. Furthermore, Kawakami
and Yang obtained an explicit expression for the effective mass of the electric conductivity
for the one-dimensional Hubbard model [10,11]. For the Bethe ansatz solutions with the
twisted boundary conditions, there are other aspects such as persistent current associated
with the AB-flux.
The article is organized as follows: in Section 2, we review the symmetry of the
one-dimensional Hubbard model and the Bethe ansatz method. We also describe how to
construct eigenstates other than the Bethe states. In Section 3 we prove the existence of
solutions for the Lieb–Wu equations with one down-spin and exactly count the number of
Bethe states in varying the strength of AB-flux. In Section 4 we study the combinatorial
formulas for counting the Bethe states in terms of the characters of so(4)- and (u(1) ⊕
sl(2))-modules. The final section is devoted to summary and concluding remarks.
The coefficients ψk,λ (x; s) in (2.2) are explicitly given in [31]. The Bethe states (2.2) are
eigenstates of the Hamiltonian (2.1) if {ki , λα } satisfy the following equations:
√
√ λβ − sin(ki + φ) − −1U/4
−1ki L
e = √ ,
1βM β
λ − sin(ki + φ) + −1U/4
√ √
λα − sin(ki + φ) − −1U/4 λα − λβ − −1U/2
√ = √ , (2.3)
1iN α
λ − sin(ki + φ) + −1U/4 β(=α) λα − λβ + −1U/2
which are coupled non-linear equations called Lieb–Wu equations [21]. The Lieb–Wu
equations have not been solved analytically. However it predicts some important results
on thermodynamic properties of the system through Takahashi’s string hypothesis [16,28–
30]. In terms of the solutions {ki , λα } of the Lieb–Wu equations (2.3), energy eigenvalues
are written as
φ φ
Hφ |k, λ; sN,M = E|k, λ; sN,M ,
1
E = −2 cos(ki + φ) + U (L − 2N). (2.4)
4
1iN
Recall that the Bethe states (2.2) give only the eigenstates with 0 2M N L. In order
to construct other eigenstates, we need to consider the symmetries of the system.
φ
Hereafter we shall sometimes abbreviate the superscript φ in |k, λ; sN,M .
2.2. Symmetries
The U -independent symmetries of the Hubbard model are classified in [7]. First we
review the symmetries connected with the spin and charge degrees of freedom [26,27,32,
33]. Define the following operators related to the spin degrees of freedom:
1 †
Sz := (ni↑ − ni↓ ), S+ := ci↑ ci↓ , S− := (S+ )† . (2.5)
2
1iL 1iL
They give a representation of the algebra sl(2) on the Fock space V . Since all the operators
{Sz , S± } commute with Hφ (2.1) for an arbitrary value of φ, it is said that the system has
270 A. Nishino, T. Deguchi / Nuclear Physics B 688 [FS] (2004) 266–290
spin-sl(2) symmetry. One finds another representation of sl(2) related to the charge degrees
of freedom,
1
ηz := (1 − ni↑ − ni↓ ),
2
1iL
√
η+ := e −1(2φ+π)i ci↓ ci↑ , η− := (η+ )† . (2.6)
1iL
Note that all the operators in (2.6) are commutative with {Sz , S± } (2.5). It is easy to see
that the operator ηz also commutes with Hφ for an arbitrary value of φ. However other
operators η± commute with Hφ only for the special values of φ satisfying L2 + Lφ π ∈ Z.
Thus the system has charge-sl(2) symmetry if L2 + Lφ π ∈ Z, and it reduces to charge-u(1)
symmetry given by ηz for other values of φ. Combining the above two kinds of sl(2)
symmetries, we see that the system has so(4)( sl(2) ⊕ sl(2)) symmetry if L2 + Lφ π ∈ Z,
and u(1) ⊕ sl(2) symmetry otherwise. For simplicity, we call the former so(4)-case and the
latter sl(2)-case. For the later discussion, we define the Casimir operators for each sl(2),
1 1
η2 := (η+ η− + η− η+ ) + ηz2 , S 2 := (S+ S− + S− S+ ) + Sz2 ,
2 2
which are employed to see the dimension of each representation.
Next we introduce the following three operators:
†
Ts := Pi↑; i↓ , Tph := cis + cis ,
1iL 1iL s=↑,↓
Tr := Pis; L−i,s ,
s=↑,↓
2
1i L−1
†
where Pis; j t := 1 − (cis − cj†t )(cis − cj t ) and x
denotes the greatest integer in x. One
notices that Ts induces a spin-reversal transformation, Tph a particle–hole transformation
and Tr a reflection of the lattice. Direct calculation shows
−1
Ts−1 Hφ Ts = Hφ , Tph Hφ Tph = Hπ−φ , Tr−1 Hφ Tr = H−φ .
For the system with even L, i.e., a bipartite lattice, we also introduce
Tb := (−1)n2i,s , Tb−1 Hφ Tb = Hφ+π .
1i L2 s=↑,↓
We construct eigenstates that are not included in the Bethe states (2.2). First we consider
the system with so(4) symmetry, i.e., both spin-sl(2) and charge-sl(2) symmetries. Here
we recall that the AB-flux parameter φ takes a special value: L2 + Lφ
π ∈ Z for even or odd L.
Then, as shown in [5] for even L with φ = 0, the Bethe states (2.2) characterized by finite-
valued solutions of the Lieb–Wu equations (2.3) are so(4)-highest, i.e., η+ |k, λ; sN,M =
S+ |k, λ; sN,M = 0. Since
Fig. 1. Bethe states and non-Bethe states for L = 5. The Bethe state with (N, M) = (3, 1) denoted by a closed
circle produces five so(4)-descendants denoted by open circles if the system has so(4) symmetry. As the so(4)
symmetry reduces to sl(2) symmetry, the six eigenstates form three doublets of sl(2). Then we need one more
Bethe state with (5, 2). Note that the eigenstate with (7, 3) is not a Bethe states and is constructed through the
transformation Tr Tph .
272 A. Nishino, T. Deguchi / Nuclear Physics B 688 [FS] (2004) 266–290
are eigenstates of Hφ (2.1) (see Fig. 1). One notices here that the Bethe states (2.2) and
their sl(2)-descendants (2.10) do not produce the eigenstates with L < N 2L since
the application of the lowering operator S− does not change the number of electrons.
Such eigenstates with L < N 2L are constructed as follows: (i) by applying the
φ
transformation Tb Tr Tph to the Bethe state |k, λ; s2L−N,L−M (L < N 2M 2L) if L
φ+π
is even, or by applying Tr Tph to the Bethe state |k, λ; s2L−N,L−M (L < N 2M 2L)
obtained by the Lieb–Wu equations (2.3) with φ + π instead of φ if L is odd, we get the
lowest weight vector of a highest weight sl(2)-module; (ii) the application of the raising
operators (S+ )n , (0 < n 2M − N) to the lowest weight vector produces other degenerate
eigenstates.
Even if we are interested only in the system with φ = 0, the Bethe ansatz method needs
the system with φ = π for odd L. One of the main purposes in this article is therefore to
discuss whether or not the above procedure produces all the eigenstates.
First we consider the real solutions. For M = 1, the Lieb–Wu equations (2.3) reduce to
√
√
−1ki L λ − sin(ki + φ) − −1U/4
e = √ (i = 1, 2, . . . , N),
λ − sin(ki + φ) + −1U/4
√
λ − sin(ki + φ) − −1U/4
√ = 1. (3.1)
1iN
λ − sin(ki + φ) + −1U/4
Fig. 2. The generic behaviour of sin(q + φ) − λ and U4 cot( qL2 ) in (3.3) under the condition U L > 8 for L = 6.
Horizontal dashed lines divide six branches as specified by (3.4). Each branch has an intersection which leads to
a solution of (3.3).
− <q <
+ ,
∈ j = 1, 2, . . . , L . (3.4)
L 2 L 2 2
Thus there exist N − 1 values of λ giving the following integer values for L
2π i q
i (λ):
1
m∈
i − + j
j = 1, 2, . . . , N − 1 .
2
1iN
Note that such {λ} and integers {m} are in one-to-one correspondence due to dqdλ
(λ)
> 0.
As a consequence, the solutions {ki = q
i (λ), λ} are specified by the set of indices {
i , m}.
L
The number of possible {
i , m} is given by ( N )(N − 1). Here we note that the number is
consistent with the formula Z(L; N, M) in [3,28].
L
Proposition 3.1. For U > 8
L, the Lieb–Wu equations (2.3) with M = 1 have ( N )(N − 1)
real solutions [2,24].
ki ∈ R/2πR (i = 1, 2, . . . , N − 2),
√ √
kN−1 = ζ − −1ξ, kN = ζ + −1ξ,
where 0 ζ < 2π and ξ > 0. Note that kN−1 and kN form a complex conjugate pair
which is referred to as (k–Λ–2)-string. Then the first set of equations in (3.1) are rewritten
in terms of real variables as follows
U ki L
sin(ki + φ) − λ = cot (i = 1, 2, . . . , N − 2), (3.6a)
4 2
U sin(ζ L)
sin(ζ + φ) cosh ξ − λ = , (3.6b)
4 cosh(ξ L) − cos(ζ L)
U sinh(ξ L)
cos(ζ + φ) sinh ξ = − . (3.6c)
4 cosh(ξ L) − cos(ζ L)
On the other hand, the second equation in (3.1) is equivalent to the following condition:
2π
ki + 2ζ = m, with m = 0, 1, . . . , NL − 1. (3.7)
L
1iN−2
In the same way as the previous case of Section 3.2, if we consider a solution of each
Eq. (3.6a) in one of the branches (3.4), and the solution can be written as a function of λ.
Given a set {
i | 1 i N − 2} of non-repeating indices specifying the branches (3.4),
we express the solutions of (3.6a) as ki = q
i (λ) (1 i N − 2). Then, from the relation
(3.7), the ζ is also written as a function of λ,
π 1
ζ = ζ (λ) := m − q
i (λ), (3.8)
L 2
1iN−2
for fixed {
i } and m.
A. Nishino, T. Deguchi / Nuclear Physics B 688 [FS] (2004) 266–290 275
For an illustration, we consider (3.6b) and (3.6c) in the case N = 2. Since ζ does not
depend on λ in the case, Eqs. (3.6b) and (3.6c) decouple into the following:
π U
λ = sin m + φ cosh ξ, sinh ξ = − f (2) (ξ ), (3.9)
L 4 cos( πL m + φ)
where
sinh(ξ L) coth(ξ L/2), for m ∈ 2Z,
f (2) (ξ ) := =
cosh(ξ L) − (−1)m tanh(ξ L/2), for m ∈ 2Z + 1.
We seek a solution of the second equation in (3.9) through graphical discussion. Since
f (2)(ξ ) > 0 when ξ > 0, we need the condition π2 < πL m + φ < 3π 2 so that the second
equation of (3.9) has a solution. If such m is even, it is straightforward that the second
equation in (3.9) determines a unique solution ξ(> 0) since limξ →∞ f (2) (ξ ) = 1. For odd
m, the equation has a unique solution if the condition
d(sinh ξ ) U df (2) UL
1= (0) < min − (0) = ,
dξ m∈2Z+1 4 cos( πL m + φ) dξ 8
π
2 < πL m+φ< 3π
2
system has the so(4) symmetry. Thus, as the so(4) symmetry reduces to the spin-sl(2)
symmetry through the change of AB-flux strength φ, the number of solutions for the Lieb–
Wu equations increases.
Let us consider the case N > 2. By inserting (3.8) into (3.6c), we have
U
sinh ξ = − fa (ξ ), (3.10a)
4 cos(ζ (λ) + φ)
sinh(ξ L) L
fa (ξ ) := , a = (−)m cos q
i (λ) . (3.10b)
cosh(ξ L) − a 2
1iN−2
Note that |a| 1. One sees that, for a fixed a, the function fa (ξ ) has the following
properties: (i) fa (0) = 0, df dξ (0) 2 , limξ →∞ fa (ξ ) = 1; (ii) if a < 0, fa (ξ ) is
a L
monotonically increasing and concave; (iii) if a > 0, fa (ξ ) has a single positive maximum
at ξ (> 0) and a single turning point at ξ (> ξ ). These properties are sufficient to discuss
the solution ξ of (3.10) for arbitrary λ. From the graphical discussion similar to the case
N = 2 (see Fig. 3), this determines a unique ξ as a function of λ under the condition
d(sinh ξ ) U dfa UL UL
1= (0) < min − (0) = min = ,
dξ |a|1 4 cos(ζ + φ) dξ |a|1 4(1 − a) 8
π
2 <ζ +φ< 3π
2
276 A. Nishino, T. Deguchi / Nuclear Physics B 688 [FS] (2004) 266–290
U
Fig. 3. The behaviour of sinh ξ and − 4 cos(ζ +φ) fa (ξ ) in (3.10a) under the conditions U L > 8 and
π < ζ + φ < 3π . The ξ and ξ respectively denote the maximum and the turning point of − U
2 2 4 cos(ζ +φ) fa (ξ ) in
the cases 0 < a < 1.
if and only if π2 < ζ (λ) + φ = πL m + φ − 12 i q
i (λ) < 3π2 . By using (3.5), it is sufficient
to have a unique solution for (3.10) that the integer m satisfies
1 L L 1 3L L
i + + − φ<m<
i − + − φ,
2 2 π 2 2 π
1iN−2 1iN−2
that is,
i (
i + 12 ) + 2 − π φ + j | j = 1, 2, . . . , L − N + 1 ,
L L L
−L
π φ ∈ Z,
m∈ 2
i (
i + 2 ) + 2 − π φ
+ j | j = 1, 2, . . . , L − N + 2 ,
1 L L L
2 −L
πφ ∈
/ Z.
(3.11)
(2)
Note that limλ→±∞ ξ(λ) = ξ , which is well-defined for the above m. We see
m− i (
i ± 12 )
that, for the values of m given in (3.11), Eq. (3.6b) with ξ(λ) and ζ(λ)
π 1
λ = sin m+φ− q
i (λ) cosh ξ(λ)
L 2
i
U sin( L2 i q
i (λ))
−
4 cos( L2 i q
i (λ)) − (−)m cosh(ξ(λ)L)
=: g q
i (λ) , ξ(λ) , (3.12)
A. Nishino, T. Deguchi / Nuclear Physics B 688 [FS] (2004) 266–290 277
Here g is a continuous and finite function with respect to λ. Hence there exists a solution
λ for Eq. (3.12).
L
Proposition 3.2. For U > L8 , the Lieb–Wu equations (2.3) with M = 1 have ( N−2 )(L −
N + 1) (k–Λ–2)-string solutions if the system has the so(4) symmetry, and they have
L
( N−2 )(L − N + 2) (k–Λ–2)-solutions, otherwise.
One notices that, only for the system with the so(4) symmetry, the number of (k–Λ–
2)-solutions is consistent with the string hypothesis [28]. Note that, for 0 < U < L8 , some
of the (k–Λ–2)-strings may disappear. In Appendix A, we numerically investigate the case
N = 2 and show that, for 0 < U < L8 , the (k–Λ–2)-strings with odd m disappear, while
additional real solutions appear [5]. For the system with only the spin-sl(2) symmetry,
the Lieb–Wu equations have more (k–Λ–2)-solutions than those expected by the string
hypothesis.
Applying the above results, we now show that all the eigenstates can be constructed
through the Bethe ansatz method in a simple case. We consider the case L = 3 and
φ = 2L
π 3π
, 2L when the system does not have so(4) symmetry but the spin-sl(2) symmetry.
We note that the completeness of eigenstates in this situation has not been discussed in the
literature.
φ
The number of Bethe states |k, λ; sN,M (0 2M N 3) is exactly calculated as
follows: the case M = 0 is trivial since the eigenstates are those of lattice free fermion
system; for the cases (N, M) = (2, 1) and (3, 1), we have obtained the following formulas
from Propositions 3.1 and 3.2:
3 N−1
, for real solutions,
(Bethe states) = N 3 15−N
N−2 1 , for (k–Λ–2)-string solutions.
φ
Since each Bethe state |k, λ; sN,M corresponds to the highest weight vector of a highest
weight sl(2)-module, we should count their sl(2)-descendant states
φ
(S− )n |k, λ; sN,M (0 < n N − 2M).
The eigenstates with 4 N 6, which are not Bethe states nor their sl(2)-descendant
states, are constructed through the transformation Tr Tph as we have described in
φ+π
Section 2.3. Indeed, by applying Tr Tph to the Bethe state |k, λ; s6−N,3−M (4 N
2M 6) of the system described by the Hamiltonian Hφ+π , we get the eigenstate
φ+π
Tr Tph |k, λ; s6−N,3−M of Hφ with 4 N 2M 6 which is the lowest weight vector
278 A. Nishino, T. Deguchi / Nuclear Physics B 688 [FS] (2004) 266–290
Table 1
π , 3π
Enumeration of eigenstates for L = 3 and φ = 2L 2L
N M 6−N 3−M Type of solutions (Bethe) sl(2) sym. (state)
0 0 real 1 1 1
1 0 real 3 2 6
2 0 real 3 3 9
2 1 real 3 1 3
2 1 (k–Λ–2)-string 3 1 3
3 0 real 1 4 4
3 1 real 2 2 4
3 1 (k–Λ–2)-string 6 2 12
4 2 2 1 real 3 1 3
4 2 2 1 (k–Λ–2)-string 3 1 3
4 3 2 0 real 3 3 9
5 3 1 0 real 3 2 6
6 3 0 0 real 1 1 1
64
Table 1 indeed shows that we obtain 64 = 43 = dim V eigenstates, which give a complete
system of the Fock space V .
The Bethe ansatz method was first introduced in the case of one-dimensional spin- 21
isotropic Heisenberg spin chain [1]. Bethe assumed the string hypothesis and estimated the
number Z(N; M) of solutions for the Bethe equations with M down-spins on an N -site
chain as
P + M
m m
Z(N; M) = , (4.1)
Mm
{M
n } m1
M= nMn
where x
denotes the greatest integer in x.
It is known that, in general, solutions of Bethe equations do not have the nature assumed
in the string hypothesis [4,8,9]. Indeed, for N > 21, some of the 2-string solutions are
redistributed to real solutions that are not counted in Z(N; M) [4]. Hence, when we
actually employ the formula Z(N; M) to show the completeness of Bethe ansatz, we must
regard such redistributed real solutions as 2-string solutions.
We apply the techniques developed in [12,13] to the Hubbard model with so(4)
symmetry. Indeed a new proof for Essler–Korepin–Schoutens’ combinatorial completeness
of Bethe ansatz [3] is obtained as a corollary of the relation among the characters of so(4)-
modules. Moreover, based on the results in the previous section, we propose the conjectural
formula related to the combinatorial completeness of Bethe ansatz for the system with
only the charge-u(1) and spin-sl(2) symmetry. In both cases, we obtain the formulas
corresponding to (4.2), which has not been established even for the so(4)-case.
Lemma 4.1. The power series ϕn (zn , . . . , zl ) has the following expression:
Pm (am ) + Mm
ϕn (zn , . . . , zl ) = znMn · · · zlMl ,
Mm
Mn ,...,Ml 0 nml
for 1 n l,
where
Pn (an ) = an − 2M + 2 (m − n)Mm .
m>n
(k)
Introduce the variables {zn | 1 n l, 0 k n} through
(k−1) −2(n−k) (k−1)
zn(0) = zn , zn(k) = 1 − zk zn , for 1 k n.
Lemma 4.2. The power series ϕ1 (z1 , . . . , zl ) is rewritten in terms of the variables {zn(k) },
−a +2M−1
ϕ1 (z1 , . . . , zl ) = 1 − zn(n−1) n .
1nl
Proof. Since
(k−1) (k−1)
ϕk zk , . . . , zl
(k−1) (k−1) −2 (k−1) (k−1) −2(l−k) (k−1)
= ψk zk ϕk+1 1 − zk zk+1 , . . . , 1 − zk zl
−a +2M−1
= 1 − zk(k−1) k (k)
ϕk+1 zk+1 , . . . , zl(k) ,
for 1 k l − 1, the lemma is proved.
Define the polynomials {Qn = Qn (t) | n ∈ Z0 } through the recursion relation,
Lemma 4.3.
Proof.
The relations (4.4a) are called the Q-system of type sl(2), which is a key object in [12].
Indeed the expression (4.4b) produces an identity among the characters of sl(2)-modules.
The Q-system also plays a significant role in the combinatorial identities associated with
the XXZ-Heisenberg spin chain and its generalizations [14,17–19].
A. Nishino, T. Deguchi / Nuclear Physics B 688 [FS] (2004) 266–290 281
Using the above lemmas, we discuss the Hubbard-case. In the similar way, we
define ϕn (zn , zn+1 , . . . , zl ) and Pn (an ) by replacing an and 2M with an and N in
ϕn (zn , zn+1 , . . . , zl ) and Pn (an ), respectively. Define
ϕ(s, t) := (1 + s)L ϕ1 s 2 t, s 4 t 2 , . . . , s 2l t l ϕ1 t, t 2 , . . . , t l .
The following is straightforward from Lemma 4.1:
L P (a ) + M Pn (an ) + Mn
n n n
ϕ(s, t) =
Nr Mn Mn
0Nr L 1nl
{Mn ,Mn }
× s Nr +2 m1 mMm t m1 m(Mm +Mm ) .
Then the coefficient of s N t M (0 2M N L) in ϕ(s, t) is expressed by
Z L, {an , an }; N, M
L Pn (an ) + Mn Pn (an ) + Mn
:= ,
Nr Mn Mn
{Nr ,Mn
,Mn } 1nl
r +2 nMn
N=N
M= n(Mn +Mn )
where
the sum runs over
all configurations {Nr , Mn , Mn 0} such that N = Nr +
2 n0 nMn and M = n1 n(Mn + Mn ). We calculate explicit forms for the power
series ϕ(s, t) after taking the special values of {an } and {an }.
Introduce
L Pn + Mn Pn + Mn
Z(L; N, M) = , (4.5a)
Nr Mn Mn
{Nr ,Mn,Mn } n1
N=N
r +2 nMn
M= n(Mn +Mn )
L Pn + Mn + n Pn + Mn
Z̃(L; N, M) = , (4.5b)
Nr Mn Mn
{Nr ,Mn,Mn } n1
N=N
r +2 nMn
M= n(Mn +Mn )
where
Pn = L − N + 2
(m − n)Mm , Pn = N − 2M + 2 (m − n)Mm .
m>n m>n
u(1 + v) v (1 − u2 v)(1 − v)
s= , t= , ds dt = du dv,
1 + u2 v (1 + v)2 (1 + u2 v)2 (1 + v)2
A. Nishino, T. Deguchi / Nuclear Physics B 688 [FS] (2004) 266–290 283
First we consider the relation (4.7a). If we express the left-hand side of (4.7a) as F (x, y),
we find the property F (x −1 , y) = F (x, y −1 ) = −F (x, y). Then the first relation (4.7a) is
284 A. Nishino, T. Deguchi / Nuclear Physics B 688 [FS] (2004) 266–290
rewritten as
L
x + x −1 + y + y −1
x L−N+1 − x −L+N−1 y N−2M+1 − y −N+2M−1
= Z(L; N, M) , (4.8)
x − x −1 y − y −1
0NL
0M N/2
where x
denotes the greatest integer in x. Let Vn be an (n + 1)-dimensional irreducible
sl(2)-module associated with the charge-sl(2) symmetry and Vn that associated with the
spin-sl(2) symmetry. We introduce an (n + 1)(m + 1)-dimensional irreducible so(4)-
module by the tensor product Vn,m = Vn ⊗ Vm . Through the representation (2.5) and
(2.6) of so(4), the Fock space V of the L-site system is isomorphic to the tensor product
(V1,0 ⊕ V0,1 )⊗L as an so(4)-module. Note that
n n m m
η2 |v = + 1 |v, S 2 |v = + 1 |v, for |v ∈ Vn,m ⊂ V .
2 2 2 2
We now decompose the Fock space V into the direct sum of Vn,m . From the relations (2.8),
each Bethe state |k, λ; sN,M corresponds to the highest weight vector belonging to
VL−N,N−2M = V2η,2S . The characters of the so(4)-module Vn,m are calculated as
x n+1 − x −n−1 y m+1 − y −m−1
ch Vn,m = .
x − x −1 y − y −1
To be precise, x = eΛ1 and y = eΛ1 where both Λ1 and Λ1 are the fundamental weight of
sl(2) and they are orthogonal to each other. One notices that, in terms of the characters, the
identity (4.8) can be rewritten as
(ch V1,0 + ch V0,1 )L = Z(L; N, M) ch VL−N,N−2M . (4.9)
0NL
0M N/2
Next we turn to the relation (4.7b). If we express the left-hand side of (4.7b) as F̃ (x, y),
we find F̃ (x, y −1 ) = −F̃ (x, y). This gives
L y N−2M+1 − y −N+2M−1
1 + x y + y −1 + x 2 = Z̃(L; N, M) x N .
y − y −1
0N2L
0M N/2
(4.11)
(N)
Let ⊂ V be a subspace of the Fock space V with N electrons and let
V (N) ⊂ V (N) Vm
be an (m + 1)-dimensional irreducible sl(2)-module related to the spin-sl(2) symmetry in
A. Nishino, T. Deguchi / Nuclear Physics B 688 [FS] (2004) 266–290 285
(N)
V (N) . The Vm can be regarded as a 1 × (m + 1)-dimensional irreducible (u(1) ⊕ sl(2))-
module by employing the representation (2.5) of sl(2) and taking u(1) = C(L − 2ηz ) with
(2.6). Here
m m
(L − 2ηz )|v = N|v, S |v =
2
+ 1 |v, for |v ∈ Vm(N) ⊂ V .
2 2
The Fock space V of the L-site Hubbard model is isomorphic to the tensor product
(V0(0) ⊕ V1(1) ⊕ V0(2) )⊗L as a (u(1) ⊕ sl(2))-module. We decompose the Fock space V
(N)
into the direct sum of Vm . From the first relation in (2.8), each Bethe state |k, λ; sN,M
(N) (N) (N)
is the highest weight vector belonging to VN−2M = V2S . The characters of Vm are
calculated as
y m+1 − y −m−1
ch Vm(N) = x N .
y − y −1
In terms of the characters, the identity (4.11) can be rewritten as
(0) (1) (2) L
(N)
ch V0 + ch V1 + ch V0 = Z(L; N, M) ch VN−2M . (4.12)
0N2L
0M N/2
(N)
Theorem 4.6 (Multiplicity formula). The multiplicity of the irreducible component VN−2M
in the tensor product (V0 ⊕ V1 ⊕ V0 )⊗L is given by Z̃(L; N, M),
(0) (1) (2)
(ii) dim V = (N − 2M + 1)Z̃(L; N, M). (4.14)
0N2L
0M N/2
The identity (i) in Corollary 4.7 reproduces the combinatorial completeness of Bethe
states for the Hubbard model with so(4) symmetry obtained by Essler, Korepin and
Schoutens [3]. The factor (L − N + 1)(N − 2M + 1) in (i) corresponds to the dimension of
the highest weight so(4)-module VL−N,N−2M with the highest weight vector |k, λ; sN,M .
In Essler–Korepin–Schoutens’ proof of (i) in Corollary 4.7, they take the sum on N
after taking that on M. In our proof, the sums on N and M are taken “simultaneously” in
the level of characters.
286 A. Nishino, T. Deguchi / Nuclear Physics B 688 [FS] (2004) 266–290
by considering the particle–hole symmetry of the system (2.7). Thus we speculate that the
identity (ii) in Corollary 4.7 accounts for the combinatorial completeness of Bethe states
for the system with the charge-u(1) and the spin-sl(2) symmetries.
The identities (4.7) also enabled us to get the explicit form of Z(L; N, M) through the
binomial theorem.
Corollary 4.8. We obtain the summation formulas for Z(L; N, M) and Z̃(L; N, M),
L+2 L L L+2
(i) Z(L; N, M) = − ,
M N −M M −1 N −M +1
L L L L
(ii) Z̃(L; N, M) = − . (4.15)
M N −M M −1 N −M +1
In the framework of Bethe ansatz, we have studied the Hubbard model with the
AB-flux that controls the symmetry of the system. In Section 3 we have shown the
existence of solutions for Lieb–Wu equations with an arbitrary number of up-spins and
one down-spin. We have found that the number of (k–Λ–2)-solutions increases as the
so(4) symmetry reduces to the spin-sl(2) symmetry (Proposition 3.2). The number of Bethe
states is consistent with the string hypothesis only in the so(4)-case. In Section 4 we have
investigated the combinatorial formulas giving the combinatorial completeness of Bethe
states. We have shown that the number of Bethe states can be interpreted as the multiplicity
of irreducible components in the tensor products of so(4)-modules (Theorem 4.5). Essler–
Korepin–Schoutens’ combinatorial formula is reproduced by the relation (4.9) among
the characters of so(4)-modules (Corollary 4.7). An advantage of our approach is that
we can obtain the summation formula (4.15) for Z(L; N, M). We have also proposed
a new combinatorial formula derived from the relation (4.12) among the characters of
(u(1) ⊕ sl(2))-modules. The formula is related to the combinatorial completeness of
Bethe ansatz in the sl(2)-case (Corollary 4.7). It should be remarked that, in Section 3,
we have not proved the uniqueness of solutions. Although the Lieb–Wu equations may
have other solutions that we have not expected, the combinatorial formulas introduced in
Section 4 give an evidence that, for M = 1, we have found solutions enough to verify the
combinatorial completeness of Bethe ansatz. The problem is open for M 2.
A. Nishino, T. Deguchi / Nuclear Physics B 688 [FS] (2004) 266–290 287
The combinatorial completeness of Bethe ansatz has not been discussed for the one-
dimensional isotropic Heisenberg spin chain with twisted boundary conditions. Kirillov’s
identity [12] also produces the following formula:
Pn + Mn + n N
Z̃(N; M) = = ,
Mn M
{M
n } n1
M= nMn
where Pn = N − 2M + 2 m(>n) (m − n)Mm . It is expected that, if the redistribution
phenomenon of solutions for Bethe equations [4,8,9] is taken into consideration, the
formula corresponds to the system with twisted boundary conditions. We remark that the
formula also appears in the different context [20].
Acknowledgements
The authors would like to thank Prof. A. Kuniba and Dr. T. Takagi for valuable
discussion. They also would like to thank Prof. M. Wadati and Dr. M. Shiroishi for helpful
comments. One of the authors (A.N.) appreciates the Research Fellowships of the Japan
Society for the Promotion of Science for Young Scientists. The present study is partially
supported by the Grant-in-Aid for Encouragement of Young Scientists (A) No. 14702012.
Fig. 4. The real parts of solutions for Lieb–Wu equations with L = 20 and N = 2. The dashed lines correspond
to the centers ζ of (k–Λ–2)-string solutions with even m, and the dots express the centers ζ of (k–Λ–2)-string
solutions with odd m and their redistribute real solutions.
288 A. Nishino, T. Deguchi / Nuclear Physics B 688 [FS] (2004) 266–290
Fig. 5. The imaginary parts ξ of solutions for Lieb–Wu equations with L = 20 and N = 2.
{
i | i = 1, 2, . . . , N} and m. But, for 0 < U < L8 , real solutions with repeating indices
may appear at the same time as (k–Λ–2)-string solutions disappear. Such redistribution of
type of solutions is observed in the isotropic Heisenberg model for a large number of sites
[4,8,9]. Here we numerically investigate such phenomenon for the Hubbard model with
L = 20 and N = 2 [3].
A. Nishino, T. Deguchi / Nuclear Physics B 688 [FS] (2004) 266–290 289
As we have already mentioned, (k–Λ–2)-string solutions with odd m may disappear for
0 < U < L8 (see Fig. 3). For each m, the critical value of U is exactly given by [3],
8 π 8
U (m)
= − cos m < .
L L L
Plotted on Fig. 4 are the center ζ = πL m (m = 11, 12, . . ., 29) of (k–Λ–2)-string solutions
and redistributed real solutions {k1 , k2 } for L = 20 and N = 2 in varying the value of U .
Plotted on Fig. 5 are their imaginary parts. The (k–Λ–2)-string solutions with odd m
disappear for U < U (m) and, at the same time, real solutions with repeating indices
{
1 ,
2 ; m} = { m2 , m2 ; m} appear. Note that, as U → 0, all the (k–Λ–2)-string solutions on
Fig. 4 approach to the wavenumbers of lattice free fermion system. Thus, when we apply
the combinatorial formulas in Corollary 4.7 to the case, we must count the number of Bethe
states by regarding the real solutions with repeating indices as (k–Λ)-string solutions.
References
[19] A. Kuniba, T. Nakanishi, Z. Tsuboi, The Bethe equation at q = 0, the Möbius inversion formula, and weight
(r)
multiplicities, III: the XN case, Lett. Math. Phys. 59 (2002) 19–31.
[20] A. Kuniba, M. Okado, T. Takagi, Y. Yamada, Vertex operators and partition functions in box–ball systems,
RIMS Kokyuroku 1302 (2003) 91–107.
[21] E.H. Lieb, F.Y. Wu, Absence of Mott transition in an exact solution of the short-range, one-band model in
one dimension, Phys. Rev. Lett. 20 (1968) 1445–1448.
[22] D.C. Mattis, The Many-Body Problem, An Encyclopedia of Exactly Solved Models in One Dimension,
World Scientific, 1993.
[23] A. Montorsi, The Hubbard Model, World Scientific, 1992.
[24] A. Nishino, T. Deguchi, Bethe-ansatz studies of energy-level crossings in the one-dimensional Hubbard
model, Phys. Rev. B 68 (2003) 075114.
[25] B.S. Shastry, B. Sutherland, Twisted boundary conditions and effective mass in Heisenberg–Ising and
Hubbard rings, Phys. Rev. Lett. 65 (1990) 243–246.
[26] M. Shiroishi, H. Ujino, M. Wadati, SO(4) symmetry of the transfer matrix for the one-dimensional Hubbard
model, J. Phys. A: Math. Gen. 31 (1998) 2341–2358.
[27] M. Shiroishi, M. Wadati, Integrable boundary conditions for the one-dimensional Hubbard model, J. Phys.
Soc. Jpn. 66 (1997) 2288–2301.
[28] M. Takahashi, One-dimensional Hubbard model at finite temperature, Prog. Theor. Phys. 47 (1972) 69–82.
[29] M. Takahashi, Low-temperature specific-heat of one-dimensional Hubbard model, Prog. Theor. Phys. 52
(1974) 103–114.
[30] M. Takahashi, Thermodynamics of One-Dimensional Solvable Models, Cambridge Univ. Press, Cambridge,
1999.
[31] F. Woynarovich, Excitations with complex wavenumber in a Hubbard chain: states with one pair of complex
wavenumbers, J. Phys. C: Solid State Phys. 15 (1982) 85–96.
[32] C.N. Yang, η pairing and off diagonal long range order in a Hubbard model, Phys. Rev. Lett. 63 (1989)
2144–2147.
[33] C.N. Yang, S.C. Zhang, SO(4) symmetry in a Hubbard model, Mod. Phys. Lett. B 4 (1990) 759.
Nuclear Physics B 688 (2004) 291–295
www.elsevier.com/locate/npe