The Economist's View of Combinatorial Games
The Economist's View of Combinatorial Games
The Economist's View of Combinatorial Games
MSRI Publications
Volume 29, 1996
Introduction
365
366 ELWYN BERLEKAMP
player with the higher bid plays Left (= Black), in return for which he pays µ
dollars to his opponent. The player with the lower bid plays Right (= White).
Each player is necessarily happy with this assignment, since it is no worse for
him than the bid he submitted.
For mnemonic purposes, the reader should think of µ as the market value.
Later, we shall see that it can also be interpreted as the mean value, or, in
thermography, as the mast value.
Our ideal economist also uses competitive auctions to determine the sizes of
the moves. Throughout the game, the economist requires that each move made
by either player must be accompanied by a fee or “tax”, which the mover must
pay to his opponent. At the beginning of the game, the tax rate is determined by
a competitive auction. Each player submits a sealed bid. The bids are opened,
and the higher one becomes the new tax rate. The player making this bid makes
the first play at the new tax rate. Then, at each turn, a player may elect to either
• pay the current tax (to his opponent) and play a move on the board, or
• pass at the current tax rate.
After two consecutive passes at the current tax rate, a new auction is held to
restart the game at a new and lower tax rate. Legal bids must be strictly less
than the prior tax rate. There is also a minimum legal bid, tmin . A player
unwilling to submit a bid ≥ tmin may pass the auction. The game ceases when
both players pass the auction.
If the players submit equal bids, the referee may let an arbitrator decide
who moves next. The arbitrator is disreputable: Each player suspects that the
arbitrator may be working in collusion with his opponent. Alternatively, when
there are equal bids for the tax rate, the referee may require that the next move
be made by the player who did not make the prior move. In this way, the referee,
at his sole discretion, can implement a “preference for alternating moves”.
In some traditional games, including Japanese Go, the minimum tax rate
tmin is defined as 0. When the game stops, the score is tallied according to the
position of the board, and the corresponding final payment of “score” is made.
This “score” payment is added to the tax payments that the players have paid
each other, and to the payment(s) to buy the more desirable color(s) initially.
Since all payments are zero-sum, one player’s net gain is his opponent’s net loss.
The player who realizes an overall net gain is the winner. In Economic games,
a tied outcome occurs whenever each player’s total net cumulative payment to
his opponent is zero.
For Winning Ways–style games, which include mathematical play on “num-
bers” beyond the conventional mathematical “stopping positions”, tmin is defined
to be −1. The minimum tax rate can also be defined as −1 for some non-Japanese
versions of Go rules.
When such a game ceases, no score remains on the board: the outcome de-
pends entirely on the net cumulative total of payments made between the players.
THE ECONOMIST’S VIEW OF COMBINATORIAL GAMES 367
1. A Demonstration Game
We now hold a demonstration using the Economist’s Rules. The players are
Yonghoan Kim and David Wolfe.
The referee, EB, gives each player $5 initial capital, and they get change at
the bank, in cash and chips valued as follows:
$5, $1 = US notes
Q = US quarter
G = green chip, worth 1/4 of a unit, now set at $1
Y = yellow chip, worth 1/16 of a unit
1 In order to pay the tax for the first move, DW first gets change from YK: $1 for
1Q + 2G + 4Y. He then puts $1.50 worth of chips (4G + 8Y) onto the table for YK.
368 ELWYN BERLEKAMP
1.1. Erdős’ Book. The mathematician Paul Erdős frequently mentions “The
Book”, a supernatural reference document that contains all mathematical truths
in their most revealing forms. To expedite the remainder of this demonstration,
we allow the participants and ourselves to peek into The Book. Figure 1 expresses
Z 1 Z 3 Z 1
2∗
0| 1 || − 38 | − 12
4
2+ ∗ 1
2 −1 + 2 ∗ 1 34
1
1∗ 2
1
µ = 2 32 t = 1 15
32 µ= 3
4 t = 1 18
(albeit in perhaps unfamiliar symbols) the Book values for the two starting posi-
tions. The values also contain more information than we need to play optimally
according to the Economist’s Rules. In addition to a single good move for each
player at the current tax rate, for each position we need only its fair market
value, µ, and one of its fair tax rates, t (also called the “temperature”). These
values are shown on the bottom line of Figure 1.
The remainder of our demonstration will illustrate how players able to peek
into The Book can play optimally by following a few very simple common-sense
rules, based on two important facts:
• The value of playing Left in any region is µ.
• Although the “size of a move” in any region is not necessarily unique, it may
always safely be taken as the temperature of that region.
The Book Strategy is as follows:
0. Initially, bid µ for the privilege of playing Left.
1. At every stage of play, define the global temperature of the game as the
maximum of the temperatures of its regions. If it is your turn, proceed as
follows:
(a) If opponent’s prior move has created a region whose temperature exceeds
the current tax rate, respond by playing in the same region as the opponent’s
prior move.
(b) Otherwise, if the temperature of the game is at least as large as the current
tax rate, play in a region of maximum temperature.
(c) If the tax rate exceeds the maximum regional temperature, pass. If you
have any canonical move(s), bid the maximum regional temperature.
THE ECONOMIST’S VIEW OF COMBINATORIAL GAMES 369
Since the Book Strategy focuses on this question, it is also called Sentestrat.
After the demonstration, we will discuss how Sentestrat compares with two other
strategies, Hotstrat and Thermostrat.
Once Yonghoan Kim and David Wolfe obtain access to The Book, each of
them elects to follow the Book Strategy.
9 1 3
µ= 16 t = 1 16 µ= 4 t = 1 18
9 1
µ= 16 t = 1 16 µ = 1 87 t = 1 81
Figure 2. The demonstration game: moves 1 and 2 played according to The Book.
DW begins playing a White Go stone in the center of the bottom row. This
is shown as move 1 in Figure 2. The cost of this move was $1.50, as determined
by the initial auction. According to The Book, that bid was a slight error; the
15
true Book value of the initial temperature was 1 32 . However, White’s move 1
1
has lowered the temperature of the Go region to 1 16 . The Domineering region
has temperature 1 81 , which is larger. So, now that they are able to reference The
Book, after move 1, both players decline to play at a tax rate of $1.50. Instead,
they both bid $1.125.
Since the bids are tied, the play alternates, and YK is selected to play move 2
at a cost of $1.125. Since this is less than the $1.50 worth of chips he has on
the table, YK pockets $0.375 of these chips, and moves the rest of the chips
back to DW in payment of the tax on move 2. YK plays move 2 as the Vertical
domino shown. According to The Book, this $1.125 move has increased µ for
the Domineering region from 34 to 1 78 , but it has not changed the temperature,
370 ELWYN BERLEKAMP
9 1 11 3
µ= 16 t = 1 16 µ= 16 t = 1 16
9 1
µ= 16 t = 1 16 µ = 1 87 t = 1 81
9 1 3
µ= 16 t = 1 16 µ= 4 t=1
µ = 1 85 t= 7
8 µ= 3
4 t=1
µ = 1 85 t= 7
8 µ = − 41 t=1
µ = 1 85 t= 7
8 µ= 7
8 t = 1 18
at the tax rate of 1 18 . The referee holds a new auction to determine who will
1
move next, and both players bid 1 16 . Because his opponent made the last move
1 1
at the old tax rate, YK pockets chips worth 16 , leaving chips worth 1 16 on his
side of the table. Since the auction was tied, the referee adopts the alternating
move convention and assigns the next move to YK, who pays the new tax rate by
moving all of the $1.0625 worth of chips on the table to DW as he plays move 6
on the Go position, as shown in Figure 4. According to The Book, move 6
9 1
increases the local µ to 16 + 1 16 = 1 85 , and decreases the Go temperature to
7
8 . The Domineering region is now the hottest game, with temperature t = 1.
1
As the old tax rate was 1 16 , both players elect to pass and bid t = 1. DW
1
pockets chips worth 16 , because his opponent made the last move at the old
tax rate. The value of the chips on the table is then $1, on DW’s side. The
referee assigns the next move to DW, who pays the chips back to YK and plays
move 7, followed by YK’s move 8 and DW’s move 9 (in Figure 5). The net
effect of the three-move sequence 7–9 was a payment of $1 of chips from DW
to YK, in return for a $1 change in the value of the Domineering µ in DW’s
favor.
After move 9, both players pass and bid 78 . The tax rate decreases by 18 , which
YK pockets, leaving $0.875 worth of chips on the table. These remaining chips
372 ELWYN BERLEKAMP
µ = 1 85 t= 7
8 µ = − 41 t= 3
4
10
µ = 2 21 t= 1
2 µ = − 41 t= 3
4
11
µ = 2 21 t= 1
2 µ = −1 t = − 12
12
µ=3 t=0 µ = −1 t = − 12
are transferred to DW as YK plays move 10. This move increases the Go value
of µ by the same amount as the tax paid, 2 21 − 1 85 = 78 .
After move 10, both players pass and bid 34 . The tax rate decreases by 18 ,
which DW pockets, leaving $0.75 worth of chips on the table. These remaining
chips are transferred to YK as DW plays move 11 on the Domineering board.
This move decreases the Domineering value of µ by the same amount as the tax
paid, −1 − (− 41 ) = − 34 .
After move 11, both players pass and bid 12 . The tax rate decreases by 14 ,
which YK pockets, leaving $0.50 worth of chips on the table. These remaining
chips are transferred to DW as YK plays move 12 on the Go board. This move
has the effect of increasing the Go value of µ by the same amount as the tax
paid, 3 − 2 21 = 12 .
THE ECONOMIST’S VIEW OF COMBINATORIAL GAMES 373
13
µ=3 t=0 µ = −1 t = − 12
14
µ=3 t = −1 µ = −1 t = − 12
15
µ=3 t = −1 µ = − 21 t = − 21
16
µ=3 t = −1 µ = −1 t = −1
According to The Book, after move 12, the temperature of the Go region is
0 and the temperature of the Domineering region is − 21 . So both players pass
at the tax rate of 12 , and submit bids of 0 for the new tax rate. DW pockets all
$0.50 worth of chips on the table. The new tax rate is now 0, which equals the
value of the chips on the table.
DW plays move 13 in Figure 6, which is the first move at the tax rate of 0.
YK responds with move 14, at the same tax rate of 0.
After move 14, The Book reveals that both the Go position and the Domi-
neering position have negative temperatures. So both players would prefer not
to play. In fact, neither will play unless his opponent pays him to move! So
each player bids −$0.50. This means that neither player will move unless his
opponent pays him $0.50. The new tax rate is negative! So DW collects $0.50
374 ELWYN BERLEKAMP
17
µ=3 t = −1 µ=0 t = −1
18
µ=3 t = −1 µ = −1 t = −1
19
µ=3 t = −1 µ=0 t = −1
20
µ=3 t = −1 µ = −1 t = −1
in cash from YK as he plays move 15. But YK responds with move 16, in return
for which he gets his $0.50 back.
After move 16, The Book reveals that both positions have temperature equal
to negative one. Both players pass at the tax rate of − 21 , and submit bids of −1
to set the new tax rate.
DW collects $1 cash from YK for move 17 in Figure 7, but YK collects it back
for move 18. The players continue to “cash in” their positions, at the rate of
$1/move, on moves 19 and 20 in Figure 7, then 21, 22, and 23 in Figure 8.
According to Japanese-style Go rules, the play ceases after move 23. The Go
position is scored as +3 points for Black, and White is required to play Black
$3 to settle this score.
According to the mathematized (Universalist) Go rules, the game continues
after move 23. White submits no bid. Black submits the minimum bid that the
THE ECONOMIST’S VIEW OF COMBINATORIAL GAMES 375
21
µ=3 t = −1 µ=0 t = −1
22
µ=3 t = −1 µ = −1 t = −1
23
µ=3 t = −1 µ=0
rules allow, which is −1. So Black wins the right to play at a tax rate of −1. In
Figure 9, he plays move 24 as a Black Go stone in the middle of the three empty
spaces, and collects $1. (According to Universalist Go rules, the Black Go stones
are now “immortal”, and cannot be captured even if Black fills in both of the
visible eyes.)
Again, White passes and submits no bid. So Black again submits the minimum
allowable bid of −1, and plays move 25 as another Black Go stone, for which
he again collects $1 from White. Again White has no option but to pass and
submit no bid. Black again submits the minimum allowable bid of −1, wins the
auction, and plays move 26 as another Black Go stone, for which he collects his
final $1 from White.
24 25
26
After move 26, both players pass. Neither player has any remaining legal
move, so neither can bid. The game finally ends, and our demonstration is
concluded.
1.2. Theorem: Book Play Ensures No Monetary Loss. After the initial
auction has determined who plays Left and who plays Right, then either player
may elect to use the Book Strategy to select his subsequent tax rate bids and
plays. The following theorem asserts that this strategy is optimal for games
played according to the Economist’s Rules:
Theorem. If Left plays Book Strategy, she ensures a total net gain of at least
µ dollars. If Right plays Book Strategy, he ensures that Left’s total net gain will
be at most µ dollars.
2. Thermography
2.1. The Economist’s Ko Rules. The game of Go includes some loopy posi-
tions. By far the most common such positions are 2-cycle loops. Locally, White
captures a stone, Black captures it back, then White again captures Black’s
stone, etc. These positions are called kos. All popular dialects of Go rules pro-
hibit such loopiness, by banning the immediate recapture of a single stone if it
would return the board to the same position it reached after mover’s prior turn.
The extension of thermography to cover kos is facilitated by the following
extensions of the Economist’s Rules:
For each region of the game, a specified player is designated as the local
Komaster. In principle, the privilege of being Komaster might be determined by
local auctions, held before the play begins. If the global game contains several
regions with potential kos, then it is quite possible for Left to be Komaster of
some regions, and Right to be Komaster of others. (These rules do not attempt to
model positions such as the classical triple ko, in which multiple interdependent
kos appear within a single region.)
According to the Economist’s Rules, kos are resolved by overruling the pref-
erence for alternating moves. Any move that repeats an overall game position
is allowed. If the mover is not the Komaster of the region wherein the move is
made, there are no further restrictions. However, if the local Komaster makes a
move that repeats the global board position (and, necessarily, the local regional
position), I call this local ko position critical. The Komaster, who just moved to
the critical ko position, is compelled to make another local move immediately.
THE ECONOMIST’S VIEW OF COMBINATORIAL GAMES 377
This rule for resolving ko overrides the referee’s normal preference for alternating
moves. Instead, the Komaster makes two moves consecutively, to and from the
critical ko position, and the Komaster pays two taxes (at the same tax rate),
one tax for each move.
Scaffolds, Hills, and Caves. The final stages of the construction of a ther-
mograph begin with another pair of trajectories LS and RS, called scaffolds
(Figure 10, left). In the most general case, the Left and Right scaffolds might
cross several times. A temperature interval in which the Left scaffold exceeds
the Right scaffold is a hill, or solid region. A temperature interval in which the
Right scaffold exceeds the Left scaffold is a cave, or gaseous region. Within a
hill, the Left wall equals the Left scaffold and the Right wall equals the Right
scaffold. However, within a cave, the Left and Right walls coincide in a local
trajectory called the mast.
The lowest region of a nondegenerate Thermograph is necessarily a hill. The
interval along the v-axis, or “ground”, between the walls of this hill is the base
of the hill. The point at which the two scaffolds cross at the top of the hill is
both the summit of the hill and the drain of the cave above it. If the scaffolds
378 ELWYN BERLEKAMP
LS
S
(degenerate)
cave
RW
LW
hill
cave
RS RW
LS hill LW
Figure 10. Scaffolds (left), and the thermograph constructed therefrom (right).
cross again at the top of the cave, that point is called the chimney of the cave,
and the fulcrum of the hill above it.
Normal Masts. Within any cave, the normal mast can be easily constructed
by following the path of an ideal balloon from the drain to the chimney. When
free to do so, the balloon rises vertically. If it hits a scaffold, the balloon follows
that roof scaffold upward until it is either again free to rise vertically or until
it reaches the chimney, at which point the mast ends and the Left and Right
walls again become different, starting at the fulcrum of the adjacent hill. This is
shown on the right in Figure 10. Moving upward from t = 0, this thermograph
has a hill, a cave, another hill, and then an infinite gaseous region (a degenerate
cave) through which the mast rises vertically to infinity. All hot games have
thermographs with a hill at t = 0 and a vertical mast as t approaches infinity.
The walls give the values of the economic game with a current tax rate of t.
If the thermograph is a hill region at temperature t, then each player desires
to move first. By doing so, he can ensure a value given by the corresponding wall.
If the thermograph is a cave region at temperature t, then at least one player
prefers to pass until the tax rate becomes lower. If the mast is vertical, then both
players prefer to pass, and neither will play until the tax rate is lowered. If the
mast coincides with a roof of the cave which is a Left scaffold, then Left prefers
to play and Right prefers to pass. If it is Right’s turn, he will pass and propose
a tax cut, but Left will then reject Right’s proposal. Left will instead exercise
his right to move at the old, higher tax rate. In a subsequent subsection we will
see a specific example of this surprising phenomenon, where it is to a player’s
advantage to insist on playing at a higher tax rate rather than at a lower one.
THE ECONOMIST’S VIEW OF COMBINATORIAL GAMES 379
A B
=B
24 −9
Let G and H be a pair of games that represent the two states of a ko. G is a
Left follower of H, and H is a Right follower of G. In the general case, G and H
may also have other followers; thus we may have
For such a ko, we construct two pairs of thermographs. Ĝ and Ĥ assume that Left
is the regional Komaster; Ǧ and Ȟ assume that Right is the regional Komaster.
From Ĥ, Left may play to a close relative of G, which we call G.
380 ELWYN BERLEKAMP
AL BR
B
RS
(Ǎ
)
Ǎ
)
(Aˇ
LS
24 13 0 −9
B̌ BR
LS(B̌) RS(B̌)
t = 11
L
Ǎ = B̌
24 13 2 −9
t = 11
 B̂
24 13 2 −9
Figure 12, left, shows a Go position called a seki. This position is terminal
because competent players will pass. Mathematically, its value is
24| −10||26| −8 = 0.
Its thermograph is a vertical mast at v = 0.
Figure 12, middle, shows G, a famous position from traditional Go literature
called the “thousand-year Ko”. Its game graph is shown on the right in the same
figure. From G, White can play either to the seki of value 0 just mentioned, or
to B, the ko of Figure 11. Left has two similar options, whose thermographs are
superimposed in Figure 13. From Ǧ, if t < 5, Left prefers to play to the seki;
but if t > 5, Left prefers to start the ko. Figure 14, left, shows the result of
382 ELWYN BERLEKAMP
0 0
C D A B
23 −10 24 −9
Figure 12. The seki of value 0 (left) is one option for White in the “thousand-year
Ko” (middle), whose game graph is shown on the right.
t = 11
C
t=5
23 12 0 −10
translating the maximal Right wall of ǦL into the Left scaffold of Ǧ. Figure 14,
right, shows RS(Ǧ).
The thermograph of Ǧ, shown in Figure 15, left, illustrates some features of
the thousand-year ko that most intermediate Go players find counterintuitive.
If the global temperature exceeds 11, both players will prefer to play elsewhere
rather than on Ǧ. The activation temperature of Ǧ is 11. At temperatures
between 11 and 7, White prefers to play elsewhere, but Black is eager to start
the ko from Ǧ. White will win this ko, but doing so costs him two moves and
it costs Black only one move to start the ko. Thus, at temperatures between 11
and 7, the higher the global temperature at which Black can start the kofight,
the happier she is.
ko
t
ar
R sek
pl
S(
st
ay
ˇG)
t = 11
i
t=9
LS t k
st
(G o
ar
ˇ)
sta
t=5 rt
ko
ki
se
ay
pl
10 −5 9 0 −9
t = 11
t = 7 23
t=7
Ĝ
t=3
Ǧ
10 −3 −9 23 7 23 0
t=1
X Y Z
t=0
1 0 −1 1 0 0 −1 0
For a general loopfree game, the shape of the thermograph at the base of its
mast depends on the sign of the infinitesimal by which the game differs from
384 ELWYN BERLEKAMP
a number at its freezing point, and the information contained in the values of
these infinitesimals is not present in the thermographs. We will return to this
topic in Section 2.5.
However, at sufficiently high temperatures, all thermographs become masts,
and their mast values do add. Therefore, knowing the thermographs of all of the
summands is sufficient to determine the thermograph of the sum at sufficiently
high temperatures.
Means and Temperatures. The µ-value of the initial, infinite, vertical mast,
which we call µ(G), is the “count”, or the “mean value” of G.
The temperature at the base of this mast, which we call t(G), is the “size of
the move”, or the “temperature” of G.
Placid Kos and Hyperactive Kos. A game G is said to be “hyperactive” if
µ(Ĝ) > µ(Ǧ). Such a game G necessarily contains a position K that is a ko. It
is not the size of K itself that makes G hyperactive, but special circumstances
which include the fact that K is a follower of some position of G that is cooler
than K.
A game G is said to be “placid” if µ(Ĝ) = µ(Ǧ). If G is placid, then when
played according to Economic Rules, µ(G) does not depend on kothreats. Many
kos, even many rather big kos, turn out to be placid.
Stable and Unstable Positions. Thermographs with more than a single hill
and a single (degenerate) cave are relatively rare. In the common case, stable
and unstable positions can be defined relatively simply, as follows:
Definitions. A position H of a game G is stable in G if all ancestors of H in
G have temperatures greater than H’s. H is unstable or transient in G if it has
an ancestor of lower temperature. If H is not unstable, but has one or more
ancestors of temperature equal to its own, it is called semistable.
In the more general case, in which some positions may have thermographs with
multiple hills and caves, a single position may have multiple activation temper-
atures, and so a more refined definition is required:
Refined Definitions. G is stable in G, and its activation temperature is the
base of its infinite mast. H is stable in G if its thermograph is a vertical mast at
the activation temperature of its nearest stable ancestor.
Semi-stable positions can be treated as stable or as unstable. Treating them as
stable yields the simplest and most convenient economic analyses. Treating them
as unstable leads to more refined, more complicated algorithms that exploit the
calculus of infinitesimal games to get the last move at each t, whenever possible.
Sentestrat players will choose to play “in another region” “elsewhere” only
when the region just played in has reached a stable position.
THE ECONOMIST’S VIEW OF COMBINATORIAL GAMES 385
Sente and Gote. From the perspective of Sentestrat, an opponent’s move that
increases the temperature has sente; it requires our immediate response. How-
ever, this move and its response might begin a longer sequence of moves through
several transient positions before the game reaches the next stable position. The
question of whether the overall sequence had sente or gote depends on whether
the length of this alternating sequence of moves was odd or even, and that ques-
tion can usually be answered by examining the thermograph of the originating
stable state. Generally speaking, if, just below the base of its relevant mast, both
walls branch outward, then the initial move by either player is gote. If Left’s wall
drops vertically but Right’s branches outward, then Left’s initial move is sente
but Right’s is reverse sente. If Right’s wall drops vertically but Left’s branches
outward, then Right’s initial move is sente and Left’s move is reverse sente.
These correspondences between the Go player’s view of sente and gote and the
thermograph are correct whenever the relevant positions have significantly differ-
ent temperatures. However, when there are intermediate semi-stable positions,
the distinction between sente and gote may become blurred.
Mainlines and Sidelines. Let n be a large integer, and suppose that two gurus
play the sum of n copies of a game G. For each H that is a position of G, let
#(H) be the number of copies of G that pass through the position H. If #(H)
approaches infinity as n approaches infinity, then H is a “mainline” position of
G. But if #(H) remains bounded as n approaches infinity, then H is a “sideline”
position of G.
If G is a traditional gote position, then #(GL ) = #(GR ) = n/2, and both
G and GR are mainline positions. However, if G is a position in which Left’s
L
move is sente and Right’s move is reverse sente, then, depending on who plays
first from n.G, #(GR ) = 0 or 1, and #(GL ) = n or n − 1. In either case, GR is
a sideline position of G.
Thermographs of sideline positions have no effect on the means of any of their
ancestors, although they often do affect the temperature(s) of some of their near
ancestors.
We illustrate these notions with an example.
Let A be the starting position on the right, be-
fore any of the numbered stones have been played. 2 1
Then moves 1, 2, 3, and 4 show a main line of
play, through positions we’ll call B, C, D, and E.
3 4
Position E is the terminal position after all four
numbered stones have been played. F is the position if White rather than Black
had played the first stone at 1. The means and temperatures of all dominant
positions are shown in Table 1.
The only stable mainline positions are A, C, and E, each having mean 1. The
mean of A is unaffected by any small change in the mean or temperature of any
other position. Hence, to verify that the mean of A is 1, it is unnecessary to
386 ELWYN BERLEKAMP
know the values of other means and temperatures precisely; appropriate bounds
are sufficient.
In general, a stable mainline gote position will have both a mainline Black
follower and a mainline White follower. On the other hand, stable mainline sente
positions have only one mainline follower. In this example, the line of play shown
is the unique main line. Notice that the mainline moves do not alternate colors;
White plays both moves 2 and 3.
If we played a sum of n copies of A concurrently on n different Go boards,
then the local line of play would follow the main line shown above on all but
one or two boards. If White gets the first move overall, he plays one exceptional
board from A to a sideline position, F. But Black then plays another board from
A to B, and White responds by continuing to C. That happens on all n − 1
boards, successively. Black then might play a second exceptional board from C
to CL . But White then plays another C to D, and White responds by continuing
to E. That sequence, from C to D to E, then happens on all boards, successively,
until all positions are terminal.
The sum of n copies of A is only one example of a rich environment, which
we shall explore further in Section 4.3. If a single copy of A is played in any
sufficiently rich environment, the play within A will follow the main line shown
above. Heuristically, the reason that Black rather than White will make the first
move from A is that Black can move there at any tax rate between 7 and 3;
White cannot afford to play A until the tax rate is 3. After A has been played
to B and C, both players make several moves elsewhere if possible. Eventually,
any time after the tax rate drops below 1.5, White can play at C. Since Black
cannot afford to play there until the tax rate is 1, White will get in his move
THE ECONOMIST’S VIEW OF COMBINATORIAL GAMES 387
first, unless the background is so impoverished that there are no suitable moves
“elsewhere”.
Smaller Thermographic Databases of G. Given a game, G, and its purported
thermograph, how much information about the positions of G is needed to verify
that the purported thermograph of G is correct? Evidently, considerably less
than the thermographs of all of the positions of G. Many positions of G are
transient. Often the only information needed about many of the followers of
transient positions is that they are “sufficiently hot”, and this bound can easily be
quantified. Precise thermographs of many sideline positions are also unnecessary;
it is required only that portions of such thermographs fall between specified
bounds.
How much information is needed to verify the purported mean of G? Evidently,
even less than is needed to verify G’s thermograph.
2.4. Top-Down Thermography. “Top-down” thermography promises to be
much more efficient than “bottom-up” thermography. The latter computes the
details of many walls of positions that the former could ignore.
A proposed top-down approach to finding the thermograph of a position G
starts by considering a reasonably large set of lines of play from G, such as
GLRL1 LRLRRLR . Most relevant lines do not follow alternating play. Indeed, em-
pirical evidence indicates that Go positions, such as the thousand-year ko of
Figure 12, which have different dominant Left options depending on the temper-
ature, are fairly rare. So the main reason that there are typically many relevant
lines of play from G is that there may be many relevant sequences of Left and
Right plays, rather than choices between HL1 and HL2 .
The proposed approach to computing the “top-down” thermograph of G as-
sumes that the given set of lines of play contains the “complete” set of G’s stable
positions. This assumption allows a program to infer that missing positions are
sufficiently hot.
One is often interested in only the high-temperature portions of thermo-
graphs. For this reason, it would eventually be desirable to have programs that
do all thermographic computations from a top-down perspective. Each trajec-
tory might be stored as a list of slopes and breakpoints, listed in top-down order.
A better programming strategy is to view each trajectory as a function whose
outputs are a list of slopes and breakpoints, generated in the top-down order.
This function is able to call other functions and itself, recursively, as needed, in
order to pursue the top-down computation of thermographs. The programming
goal is to get the next output as quickly as possible, by looking no further than
necessary.
t=0
X Y Z
t = −1
1 0 −1 1 0 0 −1 0
fuzzy positive negative
1 1 1 3
2 4 8 8
1 0 1 0 1 0 1 0
1 1 1
2
∗ 2
↑ 2
↓ 1
2
1 0 1 0 1 0 1 0
1
Figure 19. Thermographs of games infinitesimally close to 2
.
1 0 −1
Figure 20. The foundations of number-ish thermographs.
of all number-ish loopfree games, in the temperature range between 0 and −1.
Figure 21 illustrates where the thermographs of two particular numbers appear
in Figure 20.
If one views an arbitrary conventional thermograph from a positive temper-
ature perspective, it may appear to be resting on “solid ground”, consisting of
the v-axis at t = 0. However, if one probes deeper as in Figure 20, one finds that
at t = −1 all thermographs rest on a discrete set of pilings located at the integer
values. At t = − 21 , there are twice as many pilings located at half-integers, etc.
An earlier, less universal, version of Figure 20, based on overheating rather
than on the Economist’s Rules allowing negative temperature, appeared on page
172 of Winning Ways.
t=0
t = − 18
t = − 14
t = − 12
1
2
− 38
t = −1
1 0 −1
1
Figure 21. Underground thermographs of 2
and − 38 .
390 ELWYN BERLEKAMP
Thermography always gives the unique value for the count, µ(G), and a usable
value for the size of the move, t(G). We now look at some of the theorems that
these quantities satisfy.
• Sentestrat: If your opponent has just played in one of these hill regions, re-
spond directly by playing in the same region in which he just moved.
All three strategies essentially agree under other circumstances. If all thermo-
graphs are caves at temperature t, then pick one of them whose mast touches
your own scaffold at temperature t, if any such exist. When you are about to play
a move of this type, you should decline any tax cut proposal that may be pend-
ing. If the only hill move at the current temperature is banned by ko, then pass
but do not lower the bid tax rate. If there are no moves of any of the previously
mentioned types, then you should pass and propose a lower tax rate. Sentes-
trat proposes the tax rate equal to the next lowest value at which you would
be willing to move, namely, the drain of a cave or the point at which a vertical
mast touches your (necessarily jagged) scaffold inside the cave. Thermostrat
calculates other trajectories in hopes of getting a stronger result. Thermostrat
computes the sum of all of your opponent’s walls from Sentestrat’s proposed
tax rate on downwards, and to this sum it adds the width of the widest cave
at each temperature. It then picks the temperature at which this trajectory is
most favorable. Often, that is the same temperature as Sentestrat’s, but on some
examples (including one shown in Winning Ways), Thermostrat takes advantage
of an opponent’s recent mistake to outperform Sentestrat, even though both do
equally well against a good opponent.
Hotstrat’s Failure. Although it usually recommends the same moves as Sen-
testrat and Thermostrat, Hotstrat plays poorly in some situations. An example
is the following sum A + B, as played by Left, going second:
Thanks to the footnote on page 367, all payments made during our demonstration
followed these rules.
Except for kos, conventional play may be viewed as economic play with a
preference for alternating moves, and with the proviso that chips are worthless.
In a typical economic game, the pile of chips on the table moves back and
forth between the two players frequently. Only when the tax rate is lowered is
a player able to move some of these chips off of the table and into his pocket.
Economically, this is an apparent gain for the first player to move at the new
and lower temperature. The player who made the last move at the old, higher
temperature sees an immediate loss of chips. However, if she is is following
Sentestrat, which ensures no net overall economic loss, then she will eventually
receive a compensating payment later. If this occurs after the tax rate has
become negative, then that payment will be in cash.
So the conventional player who gets the last move at some temperature would
appear to receive a benefit equal to the change of temperature.
There is an alternative, heuristic view that leads to similar conclusions without
assuming that the chips are worthless. In this view, the big pile of chips on the
table at early stages of the economic game is very likely to be pocketed in many
THE ECONOMIST’S VIEW OF COMBINATORIAL GAMES 393
Forecast = LW(G) + 12 t.
But if it is Right’s turn to play from G and the current global temperature is t,
then
Forecast = RW(G) − 12 t.
We note that if the players both follow Sentestrat, then the forecast does not
change during any interval of ko-free play at which the temperature remains con-
stant. But, when Left moves from G, of temperature told , to GL , of temperature
tnew , the forecast changes from
If Left followed Sentestrat, then RW(GL ) = LW(G) + told , and the effect of
Left’s move was not only to lower the temperature from told to tnew , but also to
increase the forecast by
∆Forecast = 12 |∆t|.
In general, whenever the temperature drops by ∆t, the forecast may be revised by
up to 12 |∆t| in favor of the player who made the last move at the old temperature.
Since either player may choose not to follow Sentestrat, other moves may
occur that cause the economist to revise his forecast. However, it is not hard to
see that all such forecast revisions are adverse to the player who moves. Hence,
moves that cause forecast revisions adverse to the mover are called sacrificial.
From this viewpoint, all moves can be partitioned into three sets:
1. Temperature-lowering moves improve the forecast by up to 12 |∆t|.
2. Sacrificial moves degrade the forecast.
3. All other moves leave the forecast unchanged.
Sacrificial moves may be either voluntary or compulsory, depending on whether
or not any alternative option was available. An examination of Sentestrat reveals
the only condition under which a sacrifice can be compulsory:
394 ELWYN BERLEKAMP
Theorem. The economic forecasts for G + MEE and G + SEE are the same as
the economic forecast for G. If G+SEE is played by two gurus using Conventional
Rules, the economic forecast of the final score will be precisely correct.
If our economist provides us with all of the relevant means and temperatures,
we can play G + SEE against an opposing guru. Even in the conventional game,
we can elect to play Sentestrat, subject to the proviso that whenever any move
permitted by Sentestrat involves a ko, we will prefer the ko move, and if there are
several permitted ko moves, we will play the same one (if any) that we previously
played most recently.
It is not hard to see that this policy ensures that the overall result is at least
as favorable as the forecast. If a placid ko has temperature t, then the standard
enriched environment also contains a bountiful supply of switches at the same
temperature. Our policy ensures that the ko will be resolved before all of these
switches are played. So, whenever we are faced with a koban, we have another
(switch) move at the same temperature that serves us just as well.
• the Economist’s forecast for G + Un is the same as the forecast for G, and
• the forecast of G + Un is precisely accurate.
Hyperactive Kos. Even in a very rich environment, kothreats can have a big
effect on the final score if the game contains a position that is a hyperactive ko.
THE ECONOMIST’S VIEW OF COMBINATORIAL GAMES 397
Max
Amt Book Max $ Loss Chip Loss
Buyer Item Bid Value DW YK DW YK
YK Black in −Go 2 14 1
2 32 7
32
DW Black in −Go 1 34 1
2 32 9
32
YK Domin. Left 1 12 3
4
3
4
1 3 1
DW Domin. Left 2 4 4
DW First move 1 34 1 15
32
9
32
9
64
YK First move 1 14 1 15
32
7
32
7
64
43 69 9 7
Total mistakes 64 64 64 64
From the economist’s perspective, the µ-value of such a ko depends on who is its
master, and this, in turn, depends on a global tally of kothreats. And, en route
to elegance and simplicity, combinatorial game theory suppresses much kothreat
information from the canonical form. Then even more kothreat information
is suppressed when the information in the canonical form is condensed into a
thermograph. So, if there is a hyperactive ko, a precise analysis requires more
data than the mathematics has retained. Of course, µ(Ĝ) and µ(Ǧ) still provide
firm bounds; but for some positions these bounds may be quite far apart.
4.4. Dollar Accounting. Economic Rules facilitate some very detailed ac-
counting, which identifies the moves at which mistakes were made and the cost
of each mistake. Table 2 shows an example.
In these auctions, every Book bid was intermediate between the bids submit-
ted by the two players, and all maximum bidding losses were realized.
Altogether, in these auctions, the average of the two players made 1 point
worth of bidding errors, 78 in dollars, and 18 in chips. DW was better than this
average; YK was worse. The absolute difference of each from the average was
13 1
64 dollars minus 64 chips.
After all play was concluded, the final result (after converting chips back to
cash) was that DW ended with $5.1875; YK, with $4.8125. Since both players
3
played all moves according to perfect Book strategies, YK’s net loss of $ 16
resulted entirely from the fact that his bidding errors cost more than DW’s.
The economist’s accounting can be readily extended to itemize any errors that
occur in play.
398 ELWYN BERLEKAMP
Total chips
still on Temper-
DW YK the table ature
−2G − 4Y −2G − 4Y At start 4G + 8Y 1 12 (bid)
1G + 2Y (after move 1) 3G + 6Y 1 18
1
1Y (after move 5) 3G + 5Y 1 16
1Y (after move 6) 3G + 4Y 1
7
2Y (after move 9) 3G + 2Y 8
3
2Y (after move 10) 3G 4
1
1G (after move 11) 2G 2
2G (after move 12) 0 0
+ Y − Y
= + 1
16 − 1
16 Net total chips gained from playing
(both players used perfect Book play)
Table 3. Pocketed chip ledger for the demonstration game of Figures 2–6.
4.6. Forecasting Records. Economists studying history like the Big Picture,
which, for the demonstration game, is shown in Figure 22. As usual, µ is plotted
along the reversed horizontal axis; t, on the vertical. Each point plotted in this
figure shows the global [µ, t] values of the full game after the numbered move.
Figure 22 records the values of the total µ and the maximum t after moves
0–12 of the demonstration game. The sequence terminates when the temperature
reaches 0. The values of µ oscillate back and forth around the forecast, and each
decrease in the tax rate is accompanied by a revision in the forecast, as illustrated
in Table 4.
The magnitude of the revision is one half the decrease in the tax rate, and
the sign of the revision favors the player who made the last move before the tax
rate decrease.
THE ECONOMIST’S VIEW OF COMBINATORIAL GAMES 399
114
3
8 2,4 1
5
→
1
6 7
temperature
9
3
10 4
→
2 11
1
4
12 13
1 5 27 1
1 16 1 16 1 32 6 + 32
1 2 3
8 1 7
8 7–9 − 16
1
7 3 13 1
8 1 8 1 16 10 + 16
3
4 2 1
4 1 7
8 11 − 81
1 1 3
2 1 2 1 4 12 + 41
0 2 2 13 . . .
Table 4. Forecasting record.
4.7. The Komi in Go. Professional Go games begin with an empty board.
By symmetry, µ = 0. The player who makes the first move is required to give
his opponent some number of points in return. This number of points is called
the komi. To prevent ties, the komi is traditionally chosen as a half-integer. The
komi is specified by the tournament directors. Values of the komi in modern
tournaments have ranged from 5.5 to 8.5 points.
400 ELWYN BERLEKAMP
5. Incentives
In loopfree combinatorial games without taxes, the best moves are those
for which the mover has a maximum incentive. If the position is G, Left’s
incentive is ∆L (G) = GL − G; Right’s incentive is ∆R (G) = G − GR . If
G = A + B + C + · · · + F, then ∆L (G) and ∆R (G) are the sets
∆L (G) = {∆L (A), ∆L (B), ∆L (C), . . . , ∆L (F)}
∆R (G) = {∆R (A), ∆R (B), ∆R (C), . . . , ∆R (F)}
It is helpful to consider the union of both sets:
∆ = ∆L , ∆R .
When temperatures are negative, both incentives equal the temperature, and
values are numbers that are equal to the means. However, when temperatures
are nonnegative, both values and incentives are necessarily games that are more
complicated than numbers. To manipulate these games, familiarity with classical
combinatorial game theory is required [Berlekamp et al. 1982; Conway 1976].
The partial ordering of the incentives plays an important role in the mathe-
matical analysis of various lines of play [Berlekamp and Kim 1996]. If ∆L (A) >
∆L (B), Left will prefer to play on A rather than on B. If ∆R (A) > ∆L (B), then
Left’s move on B is reversed through Right’s response on A. Reversibility is a
technical criterion well known in combinatorial game theory.
Every incentive is a game that has 0 as a right follower. Conversely, any game
in which 0 is a right follower is one of its own right incentives. So no incentive
can be 0 or positive.
Many pairs of incentives are incomparable. On the other hand, temperatures
of incentives, being numbers, can be totally ordered (although ties will occur in
cases of equal temperatures). Thus, a wise prelude to an investigation of the
partial ordering of the incentives is to compute each incentive’s temperature,
and then sort them accordingly. If temp(∆L (A)) > temp(∆L (B)), this does
THE ECONOMIST’S VIEW OF COMBINATORIAL GAMES 401
not imply that ∆L (A) > ∆L (B), but it certainly precludes the possibility that
∆L (B) ≥ ∆L (A).
In general, temp(A − B) ≤ max{temp(A), temp(B)}. However, in the special
case of incentives, we can assert that for any G, there is at least one left incentive
such that
temp ∆L (G) = max temp(G), temp(GL ) .
This is true because if the temperature of the incentive were less, then we could
pick a value of t between the temperature of the incentive and the temperature
of G and GL . Cooling the incentive-defining equation by t would yield
∆L (Gt ) = GLt − Gt ,
where neither term on the right side is a number, but their difference is. But this
contradicts another theorem of combinatorial game theory, which states that if
H = Gt is not a number, then it has at least one left incentive that exceeds all
negative numbers.
The same arguments can be repeated for right incentives.
In earlier sections of this paper, we showed that for economic rules, when
players bid to set a new tax rate, either player can safely select the temperature
of the full board position as his bid, except in the case when he has no legal
move. This leads directly to the important conclusion that initial tax rate bids
may be submitted independently of the bids to play Left’s side of the game.
When the tax rate is fair and both players are competent, it makes no economic
difference who moves first.
However, it is in many ways more natural to view the temperature as a func-
tion of the move rather than as a function of the initial position. In that sense,
temperature is a natural function of the incentive. Sentestrat still works.
∆L (G) = GL − G
= {x|y} + {0|| −y | −x}
= {x||x−y |0, x|y |||0}
= {x−y, x|y ||0}
= H.
As shown in Figure 23, the thermographs of H can have any of three different
forms, depending on how x compares with 2y and 3y.
Corresponding to Figure 23, we have these expressions for the thermal disso-
ciations of G and H, where v = {v |0||0} and v = {0||0| −v}:
Z 14 (x+y) Z 12 (x−y)
x+y
Cases 1 and 2: G = + ∗ + ε1 ,
4
where ε1 = ∗| 12 (x−3y) + 12 (3y −x)|0 is fuzzy with 0
Z y
Case 3: G=y+ x−3y
Case 1: H=G
Z x−2y
Case 2: H=G+ ε2 ,
where ε2 = 0, 3y −x|0||x−3y − 3y −x|0||x−3y > 0
Z 1 Z y
x−y 2 (x−y)
Case 3: H=
2
+ ∗ + x−3y
(Note: We have H > G, since v |0 > v v .)
It is interesting to note that the Yedwab–Moews construction of NP-complete-
ness requires no games of Case 2, and only their coolest game is of Case 3. All
but one of their summands are of Case 1, in which left and right incentives are
equal.
404 ELWYN BERLEKAMP
x+y
4
Case 1: y ≤ x < 2y
x−y
0<x 2 H G
x+y
x−y 4
x−y
2 2
x − 2y
Case 2: 2y ≤ x < 3y H G
x−y
2
y
Case 3: 3y ≤ x
H G
x−y x−y y 0 y 0
2
References
[Berlekamp and Kim 1996] Elwyn R. Berlekamp and Yonghoan Kim, “Where is the
thousand dollar ko?”, pp. 203–226 in this volume.
[Berlekamp and Wolfe 1994] Elwyn R. Berlekamp and David Wolfe, Mathematical Go:
Chilling Gets the Last Point, A. K. Peters, Wellesley, MA, 1994. Also published in
paperback, with accompanying software, as Mathematical Go: Nightmares for the
Professional Go Player, by Ishi Press International, San Jose, CA.
[Berlekamp et al. 1982] Elwyn R. Berlekamp, John H. Conway, and Richard K. Guy,
Winning Ways for Your Mathematical Plays, Academic Press, London, 1982.
[Conway 1976] John H. Conway, On Numbers and Games, Academic Press, London,
1976.
THE ECONOMIST’S VIEW OF COMBINATORIAL GAMES 405
[Hanner 1959] Olof Hanner, “Mean play of sums of positional games”, Pacific J. Math.
9 (1959), 81–99.
[Kim 1995] Yonghoan Kim, “New values in Domineering and loopy games in Go”,
Ph.D. thesis, Mathematics Department, University of California at Berkeley, May
1995.
[Milnor 1953] John Milnor, “Sums of positional games”, pp. 291–301 in Contributions
to the Theory of Games, vol. 2 (edited by H. W. Kuhn and A. W. Tucker), Ann. of
Math. Stud. 28, Princeton University Press, Princeton, 1953.
[Moews 1993] David Moews, “On some combinatorial games connected with Go”,
Ph.D. thesis, Mathematics Department, University of California at Berkeley,
December 1993.
[Yedwab 1985] Laura J. Yedwab, “On playing well in a sum of games”, M.Sc. Thesis,
MIT, 1985. Issued as MIT/LCS/TR-348, MIT Laboratory for Computer Science,
Cambridge, MA.
Elwyn Berlekamp
Department of Mathematics
University of California at Berkeley
Berkeley, CA 94720
berlek@math.berkeley.edu