Mixed Strategy Nash Equilibrium: Sanjay Singh

Mixed Strategy Nash Equilibrium
Sanjay Singh
1
Department of Information and Communication Technology
Manipal Institute of Technology, MAHE
Manipal-576104, INDIA
sanjay.singh@manipal.edu
2
Centre for Artificial and Machine Intelligence (CAMI)
MAHE, Manipal-576104, INDIA
April 10, 2021
Sanjay Singh MSNE 1
A strategy profile with an outcome which is simultaneously the smallest number in

its row and the largest number in its column are the best response to each other, and
that both players are guaranteed to do no worse than this value.
Definition (Saddle Point)

An outcome in a matrix game (with payoff to row player) is called a saddle point if
the entry at that outcome is both less than or equal to any entry in its row, and greater
than or equal to any entry in its column.
Definition (Saddle Point Principle)

If a matrix game has a saddle point, both players should play a strategy which
contains it.
Definition (Game Value)

For any matrix game, if there is a number v such that Rose (row player) has a strategy
which guarantees that she will get at least v, and Colin (column player) has a strategy
which guarantees that Rose will get no more than v, then v is called the value of the
game.
Sanjay Singh MSNE 2

Colin
A B C D
A 4 2 5 2
Rose B 2 1 -1 -20
C 3 2 4 2
D -16 0 16 1
All four of the circled outcomes are saddle point

These are the only saddle points
The strategy profile (B, A) is not a saddle point
If the maxmin of rows and minmax of columns are the same,
then they appear at saddle point strategies
When the maxmin and minmax are different, the game has no
saddle point
Sanjay Singh MSNE 3
Only sensible plan is to use

Colin some random device to decide
A B
Rose which strategy to play
A 2 -3
B 0 3 For example, Rose may flip a
coin to decide between A and B
There is no saddle point in this Such a plan, which involves
game playing a mixture of strategies
Neither player would want to according to certain fixed
play a single strategy with probabilities, is called mixed
certainty strategy
Other player could take The contrasting plan of playing

advantage of such a choice one strategy with certainty is
called a pure strategy
Sanjay Singh MSNE 4

To analyze the effect of one or both players using mixed strategy,
we use the concept of expected value
The expected value of getting payoffs a1 , . . . , ak with respective
probabilities p1 , . . . , pk is given by
p1 a1 + p2 a2 + . . . + pk ak
Expected Value Principle

If you know that your opponent is playing a given mixed strategy, and
will continue to play it regardless of what you do, you should play
your strategy which has the largest expected value.
Sanjay Singh MSNE 5
Colin
A B
Rose
A 2 -3
B 0 3
Consider the scanario wherein Colin decides to use mixed

strategy with different probabilities
Might there be some choice of probabilities which Rose could
not take advantage of?
To find, suppose Colin plays a mixed strategy with probabilities
x for A and (1 − x) for B
The Rose’s expected value for A and B is given by:
Rose A: x(2) + (1 − x)(−3) = −3 + 5x
Rose B: x(0) + (1 − x)(3) = 3 − 5x
Sanjay Singh MSNE 6

Rose will not be able to take advantage of Colin’s mixed strategy
if these two expected values are same:
3
−3 + 5x = 3 − 3x ⇒ x =
4
3 1
If Colin plays mixed strategy A and B, Colin can assure that
4 4
3
Rose wins on average, no more than unit per game, regardless
4
of how Rose plays
Sanjay Singh MSNE 7
Figure 1: Game with oddments, and mixed strategy probability calculation
Value of the game is computed as:

2 × 6 + (−3) × 2 3
= using R1
6+2 4
0×6+3×2 3
= using R2
6+2 4
2×3+0×5 3
= using C1
3+5 4
(−3) × 3 + 3 × 5 3
= using C2
3+5 4
Sanjay Singh MSNE 8
Now consider the games with 2 × n or m × 2
If such a game doesn’t have saddle point, it has a mixed strategy
solution to one of its 2 × 2 subgames
For large m or n there could be large number of 2 × 2 subgames
to try
There is an elegant graphical way to find which 2 × 2 subgame
gives solution to the game
Sanjay Singh MSNE 9
For each Rose strategy, mark the payoff if Colin

plays A on the left axis
Payoff if Colin plays B on the right axis, and connect
them with a line
The vertical coordinate of this line above any point x
gives Rose’s expected payoff if Colin plays mixed
strategy (1 − x)A, xB
If Rose knew Colin’s mixed strategy, she would take
advantage by choosing best response (i.e outcome
would lie on the upper envelope of graph)
Colin would choose x to make the corresponding
payoff to Rose as small as possible to get the lowest
point on the upper envelope
Such a point (circled) is at the intersection of lines for
Rose A and Rose B
Sanjay Singh MSNE 10

The appropriate game to solve is
Colin Rose
A B probabilities
Rose
A 2 -3 2/7
B 0 2 5/7
Colin prob 5/7 2/7
Rose expectations if Colin plays (5/7)A, (2/7)B:
5 2 4
Rose A: (2) + (−3) =
7 7 7
5 2 4
Rose B: (0) + (2) =
7 7 7
5 2 5
Rose C: (−5) + (10) = −
7 7 7
Colin expectations if Rose plays (2/7)A, (5/7)B, (0)C:
2 5 4
Colin A: (2) + (0) + (0)(−5) =
7 7 7
2 5 4
Colin B: (−3) + (2) + (0)(10) =
7 7 7
All of Rose’s expectations are ≤ 4/7, and all of Colin’s

expectations are ≥ 4/7
Colin assures that Rose will not get get more than 4/7 and Rose
assures that she will not get less than 4/7
Thus, the value of the game is 4/7

Interpretation of these results on graph
The lowest point on the upper envelope lies

2 5
above x = ( of the way toward Colin A)
7 7
The vertical coordinate of this point (i.e
4
horizontal projection on left axis) is at ,
7
the value of the game
2
The vertical project on Rose C at x = has
7
5
vertical coordinate − , which is lower than
7
the value of the game, hence Rose C should
not be used
A 2 × n game
A B C D E
A -2 5 1 0 -4
B 3 -3 -1 3 8
Colin would try to choose her strategy to

stay on the lower envelope
Rose would want to choose x to get the
highest point on the lower envelope, which
is obtained at intersection of A and C
The reduced 2 × 2 game has only strategy
A and C for Colin
The optimal mixed strategies are
2 5 4 3
Colin : ( , 0, , 0, 0), Rose : ( , )
7 7 7 7

What is game value?
Rose expectation against Colin optimal
2 5 1
Rose A: (−2) + (1) =
7 7 7
2 5 1 The value of the game
Rose B: (3) + (−1) = 1
7 7 7 is
Colin expectation against Rose optimal 7
4 3 1 All of Colin’s
Colin A: (−2) + (3) = expectations against
7 7 7 1
4 3 11 Rose optimal are ≥
Colin B: (5) + (−3) = 7
7 7 7 Even if one had been
4 3 1 1
Colin C: (1) + (−1) = less than , we would
7 7 7 7
4 3 9 infer that there is error
Colin D: (0) + (3) = in claimed solution
7 7 7
4 3 8
Colin E: (−4) + (8) =
7 7 7
m × n game
Theorem (Minimax Theorem)

Every m × n matrix game has a solution. That is, there is a unique
number v, called the value of the game, and there are optimal (pure or
mixed) strategies for Rose and Colin such that
1 if Rose plays her optimal strategy, Rose’s expected payoff will be
≥ v, no matter what Colin does, and
2 if Colin plays her optimal strategy, Rose’s expected payoff will
be ≤ v, no matter what Rose does.
The solution of a m × n can always be found as the solution to

some k × k subgame of original game
The pure strategies which are involved in solution are called
active strategies
Active strategies should be played according to certain
probabilities, and other strategies should not be played at all
Colin
A B C
Rose
A 1 2 2
B 2 1 2
C 2 2 0
Let us solve this 3 × 3 game by method of equalizing

expectations
This game does not have a saddle point
Suppose Colin plays A,B,and C with probabilities
(x, y, 1 − x − y), the Rose’s expectations are:
Rose A: x(1) + y(2) + (1 − x − y)(2) = 2 − x
Rose B: x(2) + y(1) + (1 − x − y)(2) = 2 − y
Rose C: x(2) + y(2) + (1 − x − y)(0) = 2x + 2y
Rose will be unable to take advantage of Colin’s mixed strategy

if these expectations are equal:
2 − x = 2 − y = 2x + 2y
These equations give a system of two equations in two
unknowns:
x−y=0
2x + 3y = 2
2
On solving, we get x = y =
5
Hence,
Colin’s
expectation-equalizing mixed strategy is
2 2 1
, ,
5 5 5
By symmetry of the matrix, this is also Rose’s
expectation-equalizing mixed strategy
These strategies do give a solution, and the value of the game is
8
5
The method of expectation-equalization fails if the solution to
the 3 × 3 game involves a 1 × 1 or 2 × 2 subgame
We should check for saddle point and dominance before trying
for expectation-equalization
If equalizing expectation fails, then we should for a 2 × 2
solution
The best way to do is to use graphical analysis on three 2 × 3
subgames
A B C
A 1 2 2
B 2 1 2
C 2 2 0
A B C A B C A B C
A 1 2 2 B 2 1 2 A 1 2 2
B 2 1 2 C 2 2 0 C 2 2 0
Some definitions
Definition (Pure Strategy)

A pure strategy defines a specific move or action that a player will
follow in every possible attainable situation in a game. Such moves
may not be random, or drawn from a distribution, as in the case of
mixed strategies
Definition (Mixed Strategy)

A strategy consisting of possible moves and a probability distribution
(collection of weights) which corresponds to how frequently each
move is to be played. A player would only use a mixed strategy when
she is indifferent between several pure strategies, and when keeping
the opponent guessing is desirable - that is, when the opponent can
benefit from knowing the next move.

Mixed Strategies
Consider a strategic form game Γ = hN, (Si ), (ui )i

Item of Si are called actions or pure strategy of player i
If player i chooses a strategy in Si according to a probability
distribution, we have mixed strategy
Definition (Mixed Strategy)

Given a player i with Si as the set of pure strategies, a mixed strategy
(a.k.a randomized strategy) σi of player i is a probability distribution
over Si . That is, σi : Si 7→ [0, 1] is a mapping that assigns to each pure
strategy si ∈ Si , a probability σi (si ) such that
X
σi (si ) = 1.
si ∈Si
A pure strategy of a player, say si ∈ Si , can be considered as a mixed

strategy that assigns a probability 1 to si and probability 0 to all other
strategies of player i
Such a mixed strategy is called a degenerate mixed strategy and is
denoted by e(si ) or si
If Si = {si1 , si2 , . . . , sim }, then set of all mixed strategies of player i is
the set of probability distribution on set Si
∆(Si ) =
 
 m
X 
m
(σ , σ , . . . , σim ) ∈ R : σij ≥ 0 for j = 1, . . . , m and σij = 1
 i1 i2 
j=1
Set ∆(Si ) is called the mixed extension of Si

Using the mixed extension of strategy set, we can define a mixed
extension of the pure strategy game Γ = hN, (Si ), (ui )i
The mixed extension of Γ is denoted as ΓME = hN, (∆(Si )), (Ui )i
For i = 1, 2, . . . , n, Ui is a mapping that maps mixed strategy profiles
to real numbers: Ui : ×i∈N ∆(Si ) → R or
Ui : ∆(S1 ) × ∆(S2 ) × . . . × ∆(Sn ) 7→ R
Given σi ∈ ∆(Si ) for i = 1, 2, . . . , n we compute Ui as follows
We make the standard assumption that the randomization of
individual players are mutually independent
It implies that given a mixed strategy profile (σ1 , . . . , σn ), the
random variables σ1 , . . . , σn are mutually independent
Joint probability of a pure strategy profile (s1 , s2 , . . . , sn ) is given
by Y
σ(s1 , . . . , sn ) = σi (si )
i∈N
The payoff functions Ui are defined as

X
Ui (σ1 , . . . , σn ) = σ(s1 , . . . , sn )ui (s1 , . . . , sn )
(s1 ,...,sn )∈Si
Ui (σ1 , . . . , σn ) for i = 1, . . . , n is the expected payoff when the

profiles (s1 , . . . , sn ) are chosen according to joint distribution σ
When there is no confusion we’ll write ui instead of Ui , that is,
instead of Ui (σ1 , . . . , σn ) we’ll write ui (σ1 , . . . , σn )
Mixed Strategies in BOS Problem
2
1
A B
A 2,1 0,0
B 0,0 1,2
Suppose (σ1 , σ2 ) is a mixed strategy profile

It means that σ1 is a probability distribution on S1 = {A, B} and
σ2 is a probability distribution on S2 = {A, B}
Lets represent σ1 = (σ1 (A), σ1 (B)); σ2 = (σ2 (A), σ2 (B))
We have S = S1 × S2 = {(A, A), (A, B), (B, A), (B, B)}, we’ll
compute the payoff function u1 and u2
X
For i = 1, 2, ui (σ1 , σ2 ) = σ(s1 , s2 )ui (s1 , s2 )
(s1 ,s2 )∈S

The function u1 can be computed as
u1 (σ1 , σ2 ) = σ1 (A)σ2 (A)u1 (A, A) + σ1 (A)σ2 (B)u1 (A, B)

+ σ1 (B)σ2 (A)u1 (B, A) + σ1 (B)σ2 (B)u1 (B, B)
= 2σ1 (A)σ2 (A) + σ1 (B)σ2 (B)
= 2σ1 (A)σ2 (A) + (1 − σ1 (A))(1 − σ2 (A))
= 1 + 3σ1 (A)σ2 (A) − σ1 (A) − σ2 (A)
Similarly, we can get

u2 (σ1 , σ2 ) = 2 + 3σ1 (A)σ2 (A) − 2σ1 (A) − 2σ2 (A)
2 1 1 2
Suppose σ1 ( , ) and σ2 ( , ), then
3 3 3 3
2 2
u1 (σ1 , σ2 ) = ; u2 (σ1 , σ2 ) =
3 3
Values of u1 (σ1 , σ2 ) and u2 (σ1 , σ2 ) are same due to the structure
of payoff matrix
Digression
Inequalities
Inequalities and intervals

x>a (a, ∞)
x≥a [a, ∞)
x<a (−∞, a)
x≤a (−∞, a]
Addition and subtraction property for inequalities
If a < b then a + c < b + c
If a > b then a − c > b − c
Addition or subtraction does not change the sense of inequality
Multiplication and division property for inequality
If a < b AND c is positive then ac < bc
a b
If a < b AND c is positive then <
c c
If a < b AND c is negative then ac > bc
a b
If a < b AND c is negative then >
c c

Solving linear inequalities
Procedure for solving linear inequalities in one variable is
similar to solving basic equations
We need to be careful about the sense of the equality when
multiplying or dividing by negative numbers
Solving non-linear inequalities
Rewrite the inequality so that there is a zero on the right side
Find all linear factors of the function
To find the critical values, set each linear function to zero and
solve for x
Determine the sign of the function in the intervals formed by the
critical values
The solution will be those intervals in which the function has the
correct signs satisfying the inequality
Solve inequality x2 − 3 > 2x.

x2 − 2x − 3 > 0, which can be factored to give (x + 1)(x − 3) > 0
Setting both factors to 0, we get critical values as x = −1, and
x=3
These critical values divide the number line into 3 intervals
x < −1
−1 < x < 3
x>3
For the interval, x < −1, the value of (x + 1) and (x − 3) will be
negative
So, in the interval x < −1, the value of function x2 − 2x − 3 will
be −ve × −ve = +ve
We continue doing this for the other 2 intervals and summarize
the results in a table (given on next slide) as

Interval (x + 1) (x − 3) Sign of f (x)
x < −1 - - +
−1 < x < 3 + - -
x>3 + + +
We are solving for (x + 1)(x − 3) > 0

The intervals that satisfy this inequality will be those where f (x)
has a positive sign
Hence, the solution is either x < −1 or x > 3
BRock BScissors BPaper

ARock 0 1 -1
AScissors -2 0 2
APaper 1 -1 0
1 1 1
, . -1/3 0 1/3
3 3 3
1 1 1
, . 0 -1/4 1/4
4 4 2
2 1 2
, . 0 0 0
5 5 5
p1 , p2 , p3 p1 .0 + p2 .(−2) + p3 .1 p1 .1 + p2 .0 + p3 .(−1) p1 .(−1) + p2 .2 + p3 .0
Smallest entries in first six rows are, -1,-2,-1,-1/3, -1/4, and 0

Among the six strategies, the one with largest guarantee for player A is the
2 1 2
strategy mix, , .
5 5 5
Remember that there are infinitely many possible rows
Might there be rows with better guarantees?

Mixed Maxmin Strategy
B
A
Left Middle Right
Up A1,1 , B1,1 A1,2 , B1,2 A1,3 , B1,3
Middle A2,1 , B2,1 A2,2 , B2,2 A2,3 , B2,3
Down A3,1 , B3,1 A3,2 , B3,2 A3,3 , B3,3
Player A’s task is to maximize the minimum of three values:

p1 .A1,1 + p2 .A2,1 + p3 .A3,1
p1 .A1,2 + p2 .A2,2 + p3 .A3,2
p1 .A1,3 + p2 .A2,3 + p3 .A3,3
3
X
by choosing her probabilities p1 , p2 , and p3 such that pi = 1
i=1
We can reformulate the problem stated on last slide as

1 Player A has to choose four values v, p1 , p2 , and p3 satisfying
2 p1 ≥ 0
p2 ≥ 0
p3 ≥ 0
p1 + p2 + p3 = 1
v ≥ p1 .A1,1 + p2 .A2,1 + p3 .A3,1
v ≥ p1 .A1,2 + p2 .A2,2 + p3 .A3,2
v ≥ p1 .A1,3 + p2 .A2,3 + p3 .A3,3
3 Such that v is maximized
It is a linear program, which can be solved using simplex
method

Mixed Strategy Nash Equilibria
Definition (Mixed Strategy Nash Equilibria)
Given a strategic form game Γ = hN, (Si ), (ui )i, a mixed strategy
profile σ1∗ , . . . , σn∗ is called a Nash equilibrium if ∀i ∈ N,
ui (σi∗ , σ−i
∗ ∗
) ≥ ui (σi , σ−i ) ∀σi ∈ ∆(Si ).
The best response function bi (.) is defined as

bi (σ−i ) = {σi ∈ ∆(Si ) : ui (σi , σ−i ) ≥ ui (σi0 , σ−i
∗
) ∀σi0 ∈ ∆(Si )}
That is, given σ−i , bi (σ−i ) is the set of all mixed strategies of
player i, each of which is a best response mixed strategy of
player i when the rest of the players are playing the strategy
profile σ−i
Then, a mixed strategy profile (σ1∗ , . . . , σn∗ ) is a Nash
equilibrium iff
σi∗ ∈ bi (σ−i
∗
) ∀i ∈ N
Mixed Strategy Nash Equilibria for BOS Game
Let (σ1∗ , σ2∗ ) be a mixed strategy equilibrium of BOS game, then

u1 (σ1∗ , σ2∗ ) ≥ u1 (σ1 , σ2∗ ) ∀σ1 ∈ ∆(S1 )
u2 (σ1∗ , σ2∗ ) ≥ u2 (σ1∗ , σ2 ) ∀σ2 ∈ ∆(S2 )
Any solution that satisfies above two equations is a mixed
strategy Nash equilibrium
Recall following equations from previous example
u1 (σ1 , σ2 ) = 1 + 3σ1 (A)σ2 (A) − σ1 (A) − σ2 (A)

u2 (σ1 , σ2 ) = 2 + 3σ1 (A)σ2 (A) − 2σ1 (A) − 2σ2 (A)
Writing first two inequalities using above equation yields

3σ1∗ (A)σ2∗ (A) − σ1∗ (A) ≥ 3σ1 (A)σ2∗ (A) − σ1 (A) ∀σ1 ∈ ∆(S1 )
3σ1∗ (A)σ2∗ (A) − 2σ2∗ (A) ≥ 3σ1∗ (A)σ2 (A) − 2σ2 (A) ∀σ2 ∈ ∆(S2 )

Rewriting last two equations
3σ1∗ (A)σ2∗ (A) − σ1∗ (A) ≥ 3σ1 (A)σ2∗ (A) − σ1 (A) ∀σ1 ∈ ∆(S1 )
3σ1∗ (A)σ2∗ (A) − 2σ2∗ (A) ≥ 3σ1∗ (A)σ2 (A) − 2σ2 (A) ∀σ2 ∈ ∆(S2 )
The above equations are equivalent to:
σ1∗ (A) (3σ2∗ (A) − 1) ≥ σ1 (A) (3σ2∗ (A) − 1) ∀σ1 ∈ ∆(S1 )
σ2∗ (A) (3σ1∗ (A) − 2) ≥ σ2 (A) (3σ1∗ (A) − 2) ∀σ2 ∈ ∆(S2 )
There are three possible cases:
Case 1: 3σ2∗ (A) > 1, this leads to the pure strategy Nash equilibrium (A,A)
Case 2: 3σ2∗ (A) < 1, this leads to the pure strategy Nash equilibrium (B,B)
Case 3: 3σ2∗ (A) = 1, this leads to the mixed strategy Nash equilibrium (σ1∗ , σ2∗ )
2 ∗ 1 1 2
Hence, σ1∗ (A) = ; σ1 (B) = ; σ2∗ (A) = ; σ2∗ (B) =
3 3 3 3
Properties of Mixed Strategies

Mathematical Preliminaries
Definition (Convex set)

A set C is convex if, for any x, y ∈ C and λ ∈ R with 0 ≤ λ ≤ 1,
λx + (1 − λ)y ∈ C
Examples of a convex set (a) and a non-convex set (b)

Definition (Convex function)
A function f : Rn → R is convex if its domain (denoted D(f )) is a convex set, and if,
for all x, y ∈ D(f ) and λ ∈ R with 0 ≤ λ ≤ 1,
f (λx + (1 − λ)y) ≤ λf (x) + (1 − λ)f (y).
Graph of a convex function
Definition (Convex combination)

Given real numbers y1 , y2 , . . . , yn , a convex combination of these numbers is a
weighted sum of the form λ1 y1 + λ2 y2 + . . . + λn yn , where
n
X
0 ≤ λi ≤ 1 for i = 1, 2, . . . , n; λi = 1
i=1
Proposition
Let Γ = hN, (Si ), (ui )i be a strategic form game. Then ui (σi , σ−i ) can
be expressed as the convex combination:
X
ui (σi , σ−i ) = σi (si )ui (si , σ−i )
si ∈Si
where  
X Y
ui (si , σ−i ) =  σj (sj ) ui (si , s−i ).
s−i ∈S−i j6=i
Interpretation
Implication of this result is that the payoff for any player under mixed
strategy can be computed as a convex combination of the payoffs
obtained when the player plays pure strategies with the rest of the
players playing σ−i .

Example
Suppose N = {1, 2}, S1 = {x1 , x2 , x3 , x4 , x5 } and

X
u1 (σ1 , σ2 ) = σ1 (s1 )u1 (s1 , σ2 )
s1 ∈S1
= σ1 (x1 )u1 (x1 , σ2 ) + σ1 (x2 )u1 (x2 , σ2 )
+ σ1 (x3 )u1 (x3 , σ2 ) + σ1 (x4 )u1 (x4 , σ2 )
+ σ1 (x5 )u1 (x5 , σ2 )
Suppose S2 is a finite set

Let u1 (x1 , σ2 ) = 5; u1 (x2 , σ2 ) = u1 (x3 , σ2 ) = 10 and
u1 (x4 , σ2 ) = u1 (x5 , σ2 ) = 20
u1 (σ1 , σ2 ) =
5σ1 (x1 ) + 10σ1 (x2 ) + 10σ1 (x3 ) + 20σ1 (x4 ) + 20σ1 (x5 )
u1 (σ1 , σ2 ) =
5σ1 (x1 ) + 10σ1 (x2 ) + 10σ1 (x3 ) + 20σ1 (x4 ) + 20σ1 (x5 )
The maximum value of convex combination is 20, which is
obtained when
σ1 (x4 ) = 1 or σ1 (x5 ) = 1 or
σ1 (x4 ) + σ1 (x5 ) = 1, and
σ1 (x1 ) + σ1 (x2 ) + σ1 (x3 ) = 0 or σ1 (x1 ) = σ1 (x2 ) = σ1 (x3 ) = 0
Also, max u1 (σ1 , σ2 ) = 20
σ1 ∈∆(S1 )
max u1 (σ1 , σ2 ) = max u1 (s1 , σ2 )
σ1 ∈∆(S1 ) s1 ∈S1

Let ρ ∈ {σ1 ∈ ∆(S1 ) : u1 (σ1 , σ2 ) ≥ u1 (σ10 , σ2 ) ∀σ10 ∈ ∆(S1 )}
⇐⇒ρ(x4 ) + ρ(x5 ) = 1
⇐⇒ρ(x1 ) + ρ(x2 ) + ρ(x3 ) = 0
⇐⇒ρ(x1 ) = ρ(x2 ) = ρ(x3 ) = 0
⇐⇒ρ(y) = 0 ∀y ∈
/ arg max u1 (s1 , σ2 )
s1 ∈S1
Proposition
Given a strategic form game hN, (Si ), (ui )i, then, for any σ ∈ ×i∈N ∆(Si )
and for any player i ∈ N,
max ui (σi , σ−i ) = max ui (si , σ−i )

σi ∈∆(Si ) si ∈Si
Furthermore
ρi ∈ arg max ui (σi , σ−i )
σi ∈∆(Si )
iff
ρi (x) = 0 ∀x ∈
/ arg max ui (si , σ−i )
si ∈Si
Necessary and Sufficient Condition for a Profile to be a Mixed

Strategy Nash Equilibrium

Definition (Support of a Mixed Strategy)
Let σi be any mixed strategy of a player i. The support of σi , denoted
by δ(σi ), is the set of all pure strategies which have non-zero
probabilities under σi , that is,
δ(σi ) = {si ∈ Si : σi (si ) > 0}
Example
Consider a SFG with S1 = {a, b, c, d}, and S2 = {a, b, e, f } with
σ1 (a) = 0.2, σ1 (b) = 0.5, σ1 (c) = 0.3, σ1 (d) = 0, and
σ2 (a) = 0.6, σ2 (b) = 0, σ2 (e) = 0.4, σ2 (f ) = 0. Find the support of
mixed strategies δ(σ1 ) and δ(σ2 ).
Support for σ1 and σ2 is given by δ(σ1 ) = {a, b, c}, and
δ(σ2 ) = {a, e}.
Definition (Support of a Mixed Strategy Profile)

Let σ = (σ1 , . . . , σn ) be a mixed strategy profile with δ(σi ) as the
support of σi for i = 1, . . . , N. Then the support δ(σ) of the profile σ
is the Cartesian product of individual supports, that is
δ(σ1 ) × . . . × δ(σn ).
Example
Consider the computation from previous example and compute δ(σ).
From last example, we know that δ(σ1 ) = {a, b, c}, and
δ(σ2 ) = {a, e}.
The support of the strategy profile is given by
δ(σ) = δ(σ1 ) × σ2 (σ2 )

= {a, b, c} × {a, e}
= {(a, a), (a, e), (b, a), (b, e), (c, a), (c, e)}

Theorem (Indifference Theorem)
A mixed strategy of player A is a best response to mixed strategies of the
other players if each pure strategy in its support is a best response to the
given mixed strategies of the other players.
Theorem
The mixed strategy profile (σ1∗ , . . . , σn∗ ) is a mixed strategy Nash equilibrium
iff ∀i ∈ N,
∗
1 ui (si , σ−i ) is the same ∀si ∈ δ(σi∗ ) and
∗
2 ui (si , σ−i ) ≥ ui (s0i , σ−i
∗
) ∀si ∈ δ(σi∗ ); ∀s0i ∈
/ δ(σi∗ ).
Henceforth we’ll call to (1) and (2) as condition (1) and condition (2)
The theorem implies that the payoff for player i corresponding to any
pure strategy having non-zero probability is same
Moreover, the payoff is no less than the payoff corresponding to any
pure strategy having zero probability
Implications of Necessary and Sufficient Conditions
Given a mixed strategy Nash equilibrium, each player gets the

same payoff (as in the equilibrium) by playing any pure strategy
having positive probability in her equilibrium mixed strategy
It means player can be indifferent about which of the pure
strategies she will play (having +ve probability in her
equilibrium mixed strategy)
To verify whether or not a mixed strategy profile is a Nash
equilibrium, it is enough to consider the effects of only pure
strategy deviations (with rest of the players playing their
equilibrium strategies)

Proposition
Given si ∈ Si , let e(si ) denote the degenerate mixed strategy that
assigns probability 1 to si and probability 0 to all other strategies in
Si . The strategy profile (s∗1 , . . . , s∗n ) is a pure strategy Nash
equilibrium of the game hN, (Si ), (ui )i iff the mixed strategy profile
(e(s∗1 ), . . . , e(s∗n )) is a mixed strategy Nash equilibrium of the game
hN, (Si ), (ui )i.
Implication of this theorem is that to identify pure strategy equilibria

of the game hN, (∆(Si )), (ui )i, it is enough to look at the pure strategy
game hN, (Si ), (ui )i.
Mixed Strategy Nash Equilibria of BOS Game

2
1
A B
A 2,1 0,0
B 0,0 1,2
First we verify that (A, A) is a Nash equilibrium
For this profile we denote
σ1∗ (A) = 1; σ1∗ (B) = 0; σ2∗ (A) = 1; σ2∗ (B) = 0
Here, δ(σ1∗ ) = δ(σ2∗ ) = {A}  
X Y
We know that ui (si , σ−i ) =  σj (sj ) ui (si , s−i )
s−i ∈S−i j6=i
X
u1 (A, σ2∗ ) = σ2∗ (s2 )u1 (A, s2 )
s2 ∈{A,B}
= σ2∗ (A)u1 (A, A) + σ2∗ (B)u1 (A, B)

= 1.2 + 0.0 = 2
Similarly, we get u1 (B, σ2∗ ) = 0
Condition (1) of NS theorem is trivially true and condition (2) of
that theorem is true because u1 (A, σ2∗ ) > u1 (B, σ2∗ )
These conditions are satisfied for player 2 also
Hence, (A, A) is a Nash equilibrium
Similarly, (B, B) is also a Nash equilibrium

2 1 1 2
Candidate NE: ,, ,
3 3 3 3
2 1 1 2
Which means, σ1∗ (A) = ; σ1∗ (B) = ; σ2∗ (A) = ; σ2∗ (B) =
3 3 3 3
∗ ∗
Here, δ(σ1 ) = δ(σ2 ) = {A, B}
First we examine the situations with player 1, from last slide we
can write  
X Y
ui (si , σ−i ) =  σj (sj ) ui (si , s−i )
s−i ∈S−i j6=i
X
u1 (A, σ2∗ ) = σ2∗ (s2 )u1 (A, s2 )
s2 ∈{A,B}
= σ2∗ (A)u1 (A, A) + σ2∗ (B)u1 (A, B)

1 2 2
= (2) + (0) =
3 3 3
1 2 2
Similarly, u1 (B, σ2∗ ) = (0) + (1) =
3 3 3
∗
Con1 : ui (si , σ−i ) is the same ∀si ∈ δ(σi∗ )
2
It is satisfied, because u1 (A, σ2∗ ) = u1 (B, σ2∗ ) =
3
∗ 0 ∗ ∗ 0 ∗
Con2 :ui (si , σ−i ) ≥ ui (si , σ−i ) ∀si ∈ δ(σi ); ∀si ∈
/ δ(σi )
Condition 2 is trivially satisfied because δ(σ1∗ ) = {A, B}, the
entire set, there is nothing to compare
For player 2, from last  slide we  can write
X Y
ui (si , σ−i ) =  σj (sj ) ui (si , s−i )
s−i ∈S−i j6=i
X
u2 (σ1∗ , A) = σ1∗ (s1 )u2 (s1 , A)
s1 ∈{A,B}
= σ1∗ (A)u2 (A, A) + σ1∗ (B)u2 (B, A)

2 1 2
= (1) + (0) =
3 3 3
2 1 2
Similarly, u2 (σ1∗ , B) = (0) + (2) =
3 3 3
Condition (1) satisfied and condition (2) is trivially satisfied
Lets check if there are any other Nash equilibria

The equilibrium (A,A) corresponds to the support {A} × {A},
and equilibrium (B,B) corresponds to the support {B} × {B}
There is no Nash equilibria with support {A} × {A, B}. If player
1 plays A, then player 2 has to play only A, which leads to PSNE
(A,A). There is no way player will play B with non-zero
probability
Similarly, there is no NE with any of the following supports:
{B} × {A, B}
{A, B} × {A}
{A, B} × {B}
{B} × {A}
{A} × {B}

Lets check if there is any other NE with support {A, B} × {A, B}
Let (σ1∗ , σ2∗ ) be defined by
σ1∗ (A) = x σ1∗ (B) = 1 − x

σ2∗ (A) = y σ2∗ (B) = 1 − y
be a NE such that neither x ∈ (0, 1), and

y ∈ (0, 1)(0 < x < 1; 0 < y < 1)
u1 (A, σ2∗ ) = σ2∗ (A)u1 (A, A) + σ2∗ (B)u1 (A, B) = 2y,
u1 (B, σ2∗ ) = 1 − y
X
∗
u2 (σ1 , A) = σ1∗ (s1 )u2 (s1 , A) = x, and
s1 ∈{A,B}
u2 (σ1∗ , B)
= 2(1 − x)
As per condition (1) of the theorem
u1 (A, σ2∗ ) = u1 (B, σ2∗ )

u2 (σ1∗ , A) = u2 (σ1∗ , B)
Substituting values in condition (1) equation yields

1
2y = 1 − y =⇒ y =
3
2
x = 2(1 − x) =⇒ x =
3
This leads to the NE

2 1 1 2
σ1∗ = , ; σ2∗ = ,
3 3 3 3
Thus the game does not have any other equilibria

Maxmin and Minmax Values in Mixed Strategies
Given a SFG, the maxmin value of a player is the highest payoff the
player can guarantee himself even in the worst case when other
players are free to play any mixed strategies, formally
Definition (Maxmin Value in Mixed Strategies)
Given a strategic form game, Γ = hN, (Si ), (ui )i, the maxmin value or
security value, in mixed strategies, of a player i(i = 1, . . . , n) is given
by
vi = max min ui (σi , σ−i ).
σi ∈∆(Si ) σ−i ∈×j6=i ∆(Sj )
Any mixed strategy σi ∈ ∆(Si ) that guarantees this payoff to player i

is called a maxmin mixed strategy or security strategy of player i.
A player may have multiple maxmin mixed strategies

The payoff of a player in a MSNE is at least the maxmin value in
mixed strategies of the player, formally
Proposition
Suppose a strategic form game Γ = hN, (Si ), (ui )i has a mixed
strategy Nash equilibrium (σ1∗ , . . . , σn∗ ). Then
ui (σ1∗ , . . . , σn∗ ) ≥ vi ∀i ∈ N
where vi is the maxmin value in mixed strategies of player i

The minmax value in mixed strategies of a player i is the lowest
payoff that the other players will be able to force on player i when
they choose mixed strategies that hurt player i the most, formally
Definition (Minmax Value in Mixed Strategies)
Given a strategic form game, Γ = hN, (Si ), (ui )i, the minmax value,
in mixed strategies, of a player i(i = 1, . . . , n) is given by
vi = min max ui (σi , σ−i ).

σ−i ∈×j6=i ∆(Sj ) σi ∈∆(Si )
Any mixed strategy σ−i that forces this payoff on player i is called a
minmax mixed strategy profile against player i.
The maxmin value in mixed strategies of a player must be less than or

equal to the minmax value in mixed strategies of that player
Proposition
Consider a strategic form game Γ = hN, (Si ), (ui )i. Then
vi ≥ vi ∀i ∈ N
where vi is the maxmin value in mixed strategies of player i and vi is

the minmax value in mixed strategies of player i.
In two player SFG, the maxmin value in mixed strategies is equal

to the minmax value in mixed strategies
However, it need not be true in pure strategies

The payoff of a player in a MSNE (if one exists) is at least the
minmax value of the player, formally
Proposition
Suppose a strategic form game Γ = hN, (Si ), (ui )i has a mixed
strategy Nash equilibrium (σ1∗ , . . . , σn∗ ). Then
ui (σ1∗ , . . . , σn∗ ) ≥ vi ∀i ∈ N
where vi is the minmax value in mixed strategies of player i
Also, if a MSNE exists then ui (σ1∗ , . . . , σn∗ ) ≥ vi ≥ vi ∀i ∈ N
Domination in Mixed Strategies
Definition
Given two mixed strategies σi , σi0 ∈ ∆(Si ) of player i, we say σi strictly dominates σi0
if
ui (σi , σ−i ) > ui (σi0 , σ−i ) ∀σ−i ∈ ×j6=i ∆(Sj ).
We say σi weakly dominates σi0 if
ui (σi , σ−i ) ≥ ui (σi0 , σ−i ) ∀σ−i ∈ ×j6=i ∆(Sj )
and
ui (σi , σ−i ) > ui (σi0 , σ−i ) ∃σ−i ∈ ×j6=i ∆(Sj ).
We say σi very weakly dominates σi0 if
ui (σi , σ−i ) ≥ ui (σi0 , σ−i ) ∀σ−i ∈ ×j6=i ∆(Sj ).
In the above cases we say the strategy σi0 is strongly/weakly/very weakly dominated
by σi

Definition (Dominant Mixed Strategy Equilibrium)
If the mixed strategy σi∗ strongly(weakly or very weakly) dominates
all other strategies σi0 ∈ ∆(Si ), we say σi∗ is a strongly (weakly or
very weakly dominant) strategy of player i. A strategy profile
(σ1∗ , . . . , σn∗ ) such that σi∗ is a strictly (weakly or very weakly)
dominant strategy for player i, ∀i ∈ N, is called a strictly (weakly or
very weakly) dominant mixed strategy equilibrium.
Any dominant mixed strategy equilibrium is also a mixed

strategy Nash equilibrium
A strictly dominant mixed strategy for any player, if one exists,
is unique
2 2
1 1 2
NC C C 1
C
NC -2,-2 -10,-1 NC -10,-1
C -5,-5
C -1,-10 -5,-5 C -5,-5
Strategy NC is strictly dominated by strategy C for player 2

Player 2 will never play NC
Strategy NC for player 2 can be eliminated, resulting in second
bimatrix
Now the strategy NC of player 1 is dominated by strategy C,
which can also be eliminated leading to a degenerate payoff
matrix with a single entry corresponding to the profile (C,C),
which happens to be a strongly dominant strategy equilibrium

2
1
X Y Z
A 3,1 0,1 0,0
B 0,1 4,1 0,0
C 1,1 1,1 5,0
u2 (A, X) = u2 (A, Y) > u2 (A, Z)

u2 (B, X) = u2 (B, Y) > u2 (B, Z)
u2 (C, X) = u2 (C, Y) > u2 (C, Z)
Strategy Z is strictly dominated by X and also Y
Player 2 will never play Z
Strategy Z can be eliminated
2
1
X Y
A 3,1 0,1
B 0,1 4,1
C 1,1 1,1
u1 (A, X) > u1 (C, X) > u1 (B, X)

u1 (B, Y) > u1 (C, Y) > u1 (A, Y)
None of the pure strategy of player 1 is dominated by any of its pure
strategies
However, strategy C is strictly dominated by the mixed strategy of
player 1 that assigns equal probability to A and B
C was not dominated by any mixed strategy in original game
A strategy that was not dominated may become dominated when a
strictly dominated strategy is eliminated
Also, a pure strategy may not be dominated by any other pure
strategies but could be dominated by a mixture of those pure strategies
2
1
X Y
A 3,1 0,1
B 0,1 4,1
No more strategies can be eliminated from this game
Iterated Elimination of Dominated Strategies (IEDS)
Elimination of dominated strategies simplifies the game

Consider a strategic form game, Γ = hN, (Si ), (ui )i
Let k = 1, 2, . . . , K denotes the successive rounds in which
strictly dominated strategies are eliminated
For each player i ∈ N, define the set of strategies Sik as follows
Si1 = Si
Sik+1 ⊆ Sik for k = 1, 2, . . . , K − 1
For k = 1, 2, . . . , K − 1, all strategies si ∈ Sik \Sik+1 are strictly
dominated strategies which are eliminated in the kth round from
the game in which the set of strategies of j ∈ N is Sjk
No strategy in SiK is strictly dominated in the game in which the
set of strategies of each player j ∈ N is SjK
Set of strategy profiles {(s1 , s2 , . . . , sn ) : si ∈ SiK , i = 1, . . . , n} is
said to survive the iterated elimination of strictly dominated
strategies

Example
In last game, S1 = {A, B, C}; S2 = {X, Y, Z}
For this game
S11 = S1 = {A, B, C}; S21 = S2 = {X, Y, Z}
S12 = {A, B, C}; S22 = {X, Y}
S13 = {A, B}; S23 = {X, Y}
The strategy profiles {(A, X), (A, Y), (B, X), (B, Y)} survives the
IEDS
Iterated removal of strongly dominated strategies may

sometimes simplify NE computation
Also, many a times, IEDS may not lead to any simplification as
well
For finite games, IEDS must end in a finite number of rounds
The order of removal of strongly dominant strategies does not
affect the final outcome of the iterated elimination process
However, the order in which weakly or very weakly dominated
strategies are eliminated does influence the final outcome of the
process
Elimination of weakly or very weakly dominated strategies
yields smaller reduced games compared to elimination of
strongly dominated strategies

Mixed Strategy Nash Equilibrium: Sanjay Singh

Uploaded by

Copyright:

Available Formats

Mixed Strategy Nash Equilibrium: Sanjay Singh

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Mixed Strategy Nash Equilibrium: Sanjay Singh

Uploaded by

Copyright:

Available Formats

Mixed Strategy Nash Equilibrium

April 10, 2021

Sanjay Singh MSNE 1

A strategy profile with an outcome which is simultaneously the smallest number in

Definition (Saddle Point)

Definition (Saddle Point Principle)

Definition (Game Value)

Sanjay Singh MSNE 2

All four of the circled outcomes are saddle point

Sanjay Singh MSNE 3

Only sensible plan is to use

Other player could take The contrasting plan of playing

Sanjay Singh MSNE 4

Expected Value Principle

Sanjay Singh MSNE 5

Consider the scanario wherein Colin decides to use mixed

Sanjay Singh MSNE 6

Sanjay Singh MSNE 7

Figure 1: Game with oddments, and mixed strategy probability calculation

Value of the game is computed as:

Sanjay Singh MSNE 9

For each Rose strategy, mark the payoff if Colin

Sanjay Singh MSNE 10

All of Rose’s expectations are ≤ 4/7, and all of Colin’s

Sanjay Singh MSNE 12

The lowest point on the upper envelope lies

Sanjay Singh MSNE 13

Colin would try to choose her strategy to

Sanjay Singh MSNE 14

Sanjay Singh MSNE 15

Theorem (Minimax Theorem)

The solution of a m × n can always be found as the solution to

Let us solve this 3 × 3 game by method of equalizing

Sanjay Singh MSNE 17

Rose will be unable to take advantage of Colin’s mixed strategy

Sanjay Singh MSNE 19

Definition (Pure Strategy)

Definition (Mixed Strategy)

Sanjay Singh MSNE 20

Consider a strategic form game Γ = hN, (Si ), (ui )i

Definition (Mixed Strategy)

Sanjay Singh MSNE 21

A pure strategy of a player, say si ∈ Si , can be considered as a mixed

Set ∆(Si ) is called the mixed extension of Si

The payoff functions Ui are defined as

Ui (σ1 , . . . , σn ) for i = 1, . . . , n is the expected payoff when the

Mixed Strategies in BOS Problem

Suppose (σ1 , σ2 ) is a mixed strategy profile

Sanjay Singh MSNE 24

u1 (σ1 , σ2 ) = σ1 (A)σ2 (A)u1 (A, A) + σ1 (A)σ2 (B)u1 (A, B)

Similarly, we can get

Sanjay Singh MSNE 25

Inequalities and intervals

Sanjay Singh MSNE 26

Sanjay Singh MSNE 27

Solve inequality x2 − 3 > 2x.

Sanjay Singh MSNE 28

We are solving for (x + 1)(x − 3) > 0

Sanjay Singh MSNE 29