Mixed Strategy Nash Equilibrium: Sanjay Singh
Mixed Strategy Nash Equilibrium: Sanjay Singh
Mixed Strategy Nash Equilibrium: Sanjay Singh
Sanjay Singh
1
Department of Information and Communication Technology
Manipal Institute of Technology, MAHE
Manipal-576104, INDIA
sanjay.singh@manipal.edu
2
Centre for Artificial and Machine Intelligence (CAMI)
MAHE, Manipal-576104, INDIA
p1 a1 + p2 a2 + . . . + pk ak
Colin
A B
Rose
A 2 -3
B 0 3
A 2 × n game
A B C D E
A -2 5 1 0 -4
B 3 -3 -1 3 8
m × n game
A B C
A 1 2 2
B 2 1 2
C 2 2 0
A B C A B C A B C
A 1 2 2 B 2 1 2 A 1 2 2
B 2 1 2 C 2 2 0 C 2 2 0
Some definitions
2
1
A B
A 2,1 0,0
B 0,0 1,2
Digression
Inequalities
B
A
Left Middle Right
Up A1,1 , B1,1 A1,2 , B1,2 A1,3 , B1,3
Middle A2,1 , B2,1 A2,2 , B2,2 A2,3 , B2,3
Down A3,1 , B3,1 A3,2 , B3,2 A3,3 , B3,3
ui (σi∗ , σ−i
∗ ∗
) ≥ ui (σi , σ−i ) ∀σi ∈ ∆(Si ).
λx + (1 − λ)y ∈ C
Proposition
Let Γ = hN, (Si ), (ui )i be a strategic form game. Then ui (σi , σ−i ) can
be expressed as the convex combination:
X
ui (σi , σ−i ) = σi (si )ui (si , σ−i )
si ∈Si
where
X Y
ui (si , σ−i ) = σj (sj ) ui (si , s−i ).
s−i ∈S−i j6=i
Interpretation
Implication of this result is that the payoff for any player under mixed
strategy can be computed as a convex combination of the payoffs
obtained when the player plays pure strategies with the rest of the
players playing σ−i .
u1 (σ1 , σ2 ) =
5σ1 (x1 ) + 10σ1 (x2 ) + 10σ1 (x3 ) + 20σ1 (x4 ) + 20σ1 (x5 )
The maximum value of convex combination is 20, which is
obtained when
σ1 (x4 ) = 1 or σ1 (x5 ) = 1 or
σ1 (x4 ) + σ1 (x5 ) = 1, and
σ1 (x1 ) + σ1 (x2 ) + σ1 (x3 ) = 0 or σ1 (x1 ) = σ1 (x2 ) = σ1 (x3 ) = 0
Also, max u1 (σ1 , σ2 ) = 20
σ1 ∈∆(S1 )
max u1 (σ1 , σ2 ) = max u1 (s1 , σ2 )
σ1 ∈∆(S1 ) s1 ∈S1
Proposition
Given a strategic form game hN, (Si ), (ui )i, then, for any σ ∈ ×i∈N ∆(Si )
and for any player i ∈ N,
Furthermore
ρi ∈ arg max ui (σi , σ−i )
σi ∈∆(Si )
iff
ρi (x) = 0 ∀x ∈
/ arg max ui (si , σ−i )
si ∈Si
Example
Consider a SFG with S1 = {a, b, c, d}, and S2 = {a, b, e, f } with
σ1 (a) = 0.2, σ1 (b) = 0.5, σ1 (c) = 0.3, σ1 (d) = 0, and
σ2 (a) = 0.6, σ2 (b) = 0, σ2 (e) = 0.4, σ2 (f ) = 0. Find the support of
mixed strategies δ(σ1 ) and δ(σ2 ).
Support for σ1 and σ2 is given by δ(σ1 ) = {a, b, c}, and
δ(σ2 ) = {a, e}.
Example
Consider the computation from previous example and compute δ(σ).
From last example, we know that δ(σ1 ) = {a, b, c}, and
δ(σ2 ) = {a, e}.
The support of the strategy profile is given by
Theorem
The mixed strategy profile (σ1∗ , . . . , σn∗ ) is a mixed strategy Nash equilibrium
iff ∀i ∈ N,
∗
1 ui (si , σ−i ) is the same ∀si ∈ δ(σi∗ ) and
∗
2 ui (si , σ−i ) ≥ ui (s0i , σ−i
∗
) ∀si ∈ δ(σi∗ ); ∀s0i ∈
/ δ(σi∗ ).
Henceforth we’ll call to (1) and (2) as condition (1) and condition (2)
The theorem implies that the payoff for player i corresponding to any
pure strategy having non-zero probability is same
Moreover, the payoff is no less than the payoff corresponding to any
pure strategy having zero probability
2 1 1 2
Candidate NE: ,, ,
3 3 3 3
2 1 1 2
Which means, σ1∗ (A) = ; σ1∗ (B) = ; σ2∗ (A) = ; σ2∗ (B) =
3 3 3 3
∗ ∗
Here, δ(σ1 ) = δ(σ2 ) = {A, B}
First we examine the situations with player 1, from last slide we
can write
X Y
ui (si , σ−i ) = σj (sj ) ui (si , s−i )
s−i ∈S−i j6=i
X
u1 (A, σ2∗ ) = σ2∗ (s2 )u1 (A, s2 )
s2 ∈{A,B}
Given a SFG, the maxmin value of a player is the highest payoff the
player can guarantee himself even in the worst case when other
players are free to play any mixed strategies, formally
Definition (Maxmin Value in Mixed Strategies)
Given a strategic form game, Γ = hN, (Si ), (ui )i, the maxmin value or
security value, in mixed strategies, of a player i(i = 1, . . . , n) is given
by
vi = max min ui (σi , σ−i ).
σi ∈∆(Si ) σ−i ∈×j6=i ∆(Sj )
Proposition
Suppose a strategic form game Γ = hN, (Si ), (ui )i has a mixed
strategy Nash equilibrium (σ1∗ , . . . , σn∗ ). Then
ui (σ1∗ , . . . , σn∗ ) ≥ vi ∀i ∈ N
Any mixed strategy σ−i that forces this payoff on player i is called a
minmax mixed strategy profile against player i.
vi ≥ vi ∀i ∈ N
ui (σ1∗ , . . . , σn∗ ) ≥ vi ∀i ∈ N
Definition
Given two mixed strategies σi , σi0 ∈ ∆(Si ) of player i, we say σi strictly dominates σi0
if
ui (σi , σ−i ) > ui (σi0 , σ−i ) ∀σ−i ∈ ×j6=i ∆(Sj ).
We say σi weakly dominates σi0 if
and
ui (σi , σ−i ) > ui (σi0 , σ−i ) ∃σ−i ∈ ×j6=i ∆(Sj ).
In the above cases we say the strategy σi0 is strongly/weakly/very weakly dominated
by σi
2 2
1 1 2
NC C C 1
C
NC -2,-2 -10,-1 NC -10,-1
C -5,-5
C -1,-10 -5,-5 C -5,-5
2
1
X Y
A 3,1 0,1
B 0,1 4,1
C 1,1 1,1