In game theory a max-dominated strategy is a strategy which is not a best response to any strategy profile of the other players. This is an extension to the notion of strictly dominated strategies, which are max-dominated as well.
A strategy
si\inSi
i
s-i\inS-i
\prime | |
s | |
i\in |
Si
\prime | |
u | |
i,s |
-i)>ui(si,s-i)
si
s-i
\prime | |
s | |
i |
si
i
If a strategy
si\inSi
\prime | |
s | |
i |
\inSi
s-i\inS-i
\prime | |
s | |
i |
\prime | |
u | |
i,s |
-i)>ui(si,s-i)
Even if
si
A strategy
si\inSi
i
s-i\inS-i
\prime | |
s | |
i\in |
Si
\prime | |
u | |
i,s |
-i)\gequi(si,s-i)
si
s-i
\prime | |
s | |
i |
si
i
If a strategy
si\inSi
\prime | |
s | |
i |
\inSi
s-i\inS-i
\prime | |
s | |
i |
\prime | |
u | |
i,s |
-i)\gequi(si,s-i)
Even if
si
A game
G
More formally we say that
G
G0,...,Gr
G0=G
Gk+1
Gk
Gr
Obviously every max-solvable game has a unique pure Nash equilibrium which is the strategy profile left in
Gr
As in the previous part one can define respectively the notion of weakly max-solvable games, which are games for which a game with a single strategy profile can be reached by eliminating weakly max-dominated strategies. The main difference would be that weakly max-dominated games may have more than one pure Nash equilibrium, and that the order of elimination might result in different Nash equilibria.
The prisoner's dilemma is an example of a max-solvable game (as it is also dominance solvable). The strategy cooperate is max-dominated by the strategy defect for both players, since playing defect always gives the player a higher utility, no matter what the other player plays. To see this note that if the row player plays cooperate then the column player would prefer playing defect and go free than playing cooperate and serving one year in jail. If the row player plays defect then the column player would prefer playing defect and serve three years in jail rather than playing cooperate and serving five years in jail.
In any max-solvable game, best-reply dynamics ultimately leads to the unique pure Nash equilibrium of the game. In order to see this, all we need to do is notice that if
s1,s2,s3,...,sk
s1
s2
s1
s2
s1
s-i
s1
s1
1, 1 | 0, 0 | |
1, 0 | 0, 1 | |
0, 1 | 1, 0 |
It may come by surprise then that weakly max-solvable games do not necessarily converge to a pure Nash equilibrium when using the best-reply dynamics, as can be seen in the game on the right. If the game starts of the bottom left cell of the matrix, then the following best replay dynamics is possible: the row player moves one row up to the center row, the column player moves to the right column, the row player moves back to the bottom row, the column player moves back to the left column and so on. This obviously never converges to the unique pure Nash equilibrium of the game (which is the upper left cell in the payoff matrix).