Matrix Game Theory
Matrix Game: Two players, each makes a choice secretly and play simutaneously. And there is payoff.
Zero Sum GamePermalink
Saddle PointPermalink
is a saddle point if entry is the largest value in column and smallest value in its row.
Thm: All saddle points has the same value and appears as corners of a rectangle.
22 GamePermalink
All examples below are between two players Colin and Ros. Because it is a zero sum game, we only write Rose’s payoff, and Colin’s payoff is just the inverse.
Example:
A | B | |
A | 2 | -3 |
B | -1 | 3 |
If Colin plays : Rose A average payout is Rose A average payout is
For Colin playing minimizes Rose’s maximum average payout; similarly Rose wants to maximize her minimum average payout. So:
So she wants:
So the average payout is
Note: If there is a saddle point in the game, then it can’t be solved.
For
a | b |
c | d |
with no dominant row/column:
So for Colin whose payout is :
Zero Sum GamePermalink
Von Neumann’s Minimax Thm: Every game has a solution. Each player has a distribution of their choices such that Rose’s average minimum payout is maximized, and Colin’s maximum average payout is minimized, and two payouts are the same.
The proof is omitted not because I am lazy but for exercise (fact!). ¯_(ツ)_/¯
Some Other Lemma and PropPermalink
Let be a matrix. is the payout vector for Rose, and is for column
Lemma:
Prop: are strategies for Rose, Colin such that
So how do we find the minimax\maximin? Use linear programming, which is not interesting at all so we will just skip that.
Non-zero Sum GamePermalink
All examples below are between two players Colin and Rose, where Colin’s payoff is the column one and Rose’s payoff is the row; i.e. for , Colin has and Rose has .
Let start with a simple example
(-1,-1) | (-10, 0) |
(0, -10) | (-5, -5) |
Obviously, the point (-5, -5) is the pure Nash equilibrium.
Mix StrategyPermalink
A mix strategy is just a vector of probability of strategy.
If is a mix strategy for Colin, a best response for Rose is any with if is not a max entry.
A Nash Equilibrium is a pair of mix strategy such that each is a best response for the other.
Thm: for game, is a Nash Eq if and have equal entries.
Definitions, Axioms, and Nash’s TheormPermalink
We know that Nash equilibrium is not necessary global optimal. For the example above, they could have picked (-1, -1), which is better than (-5, -5), but they end up with the worse one. Therefore, we might want someone to pick a position for them, instead of letting them come up with a solution themselves.
Given a game, the God selects a position for R and C that is a “fair”. To be a “fair” decision the outcome should be:
- Pareto Optimal: No with
- should be at least their maximin strategy with repsect to their own game (or they will just pick the maximin strategy)
We call the numbers in the matrix utility, because it is not necessarily be money.
Also, we can map the matrix geometrically, where each payoff is just a point. I won’t show it here, because it should be obvious. After placing all the points, we can draw a polygon by connecting those points. So the boundary on the North-East direction of the convex hull is pareto optimal. If the maximin solution point is inside the convex hull, then the boundary of the convex hull on the North-East direction of the maximin solution (which is a point) is called negotiation set.
Then we have the following axioms: Any “fair” decision should be:
- In negotiation set.
- If either player’s utilities, say Roses’, is tranformed by , then is a fair decision in the new game.
- If the payoff polygon is symmetric about the line , then is on this line.
- Suppose is a payoff polygon, and a fixed “status quo” point, which could be the maximin or other “fall back” point, is given. Suppose is another polygon contained in where , , then is the fair decision for .
Nash’s TheormPermalink
Nash’s Thm:
There is only one point that satisfies all of the above axioms.
If , then maximizes , or
The proof is omitted not because I am lazy but for exercise (fact!). ¯_(ツ)_/¯
Comments