Matrix Game Theory

September 10, 2017 4 minute read

Matrix Game: Two players, each makes a choice secretly and play simutaneously. And there is payoff.

Zero Sum GamePermalink

Saddle PointPermalink

$(X, Y)$ is a saddle point if entry $x$ is the largest value in column $X$ and smallest value in its row.

Thm: All saddle points has the same value and appears as corners of a rectangle.

2 $\times$ 2 GamePermalink

All examples below are between two players Colin and Ros. Because it is a zero sum game, we only write Rose’s payoff, and Colin’s payoff is just the inverse.

Example:

	A	B
A	2	-3
B	-1	3

If Colin plays $\frac{2}{3} A, \frac{1}{3} B$ : Rose A average payout is $2 \times \frac{2}{3} - 3 \times \frac{1}{3} = \frac{1}{3}$ Rose A average payout is $- 1 \times \frac{2}{3} + 3 \times \frac{1}{3} = \frac{1}{3}$

For Colin playing $\frac{2}{3} A, \frac{1}{3} B$ minimizes Rose’s maximum average payout; similarly Rose wants to maximize her minimum average payout. So:

m a x_{y \in [0, 1]} {m i n {(2 y - 1 (1 - y), - 3 y + 3 (1 - y))}}

So she wants:

2 y - 1 (1 - y) = - 3 y + 3 (1 - y) y = \frac{4}{9}

So the average payout is $2 y - (1 - y) = \frac{1}{3}$

Note: If there is a saddle point in the game, then it can’t be solved.

For

a	b
c	d

with no dominant row/column:

a > c ⟺ b \leq d a \leq c ⟺ b > d

So for Colin whose payout is $x$ :

a x + b (1 - x) = c x + d (1 - x) (a - b - c + d) x = d - b x = \frac{d - b}{a - c + d - b}

$m \times n$ Zero Sum GamePermalink

Von Neumann’s Minimax Thm: Every $m \times n$ game has a solution. Each player has a distribution of their choices such that Rose’s average minimum payout is maximized, and Colin’s maximum average payout is minimized, and two payouts are the same.

The proof is omitted not because I am lazy but for exercise (fact!). ¯_(ツ)_/¯

Some Other Lemma and PropPermalink

Let $A$ be a $m \times n$ matrix. $P$ is the payout vector for Rose, and $q$ is for column

\begin{aligned} P & = [\begin{matrix} p_{1} \\ p_{2} \\ ⋮ \\ p_{m} \end{matrix}] \end{aligned} \in [0, 1]^{m} s . t . \sum_{i = 1}^{m} p_{i} = 1 P^{T} A = [p^{T} A_{1}, p^{T} A_{2}, \dots, p^{T} A_{n}] s . t . i^{t h} e n t r y i s R o s e^{'} s a v e r a g e p a y o f f w h e n C o l i n p l a y s i^{t h} s t r a t e g y .

\begin{aligned} A q & = [\begin{matrix} A_{1} q \\ A_{2} q \\ ⋮ \\ A_{m} q \end{matrix}] \end{aligned} r = m a x i m i n = m a x (m i n (p^{T} A_{i})) (f o r R o s e) c = m i n i m a x = m i n (m a x (A q)) (f o r C o l i n)

Lemma:

f o r a n y s t r a t e g y : \forall p, q s . t . p, q \in [0, 1]^{m}, \sum p_{i} = \sum q_{i} = q m i n_{i = 1, 2, \dots n} ((p^{T} A)_{i}) \leq p^{T} A q m a x_{i = 1, 2, \dots m} ((A q)_{i}) \geq p^{T} A q ⟹ m i n (p^{T} A) \leq m a x (A q) ⟹ m a x (m i n (p^{T} A)) \leq m i n (m a x (A q)) ⟹ r \leq c

Prop: $p, q$ are strategies for Rose, Colin such that

m i n (p^{T} A) = m a x (A q) = V T h e n I f p_{i} > 0 ⟹ (A q)_{i} = V I f q_{i} > 0 ⟹ (p^{T} A)_{i} = V

So how do we find the minimax\maximin? Use linear programming, which is not interesting at all so we will just skip that.

Non-zero Sum GamePermalink

All examples below are between two players Colin and Rose, where Colin’s payoff is the column one and Rose’s payoff is the row; i.e. for $(X, Y)$ , Colin has $X$ and Rose has $Y$ .

Let start with a simple example

(-1,-1)	(-10, 0)
(0, -10)	(-5, -5)

Obviously, the point (-5, -5) is the pure Nash equilibrium.

Mix StrategyPermalink

A mix strategy is just a vector of probability of strategy.

If $q$ is a mix strategy for Colin, a best response for Rose is any $p$ with $p_{i} = 0$ if $(R q)$ is not a max entry.

A Nash Equilibrium is a pair $(p, q)$ of mix strategy such that each is a best response for the other.

Thm: for $2 \times 2$ game, $(p, q)$ is a Nash Eq if $p^{T} C$ and $R q$ have equal entries.

Definitions, Axioms, and Nash’s TheormPermalink

We know that Nash equilibrium is not necessary global optimal. For the example above, they could have picked (-1, -1), which is better than (-5, -5), but they end up with the worse one. Therefore, we might want someone to pick a position for them, instead of letting them come up with a solution themselves.

Given a game, the God selects a position for R and C that is a “fair”. To be a “fair” decision $(x *, y *)$ the outcome should be:

Pareto Optimal: No $(x, y)$ with $(x, y) \geq (x *, y *)$
$(x *, y *)$ should be at least their maximin strategy with repsect to their own game (or they will just pick the maximin strategy)

We call the numbers in the matrix utility, because it is not necessarily be money.

Also, we can map the matrix geometrically, where each payoff is just a point. I won’t show it here, because it should be obvious. After placing all the points, we can draw a polygon by connecting those points. So the boundary on the North-East direction of the convex hull is pareto optimal. If the maximin solution point is inside the convex hull, then the boundary of the convex hull on the North-East direction of the maximin solution (which is a point) is called negotiation set.

Then we have the following axioms: Any “fair” decision should be:

In negotiation set.
If either player’s utilities, say Roses’, is tranformed by $g (x) = m x + n, m > 0$ , then $(g (x *), y *)$ is a fair decision in the new game.
If the payoff polygon is symmetric about the line $x = y$ , then $(x *, y *)$ is on this line.
Suppose $P$ is a payoff polygon, and a fixed “status quo” point, which could be the maximin or other “fall back” point, $S Q$ is given. Suppose $Q$ is another polygon contained in $P$ where $S Q$ , $(x *, y *) \in Q$ , then $(x *, y *)$ is the fair decision for $Q$ .

Nash’s TheormPermalink

Nash’s Thm:

There is only one point $(x *, y *)$ that satisfies all of the above axioms.

If $S Q = (x_{0}, y_{0})$ , then $(x *, y *)$ maximizes $(x - x_{0}) (y - y_{0})$ , or $(x *, y *) = (x_{0}, y_{0})$

The proof is omitted not because I am lazy but for exercise (fact!). ¯_(ツ)_/¯

Share on

X Facebook LinkedIn Bluesky

Yanxi Chen

Matrix Game Theory

Zero Sum GamePermalink

Saddle PointPermalink

2 $\times$ 2 GamePermalink

$m \times n$ Zero Sum GamePermalink

Some Other Lemma and PropPermalink

Non-zero Sum GamePermalink

Mix StrategyPermalink

Definitions, Axioms, and Nash’s TheormPermalink

Nash’s TheormPermalink

Share on

Comments

You May Also Enjoy

潮汕牛肉丸改良版

鸡肝酱Pâté

萝卜/芋头糕

潮汕肉丸

Yanxi Chen

Zero Sum GamePermalink

Saddle PointPermalink

2××2 GamePermalink

m×nm×n Zero Sum GamePermalink

Some Other Lemma and PropPermalink

Non-zero Sum GamePermalink

Mix StrategyPermalink

Definitions, Axioms, and Nash’s TheormPermalink

Nash’s TheormPermalink

Share on

Comments

You May Also Enjoy

潮汕牛肉丸改良版

鸡肝酱Pâté

萝卜/芋头糕

潮汕肉丸

2 $\times$ 2 GamePermalink

$m \times n$ Zero Sum GamePermalink