The allied forces were commanded by General George C. Kenney. His
objective
was to do as much damage as possible to the Japanese convoy, but he had
to find them first. He could either start searching north of New
Britain
or south. Assume that it will take one day to either find
the
convoy or determine that they took another route under stormy
conditions,
and assume that it will take almost no time at all to find the convoy
under
sunny conditions ,but one day to determine if the convoy took another
route
under sunny conditions. With these assumptions, we can construct
the following payoff matrix. Note that only the payoffs,
presented
as number of days of bombing, for the allied forces are shown.
Since
the game is zero sum, the payoff for the Japanese forces is
simply
the opposite of the allied payoff.
|
|
|
|
|
|
|
|
|
|
|
|
This payoff matrix indicates that if the Japanese sail north and the allies search south, then the allies will spend one day determining that the Japanese did not sail south and then spend another day finding the Japanese ships to the north. This results in only one day of bombing. Similar reasoning applies to the other entries.
So what do you do? The maximin solution for the allied forces
is to search north, and the minimax solution for the Japanese forces is
to sail north. Additionally, the "sail and search north" solution
is a Nash equilibrium since there is no incentive for either combatant
to change its strategy from this strategy. Since every other
option
in the payoff matrix comes with an incentive to change strategies,
there
is no other equilibrium solution. For this problem, the minimax
solution
and the Nash equilibrium coincide. When this occurs, the
resulting
payoff is referred to as the value of the game. In our
example,
2 days of bombing is the game's value.
You might wonder why this payoff is called
the value of the game?
The answer to this lies in the fact that, for zero-sum games, the
minimax value for the minimizing player and the maximin value for the
maximizing player are the same. Thus, finding the minimax value
yields a unique number that represents the value of the game. In
the proof of the minimax theorem, you will encounter theorem 1 which
states that if an equilibrium exists than the minimax value equals the
maximin value.
For historical purposes, these choices were indeed the ones made by the forces; the convoy was sighted about one day after it sailed and the Japanese suffered severe losses. Eventually (1944), the Japanese forces on the island were isolated and, though the force withered, the force continued guerilla warfare until the war ended in 1945.
|
|
|
|
|
|
|
|
|
|
|
|
For example, if the fighter attacked from the sun and the bomber crew was looking up then the fighter had a 95 percent chance of surviving. If they attacked from below and the bomber crew was looking down then they had 0 percent chance of surviving.
Is there a Nash equilibrium? Let's consider the options. For (Sun,Up) the fighter payoff is 0.95 but the fighter has an incentive to attack from the bottom because the payoff increases to 1. Thus, (Sun, Up) is not an equilibrium point. For (Bottom, Up) the fighter payoff is 1 but the bomber crew has an incentive to look down because the fighter payoff decreases to 0. Thus, (Bottom, Up) is not an equilibrium point. It is left to you to think through (Sun, Down) and (Bottom, Down). When you are done thinking, you should find out that there is no equilibrium solution.
What value results when the fighter employs maximin and the bomber employs minimax? For the fighter, the maximin solution is to attack from the sun and the maximin value(in terms of cost) is 0.95. For the bomber, either strategy is a minimax solution and both minimax values are 1 (corresponding to a maximin payoff of -1).
How can we rationally address this problem? What should the bomber do and what should the fighter do? In answering this question, we will need to employ the idea of a mixed strategy.
The essence of a mixed strategy is that we perform one action with a certain probability and other actions with certain probabilities. Sometimes mixed strategies work better than the pure minimax strategies because the mixed strategies are better at minimizing (expected) risk.
We begin with a formal statement of a two person, zero sum game. Please note that I am trying to stick closely to the terminology presented in previous lectures, but that it is notationally easier to denote P2's set of actions as B instead of A2. Because I can't get HTML to display all of the symbols I need, please refer to the pdf version of the proof.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
We'll adopt P1's perspective. Suppose that P1's optimal mixed strategy is to play strategy a1 p percent of the time and play a2 1-p percent of the time. Then, on the average, payoffs to P1 against the three strategies of P2 are
M(p,b1) = 0p + 1(1-p) = 1-p against strategy b1Let's plot these expected payoffs as a function of p.
M(p,b2) = 5/6p+1/2(1-p) = 1/2 + 1/3p against strategy b2
M(p,b3) = 1/2p + 3/4(1-p) = 3/4-1/4p against strategy b3.
Recall that P1 wants to maximize the minimum payoff that it will receive. This minimum payoff corresponds to the lowest M as p changes. For points to the left of about 0.4 the function M(p,b2) is minimum. For points to the right of about 0.4 the function M(p,b1) is minimum. The maximum of these minimal functions occurs at their intersection, that is at the point where
M(p,b1) = M(p,b2)which occurs when p=3/8=.375.
1-p = 1/2 + 1/3p
What is the optimal mixed strategy for P2? Can you do this exercise for the Fighters and Bombers game?