The theory of the firm and industry equilibrium: 7.2 Strategic games

7.2 Strategic games

Definition of a strategic game

To describe a situation in which decision-makers interact, we need to specify

who the decision-makers are
what each decision-maker can do
each decision-maker's payoff from each possible outcome.

A strategic game is one way to specify these components.

Definition: Strategic game

A strategic game consists of

a set of players
for each player, a set of actions (sometimes called strategies)
for each player, a payoff function that gives the player's payoff to each list of the players' actions.

An essential feature of this definition is that each player's payoff depends on the list of all the other players' actions. In particular, a player's payoff does not depend only on her own action.

When using the theory of strategic games to study oligopoly, we will specify the components as follows:

Players: The set of firms.
Actions: The set of outputs, or the set of prices, or the advertising budgets, or any other variable chosen by the firm, or any combination of these variables.
Payoffs: The firms' profits.

However, the notion of a strategic game can be—and has been—used to study a wide variety of situations, from tariff wars between countries to electoral competition to the design of legal regimes to sibling rivalry to the mating habits of hermaphroditic fish. In particular, the definition of a strategic game does not put any restrictions on the nature of the players' actions. For example, an action can be a single variable (like an output, or price), or can be a list of variables (like an (output, price) pair), or can be a complicated contingency plan (if X happens, choose x, while if Y happens, choose y, ...).

A list of actions, one for each player in the game, is called an action profile (or, sometimes, a strategy profile or strategy combination).

We can compactly represent a strategic game with two players in which each player has finitely many actions in a table, like the following one.

	L	R
T	2,2	0,3
B	3,0	1,1

This table represents a strategic game in which player 1's actions are T and B and player 2's actions are L and R. The first number in each box is player 1's payoff to the pair of actions that define the box, while the second number in each box is player 2's payoff to the pair of actions that define the box. Thus, for example, if player 1 chooses the action B and player 2 chooses the action L then player 1's payoff is 3 and player 2's payoff is 0.

Nash equilibrium

Definition

What actions will be chosen by the players in a strategic game? We assume that

each player chooses the action that is best for her, given her beliefs about the other players' actions.

How do players form beliefs about each other? We consider here the case in which every player is experienced: she has played the game sufficiently many times that she knows the actions the other players will choose. Thus we assume that

every player's belief about the other players' actions is correct.

The notion of equilibrium that embodies these two principles is called Nash equilibrium (after John Nash, who suggested it in the early 1950s). (The notion is sometimes referred to as a “Cournot-Nash equilibrium”.)

Definition: Nash equilibrium of strategic game: A Nash equilibrium of a strategic game is an action profile (list of actions, one for each player) with the property that no player can increase her payoff by choosing a different action, given the other players' actions.

Note that nothing in the definition suggests that a strategic game necessarily has a Nash equilibrium, or that if it does, it has only one. A strategic game may have no Nash equilibrium, may have a single Nash equilibrium, or may have many Nash equilibria.

Finding Nash equilibria: games with finitely many actions for each player

Consider the game

	L	R
T	2,2	0,3
B	3,0	1,1

There are four action profiles ((T,L), (T,R), (B,L), and (B,R)); we can examine each in turn to check whether it is a Nash equilibrium.

(T,L): By choosing B rather than T, player 1 obtains a payoff of 3 rather than 2, given player 2's action. Thus (T,L) is not a Nash equilibrium. [Player 2 also can increase her payoff (from 2 to 3) by choosing R rather than L.]
(T,R): By choosing B rather than T, player 1 obtains a payoff of 1 rather than 0, given player 2's action. Thus (T,R) is not a Nash equilibrium.
(B,L): By choosing R rather than L, player 2 obtains a payoff of 1 rather than 0, given player 1's action. Thus (B,L) is not a Nash equilibrium.
(B,R): Neither player can increase her payoff by choosing an action different from her current one. Thus this action profile is a Nash equilibrium.

We conclude that the game has a unique Nash equilibrium, (B,R).

Notice that in this equilibrium both players are worse off than they are in the action profile (T,L). Thus they would like to achieve (T,L); but their individual incentives point them to (B,R).

This game is called the Prisoner's dilemma; it has been used to model a wide variety of situations. The story that gives the game its name is the following. Two suspects in a major crime are in separate cells. There is enough evidence to convict each of them of a minor offense, but not enough evidence to convict either of them of the major crime unless one of them acts as an informer against the other (finks). If they are both quiet, each will be convicted of the minor offense and spend one year in prison. If one and only one of them finks, she will be freed and used as a witness against the other, who will spend four years in prison. If they both fink, each will spend three years in prison.

Assign each player the payoff of 0 for a four-year jail term, the payoff of 1 for a three-year term, the payoff of 2 for a one-year term, and the payoff of 3 for freedom, and associate T and B for player 1 with the actions Quiet and Fink, and L and R for player 2 with the actions Quiet and Fink. Then the game above represents this situation.

We conclude from our analysis of the Nash equilibrium of this game that the outcome will be that both players Fink and wind up in jail for three years.

Procedure for finding Nash equilibria of strategic game in which each player has finitely many actions: Check each action pair to see if it has the property that each player's action maximizes her payoff given the other players' actions.

Example: coordination between players with different preferences

Two firms are merging into two divisions of a large firm, and have to choose the computer system to use. In the past the firms have used different systems, I and A; each prefers the system it has used in the past. They will both be better off if they use the same system then if they continue to use different systems.

We can model this situation by the following two-player strategic game.

	I	A
I	2,1	0,0
A	0,0	1,2

To find the Nash equilibria of this game, we can examine each action profile in turn.

(I,I): Neither player can increase her payoff by choosing an action different from her current one. Thus this action profile is a Nash equilibrium.
(I,A): By choosing A rather than I, player 1 obtains a payoff of 1 rather than 0, given player 2's action. Thus this action profile is not a Nash equilibrium. [Also, player 2 can increase her payoff by choosing I rather than A.]
(A,I): By choosing I rather than A, player 1 obtains a payoff of 2 rather than 0, given player 2's action. Thus this action profile is not a Nash equilibrium. [Also, player 2 can increase her payoff by choosing A rather than I.]
(A,A): Neither player can increase her payoff by choosing an action different from her current one. Thus this action profile is a Nash equilibrium.

We conclude that the game has two Nash equilibria, (I,I) and (A,A).

Example: players with opposing preferences

An established firm and a newcomer to the market of fixed size have to choose the appearance for a product. Each firm can choose between two different appearances for the product; call them X and Y. The established producer prefers the newcomer's product to look different from its own (so that its customers will not be tempted to buy the newcomer's product) while the newcomer prefers that the products look alike.

We can model this situation by the following two-player strategic game.

	X	Y
X	2,1	1,2
Y	1,2	2,1

To find the Nash equilibria of this game, we can examine each action profile in turn.

(X,X): Firm 2 can increase its payoff from 1 to 2 by choosing the action Y rather than the action X. Thus this action profile is not a Nash equilibrium.
(X,Y): Firm 1 can increase its payoff from 1 to 2 by choosing the action Y rather than the action X. Thus this action profile is not a Nash equilibrium.
(Y,X): Firm 1 can increase its payoff from 1 to 2 by choosing the action X rather than the action Y. Thus this action profile is not a Nash equilibrium.
(Y,Y): Firm 2 can increase its payoff from 1 to 2 by choosing the action X rather than the action Y. Thus this action profile is not a Nash equilibrium.

We conclude that the game has no Nash equilibrium!

Finding Nash equilibria: best response functions

In a game in which each player has infinitely many possible actions, we cannot find a Nash equilibrium by examining all action profiles in turn. To develop an alternative method of finding Nash equilibria, we first reformulate the definition of a Nash equilibrium for a two-player game. (The general definition above applies to games with any number of players; for simplicity now I restrict to games with two players.)

Call the action of player 1 that maximizes her payoff, given that player 2's action is a₂, player 1's best response to a₂. Similarly, call the action of player 2 that maximizes her payoff, given that player 1's action is a₁, player 2's best response to a₁. (I am assuming that each player has a single best response.)

Given this definition of best responses, a pair (a₁, a₂) of actions is a Nash equilibrium if and only if

player 1's action a₁ is a best response to player 2's action a₂
and player 2's action a₂ is a best response to player 1's action a₁.

That is, in order to find a Nash equilibrium we need to find a pair (a₁, a₂) of actions such that a₁ is a best response to a₂, and vice versa.

If we denote player 1's best response to a₂ by b₁(a₂) and player 2's best response to a₁ by b₂(a₁) then we can write the condition for a Nash equilibrium more compactly:

the pair (a₁, a₂) of actions is a Nash equilibrium if and only if a₁ = b₁(a₂) and a₂ = b₂(a₁).

The method of finding the players' best response functions and then solving the two simultaneous equations is most useful when considering a game in which each player has infinitely many actions, but it can be applied also to a game in which each player has finitely many actions. Consider, for example, the Prisoner's dilemma:

	L	R
T	2,2	0,3
B	3,0	1,1

Player 1's best response to L is B, and her best response to R is also B. Similarly, player 2's best response to T is R and her best response to B is R. Thus we have

b₁(L) = B and b₁(R) = B
and b₂(T) = R and b₂(B) = R.

We see that the only pair of actions (a₁, a₂) with the property that a₁ = b₁(a₂) and a₂ = b₂(a₁) is (B,R): the Nash equilibrium that we found previously.

Procedure for finding Nash equilibria of strategic game using best response functions

Find each player's best response function by finding the action that maximizes its payoff for any given action of the other player. Denote the best response function of player i by b_i.
Find the pair (a₁, a₂) of actions with the property that player 1's action is a best response to player 2's action, and player 2's action is a best response to player 1's action: a₁ = b₁(a₂) and a₂ = b₂(a₁).

Example

Consider the strategic game in which

the players are two firms
each player can choose its amount of advertising (any nonnegative number)
if firm 1 chooses the amount a₁ of advertising and firm 2 chooses the amount a₂ of advertising then the payoff (profit) of firm 1 is
a₁(c + a₂ − a₁)
and the payoff (profit) of firm 2 is
u₂(a₁, a₂) = a₂(c + a₁ − a₂),
where c is a positive constant.

What are the Nash equilibria?

Find the firms' best response functions. To find the best response of firm 1 to any action a₂ of firm 2, fix a₂ and solve
max_a_₁a₁(c + a₂ − a₁).
The derivative is c + a₂ − 2a₁, so the maximizer is a₁ = (c + a₂)/2. Thus firm 1's best response function is given by
b₁(a₂) = (c + a₂)/2.
Similarly, firm 2's best response function is given by
b₂(a₁) = (c + a₁)/2.
A Nash equilibrium is a pair (a*₁,a*₂) such that a*₁ = b₁(a*₂) and a*₂ = b₂(a*₁). Thus a Nash equilibrium is a solution of the equations
a*₁ = (c + a*₂)/2
a*₂ = (c + a*₁)/2.
Substituting the second equation in the first equation, we get (a*₁, a*₂) = (c, c).

We conclude that the game has a unique Nash equilibrium, in which each firm's amount of advertising is c.

Your first name*
Your last name*
Your email address*
Comment*
Enter the first six letters of the alphabet*	(to help establish that you are human)