The Price of Anarchy (PoA)[1] is a concept in economics and game theory that measures how the efficiency of a system degrades due to selfish behavior of its agents. It is a general notion that can be extended to diverse systems and notions of efficiency. For example, consider the system of transportation of a city and many agents trying to go from some initial location to a destination. Here, efficiency means the average time for an agent to reach the destination. In the 'centralized' solution, a central authority can tell each agent which path to take in order to minimize the average travel time. In the 'decentralized' version, each agent chooses its own path. The Price of Anarchy measures the ratio between average travel time in the two cases.
Usually the system is modeled as a game and the efficiency is some function of the outcomes (e.g. maximum delay in a network, congestion in a transportation system, social welfare in an auction, etc.). Different concepts of equilibrium can be used to model the selfish behavior of the agents, among which the most common is the Nash equilibrium. Different flavors of Nash equilibrium lead to variations of the notion of Price of Anarchy as Pure Price of Anarchy (for deterministic equilibria), Mixed Price of Anarchy (for randomized equilibria), and Bayes–Nash Price of Anarchy (for games with incomplete information). Solution concepts other than Nash equilibrium lead to variations such as the Price of Sinking.[2]
The term Price of Anarchy was first used by Elias Koutsoupias and Christos Papadimitriou, but the idea of measuring inefficiency of equilibrium is older.[3] The concept in its current form was designed to be the analogue of the 'approximation ratio' in an approximation algorithm or the 'competitive ratio' in an online algorithm. This is in the context of the current trend of analyzing games using algorithmic lenses (algorithmic game theory).
Consider a game
G=(N,S,u)
N
Si
ui:S → R
S=S1 x ... x Sn
\operatorname{Welf}:S → R
\operatorname{Welf}(s)=\sumiui(s),
\operatorname{Welf}(s)=miniui(s),
We can define a subset
Equil\subseteqS
PoA=
maxs\operatorname{Welf | |
(s)}{min |
s\operatorname{Welf}(s)}
If, instead of a 'welfare' which we want to 'maximize', the function measure efficiency is a 'cost function'
\operatorname{Cost}:S → R
PoA=
maxs\operatorname{Cost | |
(s)}{min |
s\operatorname{Cost}(s)}
A related notion is that of the Price of Stability (PoS) which measures the ratio between the optimal 'centralized' solution and the 'best equilibrium':
PoS=
maxs\operatorname{Welf | |
(s)}{max |
s\operatorname{Welf}(s)}
or in the case of cost functions:
PoS=
mins\operatorname{Cost | |
(s)}{min |
s\operatorname{Cost}(s)}
We know that
1\leqPoS\leqPoA
Consider the 2x2 game called prisoner's dilemma, given by the following cost matrix:
Cooperate | Defect | ||
---|---|---|---|
Cooperate | 1, 1 | 7, 0 | |
Defect | 0, 7 | 5, 5 |
and let the cost function be
C(s1,s2)=u1(s1,s2)+u2(s1,s2).
Cequil=5+5=10
Cmin=1+1=2
Cequil/Cmin=10/2=5
Since the game has a unique Nash equilibrium, the PoS is equal to the PoA and it is 5 too.
A more natural example is the one of job scheduling. There are
N
M
Each machine has a speed
s1,\ldots,sM>0.
w1,\ldots,wN>0.
Ai=\{1,2,\ldots,M\}.
j
L | ||||||||||
|
.
The cost for player
i
ci(a)=L
ai |
(a),
MS(a)=maxjLj(a)
We consider two concepts of equilibrium: pure Nash and mixed Nash. It should be clear that mixed PoA ≥ pure PoA, because any pure Nash equilibrium is also a mixed Nash equilibrium (this inequality can be strict: e.g. when
N=2
w1=w2=1
M=2
s1=s2=1
\sigma1=\sigma2=(1/2,1/2)
\leq4/3
Claim. For each job scheduling game, there exists at least one pure-strategy Nash equilibrium.
Proof. We would like to take a socially optimal action profile
a*
M
We claim that this is a pure-strategy Nash equilibrium. Reasoning by contradiction, suppose that some player
i
j
k
k
j
j
j
a
Claim. For each job scheduling game, the pure PoA is at most
M
Proof. It is easy to upper-bound the welfare obtained at any mixed-strategy Nash equilibrium
\sigma
w(\sigma)\leq
\sumi{wi | |
Consider, for clarity of exposition, any pure-strategy action profile
a
w(a)\geq
\sumi{wi | |
Since the above holds for the social optimum as well, comparing the ratios
w(\sigma)
w(a)
See main article: Braess's paradox.
Consider a road network as shown in the adjacent diagram on which 4000 drivers wish to travel from point Start to End. The travel time in minutes on the Start–A road is the number of travelers (T) divided by 100, and on Start–B is a constant 45 minutes (likewise with the roads across from them). If the dashed road does not exist (so the traffic network has 4 roads in total), the time needed to drive Start–A–End route with
a
\tfrac{a}{100}+45
b
\tfrac{b}{100}+45
a+b=4000
a=b=2000
\tfrac{2000}{100}+45=65
Now suppose the dashed line A–B is a road with an extremely short travel time of approximately 0 minutes. Suppose that the road is opened and one driver tries Start–A–B–End. To his surprise he finds that his time is
\tfrac{2000}{100}+\tfrac{2001}{100}=40.01
\tfrac{2500}{100}+\tfrac{4000}{100}=65
45+\tfrac{4000}{100}=85
\tfrac{4000}{100}+\tfrac{4000}{100}=80
The routing problem introduced in the Braess's paradox can be generalized to many different flows traversing the same graph at the same time.
Definition (Generalized flow). Let
G=(V,E)
L
w
R=\{r1,r2,...,rk, | ri>0\}
\Gamma=\{(s1,t1),(s2,t2),...,(sk,tk)\}\subseteq(V x V)
f\Gamma,
p\mapsto\Re
p
si
ti
\in\Gamma
\sum | |
p:si → ti |
{fp}=ri \forall(si,ti)\in\Gamma.
The flow traversing a specific edge of
G
fe,\Gamma,=\sump:{fp}.
For succinctness, we write
fe
\Gamma,R
Definition (Nash-equilibrium flow). A flow
f\Gamma,
\forall(si,ti)\in\Gamma
\forallp,q
si
ti
fp>0 ⇒ \sume{le(fe)}\leq\sume{le(fe)}.
This definition is closely related to what we said about the support of mixed-strategy Nash equilibria in normal-form games.
Definition (Conditional welfare of a flow). Let
f\Gamma,
* | |
f | |
\Gamma,R |
G
\Gamma
R
f
f*
f
wf(f*)=\sume
* | |
{f | |
e |
⋅ le(fe)}
Fact 1. Given a Nash-equilibrium flow
f
f*
w(f)=wf(f)\leqwf(f*)
Proof (By contradiction). Assume that
wf(f*)<wf(f)
k | |
\sum | |
i=1 |
\sum | |
p:si → ti |
* | |
f | |
p |
⋅ \sumele(fe)<
k | |
\sum | |
i=1 |
\sum | |
p:si → ti |
fp ⋅ \sumele(fe)
f
f*
\Gamma,R
\sum | |
p:si → ti |
fp=
\sum | |
p:si → ti |
* | |
f | |
p |
=ri \foralli.
Therefore, there must be a pair
(si,ti)
p,q
si
ti
* | |
f | |
p |
>fp
* | |
f | |
q |
<fq
\sumele(fe)<\sumele(fe).
In other words, the flow
f*
f
si
ti
f*
f
f
Note that Fact 1 does not assume any particular structure on the set
L
Fact 2. Given any two real numbers
x
y
x ⋅ y\leqx2+y2/4
Proof. This is another way to express the true inequality
(x-y/2)2\geq0
Theorem. The pure PoA of any generalized routing problem
(G,L)
\leq4/3
Proof. Note that this theorem is equivalent to saying that for each Nash-equilibrium flow
f
w(f)\leq(4/3) ⋅
min | |
f* |
\{w(f*)\}
f*
wf(f*)=\sume
* | |
f | |
e |
(ae ⋅ fe+be)
=\sume(aefe
* | |
f | |
e |
)+\sume
* | |
f | |
e |
be.
By using Fact 2, we have that
wf(f*)\leq\sume\left(ae ⋅ \left(
* | |
(f | |
e |
)2+
2 | |
(f | |
e) |
/4\right)\right)+\sume
* | |
f | |
e |
⋅ be
=\left(\sumeae(f
* | |
e |
)2+
* | |
f | |
e |
be\right)+\sumeae
2 | |
(f | |
e) |
/4
\leqw(f*)+
w(f) | |
4 |
,
since
(1/4) ⋅ w(f)=(1/4) ⋅ \sumefe(aefe+be)
=(1/4) ⋅ \sume(fe)2+\underbrace{(1/4) ⋅ \sumefebe
We can conclude that
wf(f*)\leqw(f*)+w(f)/4
Note that in the proof we have made extensive use of the assumption that the functions in
L
Theorem. Given a generalized routing problem with graph
G
d
\leqd+1
Note that the PoA can grow with
d
x=1-1/{\sqrt{d+1}}
w=\left(1-
1 | |
\sqrt{d+1 |
=\left(\left(1-
1 | |
\sqrt{d+1 |
\leqe-\sqrt{d+1
This quantity tends to zero when
d
PoA upper bounds can be obtained if the game is shown to satisfy a so-called smoothness inequality. More precisely, a cost-minimimization game is (λ,μ)-smooth (with λ ≥ 0 and μ < 1) if the inequality
n | |
\sum | |
i=1 |
Ci\left(a
* | |
i |
,a-i\right)\leqλC\left(a*\right)+\muC(a)
holds for any outcome a and a*. In this case, the PoA is upper bounded by λ/(1 − μ).[4]
For cost-sharing games with concave cost functions, the optimal cost-sharing rule that optimizes the price of anarchy, followed by the price of stability, is precisely the Shapley value cost-sharing rule.[5] (A symmetrical statement is similarly valid for utility-sharing games with convex utility functions.) In mechanism design, this means that the Shapley value solution concept is optimal for these sets of games.
Moreover, for these (finite) games it was proven that every equilibrium which achieves the PoA bound is fragile, in the sense that the agents demonstrate a state of indifference between their equilibrium action and the action they would pursue in a system-optimal outcome.[6]