Atkinson–Stiglitz theorem explained

The Atkinson–Stiglitz theorem is a theorem of public economics. It implies that no indirect taxes need to be employed where the utility function is separable between labor and all commodities. Non-linear income taxation can be used by the government and was in a seminal article by Joseph Stiglitz and Anthony Atkinson in 1976.[1] The Atkinson–Stiglitz theorem is an important theoretical result in public economics, spawning a broad literature that delimited the conditions under which the theorem holds. For example, Emmanuel Saez, a French-American professor and economist demonstrated that the Atkinson–Stiglitz theorem does not hold if households have heterogeneous preferences rather than homogeneous ones.[2] [3]

In practice, the Atkinson–Stiglitz theorem has often been invoked in the debate on optimal capital income taxation. As capital income taxation can be interpreted as the taxation of future consumption over the taxation of present consumption, the theorem implies that governments should abstain from capital income taxation if non-linear income taxation were an option since capital income taxation would not improve equity by comparison to the non-linear income tax, while additionally distorting savings.

Optimal taxation

For an individual whose wage is

w

, the budget constraint is calculated by

\sumjqjxj=\sumj(xj+tj(xj))=wL-T(wL),

where

qi

and

xi

are the price and the purchase of the

i

-th commodity, respectively. To maximize the utility function, the first-order condition is:

Uj=

(1+t'j)(-UL)
w(1-T')

(j=1,2,...,N).

The government maximizes the social welfare function, and so
infty
\int
0

\left[wL-\sumjxj-\overline{R}\right]dF=0.

Then a density function

f

is used to express the Hamiltonian:

H=\left[G(U)-λ\left\lbracewL-\sumjxj-\overline{R}\right\rbrace\right]f-\mu\thetaUL.

Taking its variation about

xj

, the condition for its maximum is used.

-λ\left[\left(

\partialx1
\partialxj

\right)U+1\right]-

\mu\theta
f

\left[

\partial2U
\partialx1\partialL

\left(

\partialx1
\partialxj

\right)U+

\partial2U
\partialxj\partialL

\right]=0.

Then the following relation holds:

\left(

\partialx1
\partialxj

\right)U=-

Uj
U1

=-

1+t'j
1+t'1

.

Substituting this relation into the above condition yields:

λ\left[

1+t'j
1+t'1

-1\right]=

\mu\thetaUj
f

\left[

\partial2U
\partialL\partialxj

1
Uj

-

\partial2U
\partialL\partialx1

1
U1

\right]=

\mu\thetaUj
f
\partial
\partialL

\left(ln{Uj

} - \ln \right) \;,and the following is obtained:

λ\left[

1+t'j
1+t'1

-1\right]=

\mu\thetaUj
f
\partial
\partialL

\left(ln{

Uj
U1

}\right).

Note that there is no loss of generality in setting

t'1

zero, therefore

t'1=0

is put. Since

Uj=(1+t'j)\alpha

,
t'j
1+t'j

=

\mu\theta\alpha
λf
\partial
\partialL

\left(ln{

Uj
U1

}\right).

Thus, no indirect taxation needs to be employed, i.e.

tj=0

, provided that the utility function is weakly separable between labor and all consumption goods.

Other Approaches

Joseph Stiglitz explains why indirect taxation is unnecessary, viewing the Atkinson–Stiglitz theorem from a different perspective.[4]

Basic concepts

Suppose that those who are in category 2 are the more able. Then, two conditions are imposed for Pareto efficient taxation at which a government aims. The first condition is that the utility of category 1 is equal to or more than a given level:

\overline{U}1\leV1(C1,Y1).

The second condition is that the government revenue

R

, which is equal to or more than the revenue requirement

\overline{R}

, is increased by a given amount:

R=-(C1-Y1)N1-(C2-Y2)N2,

\overline{R}\leR,

where

N1

and

N2

indicate the number of individuals of each type. Under these conditions, the government needs to maximize the utility

V2(C2,Y2)

of category 2. Then writing down the Lagrange function for this problem:

l{L}=V2(C2,Y2)+\muV1(C1,Y1)+λ2(V2(C2,Y2)-V2(C1,Y1))+λ1(V1(C1,Y1)-V1(C2,Y2))+\gamma\left(-(C1-Y1)N1-(C2-Y2)N2-\overline{R}\right),

ensuring the satisfaction of the self-selection constraints, the first-order conditions are:

\mu

\partialV1
\partialC1

-λ2

\partialV2
\partialC1

+λ1

\partialV1
\partialC1

-\gammaN1=0,

\mu

\partialV1
\partialY1

-λ2

\partialV2
\partialY1

+λ1

\partialV1
\partialY1

+\gammaN1=0,

\partialV2
\partialC2

+λ2

\partialV2
\partialC2

-λ1

\partialV1
\partialC2

-\gammaN2=0,

\partialV2
\partialY2

+λ2

\partialV2
\partialY2

-λ1

\partialV1
\partialY2

+\gammaN2=0.

For the case where

λ1=0

and

λ2=0

:
\partialVi/\partialYi
\partialVi/\partialCi

+1=0,

for

i=1,2

, therefore the government can achieve a lump-sum taxation. For the case where

λ1=0

and

λ2>0

:
\partialV2/\partialY2
\partialV2/\partialC2

+1=0,

the marginal tax rate for category 2 is zero. As to category 1:
\partialV1/\partialY2
\partialV1/\partialC1

=-

1-λ2(\partialV2/\partialY1)/N1\gamma
1+λ2(\partialV2/\partialC1)/N1\gamma

.

If

\deltai=

\partialVi/\partialY1
\partialVi/\partialC1

,(i=1,2)

, the marginal tax rate for category 1 is

\delta1+1

.

Also, note the following equation:

\delta1=-\left(

1-\nu\delta2
1+\nu

\right),

where

\nu

is denoted by:

\nu=

λ2(\partialV2/\partialC1)
N1\gamma

.

Therefore, by assumption,

\delta1<\delta2

, and so

-1<\delta1<\delta2

can be directly proven. Accordingly, the marginal tax rate for category 1 is positive.

For the case where

λ1>0

, and

λ2=0

, the marginal tax rate for category 2 is negative. The lump-sum tax imposed on an individual of category 1 would become larger than that for category 2 if the lump-sum tax were feasible.

Various commodities

Consider a case where income level and several commodities are observable. Each individual's consumption function is expressed in a vector form as:

bf{C}1=\sumjC1jbf{e}j

bf{C}2=\sumjC2jbf{e}j.

In this case, the government's budget constraint is:

R\leq

2
\sum
k=1

(YkNk)-N1\sumjC1j-N2\sumjC2j.

Then:

\mu

\partialV1
\partialC1j

-λ2

\partialV2
\partialC1j

+λ1

\partialV1
\partialC1j

-\gammaN1=0,

\mu

\partialV1
\partialY1

-λ2

\partialV2
\partialY1

+λ1

\partialV1
\partialY1

+\gammaN1=0,

\partialV2
\partialC2j

+λ2

\partialV2
\partialC2j

-λ1

\partialV1
\partialC2j

-\gammaN2=0,

\partialV2
\partialY2

+λ2

\partialV2
\partialY2

-λ1

\partialV1
\partialY2

+\gammaN2=0.

Here,

λ1=0

and

λ2>0

. Therefore, it follows that:
\partialV2
\partialC2j
\partialV2
\partialC2n

=1,

\partialV2
\partialC2j
\partialV2
\partialY2

=1.

Suppose all individuals have the same indifference curve in C-L plane. The separability between leisure and consumption can be expressed as:
\partial2Uk
\partialCkj\partialLk

=0,

yielding
\partialV1
\partialC1j

=

\partialV2
\partialC1j

.

As a result:
\partialV1
\partialC1j
\partialV1
\partialC1n

=1.

Thus, Stiglitz stated it is unnecessary to impose taxes on commodities.

Conditions for randomization

Consider a scenario in which individuals with high abilities, who typically earn higher incomes as a reflection of their skills, downplay their abilities. In this case, it could be argued that the government needs to randomize the taxes imposed on the low ability individuals, to increase the effectiveness of screening. It is possible that under certain conditions the taxes can be randomized without damaging the low-ability individuals. For the case where an individual chooses to show their ability, a tax schedule is related to

\lbrace

*
C
2

,

*
Y
2

\rbrace

. For the case where an individual chooses to hide their ability, there are two tax schedule possibilities:

\lbrace

*
C
1

,

*
Y
1

\rbrace

and

\lbrace

**
C
1

,

**
Y
1

\rbrace

. The randomization is done so that the risk of the former case should differ from that of the latter.

To avoid hitting the low ability group, the mean consumption must be shifted upwards at each

Y

. As the consumption is maximized, a higher

\overline{C}1

is set for a higher

\overline{Y}1

. Then the relations between those variables are:
*
C
1

=\overline{C}1+h,

*
Y
1

=\overline{Y}1+λh

**
C
1

=\overline{C}1-h,

**
Y
1

=\overline{Y}1-λh.

The utility function is

V2(

*
C
1

,

*
Y
1

)

and

V2

**
(C
1

,

**
Y
1

)

, therefore the condition for the optimum is:
V
2C*

(d\overline{C}1+dh)+

V
2Y*

(d\overline{Y}1+λdh)+

V
2C**

(d\overline{C}1-dh)+

V
2Y**

(d\overline{Y}1-λdh)=0,

and likewise:
V
1C*

(d\overline{C}1+dh)+

V
1Y*

(d\overline{Y}1+λdh)+

V
1C**

(d\overline{C}1-dh)+

V
1Y**

(d\overline{Y}1-λdh)=0.

And accordingly:

\begin{bmatrix} SV2&SV2\\ SV1&SV1\end{bmatrix} \begin{bmatrix} d\overline{C}\\ d\overline{Y} \end{bmatrix}=-\begin{bmatrix} DV2C+λDV2\\ DV1C+λDV1\end{bmatrix}dh,

where

SVk=

V
kC*

+

V
kC**

and

SVk=

V
kY*

+

V
kY**

and

k=1,2

. Similarly,

DVk=

V
kC*

-

V
kC**

and

DVk=

V
kY*

-

V
kY**

.

Then:

\limh

d(\overline{Y
-

\overline{C})}{dh}=

F1-F2
(-2)(MRS1-MRS2)

,

where

MRSk=-(

\partialVk
\partialC1

)-1

\partialVk
\partialY1

. As to

F1,F2

are denoted by

F1=(

\partialV2
\partialC1

)-1M2(1-MRS1)

and

F2=(

\partialV1
\partialC1

)-1M1(1-MRS2)

. Also,

Mk

is defined by

Mk=DVk+λDVk

. The first derivative of

\overline{Y}-\overline{C}

with regard to

h

, at

h=0

, is zero because

Mk=0

, and so its second derivative needs to be calculated.
d2(\overline{Y
-

\overline{C})}{dh2

} = H_ + H_ \;, where

H1=

d(F1-F2)
dh
1
-2(MRS1-MRS2)

and

H2=(-1)

d(\overline{Y
-

\overline{C})}{dh}

dln{(-2)(MRS1-MRS2)
}{d

h}

. And so

H2

disappears at

h=0

. Then:
d2(\overline{Y
-

\overline{C})}{dh2

} = \frac \; \; .

I1=(V2CC+2λV2+λ2V2)(

\partialV2
\partialC1

)-1(1-MRS1)

I2=(-1)(V1CC+2λV1+λ2V1)(

\partialV1
\partialC1

)-1(1-MRS2)

Since

MRS2<MRS1<1

, the condition under which randomization is desirable is calculated:

(V2CC+2λV2+λ2V2)(

V
1C1

+

V
2Y1

)-(V1CC+2λV1+λ2V2)(

V
2C1

+

V
2Y1

)<0.

See also

Notes and References

  1. Atkinson . A. B. . Stiglitz . J. E. . 1976 . The Design of Tax Structure: Direct versus Indirect Taxation . . 6 . 1–2 . 55–75 [p. 74] . 10.1016/0047-2727(76)90041-4 .
  2. Saez . E. . 2002 . The Desirability of Commodity Taxation under Non-linear Income Taxation and Heterogeneous Tastes . Journal of Public Economics . 83 . 2 . 217–230 . 10.1016/S0047-2727(00)00159-6 .
  3. Book: Boadway . R. W. . Pestieau . P. . 2003 . Indirect Taxation and Redistribution: The Scope of the Atkinson-Stiglitz Theorem . Economics for an Imperfect World: Essays in Honor of Joseph E. Stiglitz . MIT Press . 0-262-01205-7 . 387–403 .
  4. J.E. Stiglitz, Journal of Public Economics, 17 (1982) 213-124, North-Holland