The three prisoners problem appeared in Martin Gardner's "Mathematical Games" column in Scientific American in 1959.[1] [2] It is mathematically equivalent to the Monty Hall problem with car and goat replaced respectively with freedom and execution.[3]
Three prisoners, A, B, and C, are in separate cells and sentenced to death. The governor has selected one of them at random to be pardoned. The warden knows which one is pardoned, but is not allowed to tell. Prisoner A begs the warden to let him know the identity of one of the two who are going to be executed. "If B is to be pardoned, give me C's name. If C is to be pardoned, give me B's name. And if I'm to be pardoned, secretly flip a coin to decide whether to name B or C."
The warden tells A that B is to be executed. Prisoner A is pleased because he believes that his probability of surviving has gone up from to, as it is now between him and C. Prisoner A secretly tells C the news, who reasons that A's chance of being pardoned is unchanged at, but he is pleased because his own chance has gone up to . Which prisoner is correct?
The answer is that prisoner A did not gain any information about his own fate, since he already knew that the warden would give him the name of someone else. Prisoner A, prior to hearing from the warden, estimates his chances of being pardoned as, the same as both B and C. As the warden says B will be executed, it is either because C will be pardoned (chance), or A will be pardoned (chance) and the coin to decide whether to name B or C the warden flipped came up B (chance; for an overall × = chance B was named because A will be pardoned). Hence, after hearing that B will be executed, the estimate of A's chance of being pardoned is half that of C. This means his chances of being pardoned, now knowing B is not, again are, but C has a chance of being pardoned.
The explanation above may be summarised in the following table. As the warden is asked by A, he can only answer B or C to be executed (or "not pardoned").
Being pardoned | Warden: "not B" | Warden: "not C" | Sum |
---|---|---|---|
A | 1/6 | 1/6 | 1/3 |
B | 0 | 1/3 | 1/3 |
C | 1/3 | 0 | 1/3 |
As the warden has answered that B will not be pardoned, the solution comes from the second column "not B". It appears that the odds for A vs. C to be pardoned are 1:2.
Call
A
B
C
b
\begin{align} P(A|b)&=
P(b|A)P(A) | |
P(b|A)P(A)+P(b|B)P(B)+P(b|C)P(C) |
\\ &=
\tfrac12 x \tfrac13 | |
\tfrac12 x \tfrac13+0 x \tfrac13+1 x \tfrac13 |
=
13. \end{align} | |
The probability of C being pardoned, on the other hand, is:
\begin{align} P(C|b)&=
P(b|C)P(C) | |
P(b|A)P(A)+P(b|B)P(B)+P(b|C)P(C) |
\\ &=
1 x \tfrac13 | |
\tfrac12 x \tfrac13+0 x \tfrac13+1 x \tfrac13 |
=
23. \end{align} | |
The crucial difference making A and C unequal is that
P(b|A)=\tfrac12
P(b|C)=1
P(b|A)=\tfrac12
P(b|C)=1
Prisoner A only has a chance of pardon. Knowing whether B or C will be executed does not change his chance. After he hears B will be executed, Prisoner A realizes that if he will not get the pardon himself it must only be going to C. That means there is a 2/3 chance for C to get a pardon. This is comparable to the Monty Hall problem.
The following scenarios may arise:
With the stipulation that the warden will choose randomly, in the of the time that A is to be pardoned, there is a chance he will say B and chance he will say C. This means that taken overall, of the time ([that A is pardoned] × [that warden says B]), the warden will say B because A will be pardoned, and of the time ([that A is pardoned] × [that warden says C]) he will say C because A is being pardoned. This adds up to the total of of the time (+) A is being pardoned, which is accurate.
It is now clear that if the warden answers B to A (of all cases), then of the time C is pardoned and A will still be executed (case 4), and only of the time A is pardoned (case 1). Hence C's chances are / = and A's are / = .
The key to this problem is that the warden may not reveal the name of a prisoner who will be pardoned. If we eliminate this requirement, it can demonstrate the original problem in another way. The only change in this example is that prisoner A asks the warden to reveal the fate of one of the other prisoners (not specifying one that will be executed). In this case, the warden flips a coin and chooses one of B and C to reveal the fate of. The cases are as follows:
Each scenario has a probability. The original three prisoners problem can be seen in this light: The warden in that problem still has these six cases, each with a probability of occurring. However, the warden in the original case cannot reveal the fate of a pardoned prisoner. Therefore, in case 3 for example, since saying "B is pardoned" is not an option, the warden says "C is executed" instead (making it the same as case 4). That leaves cases 4 and 5 each with a probability of occurring and leaves us with the same probability as before.
The tendency of people to provide the answer 1/2 is likely due to a tendency to ignore context that may seem unimpactful. For example, how the question is posed to the warden can affect the answer. This can be shown by considering a modified case, where
P(A)= | 1 |
4 |
,P(B)=
1 | |
4 |
,P(C)=
1 | |
2 |
\begin{align} P(A|b)&=
\tfrac12 x \tfrac14 | |
\tfrac12 x \tfrac14+0 x \tfrac14+1 x \tfrac12 |
=
15. \end{align} | |
However, if A simply asks if B will be executed, and the warden responds with "yes", the probability that A is pardoned becomes:
\begin{align} P(A|b)&=
1 x \tfrac14 | |
1 x \tfrac14+0 x \tfrac14+1 x \tfrac12 |
=
13. \end{align} | |
A similar assumption is that A plans beforehand to ask the warden for this information. A similar case to the above arises if A does not plan to ask the warden anything and the warden simply informs him that he will be executing B.[5]
Another likely overlooked assumption is that the warden has a probabilistic choice. Let us define
p
P(A|b)
\begin{align} P(A|b)&=
p | |
p+1 |
\end{align}
If we assume that
p=1
P(A|b)= | 12 |
p= | 12 |
P(A|b)= | 13 |
Judea Pearl (1988) used a variant of this example to demonstrate that belief updates must depend not merely on the facts observed but also on the experiment (i.e., query) that led to those facts.[7]