Holevo's theorem explained

Holevo's theorem is an important limitative theorem in quantum computing, an interdisciplinary field of physics and computer science. It is sometimes called Holevo's bound, since it establishes an upper bound to the amount of information that can be known about a quantum state (accessible information). It was published by Alexander Holevo in 1973.

Statement of the theorem

Suppose Alice wants to send a classical message to Bob by encoding it into a quantum state, and suppose she can prepare a state from some fixed set

\{\rho1,...,\rhon\}

, with the i-th state prepared with probability

pi

. Let

X

be the classical register containing the choice of state made by Alice. Bob's objective is to recover the value of

X

from measurement results on the state he received. Let

Y

be the classical register containing Bob's measurement outcome. Note that

Y

is therefore a random variable whose probability distribution depends on Bob's choice of measurement.

Holevo's theorem bounds the amount of correlation between the classical registers

X

and

Y

, regardless of Bob's measurement choice, in terms of the Holevo information. This is useful in practice because the Holevo information does not depend on the measurement choice, and therefore its computation does not require performing an optimization over the possible measurements.

More precisely, define the accessible information between

X

and

Y

as the (classical) mutual information between the two registers maximized over all possible choices of measurements on Bob's side:I_(X:Y) = \sup_ I(X:Y|\_i),where
B
I(X:Y|\{\Pi
i\}

i)

is the (classical) mutual information of the joint probability distribution given by

pij=pi

B
\operatorname{Tr}(\Pi
j

\rhoi)

. There is currently no known formula to analytically solve the optimization in the definition of accessible information in the general case. Nonetheless, we always have the upper bound:I_ (X : Y) \leq \chi(\eta) \equiv S\left(\sum_i p_i \rho_i\right) - \sum_i p_i S(\rho_i),where

η\equiv\{(pi,\rhoi)\}i

is the ensemble of states Alice is using to send information, and

S

is the von Neumann entropy. This

\chi(η)

is called the Holevo information or Holevo χ quantity.

Note that the Holevo information also equals the quantum mutual information of the classical-quantum state corresponding to the ensemble:\chi(\eta) = I\left(\sum_i p_i |i\rangle\!\langle i|\otimes \rho_i\right),with

I(\rhoAB)\equivS(\rhoA)+S(\rhoB)-S(\rhoAB)

the quantum mutual information of the bipartite state

\rhoAB

. It follows that Holevo's theorem can be concisely summarized as a bound on the accessible information in terms of the quantum mutual information for classical-quantum states.

Proof

Consider the composite system that describes the entire communication process, which involves Alice's classical input

X

, the quantum system

Q

, and Bob's classical output

Y

. The classical input

X

can be written as a classical register

\rhoX:=

n
\sum\nolimits
x=1

px|x\rangle\langlex|

with respect to some orthonormal basis
n
\{|x\rangle\}
x=1
. By writing

X

in this manner, the von Neumann entropy

S(X)

of the state

\rhoX

corresponds to the Shannon entropy

H(X)

of the probability distribution

\{px\}

n
x=1
:

S(X) =-\operatorname{tr}\left(\rhoXlog\rhoX\right) =

n
-\operatorname{tr}\left(\sum
x=1

pxlogpx|x\rangle\langlex|\right) =

n
-\sum
x=1

pxlogpx =H(X).

The initial state of the system, where Alice prepares the state

\rhox

with probability

px

, is described by

\rhoXQ:=

n
\sum
x=1

px|x\rangle\langlex|\rhox.

Afterwards, Alice sends the quantum state to Bob. As Bob only has access to the quantum system

Q

but not the input

X

, he receives a mixed state of the form

\rho:=

XQ
\operatorname{tr}
X\left(\rho

\right)=

n
\sum\nolimits
x=1

px\rhox

. Bob measures this state with respect to the POVM elements

\{Ey\}

m
y=1
, and the probabilities

\{qy\}

m
y=1
of measuring the outcomes

y=1,2,...,m

form the classical output

Y

. This measurement process can be described as a quantum instrument

l{E}Q(\rhox)=

m
\sum
y=1

qy|x\rhoy|x|y\rangle\langley|,

where

qy|x=\operatorname{tr}\left(Ey\rhox\right)

is the probability of outcome

y

given the state

\rhox

, while

\rhoy|x=W\sqrt{Ey}\rhox\sqrt{E

\dagger/q
y|x
for some unitary

W

is the normalised post-measurement state. Then, the state of the entire system after the measurement process is

\rhoXQ'Y:=\left[l{I}Xl{E}Q\right]\left(\rhoXQ\right)=

m
\sum
y=1

pxqy|x|x\rangle\langlex|\rhoy|x|y\rangle\langley|.

Here

l{I}X

is the identity channel on the system

X

. Since

l{E}Q

is a quantum channel, and the quantum mutual information is monotonic under completely positive trace-preserving maps,[1]

S(X:Q'Y)\leqS(X:Q)

. Additionally, as the partial trace over

Q'

is also completely positive and trace-preserving,

S(X:Y)\leqS(X:Q'Y)

. These two inequalities give

S(X:Y)\leqS(X:Q).

On the left-hand side, the quantities of interest depend only on

\rhoXY:=\operatorname{tr}Q'\left(\rhoXQ'Y\right)=

m
\sum
y=1

pxqy|x|x\rangle\langlex||y\rangle\langley| =

m
\sum
y=1

px,y|x,y\rangle\langlex,y|,

with joint probabilities

px,y=pxqy|x

. Clearly,

\rhoXY

and

\rhoY:=

XY
\operatorname{tr}
X(\rho

)

, which are in the same form as

\rhoX

, describe classical registers. Hence,

S(X:Y)=S(X)+S(Y)-S(XY)=H(X)+H(Y)-H(XY)=I(X:Y).

Meanwhile,

S(X:Q)

depends on the term

log\rhoXQ=

n
log\left(\sum
x=1

px|x\rangle\langlex|\rhox\right) =

n
\sum
x=1

|x\rangle\langlex|log\left(px\rhox\right) =

n
\sum
x=1

logpx|x\rangle\langlex|IQ+

n
\sum
x=1

|x\rangle\langlex|log\rhox,

where

IQ

is the identity operator on the quantum system

Q

. Then, the right-hand side is

\begin{aligned} S(X:Q)&=S(X)+S(Q)-S(XQ)\\ &=S(X)+S(\rho)+\operatorname{tr}\left(\rhoXQlog\rhoXQ\right)\\ &=S(X)+S(\rho)+

n
\operatorname{tr}\left(\sum
x=1

pxlogpx|x\rangle\langlex|\rhox\right)+

n
\operatorname{tr}\left(\sum
x=1

px|x\rangle\langlex|\rhoxlog\rhox\right)\\ &=S(X)+S(\rho)+

n
\underbrace{\operatorname{tr}\left(\sum
x=1

pxlogpx|x\rangle\langlex|\right)}-S(X)+

n
\operatorname{tr}\left(\sum
x=1

px\rhoxlog\rhox\right)\\ &=S(\rho)+

n
\sum
x=1

px\underbrace{\operatorname{tr}\left(\rhoxlog\rhox\right)}

-S(\rhox)

\\ &=S(\rho)-

n
\sum
x=1

pxS(\rhox), \end{aligned}

which completes the proof.

Comments and remarks

In essence, the Holevo bound proves that given n qubits, although they can "carry" a larger amount of (classical) information (thanks to quantum superposition), the amount of classical information that can be retrieved, i.e. accessed, can be only up to n classical (non-quantum encoded) bits. It was also established, both theoretically and experimentally, that there are computations where quantum bits carry more information through the process of the computation than is possible classically.[2]

See also

Further reading

Notes and References

  1. Book: Preskill, John . June 2016. Quantum Information . Chapter 10. Quantum Shannon Theory . https://authors.library.caltech.edu/66493/2/chap10_15.pdf . 23–24 . 30 June 2021.
  2. 10.1038/s41567-021-01271-7 . 17. 894–897 . Maslov. Dmitri. Kim. Jin-Sung. Bravyi. Sergey. Yoder. Theodore J.. Sheldon. Sarah. Quantum advantage for computations with limited space. Nature Physics. 2021-06-28 . 8. 2008.06478. 2021NatPh..17..894M. 221136153.