The van Cittert–Zernike theorem, named after physicists Pieter Hendrik van Cittert and Frits Zernike,[1] is a formula in coherence theory that states that under certain conditions the Fourier transform of the intensity distribution function of a distant, incoherent source is equal to its complex visibility.[2] [3] This implies that the wavefront from an incoherent source will appear mostly coherent at large distances. Intuitively, this can be understood by considering the wavefronts created by two incoherent sources. If we measure the wavefront immediately in front of one of the sources, our measurement will be dominated by the nearby source. If we make the same measurement far from the sources, our measurement will no longer be dominated by a single source; both sources will contribute almost equally to the wavefront at large distances.
This reasoning can be easily visualized by dropping two stones in the center of a calm pond. Near the center of the pond, the disturbance created by the two stones will be very complicated. As the disturbance propagates towards the edge of the pond, however, the waves will smooth out and will appear to be nearly circular.
The van Cittert–Zernike theorem has important implications for radio astronomy. With the exception of pulsars and masers, all astronomical sources are spatially incoherent. Nevertheless, because they are observed at distances large enough to satisfy the van Cittert–Zernike theorem, these objects exhibit a non-zero degree of coherence at different points in the imaging plane. By measuring the degree of coherence at different points in the imaging plane (the so-called "visibility function") of an astronomical object, a radio astronomer can thereby reconstruct the source's brightness distribution and make a two-dimensional map of the source's appearance.
Consider two very distant parallel planes, both perpendicular to the line of sight, and let's call them source plane and observation plane; If
\Gamma12(u,v,0)
\Gamma12(u,v,0)=\iintI(l,m)e-2\pidldm
where
l
m
u
v
I
This theorem will remain confusing to some engineers or scientists because of its statistical nature and difference from simple correlation or even covariance processing methods. A good reference (which still might not clarify the issue for some users, but does have a great sketch to drive the method home) is Goodman, starting on page 207.[7]
E(t)
\Gamma12(\tau)=\limT
1 | |
2T |
T | |
\int | |
-T |
E1(t)
*(t-\tau) | |
E | |
2 |
dt
where
\tau
E(t)
\tau
Normalization of the mutual coherence function to the product of the square roots of the intensities of the two electric fields yields the complex degree of (second-order) coherence (correlation coefficient function):
\gamma12(\tau)=
\Gamma12(\tau) | |
\sqrt{I1 |
\sqrt{I2}}
Let
XY
xy
P1
P2
(l,m)
P1
P2
P1
E1(l,m,t)=A\left(l,m,t-
R1 | |
c |
\right)
| ||||||||||
R1 |
where
R1
P1
\omega
A
P2
E2(l,m,t)=A\left(l,m,t-
R2 | |
c |
\right)
| ||||||||||||
R2 |
Let us now calculate the time-averaged cross-correlation between the electric field at
P1
P2
\langleE1(l,m,t)
*(l, | |
E | |
2 |
m,t) \rangle= \langleA\left(l,m,t-
R1 | |
c |
\right)A*\left(l,m,t-
R2 | |
c |
\right) \rangle x
| ||||||||||
R1 |
x
| ||||||||||
R2 |
Because the quantity in the angle brackets is time-averaged an arbitrary offset to the temporal term of the amplitudes may be added as long as the same offset is added to both. Let us now add
R1 | |
c |
\langleE1(l,m,t)
*(l, | |
E | |
2 |
m,t) \rangle= \langleA(l,m,t)A*\left(l,m,t-
R2-R1 | |
c |
\right) \rangle x
| ||||||||||||
R1R2 |
But if the source is in the far field then the difference between
R1
R2
t
t
P1
P2
\langleE1(l,m,t)
*(l, | |
E | |
2 |
m,t)\rangle=\langleA(l,m,t)A*(l,m,t)\rangle x
| ||||||||||
R1R2 |
Now,
\langleA(l,m,t)A*(l,m,t)\rangle
I(l,m)
\langleE1(l,m,t)
*(l, | |
E | |
2 |
m,t)\rangle=I(l,m)
| ||||||||||
R1R2 |
To calculate the mutual coherence function from this expression, simply integrate over the entire source.
\Gamma12(u,v,0)=\iintrm{source
Note that cross terms of the form
\langleA1(l,m,t)
* | |
A | |
2 |
(l,m,t)\rangle
Next rewrite the
R2-R1
u,v,l
m
P1=(x1,y1)
P2=(x2,y2)
R1=\sqrt{R2+
2 | |
x | |
1 |
+
2} | |
y | |
1 |
R2=\sqrt{R2+
2 | |
x | |
2 |
+
2} | |
y | |
2 |
where
R
R1
R2
R2-R1=R\sqrt{1+
| |||||||
R2 |
+
| |||||||
R2 |
But because
x1,x2,y1
y2
R
R2-R1=R\left(1+
1 | |
2 |
\left(
| ||||||||||||||||
R2 |
\right)\right)-R\left(1+
1 | |
2 |
\left(
| ||||||||||||||||
R2 |
\right)\right)
which, after some algebraic manipulation, simplifies to
R2-R1=
1 | |
2R |
\left((x2-x1)(x2+x1)+(y2-y1)(y2+y1)\right)
Now,
1 | |
2 |
(x2+x1)
x
P1
P2
1 | |
2R |
(x2+x1)
l
m=
1 | |
2R |
(y2+y1)
u
x
P1
P2
u=
\omega | |
2\pic |
(x1-x2)
Similarly,
v
P1
P2
y
v=
\omega | |
2\pic |
(y1-y2)
Hence
R2-R1=
2\pic | |
\omega |
(ul+vm)
Because
x1,x2,y1,
y2
R
R1\simeqR2\simeqR
dS
R2dldm
\Gamma12(u,v,0)=\iintrm{source
Which reduces to
\Gamma12(u,v,0)=\iintrm{source
But the limits of these two integrals can be extended to cover the entire plane of the source as long as the source's intensity function is set to be zero over these regions. Hence,
\Gamma12(u,v,0)=\iintI(l,m)e-2dldm
which is the two-dimensional Fourier transform of the intensity function. This completes the proof.
The van Cittert–Zernike theorem rests on a number of assumptions, all of which are approximately true for nearly all astronomical sources. The most important assumptions of the theorem and their relevance to astronomical sources are discussed here.
A spatially coherent source does not obey the van Cittert–Zernike theorem. To see why this is, suppose we observe a source consisting of two points,
a
b
P1
P2
P1
E1=Ea1+Eb1
and at
P2
E2=Ea2+Eb2
so the mutual coherence function is
\langleE1(t)
*(t | |
E | |
2 |
-\tau)\rangle=\langle(Ea1(t)+Eb1(t))(
*(t | |
E | |
a2 |
-\tau)+
*(t | |
E | |
b2 |
-\tau))\rangle
Which becomes
\langleE1(t)
*(t | |
E | |
2 |
-\tau)\rangle=\langleEa1(t)
*(t | |
E | |
a2 |
-\tau)\rangle+\langleEa1(t)
*(t | |
E | |
b2 |
-\tau)\rangle+\langleEb1(t)
*(t | |
E | |
a2 |
-\tau)\rangle+\langleEb1(t)
*(t | |
E | |
b2 |
-\tau)\rangle
If points
a
b
This assumption holds for most astronomical sources. Pulsars and masers are the only astronomical sources which exhibit coherence.
In the proof of the theorem we assume that
R\ggx1-x2
R\ggy1-y2
D
R\gg
D2 | |
λ |
Using a reasonable baseline of 20 km for the Very Large Array at a wavelength of 1 cm, the far field distance is of order
4 x 1010
In the derivation of the van Cittert–Zernike theorem we write the direction cosines
l
m
1 | |
2 |
(x1+x2)/R
1 | |
2 |
(y1+y2)/R
R\gg
1 | |
2 |
(x1+x2)
R\gg
1 | |
2 |
(y1+y2)
Because most astronomical sources subtend very small angles on the sky (typically much less than a degree), this assumption of the theorem is easily fulfilled in the domain of radio astronomy.
The van Cittert–Zernike theorem assumes that the source is quasi-monochromatic. That is, if the source emits light over a range of frequencies,
\Delta\nu
\nu
\Delta\nu | |
\nu |
\lesssim1
Moreover, the bandwidth must be narrow enough that
\Delta\nu | |
\nu |
\ll
1 | |
lu |
where
l
u
(R2-R1)/c
t
This requirement implies that a radio astronomer must restrict signals through a bandpass filter. Because radio telescopes almost always pass the signal through a relatively narrow bandpass filter, this assumption is typically satisfied in practice.
We assume that our source lies in a two-dimensional plane. In reality, astronomical sources are three-dimensional. However, because they are in the far field, their angular distribution does not change with distance. Therefore, when we measure an astronomical source, its three-dimensional structure becomes projected upon a two-dimensional plane. This means that the van Cittert–Zernike theorem may be applied to measurements of astronomical sources, but we cannot determine structure along the line of sight with such measurements.
The van Cittert–Zernike theorem assumes that the medium between the source and the imaging plane is homogeneous. If the medium is not homogeneous then light from one region of the source will be differentially refracted relative to other regions of the source due to the difference in light travel time through the medium. In the case of a heterogeneous medium one must use a generalization of the van Cittert–Zernike theorem, called Hopkins's formula.
Because the wavefront does not pass through a perfectly uniform medium as it travels through the interstellar (and possibly intergalactic) medium and into the Earth's atmosphere, the van Cittert–Zernike theorem does not hold exactly true for astronomical sources. In practice, however, variations in the refractive index of the interstellar and intergalactic media and Earth's atmosphere are small enough that the theorem is approximately true to within any reasonable experimental error. Such variations in the refractive index of the medium result only in slight perturbations from the case of a wavefront traveling through a homogeneous medium.
Suppose we have a situation identical to that considered when the van Cittert–Zernike theorem was derived, except that the medium is now heterogeneous. We therefore introduce the transmission function of the medium,
K(l,m,P,\nu)
\Gamma12(l,m,0)=λ2\iintI(l,m)K(l,m,P1,\nu)K*(l,m,P2,\nu)dS
If we define
U(l,m,P1)\equiviλK(l,m,P1,\nu)\sqrt{I(l,m)}
then the mutual coherence function becomes
\Gamma12(l,m,0)=\iintU(l,m,P1)U*(l,m,P2)dS
which is Hopkins's generalization of the van Cittert–Zernike theorem.[8] In the special case of a homogeneous medium, the transmission function becomes
K(l,m,P,\nu)=-
ieikR | |
λR |
in which case the mutual coherence function reduces to the Fourier transform of the brightness distribution of the source. The primary advantage of Hopkins's formula is that one may calculate the mutual coherence function of a source indirectly by measuring its brightness distribution.
The van Cittert–Zernike theorem is crucial to the measurement of the brightness distribution of a source. With two telescopes, a radio astronomer (or an infrared or submillimeter astronomer) can measure the correlation between the electric field at the two dishes due to some point from the source. By measuring this correlation for many points on the source, the astronomer can reconstruct the visibility function of the source. By applying the van Cittert–Zernike theorem, the astronomer can then take the inverse Fourier transform of the visibility function to discover the brightness distribution of the source. This technique is known as aperture synthesis or synthesis imaging.
In practice, radio astronomers rarely recover the brightness distribution of a source by directly taking the inverse Fourier transform of a measured visibility function. Such a process would require a sufficient number of samples to satisfy the Nyquist sampling theorem; this is many more observations than are needed to approximately reconstruct the brightness distribution of the source. Astronomers therefore take advantage of physical constraints on the brightness distribution of astronomical sources to reduce the number of observations which must be made. Because the brightness distribution must be real and positive everywhere, the visibility function cannot take on arbitrary values in unsampled regions. Thus, a non-linear deconvolution algorithm like CLEAN or Maximum Entropy may be used to approximately reconstruct the brightness distribution of the source from a limited number of observations.[9]
The van Cittert–Zernike theorem also places constraints on the sensitivity of an adaptive optics system. In an adaptive optics (AO) system, a distorted wavefront is provided and must be transformed to a distortion-free wavefront. An AO system must make a number of different corrections to remove the distortions from the wavefront. One such correction involves splitting the wavefront into two identical wavefronts and shifting one by some physical distance
s
The sensitivity of this technique is limited by the van Cittert–Zernike theorem.[11] If an extended source is imaged, the contrast between the fringes will be reduced by a factor proportional to the Fourier transform of the brightness distribution of the source.[12] The van Cittert–Zernike theorem implies that the mutual coherence of an extended source imaged by an AO system will be the Fourier transform of its brightness distribution. An extended source will therefore change the mutual coherence of the fringes, reducing their contrast.
The van Cittert–Zernike theorem can be used to calculate the partial spatial coherence of radiation from a free-electron laser.