VEGAS algorithm explained
The VEGAS algorithm, due to G. Peter Lepage,[1] [2] [3] is a method for reducing error in Monte Carlo simulations by using a known or approximate probability distribution function to concentrate the search in those areas of the integrand that make the greatest contribution to the final integral.
The VEGAS algorithm is based on importance sampling. It samples points from the probability distribution described by the function
so that the points are concentrated in the regions that make the largest contribution to the integral. The
GNU Scientific Library (GSL) provides a VEGAS routine.
Sampling method
In general, if the Monte Carlo integral of
over a volume
is sampled with points distributed according to a probability distribution described by the function
we obtain an estimate
Eg(f;N)={1\overN}
{f(xi)}/g(xi).
The variance of the new estimate is then
where
is the variance of the original estimate,
Var(f;N)=E(f2;N)-(E(f;N))2.
If the probability distribution is chosen as
g=|f|/style\int\Omega|f(x)|dx
then it can be shown that the variance
vanishes, and the error in the estimate will be zero. In practice it is not possible to sample from the exact distribution g for an arbitrary function, so importance sampling algorithms aim to produce efficient approximations to the desired distribution.
Approximation of probability distribution
The VEGAS algorithm approximates the exact distribution by making a number of passes over the integration region while histogramming the function f. Each histogram is used to define a sampling distribution for the next pass. Asymptotically this procedure converges to the desired distribution. In order to avoid the number of histogram bins growing like
with dimension
d the probability distribution is approximated by a separable function:
g(x1,x2,\ldots)=g1(x1)g2(x2) …
so that the number of bins required is only
Kd. This is equivalent to locating the peaks of the function from the
projections of the integrand onto the coordinate axes. The efficiency of VEGAS depends on the validity of this assumption. It is most efficient when the peaks of the integrand are well-localized. If an integrand can be rewritten in a form which is approximately separable this will increase the efficiency of integration with VEGAS.
See also
References
Notes and References
- Lepage. G.P.. A New Algorithm for Adaptive Multidimensional Integration. Journal of Computational Physics. May 1978. 27. 2. 192–203. 10.1016/0021-9991(78)90004-9. 1978JCoPh..27..192L.
- Lepage. G.P.. VEGAS: An Adaptive Multi-dimensional Integration Program. Cornell Preprint. CLNS 80-447. March 1980.
- Ohl. T.. Vegas revisited: Adaptive Monte Carlo integration beyond factorization. Computer Physics Communications. July 1999. 120. 1. 13–19. 10.1016/S0010-4655(99)00209-X. hep-ph/9806432. 1999CoPhC.120...13O. 18194240.