Shrinkage Fields (image restoration) explained
Shrinkage fields is a random field-based machine learning technique that aims to perform high quality image restoration (denoising and deblurring) using low computational overhead.
Method
The restored image
is predicted from a corrupted observation
after training on a set of sample images
.
A shrinkage (mapping) function
}\left(v\right)=_^_\exp \left(-\frac^\right) is directly modeled as a linear combination of
radial basis function kernels, where
is the shared precision parameter,
denotes the (equidistant) kernel positions, and M is the number of Gaussian kernels.
Because the shrinkage function is directly modeled, the optimization procedure is reduced to a single quadratic minimization per iteration, denoted as the prediction of a shrinkage field
{g}\Theta\left(x\right)={l{F}}-1\left\lbrack
{K}Ty+{\sum
{f}{\pii
}\left(_x\right)\right)}\right\rbrack =^\eta where
denotes the
discrete Fourier transform and
is the 2D convolution
with
point spread function filter,
is an optical transfer function defined as the discrete Fourier transform of
, and
is the complex conjugate of
.
is learned as
}\left(_\right) for each iteration
with the initial case
, this forms a cascade of Gaussian
conditional random fields (or cascade of shrinkage fields (
CSF)). Loss-minimization is used to learn the model parameters
}_=_^.
The learning objective function is defined as
}_\right)=_^l\left(_^;_^\right), where
is a
differentiable loss function which is greedily minimized using training data
{\left\lbrace
,{y}\left(s\right),{k}\left(s\right)\right\rbrace
and
.
Performance
Preliminary tests by the author suggest that RTF5[1] obtains slightly better denoising performance than
}_^, followed by
}_^,
}_^,
}_^, and
BM3D.
BM3D denoising speed falls between that of
}_^ and
}_^, RTF being an order of magnitude slower.
Advantages
- Results are comparable to those obtained by BM3D (reference in state of the art denoising since its inception in 2007)
- Minimal runtime compared to other high-performance methods (potentially applicable within embedded devices)
- Parallelizable (e.g.: possible GPU implementation)
- Predictability:
runtime where
is the number of pixels
- Fast training even with CPU
Implementations
- A reference implementation has been written in MATLAB and released under the BSD 2-Clause license: shrinkage-fields
See also
References
Notes and References
- Regression Tree Fields – An Efficient, Non-parametric Approach to Image Labeling Problems . Jancsary . Jeremy. Nowozin . Sebastian . Sharp. Toby. Rother. Carsten . 10 April 2012 . IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR) . IEEE Computer Society . Providence, RI, USA . 10.1109/CVPR.2012.6247950 .