Kernel (image processing) explained

In image processing, a kernel, convolution matrix, or mask is a small matrix used for blurring, sharpening, embossing, edge detection, and more. This is accomplished by doing a convolution between the kernel and an image. Or more simply, when each pixel in the output image is a function of the nearby pixels (including itself) in the input image, the kernel is that function.

Details

The general expression of a convolution is

g(x,y)=\omega

	b{
f(x,y)=\sum*
	j=-b

\omega(i,j)f(x-i,y-j)}},

where

g(x,y)

is the filtered image,

f(x,y)

is the original image,

\omega

is the filter kernel. Every element of the filter kernel is considered by

-a\leqi\leqa

and

-b\leqj\leqb

Depending on the element values, a kernel can cause a wide range of effects:

Operation

Kernel ω

Image result g(x,y)

Identity

\begin{bmatrix} 0& 0& 0\\ 0& 1& 0\\ 0& 0& 0 \end{bmatrix}

Ridge or edge detection

\begin{bmatrix} 0&-1&0\\ -1& 4&-1\\ 0&-1&0 \end{bmatrix}

\begin{bmatrix} -1&-1&-1\\ -1& 8&-1\\ -1&-1&-1 \end{bmatrix}

Sharpen

\begin{bmatrix} 0&-1& 0\\ -1& 5&-1\\ 0&-1& 0 \end{bmatrix}

Box blur
(normalized)

	1
	9

\begin{bmatrix} 1& 1& 1\\ 1& 1& 1\\ 1& 1& 1 \end{bmatrix}

Gaussian blur 3 × 3
(approximation)

	1
	16

\begin{bmatrix} 1& 2& 1\\ 2& 4& 2\\ 1& 2& 1 \end{bmatrix}

Gaussian blur 5 × 5
(approximation)

	1
	256

\begin{bmatrix} 1&4&6&4&1\\ 4&16&24&16&4\\ 6&24&36&24&6\\ 4&16&24&16&4\\ 1&4&6&4&1 \end{bmatrix}

Unsharp masking 5 × 5
Based on Gaussian blur
with amount as 1 and
threshold as 0
(with no image mask)

	-1
	256

\begin{bmatrix} 1&4& 6&4&1\\ 4&16& 24&16&4\\ 6&24&-476&24&6\\ 4&16& 24&16&4\\ 1&4& 6&4&1 \end{bmatrix}

The above are just a few examples of effects achievable by convolving kernels and images.

Origin

The origin is the position of the kernel which is above (conceptually) the current output pixel. This could be outside of the actual kernel, though usually it corresponds to one of the kernel elements. For a symmetric kernel, the origin is usually the center element.

Convolution

Edge handling

Kernel convolution usually requires values from pixels outside of the image boundaries. There are a variety of methods for handling image edges.

Extend

The nearest border pixels are conceptually extended as far as necessary to provide values for the convolution. Corner pixels are extended in 90° wedges. Other edge pixels are extended in lines.

Wrap

The image is conceptually wrapped (or tiled) and values are taken from the opposite edge or corner.

Mirror

The image is conceptually mirrored at the edges. For example, attempting to read a pixel 3 units outside an edge reads one 3 units inside the edge instead.

Crop / Avoid overlap

Any pixel in the output image which would require values from beyond the edge is skipped. This method can result in the output image being slightly smaller, with the edges having been cropped. Move kernel so that values from outside of image is never required. Machine learning mainly uses this approach. Example: Kernel size 10x10, image size 32x32, result image is 23x23.

Kernel Crop

Any pixel in the kernel that extends past the input image isn't used and the normalizing is adjusted to compensate.

Constant

Use constant value for pixels outside of image. Usually black or sometimes gray is used. Generally this depends on application.

Normalization

Normalization is defined as the division of each element in the kernel by the sum of all kernel elements, so that the sum of the elements of a normalized kernel is unity. This will ensure the average pixel in the modified image is as bright as the average pixel in the original image.

Optimisation

Fast convolution algorithms include:

separable convolution

Separable convolution

2D convolution with an M × N kernel requires M × N multiplications for each sample (pixel). If the kernel is separable, then the computation can be reduced to M + N multiplications. Using separable convolutions can significantly decrease the computation by doing 1D convolution twice instead of one 2D convolution.^[2]

Implementation

Here a concrete convolution implementation done with the GLSL shading language :// author : csblo// Work made just by consulting :// https://en.wikipedia.org/wiki/Kernel_(image_processing)

// Define kernels

define identity mat3(0, 0, 0, 0, 1, 0, 0, 0, 0)
define edge0 mat3(1, 0, -1, 0, 0, 0, -1, 0, 1)
define edge1 mat3(0, 1, 0, 1, -4, 1, 0, 1, 0)
define edge2 mat3(-1, -1, -1, -1, 8, -1, -1, -1, -1)
define sharpen mat3(0, -1, 0, -1, 5, -1, 0, -1, 0)
define box_blur mat3(1, 1, 1, 1, 1, 1, 1, 1, 1) * 0.1111
define gaussian_blur mat3(1, 2, 1, 2, 4, 2, 1, 2, 1) * 0.0625
define emboss mat3(-2, -1, 0, -1, 1, 1, 0, 1, 2)

// Find coordinate of matrix element from indexvec2 kpos(int index)

// Extract region of dimension 3x3 from sampler centered in uv// sampler : texture sampler// uv : current coordinates on sampler// return : an array of mat3, each index corresponding with a color channelmat3[3] region3x3(sampler2D sampler, vec2 uv)

// Convolve a texture with kernel// kernel : kernel used for convolution// sampler : texture sampler// uv : current coordinates on samplervec3 convolution(mat3 kernel, sampler2D sampler, vec2 uv)

void mainImage(out vec4 fragColor, in vec2 fragCoord)

Sources

Book: Ludwig, Jamie (n.d.) . Image Convolution . Portland State University .
Book: Lecarme . Olivier . Delvare . Karine . January 2013 . The Book of GIMP: A Complete Guide to Nearly Everything . 429 . No Starch Press . 978-1593273835 .
Book: Gumster . Jason van . Shimonski . Robert . March 2012 . GIMP Bible . 438 - 442 . John Wiley & Sons . 978-0470523971 .
Book: Stockman . George C. . Shapiro . Linda G.. Linda Shapiro . February 2001 . Computer Vision . 53 - 54 . Prentice Hall . 978-0130307965 .

External links

Notes and References

Web site: Example of 2D Convolution .
Web site: Convolution . 2022-11-19 . www.songho.ca.