The growth function, also called the shatter coefficient or the shattering number, measures the richness of a set family or class of function. It is especially used in the context of statistical learning theory, where it is used to study properties of statistical learning methods.The term 'growth function' was coined by Vapnik and Chervonenkis in their 1968 paper, where they also proved many of its properties.It is a basic concept in machine learning.[1]
Let
H
C
The intersection-size (also called the index) of
H
C
|H\capC|
Cm
m
2m
C
H
The growth function measures the size of
H\capC
|C|
Equivalently, let
H
C
m
H
C
C
H
The growth function measures the size of
HC
|C|
1. The domain is the real line
R
H
\{x>x0\midx\inR\}
x0\inR
C
m
H\capC
m+1
C
C
\operatorname{Growth}(H,m)=m+1
H
2. The domain is the segment
[0,1]
H
C
m
H\capC
C
2m
\operatorname{Growth}(H,m)=2m
3. The domain is the Euclidean space
Rn
H
x ⋅ \phi\geq1
\phi
\operatorname{Growth}(H,m)=\operatorname{Comp}(n,m)
4. The domain is the real line
R
H
\{x\in[x0,x1]|x\inR\}
x0,x1\inR
C
m
H\capC
m
C
{m+1\choose2}+1
\operatorname{Growth}(H,m)={m+1\choose2}+1
The main property that makes the growth function interesting is that it can be either polynomial or exponential - nothing in-between.
The following is a property of the intersection-size:
Cm
m
n\leqm
|H\capCm|\geq\operatorname{Comp}(n,m)
Cn\subseteqCm
n
|H\capCn|=2n
This implies the following property of the Growth function.For every family
H
\operatorname{Growth}(H,m)=2m
\operatorname{Growth}(H,m)
\operatorname{Comp}(n,m)\leqmn+1
n
\operatorname{Growth}(H,n)<2n
For any finite
H
\operatorname{Growth}(H,m)\leq|H|
C
H\capC
|H|
H
For any nonempty
H
\operatorname{Growth}(H,m)\leq2m
We say that a set-family
H
C
C
H\capC=2C
H
C
m
\operatorname{Growth}(H,C)=2m
Define the Cartesian intersection of two set-families as:
H1otimesH2:=\{h1\caph2\midh1\inH1,h2\inH2\}
\operatorname{Growth}(H1otimesH2,m)\leq\operatorname{Growth}(H1,m) ⋅ \operatorname{Growth}(H2,m)
For every two set-families:[1]
\operatorname{Growth}(H1\cupH2,m)\leq\operatorname{Growth}(H1,m)+\operatorname{Growth}(H2,m)
The VC dimension of
H
\operatorname{VCDim}(H)=n-1
d
\operatorname{Growth}(H,d)=2d
\operatorname{VCDim}(H)=infty
So
\operatorname{VCDim}(H)\geqd
\operatorname{Growth}(H,d)=2d
The growth function can be regarded as a refinement of the concept of VC dimension. The VC dimension only tells us whether
\operatorname{Growth}(H,d)
2d
\operatorname{Growth}(H,m)
m
Another connection between the growth function and the VC dimension is given by the Sauer–Shelah lemma:
If
\operatorname{VCDim}(H)=d
for all
m
\operatorname{Growth}(H,m)\leq
d | |
\sum | |
i=0 |
{m\choosei}
for all
m>d+1
\operatorname{Growth}(H,m)\leq(em/d)d=O(md)
so when the VC dimension is finite, the growth function grows polynomially with
m
m>d
H
d
\operatorname{Growth}(H,m)=
d | |
\sum | |
i=0 |
{m\choosei}
While the growth-function is related to the maximum intersection-size,the entropy is related to the average intersection size:
\operatorname{Entropy}(H,m)=
E | |
|Cm|=m |
[log2(|H\capCm|)]
H
|H\cap(C1\cupC2)|\leq|H\capC1| ⋅ |H\capC2|
\operatorname{Entropy}(H,m1+m2) \leq \operatorname{Entropy}(H,m1)+\operatorname{Entropy}(H,m2)
\operatorname{Entropy}(H,m)/m
c\in[0,1]
m\toinfty
Moreover, the random-variable
log2{|H\capCm|/m}
c
Let
\Omega
\Pr
H
\Omega
Suppose we choose a set
Cm
m
\Omega
P
h\inH
Cm
|h\capCm|/m
\Pr[h]
D(h,Cm):=||h\capCm|/m-\Pr[h]|
which is equivalent to:
In words: the probability that for all events in
H
H
A corollary of this is that, if the growth function is polynomial in
m
n
\operatorname{Growth}(H,m)\leqmn+1
m\toinfty
H