The redundancy principle in biology[1] [2] [3] [4] [5] [6] [7] [8] [9] expresses the need of many copies of the same entity (cells, molecules, ions) to fulfill a biological function. Examples are numerous: disproportionate numbers of spermatozoa during fertilization compared to one egg, large number of neurotransmitters released during neuronal communication compared to the number of receptors, large numbers of released calcium ions during transient in cells, and many more in molecular and cellular transduction or gene activation and cell signaling. This redundancy is particularly relevant when the sites of activation are physically separated from the initial position of the molecular messengers. The redundancy is often generated for the purpose of resolving the time constraint of fast-activating pathways. It can be expressed in terms of the theory of extreme statistics to determine its laws and quantify how the shortest paths are selected. The main goal is to estimate these large numbers from physical principles and mathematical derivations.
When a large distance separates the source and the target (a small activation site), the redundancy principle explains that this geometrical gap can be compensated by large number. Had nature used less copies than normal, activation would have taken a much longer time, as finding a small target by chance is a rare event and falls into narrow escape problems.[10]
The time for the fastest particles to reach a target in the context of redundancy depends on the numbers and the local geometry of the target. In most of the time, it is the rate of activation. This rate should be used instead of the classical Smoluchowski's rate describing the mean arrival time, but not the fastest. The statistics of the minimal time to activation set kinetic laws in biology, which can be quite different from the ones associated to average times.
The motion of a particle located at position
Xt
dXt=\sqrt{2D}
dB | ||||
|
F(x)dt,
where
D
\gamma
F(x)
Bt
\begin{align} xn+1= \begin{cases}xn-a,&withprobabilityl(xn)\ xn+b,&withprobabilityr(xn)\end{cases} \end{align}
r(x)= | 1 |
1+\betax |
,
r(x)+l(x)=1
X |
=v0\bfu,
\bfu
X0\in\partial\Omega
X |
=v0\bfv,
\bfv
X0
\bfu
\Omega
The mathematical analysis of large numbers of molecules, which are obviously redundant in the traditional activation theory, is used to compute the in vivo time scale of stochastic chemical reactions. The computation relies on asymptotics or probabilistic approaches to estimate the mean time of the fastest to reach a small target in various geometries.[15] [16] [17]
With N non-interacting i.i.d. Brownian trajectories (ions) in a bounded domain Ω that bind at a site, the shortest arrival time is by definition
\tau1=min(t1,\ldots,tN),
ti
Pr(\tau1>t)
Pr(\tau1
N(t | |
>t)=Pr | |
1>t) |
Pr\{t1>t\}
\Omega
\partialp(x,t) | |
\partialt |
=D\Deltap(x,t)\hbox{for}x\in\Omega,t>0
\begin{align}p(x,0)=&p0(x)\hbox{for}x\in\Omega\\
\partialp | |
\partialn |
(x,t)&=0\hbox{for}x\in\partial\Omegar\\ p(x,t)&=0\hbox{for}x\in\partial\Omegaa, \end{align}
where the boundary
\partial\Omega
\partial\Omegai\subset\partial\Omega
\partial\Omegaa=cup\limits
NR | |
i=1 |
\partial\Omegai, \partial\Omegar=\partial\Omega-\partial\Omegaa
\Pr\{t1>t\}=\int\limits\Omegap(x,t)dx,
\Pr\{\tau1=t\}=
d | |
dt |
\Pr\{\tau1<t\}=N(\Pr\{t1>t\})N-1\Pr\limits\{t1=t \},
\Pr\{t1=t\}=
{\oint | |
\partial\Omegaa |
\Pr\{t1=t\}=NR
{\oint | |
\partial\Omega1 |
The probability density function (pdf) of the arrival time is
\Pr\{\tau1=t\}=NNR\left[\int\limits\Omegap(x,t)dx\right]N-1
\oint\limits | |
\partial\Omega1 |
\partialp(x,t) | |
\partialn |
dSx,
\bar{\tau}1
infty | |
=\int\limits\limits | |
0 |
\Pr\{\tau1>t\}dt=
infty | |
\int\limits | |
0 |
\left[\Pr\{t1>t\}\right]Ndt.
\Pr\{t1>t\}
The short-time asymptotic of the diffusion equation is based on the ray method approximation. For an semi-interval
[0,infty[
\begin{align} | \partial(x,t) |
\partialt |
&=D
\partial2p(x,t) | |
\partialx2 |
forx>0, t>0\\ p(x,0)&=\delta(x-a) for x>0, p(0,t)=0 fort>0, \end{align}
that is
p(x,t)=
1 | |
\sqrt{4D\pit |
The survival probability with D=1 is
\Pr\{t1>t
infty | ||
\}=\int\limits\limits | p(x,t)dx=1- | |
0 |
2 | |
\sqrt{\pi |
2 | |
\sqrt{\pi |
\bar{\tau}1
infty | |
=\int\limits\limits | |
0 |
\left[\Pr\{t1>t\}\right]Ndt ≈
infty | |
\int\limits\limits | |
0 |
\exp\left\{Nln\left(1-
e-(a/\sqrt{4t)2 | |
leading (the main contribution of the integral is near 0) to
\bar{\tau}1 ≈
a2 | |||||
|
This result is reminiscent of using the Gumbel's law. Similarly, escape from the interval [0,a] is computed from the infinite sum
p(x,t|y)=
1 | |
\sqrt{4D\pit |
\Pr\{t1>t|y
a | |
\}=\int\limits\limits | |
0 |
p(x,t|y)dxds\sim1-max
2\sqrt{t | |
\delta=
\bar{\tau}1=
infty | |
\int\limits\limits | |
0 |
\left[\Pr\{t1>t\}\right]Ndt ≈
infty | |
\int\limits\limits | |
0 |
\exp\left\{Nln\left(1-
8\sqrt{t | |
The arrival times of the fastest among many Brownian motions are expressed in terms of the shortest distance from the source S to the absorbing window A, measured by the distance
\deltamin=d(S,A),
\bar\taud1 ≈
| |||||||
|
\right)},\hbox{indim1,validfor}N\gg1 ,
\bar\taud3 ≈
| ||||||||||||||||
|
These formulas show that the expected arrival time of the fastest particle is in dimension 1 and 2, O(1/\log(N)). They should be used instead of the classical forward rate in models of activation in biochemical reactions. The method to derive formulas is based on short-time asymptotic and the Green's function representation of the Helmholtz equation. Note that other distributions could lead to other decays with respect N.
The optimal paths for the fastest can be found using the Wencell-Freidlin functional in the Large-deviation theory. These paths correspond to the short-time asymptotics of the diffusion equation from a source to a target. In general, the exact solution is hard to find, especially for a space containing various distribution of obstacles.
The Wiener integral representation of the pdf for a pure Brownian motion is obtained for a zero drift and diffusion tensor
\sigma=D
\partial\Omegaa
Pr\{xN(t1,M)\in\Omega,{x}N(t2,M)\in\Omega,...,xM(t)=x,t\leqT\leqt+\Deltat|x(0)=y\}
=[\int\limits\Omega … \int\limits\limits\Omega
M | |
\prod | |
j=1 |
d{y | |
j}{\sqrt{(2\pi |
\Deltat)n\det{\sigma}(x)(tj-1,M))}} \exp\{-
1 | |
2\Deltat |
\left[{y}j-x(tj-1,N)-{a}({x}(tj-1,N))\Deltat\right]T{\sigma}-1(x(tj-1,N))\left[{y}j-x(tj-1,N)-{a}(x(tj-1,N))\Deltat\right]\}
where
\Deltat=t/M,tj,N=j\Deltat, x(t0,N)=y\hbox{and}{y}j=x(tj,N)
\partial\Omegaa.
\langle\tau(n)
infty | |
\rangle=\int\limits\limits | |
0 |
\exp\left\{nlog\int\limits\Omegap(x,t|y)dx\right\}dt
infty | |
=\int | |
0 |
\tau\sigmaPr\{\hbox{Path}\sigma\inSn(y),\tau\sigma=t\}dt,
where
Sn(y)
\Omega
Pr\{\hbox{Path}\sigma\inSn\}
Sn
\partial\Omegaa
\epsilon= | |\partial\Omegaa| |
|\partial\Omega| |
\ll1
Sn(y)
\tildeSn(y)
Pr\{\hbox{Path}\sigma\in\tildeSn(y)|t<\tau\sigma<t+dt
infty | |
\}=\sum | |
m=0 |
Pr\{\hbox{Path}\sigma\in\tildeSn(y)|m,t<\tau\sigma<t+dt\}Pr\{msteps\}
Pr\{msteps\}=Pr\{thepathsof\tildeSn(y)exitinmsteps\}
\tildeSn(y)
\Deltat
Pr\{x(s)|s\in[0,t]\} ≈ \exp
t | ||
\left(-\int | | | |
0 |
x| |
2ds\right).
The Survival probability conditioned on starting at y is given by the Wiener representation:
S(t|x0)=\intx\indx
x(t)=x | |
\int | |
x(0) |
{lD}(x)\exp
t | ||
\left(-\int | | | |
0 |
x| |
2ds\right),
where
{lD}(x)
(\sigma1,..\sigman)
\Deltat
Pr\{\sigma1,..\sigman\inSn(y)|m,\tau\sigma=m\Deltat\}=
\left(\int\limits | |
y0=y |
… \int\limits{yj\in\Omega} \int\limits{yn\in\partial\Omegaa}
1 | |
(4D\Deltat)dm/2 |
m | |
\prod | |
j=1 |
\exp \{-
1 | |
4D\Deltat |
\left[|{y}j-{y}j-1)|2\right]\} \right)n
≈ \left(
1 | |
(4D\Deltat)dm/2 |
n\int | |
\right) | |
x |
{lD} (x)\exp
m\Deltat | |
\{-n\int\limits | |
0 |
x |
2ds \}
Pr\{\hbox{Path}\sigma\in\tildeSn(y)|m,\tau\sigma=m\Deltat\}=\left(\int\limits{y0=y} … \int\limits{yj\in\Omega}\int\limits{yn\in\partial\Omegaa}
1 | |
(4D\Deltat)dm/2 |
m | |
\prod | |
j=1 |
\exp(-
1 | |
4D\Deltat |
\left[|{y}j-{y}j-1)|2\right])\right)n
≈ \intdx
x(t)=x | |
\int | |
x(0)=y |
{D}(x)\exp(-n
m\Deltat | |
\int\limits | |
0 |
x |
2ds),
where the integral is taken over all paths starting at y(0) and exiting at time
m\Deltat
E=min | |
X\inlPt |
T | |
\int\limits | |
0 |
x |
2ds,
where the integration is taken over the ensemble of regular paths
lPt
\Omega
\partial\Omegaa
lPT=\{P(0)=y,P(T)\in\partial\Omegaa\hbox{and}P(s)\in\Omega\hbox{and}0\leqs\leqT\}.
This formal argument shows that the random paths associated to the fastest exit time are concentrated near the shortest paths. Indeed, the Euler-Lagrange equations for the extremal problem are the classical geodesics between y and a point in the narrow window
\partial\Omegaa
The formula for the fastest escape can generalize to the case where the absorbing window is located in funnel cusp and the initial particles are distributed outside the cusp. The cusp has a size
\epsilon
\tau(n) ≈
\pi2R3 | |||||
|
{\tilde\epsilon})2log(
2n | |
\sqrt{\pi |
\tilde\epsilon=
\epsilon | |
R |
\epsilon
How nature sets the disproportionate numbers of particles remain unclear, but can be found using the theory of diffusion. One example is the number of neurotransmitters around 2000 to 3000 released during synaptic transmission, that are set to compensate the low copy number of receptors, so the probability of activation is restored to one.[19] [20]
In natural processes these large numbers should not be considered wasteful, but are necessary for generating the fastest possible response and make possible rare events that otherwise would never happen. This property is universal, ranging from the molecular scale to the population level.[21]
Nature's strategy for optimizing the response time is not necessarily defined by the physics of the motion of an individual particle, but rather by the extreme statistics, that select the shortest paths. In addition, the search for a small activation site selects the particle to arrive first: although these trajectories are rare, they are the ones that set the time scale. We may need to reconsider our estimation toward numbers when punctioning nature in agreement with the redundant principle that quantifies the request to achieve the biological function.[21]