The theta model, or Ermentrout–Kopell canonical model, is a biological neuron model originally developed to mathematically describe neurons in the animal Aplysia.[1] The model is particularly well-suited to describe neural bursting, which is characterized by periodic transitions between rapid oscillations in the membrane potential followed by quiescence. This bursting behavior is often found in neurons responsible for controlling and maintaining steady rhythms such as breathing,[2] swimming,[3] and digesting.[4] Of the three main classes of bursting neurons (square wave bursting, parabolic bursting, and elliptic bursting),[5] [6] the theta model describes parabolic bursting, which is characterized by a parabolic frequency curve during each burst.[7]
The model consists of one variable that describes the membrane potential of a neuron along with an input current.[8] The single variable of the theta model obeys relatively simple equations, allowing for analytic, or closed-form solutions, which are useful for understanding the properties of parabolic bursting neurons.[9] [7] In contrast, other biophysically accurate neural models such as the Hodgkin–Huxley model and Morris–Lecar model consist of multiple variables that cannot be solved analytically, requiring numerical integration to solve.[9]
Similar models include the quadratic integrate and fire (QIF) model, which differs from the theta model only by a change of variables[10] [8] [11] [12] [13] and Plant's model,[14] which consists of Hodgkin–Huxley type equations and also differs from the theta model by a series of coordinate transformations.[15]
Despite its simplicity, the theta model offers enough complexity in its dynamics that it has been used for a wide range of theoretical neuroscience research[16] [17] as well as in research beyond biology, such as in artificial intelligence.[18]
Bursting is "an oscillation in which an observable [part] of the system, such as voltage or chemical concentration, changes periodically between an active phase of rapid spike oscillations (the fast sub-system) and a phase of quiescence".[19] Bursting comes in three distinct forms: square-wave bursting, parabolic bursting, and elliptic bursting.[5] [6] There exist some models that do not fit neatly into these categories by qualitative observation, but it is possible to sort such models by their topology (i.e. such models can be sorted "by the structure of the fast subsystem").[19]
All three forms of bursting are capable of beating and periodic bursting.[14] Periodic bursting (or just bursting) is of more interest because many phenomena are controlled by, or arise from, bursting. For example, bursting due to a changing membrane potential is common in various neurons, including but not limited to cortical chattering neurons, thalamacortical neurons,[20] and pacemaker neurons. Pacemakers in general are known to burst and synchronize as a population, thus generating a robust rhythm that can maintain repetitive tasks like breathing, walking, and eating.[21] [22] Beating occurs when a cell bursts continuously with no periodic quiescent periods,[23] but beating is often considered to be an extreme case and is rarely of primary interest.
Bursting cells are important for motor generation and synchronization.[20] For example, the pre-Bötzinger complex in the mammalian brain stem contains many bursting neurons that control autonomous breathing rhythms.[2] [24] Various neocortical neurons (i.e. cells of the neocortex) are capable of bursting, which "contribute significantly to [the] network behavior [of neocortical neurons]".[25] The R15 neuron of the abdominal ganglion in Aplyisa, hypothesized to be a neurosecretory cell (i.e. a cell that produces hormones), is known to produce bursts characteristic of neurosecretory cells.[26] In particular, it is known to produce parabolic bursts.
Since many biological processes involve bursting behavior, there is a wealth of various bursting models in scientific literature. For instance, there exist several models for interneurons[27] and cortical spiking neurons.[28] However, the literature on parabolic bursting models is relatively scarce.
Parabolic bursting models are mathematical models that mimic parabolic bursting in real biological systems. Each burst of a parabolic burster has a characteristic feature in the burst structure itself – the frequency at the beginning and end of the burst is low relative to the frequency in the middle of the burst.[5] A frequency plot of one burst resembles a parabola, hence the name "parabolic burst". Furthermore, unlike elliptic or square-wave bursting, there is a slow modulating wave which, at its peak, excites the cell enough to generate a burst and inhibits the cell in regions near its minimum. As a result, the neuron periodically transitions between bursting and quiescence.
Parabolic bursting has been studied most extensively in the R15 neuron, which is one of six types of neurons of the Aplysia abdominal ganglion[29] and one of thirty neurons comprising the abdominal ganglion.[30] The Aplysia abdominal ganglion was studied and extensively characterized because its relatively large neurons and proximity of the neurons to the surface of the ganglion made it an ideal and "valuable preparation for cellular electrophysical studies".[31]
Early attempts to model parabolic bursting were for specific applications, often related to studies of the R15 neuron. This is especially true of R. E. Plant[14] [32] and Carpenter,[33] whose combined works comprise the bulk of parabolic bursting models prior to Ermentrout and Kopell's canonical model.
Though there was no specific mention of the term "parabolic bursting" in Plant's papers, Plant's model(s) do involve a slow, modulating oscillation which control bursting in the model(s).[14] [32] This is, by definition, parabolic bursting. Both of Plant's papers on the topic involve a model derived from the Hodgkin–Huxley equations and include extra conductances, which only add to the complexity of the model.
Carpenter developed her model primarily for a square wave burster.[33] The model was capable of producing a small variety of square wave bursts and produced parabolic bursts as a consequence of adding an extra conductance. However, the model applied to only spatial propagation down axons and not situations where oscillations are limited to a small region in space (i.e. it was not suited for "space-clamped" situations).
The lack of a simple, generalizable, space-clamped, parabolic bursting model motivated Ermentrout and Kopell to develop the theta model.
It is possible to describe a multitude of parabolic bursting cells by deriving a simple mathematical model, called a canonical model. Derivation of the Ermentrout and Kopell canonical model begins with the general form for parabolic bursting, and notation will be fixed to clarify the discussion. The letters
f
g
h
I
x
y
\theta
\varepsilon
p
q
In the following generalized system of equations for parabolic bursting, the values of
f
h
y
g
y |
x |
y |
x |
x
y
x |
=f(x)+\varepsilon2g(x,y,\varepsilon),
y |
=\varepsilonh(x,y,\varepsilon),
where
x
p
x\inRp
y
q
y\inRq
\varepsilon
f
g
h
x |
=f(x)
R2
x=0
y |
=h(0,y,0)
\varepsilon=0
x |
=f(x)
x=0
x=0
y |
=h(0,y,0)
The theta model can be used in place of any parabolic bursting model that satisfies the assumptions above.
The theta model is a reduction of the generalized system from the previous section and takes the form,
d\theta | |
dt |
=1-\cos\theta+(1+\cos\theta)I(t), \theta\inS1.
This model is one of the simplest excitable neuron models.[18] The state variable
\theta
I(t)
\theta
\theta=\pi
The theta model is capable of a single saddle-node bifurcation and can be shown to be the "normal form for the saddle-node on a limit cycle bifurcation."[8] When
(I<0)
R2
S1
R2
(I>0)
\theta |
I(t)=0
Near the bifurcation point, the theta model resembles the quadratic integrate and fire model:
dx | |
dt |
=x2+I.
For I > 0, the solutions of this equation blow up in finite time. By resetting the trajectory
x(t)
-infty
+infty
T=
\pi | |
\sqrt{I |
Therefore, the period diverges as
I → 0+
When
I(t)
I(t):=\sin(\alphat)
\alpha
\alphat\in(0,\pi)
I(t)
\theta
\pi
\alphat
\pi
\alphat
\alphat=\pi/2
\alphat=\pi
\theta
\theta=\pi
\alphat\in(\pi,2\pi)
The derivation comes in the form of two lemmas in Ermentrout and Kopell (1986). Lemma 1, in summary, states that when viewing the general equations above in a subset
S1 x R2
x1 |
=\overline{f}(x1)+\varepsilon2\overline{g}(x1,y,\varepsilon) x1\inS1,
y |
=\varepsilon\overline{h}(x1,y,\varepsilon) y\inRq.
By lemma 2 in Ermentrout and Kopell 1986, "There exists a change of coordinates... and a constant, c, such that in new coordinates, the two equations above converge pointwise as
\varepsilon → 0
\theta |
=(1-\cos\theta)+(1+\cos\theta)\overline{g}(0,y,0),
y |
=
1 | |
c |
\overline{h}(0,y,0),
for all
\theta ≠ \pi
\theta=\pi
I(t):=\overline{g}(0,y,0)
In general, given a scalar phase model of the form
\theta |
=f(\theta)+g(\theta)S(t),
where
S(t)
However, the theta model is a special case of such an oscillator and happens to have a closed-form solution for the PRC. The theta model is recovered by defining
f
g
f(\theta)=(1-\cos\theta)+I(1+\cos\theta),
g(\theta)=(1+\cos\theta).
In the appendix of Ermentrout 1996, the PRC is shown to be
Z(\theta)=K(1+\cos\theta)
The authors of Soto-Treviño et al. (1996) discuss in great detail the similarities between Plant's (1976) model and the theta model. At first glance, the mechanisms of bursting in both systems are very different: In Plant's model, there are two slow oscillations – one for conductance of a specific current and one for the concentration of calcium. The calcium oscillations are active only when the membrane potential is capable of oscillating. This contrasts heavily against the theta model in which one slow wave modulates the burst of the neuron and the slow wave has no dependence upon the bursts. Despite these differences, the theta model is shown to be similar to Plant's (1976) model by a series of coordinate transformations. In the process, Soto-Trevino, et al. discovered that the theta model was more general than originally believed.
The quadratic integrate-and-fire (QIF) model was created by Latham et al. in 2000 to explore the many questions related to networks of neurons with low firing rates.[12] It was unclear to Latham et al. why networks of neurons with "standard" parameters were unable to generate sustained low frequency firing rates, while networks with low firing rates were often seen in biological systems.
According to Gerstner and Kistler (2002), the quadratic integrate-and-fire (QIF) model is given by the following differential equation:
\tau
u |
=a0(u-urest)(u-uc)+RmI,
where
a0
u
urest
uc
Rm
\tau
uc>urest
I=0
I
u
ur
The theta model is very similar to the QIF model since the theta model differs from the QIF model by means of a simple coordinate transform.[10] [12] By scaling the voltage appropriately and letting
\DeltaI
u |
=u2+\DeltaI.
Similarly, the theta model can be rewritten as
\theta |
=1-\cos\theta+(1+\cos\theta)\DeltaI.
The following proof will show that the QIF model becomes the theta model given an appropriate choice for the coordinate transform.
Define
u(t)=\tan(\theta/2)
d\tan(x)/dx=1/\cos2(x)
u |
=
1 | |||||||
|
1 | |
2 |
\theta |
=u2+\DeltaI.
An additional substitution and rearranging in terms of
\theta
\theta |
=2\left[
| |||||
\cos | \left( |
\theta2\right) | |
+ |
| ||||
\cos |
\right]=2\left[\sin2\left(
\theta | |
2\right) |
+\cos2\left(
\theta | |
2\right)\Delta |
I\right].
Using the trigonometric identities
\cos2(x/2)=
1+\cos(x) | |
2 |
\sin2(x/2)=
1-\cos(x) | |
2 |
\theta |
\theta |
=2\left[
1-\cos\theta | |
2 |
+\left(
1+\cos\theta | |
2 |
\right)\DeltaI\right]=1-\cos\theta+(1+\cos\theta)\DeltaI.
Therefore, there exists a change of coordinates, namely
u(t)=\tan(\theta/2)
Though the theta model was originally used to model slow cytoplasmic oscillations that modulate fast membrane oscillations in a single cell, Ermentrout and Kopell found that the theta model could be applied just as easily to systems of two electrically coupled cells such that the slow oscillations of one cell modulates the bursts of the other.[7] Such cells serve as the central pattern generator (CPG) of the pyloric system in the lobster stomatograstic ganglion.[36] In such a system, a slow oscillator, called the anterior burster (AB) cell, modulates the bursting cell called the pyloric dilator (PD), resulting in parabolic bursts.[7]
A group led by Boergers,[16] used the theta model to explain why exposure to multiple simultaneous stimuli can reduce the response of the visual cortex below the normal response from a single (preferred) stimulus. Their computational results showed that this may happen due to strong stimulation of a large group of inhibitory neurons. This effect not only inhibits neighboring populations, but has the extra consequence of leaving the inhibitory neurons in disarray, thus increasing the effectiveness of inhibition.
Osan et al. (2002) found that in a network of theta neurons, there exist two different types of waves that propagate smoothly over the network, given a sufficiently large coupling strength.[17] Such traveling waves are of interest because they are frequently observed in pharmacologically treated brain slices, but are hard to measure in intact animals brains.[17] The authors used a network of theta models in favor of a network of leaky integrate-and-fire (LIF) models due to two primary advantages: first, the theta model is continuous, and second, the theta model retains information about "the delay between the crossing of the spiking threshold and the actual firing of an action potential". The LIF fails to satisfy both conditions.
The theta model can also be applied to research beyond the realm of biology. McKennoch et al. (2008) derived a steepest gradient descent learning rule based on theta neuron dynamics.[18] Their model is based on the assumption that "intrinsic neuron dynamics are sufficient to achieve consistent time coding, with no need to involve the precise shape of postsynaptic currents..." contrary to similar models like SpikeProp and Tempotron, which depend heavily on the shape of the postsynaptic potential (PSP). Not only could the multilayer theta network perform just about as well as Tempotron learning, but the rule trained the multilayer theta network to perform certain tasks neither SpikeProp nor Tempotron were capable of.
According to Kopell and Ermentrout (2004), a limitation of the theta model lies in its relative difficulty in electrically coupling two theta neurons. It is possible to create large networks of theta neurons – and much research has been done with such networks – but it may be advantageous to use Quadratic Integrate-and-Fire (QIF) neurons, which allow for electrical coupling in a "straightforward way".[37]