The self-consistent mean field (SCMF) method is an adaptation of mean field theory used in protein structure prediction to determine the optimal amino acid side chain packing given a fixed protein backbone. It is faster but less accurate than dead-end elimination and is generally used in situations where the protein of interest is too large for the problem to be tractable by DEE.
Like dead-end elimination, the SCMF method explores conformational space by discretizing the dihedral angles of each side chain into a set of rotamers for each position in the protein sequence. The method iteratively develops a probabilistic description of the relative population of each possible rotamer at each position, and the probability of a given structure is defined as a function of the probabilities of its individual rotamer components.
The basic requirements for an effective SCMF implementation are:
The process is generally initialized with a uniform probability distribution over the rotamers—that is, if there are
p
kth
A | |
r | |
k |
1/p
The energy of an individual rotamer
rk
N
p
i
Mi
E
Pi
A | |
(r | |
k |
)
Mi
A | |
(r | |
k |
)=Ek
A | |
(r | |
k |
)+
N | |
\sum | |
x=1 |
p | |
\sum | |
y=1 |
Pi-1
y | |
(r | |
x |
)Exy
A | |
(r | |
k |
,
y | |
r | |
x |
)
These mean-field energies are used to update the probabilities through the Boltzmann law:
Pi
A | |
(r | |
k |
)=\left(\exp\left(-
| |||||||||||||
kT |
p | ||
\right)\right)\left(\sum | \exp\left(- | |
y=1 |
| |||||||||||||
kT |
\right)\right)-1
k
T
Although computing the system energy is not required in carrying out the SCMF method, it is useful to know the overall energies of the converged results. The system energy
Msys
Msys=Msingle+Mpair
Msingle=
N | |
\sum | |
x=1 |
p | |
\sum | |
y=1 |
y | |
P(r | |
x |
)Ex
y | |
(r | |
x |
)
Mpair=
N | |
\sum | |
x=1 |
p | |
\sum | |
y=1 |
N | |
\sum | |
a=x+1 |
p | |
\sum | |
b=1 |
y | |
\left(P(r | |
x |
b | |
)P(r | |
a |
)Exy
y | |
(r | |
x |
,
b | |
r | |
a |
)\right)
Perfect convergence for the SCMF method would result in a probability of 1 for exactly one rotamer at each position
k
Unlike dead-end elimination, SCMF is not guaranteed to converge on the optimal solution. However, it is deterministic (as in, it will converge to the same solution every time given the same initial conditions), unlike alternatives that rely on Monte Carlo analysis. By comparison to DEE, which is guaranteed to find the optimal solution, SCMF is faster but less accurate overall; it is significantly better at identifying correct side chain conformations in the protein's core than it is on identifying correct surface conformations. Geometric packing constraints are less restrictive on the surface and thus provide fewer boundaries to the conformational search.