In quantum mechanics, the Hellmann–Feynman theorem relates the derivative of the total energy with respect to a parameter to the expectation value of the derivative of the Hamiltonian with respect to that same parameter. According to the theorem, once the spatial distribution of the electrons has been determined by solving the Schrödinger equation, all the forces in the system can be calculated using classical electrostatics.
The theorem has been proven independently by many authors, including Paul Güttinger (1932),[1] Wolfgang Pauli (1933),[2] Hans Hellmann (1937)[3] and Richard Feynman (1939).[4]
The theorem states
where
\hat{H}λ
λ
|\psiλ\rangle
λ
Eλ
|\psiλ\rangle
\hat{H}λ|\psiλ\rangle=Eλ|\psiλ\rangle
Note that there is a breakdown of the Hellmann-Feynman theorem close to quantum critical points in the thermodynamic limit.[5]
This proof of the Hellmann–Feynman theorem requires that the wave function be an eigenfunction of the Hamiltonian under consideration; however, it is also possible to prove more generally that the theorem holds for non-eigenfunction wave functions which are stationary (partial derivative is zero) for all relevant variables (such as orbital rotations). The Hartree–Fock wavefunction is an important example of an approximate eigenfunction that still satisfies the Hellmann–Feynman theorem. Notable example of where the Hellmann–Feynman is not applicable is for example finite-order Møller–Plesset perturbation theory, which is not variational.[6]
The proof also employs an identity of normalized wavefunctions – that derivatives of the overlap of a wave function with itself must be zero. Using Dirac's bra–ket notation these two conditions are written as
\hat{H}λ|\psiλ\rangle=Eλ|\psiλ\rangle,
\langle\psiλ|\psiλ\rangle=1 ⇒
d | |
dλ |
\langle\psiλ|\psiλ\rangle=0.
The proof then follows through an application of the derivative product rule to the expectation value of the Hamiltonian viewed as a function of
λ
\begin{align} | dEλ |
dλ |
&=
d | |
dλ |
\langle\psiλ|\hat{H}λ|\psiλ\rangle\\ &=\langle
d\psiλ | |
dλ |
|\hat{H}λ|\psiλ\rangle+\langle\psiλ|\hat{H}λ|
d\psiλ | |
dλ |
\rangle+
\langle\psi | ||||
|
The Hellmann–Feynman theorem is actually a direct, and to some extent trivial, consequence of the variational principle (the Rayleigh–Ritz variational principle) from which the Schrödinger equation may be derived. This is why the Hellmann–Feynman theorem holds for wave-functions (such as the Hartree–Fock wave-function) that, though not eigenfunctions of the Hamiltonian, do derive from a variational principle. This is also why it holds, e.g., in density functional theory, which is not wave-function based and for which the standard derivation does not apply.
According to the Rayleigh–Ritz variational principle, the eigenfunctions of the Schrödinger equation are stationary points of the functional (which is nicknamed Schrödinger functional for brevity):The eigenvalues are the values that the Schrödinger functional takes at the stationary points:where
\psiλ
The most common application of the Hellmann–Feynman theorem is the calculation of intramolecular forces in molecules. This allows for the calculation of equilibrium geometries – the nuclear coordinates where the forces acting upon the nuclei, due to the electrons and other nuclei, vanish. The parameter
λ
1\leqi\leqN
\{ri\}
1\leq\alpha\leqM
\{R\alpha=\{X\alpha,Y\alpha,Z\alpha\}\}
Z\alpha
\hat{H}=\hat{T}+\hat{U}-
N | |
\sum | |
i=1 |
M | |
\sum | |
\alpha=1 |
Z\alpha | |
|ri-R\alpha| |
+
M | |
\sum | |
\alpha |
M | |
\sum | |
\beta>\alpha |
Z\alphaZ\beta | |
|R\alpha-R\beta| |
.
The
x
F | |
X\gamma |
=-
\partialE | |
\partialX\gamma |
=-\langle\psi|
\partial\hat{H | |
Only two components of the Hamiltonian contribute to the required derivative – the electron-nucleus and nucleus-nucleus terms. Differentiating the Hamiltonian yields[7]
\begin{align} | \partial\hat{H |
^ |
^ |
Insertion of this in to the Hellmann–Feynman theorem returns the
x
\rho(r)
F | |
X\gamma |
=Z\gamma\left(\intdr \rho(r)
x-X\gamma | |
|r-R\gamma|3 |
-
M | |
\sum | |
\alpha ≠ \gamma |
Z\alpha
X\alpha-X\gamma | |
|R\alpha-R\gamma|3 |
\right).
An alternative approach for applying the Hellmann–Feynman theorem is to promote a fixed or discrete parameter which appears in a Hamiltonian to be a continuous variable solely for the mathematical purpose of taking a derivative. Possible parameters are physical constants or discrete quantum numbers. As an example, the radial Schrödinger equation for a hydrogen-like atom is
\hat{H}l=-
\hbar2 | \left( | |
2\mur2 |
d | |
dr |
\left(r2
d | |
dr |
\right)-l(l+1)\right)-
Ze2 | |
r |
,
l
l
\partial\hat{H | |
l |
The Hellmann–Feynman theorem then allows for the determination of the expectation value of
1 | |
r2 |
\begin{align} \langle\psinl|
1 | |
r2 |
|\psinl\rangle&=
2\mu | |
\hbar2 |
1 | |
2l+1 |
\langle\psinl|
\partial\hat{H | |
l |
In order to compute the energy derivative, the way
n
l
n-l+1
\partialn/\partiall=1
In the end of Feynman's paper, he states that, "Van der Waals' forces can also be interpreted as arising from charge distributions with higher concentration between the nuclei. The Schrödinger perturbation theory for two interacting atoms at a separation
R
1/R7
1/R7
For a general time-dependent wavefunction satisfying the time-dependent Schrödinger equation, the Hellmann–Feynman theorem is not valid.However, the following identity holds:[9] [10]
\langle\Psi | ||||
|
|\Psiλ(t)\rangle=i\hbar
\partial | |
\partialt |
\langle\Psi | ||||
|
\rangle
i\hbar | \partial\Psiλ(t) |
\partialt |
=Hλ\Psiλ(t)
The proof only relies on the Schrödinger equation and the assumption that partial derivatives with respect to λ and t can be interchanged.
\begin{align} \langle\Psi | ||||
|
|\Psiλ(t)\rangle&=
\partial | |
\partialλ |
\langle\Psiλ(t)|Hλ|\Psiλ(t)\rangle -\langle
\partial\Psiλ(t) | |
\partialλ |
|Hλ|\Psiλ(t)\rangle -\langle\Psiλ(t)|H
|
\rangle\\ &=i\hbar
\partial | |
\partialλ |
\langle\Psi | ||||
|
\rangle -i\hbar\langle
\partial\Psiλ(t) | | | |
\partialλ |
\partial\Psiλ(t) | |
\partialt |
\rangle +i\hbar\langle
\partial\Psiλ(t) | | | |
\partialt |
\partial\Psiλ(t) | |
\partialλ |
\rangle\\ &=i\hbar\langle\Psiλ(t)|
| |||||||
\partialλ\partialt |
\rangle +i\hbar\langle
\partial\Psiλ(t) | | | |
\partialt |
\partial\Psiλ(t) | |
\partialλ |
\rangle\\ &=i\hbar
\partial | |
\partialt |
\langle\Psi | ||||
|
\rangle \end{align}