The Journal of Chemical Physics, Vol. 112, No. 9, pp. 4419–4420, 1 March 2000
©2000 American Institute of Physics. All rights reserved.
Previous section: TITLE PAGE
Next section: REFERENCES

In a recent paper Schautz and Flad1 studied the dipole moment of CO using various first-principles computational methods. As part of this study they considered the applicability of the Hellmann–Feynman theorem within the fixed-node diffusion quantum Monte Carlo (DMC) method.2,3,4 The purpose of this comment is to point out that the conclusion reached by Schautz and Flad1 regarding the applicability of the Hellmann–Feynman theorem within fixed-node DMC is incorrect.

In the DMC method stochastic evolution of the imaginary-time Schrödinger equation is used to project out the lowest energy many-electron wave function. Such projector methods suffer from a fermion sign problem in which the wave function decays towards the bosonic ground state. To enforce the fermion symmetry the fixed-node approximation5,6 is normally used. In a fixed-node DMC calculation the Schrödinger equation is solved separately in each nodal pocket with the boundary condition that the wave function in the alphath pocket, psialpha, is zero on the surface of the pocket. For simplicity we consider only ground-state wave functions which satisfy the tiling property7,8 that all nodal pockets are equivalent and related by the permutation symmetry. The pocket wave function is zero outside the pocket and has a discontinuous derivative at the surface of the pocket and, therefore, satisfies8

<i>H</i>-hat  <i>psi</i><sub><i>alpha</i></sub> = <i>E</i><sub>D</sub><i>psi</i><sub><i>alpha</i></sub> + <i>g</i>[<i>psi</i><sub><i>alpha</i></sub>]  <i>delta</i>(<b>r</b> – <b>r</b><sub>n</sub>[<i>psi</i><sub><i>alpha</i></sub>]),

where rn[psialpha] is the surface of pocket alpha. The delta function term, which was neglected by Schautz and Flad,1 arises from the action of the kinetic-energy operator on the discontinuity in the derivative of the wave function at the pocket surface. Operating on this equation with the antisymmetrizing operator, Â, gives

<i>H</i>-hat  <i>Psi</i> = <i>E</i><sub>D</sub><i>Psi</i> + <i>h</i>[<i>Psi</i>]  <i>delta</i>(<b>r</b> – <b>r</b><sub>n</sub>[<i>Psi</i>]),

where Psi= Âpsialpha is the antisymmetric DMC wave function and rn[Psi] is the nodal surface of Psi. The nodal surface is fixed by the choice of the trial wave function, PhiT. The DMC energy may then be calculated from

<i>E</i><sub>D</sub> = (([integral]<i>Phi</i><sub><i>T</i></sub><i>H</i>-hat  <i>Psi</i>  <i>d</i><b>r</b>)/([integral]<i>Phi</i><sub><i>T</i></sub><i>Psi</i>  <i>d</i><b>r</b>)).

If the nodal surface of PhiT is exact then Psi has no gradient discontinuities on the nodal surface and <i>H</i>-hat Psi0 = E0Psi0, where Psi0 and E0 are the exact wave function and energy, but if the nodal surface is inexact then h[Psi] is nonzero and normally of order PsiPsi0. On substituting Eq. (2) into Eq. (3) we find that the nodal term, hdelta, does not contribute to the energy because PhiT is zero on the nodal surface. However, the nodal term can contribute to derivatives of ED. A simple way to see this is to note that the DMC energy can also be evaluated as the expectation value with the fixed-node DMC wave function

<i>E</i><sub>D</sub> = (([integral]<i>Psi</i> <i>H</i>-hat  <i>Psi</i>  <i>d</i><b>r</b>)/([integral]<i>Psi</i> <i>Psi</i>  <i>d</i><b>r</b>)).

The derivative with respect to a parameter, lambda, is

(([partial-derivative]<i>E</i><sub>D</sub>)/([partial-derivative] <i>lambda</i>)) = (([integral]<i>Psi</i>([partial-derivative]<i>H</i>-hat/[partial-derivative] <i>lambda</i>)<i>Psi</i>  <i>d</i><b>r</b>)/([integral]<i>Psi</i> <i>Psi</i>  <i>d</i><b>r</b>)) + 2(([integral]([partial-derivative] <i>Psi</i>/[partial-derivative] <i>lambda</i>)<i>H</i>-hat  <i>Psi</i>  <i>d</i><b>r</b>)/([integral]<i>Psi</i> <i>Psi</i>  <i>d</i><b>r</b>)) – 2<i>E</i><sub>D</sub>(([integral]<i>Psi</i>([partial-derivative] <i>Psi</i>/[partial-derivative] <i>lambda</i>) <i>d</i><b>r</b>)/([integral]<i>Psi</i> <i>Psi</i>  <i>d</i><b>r</b>)),

and using Eq. (2) we obtain

(([partial-derivative]<i>E</i><sub>D</sub>)/([partial-derivative] <i>lambda</i>)) = (([integral]<i>Psi</i>([partial-derivative]<i>H</i>-hat/[partial-derivative] <i>lambda</i>)<i>Psi</i>  <i>d</i><b>r</b>)/([integral]<i>Psi</i> <i>Psi</i>  <i>d</i><b>r</b>)) + 2(([integral]([partial-derivative] <i>Psi</i>/[partial-derivative] <i>lambda</i>)<i>h</i> <i>delta</i>  <i>d</i><b>r</b>)/([integral]<i>Psi</i> <i>Psi</i>  <i>d</i><b>r</b>)),

which is the Hellmann–Feynman expression with an additional nodal term. While hdelta is determined by the nodes of Psi (or equivalently those of PhiT), [partial-derivative]Psi/[partial-derivative]lambda depends on how we choose the nodes to vary with lambda. We, therefore, expect the nodal term in Eq. (6) to be nonzero in general. If we choose the nodes to be independent of lambda then [partial-derivative]Psi/[partial-derivative]lambda= 0 on the nodal surface and the contribution from the nodal term is zero. In this case the Hellmann–Feynman theorem holds, as correctly stated by Schautz and Flad.1 However, if the nodes of the trial wave function vary with lambda then the nodal term will normally be nonzero and the Hellmann–Feynman expression does not give the exact derivative of the DMC energy of Eqs. (3) and (4), in contradiction to Schautz and Flad.1

We conclude that Schautz and Flad are correct in stating that the Hellmann–Feynman expression evaluated with the fixed-node DMC wave function is equal to the derivative of the fixed-node DMC energy with respect to a parameter lambda if the nodes are independent of lambda, but this does not hold in general if the nodal surface depends on lambda.


Previous section: TITLE PAGE
Next section: REFERENCES