Ewald summation

From KYNNpedia
Revision as of 01:34, 13 December 2023 by imported>Citation bot (Add: bibcode, authors 1-1. Removed parameters. Some additions/deletions were parameter name changes. | Use this bot. Report bugs. | Suggested by Abductive | Category:Series acceleration methods‎ | #UCB_Category 8/8)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Ewald summation, named after Paul Peter Ewald, is a method for computing long-range interactions (e.g. electrostatic interactions) in periodic systems. It was first developed as the method for calculating the electrostatic energies of ionic crystals, and is now commonly used for calculating long-range interactions in computational chemistry. Ewald summation is a special case of the Poisson summation formula, replacing the summation of interaction energies in real space with an equivalent summation in Fourier space. In this method, the long-range interaction is divided into two parts: a short-range contribution, and a long-range contribution which does not have a singularity. The short-range contribution is calculated in real space, whereas the long-range contribution is calculated using a Fourier transform. The advantage of this method is the rapid convergence of the energy compared with that of a direct summation. This means that the method has high accuracy and reasonable speed when computing long-range interactions, and it is thus the de facto standard method for calculating long-range interactions in periodic systems. The method requires charge neutrality of the molecular system to accurately calculate the total Coulombic interaction. A study of the truncation errors introduced in the energy and force calculations of disordered point-charge systems is provided by Kolafa and Perram.<ref> Kolafa, Jiri; Perram, John W. (September 1992). "Cutoff Errors in the Ewald Summation Formulae for Point Charge Systems". Molecular Simulation. 9 (5): 351–368. doi:10.1080/08927029208049126.</ref>

Derivation

Ewald summation rewrites the interaction potential as the sum of two terms, <math display="block">\varphi(\mathbf{r}) \ \stackrel{\mathrm{def}}{=}\ \varphi_{sr}(\mathbf{r}) + \varphi_{\ell r}(\mathbf{r}),</math> where <math>\varphi_{sr}(\mathbf{r})</math> represents the short-range term whose sum quickly converges in real space and <math>\varphi_{\ell r}(\mathbf{r})</math> represents the long-range term whose sum quickly converges in Fourier (reciprocal) space. The long-ranged part should be finite for all arguments (most notably r = 0) but may have any convenient mathematical form, most typically a Gaussian distribution. The method assumes that the short-range part can be summed easily; hence, the problem becomes the summation of the long-range term. Due to the use of the Fourier sum, the method implicitly assumes that the system under study is infinitely periodic (a sensible assumption for the interiors of crystals). One repeating unit of this hypothetical periodic system is called a unit cell. One such cell is chosen as the "central cell" for reference and the remaining cells are called images.

The long-range interaction energy is the sum of interaction energies between the charges of a central unit cell and all the charges of the lattice. Hence, it can be represented as a double integral over two charge density fields representing the fields of the unit cell and the crystal lattice <math display="block"> E_{\ell r} = \iint d\mathbf{r}\, d\mathbf{r}^\prime\, \rho_\text{TOT}(\mathbf{r}) \rho_{uc}(\mathbf{r}^\prime) \ \varphi_{\ell r}(\mathbf{r} - \mathbf{r}^\prime) </math> where the unit-cell charge density field <math>\rho_{uc}(\mathbf{r})</math> is a sum over the positions <math>\mathbf{r}_k</math> of the charges <math>q_k</math> in the central unit cell <math display="block"> \rho_{uc}(\mathbf{r}) \ \stackrel{\mathrm{def}}{=}\ \sum_{\mathrm{charges}\ k} q_k \delta(\mathbf{r} - \mathbf{r}_k) </math> and the total charge density field <math>\rho_\text{TOT}(\mathbf{r})</math> is the same sum over the unit-cell charges <math>q_{k}</math> and their periodic images <math display="block"> \rho_\text{TOT}(\mathbf{r}) \ \stackrel{\mathrm{def}}{=}\ \sum_{n_1, n_2, n_3} \sum_{\mathrm{charges}\ k} q_k \delta(\mathbf{r} - \mathbf{r}_k - n_1 \mathbf{a}_1 - n_2 \mathbf{a}_2 - n_3 \mathbf{a}_3) </math>

Here, <math>\delta(\mathbf{x})</math> is the Dirac delta function, <math>\mathbf{a}_1</math>, <math>\mathbf{a}_2</math> and <math>\mathbf{a}_3</math> are the lattice vectors and <math>n_1</math>, <math>n_2</math> and <math>n_3</math> range over all integers. The total field <math>\rho_\text{TOT}(\mathbf{r})</math> can be represented as a convolution of <math>\rho_{uc}(\mathbf{r})</math> with a lattice function <math>L(\mathbf{r})</math> <math display="block"> L(\mathbf{r}) \ \stackrel{\mathrm{def}}{=}\ \sum_{n_1, n_2, n_3} \delta(\mathbf{r} - n_1 \mathbf{a}_1 - n_2 \mathbf{a}_2 - n_3 \mathbf{a}_3) </math>

Since this is a convolution, the Fourier transformation of <math>\rho_\text{TOT}(\mathbf{r})</math> is a product <math display="block"> \tilde{\rho}_\text{TOT}(\mathbf{k}) = \tilde{L}(\mathbf{k}) \tilde{\rho}_{uc}(\mathbf{k}) </math> where the Fourier transform of the lattice function is another sum over delta functions <math display="block"> \tilde{L}(\mathbf{k}) = \frac{\left(2\pi \right)^{3}}{\Omega} \sum_{m_1, m_2, m_3} \delta(\mathbf{k} - m_1 \mathbf{b}_1 - m_2 \mathbf{b}_2 - m_3 \mathbf{b}_3) </math> where the reciprocal space vectors are defined <math>\mathbf{b}_1 \ \stackrel{\mathrm{def}}{=}\ 2 \pi \frac{\mathbf{a}_2 \times \mathbf{a}_3}{\Omega}</math> (and cyclic permutations) where <math>\Omega \ \stackrel{\mathrm{def}}{=}\ \mathbf{a}_1 \cdot \left( \mathbf{a}_2 \times \mathbf{a}_3 \right)</math> is the volume of the central unit cell (if it is geometrically a parallelepiped, which is often but not necessarily the case). Note that both <math>L(\mathbf{r})</math> and <math>\tilde{L}(\mathbf{k})</math> are real, even functions.

For brevity, define an effective single-particle potential <math display="block"> v(\mathbf{r}) \ \stackrel{\mathrm{def}}{=}\ \int d\mathbf{r}^{\prime}\, \rho_{uc}(\mathbf{r}^\prime) \ \varphi_{\ell r}(\mathbf{r} - \mathbf{r}^\prime) </math>

Since this is also a convolution, the Fourier transformation of the same equation is a product <math display="block"> \tilde{V}(\mathbf{k}) \ \stackrel{\mathrm{def}}{=}\ \tilde{\rho}_{uc}(\mathbf{k}) \tilde{\Phi}(\mathbf{k}) </math> where the Fourier transform is defined <math display="block"> \tilde{V}(\mathbf{k}) = \int d\mathbf{r} \ v(\mathbf{r}) \ e^{-i\mathbf{k} \cdot \mathbf{r}} </math>

The energy can now be written as a single field integral <math display="block"> E_{\ell r} = \int d\mathbf{r} \ \rho_\text{TOT}(\mathbf{r}) \ v(\mathbf{r}) </math>

Using Plancherel theorem, the energy can also be summed in Fourier space <math display="block"> E_{\ell r} = \int \frac{d\mathbf{k}}{\left(2\pi\right)^3} \ \tilde{\rho}_\text{TOT}^*(\mathbf{k}) \tilde{V}(\mathbf{k}) = \int \frac{d\mathbf{k}}{\left(2\pi\right)^3} \tilde{L}^*(\mathbf{k}) \left| \tilde{\rho}_{uc}(\mathbf{k})\right|^2 \tilde{\Phi}(\mathbf{k}) = \frac{1}{\Omega} \sum_{m_1, m_2, m_3} \left| \tilde{\rho}_{uc}(\mathbf{k})\right|^2 \tilde{\Phi}(\mathbf{k}) </math>

where <math>\mathbf{k} = m_1 \mathbf{b}_1 + m_2 \mathbf{b}_2 + m_3 \mathbf{b}_3</math> in the final summation.

This is the essential result. Once <math>\tilde{\rho}_{uc}(\mathbf{k})</math> is calculated, the summation/integration over <math>\mathbf{k}</math> is straightforward and should converge quickly. The most common reason for lack of convergence is a poorly defined unit cell, which must be charge neutral to avoid infinite sums.

Particle mesh Ewald (PME) method

Ewald summation was developed as a method in theoretical physics, long before the advent of computers. However, the Ewald method has enjoyed widespread use since the 1970s in computer simulations of particle systems, especially those whose particles interact via an inverse square force law such as gravity or electrostatics. Recently, PME has also been used to calculate the <math>r^{-6}</math> part of the Lennard-Jones potential in order to eliminate artifacts due to truncation.<ref>Di Pierro, M.; Elber, R.; Leimkuhler, B. (2015), "A Stochastic Algorithm for the Isobaric-Isothermal Ensemble with Ewald Summations for all Long Range Forces.", Journal of Chemical Theory and Computation, 11 (12): 5624–5637, doi:10.1021/acs.jctc.5b00648, PMC 4890727, PMID 26616351</ref> Applications include simulations of plasmas, galaxies and molecules.

In the particle mesh method, just as in standard Ewald summation, the generic interaction potential is separated into two terms <math>\varphi(\mathbf{r}) \ \stackrel{\mathrm{def}}{=}\ \varphi_{sr}(\mathbf{r}) + \varphi_{\ell r}(\mathbf{r})</math>. The basic idea of particle mesh Ewald summation is to replace the direct summation of interaction energies between point particles <math display="block"> E_\text{TOT} = \sum_{i,j} \varphi(\mathbf{r}_{j} - \mathbf{r}_i) = E_{sr} + E_{\ell r} </math> with two summations, a direct sum <math>E_{sr}</math> of the short-ranged potential in real space <math display="block"> E_{sr} = \sum_{i,j} \varphi_{sr}(\mathbf{r}_j - \mathbf{r}_i) </math> (this is the particle part of particle mesh Ewald) and a summation in Fourier space of the long-ranged part <math display="block"> E_{\ell r} = \sum_{\mathbf{k}} \tilde{\Phi}_{\ell r}(\mathbf{k}) \left| \tilde{\rho}(\mathbf{k}) \right|^2 </math>

where <math>\tilde{\Phi}_{\ell r}</math> and <math>\tilde{\rho}(\mathbf{k})</math> represent the Fourier transforms of the potential and the charge density (this is the Ewald part). Since both summations converge quickly in their respective spaces (real and Fourier), they may be truncated with little loss of accuracy and great improvement in required computational time. To evaluate the Fourier transform <math>\tilde{\rho}(\mathbf{k})</math> of the charge density field efficiently, one uses the fast Fourier transform, which requires that the density field be evaluated on a discrete lattice in space (this is the mesh part).

Due to the periodicity assumption implicit in Ewald summation, applications of the PME method to physical systems require the imposition of periodic symmetry. Thus, the method is best suited to systems that can be simulated as infinite in spatial extent. In molecular dynamics simulations this is normally accomplished by deliberately constructing a charge-neutral unit cell that can be infinitely "tiled" to form images; however, to properly account for the effects of this approximation, these images are reincorporated back into the original simulation cell. The overall effect is called a periodic boundary condition. To visualize this most clearly, think of a unit cube; the upper face is effectively in contact with the lower face, the right with the left face, and the front with the back face. As a result, the unit cell size must be carefully chosen to be large enough to avoid improper motion correlations between two faces "in contact", but still small enough to be computationally feasible. The definition of the cutoff between short- and long-range interactions can also introduce artifacts.

The restriction of the density field to a mesh makes the PME method more efficient for systems with "smooth" variations in density, or continuous potential functions. Localized systems or those with large fluctuations in density may be treated more efficiently with the fast multipole method of Greengard and Rokhlin.

Dipole term

The electrostatic energy of a polar crystal (i.e. a crystal with a net dipole <math>\mathbf{p}_{uc}</math> in the unit cell) is conditionally convergent, i.e. depends on the order of the summation. For example, if the dipole-dipole interactions of a central unit cell with unit cells located on an ever-increasing cube, the energy converges to a different value than if the interaction energies had been summed spherically. Roughly speaking, this conditional convergence arises because (1) the number of interacting dipoles on a shell of radius <math>R</math> grows like <math display="inline">R^2</math>; (2) the strength of a single dipole-dipole interaction falls like <math display="inline">1 / {R^3}</math>; and (3) the mathematical summation <math display="inline">\sum_{n=1}^{\infty} \frac{1}{n}</math> diverges.

This somewhat surprising result can be reconciled with the finite energy of real crystals because such crystals are not infinite, i.e. have a particular boundary. More specifically, the boundary of a polar crystal has an effective surface charge density on its surface <math>\sigma = \mathbf{P} \cdot \mathbf{n}</math> where <math>\mathbf{n}</math> is the surface normal vector and <math>\mathbf{P}</math> represents the net dipole moment per volume. The interaction energy <math>U</math> of the dipole in a central unit cell with that surface charge density can be written<ref>Herce, HD; Garcia, AE; Darden, T (28 March 2007). "The electrostatic surface term: (I) periodic systems". The Journal of Chemical Physics. 126 (12): 124106. Bibcode:2007JChPh.126l4106H. doi:10.1063/1.2714527. PMID 17411107.</ref> <math display="block"> U = \frac{1}{2V_{uc}} \int \frac{\left( \mathbf{p}_{uc}\cdot \mathbf{r} \right) \left( \mathbf{p}_{uc} \cdot \mathbf{n} \right)}{r^3} \, dS </math> where <math>\mathbf{p}_{uc}</math> and <math>V_{uc}</math> are the net dipole moment and volume of the unit cell, <math>dS</math> is an infinitesimal area on the crystal surface and <math>\mathbf{r}</math> is the vector from the central unit cell to the infinitesimal area. This formula results from integrating the energy <math> dU = -\mathbf{p}_{uc} \cdot d\mathbf{E}</math> where <math>d\mathbf{E}</math> represents the infinitesimal electric field generated by an infinitesimal surface charge <math>dq \ \stackrel{\mathrm{def}}{=}\ \sigma dS</math> (Coulomb's law) <math display="block"> d\mathbf{E} \ \stackrel{\mathrm{def}}{=}\ \left( \frac{-1}{4\pi\epsilon} \right) \frac{dq \ \mathbf{r}}{r^3} = \left( \frac{-1}{4\pi\epsilon} \right) \frac{\sigma\, dS \ \mathbf{r} }{r^3} </math> The negative sign derives from the definition of <math>\mathbf{r}</math>, which points towards the charge, not away from it.

History

The Ewald summation was developed by Paul Peter Ewald in 1921 (see References below) to determine the electrostatic energy (and, hence, the Madelung constant) of ionic crystals.

Scaling

Generally, different Ewald summation methods give different time complexities. Direct calculation gives <math>O(N^2)</math>, where <math>N</math> is the number of atoms in the system. The PME method gives <math>O(N\,\log N)</math>.<ref name="Darden1993">Darden, Tom; York, Darrin; Pedersen, Lee (1993-06-15). "Particle mesh Ewald: An N ⋅log( N ) method for Ewald sums in large systems". The Journal of Chemical Physics. 98 (12): 10089–10092. Bibcode:1993JChPh..9810089D. doi:10.1063/1.464397. ISSN 0021-9606.</ref>

See also

References

<references group="" responsive="1"></references>