22 |
|
simulation package of this size and scope would not have been possible |
23 |
|
without the collaborative efforts of my colleagues: Charles |
24 |
|
F.~Vardeman II, Teng Lin, Christopher J.~Fennell and J.~Daniel |
25 |
< |
Gezelter. Although my contributions to {\sc oopse} are significant, |
26 |
< |
consideration of my work apart from the others, would not give a |
25 |
> |
Gezelter. Although my contributions to {\sc oopse} are major, |
26 |
> |
consideration of my work apart from the others would not give a |
27 |
|
complete description to the package's capabilities. As such, all |
28 |
|
contributions to {\sc oopse} to date are presented in this chapter. |
29 |
|
|
30 |
< |
Charles Vardeman is responsible for the parallelization of {\sc oopse} |
31 |
< |
(Sec.~\ref{oopseSec:parallelization}) as well as the inclusion of the |
32 |
< |
embedded-atom potential (Sec.~\ref{oopseSec:eam}). Teng Lin's |
33 |
< |
contributions include refinement of the periodic boundary conditions |
30 |
> |
Charles Vardeman is responsible for the parallelization of the long |
31 |
> |
range forces in {\sc oopse} (Sec.~\ref{oopseSec:parallelization}) as |
32 |
> |
well as the inclusion of the embedded-atom potential for transition |
33 |
> |
metals (Sec.~\ref{oopseSec:eam}). Teng Lin's contributions include |
34 |
> |
refinement of the periodic boundary conditions |
35 |
|
(Sec.~\ref{oopseSec:pbc}), the z-constraint method |
36 |
|
(Sec.~\ref{oopseSec:zcons}), refinement of the property analysis |
37 |
|
programs (Sec.~\ref{oopseSec:props}), and development in the extended |
38 |
< |
state integrators (Sec.~\ref{oopseSec:noseHooverThermo}). Christopher |
38 |
> |
system integrators (Sec.~\ref{oopseSec:noseHooverThermo}). Christopher |
39 |
|
Fennell worked on the symplectic integrator |
40 |
|
(Sec.~\ref{oopseSec:integrate}) and the refinement of the {\sc ssd} |
41 |
|
water model (Sec.~\ref{oopseSec:SSD}). Daniel Gezelter lent his |
49 |
|
of the Lennard-Jones (Sec.~\ref{sec:LJPot}) and {\sc duff} |
50 |
|
(Sec.~\ref{oopseSec:DUFF}) force fields, and initial implementation of |
51 |
|
the property analysis (Sec.~\ref{oopseSec:props}) and system |
52 |
< |
initialization (Sec.~\ref{oopseSec:initCoords}) utility programs. |
52 |
> |
initialization (Sec.~\ref{oopseSec:initCoords}) utility programs. {\sc |
53 |
> |
oopse}, like many other Molecular Dynamics programs, is a work in |
54 |
> |
progress, and will continue to be so for many graduate student |
55 |
> |
lifetimes. |
56 |
|
|
57 |
|
\section{\label{sec:intro}Introduction} |
58 |
|
|
70 |
|
|
71 |
|
Despite their utility, problems with these packages arise when |
72 |
|
researchers try to develop techniques or energetic models that the |
73 |
< |
code was not originally designed to do. Examples of uncommonly |
73 |
> |
code was not originally designed to simulate. Examples of uncommonly |
74 |
|
implemented techniques and energetics include; dipole-dipole |
75 |
|
interactions, rigid body dynamics, and metallic embedded |
76 |
|
potentials. When faced with these obstacles, a researcher must either |
79 |
|
simulation code capable of implementing the types of models upon which |
80 |
|
our research is based. |
81 |
|
|
82 |
< |
Having written {\sc oopse} we are implementing the concept of Open |
83 |
< |
Source development, and releasing our source code into the public |
84 |
< |
domain. It is our intent that by doing so, other researchers might |
85 |
< |
benefit from our work, and add their own contributions to the |
86 |
< |
package. The license under which {\sc oopse} is distributed allows any |
87 |
< |
researcher to download and modify the source code for their own |
88 |
< |
use. In this way further development of {\sc oopse} is not limited to |
89 |
< |
only the models of interest to ourselves, but also those of the |
90 |
< |
community of scientists who contribute back to the project. |
82 |
> |
In developing {\sc oopse}, we have adhered to the precepts of Open |
83 |
> |
Source development, and are releasing our source code with a |
84 |
> |
permissive license. It is our intent that by doing so, other |
85 |
> |
researchers might benefit from our work, and add their own |
86 |
> |
contributions to the package. The license under which {\sc oopse} is |
87 |
> |
distributed allows any researcher to download and modify the source |
88 |
> |
code for their own use. In this way further development of {\sc oopse} |
89 |
> |
is not limited to only the models of interest to ourselves, but also |
90 |
> |
those of the community of scientists who contribute back to the |
91 |
> |
project. |
92 |
|
|
93 |
|
We have structured this chapter to first discuss the empirical energy |
94 |
|
functions that {\sc oopse } implements in |
95 |
|
Sec.~\ref{oopseSec:empiricalEnergy}. Following that is a discussion of |
96 |
|
the various input and output files associated with the package |
97 |
< |
(Sec.~\ref{oopseSec:IOfiles}). In Sec.~\ref{oopseSec:mechanics} |
97 |
> |
(Sec.~\ref{oopseSec:IOfiles}). Sec.~\ref{oopseSec:mechanics} |
98 |
|
elucidates the various Molecular Dynamics algorithms {\sc oopse} |
99 |
|
implements in the integration of the Newtonian equations of |
100 |
|
motion. Basic analysis of the trajectories obtained from the |
101 |
|
simulation is discussed in Sec.~\ref{oopseSec:props}. Program design |
102 |
< |
considerations as well as the software distribution license is |
103 |
< |
presented in Sec.~\ref{oopseSec:design}. And lastly, |
99 |
< |
Sec.~\ref{oopseSec:conclusion} concludes the chapter. |
102 |
> |
considerations are presented in Sec.~\ref{oopseSec:design}. And |
103 |
> |
lastly, Sec.~\ref{oopseSec:conclusion} concludes the chapter. |
104 |
|
|
105 |
|
\section{\label{oopseSec:empiricalEnergy}The Empirical Energy Functions} |
106 |
|
|
113 |
|
methyl and carbonyl groups. The atoms are also capable of having |
114 |
|
directional components associated with them (\emph{e.g.}~permanent |
115 |
|
dipoles). Charges, permanent dipoles, and Lennard-Jones parameters for |
116 |
< |
a given atom type are set in the force field parameter files |
116 |
> |
a given atom type are set in the force field parameter files. |
117 |
|
|
118 |
|
\begin{lstlisting}[float,caption={[Specifier for molecules and atoms] A sample specification of an Ar molecule},label=sch:AtmMole] |
119 |
|
molecule{ |
133 |
|
identities of all the atoms and rigid bodies associated with |
134 |
|
themselves, and are responsible for the evaluation of their own |
135 |
|
internal interactions (\emph{i.e.}~bonds, bends, and torsions). Scheme |
136 |
< |
\ref{sch:AtmMole} shows how one creates a molecule in a |
136 |
> |
\ref{sch:AtmMole} shows how one creates a molecule in a ``model'' or |
137 |
|
\texttt{.mdl} file. The position of the atoms given in the |
138 |
|
declaration are relative to the origin of the molecule, and is used |
139 |
|
when creating a system containing the molecule. |
143 |
|
to handle rigid body dynamics. Rigid bodies are non-spherical |
144 |
|
particles or collections of particles that have a constant internal |
145 |
|
potential and move collectively.\cite{Goldstein01} They are not |
146 |
< |
included in most simulation packages because of the requirement to |
147 |
< |
propagate the orientational degrees of freedom. Until recently, |
148 |
< |
integrators which propagate orientational motion have been lacking. |
146 |
> |
included in most simulation packages because of the algorithmic |
147 |
> |
complexity involved in propagating orientational degrees of |
148 |
> |
freedom. Until recently, integrators which propagate orientational |
149 |
> |
motion have been much worse than those available for translational |
150 |
> |
motion. |
151 |
|
|
152 |
|
Moving a rigid body involves determination of both the force and |
153 |
|
torque applied by the surroundings, which directly affect the |
156 |
|
first be calculated for all the internal particles. The total force on |
157 |
|
the rigid body is simply the sum of these external forces. |
158 |
|
Accumulation of the total torque on the rigid body is more complex |
159 |
< |
than the force in that it is the torque applied on the center of mass |
160 |
< |
that dictates rotational motion. The torque on rigid body {\it i} is |
159 |
> |
than the force because the torque is applied to the center of mass of |
160 |
> |
the rigid body. The torque on rigid body $i$ is |
161 |
|
\begin{equation} |
162 |
|
\boldsymbol{\tau}_i= |
163 |
< |
\sum_{a}(\mathbf{r}_{ia}-\mathbf{r}_i)\times \mathbf{f}_{ia} |
164 |
< |
+ \boldsymbol{\tau}_{ia}, |
163 |
> |
\sum_{a}\biggl[(\mathbf{r}_{ia}-\mathbf{r}_i)\times \mathbf{f}_{ia} |
164 |
> |
+ \boldsymbol{\tau}_{ia}\biggr] |
165 |
|
\label{eq:torqueAccumulate} |
166 |
|
\end{equation} |
167 |
|
where $\boldsymbol{\tau}_i$ and $\mathbf{r}_i$ are the torque on and |
170 |
|
position of, and torque on the component particles of the rigid body. |
171 |
|
|
172 |
|
The summation of the total torque is done in the body fixed axis of |
173 |
< |
the rigid body. In order to move between the space fixed and body |
173 |
> |
each rigid body. In order to move between the space fixed and body |
174 |
|
fixed coordinate axes, parameters describing the orientation must be |
175 |
|
maintained for each rigid body. At a minimum, the rotation matrix |
176 |
|
(\textbf{A}) can be described by the three Euler angles ($\phi, |
185 |
|
systems.\cite{Evans77} |
186 |
|
|
187 |
|
{\sc oopse} utilizes a relatively new scheme that propagates the |
188 |
< |
entire nine parameter rotation matrix internally. Further discussion |
188 |
> |
entire nine parameter rotation matrix. Further discussion |
189 |
|
on this choice can be found in Sec.~\ref{oopseSec:integrate}. An example |
190 |
|
definition of a rigid body can be seen in Scheme |
191 |
|
\ref{sch:rigidBody}. The positions in the atom definitions are the |
239 |
|
|
240 |
|
\begin{lstlisting}[float,caption={[Invocation of the Lennard-Jones force field] A sample system using the Lennard-Jones force field.},label={sch:LJFF}] |
241 |
|
|
236 |
– |
/* |
237 |
– |
* The Ar molecule is specified |
238 |
– |
* external to the.bass file |
239 |
– |
*/ |
240 |
– |
|
242 |
|
#include "argon.mdl" |
243 |
|
|
244 |
|
nComponents = 1; |
247 |
|
nMol = 108; |
248 |
|
} |
249 |
|
|
249 |
– |
/* |
250 |
– |
* The initial configuration is generated |
251 |
– |
* before the simulation is invoked. |
252 |
– |
*/ |
253 |
– |
|
250 |
|
initialConfig = "./argon.init"; |
251 |
|
|
252 |
|
forceField = "LJ"; |
255 |
|
Because this potential is calculated between all pairs, the force |
256 |
|
evaluation can become computationally expensive for large systems. To |
257 |
|
keep the pair evaluations to a manageable number, {\sc oopse} employs |
258 |
< |
a cut-off radius.\cite{allen87:csl} The cutoff radius is set to be |
258 |
> |
a cut-off radius.\cite{allen87:csl} The cutoff radius can either be |
259 |
> |
specified in the \texttt{.bass} file, or left as its default value of |
260 |
|
$2.5\sigma_{ii}$, where $\sigma_{ii}$ is the largest Lennard-Jones |
261 |
|
length parameter present in the simulation. Truncating the calculation |
262 |
|
at $r_{\text{cut}}$ introduces a discontinuity into the potential |
263 |
< |
energy. To offset this discontinuity, the energy value at |
264 |
< |
$r_{\text{cut}}$ is subtracted from the potential. This causes the |
265 |
< |
potential to go to zero smoothly at the cut-off radius. |
263 |
> |
energy and the force. To offset this discontinuity in the potential, |
264 |
> |
the energy value at $r_{\text{cut}}$ is subtracted from the |
265 |
> |
potential. This causes the potential to go to zero smoothly at the |
266 |
> |
cut-off radius, and preserves conservation of energy in integrating |
267 |
> |
the equations of motion. |
268 |
|
|
269 |
|
Interactions between dissimilar particles requires the generation of |
270 |
|
cross term parameters for $\sigma$ and $\epsilon$. These are |
280 |
|
\label{eq:epsilonMix} |
281 |
|
\end{equation} |
282 |
|
|
284 |
– |
|
285 |
– |
|
283 |
|
\subsection{\label{oopseSec:DUFF}Dipolar Unified-Atom Force Field} |
284 |
|
|
285 |
|
The dipolar unified-atom force field ({\sc duff}) was developed to |
293 |
|
charges. Charge-neutral distributions were replaced with dipoles, |
294 |
|
while most atoms and groups of atoms were reduced to Lennard-Jones |
295 |
|
interaction sites. This simplification cuts the length scale of long |
296 |
< |
range interactions from $\frac{1}{r}$ to $\frac{1}{r^3}$, allowing us |
297 |
< |
to avoid the computationally expensive Ewald sum. Instead, we can use |
298 |
< |
neighbor-lists, reaction field, and cutoff radii for the dipolar |
299 |
< |
interactions. |
296 |
> |
range interactions from $\frac{1}{r}$ to $\frac{1}{r^3}$, and allows |
297 |
> |
us to avoid the computationally expensive Ewald sum. Instead, we can |
298 |
> |
use neighbor-lists and cutoff radii for the dipolar interactions, or |
299 |
> |
include a reaction field to mimic larger range interactions. |
300 |
|
|
301 |
|
As an example, lipid head-groups in {\sc duff} are represented as |
302 |
< |
point dipole interaction sites. By placing a dipole of 20.6~Debye at |
303 |
< |
the head group center of mass, our model mimics the head group of |
304 |
< |
phosphatidylcholine.\cite{Cevc87} Additionally, a large Lennard-Jones |
305 |
< |
site is located at the pseudoatom's center of mass. The model is |
306 |
< |
illustrated by the dark grey atom in Fig.~\ref{oopseFig:lipidModel}. The |
307 |
< |
water model we use to complement the dipoles of the lipids is our |
308 |
< |
reparameterization of the soft sticky dipole (SSD) model of Ichiye |
302 |
> |
point dipole interaction sites. By placing a dipole at the head group |
303 |
> |
center of mass, our model mimics the charge separation found in common |
304 |
> |
phospholipids such as phosphatidylcholine.\cite{Cevc87} Additionally, |
305 |
> |
a large Lennard-Jones site is located at the pseudoatom's center of |
306 |
> |
mass. The model is illustrated by the red atom in |
307 |
> |
Fig.~\ref{oopseFig:lipidModel}. The water model we use to complement |
308 |
> |
the dipoles of the lipids is our reparameterization of the soft sticky |
309 |
> |
dipole (SSD) model of Ichiye |
310 |
|
\emph{et al.}\cite{liu96:new_model} |
311 |
|
|
312 |
|
\begin{figure} |
313 |
|
\centering |
314 |
|
\includegraphics[width=\linewidth]{lipidModel.eps} |
315 |
< |
\caption{A representation of the lipid model. $\phi$ is the torsion angle, $\theta$ % |
315 |
> |
\caption[A representation of a lipid model in {\sc duff}]{A representation of the lipid model. $\phi$ is the torsion angle, $\theta$ % |
316 |
|
is the bend angle, $\mu$ is the dipole moment of the head group, and n |
317 |
|
is the chain length.} |
318 |
|
\label{oopseFig:lipidModel} |
326 |
|
equilibria using Gibbs ensemble Monte Carlo simulation |
327 |
|
techniques.\cite{Siepmann1998} One of the advantages of TraPPE is that |
328 |
|
it generalizes the types of atoms in an alkyl chain to keep the number |
329 |
< |
of pseudoatoms to a minimum; the parameters for an atom such as |
329 |
> |
of pseudoatoms to a minimum; the parameters for a unified atom such as |
330 |
|
$\text{CH}_2$ do not change depending on what species are bonded to |
331 |
|
it. |
332 |
|
|
333 |
|
TraPPE also constrains all bonds to be of fixed length. Typically, |
334 |
|
bond vibrations are the fastest motions in a molecular dynamic |
335 |
|
simulation. Small time steps between force evaluations must be used to |
336 |
< |
ensure adequate sampling of the bond potential to ensure conservation |
337 |
< |
of energy. By constraining the bond lengths, larger time steps may be |
338 |
< |
used when integrating the equations of motion. A simulation using {\sc |
339 |
< |
duff} is illustrated in Scheme \ref{sch:DUFF}. |
336 |
> |
ensure adequate energy conservation in the bond degrees of freedom. By |
337 |
> |
constraining the bond lengths, larger time steps may be used when |
338 |
> |
integrating the equations of motion. A simulation using {\sc duff} is |
339 |
> |
illustrated in Scheme \ref{sch:DUFF}. |
340 |
|
|
341 |
|
\begin{lstlisting}[float,caption={[Invocation of {\sc duff}]Sample \texttt{.bass} file showing a simulation utilizing {\sc duff}},label={sch:DUFF}] |
342 |
|
|
405 |
|
+ c_3[1 + \cos(3\phi)] |
406 |
|
\label{eq:origTorsionPot} |
407 |
|
\end{equation} |
408 |
< |
Here $\phi$ is the angle defined by four bonded neighbors $i$, |
411 |
< |
$j$, $k$, and $l$ (again, see Fig.~\ref{oopseFig:lipidModel}). For |
412 |
< |
computational efficiency, the torsion potential has been recast after |
413 |
< |
the method of {\sc charmm},\cite{Brooks83} in which the angle series is |
414 |
< |
converted to a power series of the form: |
408 |
> |
Where: |
409 |
|
\begin{equation} |
410 |
+ |
\cos\phi = (\hat{\mathbf{r}}_{ij} \times \hat{\mathbf{r}}_{jk}) \cdot |
411 |
+ |
(\hat{\mathbf{r}}_{jk} \times \hat{\mathbf{r}}_{kl}) |
412 |
+ |
\label{eq:torsPhi} |
413 |
+ |
\end{equation} |
414 |
+ |
Here, $\hat{\mathbf{r}}_{\alpha\beta}$ are the set of unit bond |
415 |
+ |
vectors between atoms $i$, $j$, $k$, and $l$. For computational |
416 |
+ |
efficiency, the torsion potential has been recast after the method of |
417 |
+ |
{\sc charmm},\cite{Brooks83} in which the angle series is converted to |
418 |
+ |
a power series of the form: |
419 |
+ |
\begin{equation} |
420 |
|
V_{\text{torsion}}(\phi) = |
421 |
|
k_3 \cos^3 \phi + k_2 \cos^2 \phi + k_1 \cos \phi + k_0 |
422 |
|
\label{eq:torsionPot} |
456 |
|
\boldsymbol{\Omega}_{j}) = \frac{|\mu_i||\mu_j|}{4\pi\epsilon_{0}r_{ij}^{3}} \biggl[ |
457 |
|
\boldsymbol{\hat{u}}_{i} \cdot \boldsymbol{\hat{u}}_{j} |
458 |
|
- |
459 |
< |
\frac{3(\boldsymbol{\hat{u}}_i \cdot \mathbf{r}_{ij}) % |
460 |
< |
(\boldsymbol{\hat{u}}_j \cdot \mathbf{r}_{ij}) } |
457 |
< |
{r^{2}_{ij}} \biggr] |
459 |
> |
3(\boldsymbol{\hat{u}}_i \cdot \hat{\mathbf{r}}_{ij}) % |
460 |
> |
(\boldsymbol{\hat{u}}_j \cdot \hat{\mathbf{r}}_{ij}) \biggr] |
461 |
|
\label{eq:dipolePot} |
462 |
|
\end{equation} |
463 |
|
Here $\mathbf{r}_{ij}$ is the vector starting at atom $i$ pointing |
469 |
|
unit vector pointing along $\mathbf{r}_{ij}$ |
470 |
|
($\boldsymbol{\hat{r}}_{ij}=\mathbf{r}_{ij}/|\mathbf{r}_{ij}|$). |
471 |
|
|
472 |
+ |
To improve computational efficiency of the dipole-dipole interactions, |
473 |
+ |
{\sc oopse} employs an electrostatic cutoff radius. This parameter can |
474 |
+ |
be set in the \texttt{.bass} file, and controls the length scale over |
475 |
+ |
which dipole interactions are felt. To compensate for the |
476 |
+ |
discontinuity in the potential and the forces at the cutoff radius, we |
477 |
+ |
have implemented a switching function to smoothly scale the |
478 |
+ |
dipole-dipole interaction at the cutoff. |
479 |
+ |
\begin{equation} |
480 |
+ |
S(r_{ij}) = |
481 |
+ |
\begin{cases} |
482 |
+ |
1 & \text{if $r_{ij} \le r_t$},\\ |
483 |
+ |
\frac{(r_{\text{cut}} + 2r_{ij} - 3r_t)(r_{\text{cut}} - r_{ij})^2} |
484 |
+ |
{(r_{\text{cut}} - r_t)^2} |
485 |
+ |
& \text{if $r_t < r_{ij} \le r_{\text{cut}}$}, \\ |
486 |
+ |
0 & \text{if $r_{ij} > r_{\text{cut}}$.} |
487 |
+ |
\end{cases} |
488 |
+ |
\label{eq:dipoleSwitching} |
489 |
+ |
\end{equation} |
490 |
+ |
Here $S(r_{ij})$ scales the potential at a given $r_{ij}$, and $r_t$ |
491 |
+ |
is the taper radius some given thickness less than the electrostatic |
492 |
+ |
cutoff. The switching thickness can be set in the \texttt{.bass} file. |
493 |
|
|
494 |
< |
\subsubsection{\label{oopseSec:SSD}The {\sc duff} Water Models: SSD/E and SSD/RF} |
494 |
> |
\subsection{\label{oopseSec:SSD}The {\sc duff} Water Models: SSD/E and SSD/RF} |
495 |
|
|
496 |
|
In the interest of computational efficiency, the default solvent used |
497 |
|
by {\sc oopse} is the extended Soft Sticky Dipole (SSD/E) water |
551 |
|
can be found in the original SSD |
552 |
|
articles.\cite{liu96:new_model,liu96:monte_carlo,chandra99:ssd_md,Ichiye03} |
553 |
|
|
554 |
< |
Since SSD is a single-point {\it dipolar} model, the force |
554 |
> |
Since SSD/E is a single-point {\it dipolar} model, the force |
555 |
|
calculations are simplified significantly relative to the standard |
556 |
|
{\it charged} multi-point models. In the original Monte Carlo |
557 |
|
simulations using this model, Ichiye {\it et al.} reported that using |
558 |
|
SSD decreased computer time by a factor of 6-7 compared to other |
559 |
|
models.\cite{liu96:new_model} What is most impressive is that these savings |
560 |
|
did not come at the expense of accurate depiction of the liquid state |
561 |
< |
properties. Indeed, SSD maintains reasonable agreement with the Soper |
561 |
> |
properties. Indeed, SSD/E maintains reasonable agreement with the Head-Gordon |
562 |
|
diffraction data for the structural features of liquid |
563 |
< |
water.\cite{Soper86,liu96:new_model} Additionally, the dynamical properties |
564 |
< |
exhibited by SSD agree with experiment better than those of more |
563 |
> |
water.\cite{hura00,liu96:new_model} Additionally, the dynamical properties |
564 |
> |
exhibited by SSD/E agree with experiment better than those of more |
565 |
|
computationally expensive models (like TIP3P and |
566 |
|
SPC/E).\cite{chandra99:ssd_md} The combination of speed and accurate depiction |
567 |
< |
of solvent properties makes SSD a very attractive model for the |
567 |
> |
of solvent properties makes SSD/E a very attractive model for the |
568 |
|
simulation of large scale biochemical simulations. |
569 |
|
|
570 |
|
Recent constant pressure simulations revealed issues in the original |
575 |
|
of a reaction field long-range interaction correction is desired, it |
576 |
|
is recommended that the parameters be modified to those of the SSD/RF |
577 |
|
model. Solvent parameters can be easily modified in an accompanying |
578 |
< |
{\sc BASS} file as illustrated in the scheme below. A table of the |
578 |
> |
\texttt{.bass} file as illustrated in the scheme below. A table of the |
579 |
|
parameter values and the drawbacks and benefits of the different |
580 |
|
density corrected SSD models can be found in |
581 |
|
reference~\cite{Gezelter04}. |
595 |
|
forceField = "DUFF"; |
596 |
|
|
597 |
|
/* |
574 |
– |
* The reactionField flag toggles reaction |
575 |
– |
* field corrections. |
576 |
– |
*/ |
577 |
– |
|
578 |
– |
reactionField = false; // defaults to false |
579 |
– |
dielectric = 80.0; // dielectric for reaction field |
580 |
– |
|
581 |
– |
/* |
598 |
|
* The following two flags set the cutoff |
599 |
|
* radius for the electrostatic forces |
600 |
|
* as well as the skin thickness of the switching |
613 |
|
capacity to simulate metallic systems, including some that have |
614 |
|
parallel computational abilities\cite{plimpton93}. Potentials that |
615 |
|
describe bonding transition metal |
616 |
< |
systems\cite{Finnis84,Ercolessi88,Chen90,Qi99,Ercolessi02} have a |
616 |
> |
systems\cite{Finnis84,Ercolessi88,Chen90,Qi99,Ercolessi02} have an |
617 |
|
attractive interaction which models ``Embedding'' |
618 |
|
a positively charged metal ion in the electron density due to the |
619 |
|
free valance ``sea'' of electrons created by the surrounding atoms in |
620 |
< |
the system. A mostly repulsive pairwise part of the potential |
620 |
> |
the system. A mostly-repulsive pairwise part of the potential |
621 |
|
describes the interaction of the positively charged metal core ions |
622 |
|
with one another. A particular potential description called the |
623 |
|
Embedded Atom Method\cite{Daw84,FBD86,johnson89,Lu97}({\sc eam}) that has |
624 |
|
particularly wide adoption has been selected for inclusion in {\sc oopse}. A |
625 |
< |
good review of {\sc eam} and other metallic potential formulations was done |
625 |
> |
good review of {\sc eam} and other metallic potential formulations was written |
626 |
|
by Voter.\cite{voter} |
627 |
|
|
628 |
|
The {\sc eam} potential has the form: |
630 |
|
V & = & \sum_{i} F_{i}\left[\rho_{i}\right] + \sum_{i} \sum_{j \neq i} |
631 |
|
\phi_{ij}({\bf r}_{ij}) \\ |
632 |
|
\rho_{i} & = & \sum_{j \neq i} f_{j}({\bf r}_{ij}) |
633 |
< |
\end{eqnarray}S |
634 |
< |
|
635 |
< |
where $F_{i} $ is the embedding function that equates the energy required to embed a |
636 |
< |
positively-charged core ion $i$ into a linear superposition of |
637 |
< |
spherically averaged atomic electron densities given by |
638 |
< |
$\rho_{i}$. $\phi_{ij}$ is a primarily repulsive pairwise interaction |
639 |
< |
between atoms $i$ and $j$. In the original formulation of |
640 |
< |
{\sc eam}\cite{Daw84}, $\phi_{ij}$ was an entirely repulsive term, however |
641 |
< |
in later refinements to EAM have shown that non-uniqueness between $F$ |
642 |
< |
and $\phi$ allow for more general forms for $\phi$.\cite{Daw89} |
643 |
< |
There is a cutoff distance, $r_{cut}$, which limits the |
628 |
< |
summations in the {\sc eam} equation to the few dozen atoms |
633 |
> |
\end{eqnarray} |
634 |
> |
where $F_{i} $ is the embedding function that equates the energy |
635 |
> |
required to embed a positively-charged core ion $i$ into a linear |
636 |
> |
superposition of spherically averaged atomic electron densities given |
637 |
> |
by $\rho_{i}$. $\phi_{ij}$ is a primarily repulsive pairwise |
638 |
> |
interaction between atoms $i$ and $j$. In the original formulation of |
639 |
> |
{\sc eam}\cite{Daw84}, $\phi_{ij}$ was an entirely repulsive term, |
640 |
> |
however in later refinements to {\sc eam} have shown that non-uniqueness |
641 |
> |
between $F$ and $\phi$ allow for more general forms for |
642 |
> |
$\phi$.\cite{Daw89} There is a cutoff distance, $r_{cut}$, which |
643 |
> |
limits the summations in the {\sc eam} equation to the few dozen atoms |
644 |
|
surrounding atom $i$ for both the density $\rho$ and pairwise $\phi$ |
645 |
< |
interactions. Foiles et al. fit EAM potentials for fcc metals Cu, Ag, Au, Ni, Pd, Pt and alloys of these metals\cite{FBD86}. These potential fits are in the DYNAMO 86 format and are included with {\sc oopse}. |
645 |
> |
interactions. Foiles \emph{et al}.~fit {\sc eam} potentials for the fcc |
646 |
> |
metals Cu, Ag, Au, Ni, Pd, Pt and alloys of these metals.\cite{FBD86} |
647 |
> |
These fits, are included in {\sc oopse}. |
648 |
|
|
632 |
– |
|
649 |
|
\subsection{\label{oopseSec:pbc}Periodic Boundary Conditions} |
650 |
|
|
651 |
|
\newcommand{\roundme}{\operatorname{round}} |
652 |
|
|
653 |
< |
\textit{Periodic boundary conditions} are widely used to simulate truly |
654 |
< |
macroscopic systems with a relatively small number of particles. The |
655 |
< |
simulation box is replicated throughout space to form an infinite lattice. |
656 |
< |
During the simulation, when a particle moves in the primary cell, its image in |
657 |
< |
other boxes move in exactly the same direction with exactly the same |
658 |
< |
orientation.Thus, as a particle leaves the primary cell, one of its images |
659 |
< |
will enter through the opposite face.If the simulation box is large enough to |
660 |
< |
avoid \textquotedblleft feeling\textquotedblright\ the symmetries of the |
661 |
< |
periodic lattice, surface effects can be ignored. Cubic, orthorhombic and |
662 |
< |
parallelepiped are the available periodic cells In OOPSE. We use a matrix to |
663 |
< |
describe the property of the simulation box. Therefore, both the size and |
648 |
< |
shape of the simulation box can be changed during the simulation. The |
649 |
< |
transformation from box space vector $\mathbf{s}$ to its corresponding real |
650 |
< |
space vector $\mathbf{r}$ is defined by |
653 |
> |
\textit{Periodic boundary conditions} are widely used to simulate bulk properties with a relatively small number of particles. The |
654 |
> |
simulation box is replicated throughout space to form an infinite |
655 |
> |
lattice. During the simulation, when a particle moves in the primary |
656 |
> |
cell, its image in other cells move in exactly the same direction with |
657 |
> |
exactly the same orientation. Thus, as a particle leaves the primary |
658 |
> |
cell, one of its images will enter through the opposite face. If the |
659 |
> |
simulation box is large enough to avoid ``feeling'' the symmetries of |
660 |
> |
the periodic lattice, surface effects can be ignored. The available |
661 |
> |
periodic cells in OOPSE are cubic, orthorhombic and parallelepiped. We |
662 |
> |
use a $3 \times 3$ matrix, $\mathsf{H}$, to describe the shape and |
663 |
> |
size of the simulation box. $\mathsf{H}$ is defined: |
664 |
|
\begin{equation} |
665 |
< |
\mathbf{r}=\underline{\mathbf{H}}\cdot\mathbf{s}% |
665 |
> |
\mathsf{H} = ( \mathbf{h}_x, \mathbf{h}_y, \mathbf{h}_z ) |
666 |
|
\end{equation} |
667 |
+ |
Where $\mathbf{h}_j$ is the column vector of the $j$th axis of the |
668 |
+ |
box. During the course of the simulation both the size and shape of |
669 |
+ |
the box can be changed to allow volume fluctuations when constraining |
670 |
+ |
the pressure. |
671 |
|
|
672 |
< |
|
673 |
< |
where $H=(h_{x},h_{y},h_{z})$ is a transformation matrix made up of the three |
674 |
< |
box axis vectors. $h_{x},h_{y}$ and $h_{z}$ represent the three sides of the |
675 |
< |
simulation box respectively. |
676 |
< |
|
677 |
< |
To find the minimum image of a vector $\mathbf{r}$, we convert the real vector |
678 |
< |
to its corresponding vector in box space first, \bigskip% |
679 |
< |
\begin{equation} |
680 |
< |
\mathbf{s}=\underline{\mathbf{H}}^{-1}\cdot\mathbf{r}% |
681 |
< |
\end{equation} |
682 |
< |
And then, each element of $\mathbf{s}$ is wrapped to lie between -0.5 to 0.5, |
672 |
> |
A real space vector, $\mathbf{r}$ can be transformed in to a box space |
673 |
> |
vector, $\mathbf{s}$, and back through the following transformations: |
674 |
> |
\begin{align} |
675 |
> |
\mathbf{s} &= \mathsf{H}^{-1} \mathbf{r} \\ |
676 |
> |
\mathbf{r} &= \mathsf{H} \mathbf{s} |
677 |
> |
\end{align} |
678 |
> |
The vector $\mathbf{s}$ is now a vector expressed as the number of box |
679 |
> |
lengths in the $\mathbf{h}_x$, $\mathbf{h}_y$, and $\mathbf{h}_z$ |
680 |
> |
directions. To find the minimum image of a vector $\mathbf{r}$, we |
681 |
> |
first convert it to its corresponding vector in box space, and then, |
682 |
> |
cast each element to lie on the in the range $[-0.5,0.5]$: |
683 |
|
\begin{equation} |
684 |
|
s_{i}^{\prime}=s_{i}-\roundme(s_{i}) |
685 |
|
\end{equation} |
686 |
< |
where |
687 |
< |
|
671 |
< |
% |
672 |
< |
|
686 |
> |
Where $s_i$ is the $i$th element of $\mathbf{s}$, and |
687 |
> |
$\roundme(s_i)$is given by |
688 |
|
\begin{equation} |
689 |
< |
\roundme(x)=\left\{ |
690 |
< |
\begin{array}{cc}% |
691 |
< |
\lfloor{x+0.5}\rfloor & \text{if \ }x\geqslant 0 \\ |
692 |
< |
\lceil{x-0.5}\rceil & \text{otherwise}% |
693 |
< |
\end{array} |
679 |
< |
\right. |
689 |
> |
\roundme(x) = |
690 |
> |
\begin{cases} |
691 |
> |
\lfloor x+0.5 \rfloor & \text{if $x \ge 0$} \\ |
692 |
> |
\lceil x-0.5 \rceil & \text{if $x < 0$ } |
693 |
> |
\end{cases} |
694 |
|
\end{equation} |
695 |
+ |
Here $\lfloor x \rfloor$ is the floor operator, and gives the largest |
696 |
+ |
integer value that is not greater than $x$, and $\lceil x \rceil$ is |
697 |
+ |
the ceiling operator, and gives the smallest integer that is not less |
698 |
+ |
than $x$. For example, $\roundme(3.6)=4$, $\roundme(3.1)=3$, |
699 |
+ |
$\roundme(-3.6)=-4$, $\roundme(-3.1)=-3$. |
700 |
|
|
682 |
– |
|
683 |
– |
For example, $\roundme(3.6)=4$,$\roundme(3.1)=3$, $\roundme(-3.6)=-4$, $\roundme(-3.1)=-3$. |
684 |
– |
|
701 |
|
Finally, we obtain the minimum image coordinates $\mathbf{r}^{\prime}$ by |
702 |
< |
transforming back to real space,% |
687 |
< |
|
702 |
> |
transforming back to real space, |
703 |
|
\begin{equation} |
704 |
< |
\mathbf{r}^{\prime}=\underline{\mathbf{H}}^{-1}\cdot\mathbf{s}^{\prime}% |
704 |
> |
\mathbf{r}^{\prime}=\mathsf{H}^{-1}\mathbf{s}^{\prime}% |
705 |
|
\end{equation} |
706 |
+ |
In this way, particles are allowed to diffuse freely in $\mathbf{r}$, |
707 |
+ |
but their minimum images, $\mathbf{r}^{\prime}$ are used to compute |
708 |
+ |
the inter-atomic forces. |
709 |
|
|
710 |
|
|
711 |
|
\section{\label{oopseSec:IOfiles}Input and Output Files} |
712 |
|
|
713 |
|
\subsection{{\sc bass} and Model Files} |
714 |
|
|
715 |
< |
Every {\sc oopse} simulation begins with a {\sc bass} file. {\sc bass} |
716 |
< |
(\underline{B}izarre \underline{A}tom \underline{S}imulation |
717 |
< |
\underline{S}yntax) is a script syntax that is parsed by {\sc oopse} at |
718 |
< |
runtime. The {\sc bass} file allows for the user to completely describe the |
719 |
< |
system they are to simulate, as well as tailor {\sc oopse}'s behavior during |
720 |
< |
the simulation. {\sc bass} files are denoted with the extension |
715 |
> |
Every {\sc oopse} simulation begins with a Bizarre Atom Simulation |
716 |
> |
Syntax ({\sc bass}) file. {\sc bass} is a script syntax that is parsed |
717 |
> |
by {\sc oopse} at runtime. The {\sc bass} file allows for the user to |
718 |
> |
completely describe the system they wish to simulate, as well as tailor |
719 |
> |
{\sc oopse}'s behavior during the simulation. {\sc bass} files are |
720 |
> |
denoted with the extension |
721 |
|
\texttt{.bass}, an example file is shown in |
722 |
< |
Fig.~\ref{fig:bassExample}. |
722 |
> |
Scheme~\ref{sch:bassExample}. |
723 |
|
|
724 |
< |
\begin{figure} |
707 |
< |
\centering |
708 |
< |
\framebox[\linewidth]{\rule{0cm}{0.75\linewidth}I'm a {\sc bass} file!} |
709 |
< |
\caption{Here is an example \texttt{.bass} file} |
710 |
< |
\label{fig:bassExample} |
711 |
< |
\end{figure} |
724 |
> |
\begin{lstlisting}[float,caption={[An example of a complete {\sc bass} file] An example showing a complete {\sc bass} file.},label={sch:bassExample}] |
725 |
|
|
726 |
+ |
molecule{ |
727 |
+ |
name = "Ar"; |
728 |
+ |
nAtoms = 1; |
729 |
+ |
atom[0]{ |
730 |
+ |
type="Ar"; |
731 |
+ |
position( 0.0, 0.0, 0.0 ); |
732 |
+ |
} |
733 |
+ |
} |
734 |
+ |
|
735 |
+ |
nComponents = 1; |
736 |
+ |
component{ |
737 |
+ |
type = "Ar"; |
738 |
+ |
nMol = 108; |
739 |
+ |
} |
740 |
+ |
|
741 |
+ |
initialConfig = "./argon.init"; |
742 |
+ |
|
743 |
+ |
forceField = "LJ"; |
744 |
+ |
ensemble = "NVE"; // specify the simulation ensemble |
745 |
+ |
dt = 1.0; // the time step for integration |
746 |
+ |
runTime = 1e3; // the total simulation run time |
747 |
+ |
sampleTime = 100; // trajectory file frequency |
748 |
+ |
statusTime = 50; // statistics file frequency |
749 |
+ |
|
750 |
+ |
\end{lstlisting} |
751 |
+ |
|
752 |
|
Within the \texttt{.bass} file it is necessary to provide a complete |
753 |
|
description of the molecule before it is actually placed in the |
754 |
< |
simulation. The {\sc bass} syntax was originally developed with this goal in |
755 |
< |
mind, and allows for the specification of all the atoms in a molecular |
756 |
< |
prototype, as well as any bonds, bends, or torsions. These |
754 |
> |
simulation. The {\sc bass} syntax was originally developed with this |
755 |
> |
goal in mind, and allows for the specification of all the atoms in a |
756 |
> |
molecular prototype, as well as any bonds, bends, or torsions. These |
757 |
|
descriptions can become lengthy for complex molecules, and it would be |
758 |
< |
inconvenient to duplicate the simulation at the beginning of each {\sc bass} |
759 |
< |
script. Addressing this issue {\sc bass} allows for the inclusion of model |
760 |
< |
files at the top of a \texttt{.bass} file. These model files, denoted |
761 |
< |
with the \texttt{.mdl} extension, allow the user to describe a |
762 |
< |
molecular prototype once, then simply include it into each simulation |
763 |
< |
containing that molecule. |
758 |
> |
inconvenient to duplicate the simulation at the beginning of each {\sc |
759 |
> |
bass} script. Addressing this issue {\sc bass} allows for the |
760 |
> |
inclusion of model files at the top of a \texttt{.bass} file. These |
761 |
> |
model files, denoted with the \texttt{.mdl} extension, allow the user |
762 |
> |
to describe a molecular prototype once, then simply include it into |
763 |
> |
each simulation containing that molecule. Returning to the example in |
764 |
> |
Scheme~\ref{sch:bassExample}, the \texttt{.mdl} file's contents would |
765 |
> |
be Scheme~\ref{sch:mdlExample}, and the new \texttt{.bass} file would |
766 |
> |
become Scheme~\ref{sch:bassExPrime}. |
767 |
|
|
768 |
+ |
\begin{lstlisting}[float,caption={An example \texttt{.mdl} file.},label={sch:mdlExample}] |
769 |
+ |
|
770 |
+ |
molecule{ |
771 |
+ |
name = "Ar"; |
772 |
+ |
nAtoms = 1; |
773 |
+ |
atom[0]{ |
774 |
+ |
type="Ar"; |
775 |
+ |
position( 0.0, 0.0, 0.0 ); |
776 |
+ |
} |
777 |
+ |
} |
778 |
+ |
|
779 |
+ |
\end{lstlisting} |
780 |
+ |
|
781 |
+ |
\begin{lstlisting}[float,caption={Revised {\sc bass} example.},label={sch:bassExPrime}] |
782 |
+ |
|
783 |
+ |
#include "argon.mdl" |
784 |
+ |
|
785 |
+ |
nComponents = 1; |
786 |
+ |
component{ |
787 |
+ |
type = "Ar"; |
788 |
+ |
nMol = 108; |
789 |
+ |
} |
790 |
+ |
|
791 |
+ |
initialConfig = "./argon.init"; |
792 |
+ |
|
793 |
+ |
forceField = "LJ"; |
794 |
+ |
ensemble = "NVE"; |
795 |
+ |
dt = 1.0; |
796 |
+ |
runTime = 1e3; |
797 |
+ |
sampleTime = 100; |
798 |
+ |
statusTime = 50; |
799 |
+ |
|
800 |
+ |
\end{lstlisting} |
801 |
+ |
|
802 |
|
\subsection{\label{oopseSec:coordFiles}Coordinate Files} |
803 |
|
|
804 |
|
The standard format for storage of a systems coordinates is a modified |
805 |
|
xyz-file syntax, the exact details of which can be seen in |
806 |
< |
App.~\ref{appCoordFormat}. As all bonding and molecular information is |
807 |
< |
stored in the \texttt{.bass} and \texttt{.mdl} files, the coordinate |
808 |
< |
files are simply the complete set of coordinates for each atom at a |
809 |
< |
given simulation time. |
806 |
> |
Scheme~\ref{sch:dumpFormat}. As all bonding and molecular information |
807 |
> |
is stored in the \texttt{.bass} and \texttt{.mdl} files, the |
808 |
> |
coordinate files are simply the complete set of coordinates for each |
809 |
> |
atom at a given simulation time. One important note, although the |
810 |
> |
simulation propagates the complete rotation matrix, directional |
811 |
> |
entities are written out using quanternions, to save space in the |
812 |
> |
output files. |
813 |
|
|
814 |
< |
There are three major files used by {\sc oopse} written in the coordinate |
815 |
< |
format, they are as follows: the initialization file, the simulation |
816 |
< |
trajectory file, and the final coordinates of the simulation. The |
817 |
< |
initialization file is necessary for {\sc oopse} to start the simulation |
818 |
< |
with the proper coordinates. It is typically denoted with the |
819 |
< |
extension \texttt{.init}. The trajectory file is created at the |
820 |
< |
beginning of the simulation, and is used to store snapshots of the |
821 |
< |
simulation at regular intervals. The first frame is a duplication of |
822 |
< |
the \texttt{.init} file, and each subsequent frame is appended to the |
823 |
< |
file at an interval specified in the \texttt{.bass} file. The |
824 |
< |
trajectory file is given the extension \texttt{.dump}. The final |
825 |
< |
coordinate file is the end of run or \texttt{.eor} file. The |
814 |
> |
\begin{lstlisting}[float,caption={[The format of the coordinate files]Shows the format of the coordinate files. The fist line is the number of atoms. The second line begins with the time stamp followed by the three $\mathsf{H}$ column vectors. It is important to note, that for extended system ensembles, additional information pertinent to the integrators may be stored on this line as well.. The next lines are the atomic coordinates for all atoms in the system. First is the name followed by position, velocity, quanternions, and lastly angular velocities.},label=sch:dumpFormat] |
815 |
> |
|
816 |
> |
nAtoms |
817 |
> |
time; Hxx Hyx Hzx; Hxy Hyy Hzy; Hxz Hyz Hzz; |
818 |
> |
Name1 x y z vx vy vz q0 q1 q2 q3 jx jy jz |
819 |
> |
Name2 x y z vx vy vz q0 q1 q2 q3 jx jy jz |
820 |
> |
etc... |
821 |
> |
|
822 |
> |
\end{lstlisting} |
823 |
> |
|
824 |
> |
|
825 |
> |
There are three major files used by {\sc oopse} written in the |
826 |
> |
coordinate format, they are as follows: the initialization file |
827 |
> |
(\texttt{.init}), the simulation trajectory file (\texttt{.dump}), and |
828 |
> |
the final coordinates of the simulation. The initialization file is |
829 |
> |
necessary for {\sc oopse} to start the simulation with the proper |
830 |
> |
coordinates, and is generated before the simulation run. The |
831 |
> |
trajectory file is created at the beginning of the simulation, and is |
832 |
> |
used to store snapshots of the simulation at regular intervals. The |
833 |
> |
first frame is a duplication of the |
834 |
> |
\texttt{.init} file, and each subsequent frame is appended to the file |
835 |
> |
at an interval specified in the \texttt{.bass} file with the |
836 |
> |
\texttt{sampleTime} flag. The final coordinate file is the end of run file. The |
837 |
|
\texttt{.eor} file stores the final configuration of the system for a |
838 |
|
given simulation. The file is updated at the same time as the |
839 |
< |
\texttt{.dump} file. However, it only contains the most recent |
839 |
> |
\texttt{.dump} file, however, it only contains the most recent |
840 |
|
frame. In this way, an \texttt{.eor} file may be used as the |
841 |
< |
initialization file to a second simulation in order to continue or |
842 |
< |
recover the previous simulation. |
841 |
> |
initialization file to a second simulation in order to continue a |
842 |
> |
simulation or recover one from a processor that has crashed during the |
843 |
> |
course of the run. |
844 |
|
|
845 |
|
\subsection{\label{oopseSec:initCoords}Generation of Initial Coordinates} |
846 |
|
|
847 |
< |
As was stated in Sec.~\ref{oopseSec:coordFiles}, an initialization file |
848 |
< |
is needed to provide the starting coordinates for a simulation. The |
849 |
< |
{\sc oopse} package provides a program called \texttt{sysBuilder} to aid in |
850 |
< |
the creation of the \texttt{.init} file. \texttt{sysBuilder} is {\sc bass} |
851 |
< |
aware, and will recognize arguments and parameters in the |
852 |
< |
\texttt{.bass} file that would otherwise be ignored by the |
853 |
< |
simulation. The program itself is under continual development, and is |
763 |
< |
offered here as a helper tool only. |
847 |
> |
As was stated in Sec.~\ref{oopseSec:coordFiles}, an initialization |
848 |
> |
file is needed to provide the starting coordinates for a |
849 |
> |
simulation. The {\sc oopse} package provides several system building |
850 |
> |
programs to aid in the creation of the \texttt{.init} |
851 |
> |
file. The programs use {\sc bass}, and will recognize |
852 |
> |
arguments and parameters in the \texttt{.bass} file that would |
853 |
> |
otherwise be ignored by the simulation. |
854 |
|
|
855 |
|
\subsection{The Statistics File} |
856 |
|
|
857 |
< |
The last output file generated by {\sc oopse} is the statistics file. This |
858 |
< |
file records such statistical quantities as the instantaneous |
859 |
< |
temperature, volume, pressure, etc. It is written out with the |
860 |
< |
frequency specified in the \texttt{.bass} file. The file allows the |
861 |
< |
user to observe the system variables as a function of simulation time |
862 |
< |
while the simulation is in progress. One useful function the |
863 |
< |
statistics file serves is to monitor the conserved quantity of a given |
864 |
< |
simulation ensemble, this allows the user to observe the stability of |
865 |
< |
the integrator. The statistics file is denoted with the \texttt{.stat} |
866 |
< |
file extension. |
857 |
> |
The last output file generated by {\sc oopse} is the statistics |
858 |
> |
file. This file records such statistical quantities as the |
859 |
> |
instantaneous temperature, volume, pressure, etc. It is written out |
860 |
> |
with the frequency specified in the \texttt{.bass} file with the |
861 |
> |
\texttt{statusTime} keyword. The file allows the user to observe the |
862 |
> |
system variables as a function of simulation time while the simulation |
863 |
> |
is in progress. One useful function the statistics file serves is to |
864 |
> |
monitor the conserved quantity of a given simulation ensemble, this |
865 |
> |
allows the user to observe the stability of the integrator. The |
866 |
> |
statistics file is denoted with the \texttt{.stat} file extension. |
867 |
|
|
868 |
|
\section{\label{oopseSec:mechanics}Mechanics} |
869 |
|
|
871 |
|
|
872 |
|
Integration of the equations of motion was carried out using the |
873 |
|
symplectic splitting method proposed by Dullweber \emph{et |
874 |
< |
al.}.\cite{Dullweber1997} The reason for this integrator selection |
875 |
< |
deals with poor energy conservation of rigid body systems using |
876 |
< |
quaternions. While quaternions work well for orientational motion in |
877 |
< |
alternate ensembles, the microcanonical ensemble has a constant energy |
878 |
< |
requirement that is quite sensitive to errors in the equations of |
879 |
< |
motion. The original implementation of this code utilized quaternions |
880 |
< |
for rotational motion propagation; however, a detailed investigation |
881 |
< |
showed that they resulted in a steady drift in the total energy, |
882 |
< |
something that has been observed by others.\cite{Laird97} |
874 |
> |
al.}.\cite{Dullweber1997} The reason for the selection of this |
875 |
> |
integrator, is the poor energy conservation of rigid body systems |
876 |
> |
using quaternion dynamics. While quaternions work well for |
877 |
> |
orientational motion in alternate ensembles, the microcanonical |
878 |
> |
ensemble has a constant energy requirement that is quite sensitive to |
879 |
> |
errors in the equations of motion. The original implementation of {\sc |
880 |
> |
oopse} utilized quaternions for rotational motion propagation; |
881 |
> |
however, a detailed investigation showed that they resulted in a |
882 |
> |
steady drift in the total energy, something that has been observed by |
883 |
> |
others.\cite{Laird97} |
884 |
|
|
885 |
|
The key difference in the integration method proposed by Dullweber |
886 |
< |
\emph{et al.} is that the entire rotation matrix is propagated from |
887 |
< |
one time step to the next. In the past, this would not have been as |
888 |
< |
feasible a option, being that the rotation matrix for a single body is |
889 |
< |
nine elements long as opposed to 3 or 4 elements for Euler angles and |
890 |
< |
quaternions respectively. System memory has become much less of an |
891 |
< |
issue in recent times, and this has resulted in substantial benefits |
892 |
< |
in energy conservation. There is still the issue of 5 or 6 additional |
802 |
< |
elements for describing the orientation of each particle, which will |
803 |
< |
increase dump files substantially. Simply translating the rotation |
804 |
< |
matrix into its component Euler angles or quaternions for storage |
805 |
< |
purposes relieves this burden. |
886 |
> |
\emph{et al}.~({\sc dlm}) is that the entire rotation matrix is propagated from |
887 |
> |
one time step to the next. In the past, this would not have been a |
888 |
> |
feasible option, since the rotation matrix for a single body is nine |
889 |
> |
elements long as opposed to three or four elements for Euler angles |
890 |
> |
and quaternions respectively. System memory has become much less of an |
891 |
> |
issue in recent times, and the {\sc dlm} method has used memory in |
892 |
> |
exchange for substantial benefits in energy conservation. |
893 |
|
|
894 |
< |
The symplectic splitting method allows for Verlet style integration of |
895 |
< |
both linear and angular motion of rigid bodies. In the integration |
896 |
< |
method, the orientational propagation involves a sequence of matrix |
894 |
> |
The {\sc dlm} method allows for Verlet style integration of both |
895 |
> |
linear and angular motion of rigid bodies. In the integration method, |
896 |
> |
the orientational propagation involves a sequence of matrix |
897 |
|
evaluations to update the rotation matrix.\cite{Dullweber1997} These |
898 |
< |
matrix rotations end up being more costly computationally than the |
899 |
< |
simpler arithmetic quaternion propagation. With the same time step, a |
900 |
< |
1000 SSD particle simulation shows an average 7\% increase in |
901 |
< |
computation time using the symplectic step method in place of |
902 |
< |
quaternions. This cost is more than justified when comparing the |
903 |
< |
energy conservation of the two methods as illustrated in figure |
817 |
< |
\ref{timestep}. |
898 |
> |
matrix rotations are more costly computationally than the simpler |
899 |
> |
arithmetic quaternion propagation. With the same time step, a 1000 SSD |
900 |
> |
particle simulation shows an average 7\% increase in computation time |
901 |
> |
using the {\sc dlm} method in place of quaternions. This cost is more |
902 |
> |
than justified when comparing the energy conservation of the two |
903 |
> |
methods as illustrated in Fig.~\ref{timestep}. |
904 |
|
|
905 |
|
\begin{figure} |
906 |
|
\centering |
907 |
|
\includegraphics[width=\linewidth]{timeStep.eps} |
908 |
< |
\caption{Energy conservation using quaternion based integration versus |
909 |
< |
the symplectic step method proposed by Dullweber \emph{et al.} with |
908 |
> |
\caption[Energy conservation for quaternion versus {\sc dlm} dynamics]{Energy conservation using quaternion based integration versus |
909 |
> |
the {\sc dlm} method with |
910 |
|
increasing time step. For each time step, the dotted line is total |
911 |
< |
energy using the symplectic step integrator, and the solid line comes |
911 |
> |
energy using the {\sc dlm} integrator, and the solid line comes |
912 |
|
from the quaternion integrator. The larger time step plots are shifted |
913 |
|
up from the true energy baseline for clarity.} |
914 |
|
\label{timestep} |
915 |
|
\end{figure} |
916 |
|
|
917 |
< |
In figure \ref{timestep}, the resulting energy drift at various time |
918 |
< |
steps for both the symplectic step and quaternion integration schemes |
917 |
> |
In Fig.~\ref{timestep}, the resulting energy drift at various time |
918 |
> |
steps for both the {\sc dlm} and quaternion integration schemes |
919 |
|
is compared. All of the 1000 SSD particle simulations started with the |
920 |
|
same configuration, and the only difference was the method for |
921 |
|
handling rotational motion. At time steps of 0.1 and 0.5 fs, both |
922 |
|
methods for propagating particle rotation conserve energy fairly well, |
923 |
|
with the quaternion method showing a slight energy drift over time in |
924 |
|
the 0.5 fs time step simulation. At time steps of 1 and 2 fs, the |
925 |
< |
energy conservation benefits of the symplectic step method are clearly |
925 |
> |
energy conservation benefits of the {\sc dlm} method are clearly |
926 |
|
demonstrated. Thus, while maintaining the same degree of energy |
927 |
|
conservation, one can take considerably longer time steps, leading to |
928 |
|
an overall reduction in computation time. |
931 |
|
time steps up to three femtoseconds. A slight energy drift on the |
932 |
|
order of 0.012 kcal/mol per nanosecond was observed at a time step of |
933 |
|
four femtoseconds, and as expected, this drift increases dramatically |
934 |
< |
with increasing time step. To insure accuracy in the constant energy |
849 |
< |
simulations, time steps were set at 2 fs and kept at this value for |
850 |
< |
constant pressure simulations as well. |
934 |
> |
with increasing time step. |
935 |
|
|
936 |
|
|
937 |
|
\subsection{\label{sec:extended}Extended Systems for other Ensembles} |
940 |
|
{\sc oopse} implements a |
941 |
|
|
942 |
|
|
943 |
< |
\subsubsection{\label{oopseSec:noseHooverThermo}Nose-Hoover Thermostatting} |
943 |
> |
\subsection{\label{oopseSec:noseHooverThermo}Nose-Hoover Thermostatting} |
944 |
|
|
945 |
|
To mimic the effects of being in a constant temperature ({\sc nvt}) |
946 |
|
ensemble, {\sc oopse} uses the Nose-Hoover extended system |
967 |
|
to values of a few ps. Within a {\sc bass} file, $\tau_{T}$ could be |
968 |
|
set to 1 ps using the {\tt tauThermostat = 1000; } command. |
969 |
|
|
970 |
+ |
\subsection{\label{oopseSec:rattle}The {\sc rattle} Method for Bond |
971 |
+ |
Constraints} |
972 |
|
|
973 |
< |
\subsection{\label{oopseSec:zcons}Z-Constraint Method} |
973 |
> |
In order to satisfy the constraints of fixed bond lengths within {\sc |
974 |
> |
oopse}, we have implemented the {\sc rattle} algorithm of |
975 |
> |
Andersen.\cite{andersen83} The algorithm is a velocity verlet |
976 |
> |
formulation of the {\sc shake} method\cite{ryckaert77} of iteratively |
977 |
> |
solving the Lagrange multipliers of constraint. The system of lagrange |
978 |
> |
multipliers allows one to reformulate the equations of motion with |
979 |
> |
explicit constraint forces.\cite{fowles99:lagrange} |
980 |
|
|
981 |
< |
Based on fluctuation-dissipation theorem,\bigskip\ force auto-correlation |
982 |
< |
method was developed to investigate the dynamics of ions inside the ion |
983 |
< |
channels.\cite{Roux91} Time-dependent friction coefficient can be calculated |
984 |
< |
from the deviation of the instantaneous force from its mean force. |
981 |
> |
Consider a system described by coordinates $q_1$ and $q_2$ subject to an |
982 |
> |
equation of constraint: |
983 |
> |
\begin{equation} |
984 |
> |
\sigma(q_1, q_2,t) = 0 |
985 |
> |
\label{oopseEq:lm1} |
986 |
> |
\end{equation} |
987 |
> |
The Lagrange formulation of the equations of motion can be written: |
988 |
> |
\begin{equation} |
989 |
> |
\delta\int_{t_1}^{t_2}L\, dt = |
990 |
> |
\int_{t_1}^{t_2} \sum_i \biggl [ \frac{\partial L}{\partial q_i} |
991 |
> |
- \frac{d}{dt}\biggl(\frac{\partial L}{\partial \dot{q}_i} |
992 |
> |
\biggr ) \biggr] \delta q_i \, dt = 0 |
993 |
> |
\label{oopseEq:lm2} |
994 |
> |
\end{equation} |
995 |
> |
Here, $\delta q_i$ is not independent for each $q$, as $q_1$ and $q_2$ |
996 |
> |
are linked by $\sigma$. However, $\sigma$ is fixed at any given |
997 |
> |
instant of time, giving: |
998 |
> |
\begin{align} |
999 |
> |
\delta\sigma &= \biggl( \frac{\partial\sigma}{\partial q_1} \delta q_1 % |
1000 |
> |
+ \frac{\partial\sigma}{\partial q_2} \delta q_2 \biggr) = 0 \\ |
1001 |
> |
% |
1002 |
> |
\frac{\partial\sigma}{\partial q_1} \delta q_1 &= % |
1003 |
> |
- \frac{\partial\sigma}{\partial q_2} \delta q_2 \\ |
1004 |
> |
% |
1005 |
> |
\delta q_2 &= - \biggl(\frac{\partial\sigma}{\partial q_1} \bigg / % |
1006 |
> |
\frac{\partial\sigma}{\partial q_2} \biggr) \delta q_1 |
1007 |
> |
\end{align} |
1008 |
> |
Substituted back into Eq.~\ref{oopseEq:lm2}, |
1009 |
> |
\begin{equation} |
1010 |
> |
\int_{t_1}^{t_2}\biggl [ \biggl(\frac{\partial L}{\partial q_1} |
1011 |
> |
- \frac{d}{dt}\,\frac{\partial L}{\partial \dot{q}_1} |
1012 |
> |
\biggr) |
1013 |
> |
- \biggl( \frac{\partial L}{\partial q_1} |
1014 |
> |
- \frac{d}{dt}\,\frac{\partial L}{\partial \dot{q}_1} |
1015 |
> |
\biggr) \biggl(\frac{\partial\sigma}{\partial q_1} \bigg / % |
1016 |
> |
\frac{\partial\sigma}{\partial q_2} \biggr)\biggr] \delta q_1 \, dt = 0 |
1017 |
> |
\label{oopseEq:lm3} |
1018 |
> |
\end{equation} |
1019 |
> |
Leading to, |
1020 |
> |
\begin{equation} |
1021 |
> |
\frac{\biggl(\frac{\partial L}{\partial q_1} |
1022 |
> |
- \frac{d}{dt}\,\frac{\partial L}{\partial \dot{q}_1} |
1023 |
> |
\biggr)}{\frac{\partial\sigma}{\partial q_1}} = |
1024 |
> |
\frac{\biggl(\frac{\partial L}{\partial q_2} |
1025 |
> |
- \frac{d}{dt}\,\frac{\partial L}{\partial \dot{q}_2} |
1026 |
> |
\biggr)}{\frac{\partial\sigma}{\partial q_2}} |
1027 |
> |
\label{oopseEq:lm4} |
1028 |
> |
\end{equation} |
1029 |
> |
This relation can only be statisfied, if both are equal to a single |
1030 |
> |
function $-\lambda(t)$, |
1031 |
> |
\begin{align} |
1032 |
> |
\frac{\biggl(\frac{\partial L}{\partial q_1} |
1033 |
> |
- \frac{d}{dt}\,\frac{\partial L}{\partial \dot{q}_1} |
1034 |
> |
\biggr)}{\frac{\partial\sigma}{\partial q_1}} &= -\lambda(t) \\ |
1035 |
> |
% |
1036 |
> |
\frac{\partial L}{\partial q_1} |
1037 |
> |
- \frac{d}{dt}\,\frac{\partial L}{\partial \dot{q}_1} &= |
1038 |
> |
-\lambda(t)\,\frac{\partial\sigma}{\partial q_1} \\ |
1039 |
> |
% |
1040 |
> |
\frac{\partial L}{\partial q_1} |
1041 |
> |
- \frac{d}{dt}\,\frac{\partial L}{\partial \dot{q}_1} |
1042 |
> |
+ \mathcal{G}_i &= 0 |
1043 |
> |
\end{align} |
1044 |
> |
Where $\mathcal{G}_i$, the force of constraint on $i$, is: |
1045 |
> |
\begin{equation} |
1046 |
> |
\mathcal{G}_i = \lambda(t)\,\frac{\partial\sigma}{\partial q_1} |
1047 |
> |
\label{oopseEq:lm5} |
1048 |
> |
\end{equation} |
1049 |
|
|
1050 |
+ |
In a simulation, this would involve the solution of a set of $(m + n)$ |
1051 |
+ |
number of equations. Where $m$ is the number of constraints, and $n$ |
1052 |
+ |
is the number of constrained coordinates. In practice, this is not |
1053 |
+ |
done, as the matrix inversion necessary to solve the system of |
1054 |
+ |
equations would be very time consuming to solve. Additionally, the |
1055 |
+ |
numerical error in the solution of the set of $\lambda$'s would be |
1056 |
+ |
compounded by the error inherent in propagating by the Velocity Verlet |
1057 |
+ |
algorithm ($\Delta t^4$). The Verlet propagation error is negligible |
1058 |
+ |
in an unconstrained system, as one is interested in the statistics of |
1059 |
+ |
the run, and not that the run be numerically exact to the ``true'' |
1060 |
+ |
integration. This relates back to the ergodic hypothesis that a time |
1061 |
+ |
integral of a valid trajectory will still give the correct ensemble |
1062 |
+ |
average. However, in the case of constraints, if the equations of |
1063 |
+ |
motion leave the ``true'' trajectory, they are departing from the |
1064 |
+ |
constrained surface. The method that is used, is to iteratively solve |
1065 |
+ |
for $\lambda(t)$ at each time step. |
1066 |
+ |
|
1067 |
+ |
In {\sc rattle} the equations of motion are modified subject to the |
1068 |
+ |
following two constraints: |
1069 |
+ |
\begin{align} |
1070 |
+ |
\sigma_{ij}[\mathbf{r}(t)] \equiv |
1071 |
+ |
[ \mathbf{r}_i(t) - \mathbf{r}_j(t)]^2 - d_{ij}^2 &= 0 % |
1072 |
+ |
\label{oopseEq:c1} \\ |
1073 |
+ |
% |
1074 |
+ |
[\mathbf{\dot{r}}_i(t) - \mathbf{\dot{r}}_j(t)] \cdot |
1075 |
+ |
[\mathbf{r}_i(t) - \mathbf{r}_j(t)] &= 0 \label{oopseEq:c2} |
1076 |
+ |
\end{align} |
1077 |
+ |
Eq.~\ref{oopseEq:c1} is the set of bond constraints, where $d_{ij}$ is |
1078 |
+ |
the constrained distance between atom $i$ and |
1079 |
+ |
$j$. Eq.~\ref{oopseEq:c2} constrains the velocities of $i$ and $j$ to |
1080 |
+ |
be perpendicular to the bond vector, so that the bond can neither grow |
1081 |
+ |
nor shrink. The constrained dynamics equations become: |
1082 |
+ |
\begin{equation} |
1083 |
+ |
m_i \mathbf{\ddot{r}}_i = \mathbf{F}_i + \mathbf{\mathcal{G}}_i |
1084 |
+ |
\label{oopseEq:r1} |
1085 |
+ |
\end{equation} |
1086 |
+ |
Where,$\mathbf{\mathcal{G}}_i$ are the forces of constraint on $i$, |
1087 |
+ |
and are defined: |
1088 |
+ |
\begin{equation} |
1089 |
+ |
\mathbf{\mathcal{G}}_i = - \sum_j \lambda_{ij}(t)\,\nabla \sigma_{ij} |
1090 |
+ |
\label{oopseEq:r2} |
1091 |
+ |
\end{equation} |
1092 |
+ |
|
1093 |
+ |
In Velocity Verlet, if $\Delta t = h$, the propagation can be written: |
1094 |
+ |
\begin{align} |
1095 |
+ |
\mathbf{r}_i(t+h) &= |
1096 |
+ |
\mathbf{r}_i(t) + h\mathbf{\dot{r}}(t) + |
1097 |
+ |
\frac{h^2}{2m_i}\,\Bigl[ \mathbf{F}_i(t) + |
1098 |
+ |
\mathbf{\mathcal{G}}_{Ri}(t) \Bigr] \label{oopseEq:vv1} \\ |
1099 |
+ |
% |
1100 |
+ |
\mathbf{\dot{r}}_i(t+h) &= |
1101 |
+ |
\mathbf{\dot{r}}_i(t) + \frac{h}{2m_i} |
1102 |
+ |
\Bigl[ \mathbf{F}_i(t) + \mathbf{\mathcal{G}}_{Ri}(t) + |
1103 |
+ |
\mathbf{F}_i(t+h) + \mathbf{\mathcal{G}}_{Vi}(t+h) \Bigr] % |
1104 |
+ |
\label{oopseEq:vv2} |
1105 |
+ |
\end{align} |
1106 |
+ |
Where: |
1107 |
+ |
\begin{align} |
1108 |
+ |
\mathbf{\mathcal{G}}_{Ri}(t) &= |
1109 |
+ |
-2 \sum_j \lambda_{Rij}(t) \mathbf{r}_{ij}(t) \\ |
1110 |
+ |
% |
1111 |
+ |
\mathbf{\mathcal{G}}_{Vi}(t+h) &= |
1112 |
+ |
-2 \sum_j \lambda_{Vij}(t+h) \mathbf{r}(t+h) |
1113 |
+ |
\end{align} |
1114 |
+ |
Next, define: |
1115 |
+ |
\begin{align} |
1116 |
+ |
g_{ij} &= h \lambda_{Rij}(t) \\ |
1117 |
+ |
k_{ij} &= h \lambda_{Vij}(t+h) \\ |
1118 |
+ |
\mathbf{q}_i &= \mathbf{\dot{r}}_i(t) + \frac{h}{2m_i} \mathbf{F}_i(t) |
1119 |
+ |
- \frac{1}{m_i}\sum_j g_{ij}\mathbf{r}_{ij}(t) |
1120 |
+ |
\end{align} |
1121 |
+ |
Using these definitions, Eq.~\ref{oopseEq:vv1} and \ref{oopseEq:vv2} |
1122 |
+ |
can be rewritten as, |
1123 |
+ |
\begin{align} |
1124 |
+ |
\mathbf{r}_i(t+h) &= \mathbf{r}_i(t) + h \mathbf{q}_i \\ |
1125 |
+ |
% |
1126 |
+ |
\mathbf{\dot{r}}(t+h) &= \mathbf{q}_i + \frac{h}{2m_i}\mathbf{F}_i(t+h) |
1127 |
+ |
-\frac{1}{m_i}\sum_j k_{ij} \mathbf{r}_{ij}(t+h) |
1128 |
+ |
\end{align} |
1129 |
+ |
|
1130 |
+ |
To integrate the equations of motion, the {\sc rattle} algorithm first |
1131 |
+ |
solves for $\mathbf{r}(t+h)$. Let, |
1132 |
+ |
\begin{equation} |
1133 |
+ |
\mathbf{q}_i = \mathbf{\dot{r}}(t) + \frac{h}{2m_i}\mathbf{F}_i(t) |
1134 |
+ |
\end{equation} |
1135 |
+ |
Here $\mathbf{q}_i$ corresponds to an initial unconstrained move. Next |
1136 |
+ |
pick a constraint $j$, and let, |
1137 |
+ |
\begin{equation} |
1138 |
+ |
\mathbf{s} = \mathbf{r}_i(t) + h\mathbf{q}_i(t) |
1139 |
+ |
- \mathbf{r}_j(t) + h\mathbf{q}_j(t) |
1140 |
+ |
\label{oopseEq:ra1} |
1141 |
+ |
\end{equation} |
1142 |
+ |
If |
1143 |
+ |
\begin{equation} |
1144 |
+ |
\Big| |\mathbf{s}|^2 - d_{ij}^2 \Big| > \text{tolerance}, |
1145 |
+ |
\end{equation} |
1146 |
+ |
then the constraint is unsatisfied, and corrections are made to the |
1147 |
+ |
positions. First we define a test corrected configuration as, |
1148 |
+ |
\begin{align} |
1149 |
+ |
\mathbf{r}_i^T(t+h) = \mathbf{r}_i(t) + h\biggl[\mathbf{q}_i - |
1150 |
+ |
g_{ij}\,\frac{\mathbf{r}_{ij}(t)}{m_i} \biggr] \\ |
1151 |
+ |
% |
1152 |
+ |
\mathbf{r}_j^T(t+h) = \mathbf{r}_j(t) + h\biggl[\mathbf{q}_j + |
1153 |
+ |
g_{ij}\,\frac{\mathbf{r}_{ij}(t)}{m_j} \biggr] |
1154 |
+ |
\end{align} |
1155 |
+ |
And we chose $g_{ij}$ such that, $|\mathbf{r}_i^T - \mathbf{r}_j^T|^2 |
1156 |
+ |
= d_{ij}^2$. Solving the quadratic for $g_{ij}$ we obtain the |
1157 |
+ |
approximation, |
1158 |
+ |
\begin{equation} |
1159 |
+ |
g_{ij} = \frac{(s^2 - d^2)}{2h[\mathbf{s}\cdot\mathbf{r}_{ij}(t)] |
1160 |
+ |
(\frac{1}{m_i} + \frac{1}{m_j})} |
1161 |
+ |
\end{equation} |
1162 |
+ |
Although not an exact solution for $g_{ij}$, as this is an iterative |
1163 |
+ |
scheme overall, the eventual solution will converge. With a trial |
1164 |
+ |
$g_{ij}$, the new $\mathbf{q}$'s become, |
1165 |
+ |
\begin{align} |
1166 |
+ |
\mathbf{q}_i &= \mathbf{q}^{\text{old}}_i - g_{ij}\, |
1167 |
+ |
\frac{\mathbf{r}_{ij}(t)}{m_i} \\ |
1168 |
+ |
% |
1169 |
+ |
\mathbf{q}_j &= \mathbf{q}^{\text{old}}_j + g_{ij}\, |
1170 |
+ |
\frac{\mathbf{r}_{ij}(t)}{m_j} |
1171 |
+ |
\end{align} |
1172 |
+ |
The whole algorithm is then repeated from Eq.~\ref{oopseEq:ra1} until |
1173 |
+ |
all constraints are satisfied. |
1174 |
+ |
|
1175 |
+ |
The second step of {\sc rattle}, is to then update the velocities. The |
1176 |
+ |
step starts with, |
1177 |
+ |
\begin{equation} |
1178 |
+ |
\mathbf{\dot{r}}_i(t+h) = \mathbf{q}_i + \frac{h}{2m_i}\mathbf{F}_i(t+h) |
1179 |
+ |
\end{equation} |
1180 |
+ |
Next we pick a constraint $j$, and calculate the dot product $\ell$. |
1181 |
+ |
\begin{equation} |
1182 |
+ |
\ell = \mathbf{r}_{ij}(t+h) \cdot \mathbf{\dot{r}}_{ij}(t+h) |
1183 |
+ |
\label{oopseEq:rv1} |
1184 |
+ |
\end{equation} |
1185 |
+ |
Here if constraint Eq.~\ref{oopseEq:c2} holds, $\ell$ should be |
1186 |
+ |
zero. Therefore if $\ell$ is greater than some tolerance, then |
1187 |
+ |
corrections are made to the $i$ and $j$ velocities. |
1188 |
+ |
\begin{align} |
1189 |
+ |
\mathbf{\dot{r}}_i^T &= \mathbf{\dot{r}}_i(t+h) - k_{ij} |
1190 |
+ |
\frac{\mathbf{\dot{r}}_{ij}(t+h)}{m_i} \\ |
1191 |
|
% |
1192 |
+ |
\mathbf{\dot{r}}_j^T &= \mathbf{\dot{r}}_j(t+h) + k_{ij} |
1193 |
+ |
\frac{\mathbf{\dot{r}}_{ij}(t+h)}{m_j} |
1194 |
+ |
\end{align} |
1195 |
+ |
Like in the previous step, we select a value for $k_{ij}$ such that |
1196 |
+ |
$\ell$ is zero. |
1197 |
+ |
\begin{equation} |
1198 |
+ |
k_{ij} = \frac{\ell}{d^2_{ij}(\frac{1}{m_i} + \frac{1}{m_j})} |
1199 |
+ |
\end{equation} |
1200 |
+ |
The test velocities, $\mathbf{\dot{r}}^T_i$ and |
1201 |
+ |
$\mathbf{\dot{r}}^T_j$, then replace their respective velocities, and |
1202 |
+ |
the algorithm is iterated from Eq.~\ref{oopseEq:rv1} until all |
1203 |
+ |
constraints are satisfied. |
1204 |
|
|
1205 |
+ |
|
1206 |
+ |
\subsection{\label{oopseSec:zcons}Z-Constraint Method} |
1207 |
+ |
|
1208 |
+ |
Based on the fluctuation-dissipation theorem, a force auto-correlation |
1209 |
+ |
method was developed by Roux and Karplus to investigate the dynamics |
1210 |
+ |
of ions inside ion channels.\cite{Roux91} The time-dependent friction |
1211 |
+ |
coefficient can be calculated from the deviation of the instantaneous |
1212 |
+ |
force from its mean force. |
1213 |
|
\begin{equation} |
1214 |
|
\xi(z,t)=\langle\delta F(z,t)\delta F(z,0)\rangle/k_{B}T |
1215 |
|
\end{equation} |
1219 |
|
\end{equation} |
1220 |
|
|
1221 |
|
|
1222 |
< |
If the time-dependent friction decay rapidly, static friction coefficient can |
1223 |
< |
be approximated by% |
907 |
< |
|
1222 |
> |
If the time-dependent friction decays rapidly, the static friction |
1223 |
> |
coefficient can be approximated by |
1224 |
|
\begin{equation} |
1225 |
|
\xi^{static}(z)=\int_{0}^{\infty}\langle\delta F(z,t)\delta F(z,0)\rangle dt |
1226 |
|
\end{equation} |
1227 |
< |
|
912 |
< |
|
913 |
< |
Hence, diffusion constant can be estimated by |
1227 |
> |
Therefore, the diffusion constant can then be estimated by |
1228 |
|
\begin{equation} |
1229 |
|
D(z)=\frac{k_{B}T}{\xi^{static}(z)}=\frac{(k_{B}T)^{2}}{\int_{0}^{\infty |
1230 |
|
}\langle\delta F(z,t)\delta F(z,0)\rangle dt}% |
1231 |
|
\end{equation} |
1232 |
|
|
1233 |
< |
|
1234 |
< |
\bigskip Z-Constraint method, which fixed the z coordinates of the molecules |
1235 |
< |
with respect to the center of the mass of the system, was proposed to obtain |
1236 |
< |
the forces required in force auto-correlation method.\cite{Marrink94} However, |
1237 |
< |
simply resetting the coordinate will move the center of the mass of the whole |
1238 |
< |
system. To avoid this problem, a new method was used at {\sc oopse}. Instead of |
1239 |
< |
resetting the coordinate, we reset the forces of z-constraint molecules as |
1240 |
< |
well as subtract the total constraint forces from the rest of the system after |
1241 |
< |
force calculation at each time step. |
1233 |
> |
The Z-Constraint method, which fixes the z coordinates of the |
1234 |
> |
molecules with respect to the center of the mass of the system, has |
1235 |
> |
been a method suggested to obtain the forces required for the force |
1236 |
> |
auto-correlation calculation.\cite{Marrink94} However, simply resetting the |
1237 |
> |
coordinate will move the center of the mass of the whole system. To |
1238 |
> |
avoid this problem, a new method was used in {\sc oopse}. Instead of |
1239 |
> |
resetting the coordinate, we reset the forces of z-constraint |
1240 |
> |
molecules as well as subtract the total constraint forces from the |
1241 |
> |
rest of the system after force calculation at each time step. |
1242 |
|
\begin{align} |
1243 |
|
F_{\alpha i}&=0\\ |
1244 |
|
V_{\alpha i}&=V_{\alpha i}-\frac{\sum\limits_{i}M_{_{\alpha i}}V_{\alpha i}}{\sum\limits_{i}M_{_{\alpha i}}}\\ |
1246 |
|
V_{\alpha i}&=V_{\alpha i}-\frac{\sum\limits_{\alpha}\sum\limits_{i}M_{_{\alpha i}}V_{\alpha i}}{\sum\limits_{\alpha}\sum\limits_{i}M_{_{\alpha i}}} |
1247 |
|
\end{align} |
1248 |
|
|
1249 |
< |
At the very beginning of the simulation, the molecules may not be at its |
1250 |
< |
constraint position. To move the z-constraint molecule to the specified |
1251 |
< |
position, a simple harmonic potential is used% |
938 |
< |
|
1249 |
> |
At the very beginning of the simulation, the molecules may not be at their |
1250 |
> |
constrained positions. To move a z-constrained molecule to its specified |
1251 |
> |
position, a simple harmonic potential is used |
1252 |
|
\begin{equation} |
1253 |
< |
U(t)=\frac{1}{2}k_{Harmonic}(z(t)-z_{cons})^{2}% |
1253 |
> |
U(t)=\frac{1}{2}k_{\text{Harmonic}}(z(t)-z_{\text{cons}})^{2}% |
1254 |
|
\end{equation} |
1255 |
< |
where $k_{Harmonic}$\bigskip\ is the harmonic force constant, $z(t)$ is |
1256 |
< |
current z coordinate of the center of mass of the z-constraint molecule, and |
1257 |
< |
$z_{cons}$ is the restraint position. Therefore, the harmonic force operated |
1258 |
< |
on the z-constraint molecule at time $t$ can be calculated by% |
1255 |
> |
where $k_{\text{Harmonic}}$ is the harmonic force constant, $z(t)$ is the |
1256 |
> |
current $z$ coordinate of the center of mass of the constrained molecule, and |
1257 |
> |
$z_{\text{cons}}$ is the constrained position. The harmonic force operating |
1258 |
> |
on the z-constrained molecule at time $t$ can be calculated by |
1259 |
|
\begin{equation} |
1260 |
< |
F_{z_{Harmonic}}(t)=-\frac{\partial U(t)}{\partial z(t)}=-k_{Harmonic}% |
1261 |
< |
(z(t)-z_{cons}) |
1260 |
> |
F_{z_{\text{Harmonic}}}(t)=-\frac{\partial U(t)}{\partial z(t)}= |
1261 |
> |
-k_{\text{Harmonic}}(z(t)-z_{\text{cons}}) |
1262 |
|
\end{equation} |
950 |
– |
Worthy of mention, other kinds of potential functions can also be used to |
951 |
– |
drive the z-constraint molecule. |
1263 |
|
|
1264 |
|
\section{\label{oopseSec:props}Trajectory Analysis} |
1265 |
|
|
1266 |
|
\subsection{\label{oopseSec:staticProps}Static Property Analysis} |
1267 |
|
|
1268 |
|
The static properties of the trajectories are analyzed with the |
1269 |
< |
program \texttt{staticProps}. The code is capable of calculating the following |
1270 |
< |
pair correlations between species A and B: |
1271 |
< |
\begin{itemize} |
1272 |
< |
\item $g_{\text{AB}}(r)$: Eq.~\ref{eq:gofr} |
962 |
< |
\item $g_{\text{AB}}(r, \cos \theta)$: Eq.~\ref{eq:gofrCosTheta} |
963 |
< |
\item $g_{\text{AB}}(r, \cos \omega)$: Eq.~\ref{eq:gofrCosOmega} |
964 |
< |
\item $g_{\text{AB}}(x, y, z)$: Eq.~\ref{eq:gofrXYZ} |
965 |
< |
\item $\langle \cos \omega \rangle_{\text{AB}}(r)$: |
966 |
< |
Eq.~\ref{eq:cosOmegaOfR} |
967 |
< |
\end{itemize} |
1269 |
> |
program \texttt{staticProps}. The code is capable of calculating a |
1270 |
> |
number of pair correlations between species A and B. Some of which |
1271 |
> |
only apply to directional entities. The summary of pair correlations |
1272 |
> |
can be found in Table~\ref{oopseTb:gofrs} |
1273 |
|
|
1274 |
+ |
\begin{table} |
1275 |
+ |
\caption[The list of pair correlations in \texttt{staticProps}]{The different pair correlations in \texttt{staticProps} along with whether atom A or B must be directional.} |
1276 |
+ |
\label{oopseTb:gofrs} |
1277 |
+ |
\begin{center} |
1278 |
+ |
\begin{tabular}{|l|c|c|} |
1279 |
+ |
\hline |
1280 |
+ |
Name & Equation & Directional Atom \\ \hline |
1281 |
+ |
$g_{\text{AB}}(r)$ & Eq.~\ref{eq:gofr} & neither \\ \hline |
1282 |
+ |
$g_{\text{AB}}(r, \cos \theta)$ & Eq.~\ref{eq:gofrCosTheta} & A \\ \hline |
1283 |
+ |
$g_{\text{AB}}(r, \cos \omega)$ & Eq.~\ref{eq:gofrCosOmega} & both \\ \hline |
1284 |
+ |
$g_{\text{AB}}(x, y, z)$ & Eq.~\ref{eq:gofrXYZ} & neither \\ \hline |
1285 |
+ |
$\langle \cos \omega \rangle_{\text{AB}}(r)$ & Eq.~\ref{eq:cosOmegaOfR} &% |
1286 |
+ |
both \\ \hline |
1287 |
+ |
\end{tabular} |
1288 |
+ |
\end{center} |
1289 |
+ |
\end{table} |
1290 |
+ |
|
1291 |
|
The first pair correlation, $g_{\text{AB}}(r)$, is defined as follows: |
1292 |
|
\begin{equation} |
1293 |
|
g_{\text{AB}}(r) = \frac{V}{N_{\text{A}}N_{\text{B}}}\langle %% |
1365 |
|
correlation that gives the average correlation of two directional |
1366 |
|
entities as a function of their distance from each other. |
1367 |
|
|
1046 |
– |
All static properties are calculated on a frame by frame basis. The |
1047 |
– |
trajectory is read a single frame at a time, and the appropriate |
1048 |
– |
calculations are done on each frame. Once one frame is finished, the |
1049 |
– |
next frame is read in, and a running average of the property being |
1050 |
– |
calculated is accumulated in each frame. The program allows for the |
1051 |
– |
user to specify more than one property be calculated in single run, |
1052 |
– |
preventing the need to read a file multiple times. |
1053 |
– |
|
1368 |
|
\subsection{\label{dynamicProps}Dynamic Property Analysis} |
1369 |
|
|
1370 |
|
The dynamic properties of a trajectory are calculated with the program |
1371 |
< |
\texttt{dynamicProps}. The program will calculate the following properties: |
1371 |
> |
\texttt{dynamicProps}. The program calculates the following properties: |
1372 |
|
\begin{gather} |
1373 |
|
\langle | \mathbf{r}(t) - \mathbf{r}(0) |^2 \rangle \label{eq:rms}\\ |
1374 |
|
\langle \mathbf{v}(t) \cdot \mathbf{v}(0) \rangle \label{eq:velCorr} \\ |
1375 |
|
\langle \mathbf{j}(t) \cdot \mathbf{j}(0) \rangle \label{eq:angularVelCorr} |
1376 |
|
\end{gather} |
1377 |
|
|
1378 |
< |
Eq.~\ref{eq:rms} is the root mean square displacement |
1379 |
< |
function. Eq.~\ref{eq:velCorr} and Eq.~\ref{eq:angularVelCorr} are the |
1380 |
< |
velocity and angular velocity correlation functions respectively. The |
1381 |
< |
latter is only applicable to directional species in the simulation. |
1378 |
> |
Eq.~\ref{eq:rms} is the root mean square displacement function. Which |
1379 |
> |
allows one to observe the average displacement of an atom as a |
1380 |
> |
function of time. The quantity is useful when calculating diffusion |
1381 |
> |
coefficients because of the Einstein Relation, which is valid at long |
1382 |
> |
times.\cite{allen87:csl} |
1383 |
> |
\begin{equation} |
1384 |
> |
2tD = \langle | \mathbf{r}(t) - \mathbf{r}(0) |^2 \rangle |
1385 |
> |
\label{oopseEq:einstein} |
1386 |
> |
\end{equation} |
1387 |
|
|
1388 |
< |
The \texttt{dynamicProps} program handles he file in a manner different from |
1389 |
< |
\texttt{staticProps}. As the properties calculated by this program are time |
1390 |
< |
dependent, multiple frames must be read in simultaneously by the |
1391 |
< |
program. For small trajectories this is no problem, and the entire |
1392 |
< |
trajectory is read into memory. However, for long trajectories of |
1074 |
< |
large systems, the files can be quite large. In order to accommodate |
1075 |
< |
large files, \texttt{dynamicProps} adopts a scheme whereby two blocks of memory |
1076 |
< |
are allocated to read in several frames each. |
1388 |
> |
Eq.~\ref{eq:velCorr} and \ref{eq:angularVelCorr} are the translational |
1389 |
> |
velocity and angular velocity correlation functions respectively. The |
1390 |
> |
latter is only applicable to directional species in the |
1391 |
> |
simulation. The velocity autocorrelation functions are useful when |
1392 |
> |
determining vibrational information about the system of interest. |
1393 |
|
|
1078 |
– |
In this two block scheme, the correlation functions are first |
1079 |
– |
calculated within each memory block, then the cross correlations |
1080 |
– |
between the frames contained within the two blocks are |
1081 |
– |
calculated. Once completed, the memory blocks are incremented, and the |
1082 |
– |
process is repeated. A diagram illustrating the process is shown in |
1083 |
– |
Fig.~\ref{oopseFig:dynamicPropsMemory}. As was the case with |
1084 |
– |
\texttt{staticProps}, multiple properties may be calculated in a |
1085 |
– |
single run to avoid multiple reads on the same file. |
1086 |
– |
|
1087 |
– |
|
1088 |
– |
|
1394 |
|
\section{\label{oopseSec:design}Program Design} |
1395 |
|
|
1396 |
|
\subsection{\label{sec:architecture} {\sc oopse} Architecture} |
1429 |
|
developed to utilize the routines provided by \texttt{libBASS} and |
1430 |
|
\texttt{libmdtools}. The main program of the package is \texttt{oopse} |
1431 |
|
and the corresponding parallel version \texttt{oopse\_MPI}. These two |
1432 |
< |
programs will take the \texttt{.bass} file, and create then integrate |
1432 |
> |
programs will take the \texttt{.bass} file, and create (and integrate) |
1433 |
|
the simulation specified in the script. The two analysis programs |
1434 |
|
\texttt{staticProps} and \texttt{dynamicProps} utilize the core |
1435 |
|
libraries to initialize and read in trajectories from previously |
1441 |
|
|
1442 |
|
\subsection{\label{oopseSec:parallelization} Parallelization of {\sc oopse}} |
1443 |
|
|
1444 |
< |
Although processor power is continually growing month by month, it is |
1445 |
< |
still unreasonable to simulate systems of more then a 1000 atoms on a |
1446 |
< |
single processor. To facilitate study of larger system sizes or |
1447 |
< |
smaller systems on long time scales in a reasonable period of time, |
1448 |
< |
parallel methods were developed allowing multiple CPU's to share the |
1449 |
< |
simulation workload. Three general categories of parallel |
1450 |
< |
decomposition method's have been developed including atomic, spatial |
1451 |
< |
and force decomposition methods. |
1444 |
> |
Although processor power is continually growing roughly following |
1445 |
> |
Moore's Law, it is still unreasonable to simulate systems of more then |
1446 |
> |
a 1000 atoms on a single processor. To facilitate study of larger |
1447 |
> |
system sizes or smaller systems on long time scales in a reasonable |
1448 |
> |
period of time, parallel methods were developed allowing multiple |
1449 |
> |
CPU's to share the simulation workload. Three general categories of |
1450 |
> |
parallel decomposition methods have been developed including atomic, |
1451 |
> |
spatial and force decomposition methods. |
1452 |
|
|
1453 |
< |
Algorithmically simplest of the three method's is atomic decomposition |
1453 |
> |
Algorithmically simplest of the three methods is atomic decomposition |
1454 |
|
where N particles in a simulation are split among P processors for the |
1455 |
|
duration of the simulation. Computational cost scales as an optimal |
1456 |
|
$O(N/P)$ for atomic decomposition. Unfortunately all processors must |
1457 |
< |
communicate positions and forces with all other processors leading |
1458 |
< |
communication to scale as an unfavorable $O(N)$ independent of the |
1459 |
< |
number of processors. This communication bottleneck led to the |
1460 |
< |
development of spatial and force decomposition methods in which |
1461 |
< |
communication among processors scales much more favorably. Spatial or |
1462 |
< |
domain decomposition divides the physical spatial domain into 3D boxes |
1463 |
< |
in which each processor is responsible for calculation of forces and |
1464 |
< |
positions of particles located in its box. Particles are reassigned to |
1465 |
< |
different processors as they move through simulation space. To |
1466 |
< |
calculate forces on a given particle, a processor must know the |
1467 |
< |
positions of particles within some cutoff radius located on nearby |
1468 |
< |
processors instead of the positions of particles on all |
1469 |
< |
processors. Both communication between processors and computation |
1470 |
< |
scale as $O(N/P)$ in the spatial method. However, spatial |
1457 |
> |
communicate positions and forces with all other processors at every |
1458 |
> |
force evaluation, leading communication costs to scale as an |
1459 |
> |
unfavorable $O(N)$, \emph{independent of the number of processors}. This |
1460 |
> |
communication bottleneck led to the development of spatial and force |
1461 |
> |
decomposition methods in which communication among processors scales |
1462 |
> |
much more favorably. Spatial or domain decomposition divides the |
1463 |
> |
physical spatial domain into 3D boxes in which each processor is |
1464 |
> |
responsible for calculation of forces and positions of particles |
1465 |
> |
located in its box. Particles are reassigned to different processors |
1466 |
> |
as they move through simulation space. To calculate forces on a given |
1467 |
> |
particle, a processor must know the positions of particles within some |
1468 |
> |
cutoff radius located on nearby processors instead of the positions of |
1469 |
> |
particles on all processors. Both communication between processors and |
1470 |
> |
computation scale as $O(N/P)$ in the spatial method. However, spatial |
1471 |
|
decomposition adds algorithmic complexity to the simulation code and |
1472 |
|
is not very efficient for small N since the overall communication |
1473 |
|
scales as the surface to volume ratio $(N/P)^{2/3}$ in three |
1474 |
|
dimensions. |
1475 |
|
|
1476 |
< |
Force decomposition assigns particles to processors based on a block |
1477 |
< |
decomposition of the force matrix. Processors are split into a |
1478 |
< |
optimally square grid forming row and column processor groups. Forces |
1479 |
< |
are calculated on particles in a given row by particles located in |
1480 |
< |
that processors column assignment. Force decomposition is less complex |
1481 |
< |
to implement then the spatial method but still scales computationally |
1482 |
< |
as $O(N/P)$ and scales as $(N/\sqrt{p})$ in communication |
1483 |
< |
cost. Plimpton also found that force decompositions scales more |
1484 |
< |
favorably then spatial decomposition up to 10,000 atoms and favorably |
1485 |
< |
competes with spatial methods for up to 100,000 atoms. |
1476 |
> |
The parallelization method used in {\sc oopse} is the force |
1477 |
> |
decomposition method. Force decomposition assigns particles to |
1478 |
> |
processors based on a block decomposition of the force |
1479 |
> |
matrix. Processors are split into an optimally square grid forming row |
1480 |
> |
and column processor groups. Forces are calculated on particles in a |
1481 |
> |
given row by particles located in that processors column |
1482 |
> |
assignment. Force decomposition is less complex to implement than the |
1483 |
> |
spatial method but still scales computationally as $O(N/P)$ and scales |
1484 |
> |
as $O(N/\sqrt{P})$ in communication cost. Plimpton has also found that |
1485 |
> |
force decompositions scale more favorably than spatial decompositions |
1486 |
> |
for systems up to 10,000 atoms and favorably compete with spatial |
1487 |
> |
methods up to 100,000 atoms.\cite{plimpton95} |
1488 |
|
|
1489 |
|
\subsection{\label{oopseSec:memAlloc}Memory Issues in Trajectory Analysis} |
1490 |
|
|
1491 |
|
For large simulations, the trajectory files can sometimes reach sizes |
1492 |
|
in excess of several gigabytes. In order to effectively analyze that |
1493 |
< |
amount of data+, two memory management schemes have been devised for |
1493 |
> |
amount of data, two memory management schemes have been devised for |
1494 |
|
\texttt{staticProps} and for \texttt{dynamicProps}. The first scheme, |
1495 |
|
developed for \texttt{staticProps}, is the simplest. As each frame's |
1496 |
|
statistics are calculated independent of each other, memory is |
1498 |
|
complete for the snapshot. To prevent multiple passes through a |
1499 |
|
potentially large file, \texttt{staticProps} is capable of calculating |
1500 |
|
all requested correlations per frame with only a single pair loop in |
1501 |
< |
each frame and a single read through of the file. |
1501 |
> |
each frame and a single read of the file. |
1502 |
|
|
1503 |
|
The second, more advanced memory scheme, is used by |
1504 |
|
\texttt{dynamicProps}. Here, the program must have multiple frames in |
1508 |
|
in blocks. The number of frames in each block is specified by the |
1509 |
|
user, and upon reading a block of the trajectory, |
1510 |
|
\texttt{dynamicProps} will calculate all of the time correlation frame |
1511 |
< |
pairs within the block. After in block correlations are complete, a |
1511 |
> |
pairs within the block. After in-block correlations are complete, a |
1512 |
|
second block of the trajectory is read, and the cross correlations are |
1513 |
|
calculated between the two blocks. this second block is then freed and |
1514 |
|
then incremented and the process repeated until the end of the |
1526 |
|
\label{oopseFig:dynamicPropsMemory} |
1527 |
|
\end{figure} |
1528 |
|
|
1222 |
– |
\subsection{\label{openSource}Open Source and Distribution License} |
1223 |
– |
|
1529 |
|
\section{\label{oopseSec:conclusion}Conclusion} |
1530 |
|
|
1531 |
|
We have presented the design and implementation of our open source |
1532 |
< |
simulation package {\sc oopse}. The package offers novel |
1533 |
< |
capabilities to the field of Molecular Dynamics simulation packages in |
1534 |
< |
the form of dipolar force fields, and symplectic integration of rigid |
1535 |
< |
body dynamics. It is capable of scaling across multiple processors |
1536 |
< |
through the use of MPI. It also implements several integration |
1537 |
< |
ensembles allowing the end user control over temperature and |
1538 |
< |
pressure. In addition, it is capable of integrating constrained |
1539 |
< |
dynamics through both the {\sc rattle} algorithm and the z-constraint |
1540 |
< |
method. |
1532 |
> |
simulation package {\sc oopse}. The package offers novel capabilities |
1533 |
> |
to the field of Molecular Dynamics simulation packages in the form of |
1534 |
> |
dipolar force fields, and symplectic integration of rigid body |
1535 |
> |
dynamics. It is capable of scaling across multiple processors through |
1536 |
> |
the use of force based decomposition using MPI. It also implements |
1537 |
> |
several advanced integrators allowing the end user control over |
1538 |
> |
temperature and pressure. In addition, it is capable of integrating |
1539 |
> |
constrained dynamics through both the {\sc rattle} algorithm and the |
1540 |
> |
z-constraint method. |
1541 |
|
|
1542 |
|
These features are all brought together in a single open-source |
1543 |
< |
development package. This allows researchers to not only benefit from |
1543 |
> |
program. Allowing researchers to not only benefit from |
1544 |
|
{\sc oopse}, but also contribute to {\sc oopse}'s development as |
1545 |
|
well.Documentation and source code for {\sc oopse} can be downloaded |
1546 |
|
from \texttt{http://www.openscience.org/oopse/}. |