312 |
|
\begin{figure} |
313 |
|
\centering |
314 |
|
\includegraphics[width=\linewidth]{lipidModel.eps} |
315 |
< |
\caption{A representation of the lipid model. $\phi$ is the torsion angle, $\theta$ % |
315 |
> |
\caption[A representation of a lipid model in {\sc duff}]{A representation of the lipid model. $\phi$ is the torsion angle, $\theta$ % |
316 |
|
is the bend angle, $\mu$ is the dipole moment of the head group, and n |
317 |
|
is the chain length.} |
318 |
|
\label{oopseFig:lipidModel} |
631 |
|
\phi_{ij}({\bf r}_{ij}) \\ |
632 |
|
\rho_{i} & = & \sum_{j \neq i} f_{j}({\bf r}_{ij}) |
633 |
|
\end{eqnarray} |
634 |
< |
where $F_{i} $ is the embedding function that equates the energy required to embed a |
635 |
< |
positively-charged core ion $i$ into a linear superposition of |
636 |
< |
spherically averaged atomic electron densities given by |
637 |
< |
$\rho_{i}$. $\phi_{ij}$ is a primarily repulsive pairwise interaction |
638 |
< |
between atoms $i$ and $j$. In the original formulation of |
639 |
< |
{\sc eam}\cite{Daw84}, $\phi_{ij}$ was an entirely repulsive term, however |
640 |
< |
in later refinements to EAM have shown that non-uniqueness between $F$ |
641 |
< |
and $\phi$ allow for more general forms for $\phi$.\cite{Daw89} |
642 |
< |
There is a cutoff distance, $r_{cut}$, which limits the |
643 |
< |
summations in the {\sc eam} equation to the few dozen atoms |
634 |
> |
where $F_{i} $ is the embedding function that equates the energy |
635 |
> |
required to embed a positively-charged core ion $i$ into a linear |
636 |
> |
superposition of spherically averaged atomic electron densities given |
637 |
> |
by $\rho_{i}$. $\phi_{ij}$ is a primarily repulsive pairwise |
638 |
> |
interaction between atoms $i$ and $j$. In the original formulation of |
639 |
> |
{\sc eam}\cite{Daw84}, $\phi_{ij}$ was an entirely repulsive term, |
640 |
> |
however in later refinements to {\sc eam} have shown that non-uniqueness |
641 |
> |
between $F$ and $\phi$ allow for more general forms for |
642 |
> |
$\phi$.\cite{Daw89} There is a cutoff distance, $r_{cut}$, which |
643 |
> |
limits the summations in the {\sc eam} equation to the few dozen atoms |
644 |
|
surrounding atom $i$ for both the density $\rho$ and pairwise $\phi$ |
645 |
< |
interactions. Foiles et al. fit EAM potentials for fcc metals Cu, Ag, Au, Ni, Pd, Pt and alloys of these metals\cite{FBD86}. These potential fits are in the DYNAMO 86 format and are included with {\sc oopse}. |
645 |
> |
interactions. Foiles \emph{et al}.~fit {\sc eam} potentials for the fcc |
646 |
> |
metals Cu, Ag, Au, Ni, Pd, Pt and alloys of these metals.\cite{FBD86} |
647 |
> |
These fits, are included in {\sc oopse}. |
648 |
|
|
647 |
– |
|
649 |
|
\subsection{\label{oopseSec:pbc}Periodic Boundary Conditions} |
650 |
|
|
651 |
|
\newcommand{\roundme}{\operatorname{round}} |
659 |
|
simulation box is large enough to avoid ``feeling'' the symmetries of |
660 |
|
the periodic lattice, surface effects can be ignored. The available |
661 |
|
periodic cells in OOPSE are cubic, orthorhombic and parallelepiped. We |
662 |
< |
use a $3 \times 3$ matrix, $\mathbf{H}$, to describe the shape and |
663 |
< |
size of the simulation box. $\mathbf{H}$ is defined: |
662 |
> |
use a $3 \times 3$ matrix, $\mathsf{H}$, to describe the shape and |
663 |
> |
size of the simulation box. $\mathsf{H}$ is defined: |
664 |
|
\begin{equation} |
665 |
< |
\mathbf{H} = ( \mathbf{h}_x, \mathbf{h}_y, \mathbf{h}_z ) |
665 |
> |
\mathsf{H} = ( \mathbf{h}_x, \mathbf{h}_y, \mathbf{h}_z ) |
666 |
|
\end{equation} |
667 |
|
Where $\mathbf{h}_j$ is the column vector of the $j$th axis of the |
668 |
|
box. During the course of the simulation both the size and shape of |
669 |
< |
the box can be changed to allow volume fluctations when constraining |
669 |
> |
the box can be changed to allow volume fluctuations when constraining |
670 |
|
the pressure. |
671 |
|
|
672 |
|
A real space vector, $\mathbf{r}$ can be transformed in to a box space |
673 |
|
vector, $\mathbf{s}$, and back through the following transformations: |
674 |
|
\begin{align} |
675 |
< |
\mathbf{s} &= \mathbf{H}^{-1} \mathbf{r} \\ |
676 |
< |
\mathbf{r} &= \mathbf{H} \mathbf{s} |
675 |
> |
\mathbf{s} &= \mathsf{H}^{-1} \mathbf{r} \\ |
676 |
> |
\mathbf{r} &= \mathsf{H} \mathbf{s} |
677 |
|
\end{align} |
678 |
|
The vector $\mathbf{s}$ is now a vector expressed as the number of box |
679 |
|
lengths in the $\mathbf{h}_x$, $\mathbf{h}_y$, and $\mathbf{h}_z$ |
701 |
|
Finally, we obtain the minimum image coordinates $\mathbf{r}^{\prime}$ by |
702 |
|
transforming back to real space, |
703 |
|
\begin{equation} |
704 |
< |
\mathbf{r}^{\prime}=\mathbf{H}^{-1}\mathbf{s}^{\prime}% |
704 |
> |
\mathbf{r}^{\prime}=\mathsf{H}^{-1}\mathbf{s}^{\prime}% |
705 |
|
\end{equation} |
706 |
|
In this way, particles are allowed to diffuse freely in $\mathbf{r}$, |
707 |
|
but their minimum images, $\mathbf{r}^{\prime}$ are used to compute |
708 |
< |
the interatomic forces. |
708 |
> |
the inter-atomic forces. |
709 |
|
|
710 |
|
|
711 |
|
\section{\label{oopseSec:IOfiles}Input and Output Files} |
741 |
|
initialConfig = "./argon.init"; |
742 |
|
|
743 |
|
forceField = "LJ"; |
744 |
< |
ensemble = "NVE"; // specify the simulation enesemble |
744 |
> |
ensemble = "NVE"; // specify the simulation ensemble |
745 |
|
dt = 1.0; // the time step for integration |
746 |
|
runTime = 1e3; // the total simulation run time |
747 |
|
sampleTime = 100; // trajectory file frequency |
782 |
|
|
783 |
|
#include "argon.mdl" |
784 |
|
|
784 |
– |
molecule{ |
785 |
– |
name = "Ar"; |
786 |
– |
nAtoms = 1; |
787 |
– |
atom[0]{ |
788 |
– |
type="Ar"; |
789 |
– |
position( 0.0, 0.0, 0.0 ); |
790 |
– |
} |
791 |
– |
} |
792 |
– |
|
785 |
|
nComponents = 1; |
786 |
|
component{ |
787 |
|
type = "Ar"; |
811 |
|
entities are written out using quanternions, to save space in the |
812 |
|
output files. |
813 |
|
|
814 |
< |
\begin{lstlisting}[float,caption={[The format of the coordinate files]Shows the format of the coordinate files. The fist line is the number of atoms. The second line begins with the time stamp followed by the three $\mathbf{H}$ column vectors. The next lines are the atomic coordinates for all atoms in the system. First is the name followed by position, velocity, quanternions, and lastly angular momentum.},label=sch:dumpFormat] |
814 |
> |
\begin{lstlisting}[float,caption={[The format of the coordinate files]Shows the format of the coordinate files. The fist line is the number of atoms. The second line begins with the time stamp followed by the three $\mathsf{H}$ column vectors. It is important to note, that for extended system ensembles, additional information pertinent to the integrators may be stored on this line as well.. The next lines are the atomic coordinates for all atoms in the system. First is the name followed by position, velocity, quanternions, and lastly angular velocities.},label=sch:dumpFormat] |
815 |
|
|
816 |
|
nAtoms |
817 |
|
time; Hxx Hyx Hzx; Hxy Hyy Hzy; Hxz Hyz Hzz; |
846 |
|
|
847 |
|
As was stated in Sec.~\ref{oopseSec:coordFiles}, an initialization |
848 |
|
file is needed to provide the starting coordinates for a |
849 |
< |
simulation. The {\sc oopse} package provides a program called |
850 |
< |
\texttt{sysBuilder} to aid in the creation of the \texttt{.init} |
851 |
< |
file. \texttt{sysBuilder} uses {\sc bass}, and will recognize |
849 |
> |
simulation. The {\sc oopse} package provides several system building |
850 |
> |
programs to aid in the creation of the \texttt{.init} |
851 |
> |
file. The programs use {\sc bass}, and will recognize |
852 |
|
arguments and parameters in the \texttt{.bass} file that would |
853 |
|
otherwise be ignored by the simulation. |
854 |
|
|
867 |
|
|
868 |
|
\section{\label{oopseSec:mechanics}Mechanics} |
869 |
|
|
878 |
– |
\subsection{\label{oopseSec:integrate}Integrating the Equations of Motion: the Symplectic Step Integrator} |
870 |
|
|
871 |
< |
Integration of the equations of motion was carried out using the |
872 |
< |
symplectic splitting method proposed by Dullweber \emph{et |
873 |
< |
al.}.\cite{Dullweber1997} The reason for the selection of this |
874 |
< |
integrator, is the poor energy conservation of rigid body systems |
875 |
< |
using quaternion dynamics. While quaternions work well for |
876 |
< |
orientational motion in alternate ensembles, the microcanonical |
877 |
< |
ensemble has a constant energy requirement that is quite sensitive to |
878 |
< |
errors in the equations of motion. The original implementation of {\sc |
879 |
< |
oopse} utilized quaternions for rotational motion propagation; |
880 |
< |
however, a detailed investigation showed that they resulted in a |
881 |
< |
steady drift in the total energy, something that has been observed by |
882 |
< |
others.\cite{Laird97} |
871 |
> |
\section{\label{sec:mechanics}Mechanics} |
872 |
> |
|
873 |
> |
\subsection{\label{oopseSec:integrate}Integrating the Equations of Motion: the |
874 |
> |
DLM method} |
875 |
> |
|
876 |
> |
The default method for integrating the equations of motion in {\sc |
877 |
> |
oopse} is a velocity-Verlet version of the symplectic splitting method |
878 |
> |
proposed by Dullweber, Leimkuhler and McLachlan |
879 |
> |
(DLM).\cite{Dullweber1997} When there are no directional atoms or |
880 |
> |
rigid bodies present in the simulation, this integrator becomes the |
881 |
> |
standard velocity-Verlet integrator which is known to sample the |
882 |
> |
microcanonical (NVE) ensemble.\cite{} |
883 |
|
|
884 |
+ |
Previous integration methods for orientational motion have problems |
885 |
+ |
that are avoided in the DLM method. Direct propagation of the Euler |
886 |
+ |
angles has a known $1/\sin\theta$ divergence in the equations of |
887 |
+ |
motion for $\phi$ and $\psi$,\cite{allen87:csl} leading to |
888 |
+ |
numerical instabilities any time one of the directional atoms or rigid |
889 |
+ |
bodies has an orientation near $\theta=0$ or $\theta=\pi$. More |
890 |
+ |
modern quaternion-based integration methods have relatively poor |
891 |
+ |
energy conservation. While quaternions work well for orientational |
892 |
+ |
motion in other ensembles, the microcanonical ensemble has a |
893 |
+ |
constant energy requirement that is quite sensitive to errors in the |
894 |
+ |
equations of motion. An earlier implementation of {\sc oopse} |
895 |
+ |
utilized quaternions for propagation of rotational motion; however, a |
896 |
+ |
detailed investigation showed that they resulted in a steady drift in |
897 |
+ |
the total energy, something that has been observed by |
898 |
+ |
Laird {\it et al.}\cite{Laird97} |
899 |
+ |
|
900 |
|
The key difference in the integration method proposed by Dullweber |
901 |
< |
\emph{et al}.~({\sc dlm}) is that the entire rotation matrix is propagated from |
902 |
< |
one time step to the next. In the past, this would not have been a |
903 |
< |
feasible option, since the rotation matrix for a single body is nine |
904 |
< |
elements long as opposed to three or four elements for Euler angles |
905 |
< |
and quaternions respectively. System memory has become much less of an |
906 |
< |
issue in recent times, and the {\sc dlm} method has used memory in |
907 |
< |
exchange for substantial benefits in energy conservation. |
901 |
> |
\emph{et al.} is that the entire $3 \times 3$ rotation matrix is |
902 |
> |
propagated from one time step to the next. In the past, this would not |
903 |
> |
have been feasible, since the rotation matrix for a single body has |
904 |
> |
nine elements compared with the more memory-efficient methods (using |
905 |
> |
three Euler angles or 4 quaternions). Computer memory has become much |
906 |
> |
less costly in recent years, and this can be translated into |
907 |
> |
substantial benefits in energy conservation. |
908 |
|
|
909 |
< |
The {\sc dlm} method allows for Verlet style integration of both |
910 |
< |
linear and angular motion of rigid bodies. In the integration method, |
911 |
< |
the orientational propagation involves a sequence of matrix |
912 |
< |
evaluations to update the rotation matrix.\cite{Dullweber1997} These |
913 |
< |
matrix rotations are more costly computationally than the simpler |
914 |
< |
arithmetic quaternion propagation. With the same time step, a 1000 SSD |
915 |
< |
particle simulation shows an average 7\% increase in computation time |
916 |
< |
using the {\sc dlm} method in place of quaternions. This cost is more |
917 |
< |
than justified when comparing the energy conservation of the two |
918 |
< |
methods as illustrated in Fig.~\ref{timestep}. |
909 |
> |
The basic equations of motion being integrated are derived from the |
910 |
> |
Hamiltonian for conservative systems containing rigid bodies, |
911 |
> |
\begin{equation} |
912 |
> |
H = \sum_{i} \left( \frac{1}{2} m_i {\bf v}_i^T \cdot {\bf v}_i + |
913 |
> |
\frac{1}{2} {\bf j}_i^T \cdot \overleftrightarrow{\mathsf{I}}_i^{-1} \cdot |
914 |
> |
{\bf j}_i \right) + |
915 |
> |
V\left(\left\{{\bf r}\right\}, \left\{\mathsf{A}\right\}\right) |
916 |
> |
\end{equation} |
917 |
> |
where ${\bf r}_i$ and ${\bf v}_i$ are the cartesian position vector |
918 |
> |
and velocity of the center of mass of particle $i$, and ${\bf j}_i$ |
919 |
> |
and $\overleftrightarrow{\mathsf{I}}_i$ are the body-fixed angular |
920 |
> |
momentum and moment of inertia tensor, respectively. $\mathsf{A}_i$ |
921 |
> |
is the $3 \times 3$ rotation matrix describing the instantaneous |
922 |
> |
orientation of the particle. $V$ is the potential energy function |
923 |
> |
which may depend on both the positions $\left\{{\bf r}\right\}$ and |
924 |
> |
orientations $\left\{\mathsf{A}\right\}$ of all particles. The |
925 |
> |
equations of motion for the particle centers of mass are derived from |
926 |
> |
Hamilton's equations and are quite simple, |
927 |
> |
\begin{eqnarray} |
928 |
> |
\dot{{\bf r}} & = & {\bf v} \\ |
929 |
> |
\dot{{\bf v}} & = & \frac{{\bf f}}{m} |
930 |
> |
\end{eqnarray} |
931 |
> |
where ${\bf f}$ is the instantaneous force on the center of mass |
932 |
> |
of the particle, |
933 |
> |
\begin{equation} |
934 |
> |
{\bf f} = - \frac{\partial}{\partial |
935 |
> |
{\bf r}} V(\left\{{\bf r}(t)\right\}, \left\{\mathsf{A}(t)\right\}). |
936 |
> |
\end{equation} |
937 |
|
|
938 |
+ |
The equations of motion for the orientational degrees of freedom are |
939 |
+ |
\begin{eqnarray} |
940 |
+ |
\dot{\mathsf{A}} & = & \mathsf{A} \cdot |
941 |
+ |
\mbox{ skew}\left(\overleftrightarrow{\mathsf{I}}^{-1} \cdot {\bf j}\right) \\ |
942 |
+ |
\dot{{\bf j}} & = & {\bf j} \times \left( \overleftrightarrow{\mathsf{I}}^{-1} |
943 |
+ |
\cdot {\bf j} \right) - \mbox{ rot}\left(\mathsf{A}^{T} \cdot \frac{\partial |
944 |
+ |
V}{\partial \mathsf{A}} \right) |
945 |
+ |
\end{eqnarray} |
946 |
+ |
In these equations of motion, the $\mbox{skew}$ matrix of a vector |
947 |
+ |
${\bf v} = \left( v_1, v_2, v_3 \right)$ is defined: |
948 |
+ |
\begin{equation} |
949 |
+ |
\mbox{skew}\left( {\bf v} \right) := \left( |
950 |
+ |
\begin{array}{ccc} |
951 |
+ |
0 & v_3 & - v_2 \\ |
952 |
+ |
-v_3 & 0 & v_1 \\ |
953 |
+ |
v_2 & -v_1 & 0 |
954 |
+ |
\end{array} |
955 |
+ |
\right) |
956 |
+ |
\end{equation} |
957 |
+ |
The $\mbox{rot}$ notation refers to the mapping of the $3 \times 3$ |
958 |
+ |
rotation matrix to a vector of orientations by first computing the |
959 |
+ |
skew-symmetric part $\left(\mathsf{A} - \mathsf{A}^{T}\right)$ and |
960 |
+ |
then associating this with a length 3 vector by inverting the |
961 |
+ |
$\mbox{skew}$ function above: |
962 |
+ |
\begin{equation} |
963 |
+ |
\mbox{rot}\left(\mathsf{A}\right) := \mbox{ skew}^{-1}\left(\mathsf{A} |
964 |
+ |
- \mathsf{A}^{T} \right) |
965 |
+ |
\end{equation} |
966 |
+ |
Written this way, the $\mbox{rot}$ operation creates a set of |
967 |
+ |
conjugate angle coordinates to the body-fixed angular momenta |
968 |
+ |
represented by ${\bf j}$. This equation of motion for angular momenta |
969 |
+ |
is equivalent to the more familiar body-fixed forms, |
970 |
+ |
\begin{eqnarray} |
971 |
+ |
\dot{j_{x}} & = & \tau^b_x(t) + |
972 |
+ |
\left(\overleftrightarrow{\mathsf{I}}_{yy} - \overleftrightarrow{\mathsf{I}}_{zz} \right) j_y j_z \\ |
973 |
+ |
\dot{j_{y}} & = & \tau^b_y(t) + |
974 |
+ |
\left(\overleftrightarrow{\mathsf{I}}_{zz} - \overleftrightarrow{\mathsf{I}}_{xx} \right) j_z j_x \\ |
975 |
+ |
\dot{j_{z}} & = & \tau^b_z(t) + |
976 |
+ |
\left(\overleftrightarrow{\mathsf{I}}_{xx} - \overleftrightarrow{\mathsf{I}}_{yy} \right) j_x j_y |
977 |
+ |
\end{eqnarray} |
978 |
+ |
which utilize the body-fixed torques, ${\bf \tau}^b$. Torques are |
979 |
+ |
most easily derived in the space-fixed frame, |
980 |
+ |
\begin{equation} |
981 |
+ |
{\bf \tau}^b(t) = \mathsf{A}(t) \cdot {\bf \tau}^s(t) |
982 |
+ |
\end{equation} |
983 |
+ |
where the torques are either derived from the forces on the |
984 |
+ |
constituent atoms of the rigid body, or for directional atoms, |
985 |
+ |
directly from derivatives of the potential energy, |
986 |
+ |
\begin{equation} |
987 |
+ |
{\bf \tau}^s(t) = - \hat{\bf u}(t) \times \left( \frac{\partial} |
988 |
+ |
{\partial \hat{\bf u}} V\left(\left\{ {\bf r}(t) \right\}, \left\{ |
989 |
+ |
\mathsf{A}(t) \right\}\right) \right). |
990 |
+ |
\end{equation} |
991 |
+ |
Here $\hat{\bf u}$ is a unit vector pointing along the principal axis |
992 |
+ |
of the particle in the space-fixed frame. |
993 |
+ |
|
994 |
+ |
The DLM method uses a Trotter factorization of the orientational |
995 |
+ |
propagator. This has three effects: |
996 |
+ |
\begin{enumerate} |
997 |
+ |
\item the integrator is area-preserving in phase space (i.e. it is |
998 |
+ |
{\it symplectic}), |
999 |
+ |
\item the integrator is time-{\it reversible}, making it suitable for Hybrid |
1000 |
+ |
Monte Carlo applications, and |
1001 |
+ |
\item the error for a single time step is of order $O\left(h^3\right)$ |
1002 |
+ |
for timesteps of length $h$. |
1003 |
+ |
\end{enumerate} |
1004 |
+ |
|
1005 |
+ |
The integration of the equations of motion is carried out in a |
1006 |
+ |
velocity-Verlet style 2-part algorithm: |
1007 |
+ |
|
1008 |
+ |
{\tt moveA:} |
1009 |
+ |
\begin{eqnarray} |
1010 |
+ |
{\bf v}\left(t + \delta t / 2\right) & \leftarrow & {\bf |
1011 |
+ |
v}(t) + \frac{\delta t}{2} \left( {\bf f}(t) / m \right) \\ |
1012 |
+ |
{\bf r}(t + \delta t) & \leftarrow & {\bf r}(t) + \delta t {\bf |
1013 |
+ |
v}\left(t + \delta t / 2 \right) \\ |
1014 |
+ |
{\bf j}\left(t + \delta t / 2 \right) & \leftarrow & {\bf |
1015 |
+ |
j}(t) + \frac{\delta t}{2} {\bf \tau}^b(t) \\ |
1016 |
+ |
\mathsf{A}(t + \delta t) & \leftarrow & \mathrm{rot}\left( \delta t |
1017 |
+ |
{\bf j}(t + \delta t / 2) \cdot \overleftrightarrow{\mathsf{I}}^{-1} |
1018 |
+ |
\right) |
1019 |
+ |
\end{eqnarray} |
1020 |
+ |
|
1021 |
+ |
In this context, the $\mathrm{rot}$ function is the reversible product |
1022 |
+ |
of the three body-fixed rotations, |
1023 |
+ |
\begin{equation} |
1024 |
+ |
\mathrm{rot}({\bf a}) = \mathsf{G}_x(a_x / 2) \cdot |
1025 |
+ |
\mathsf{G}_y(a_y / 2) \cdot \mathsf{G}_z(a_z) \cdot \mathsf{G}_y(a_y / |
1026 |
+ |
2) \cdot \mathsf{G}_x(a_x /2) |
1027 |
+ |
\end{equation} |
1028 |
+ |
where each rotational propagator, $\mathsf{G}_\alpha(\theta)$, rotates |
1029 |
+ |
both the rotation matrix ($\mathsf{A}$) and the body-fixed angular |
1030 |
+ |
momentum (${\bf j}$) by an angle $\theta$ around body-fixed axis |
1031 |
+ |
$\alpha$, |
1032 |
+ |
\begin{equation} |
1033 |
+ |
\mathsf{G}_\alpha( \theta ) = \left\{ |
1034 |
+ |
\begin{array}{lcl} |
1035 |
+ |
\mathsf{A}(t) & \leftarrow & \mathsf{A}(0) \cdot \mathsf{R}_\alpha(\theta)^T \\ |
1036 |
+ |
{\bf j}(t) & \leftarrow & \mathsf{R}_\alpha(\theta) \cdot {\bf j}(0) |
1037 |
+ |
\end{array} |
1038 |
+ |
\right. |
1039 |
+ |
\end{equation} |
1040 |
+ |
$\mathsf{R}_\alpha$ is a quadratic approximation to |
1041 |
+ |
the single-axis rotation matrix. For example, in the small-angle |
1042 |
+ |
limit, the rotation matrix around the body-fixed x-axis can be |
1043 |
+ |
approximated as |
1044 |
+ |
\begin{equation} |
1045 |
+ |
\mathsf{R}_x(\theta) \approx \left( |
1046 |
+ |
\begin{array}{ccc} |
1047 |
+ |
1 & 0 & 0 \\ |
1048 |
+ |
0 & \frac{1-\theta^2 / 4}{1 + \theta^2 / 4} & -\frac{\theta}{1+ |
1049 |
+ |
\theta^2 / 4} \\ |
1050 |
+ |
0 & \frac{\theta}{1+ |
1051 |
+ |
\theta^2 / 4} & \frac{1-\theta^2 / 4}{1 + \theta^2 / 4} |
1052 |
+ |
\end{array} |
1053 |
+ |
\right). |
1054 |
+ |
\end{equation} |
1055 |
+ |
All other rotations follow in a straightforward manner. |
1056 |
+ |
|
1057 |
+ |
After the first part of the propagation, the forces and body-fixed |
1058 |
+ |
torques are calculated at the new positions and orientations |
1059 |
+ |
|
1060 |
+ |
{\tt doForces:} |
1061 |
+ |
\begin{eqnarray} |
1062 |
+ |
{\bf f}(t + \delta t) & \leftarrow & - \left(\frac{\partial V}{\partial {\bf |
1063 |
+ |
r}}\right)_{{\bf r}(t + \delta t)} \\ |
1064 |
+ |
{\bf \tau}^{s}(t + \delta t) & \leftarrow & {\bf u}(t + \delta t) |
1065 |
+ |
\times \frac{\partial V}{\partial {\bf u}} \\ |
1066 |
+ |
{\bf \tau}^{b}(t + \delta t) & \leftarrow & \mathsf{A}(t + \delta t) |
1067 |
+ |
\cdot {\bf \tau}^s(t + \delta t) |
1068 |
+ |
\end{eqnarray} |
1069 |
+ |
|
1070 |
+ |
{\sc oopse} automatically updates ${\bf u}$ when the rotation matrix |
1071 |
+ |
$\mathsf{A}$ is calculated in {\tt moveA}. Once the forces and |
1072 |
+ |
torques have been obtained at the new time step, the velocities can be |
1073 |
+ |
advanced to the same time value. |
1074 |
+ |
|
1075 |
+ |
{\tt moveB:} |
1076 |
+ |
\begin{eqnarray} |
1077 |
+ |
{\bf v}\left(t + \delta t \right) & \leftarrow & {\bf |
1078 |
+ |
v}\left(t + \delta t / 2 \right) + \frac{\delta t}{2} \left( |
1079 |
+ |
{\bf f}(t + \delta t) / m \right) \\ |
1080 |
+ |
{\bf j}\left(t + \delta t \right) & \leftarrow & {\bf |
1081 |
+ |
j}\left(t + \delta t / 2 \right) + \frac{\delta t}{2} {\bf |
1082 |
+ |
\tau}^b(t + \delta t) |
1083 |
+ |
\end{eqnarray} |
1084 |
+ |
|
1085 |
+ |
The matrix rotations used in the DLM method end up being more costly |
1086 |
+ |
computationally than the simpler arithmetic quaternion |
1087 |
+ |
propagation. With the same time step, a 1000-molecule water simulation |
1088 |
+ |
shows an average 7\% increase in computation time using the DLM method |
1089 |
+ |
in place of quaternions. This cost is more than justified when |
1090 |
+ |
comparing the energy conservation of the two methods as illustrated in |
1091 |
+ |
figure \ref{timestep}. |
1092 |
+ |
|
1093 |
|
\begin{figure} |
1094 |
|
\centering |
1095 |
|
\includegraphics[width=\linewidth]{timeStep.eps} |
1096 |
< |
\caption[Energy conservation for quaternion versus {\sc dlm} dynamics]{Energy conservation using quaternion based integration versus |
1097 |
< |
the {\sc dlm} method with |
1098 |
< |
increasing time step. For each time step, the dotted line is total |
1099 |
< |
energy using the {\sc dlm} integrator, and the solid line comes |
1100 |
< |
from the quaternion integrator. The larger time step plots are shifted |
1101 |
< |
up from the true energy baseline for clarity.} |
1096 |
> |
\caption[Energy conservation for quaternion versus DLM dynamics]{Energy conservation using quaternion based integration versus |
1097 |
> |
the method proposed by Dullweber \emph{et al.} with increasing time |
1098 |
> |
step. For each time step, the dotted line is total energy using the |
1099 |
> |
DLM integrator, and the solid line comes from the quaternion |
1100 |
> |
integrator. The larger time step plots are shifted up from the true |
1101 |
> |
energy baseline for clarity.} |
1102 |
|
\label{timestep} |
1103 |
|
\end{figure} |
1104 |
|
|
1105 |
< |
In Fig.~\ref{timestep}, the resulting energy drift at various time |
1106 |
< |
steps for both the {\sc dlm} and quaternion integration schemes |
1107 |
< |
is compared. All of the 1000 SSD particle simulations started with the |
1105 |
> |
In figure \ref{timestep}, the resulting energy drift at various time |
1106 |
> |
steps for both the DLM and quaternion integration schemes is |
1107 |
> |
compared. All of the 1000 molecule water simulations started with the |
1108 |
|
same configuration, and the only difference was the method for |
1109 |
|
handling rotational motion. At time steps of 0.1 and 0.5 fs, both |
1110 |
< |
methods for propagating particle rotation conserve energy fairly well, |
1110 |
> |
methods for propagating molecule rotation conserve energy fairly well, |
1111 |
|
with the quaternion method showing a slight energy drift over time in |
1112 |
|
the 0.5 fs time step simulation. At time steps of 1 and 2 fs, the |
1113 |
< |
energy conservation benefits of the {\sc dlm} method are clearly |
1113 |
> |
energy conservation benefits of the DLM method are clearly |
1114 |
|
demonstrated. Thus, while maintaining the same degree of energy |
1115 |
|
conservation, one can take considerably longer time steps, leading to |
1116 |
|
an overall reduction in computation time. |
1117 |
|
|
1118 |
< |
Energy drift in these SSD particle simulations was unnoticeable for |
1119 |
< |
time steps up to three femtoseconds. A slight energy drift on the |
940 |
< |
order of 0.012 kcal/mol per nanosecond was observed at a time step of |
941 |
< |
four femtoseconds, and as expected, this drift increases dramatically |
942 |
< |
with increasing time step. |
1118 |
> |
There is only one specific keyword relevant to the default integrator, |
1119 |
> |
and that is the time step for integrating the equations of motion. |
1120 |
|
|
1121 |
+ |
\begin{center} |
1122 |
+ |
\begin{tabular}{llll} |
1123 |
+ |
{\bf variable} & {\bf {\tt .bass} keyword} & {\bf units} & {\bf |
1124 |
+ |
default value} \\ |
1125 |
+ |
$\delta t$ & {\tt dt = 2.0;} & fs & none |
1126 |
+ |
\end{tabular} |
1127 |
+ |
\end{center} |
1128 |
|
|
1129 |
|
\subsection{\label{sec:extended}Extended Systems for other Ensembles} |
1130 |
|
|
1131 |
+ |
{\sc oopse} implements a number of extended system integrators for |
1132 |
+ |
sampling from other ensembles relevant to chemical physics. The |
1133 |
+ |
integrator can selected with the {\tt ensemble} keyword in the |
1134 |
+ |
{\tt .bass} file: |
1135 |
|
|
1136 |
< |
{\sc oopse} implements a |
1136 |
> |
\begin{center} |
1137 |
> |
\begin{tabular}{lll} |
1138 |
> |
{\bf Integrator} & {\bf Ensemble} & {\bf {\tt .bass} line} \\ |
1139 |
> |
NVE & microcanonical & {\tt ensemble = ``NVE''; } \\ |
1140 |
> |
NVT & canonical & {\tt ensemble = ``NVT''; } \\ |
1141 |
> |
NPTi & isobaric-isothermal (with isotropic volume changes) & {\tt |
1142 |
> |
ensemble = ``NPTi'';} \\ |
1143 |
> |
NPTf & isobaric-isothermal (with changes to box shape) & {\tt |
1144 |
> |
ensemble = ``NPTf'';} \\ |
1145 |
> |
NPTxyz & approximate isobaric-isothermal & {\tt ensemble = |
1146 |
> |
``NPTxyz'';} \\ |
1147 |
> |
& (with separate barostats on each box dimension) & |
1148 |
> |
\end{tabular} |
1149 |
> |
\end{center} |
1150 |
|
|
1151 |
+ |
The relatively well-known Nos\'e-Hoover thermostat is implemented in |
1152 |
+ |
{\sc oopse}'s NVT integrator. This method couples an extra degree of |
1153 |
+ |
freedom (the thermostat) to the kinetic energy of the system, and has |
1154 |
+ |
been shown to sample the canonical distribution in the system degrees |
1155 |
+ |
of freedom while conserving a quantity that is, to within a constant, |
1156 |
+ |
the Helmholtz free energy. |
1157 |
|
|
1158 |
< |
\subsection{\label{oopseSec:noseHooverThermo}Nose-Hoover Thermostatting} |
1158 |
> |
NPT algorithms attempt to maintain constant pressure in the system by |
1159 |
> |
coupling the volume of the system to a barostat. {\sc oopse} contains |
1160 |
> |
three different constant pressure algorithms. The first two, NPTi and |
1161 |
> |
NPTf have been shown to conserve a quantity that is, to within a |
1162 |
> |
constant, the Gibbs free energy. The Melchionna modification to the |
1163 |
> |
Hoover barostat is implemented in both NPTi and NPTf. NPTi allows |
1164 |
> |
only isotropic changes in the simulation box, while box {\it shape} |
1165 |
> |
variations are allowed in NPTf. The NPTxyz integrator has {\it not} |
1166 |
> |
been shown to sample from the isobaric-isothermal ensemble. It is |
1167 |
> |
useful, however, in that it maintains orthogonality for the axes of |
1168 |
> |
the simulation box while attempting to equalize pressure along the |
1169 |
> |
three perpendicular directions in the box. |
1170 |
|
|
1171 |
< |
To mimic the effects of being in a constant temperature ({\sc nvt}) |
1172 |
< |
ensemble, {\sc oopse} uses the Nose-Hoover extended system |
1173 |
< |
approach.\cite{Hoover85} In this method, the equations of motion for |
1174 |
< |
the particle positions and velocities are |
1171 |
> |
Each of the extended system integrators requires additional keywords |
1172 |
> |
to set target values for the thermodynamic state variables that are |
1173 |
> |
being held constant. Keywords are also required to set the |
1174 |
> |
characteristic decay times for the dynamics of the extended |
1175 |
> |
variables. |
1176 |
> |
|
1177 |
> |
\begin{tabular}{llll} |
1178 |
> |
{\bf variable} & {\bf {\tt .bass} keyword} & {\bf units} & {\bf |
1179 |
> |
default value} \\ |
1180 |
> |
$T_{\mathrm{target}}$ & {\tt targetTemperature = 300;} & K & none \\ |
1181 |
> |
$P_{\mathrm{target}}$ & {\tt targetPressure = 1;} & atm & none \\ |
1182 |
> |
$\tau_T$ & {\tt tauThermostat = 1e3;} & fs & none \\ |
1183 |
> |
$\tau_B$ & {\tt tauBarostat = 5e3;} & fs & none \\ |
1184 |
> |
& {\tt resetTime = 200;} & fs & none \\ |
1185 |
> |
& {\tt useInitialExtendedSystemState = ``true'';} & logical & |
1186 |
> |
false |
1187 |
> |
\end{tabular} |
1188 |
> |
|
1189 |
> |
Two additional keywords can be used to either clear the extended |
1190 |
> |
system variables periodically ({\tt resetTime}), or to maintain the |
1191 |
> |
state of the extended system variables between simulations ({\tt |
1192 |
> |
useInitialExtendedSystemState}). More details on these variables |
1193 |
> |
and their use in the integrators follows below. |
1194 |
> |
|
1195 |
> |
\subsubsection{\label{oopseSec:noseHooverThermo}Nos\'{e}-Hoover Thermostatting} |
1196 |
> |
|
1197 |
> |
The Nos\'e-Hoover equations of motion are given by\cite{Hoover85} |
1198 |
|
\begin{eqnarray} |
1199 |
|
\dot{{\bf r}} & = & {\bf v} \\ |
1200 |
< |
\dot{{\bf v}} & = & \frac{{\bf f}}{m} - \chi {\bf v} |
1200 |
> |
\dot{{\bf v}} & = & \frac{{\bf f}}{m} - \chi {\bf v} \\ |
1201 |
> |
\dot{\mathsf{A}} & = & \mathsf{A} \cdot |
1202 |
> |
\mbox{ skew}\left(\overleftrightarrow{\mathsf{I}}^{-1} \cdot {\bf j}\right) \\ |
1203 |
> |
\dot{{\bf j}} & = & {\bf j} \times \left( \overleftrightarrow{\mathsf{I}}^{-1} |
1204 |
> |
\cdot {\bf j} \right) - \mbox{ rot}\left(\mathsf{A}^{T} \cdot \frac{\partial |
1205 |
> |
V}{\partial \mathsf{A}} \right) - \chi {\bf j} |
1206 |
|
\label{eq:nosehoovereom} |
1207 |
|
\end{eqnarray} |
1208 |
|
|
1209 |
|
$\chi$ is an ``extra'' variable included in the extended system, and |
1210 |
|
it is propagated using the first order equation of motion |
1211 |
|
\begin{equation} |
1212 |
< |
\dot{\chi} = \frac{1}{\tau_{T}} \left( \frac{T}{T_{target}} - 1 \right) |
1212 |
> |
\dot{\chi} = \frac{1}{\tau_{T}^2} \left( \frac{T}{T_{\mathrm{target}}} - 1 \right). |
1213 |
|
\label{eq:nosehooverext} |
1214 |
|
\end{equation} |
969 |
– |
where $T_{target}$ is the target temperature for the simulation, and |
970 |
– |
$\tau_{T}$ is a time constant for the thermostat. |
1215 |
|
|
1216 |
< |
To select the Nose-Hoover {\sc nvt} ensemble, the {\tt ensemble = NVT;} |
1217 |
< |
command would be used in the simulation's {\sc bass} file. There is |
1218 |
< |
some subtlety in choosing values for $\tau_{T}$, and it is usually set |
1219 |
< |
to values of a few ps. Within a {\sc bass} file, $\tau_{T}$ could be |
1220 |
< |
set to 1 ps using the {\tt tauThermostat = 1000; } command. |
1216 |
> |
The instantaneous temperature $T$ is proportional to the total kinetic |
1217 |
> |
energy (both translational and orientational) and is given by |
1218 |
> |
\begin{equation} |
1219 |
> |
T = \frac{2 K}{f k_B} |
1220 |
> |
\end{equation} |
1221 |
> |
Here, $f$ is the total number of degrees of freedom in the system, |
1222 |
> |
\begin{equation} |
1223 |
> |
f = 3 N + 3 N_{\mathrm{orient}} - N_{\mathrm{constraints}} |
1224 |
> |
\end{equation} |
1225 |
> |
and $K$ is the total kinetic energy, |
1226 |
> |
\begin{equation} |
1227 |
> |
K = \sum_{i=1}^{N} \frac{1}{2} m_i {\bf v}_i^T \cdot {\bf v}_i + |
1228 |
> |
\sum_{i=1}^{N_{\mathrm{orient}}} \frac{1}{2} {\bf j}_i^T \cdot |
1229 |
> |
\overleftrightarrow{\mathsf{I}}_i^{-1} \cdot {\bf j}_i |
1230 |
> |
\end{equation} |
1231 |
> |
|
1232 |
> |
In eq.(\ref{eq:nosehooverext}), $\tau_T$ is the time constant for |
1233 |
> |
relaxation of the temperature to the target value. To set values for |
1234 |
> |
$\tau_T$ or $T_{\mathrm{target}}$ in a simulation, one would use the |
1235 |
> |
{\tt tauThermostat} and {\tt targetTemperature} keywords in the {\tt |
1236 |
> |
.bass} file. The units for {\tt tauThermostat} are fs, and the units |
1237 |
> |
for the {\tt targetTemperature} are degrees K. The integration of |
1238 |
> |
the equations of motion is carried out in a velocity-Verlet style 2 |
1239 |
> |
part algorithm: |
1240 |
> |
|
1241 |
> |
{\tt moveA:} |
1242 |
> |
\begin{eqnarray} |
1243 |
> |
T(t) & \leftarrow & \left\{{\bf v}(t)\right\}, \left\{{\bf j}(t)\right\} \\ |
1244 |
> |
{\bf v}\left(t + \delta t / 2\right) & \leftarrow & {\bf |
1245 |
> |
v}(t) + \frac{\delta t}{2} \left( \frac{{\bf f}(t)}{m} - {\bf v}(t) |
1246 |
> |
\chi(t)\right) \\ |
1247 |
> |
{\bf r}(t + \delta t) & \leftarrow & {\bf r}(t) + \delta t {\bf |
1248 |
> |
v}\left(t + \delta t / 2 \right) \\ |
1249 |
> |
{\bf j}\left(t + \delta t / 2 \right) & \leftarrow & {\bf |
1250 |
> |
j}(t) + \frac{\delta t}{2} \left( {\bf \tau}^b(t) - {\bf j}(t) |
1251 |
> |
\chi(t) \right) \\ |
1252 |
> |
\mathsf{A}(t + \delta t) & \leftarrow & \mathrm{rot}\left(\delta t * |
1253 |
> |
{\bf j}(t + \delta t / 2) \overleftrightarrow{\mathsf{I}}^{-1} \right) \\ |
1254 |
> |
\chi\left(t + \delta t / 2 \right) & \leftarrow & \chi(t) + |
1255 |
> |
\frac{\delta t}{2 \tau_T^2} \left( \frac{T(t)}{T_{\mathrm{target}}} - 1 |
1256 |
> |
\right) |
1257 |
> |
\end{eqnarray} |
1258 |
> |
|
1259 |
> |
Here $\mathrm{rot}(\delta t * {\bf j} |
1260 |
> |
\overleftrightarrow{\mathsf{I}}^{-1})$ is the same symplectic Trotter |
1261 |
> |
factorization of the three rotation operations that was discussed in |
1262 |
> |
the section on the DLM integrator. Note that this operation modifies |
1263 |
> |
both the rotation matrix $\mathsf{A}$ and the angular momentum ${\bf |
1264 |
> |
j}$. {\tt moveA} propagates velocities by a half time step, and |
1265 |
> |
positional degrees of freedom by a full time step. The new positions |
1266 |
> |
(and orientations) are then used to calculate a new set of forces and |
1267 |
> |
torques in exactly the same way they are calculated in the {\tt |
1268 |
> |
doForces} portion of the DLM integrator. |
1269 |
> |
|
1270 |
> |
Once the forces and torques have been obtained at the new time step, |
1271 |
> |
the temperature, velocities, and the extended system variable can be |
1272 |
> |
advanced to the same time value. |
1273 |
> |
|
1274 |
> |
{\tt moveB:} |
1275 |
> |
\begin{eqnarray} |
1276 |
> |
T(t + \delta t) & \leftarrow & \left\{{\bf v}(t + \delta t)\right\}, |
1277 |
> |
\left\{{\bf j}(t + \delta t)\right\} \\ |
1278 |
> |
\chi\left(t + \delta t \right) & \leftarrow & \chi\left(t + \delta t / |
1279 |
> |
2 \right) + \frac{\delta t}{2 \tau_T^2} \left( \frac{T(t+\delta |
1280 |
> |
t)}{T_{\mathrm{target}}} - 1 \right) \\ |
1281 |
> |
{\bf v}\left(t + \delta t \right) & \leftarrow & {\bf |
1282 |
> |
v}\left(t + \delta t / 2 \right) + \frac{\delta t}{2} \left( |
1283 |
> |
\frac{{\bf f}(t + \delta t)}{m} - {\bf v}(t + \delta t) |
1284 |
> |
\chi(t \delta t)\right) \\ |
1285 |
> |
{\bf j}\left(t + \delta t \right) & \leftarrow & {\bf |
1286 |
> |
j}\left(t + \delta t / 2 \right) + \frac{\delta t}{2} \left( {\bf |
1287 |
> |
\tau}^b(t + \delta t) - {\bf j}(t + \delta t) |
1288 |
> |
\chi(t + \delta t) \right) |
1289 |
> |
\end{eqnarray} |
1290 |
> |
|
1291 |
> |
Since ${\bf v}(t + \delta t)$ and ${\bf j}(t + \delta t)$ are required |
1292 |
> |
to caclculate $T(t + \delta t)$ as well as $\chi(t + \delta t)$, they |
1293 |
> |
indirectly depend on their own values at time $t + \delta t$. {\tt |
1294 |
> |
moveB} is therefore done in an iterative fashion until $\chi(t + |
1295 |
> |
\delta t)$ becomes self-consistent. The relative tolerance for the |
1296 |
> |
self-consistency check defaults to a value of $\mbox{10}^{-6}$, but |
1297 |
> |
{\sc oopse} will terminate the iteration after 4 loops even if the |
1298 |
> |
consistency check has not been satisfied. |
1299 |
> |
|
1300 |
> |
The Nos\'e-Hoover algorithm is known to conserve a Hamiltonian for the |
1301 |
> |
extended system that is, to within a constant, identical to the |
1302 |
> |
Helmholtz free energy, |
1303 |
> |
\begin{equation} |
1304 |
> |
H_{\mathrm{NVT}} = V + K + f k_B T_{\mathrm{target}} \left( |
1305 |
> |
\frac{\tau_{T}^2 \chi^2(t)}{2} + \int_{0}^{t} \chi(t^\prime) dt^\prime |
1306 |
> |
\right) |
1307 |
> |
\end{equation} |
1308 |
> |
Poor choices of $\delta t$ or $\tau_T$ can result in non-conservation |
1309 |
> |
of $H_{\mathrm{NVT}}$, so the conserved quantity is maintained in the |
1310 |
> |
last column of the {\tt .stat} file to allow checks on the quality of |
1311 |
> |
the integration. |
1312 |
> |
|
1313 |
> |
Bond constraints are applied at the end of both the {\tt moveA} and |
1314 |
> |
{\tt moveB} portions of the algorithm. Details on the constraint |
1315 |
> |
algorithms are given in section \ref{oopseSec:rattle}. |
1316 |
> |
|
1317 |
> |
\subsubsection{\label{sec:NPTi}Constant-pressure integration with |
1318 |
> |
isotropic box deformations (NPTi)} |
1319 |
> |
|
1320 |
> |
To carry out isobaric-isothermal ensemble calculations {\sc oopse} |
1321 |
> |
implements the Melchionna modifications to the Nos\'e-Hoover-Andersen |
1322 |
> |
equations of motion,\cite{melchionna93} |
1323 |
> |
|
1324 |
> |
\begin{eqnarray} |
1325 |
> |
\dot{{\bf r}} & = & {\bf v} + \eta \left( {\bf r} - {\bf R}_0 \right) \\ |
1326 |
> |
\dot{{\bf v}} & = & \frac{{\bf f}}{m} - (\eta + \chi) {\bf v} \\ |
1327 |
> |
\dot{\mathsf{A}} & = & \mathsf{A} \cdot |
1328 |
> |
\mbox{ skew}\left(\overleftrightarrow{I}^{-1} \cdot {\bf j}\right) \\ |
1329 |
> |
\dot{{\bf j}} & = & {\bf j} \times \left( \overleftrightarrow{I}^{-1} |
1330 |
> |
\cdot {\bf j} \right) - \mbox{ rot}\left(\mathsf{A}^{T} \cdot \frac{\partial |
1331 |
> |
V}{\partial \mathsf{A}} \right) - \chi {\bf j} \\ |
1332 |
> |
\dot{\chi} & = & \frac{1}{\tau_{T}^2} \left( |
1333 |
> |
\frac{T}{T_{\mathrm{target}}} - 1 \right) \\ |
1334 |
> |
\dot{\eta} & = & \frac{1}{\tau_{B}^2 f k_B T_{\mathrm{target}}} V \left( P - |
1335 |
> |
P_{\mathrm{target}} \right) \\ |
1336 |
> |
\dot{\mathcal{V}} & = & 3 \mathcal{V} \eta |
1337 |
> |
\label{eq:melchionna1} |
1338 |
> |
\end{eqnarray} |
1339 |
> |
|
1340 |
> |
$\chi$ and $\eta$ are the ``extra'' degrees of freedom in the extended |
1341 |
> |
system. $\chi$ is a thermostat, and it has the same function as it |
1342 |
> |
does in the Nos\'e-Hoover NVT integrator. $\eta$ is a barostat which |
1343 |
> |
controls changes to the volume of the simulation box. ${\bf R}_0$ is |
1344 |
> |
the location of the center of mass for the entire system, and |
1345 |
> |
$\mathcal{V}$ is the volume of the simulation box. At any time, the |
1346 |
> |
volume can be calculated from the determinant of the matrix which |
1347 |
> |
describes the box shape: |
1348 |
> |
\begin{equation} |
1349 |
> |
\mathcal{V} = \det(\mathsf{H}) |
1350 |
> |
\end{equation} |
1351 |
> |
|
1352 |
> |
The NPTi integrator requires an instantaneous pressure. This quantity |
1353 |
> |
is calculated via the pressure tensor, |
1354 |
> |
\begin{equation} |
1355 |
> |
\overleftrightarrow{\mathsf{P}}(t) = \frac{1}{\mathcal{V}(t)} \left( |
1356 |
> |
\sum_{i=1}^{N} m_i {\bf v}_i(t) \otimes {\bf v}_i(t) \right) + |
1357 |
> |
\overleftrightarrow{\mathsf{W}}(t) |
1358 |
> |
\end{equation} |
1359 |
> |
The kinetic contribution to the pressure tensor utilizes the {\it |
1360 |
> |
outer} product of the velocities denoted by the $\otimes$ symbol. The |
1361 |
> |
stress tensor is calculated from another outer product of the |
1362 |
> |
inter-atomic separation vectors (${\bf r}_{ij} = {\bf r}_j - {\bf |
1363 |
> |
r}_i$) with the forces between the same two atoms, |
1364 |
> |
\begin{equation} |
1365 |
> |
\overleftrightarrow{\mathsf{W}}(t) = \sum_{i} \sum_{j>i} {\bf r}_{ij}(t) |
1366 |
> |
\otimes {\bf f}_{ij}(t) |
1367 |
> |
\end{equation} |
1368 |
> |
The instantaneous pressure is then simply obtained from the trace of |
1369 |
> |
the Pressure tensor, |
1370 |
> |
\begin{equation} |
1371 |
> |
P(t) = \frac{1}{3} \mathrm{Tr} \left( \overleftrightarrow{\mathsf{P}}(t) |
1372 |
> |
\right) |
1373 |
> |
\end{equation} |
1374 |
> |
|
1375 |
> |
In eq.(\ref{eq:melchionna1}), $\tau_B$ is the time constant for |
1376 |
> |
relaxation of the pressure to the target value. To set values for |
1377 |
> |
$\tau_B$ or $P_{\mathrm{target}}$ in a simulation, one would use the |
1378 |
> |
{\tt tauBarostat} and {\tt targetPressure} keywords in the {\tt .bass} |
1379 |
> |
file. The units for {\tt tauBarostat} are fs, and the units for the |
1380 |
> |
{\tt targetPressure} are atmospheres. Like in the NVT integrator, the |
1381 |
> |
integration of the equations of motion is carried out in a |
1382 |
> |
velocity-Verlet style 2 part algorithm: |
1383 |
> |
|
1384 |
> |
{\tt moveA:} |
1385 |
> |
\begin{eqnarray} |
1386 |
> |
T(t) & \leftarrow & \left\{{\bf v}(t)\right\}, \left\{{\bf j}(t)\right\} \\ |
1387 |
> |
P(t) & \leftarrow & \left\{{\bf r}(t)\right\}, \left\{{\bf v}(t)\right\}, \left\{{\bf f}(t)\right\} \\ |
1388 |
> |
{\bf v}\left(t + \delta t / 2\right) & \leftarrow & {\bf |
1389 |
> |
v}(t) + \frac{\delta t}{2} \left( \frac{{\bf f}(t)}{m} - {\bf v}(t) |
1390 |
> |
\left(\chi(t) + \eta(t) \right) \right) \\ |
1391 |
> |
{\bf j}\left(t + \delta t / 2 \right) & \leftarrow & {\bf |
1392 |
> |
j}(t) + \frac{\delta t}{2} \left( {\bf \tau}^b(t) - {\bf j}(t) |
1393 |
> |
\chi(t) \right) \\ |
1394 |
> |
\mathsf{A}(t + \delta t) & \leftarrow & \mathrm{rot}\left(\delta t * |
1395 |
> |
{\bf j}(t + \delta t / 2) \overleftrightarrow{\mathsf{I}}^{-1} \right) \\ |
1396 |
> |
\chi\left(t + \delta t / 2 \right) & \leftarrow & \chi(t) + |
1397 |
> |
\frac{\delta t}{2 \tau_T^2} \left( \frac{T(t)}{T_{\mathrm{target}}} - 1 |
1398 |
> |
\right) \\ |
1399 |
> |
\eta(t + \delta t / 2) & \leftarrow & \eta(t) + \frac{\delta t \mathcal{V}(t)}{2 N k_B |
1400 |
> |
T(t) \tau_B^2} \left( P(t) - P_{\mathrm{target}} \right) \\ |
1401 |
> |
{\bf r}(t + \delta t) & \leftarrow & {\bf r}(t) + \delta t \left\{ {\bf |
1402 |
> |
v}\left(t + \delta t / 2 \right) + \eta(t + \delta t / 2)\left[ {\bf |
1403 |
> |
r}(t + \delta t) - {\bf R}_0 \right] \right\} \\ |
1404 |
> |
\mathsf{H}(t + \delta t) & \leftarrow & e^{-\delta t \eta(t + \delta t |
1405 |
> |
/ 2)} \mathsf{H}(t) |
1406 |
> |
\end{eqnarray} |
1407 |
> |
|
1408 |
> |
Most of these equations are identical to their counterparts in the NVT |
1409 |
> |
integrator, but the propagation of positions to time $t + \delta t$ |
1410 |
> |
depends on the positions at the same time. {\sc oopse} carries out |
1411 |
> |
this step iteratively (with a limit of 5 passes through the iterative |
1412 |
> |
loop). Also, the simulation box $\mathsf{H}$ is scaled uniformly for |
1413 |
> |
one full time step by an exponential factor that depends on the value |
1414 |
> |
of $\eta$ at time $t + |
1415 |
> |
\delta t / 2$. Reshaping the box uniformly also scales the volume of |
1416 |
> |
the box by |
1417 |
> |
\begin{equation} |
1418 |
> |
\mathcal{V}(t + \delta t) \leftarrow e^{ - 3 \delta t \eta(t + \delta t /2)} |
1419 |
> |
\mathcal{V}(t) |
1420 |
> |
\end{equation} |
1421 |
> |
|
1422 |
> |
The {\tt doForces} step for the NPTi integrator is exactly the same as |
1423 |
> |
in both the DLM and NVT integrators. Once the forces and torques have |
1424 |
> |
been obtained at the new time step, the velocities can be advanced to |
1425 |
> |
the same time value. |
1426 |
> |
|
1427 |
> |
{\tt moveB:} |
1428 |
> |
\begin{eqnarray} |
1429 |
> |
T(t + \delta t) & \leftarrow & \left\{{\bf v}(t + \delta t)\right\}, |
1430 |
> |
\left\{{\bf j}(t + \delta t)\right\} \\ |
1431 |
> |
P(t + \delta t) & \leftarrow & \left\{{\bf r}(t + \delta t)\right\}, |
1432 |
> |
\left\{{\bf v}(t + \delta t)\right\}, \left\{{\bf f}(t + \delta t)\right\} \\ |
1433 |
> |
\chi\left(t + \delta t \right) & \leftarrow & \chi\left(t + \delta t / |
1434 |
> |
2 \right) + \frac{\delta t}{2 \tau_T^2} \left( \frac{T(t+\delta |
1435 |
> |
t)}{T_{\mathrm{target}}} - 1 \right) \\ |
1436 |
> |
\eta(t + \delta t) & \leftarrow & \eta(t + \delta t / 2) + |
1437 |
> |
\frac{\delta t \mathcal{V}(t + \delta t)}{2 N k_B T(t + \delta t) \tau_B^2} |
1438 |
> |
\left( P(t + \delta t) - P_{\mathrm{target}} |
1439 |
> |
\right) \\ |
1440 |
> |
{\bf v}\left(t + \delta t \right) & \leftarrow & {\bf |
1441 |
> |
v}\left(t + \delta t / 2 \right) + \frac{\delta t}{2} \left( |
1442 |
> |
\frac{{\bf f}(t + \delta t)}{m} - {\bf v}(t + \delta t) |
1443 |
> |
(\chi(t + \delta t) + \eta(t + \delta t)) \right) \\ |
1444 |
> |
{\bf j}\left(t + \delta t \right) & \leftarrow & {\bf |
1445 |
> |
j}\left(t + \delta t / 2 \right) + \frac{\delta t}{2} \left( {\bf |
1446 |
> |
\tau}^b(t + \delta t) - {\bf j}(t + \delta t) |
1447 |
> |
\chi(t + \delta t) \right) |
1448 |
> |
\end{eqnarray} |
1449 |
> |
|
1450 |
> |
Once again, since ${\bf v}(t + \delta t)$ and ${\bf j}(t + \delta t)$ |
1451 |
> |
are required to caclculate $T(t + \delta t)$, $P(t + \delta t)$, $\chi(t + |
1452 |
> |
\delta t)$, and $\eta(t + \delta t)$, they indirectly depend on their |
1453 |
> |
own values at time $t + \delta t$. {\tt moveB} is therefore done in |
1454 |
> |
an iterative fashion until $\chi(t + \delta t)$ and $\eta(t + \delta |
1455 |
> |
t)$ become self-consistent. The relative tolerance for the |
1456 |
> |
self-consistency check defaults to a value of $\mbox{10}^{-6}$, but |
1457 |
> |
{\sc oopse} will terminate the iteration after 4 loops even if the |
1458 |
> |
consistency check has not been satisfied. |
1459 |
> |
|
1460 |
> |
The Melchionna modification of the Nos\'e-Hoover-Andersen algorithm is |
1461 |
> |
known to conserve a Hamiltonian for the extended system that is, to |
1462 |
> |
within a constant, identical to the Gibbs free energy, |
1463 |
> |
\begin{equation} |
1464 |
> |
H_{\mathrm{NPTi}} = V + K + f k_B T_{\mathrm{target}} \left( |
1465 |
> |
\frac{\tau_{T}^2 \chi^2(t)}{2} + \int_{0}^{t} \chi(t^\prime) dt^\prime |
1466 |
> |
\right) + P_{\mathrm{target}} \mathcal{V}(t). |
1467 |
> |
\end{equation} |
1468 |
> |
Poor choices of $\delta t$, $\tau_T$, or $\tau_B$ can result in |
1469 |
> |
non-conservation of $H_{\mathrm{NPTi}}$, so the conserved quantity is |
1470 |
> |
maintained in the last column of the {\tt .stat} file to allow checks |
1471 |
> |
on the quality of the integration. It is also known that this |
1472 |
> |
algorithm samples the equilibrium distribution for the enthalpy |
1473 |
> |
(including contributions for the thermostat and barostat), |
1474 |
> |
\begin{equation} |
1475 |
> |
H_{\mathrm{NPTi}} = V + K + \frac{f k_B T_{\mathrm{target}}}{2} \left( |
1476 |
> |
\chi^2 \tau_T^2 + \eta^2 \tau_B^2 \right) + P_{\mathrm{target}} |
1477 |
> |
\mathcal{V}(t). |
1478 |
> |
\end{equation} |
1479 |
> |
|
1480 |
> |
Bond constraints are applied at the end of both the {\tt moveA} and |
1481 |
> |
{\tt moveB} portions of the algorithm. Details on the constraint |
1482 |
> |
algorithms are given in section \ref{oopseSec:rattle}. |
1483 |
> |
|
1484 |
> |
\subsubsection{\label{sec:NPTf}Constant-pressure integration with a |
1485 |
> |
flexible box (NPTf)} |
1486 |
> |
|
1487 |
> |
There is a relatively simple generalization of the |
1488 |
> |
Nos\'e-Hoover-Andersen method to include changes in the simulation box |
1489 |
> |
{\it shape} as well as in the volume of the box. This method utilizes |
1490 |
> |
the full $3 \times 3$ pressure tensor and introduces a tensor of |
1491 |
> |
extended variables ($\overleftrightarrow{\eta}$) to control changes to |
1492 |
> |
the box shape. The equations of motion for this method are |
1493 |
> |
\begin{eqnarray} |
1494 |
> |
\dot{{\bf r}} & = & {\bf v} + \overleftrightarrow{\eta} \cdot \left( {\bf r} - {\bf R}_0 \right) \\ |
1495 |
> |
\dot{{\bf v}} & = & \frac{{\bf f}}{m} - (\overleftrightarrow{\eta} + |
1496 |
> |
\chi \mathsf{1}) {\bf v} \\ |
1497 |
> |
\dot{\mathsf{A}} & = & \mathsf{A} \cdot |
1498 |
> |
\mbox{ skew}\left(\overleftrightarrow{I}^{-1} \cdot {\bf j}\right) \\ |
1499 |
> |
\dot{{\bf j}} & = & {\bf j} \times \left( \overleftrightarrow{I}^{-1} |
1500 |
> |
\cdot {\bf j} \right) - \mbox{ rot}\left(\mathsf{A}^{T} \cdot \frac{\partial |
1501 |
> |
V}{\partial \mathsf{A}} \right) - \chi {\bf j} \\ |
1502 |
> |
\dot{\chi} & = & \frac{1}{\tau_{T}^2} \left( |
1503 |
> |
\frac{T}{T_{\mathrm{target}}} - 1 \right) \\ |
1504 |
> |
\dot{\overleftrightarrow{eta}} & = & \frac{1}{\tau_{B}^2 f k_B |
1505 |
> |
T_{\mathrm{target}}} V \left( \overleftrightarrow{\mathsf{P}} - P_{\mathrm{target}}\mathsf{1} \right) \\ |
1506 |
> |
\dot{\mathsf{H}} & = & \overleftrightarrow{\eta} \cdot \mathsf{H} |
1507 |
> |
\label{eq:melchionna2} |
1508 |
> |
\end{eqnarray} |
1509 |
> |
|
1510 |
> |
Here, $\mathsf{1}$ is the unit matrix and $\overleftrightarrow{\mathsf{P}}$ |
1511 |
> |
is the pressure tensor. Again, the volume, $\mathcal{V} = \det |
1512 |
> |
\mathsf{H}$. |
1513 |
> |
|
1514 |
> |
The propagation of the equations of motion is nearly identical to the |
1515 |
> |
NPTi integration: |
1516 |
> |
|
1517 |
> |
{\tt moveA:} |
1518 |
> |
\begin{eqnarray} |
1519 |
> |
T(t) & \leftarrow & \left\{{\bf v}(t)\right\}, \left\{{\bf j}(t)\right\} \\ |
1520 |
> |
\overleftrightarrow{\mathsf{P}}(t) & \leftarrow & \left\{{\bf r}(t)\right\}, \left\{{\bf v}(t)\right\}, \left\{{\bf f}(t)\right\} \\ |
1521 |
> |
{\bf v}\left(t + \delta t / 2\right) & \leftarrow & {\bf |
1522 |
> |
v}(t) + \frac{\delta t}{2} \left( \frac{{\bf f}(t)}{m} - |
1523 |
> |
\left(\chi(t)\mathsf{1} + \overleftrightarrow{\eta}(t) \right) \cdot |
1524 |
> |
{\bf v}(t) \right) \\ |
1525 |
> |
{\bf j}\left(t + \delta t / 2 \right) & \leftarrow & {\bf |
1526 |
> |
j}(t) + \frac{\delta t}{2} \left( {\bf \tau}^b(t) - {\bf j}(t) |
1527 |
> |
\chi(t) \right) \\ |
1528 |
> |
\mathsf{A}(t + \delta t) & \leftarrow & \mathrm{rot}\left(\delta t * |
1529 |
> |
{\bf j}(t + \delta t / 2) \overleftrightarrow{\mathsf{I}}^{-1} \right) \\ |
1530 |
> |
\chi\left(t + \delta t / 2 \right) & \leftarrow & \chi(t) + |
1531 |
> |
\frac{\delta t}{2 \tau_T^2} \left( \frac{T(t)}{T_{\mathrm{target}}} - 1 |
1532 |
> |
\right) \\ |
1533 |
> |
\overleftrightarrow{\eta}(t + \delta t / 2) & \leftarrow & \overleftrightarrow{\eta}(t) + \frac{\delta t \mathcal{V}(t)}{2 N k_B |
1534 |
> |
T(t) \tau_B^2} \left( \overleftrightarrow{\mathsf{P}}(t) - P_{\mathrm{target}}\mathsf{1} \right) \\ |
1535 |
> |
{\bf r}(t + \delta t) & \leftarrow & {\bf r}(t) + \delta t \left\{ {\bf |
1536 |
> |
v}\left(t + \delta t / 2 \right) + \overleftrightarrow{\eta}(t + |
1537 |
> |
\delta t / 2) \cdot \left[ {\bf |
1538 |
> |
r}(t + \delta t) - {\bf R}_0 \right] \right\} \\ |
1539 |
> |
\mathsf{H}(t + \delta t) & \leftarrow & \mathsf{H}(t) \cdot e^{-\delta t |
1540 |
> |
\overleftrightarrow{\eta}(t + \delta t / 2)} |
1541 |
> |
\end{eqnarray} |
1542 |
> |
{\sc oopse} uses a power series expansion truncated at second order |
1543 |
> |
for the exponential operation which scales the simulation box. |
1544 |
> |
|
1545 |
> |
The {\tt moveB} portion of the algorithm is largely unchanged from the |
1546 |
> |
NPTi integrator: |
1547 |
> |
|
1548 |
> |
{\tt moveB:} |
1549 |
> |
\begin{eqnarray} |
1550 |
> |
T(t + \delta t) & \leftarrow & \left\{{\bf v}(t + \delta t)\right\}, |
1551 |
> |
\left\{{\bf j}(t + \delta t)\right\} \\ |
1552 |
> |
\overleftrightarrow{\mathsf{P}}(t + \delta t) & \leftarrow & \left\{{\bf r}(t + \delta t)\right\}, |
1553 |
> |
\left\{{\bf v}(t + \delta t)\right\}, \left\{{\bf f}(t + \delta t)\right\} \\ |
1554 |
> |
\chi\left(t + \delta t \right) & \leftarrow & \chi\left(t + \delta t / |
1555 |
> |
2 \right) + \frac{\delta t}{2 \tau_T^2} \left( \frac{T(t+\delta |
1556 |
> |
t)}{T_{\mathrm{target}}} - 1 \right) \\ |
1557 |
> |
\overleftrightarrow{\eta}(t + \delta t) & \leftarrow & \overleftrightarrow{\eta}(t + \delta t / 2) + |
1558 |
> |
\frac{\delta t \mathcal{V}(t + \delta t)}{2 N k_B T(t + \delta t) \tau_B^2} |
1559 |
> |
\left( \overleftrightarrow{P}(t + \delta t) - P_{\mathrm{target}}\mathsf{1} |
1560 |
> |
\right) \\ |
1561 |
> |
{\bf v}\left(t + \delta t \right) & \leftarrow & {\bf |
1562 |
> |
v}\left(t + \delta t / 2 \right) + \frac{\delta t}{2} \left( |
1563 |
> |
\frac{{\bf f}(t + \delta t)}{m} - |
1564 |
> |
(\chi(t + \delta t)\mathsf{1} + \overleftrightarrow{\eta}(t + \delta |
1565 |
> |
t)) \right) \cdot {\bf v}(t + \delta t) \\ |
1566 |
> |
{\bf j}\left(t + \delta t \right) & \leftarrow & {\bf |
1567 |
> |
j}\left(t + \delta t / 2 \right) + \frac{\delta t}{2} \left( {\bf |
1568 |
> |
\tau}^b(t + \delta t) - {\bf j}(t + \delta t) |
1569 |
> |
\chi(t + \delta t) \right) |
1570 |
> |
\end{eqnarray} |
1571 |
|
|
1572 |
+ |
The iterative schemes for both {\tt moveA} and {\tt moveB} are |
1573 |
+ |
identical to those described for the NPTi integrator. |
1574 |
+ |
|
1575 |
+ |
The NPTf integrator is known to conserve the following Hamiltonian: |
1576 |
+ |
\begin{equation} |
1577 |
+ |
H_{\mathrm{NPTf}} = V + K + f k_B T_{\mathrm{target}} \left( |
1578 |
+ |
\frac{\tau_{T}^2 \chi^2(t)}{2} + \int_{0}^{t} \chi(t^\prime) dt^\prime |
1579 |
+ |
\right) + P_{\mathrm{target}} \mathcal{V}(t) + \frac{f k_B |
1580 |
+ |
T_{\mathrm{target}}}{2} |
1581 |
+ |
\mathrm{Tr}\left[\overleftrightarrow{\eta}(t)\right]^2 \tau_B^2. |
1582 |
+ |
\end{equation} |
1583 |
+ |
|
1584 |
+ |
This integrator must be used with care, particularly in liquid |
1585 |
+ |
simulations. Liquids have very small restoring forces in the |
1586 |
+ |
off-diagonal directions, and the simulation box can very quickly form |
1587 |
+ |
elongated and sheared geometries which become smaller than the |
1588 |
+ |
electrostatic or Lennard-Jones cutoff radii. It finds most use in |
1589 |
+ |
simulating crystals or liquid crystals which assume non-orthorhombic |
1590 |
+ |
geometries. |
1591 |
+ |
|
1592 |
+ |
\subsubsection{\label{nptxyz}Constant pressure in 3 axes (NPTxyz)} |
1593 |
+ |
|
1594 |
+ |
There is one additional extended system integrator which is somewhat |
1595 |
+ |
simpler than the NPTf method described above. In this case, the three |
1596 |
+ |
axes have independent barostats which each attempt to preserve the |
1597 |
+ |
target pressure along the box walls perpendicular to that particular |
1598 |
+ |
axis. The lengths of the box axes are allowed to fluctuate |
1599 |
+ |
independently, but the angle between the box axes does not change. |
1600 |
+ |
The equations of motion are identical to those described above, but |
1601 |
+ |
only the {\it diagonal} elements of $\overleftrightarrow{\eta}$ are |
1602 |
+ |
computed. The off-diagonal elements are set to zero (even when the |
1603 |
+ |
pressure tensor has non-zero off-diagonal elements). |
1604 |
+ |
|
1605 |
+ |
It should be noted that the NPTxyz integrator is {\it not} known to |
1606 |
+ |
preserve any Hamiltonian of interest to the chemical physics |
1607 |
+ |
community. The integrator is extremely useful, however, in generating |
1608 |
+ |
initial conditions for other integration methods. It {\it is} suitable |
1609 |
+ |
for use with liquid simulations, or in cases where there is |
1610 |
+ |
orientational anisotropy in the system (i.e. in lipid bilayer |
1611 |
+ |
simulations). |
1612 |
+ |
|
1613 |
|
\subsection{\label{oopseSec:rattle}The {\sc rattle} Method for Bond |
1614 |
|
Constraints} |
1615 |
|
|
1619 |
|
formulation of the {\sc shake} method\cite{ryckaert77} of iteratively |
1620 |
|
solving the Lagrange multipliers of constraint. The system of lagrange |
1621 |
|
multipliers allows one to reformulate the equations of motion with |
1622 |
< |
explicit constraint forces on the equations of |
988 |
< |
motion.\cite{fowles99:lagrange} |
1622 |
> |
explicit constraint forces.\cite{fowles99:lagrange} |
1623 |
|
|
1624 |
< |
Consider a system described by qoordinates $q_1$ and $q_2$ subject to an |
1624 |
> |
Consider a system described by coordinates $q_1$ and $q_2$ subject to an |
1625 |
|
equation of constraint: |
1626 |
|
\begin{equation} |
1627 |
|
\sigma(q_1, q_2,t) = 0 |
1693 |
|
In a simulation, this would involve the solution of a set of $(m + n)$ |
1694 |
|
number of equations. Where $m$ is the number of constraints, and $n$ |
1695 |
|
is the number of constrained coordinates. In practice, this is not |
1696 |
< |
done, as the matrix inversion neccassary to solve the system of |
1696 |
> |
done, as the matrix inversion necessary to solve the system of |
1697 |
|
equations would be very time consuming to solve. Additionally, the |
1698 |
|
numerical error in the solution of the set of $\lambda$'s would be |
1699 |
|
compounded by the error inherent in propagating by the Velocity Verlet |
1700 |
< |
algorithm ($\Delta t^4$). The verlet propagation error is negligible |
1701 |
< |
in an unconstrained system, as one is interested in the statisitics of |
1700 |
> |
algorithm ($\Delta t^4$). The Verlet propagation error is negligible |
1701 |
> |
in an unconstrained system, as one is interested in the statistics of |
1702 |
|
the run, and not that the run be numerically exact to the ``true'' |
1703 |
|
integration. This relates back to the ergodic hypothesis that a time |
1704 |
< |
integral of a valid trajectory will still give the correct enesemble |
1704 |
> |
integral of a valid trajectory will still give the correct ensemble |
1705 |
|
average. However, in the case of constraints, if the equations of |
1706 |
|
motion leave the ``true'' trajectory, they are departing from the |
1707 |
|
constrained surface. The method that is used, is to iteratively solve |
1720 |
|
Eq.~\ref{oopseEq:c1} is the set of bond constraints, where $d_{ij}$ is |
1721 |
|
the constrained distance between atom $i$ and |
1722 |
|
$j$. Eq.~\ref{oopseEq:c2} constrains the velocities of $i$ and $j$ to |
1723 |
< |
be perpindicular to the bond vector, so that the bond can neither grow |
1723 |
> |
be perpendicular to the bond vector, so that the bond can neither grow |
1724 |
|
nor shrink. The constrained dynamics equations become: |
1725 |
|
\begin{equation} |
1726 |
|
m_i \mathbf{\ddot{r}}_i = \mathbf{F}_i + \mathbf{\mathcal{G}}_i |
1727 |
|
\label{oopseEq:r1} |
1728 |
|
\end{equation} |
1729 |
< |
Where, |
1729 |
> |
Where,$\mathbf{\mathcal{G}}_i$ are the forces of constraint on $i$, |
1730 |
> |
and are defined: |
1731 |
|
\begin{equation} |
1732 |
|
\mathbf{\mathcal{G}}_i = - \sum_j \lambda_{ij}(t)\,\nabla \sigma_{ij} |
1733 |
|
\label{oopseEq:r2} |
1746 |
|
\mathbf{F}_i(t+h) + \mathbf{\mathcal{G}}_{Vi}(t+h) \Bigr] % |
1747 |
|
\label{oopseEq:vv2} |
1748 |
|
\end{align} |
1749 |
+ |
Where: |
1750 |
+ |
\begin{align} |
1751 |
+ |
\mathbf{\mathcal{G}}_{Ri}(t) &= |
1752 |
+ |
-2 \sum_j \lambda_{Rij}(t) \mathbf{r}_{ij}(t) \\ |
1753 |
+ |
% |
1754 |
+ |
\mathbf{\mathcal{G}}_{Vi}(t+h) &= |
1755 |
+ |
-2 \sum_j \lambda_{Vij}(t+h) \mathbf{r}(t+h) |
1756 |
+ |
\end{align} |
1757 |
+ |
Next, define: |
1758 |
+ |
\begin{align} |
1759 |
+ |
g_{ij} &= h \lambda_{Rij}(t) \\ |
1760 |
+ |
k_{ij} &= h \lambda_{Vij}(t+h) \\ |
1761 |
+ |
\mathbf{q}_i &= \mathbf{\dot{r}}_i(t) + \frac{h}{2m_i} \mathbf{F}_i(t) |
1762 |
+ |
- \frac{1}{m_i}\sum_j g_{ij}\mathbf{r}_{ij}(t) |
1763 |
+ |
\end{align} |
1764 |
+ |
Using these definitions, Eq.~\ref{oopseEq:vv1} and \ref{oopseEq:vv2} |
1765 |
+ |
can be rewritten as, |
1766 |
+ |
\begin{align} |
1767 |
+ |
\mathbf{r}_i(t+h) &= \mathbf{r}_i(t) + h \mathbf{q}_i \\ |
1768 |
+ |
% |
1769 |
+ |
\mathbf{\dot{r}}(t+h) &= \mathbf{q}_i + \frac{h}{2m_i}\mathbf{F}_i(t+h) |
1770 |
+ |
-\frac{1}{m_i}\sum_j k_{ij} \mathbf{r}_{ij}(t+h) |
1771 |
+ |
\end{align} |
1772 |
|
|
1773 |
+ |
To integrate the equations of motion, the {\sc rattle} algorithm first |
1774 |
+ |
solves for $\mathbf{r}(t+h)$. Let, |
1775 |
+ |
\begin{equation} |
1776 |
+ |
\mathbf{q}_i = \mathbf{\dot{r}}(t) + \frac{h}{2m_i}\mathbf{F}_i(t) |
1777 |
+ |
\end{equation} |
1778 |
+ |
Here $\mathbf{q}_i$ corresponds to an initial unconstrained move. Next |
1779 |
+ |
pick a constraint $j$, and let, |
1780 |
+ |
\begin{equation} |
1781 |
+ |
\mathbf{s} = \mathbf{r}_i(t) + h\mathbf{q}_i(t) |
1782 |
+ |
- \mathbf{r}_j(t) + h\mathbf{q}_j(t) |
1783 |
+ |
\label{oopseEq:ra1} |
1784 |
+ |
\end{equation} |
1785 |
+ |
If |
1786 |
+ |
\begin{equation} |
1787 |
+ |
\Big| |\mathbf{s}|^2 - d_{ij}^2 \Big| > \text{tolerance}, |
1788 |
+ |
\end{equation} |
1789 |
+ |
then the constraint is unsatisfied, and corrections are made to the |
1790 |
+ |
positions. First we define a test corrected configuration as, |
1791 |
+ |
\begin{align} |
1792 |
+ |
\mathbf{r}_i^T(t+h) = \mathbf{r}_i(t) + h\biggl[\mathbf{q}_i - |
1793 |
+ |
g_{ij}\,\frac{\mathbf{r}_{ij}(t)}{m_i} \biggr] \\ |
1794 |
+ |
% |
1795 |
+ |
\mathbf{r}_j^T(t+h) = \mathbf{r}_j(t) + h\biggl[\mathbf{q}_j + |
1796 |
+ |
g_{ij}\,\frac{\mathbf{r}_{ij}(t)}{m_j} \biggr] |
1797 |
+ |
\end{align} |
1798 |
+ |
And we chose $g_{ij}$ such that, $|\mathbf{r}_i^T - \mathbf{r}_j^T|^2 |
1799 |
+ |
= d_{ij}^2$. Solving the quadratic for $g_{ij}$ we obtain the |
1800 |
+ |
approximation, |
1801 |
+ |
\begin{equation} |
1802 |
+ |
g_{ij} = \frac{(s^2 - d^2)}{2h[\mathbf{s}\cdot\mathbf{r}_{ij}(t)] |
1803 |
+ |
(\frac{1}{m_i} + \frac{1}{m_j})} |
1804 |
+ |
\end{equation} |
1805 |
+ |
Although not an exact solution for $g_{ij}$, as this is an iterative |
1806 |
+ |
scheme overall, the eventual solution will converge. With a trial |
1807 |
+ |
$g_{ij}$, the new $\mathbf{q}$'s become, |
1808 |
+ |
\begin{align} |
1809 |
+ |
\mathbf{q}_i &= \mathbf{q}^{\text{old}}_i - g_{ij}\, |
1810 |
+ |
\frac{\mathbf{r}_{ij}(t)}{m_i} \\ |
1811 |
+ |
% |
1812 |
+ |
\mathbf{q}_j &= \mathbf{q}^{\text{old}}_j + g_{ij}\, |
1813 |
+ |
\frac{\mathbf{r}_{ij}(t)}{m_j} |
1814 |
+ |
\end{align} |
1815 |
+ |
The whole algorithm is then repeated from Eq.~\ref{oopseEq:ra1} until |
1816 |
+ |
all constraints are satisfied. |
1817 |
|
|
1818 |
+ |
The second step of {\sc rattle}, is to then update the velocities. The |
1819 |
+ |
step starts with, |
1820 |
+ |
\begin{equation} |
1821 |
+ |
\mathbf{\dot{r}}_i(t+h) = \mathbf{q}_i + \frac{h}{2m_i}\mathbf{F}_i(t+h) |
1822 |
+ |
\end{equation} |
1823 |
+ |
Next we pick a constraint $j$, and calculate the dot product $\ell$. |
1824 |
+ |
\begin{equation} |
1825 |
+ |
\ell = \mathbf{r}_{ij}(t+h) \cdot \mathbf{\dot{r}}_{ij}(t+h) |
1826 |
+ |
\label{oopseEq:rv1} |
1827 |
+ |
\end{equation} |
1828 |
+ |
Here if constraint Eq.~\ref{oopseEq:c2} holds, $\ell$ should be |
1829 |
+ |
zero. Therefore if $\ell$ is greater than some tolerance, then |
1830 |
+ |
corrections are made to the $i$ and $j$ velocities. |
1831 |
+ |
\begin{align} |
1832 |
+ |
\mathbf{\dot{r}}_i^T &= \mathbf{\dot{r}}_i(t+h) - k_{ij} |
1833 |
+ |
\frac{\mathbf{\dot{r}}_{ij}(t+h)}{m_i} \\ |
1834 |
+ |
% |
1835 |
+ |
\mathbf{\dot{r}}_j^T &= \mathbf{\dot{r}}_j(t+h) + k_{ij} |
1836 |
+ |
\frac{\mathbf{\dot{r}}_{ij}(t+h)}{m_j} |
1837 |
+ |
\end{align} |
1838 |
+ |
Like in the previous step, we select a value for $k_{ij}$ such that |
1839 |
+ |
$\ell$ is zero. |
1840 |
+ |
\begin{equation} |
1841 |
+ |
k_{ij} = \frac{\ell}{d^2_{ij}(\frac{1}{m_i} + \frac{1}{m_j})} |
1842 |
+ |
\end{equation} |
1843 |
+ |
The test velocities, $\mathbf{\dot{r}}^T_i$ and |
1844 |
+ |
$\mathbf{\dot{r}}^T_j$, then replace their respective velocities, and |
1845 |
+ |
the algorithm is iterated from Eq.~\ref{oopseEq:rv1} until all |
1846 |
+ |
constraints are satisfied. |
1847 |
|
|
1848 |
+ |
|
1849 |
|
\subsection{\label{oopseSec:zcons}Z-Constraint Method} |
1850 |
|
|
1851 |
< |
Based on fluctuation-dissipation theorem, a force auto-correlation |
1852 |
< |
method was developed to investigate the dynamics of ions inside the ion |
1853 |
< |
channels.\cite{Roux91} Time-dependent friction coefficient can be calculated |
1854 |
< |
from the deviation of the instantaneous force from its mean force. |
1855 |
< |
|
1124 |
< |
% |
1125 |
< |
|
1851 |
> |
Based on the fluctuation-dissipation theorem, a force auto-correlation |
1852 |
> |
method was developed by Roux and Karplus to investigate the dynamics |
1853 |
> |
of ions inside ion channels.\cite{Roux91} The time-dependent friction |
1854 |
> |
coefficient can be calculated from the deviation of the instantaneous |
1855 |
> |
force from its mean force. |
1856 |
|
\begin{equation} |
1857 |
|
\xi(z,t)=\langle\delta F(z,t)\delta F(z,0)\rangle/k_{B}T |
1858 |
|
\end{equation} |
1862 |
|
\end{equation} |
1863 |
|
|
1864 |
|
|
1865 |
< |
If the time-dependent friction decay rapidly, static friction coefficient can |
1866 |
< |
be approximated by% |
1137 |
< |
|
1865 |
> |
If the time-dependent friction decays rapidly, the static friction |
1866 |
> |
coefficient can be approximated by |
1867 |
|
\begin{equation} |
1868 |
< |
\xi^{static}(z)=\int_{0}^{\infty}\langle\delta F(z,t)\delta F(z,0)\rangle dt |
1868 |
> |
\xi_{\text{static}}(z)=\int_{0}^{\infty}\langle\delta F(z,t)\delta F(z,0)\rangle dt |
1869 |
|
\end{equation} |
1870 |
< |
|
1871 |
< |
|
1143 |
< |
Hence, diffusion constant can be estimated by |
1870 |
> |
Allowing diffusion constant to then be calculated through the |
1871 |
> |
Einstein relation:\cite{Marrink94} |
1872 |
|
\begin{equation} |
1873 |
< |
D(z)=\frac{k_{B}T}{\xi^{static}(z)}=\frac{(k_{B}T)^{2}}{\int_{0}^{\infty |
1873 |
> |
D(z)=\frac{k_{B}T}{\xi_{\text{static}}(z)}=\frac{(k_{B}T)^{2}}{\int_{0}^{\infty |
1874 |
|
}\langle\delta F(z,t)\delta F(z,0)\rangle dt}% |
1875 |
|
\end{equation} |
1876 |
|
|
1877 |
+ |
The Z-Constraint method, which fixes the z coordinates of the |
1878 |
+ |
molecules with respect to the center of the mass of the system, has |
1879 |
+ |
been a method suggested to obtain the forces required for the force |
1880 |
+ |
auto-correlation calculation.\cite{Marrink94} However, simply resetting the |
1881 |
+ |
coordinate will move the center of the mass of the whole system. To |
1882 |
+ |
avoid this problem, a new method was used in {\sc oopse}. Instead of |
1883 |
+ |
resetting the coordinate, we reset the forces of z-constrained |
1884 |
+ |
molecules as well as subtract the total constraint forces from the |
1885 |
+ |
rest of the system after the force calculation at each time step. |
1886 |
|
|
1887 |
< |
\bigskip Z-Constraint method, which fixed the z coordinates of the molecules |
1888 |
< |
with respect to the center of the mass of the system, was proposed to obtain |
1889 |
< |
the forces required in force auto-correlation method.\cite{Marrink94} However, |
1890 |
< |
simply resetting the coordinate will move the center of the mass of the whole |
1891 |
< |
system. To avoid this problem, a new method was used at {\sc oopse}. Instead of |
1892 |
< |
resetting the coordinate, we reset the forces of z-constraint molecules as |
1893 |
< |
well as subtract the total constraint forces from the rest of the system after |
1894 |
< |
force calculation at each time step. |
1895 |
< |
\begin{align} |
1896 |
< |
F_{\alpha i}&=0\\ |
1897 |
< |
V_{\alpha i}&=V_{\alpha i}-\frac{\sum\limits_{i}M_{_{\alpha i}}V_{\alpha i}}{\sum\limits_{i}M_{_{\alpha i}}}\\ |
1898 |
< |
F_{\alpha i}&=F_{\alpha i}-\frac{M_{_{\alpha i}}}{\sum\limits_{\alpha}\sum\limits_{i}M_{_{\alpha i}}}\sum\limits_{\beta}F_{\beta}\\ |
1899 |
< |
V_{\alpha i}&=V_{\alpha i}-\frac{\sum\limits_{\alpha}\sum\limits_{i}M_{_{\alpha i}}V_{\alpha i}}{\sum\limits_{\alpha}\sum\limits_{i}M_{_{\alpha i}}} |
1900 |
< |
\end{align} |
1887 |
> |
After the force calculation, define $G_\alpha$ as |
1888 |
> |
\begin{equation} |
1889 |
> |
G_{\alpha} = \sum_i F_{\alpha i} |
1890 |
> |
\label{oopseEq:zc1} |
1891 |
> |
\end{equation} |
1892 |
> |
Where $F_{\alpha i}$ is the force in the z direction of atom $i$ in |
1893 |
> |
z-constrained molecule $\alpha$. The forces of the z constrained |
1894 |
> |
molecule are then set to: |
1895 |
> |
\begin{equation} |
1896 |
> |
F_{\alpha i} = F_{\alpha i} - |
1897 |
> |
\frac{m_{\alpha i} G_{\alpha}}{\sum_i m_{\alpha i}} |
1898 |
> |
\end{equation} |
1899 |
> |
Here, $m_{\alpha i}$ is the mass of atom $i$ in the z-constrained |
1900 |
> |
molecule. Having rescaled the forces, the velocities must also be |
1901 |
> |
rescaled to subtract out any center of mass velocity in the z |
1902 |
> |
direction. |
1903 |
> |
\begin{equation} |
1904 |
> |
v_{\alpha i} = v_{\alpha i} - |
1905 |
> |
\frac{\sum_i m_{\alpha i} v_{\alpha i}}{\sum_i m_{\alpha i}} |
1906 |
> |
\end{equation} |
1907 |
> |
Where $v_{\alpha i}$ is the velocity of atom $i$ in the z direction. |
1908 |
> |
Lastly, all of the accumulated z constrained forces must be subtracted |
1909 |
> |
from the system to keep the system center of mass from drifting. |
1910 |
> |
\begin{equation} |
1911 |
> |
F_{\beta i} = F_{\beta i} - \frac{m_{\beta i} \sum_{\alpha} G_{\alpha}} |
1912 |
> |
{\sum_{\beta}\sum_i m_{\beta i}} |
1913 |
> |
\end{equation} |
1914 |
> |
Where $\beta$ are all of the unconstrained molecules in the system. |
1915 |
|
|
1916 |
< |
At the very beginning of the simulation, the molecules may not be at its |
1917 |
< |
constraint position. To move the z-constraint molecule to the specified |
1918 |
< |
position, a simple harmonic potential is used% |
1168 |
< |
|
1916 |
> |
At the very beginning of the simulation, the molecules may not be at their |
1917 |
> |
constrained positions. To move a z-constrained molecule to its specified |
1918 |
> |
position, a simple harmonic potential is used |
1919 |
|
\begin{equation} |
1920 |
< |
U(t)=\frac{1}{2}k_{Harmonic}(z(t)-z_{cons})^{2}% |
1920 |
> |
U(t)=\frac{1}{2}k_{\text{Harmonic}}(z(t)-z_{\text{cons}})^{2}% |
1921 |
|
\end{equation} |
1922 |
< |
where $k_{Harmonic}$\bigskip\ is the harmonic force constant, $z(t)$ is |
1923 |
< |
current z coordinate of the center of mass of the z-constraint molecule, and |
1924 |
< |
$z_{cons}$ is the restraint position. Therefore, the harmonic force operated |
1925 |
< |
on the z-constraint molecule at time $t$ can be calculated by% |
1922 |
> |
where $k_{\text{Harmonic}}$ is the harmonic force constant, $z(t)$ is the |
1923 |
> |
current $z$ coordinate of the center of mass of the constrained molecule, and |
1924 |
> |
$z_{\text{cons}}$ is the constrained position. The harmonic force operating |
1925 |
> |
on the z-constrained molecule at time $t$ can be calculated by |
1926 |
|
\begin{equation} |
1927 |
< |
F_{z_{Harmonic}}(t)=-\frac{\partial U(t)}{\partial z(t)}=-k_{Harmonic}% |
1928 |
< |
(z(t)-z_{cons}) |
1927 |
> |
F_{z_{\text{Harmonic}}}(t)=-\frac{\partial U(t)}{\partial z(t)}= |
1928 |
> |
-k_{\text{Harmonic}}(z(t)-z_{\text{cons}}) |
1929 |
|
\end{equation} |
1180 |
– |
Worthy of mention, other kinds of potential functions can also be used to |
1181 |
– |
drive the z-constraint molecule. |
1930 |
|
|
1931 |
|
\section{\label{oopseSec:props}Trajectory Analysis} |
1932 |
|
|
1933 |
|
\subsection{\label{oopseSec:staticProps}Static Property Analysis} |
1934 |
|
|
1935 |
|
The static properties of the trajectories are analyzed with the |
1936 |
< |
program \texttt{staticProps}. The code is capable of calculating the following |
1937 |
< |
pair correlations between species A and B: |
1938 |
< |
\begin{itemize} |
1939 |
< |
\item $g_{\text{AB}}(r)$: Eq.~\ref{eq:gofr} |
1940 |
< |
\item $g_{\text{AB}}(r, \cos \theta)$: Eq.~\ref{eq:gofrCosTheta} |
1941 |
< |
\item $g_{\text{AB}}(r, \cos \omega)$: Eq.~\ref{eq:gofrCosOmega} |
1942 |
< |
\item $g_{\text{AB}}(x, y, z)$: Eq.~\ref{eq:gofrXYZ} |
1943 |
< |
\item $\langle \cos \omega \rangle_{\text{AB}}(r)$: |
1944 |
< |
Eq.~\ref{eq:cosOmegaOfR} |
1945 |
< |
\end{itemize} |
1936 |
> |
program \texttt{staticProps}. The code is capable of calculating a |
1937 |
> |
number of pair correlations between species A and B. Some of which |
1938 |
> |
only apply to directional entities. The summary of pair correlations |
1939 |
> |
can be found in Table~\ref{oopseTb:gofrs} |
1940 |
> |
|
1941 |
> |
\begin{table} |
1942 |
> |
\caption[The list of pair correlations in \texttt{staticProps}]{The different pair correlations in \texttt{staticProps} along with whether atom A or B must be directional.} |
1943 |
> |
\label{oopseTb:gofrs} |
1944 |
> |
\begin{center} |
1945 |
> |
\begin{tabular}{|l|c|c|} |
1946 |
> |
\hline |
1947 |
> |
Name & Equation & Directional Atom \\ \hline |
1948 |
> |
$g_{\text{AB}}(r)$ & Eq.~\ref{eq:gofr} & neither \\ \hline |
1949 |
> |
$g_{\text{AB}}(r, \cos \theta)$ & Eq.~\ref{eq:gofrCosTheta} & A \\ \hline |
1950 |
> |
$g_{\text{AB}}(r, \cos \omega)$ & Eq.~\ref{eq:gofrCosOmega} & both \\ \hline |
1951 |
> |
$g_{\text{AB}}(x, y, z)$ & Eq.~\ref{eq:gofrXYZ} & neither \\ \hline |
1952 |
> |
$\langle \cos \omega \rangle_{\text{AB}}(r)$ & Eq.~\ref{eq:cosOmegaOfR} &% |
1953 |
> |
both \\ \hline |
1954 |
> |
\end{tabular} |
1955 |
> |
\end{center} |
1956 |
> |
\end{table} |
1957 |
|
|
1958 |
|
The first pair correlation, $g_{\text{AB}}(r)$, is defined as follows: |
1959 |
|
\begin{equation} |
2032 |
|
correlation that gives the average correlation of two directional |
2033 |
|
entities as a function of their distance from each other. |
2034 |
|
|
1276 |
– |
All static properties are calculated on a frame by frame basis. The |
1277 |
– |
trajectory is read a single frame at a time, and the appropriate |
1278 |
– |
calculations are done on each frame. Once one frame is finished, the |
1279 |
– |
next frame is read in, and a running average of the property being |
1280 |
– |
calculated is accumulated in each frame. The program allows for the |
1281 |
– |
user to specify more than one property be calculated in single run, |
1282 |
– |
preventing the need to read a file multiple times. |
1283 |
– |
|
2035 |
|
\subsection{\label{dynamicProps}Dynamic Property Analysis} |
2036 |
|
|
2037 |
|
The dynamic properties of a trajectory are calculated with the program |
2038 |
< |
\texttt{dynamicProps}. The program will calculate the following properties: |
2038 |
> |
\texttt{dynamicProps}. The program calculates the following properties: |
2039 |
|
\begin{gather} |
2040 |
|
\langle | \mathbf{r}(t) - \mathbf{r}(0) |^2 \rangle \label{eq:rms}\\ |
2041 |
|
\langle \mathbf{v}(t) \cdot \mathbf{v}(0) \rangle \label{eq:velCorr} \\ |
2042 |
|
\langle \mathbf{j}(t) \cdot \mathbf{j}(0) \rangle \label{eq:angularVelCorr} |
2043 |
|
\end{gather} |
2044 |
|
|
2045 |
< |
Eq.~\ref{eq:rms} is the root mean square displacement |
2046 |
< |
function. Eq.~\ref{eq:velCorr} and Eq.~\ref{eq:angularVelCorr} are the |
2045 |
> |
Eq.~\ref{eq:rms} is the root mean square displacement function. Which |
2046 |
> |
allows one to observe the average displacement of an atom as a |
2047 |
> |
function of time. The quantity is useful when calculating diffusion |
2048 |
> |
coefficients because of the Einstein Relation, which is valid at long |
2049 |
> |
times.\cite{allen87:csl} |
2050 |
> |
\begin{equation} |
2051 |
> |
2tD = \langle | \mathbf{r}(t) - \mathbf{r}(0) |^2 \rangle |
2052 |
> |
\label{oopseEq:einstein} |
2053 |
> |
\end{equation} |
2054 |
> |
|
2055 |
> |
Eq.~\ref{eq:velCorr} and \ref{eq:angularVelCorr} are the translational |
2056 |
|
velocity and angular velocity correlation functions respectively. The |
2057 |
< |
latter is only applicable to directional species in the simulation. |
2057 |
> |
latter is only applicable to directional species in the |
2058 |
> |
simulation. The velocity autocorrelation functions are useful when |
2059 |
> |
determining vibrational information about the system of interest. |
2060 |
|
|
1299 |
– |
The \texttt{dynamicProps} program handles he file in a manner different from |
1300 |
– |
\texttt{staticProps}. As the properties calculated by this program are time |
1301 |
– |
dependent, multiple frames must be read in simultaneously by the |
1302 |
– |
program. For small trajectories this is no problem, and the entire |
1303 |
– |
trajectory is read into memory. However, for long trajectories of |
1304 |
– |
large systems, the files can be quite large. In order to accommodate |
1305 |
– |
large files, \texttt{dynamicProps} adopts a scheme whereby two blocks of memory |
1306 |
– |
are allocated to read in several frames each. |
1307 |
– |
|
1308 |
– |
In this two block scheme, the correlation functions are first |
1309 |
– |
calculated within each memory block, then the cross correlations |
1310 |
– |
between the frames contained within the two blocks are |
1311 |
– |
calculated. Once completed, the memory blocks are incremented, and the |
1312 |
– |
process is repeated. A diagram illustrating the process is shown in |
1313 |
– |
Fig.~\ref{oopseFig:dynamicPropsMemory}. As was the case with |
1314 |
– |
\texttt{staticProps}, multiple properties may be calculated in a |
1315 |
– |
single run to avoid multiple reads on the same file. |
1316 |
– |
|
1317 |
– |
|
1318 |
– |
|
2061 |
|
\section{\label{oopseSec:design}Program Design} |
2062 |
|
|
2063 |
|
\subsection{\label{sec:architecture} {\sc oopse} Architecture} |
2096 |
|
developed to utilize the routines provided by \texttt{libBASS} and |
2097 |
|
\texttt{libmdtools}. The main program of the package is \texttt{oopse} |
2098 |
|
and the corresponding parallel version \texttt{oopse\_MPI}. These two |
2099 |
< |
programs will take the \texttt{.bass} file, and create then integrate |
2099 |
> |
programs will take the \texttt{.bass} file, and create (and integrate) |
2100 |
|
the simulation specified in the script. The two analysis programs |
2101 |
|
\texttt{staticProps} and \texttt{dynamicProps} utilize the core |
2102 |
|
libraries to initialize and read in trajectories from previously |
2108 |
|
|
2109 |
|
\subsection{\label{oopseSec:parallelization} Parallelization of {\sc oopse}} |
2110 |
|
|
2111 |
< |
Although processor power is continually growing month by month, it is |
2112 |
< |
still unreasonable to simulate systems of more then a 1000 atoms on a |
2113 |
< |
single processor. To facilitate study of larger system sizes or |
2114 |
< |
smaller systems on long time scales in a reasonable period of time, |
2115 |
< |
parallel methods were developed allowing multiple CPU's to share the |
2116 |
< |
simulation workload. Three general categories of parallel |
2117 |
< |
decomposition method's have been developed including atomic, spatial |
2118 |
< |
and force decomposition methods. |
2111 |
> |
Although processor power is continually growing roughly following |
2112 |
> |
Moore's Law, it is still unreasonable to simulate systems of more then |
2113 |
> |
a 1000 atoms on a single processor. To facilitate study of larger |
2114 |
> |
system sizes or smaller systems on long time scales in a reasonable |
2115 |
> |
period of time, parallel methods were developed allowing multiple |
2116 |
> |
CPU's to share the simulation workload. Three general categories of |
2117 |
> |
parallel decomposition methods have been developed including atomic, |
2118 |
> |
spatial and force decomposition methods. |
2119 |
|
|
2120 |
< |
Algorithmically simplest of the three method's is atomic decomposition |
2120 |
> |
Algorithmically simplest of the three methods is atomic decomposition |
2121 |
|
where N particles in a simulation are split among P processors for the |
2122 |
|
duration of the simulation. Computational cost scales as an optimal |
2123 |
|
$O(N/P)$ for atomic decomposition. Unfortunately all processors must |
2124 |
< |
communicate positions and forces with all other processors leading |
2125 |
< |
communication to scale as an unfavorable $O(N)$ independent of the |
2126 |
< |
number of processors. This communication bottleneck led to the |
2127 |
< |
development of spatial and force decomposition methods in which |
2128 |
< |
communication among processors scales much more favorably. Spatial or |
2129 |
< |
domain decomposition divides the physical spatial domain into 3D boxes |
2130 |
< |
in which each processor is responsible for calculation of forces and |
2131 |
< |
positions of particles located in its box. Particles are reassigned to |
2132 |
< |
different processors as they move through simulation space. To |
2133 |
< |
calculate forces on a given particle, a processor must know the |
2134 |
< |
positions of particles within some cutoff radius located on nearby |
2135 |
< |
processors instead of the positions of particles on all |
2136 |
< |
processors. Both communication between processors and computation |
2137 |
< |
scale as $O(N/P)$ in the spatial method. However, spatial |
2124 |
> |
communicate positions and forces with all other processors at every |
2125 |
> |
force evaluation, leading communication costs to scale as an |
2126 |
> |
unfavorable $O(N)$, \emph{independent of the number of processors}. This |
2127 |
> |
communication bottleneck led to the development of spatial and force |
2128 |
> |
decomposition methods in which communication among processors scales |
2129 |
> |
much more favorably. Spatial or domain decomposition divides the |
2130 |
> |
physical spatial domain into 3D boxes in which each processor is |
2131 |
> |
responsible for calculation of forces and positions of particles |
2132 |
> |
located in its box. Particles are reassigned to different processors |
2133 |
> |
as they move through simulation space. To calculate forces on a given |
2134 |
> |
particle, a processor must know the positions of particles within some |
2135 |
> |
cutoff radius located on nearby processors instead of the positions of |
2136 |
> |
particles on all processors. Both communication between processors and |
2137 |
> |
computation scale as $O(N/P)$ in the spatial method. However, spatial |
2138 |
|
decomposition adds algorithmic complexity to the simulation code and |
2139 |
|
is not very efficient for small N since the overall communication |
2140 |
|
scales as the surface to volume ratio $(N/P)^{2/3}$ in three |
2141 |
|
dimensions. |
2142 |
|
|
2143 |
< |
Force decomposition assigns particles to processors based on a block |
2144 |
< |
decomposition of the force matrix. Processors are split into a |
2145 |
< |
optimally square grid forming row and column processor groups. Forces |
2146 |
< |
are calculated on particles in a given row by particles located in |
2147 |
< |
that processors column assignment. Force decomposition is less complex |
2148 |
< |
to implement then the spatial method but still scales computationally |
2149 |
< |
as $O(N/P)$ and scales as $(N/\sqrt{p})$ in communication |
2150 |
< |
cost. Plimpton also found that force decompositions scales more |
2151 |
< |
favorably then spatial decomposition up to 10,000 atoms and favorably |
2152 |
< |
competes with spatial methods for up to 100,000 atoms. |
2143 |
> |
The parallelization method used in {\sc oopse} is the force |
2144 |
> |
decomposition method. Force decomposition assigns particles to |
2145 |
> |
processors based on a block decomposition of the force |
2146 |
> |
matrix. Processors are split into an optimally square grid forming row |
2147 |
> |
and column processor groups. Forces are calculated on particles in a |
2148 |
> |
given row by particles located in that processors column |
2149 |
> |
assignment. Force decomposition is less complex to implement than the |
2150 |
> |
spatial method but still scales computationally as $O(N/P)$ and scales |
2151 |
> |
as $O(N/\sqrt{P})$ in communication cost. Plimpton has also found that |
2152 |
> |
force decompositions scale more favorably than spatial decompositions |
2153 |
> |
for systems up to 10,000 atoms and favorably compete with spatial |
2154 |
> |
methods up to 100,000 atoms.\cite{plimpton95} |
2155 |
|
|
2156 |
|
\subsection{\label{oopseSec:memAlloc}Memory Issues in Trajectory Analysis} |
2157 |
|
|
2158 |
|
For large simulations, the trajectory files can sometimes reach sizes |
2159 |
|
in excess of several gigabytes. In order to effectively analyze that |
2160 |
< |
amount of data+, two memory management schemes have been devised for |
2160 |
> |
amount of data, two memory management schemes have been devised for |
2161 |
|
\texttt{staticProps} and for \texttt{dynamicProps}. The first scheme, |
2162 |
|
developed for \texttt{staticProps}, is the simplest. As each frame's |
2163 |
|
statistics are calculated independent of each other, memory is |
2165 |
|
complete for the snapshot. To prevent multiple passes through a |
2166 |
|
potentially large file, \texttt{staticProps} is capable of calculating |
2167 |
|
all requested correlations per frame with only a single pair loop in |
2168 |
< |
each frame and a single read through of the file. |
2168 |
> |
each frame and a single read of the file. |
2169 |
|
|
2170 |
|
The second, more advanced memory scheme, is used by |
2171 |
|
\texttt{dynamicProps}. Here, the program must have multiple frames in |
2175 |
|
in blocks. The number of frames in each block is specified by the |
2176 |
|
user, and upon reading a block of the trajectory, |
2177 |
|
\texttt{dynamicProps} will calculate all of the time correlation frame |
2178 |
< |
pairs within the block. After in block correlations are complete, a |
2178 |
> |
pairs within the block. After in-block correlations are complete, a |
2179 |
|
second block of the trajectory is read, and the cross correlations are |
2180 |
|
calculated between the two blocks. this second block is then freed and |
2181 |
|
then incremented and the process repeated until the end of the |
2193 |
|
\label{oopseFig:dynamicPropsMemory} |
2194 |
|
\end{figure} |
2195 |
|
|
1452 |
– |
\subsection{\label{openSource}Open Source and Distribution License} |
1453 |
– |
|
2196 |
|
\section{\label{oopseSec:conclusion}Conclusion} |
2197 |
|
|
2198 |
|
We have presented the design and implementation of our open source |
2199 |
< |
simulation package {\sc oopse}. The package offers novel |
2200 |
< |
capabilities to the field of Molecular Dynamics simulation packages in |
2201 |
< |
the form of dipolar force fields, and symplectic integration of rigid |
2202 |
< |
body dynamics. It is capable of scaling across multiple processors |
2203 |
< |
through the use of MPI. It also implements several integration |
2204 |
< |
ensembles allowing the end user control over temperature and |
2205 |
< |
pressure. In addition, it is capable of integrating constrained |
2206 |
< |
dynamics through both the {\sc rattle} algorithm and the z-constraint |
2207 |
< |
method. |
2199 |
> |
simulation package {\sc oopse}. The package offers novel capabilities |
2200 |
> |
to the field of Molecular Dynamics simulation packages in the form of |
2201 |
> |
dipolar force fields, and symplectic integration of rigid body |
2202 |
> |
dynamics. It is capable of scaling across multiple processors through |
2203 |
> |
the use of force based decomposition using MPI. It also implements |
2204 |
> |
several advanced integrators allowing the end user control over |
2205 |
> |
temperature and pressure. In addition, it is capable of integrating |
2206 |
> |
constrained dynamics through both the {\sc rattle} algorithm and the |
2207 |
> |
z-constraint method. |
2208 |
|
|
2209 |
|
These features are all brought together in a single open-source |
2210 |
< |
development package. This allows researchers to not only benefit from |
2210 |
> |
program. Allowing researchers to not only benefit from |
2211 |
|
{\sc oopse}, but also contribute to {\sc oopse}'s development as |
2212 |
|
well.Documentation and source code for {\sc oopse} can be downloaded |
2213 |
|
from \texttt{http://www.openscience.org/oopse/}. |