Molecular Dynamics Guide:
Simulating Atomic Motion

Everything you need to understand molecular dynamics — from Newton's equations and force fields, through thermostats, barostats, and enhanced sampling, to professional trajectory analysis.

📑 Contents

  1. 01Newton's Equations
  2. 02Force Fields
  3. 03Integrators
  4. 04Ensembles, Thermostats, Barostats
  5. 05Water Models
  6. 06Complete MD Workflow
  7. 07Trajectory Analysis
  8. 08Enhanced Sampling
  9. 09Software Ecosystem

Overview

Molecular Dynamics (MD) simulation is one of the most powerful tools in the computational chemist's arsenal. Where quantum mechanics gives us the static electronic structure of a molecule, MD gives us its dynamic behavior over time — how atoms move, how proteins breathe, how a drug molecule enters a binding pocket, and how a membrane deforms under stress. MD treats atoms as classical particles governed by Newton's equations of motion, updated at each femtosecond timestep to generate a physically realistic movie of molecular motion.

📌
What MD Can Tell You That Other Methods Cannot

MD reveals dynamic properties: protein conformational changes, binding and unbinding kinetics, membrane permeability, solvent effects, allosteric communication pathways, and thermal stability — all fundamentally inaccessible to static methods like crystal structure analysis or docking alone.

⚛️ Newton's Equations of Motion

Molecular dynamics applies classical Newtonian mechanics to molecular systems. For each atom i with mass mᵢ, position rᵢ, and forces Fᵢ acting on it:

Newton's Second Law — The Engine of MD Fᵢ = mᵢ × aᵢ = mᵢ × d²rᵢ/dt²
Force from Potential Energy Fᵢ = −∇ᵢV(r₁, r₂, …, rₙ)

🔗 Force Fields — Describing Atomic Interactions

A force field is a mathematical model that describes the potential energy of a molecular system as a function of atomic positions. It is the single most important choice in any MD simulation — the force field determines how "realistic" the physics is.

Total Potential Energy Decomposition V = V_bonds + V_angles + V_dihedrals + V_improper + V_vdW + V_electrostatics
TermPhysical MeaningMathematical Form
Bond stretchingRestoring force for covalent bond length deviationsV = ½k_b(r − r₀)²
Angle bendingRestoring force for deviations in bond anglesV = ½k_θ(θ − θ₀)²
Torsion/DihedralRotational barriers around single bondsV = Σₙ Vₙ/2(1+cos(nφ−δ))
Van der WaalsShort-range repulsion + London dispersion4ε[(σ/r)¹² − (σ/r)⁶]
ElectrostaticsCoulombic interaction between partial chargesV = q₁q₂/(4πε₀r)

Common Force Fields and When to Use Them

AMBER ff19SBAMBER family

State-of-the-art for proteins. Excellent backbone φ/ψ parameters. Best for protein folding, dynamics, and drug-protein simulations.

CHARMM36mCHARMM family

Excellent for proteins, lipids, and nucleic acids. Industry standard for membrane simulations.

GAFF2AMBER / small mol.

General AMBER Force Field for small organic molecules and drug-like compounds. Used to parameterize ligands in drug-protein MD.

OPLS-AAOPLS family

Optimized Potentials for Liquid Simulations. Excellent for organic molecules, solution thermodynamics, and liquid simulations.

GROMOS 54A7GROMOS family

United-atom representation. Computationally efficient; good for carbohydrates and lipids in GROMACS.

ReaxFFReactive FF

Reactive force field allowing bond breaking and formation. Used for materials science, combustion, and reactive systems.

🔢 Integrators — Propagating the Trajectory

An integrator is the numerical algorithm that updates positions and velocities at each timestep. The most widely used in MD is the Velocity Verlet integrator:

Velocity Verlet — The Standard MD Integrator r(t+Δt) = r(t) + v(t)Δt + ½a(t)Δt² v(t+Δt) = v(t) + ½[a(t) + a(t+Δt)]Δt
⏱️
Timestep Choice is Critical

Δt must be much smaller than the fastest motion in the system (N-H bond vibration ≈ 10 fs period). Standard: Δt = 1–2 fs with hydrogen bond constraints (LINCS/SHAKE). Larger timesteps (4–5 fs) require hydrogen mass repartitioning (HMR). Too-large timesteps → energy drift → simulation instability ("explosion").

🌡️ Statistical Ensembles, Thermostats & Barostats

MD simulations mimic different thermodynamic conditions by controlling which variables are fixed:

EnsembleFixed VariablesThermostatBarostatUse Case
NVEN, V, ENoneNoneTesting integration accuracy
NVTN, V, TRequiredNoneNVT equilibration; heating/cooling
NPTN, P, TRequiredRequiredStandard production runs (lab conditions)
NμTμ fixedRequiredNoneGrand canonical; open systems

Thermostats — Controlling Temperature

V-rescale (Velocity Rescaling): Stochastically rescales velocities; correct canonical ensemble. Default for equilibration in GROMACS.
Nosé-Hoover: Extended system thermostat; rigorous canonical ensemble. Preferred for production runs.
Langevin Dynamics: Adds random forces and friction to mimic solvent coupling. Widely used in AMBER and NAMD.
Berendsen: Simple, fast convergence — but does NOT produce correct canonical ensemble. Only for initial equilibration.

Barostats — Controlling Pressure

Parrinello-Rahman: Correct NPT ensemble. Gold standard for production MD in GROMACS.
Monte Carlo Barostat: Default in AMBER/OpenMM for NPT. Robust and easy to use.
Berendsen: Fast pressure equilibration — use for NPT equilibration only, not production.

💧 Water Models

Biological simulations require explicit water molecules, and the water model dramatically affects protein dynamics, solvation energies, and transport properties:

ModelSitesDiffusion Coeff.Best Paired With
TIP3P3 (rigid)~5.5 × 10⁻⁹ m²/s (too fast)CHARMM, AMBER ff14SB. Fast and widely used.
TIP4P-Ew4 (rigid)~2.4 × 10⁻⁹ m²/sAMBER, better electrostatics than TIP3P
SPC/E3 (rigid)~2.5 × 10⁻⁹ m²/sGROMOS, OPLS. Good density and diffusion
TIP4P/20054 (rigid)~2.1 × 10⁻⁹ m²/sBest overall water model; reproduces many properties
OPC4 (rigid)~2.3 × 10⁻⁹ m²/sAMBER ff19SB. Best for protein dynamics
🖼️
Fig. 1 — A protein (shown as ribbon/surface) surrounded by explicit water molecules in a periodic simulation box — the standard setup for biomolecular MD.

🔄 Complete MD Simulation Workflow

1

Prepare the System

Download PDB structure. Remove water/heteroatoms. Model missing residues. Protonate at correct pH. Parameterize ligands with GAFF2 or CGenFF.

2

Build the Simulation Box

Create periodic boundary conditions (PBC) box. Solvate with explicit water. Neutralize charge with counterions. Add physiological NaCl (0.15 M). Check system composition.

3

Energy Minimization

Steepest descent to remove steric clashes. Continue until max force < 1000 kJ/mol/nm. Critical step — skip this and the simulation will explode.

4

NVT Equilibration (100–200 ps)

Heat system to target temperature (300 K or 310 K). Apply position restraints on heavy atoms. Use V-rescale thermostat. Monitor temperature convergence.

5

NPT Equilibration (100–500 ps)

Equilibrate density and pressure (1 bar). Release position restraints gradually. Use Parrinello-Rahman barostat + Nosé-Hoover thermostat. Monitor density convergence (~1.0 g/cm³).

6

Production MD Run (10 ns – μs)

Full unconstrained simulation. No position restraints. Save coordinates every 10–100 ps. Use checkpoint files. GPU acceleration recommended.

7

Trajectory Analysis

RMSD, RMSF, Rg, H-bonds, SASA, PCA, clustering, MM-PBSA. Visualize in VMD or PyMOL. Confirm equilibration before extracting properties.

📊 Trajectory Analysis — What to Measure

RMSD

Structural Stability
√[1/N Σᵢ(rᵢ−r₀ᵢ)²]

Measures structural drift from reference. Plateau = stable simulation.

RMSF

Residue Flexibility
√[⟨(rᵢ−⟨rᵢ⟩)²⟩]

Per-residue fluctuation. High = flexible loops. Low = rigid secondary structure.

Radius of Gyration

Compactness
√[1/M Σᵢmᵢ(rᵢ−r_cm)²]

Overall compactness. Increasing Rg = unfolding. Stable = well-folded protein.

Hydrogen Bonds

Interactions
dist <3.5Å, angle >120°

Count of H-bonds over time. Key metric for binding stability and secondary structure.

PCA / Clustering

Conformations
Cov → eigenvectors → PC

Identifies dominant motions and representative conformations visited during simulation.

MM-PBSA / MM-GBSA

Binding Free Energy
ΔG = ΔE_MM + ΔG_sol

Post-processing binding free energy estimation. Widely used for lead compound ranking.

🚀 Enhanced Sampling Methods

Standard MD is limited to microsecond timescales on modern hardware. Many biological events occur on millisecond-to-second timescales. Enhanced sampling methods accelerate conformational space exploration:

Replica Exchange MD (REMD): Multiple replicas at different temperatures. Exchanges allow escaping local energy minima. Best for small proteins and peptide folding.
Metadynamics: History-dependent bias along collective variables (CVs) discourages revisiting sampled states. Excellent for free energy surfaces.
Umbrella Sampling: Harmonic bias along reaction coordinate. Combined with WHAM to reconstruct PMF. Gold standard for binding/unbinding pathways.
Steered MD (SMD): External force pulls ligand out of binding pocket. Combined with Jarzynski equality for free energy estimates.
Gaussian Accelerated MD (GaMD): Automatically applies boost potential without predefined CVs. Excellent for drug binding/unbinding on long timescales.

💻 Software Ecosystem

SoftwareStrengthLicenceGPU Support
GROMACSSpeed, efficiency — most widely used academic MD engineFree (LGPL)Excellent (CUDA)
AMBERGold standard for biomolecules; excellent force fieldsCommercialExcellent (pmemd.cuda)
NAMDOutstanding parallel scaling on large clusters; steered MDFree (academic)Good
OpenMMPython API; highly flexible; custom forces; ML potentialsFree (MIT)Excellent
CHARMMComprehensive; excellent for method developmentCommercialGood
DesmondFastest GPU MD; used in Schrödinger SuiteCommercialExcellent

References & Further Reading

  1. Frenkel, D. & Smit, B. (2002). Understanding Molecular Simulation (2nd ed.). Academic Press.
  2. Allen, M. P. & Tildesley, D. J. (2017). Computer Simulation of Liquids (2nd ed.). Oxford University Press.
  3. Case, D. A. et al. (2021). "Amber 2021." University of California, San Francisco.
  4. Abraham, M. J. et al. (2015). "GROMACS: High performance molecular simulations." SoftwareX, 1–2, 19–25.
  5. Hollingsworth, S. A. & Dror, R. O. (2018). "Molecular Dynamics Simulation for All." Neuron, 99(6), 1129–1143.
  6. Huggins, D. J. et al. (2019). "Biomolecular simulations: From dynamics and mechanisms to computational assays." WIREs Comput. Mol. Sci., 9, e1393.

🎓 The Bottom Line: Molecular dynamics is not just a computational technique — it is a microscope for watching molecules move in real time. Machine-learned force fields (ANI, AMOEBA+, OpenFF) are closing the gap between DFT accuracy and MD speed. AI-accelerated MD is pushing accessible timescales toward biological relevance. The next decade promises microsecond-to-millisecond simulations of full protein complexes on commodity hardware.

← Previous Article Next Article →