The objective of this tutorial is to
learn how to use the MMTSB Tool Set for carrying out and analyzing DNA simulations with CHARMM. Specifically we will simulate the sequence d(CCCCCCTTTTTT)2.
In this tutorial the B-DNA dodecamer d(CGCGAATTCGCG)2 is used as starting point. A crystal structure
is available from the Protein Data Bank
with the PDB code
1BNA. A picture is shown below.
|
1. Initial Structure
Begin by downloading/copying the PDB file (1BNA). Extract the two nucleic acid strands with:
convpdb.pl -nsel nucleic 1BNA.pdb > 1bna.dna.pdb
Since we want to simulate d(CCCCCCTTTTTT)2 for which a structure is not available we will mutate the nucleotides in the 1BNA structure:
mutateNA.pl -seq A1:B12:CCCCCCTTTTTT 1bna.dna.pdb > ct.dna.pdb
This commmand expects the first base on the first strand (A1) as well as the first complementary base on the second strand (B12) followed by the new sequence (CCCCCCTTTTTT) as parameters.
2. Energy Minimization
We begin with an energy minimization in
distance dependent dielectric to approximate solvent:
minCHARMM.pl -par dielec=rdie,epsilon=4,minsteps=50 \
-log min.log ct.dna.pdb > ct.min.pdb
This script will carry out steepest descent minimization first and then a more effective
minimization method (Adopted-Basis Newton-Raphson) over 50 steps.
Check the CHARMM log file min.log and look at the minimized structure with VMD
to make sure everything is ok.
3. Explicit Solvent Simulation
Next, we will setup the simulation box for an explicit solvent simulation.
First, we need to solvate the DNA in a box of water. We will use a rectangular box to match the
dimensions of DNA better:
convpdb.pl -solvate -cutoff 9 ct.min.pdb > ct.solvated.pdb
Write down the box dimensions that were reported.
Second, we will add counterions to balance the charge of the DNA. Each phosphate has a charge of -1, but
no phosphate group is present with the default termini for the 5' nucleotide of each strand. Therefore,
the total charge is -22. You can check with the following command:
enerCHARMM.pl -charge ct.min.pdb
Counterions (we will use sodium) are added with the following command:
convpdb.pl -ions SOD:22 ct.solvated.pdb > ct.water.ions.pdb
We now have a complete initial system from which we can start a simulation. Begin by minimizing
the entire system. Replace the box dimensions obtained earlier in the following command:
minCHARMM.pl -par minsteps=50,boxx=xxx,boxy=yyy,boxz=zzz \
ct.water.ions.pdb > ct.water.ions.min.pdb
Check the resulting system again with VMD before beginning the molecular dynamics simulations.
In order to equilibrate the DNA system we will need to slowly heat the system with the following series
of commands. Again, replace the correct box dimensions of the system:
mdCHARMM.pl -par dynsteps=500,boxx=xxx,boxy=yyy,boxz=zzz,dyntemp=100 \
-final ct.md1.pdb ct.water.ions.min.pdb
mdCHARMM.pl -par dynsteps=500,boxx=xxx,boxy=yyy,boxz=zzz,dyntemp=200 \
-final ct.md2.pdb ct.md1.pdb
mdCHARMM.pl -par dynsteps=500,boxx=xxx,boxy=yyy,boxz=zzz,dyntemp=250 \
-final ct.md3.pdb ct.md2.pdb
mdCHARMM.pl -par dynsteps=500,boxx=xxx,boxy=yyy,boxz=zzz,dyntemp=300 \
-final ct.md4.pdb ct.md3.pdb
If everything went well so far we are now ready to run a short "production" simulation. This step may take about an hour to complete:
mdCHARMM.pl -par dynsteps=5000,boxx=xxx,boxy=yyy,boxz=zzz \
-par dyntemp=300,dynoutfrq=10 -trajout ct.md5.dcd \
-log ct.md5.log -final ct.md5.pdb ct.md4.pdb
4. Analysis
We can analyze the trajectory with
analyzeCHARMM.pl.
First, it is interesting to examine the value of the sugar pucker,
a key characteristic of A- vs. B-DNA. The following command will calculate
the pucker for all of the nucleotides in the first strand (chain A):
analyzeCHARMM.pl -sel A: -pucker -pdb ct.md4.pdb ct.md4.dcd
You will find that the output actually contains 12 pairs of values per time step. The
first value is the phase (100-180 means B-form, 0-40 means A-form), the second value
is the amplitude.
Did any of the sugar puckers leave the B-form from the intial crystal structure?
Second, the width of the minor groove can be monitored from the C1'-C1' distance of
complementary bases on opposite strands. For example, the distance between the C1' of THY7 (first strand) and C1' of ADE6 (second strand) would measure the minor groove width at the center of the helix. The following
command can be used to obtain a time series of this distance (the back quotes are needed
to mask the prime character from the shell).
analyzeCHARMM.pl -dist -dsel A:7.C1\' B:6.C1\' -pdb ct.md4.pdb ct.md4.dcd
How do these values compare with the minor groove near either one of the termini?