Go to the previous, next section.

Parameter Files and Their Origins

There are number of parameter files available. Immutable versions are stored in the directory, `CGDATA'. The file names used for these files consists of an alphabetic part followed by a number, e.g. `PARAM5'. There are two copies of each file; one with extension, `.INP', which is a character files used as an command file to generate the binary file, with extension, `.MOD'. The `.INP' is meant for human eyes; the `.MOD' files is meant for CONGEN to read. The numeric part of each name is its version number. In general, one should use the highest version number of a file.

There are four types of parameter files available; protein, DNA, AMBER, and obsolete. The protein parameter files are called `PARAMn'. The DNA parameter files are called `PARMDNAn'. The AMBER parameters files begin with `AMBER', and the obsolete files are the other files which begin with `PARM'.

This appendix begins with an essay by David J. States describing the development of the `PARAMn'. The remainder of the appendix lists the titles from all the parameter files.

Development of the PARAM1 Parameter Set

by David J. States

This essay describes the PARAM1 parameterization for the CHARMM molecular modelling program, compiled in May of 1981 by David States. My goal was to obtain a documented set of parameters which described the experimentally observable properties of small molecules as well as possible in the context of the CHARMM potential energy form. This was achieved by a combination of explicit least squares fitting where possible, and trial and error model building where it was not. Extended atom representations were used for aliphatic and aromatic hydrogens with the exception of benzene, where it was necessary to model the system using both extended and all hydrogen potentials. Explicit hydrogens were used for all protonated heteroatoms. Parameters have been included in the parameter file for atom types used in the PROT and ALLH topology files by generalizing the most appropriate HPRO parameter (ex. using the C-NH1 bond constants for C-NH1E).

Geometric Constants

The first step in the process of obtaining bond, angle, and dihedral parameters was to establish the mean bond and angle geometries. Bond lengths were taken from Bruce Gelin's Ph. D. thesis (the dipeptide modelling section), and agree quite well with other tabulations of standard bond lengths such as the CRC Handbook and Karplus and Porter. Angles were chosen as the average value observed in protein crystal structures available at better than 2.0 angstroms resolution on the Brookhaven Protein Data Bank tape. This includes the following structures:

beta trypsin (benzamidine inhibited)
Fehlhammer and Bode (1975) J. Mol. Biol. 98: 683.

beta trypsin complexed with pancreatic trypsin inhibitor
Ruehlman et al (1973) J. Mol. Biol. 77: 417

pancreatic trypsin inhibitor
Deisenhofer and Steigemann (1975) Acta Cryst. B 31: 238

trypsin inhibited with diisopropyl phosphate
Chambers and Stroud (1979) Acta Cryst. B 35: 1861

actinidin
Baker and Dodson (1977) J. Mol. Biol. 115: 263

parvalbumin
Moews and Ketsinger (1975) J. Mol. Biol. 91: 201

erythrocrurin - deoxy, carbon-monoxy, aquomet, and cyanomet
Steigemann and Weber (1979) J. Mol. Biol. 127: 309

flavodoxin - oxidized and semiquinone
Smith et al (1977) J. Mol. Biol. 117: 195

These coordinates and their sequences were read off the tape and tables of the angle geometries built using the CHARMM program. The average values for each angle type were collected over all of these structures weighted equally. This compilation allowed the averaging of twenty to fifteen hundred occurrences for each angle type, with variances ranging from less than one degree to as much as fifteen degrees, and ranges of two to sixty five degrees. In general these average values are within two degrees of the `PARMFIX10' values. It should be noted that all of the above structures are crystallographically refined. A certain amount of bias is therefore inevitable, but I feel justified by the higher resolutions that these methods achieve.

Force Constants

Having established these geometric parameters, the next step was the definition of force constants. Force constants describing simple hydrocarbons and amides were obtained by least squares fitting to the vibrational spectra of propane, butane, pentane, and N-methyl acetamide (Snyder et al and Warshel, Levitt, and Lifson). This fitting was carried out in stages, first choosing parameters and evaluating the resulting normal modes until a set was obtained with correctly ordered vibration frequencies. These parameters were then refined by least squares fitting to vibrational spectra of all the isotopic variants available. This fitting was performed with a user subroutine linked to CHARMM that called the CHARMM energy and vibrational analysis routines for normal mode analysis, and used an IMSL minimization routine (ZXSSQ). This fitting algorithm uses finite differences to construct an approximate first derivative vector and second derivative matrix, requiring a rapidly increasing number of function evaluations as the number of free variables rises. To reduce the complexity of this fitting, parameters were grouped (bonds vs. angles vs. dihedrals and impropers) and minimized separately until good agreement was obtained over the entire range of observed frequencies. The experimental and calculated normal modes for N-methyl acetamide are shown in the first table. The most prominent differences between the current and the previous parameterization are seen in the low frequency and out of plane bending modes.

The dihedral angle potential for alkanes was found to be 1.6 kcal/mole by this procedure in good agreement with the value reported in the dipeptide and acetylcholine work of Gelin (1.8 kcal/mole). It should be noted that the previous parameter sets used a much lower value (0.2 kcal/mole). The old value resulted in essentially unhindered rotation between trans and gauche isomers and a very small gauche well depth (0.03 kcal/mole). The new parameters are in much better agreement with experimental observations as can be seen in the second table.

For the aromatic systems it was not possible to perform this least squares fitting because the simple potential energy form that we are using is not able to correctly order the various normal modes (Varsanyi and Dollish et al). This is apparently a result of the strong valence interactions between internal coordinates, and presumably could be overcome using a potential form that includes valence cross terms. The large number of modes (30) in benzene and the strong interaction of carbon and hydrogen motions also complicated the analysis of this system. For these reasons, benzene was analyzed by manually adjusting parameters for a model including all hydrogens. The heavy atom parameters obtained from this system agree with those used previously, and were kept fixed in modelling more elaborate aromatic systems.

Using the CHARMM potential energy function, aromatic out of plane bending behavior is determined by both the arrangement and parameterization of the improper dihedrals. For histidine, phenylalanine, and tyrosine the current parameterization and arrangement gave good agreement for the intrinsic ring modes (based on the benzene modelling), and they were retained. The ring substituent out of plane modes were not well described, and it was necessary to parameterize the system using the lowest out of plane bending modes for toluene (217 cm-1) and phenol (241 cm-1) to fix new constants for the ring CB and ring hydroxyl bends respectively. The present improper dihedral was retained, but in both cases the force constants required large increases (from 25 kcal/mol/rad2 to 120 and 150 respectively).

Tryptophan required both rearrangement and reparameterization of the improper dihedrals to obtain reasonable agreement with the skeletal out of plane bend (400 to 500 cm-1 in purines and napthalene) without creating very high frequency modes involving the ring intersection. The new arrangement replaces the improper dihedrals centered on the atoms at the intersection of the rings (CD2 and CE2) with terms that are trans two fold dihedrals beginning on one ring, running across the bridging atoms and ending on the opposite ring as shown below:

Tryptophan

                 CZ2           NE1
               *     *       *     *
              *       *     *       *
           CH2          CE2          *
            |            |            *
            |            |             CD1
            |            |            *
            |            |           *
           CZ3          CD2         *
              *        *    *      *
               *      *      *    *
                 CE3           CG
                                |
                                |
                               CB

           PARAM1                    PARMFIXn

        CZ2                        CZ2----------NE1
           *                          *        *
            *                          *      *
             CE2                        *  CE2
              |                          *
              |                           *
             CD2                           CD2
                *
                 *
                  CG

                  NE1
                 *
                *
             CE2                           CE2
              |                           *
              |                          *
             CD2                        *  CD2
            *                          *      *
           *                          *        *
        CE3                        CE3-----------CG

In addition, two improper dihedrals were added as shown below.

 CH2--------CE2                             CE2 *
             |                               |     *
             |        * CD1                  |        * CD1
             |     *                         |
            CD2 *                 CZ3-------CD2

This new arrangement distributes the out of plane bending forces evenly across the ring in a manner analogous to the delocalized electronic structure of the system, and does so without excessively elevating the frequency of CD2/CE2 out of plane modes. With the `PARMFIX10' parameters these modes had frequencies of about 1000 cm-1, but the skeletal out of plane bend occurred at 120 cm-1. Simply raising the force constants without rearranging the dihedrals resulted in a bridge atoms out of plane mode at 3000 cm-1! Aside from being unrealistic, this high frequency would force the use of small step sizes to retain accuracy in the dynamics integration.

Hydrogen Bonds and Non-bonded Interactions

My effort concentrated on the bond, angle, and dihedral force constants. The non-bond interaction parameters from Bruce Gelin's thesis were used without modification or cutoffs for the model systems. These nonbond parameters are the same as those found in `PARMFIX10'.

The hydrogen bond geometries and well depths were obtained from a table of typical values found in Vinogradov and Linell. Because the van der Waals and electrostatic terms contribute significantly to the total hydrogen bond energy, the hydrogen bond parameters were adjusted to give an energy minimized water dimer with the desired geometry and well depth. The other terms were shifted similarly, although by smaller amounts because the van der Waals interaction is not as large in the longer nitrogen-oxygen and nitrogen-nitrogen pairs. No hydrogen bonds were present in the model systems used above.

Summary and Discussion

`PARAM1' is a refined set of parameters for the CHARMM potential energy form that describes the molecular mechanics of systems containing the covalent structures found in biological molecules. While an effort has been made to include different model compounds and conformations to generalize these parameters, they are primarily based on simple amides and hydrocarbons and intended for use in protein simulations. For non-protein systems it would be advisable to verify the properties of the constituents using model compounds (such as sugars and nucleotides) before macromolecular simulations are attempted. The accuracy of the geometric and force constants presented in this set is difficult to estimate. For some parameters such as bond stretching constants isolated modes could easily be identified and the parameter set to an accuracy of five percent or better. Conversely, angle bending and dihedral constants could not always be associated with particular modes, and significant correlations between the various parameters were observed during fitting. Even where least squares fitting was possible, none of these force constants should be regarded as accurate to better than twenty per cent. One corollary of this interdependence is the danger in naively readjusting parameters without repeating the analysis of all of the various model systems affected.

Given the number and extent of the changes presented in this work, the question of how they will affect results naturally arises. It should be noted that almost all of the changes presented here will tend to make molecules more rigid than they were previously. This will have implications in both the accuracy of the dynamics algorithm at a given step size, and the magnitude of the expected fluctuations. The behavior of the peptide bonds should not be markedly changed since the net effect of the parameter change on their vibrational modes is small. The aliphatic sidechains and coupling of motions through angle and dihedral terms is expected to show more prominent effects.

References:

Dollish, Fately, and Bentley (1975) Characteristic Raman Frequencies of Organic Compounds Wiley and Sons, New York.

Gelin Ph. D. Thesis (Chapters 2 and 4, Tables 12 to 15)

Gelin and Karplus (1975) JACS 97: 6996

Snyder and Schachtscheider (1965) Spectrochimica Acta 21: 169-195

Varsanyi (1974) Assignments to Vibrational Spectra of 700 Benzene Derivatives Halsted Press, Budapest.

Vinogradov and Linell (1971) Hydrogen Bonding van Nostrand.

Warshel and Lifson (1970) J. Chem. Phys. 53: 582

Warshel, Levitt, and Lifson (1971) J. Mol. Spect. 33: 84

Table I -- N-Methyl Acetamide Vibrational Analysis

     mode         Experimental   New    Old
____________________________________________
peptide torsion        192        189    162
    N bend             289        281    291
  C in plane           437        444    375
C out of plane         600        551    312
   Amide IV            628        621    544
NH out of plane        725        730    390
  C C stretch          883        866    864
  C N stretch         1120       1115   1112
   Amide III          1323       1314   1262
   Amide II           1570       1597   1580
    Amide I           1660       1661   1675
  N H stretch         3200       3193   2969

Table II -- n-Butane Dihedral

Adiabatic Potential Energy Surface for the n-Butane Dihedral

Angle       Experimental            New             Old
____________________________________________________________
180.0           0.0                -0.23           -0.23
165.0                               0.21            0.00
150.0                               1.32            0.27
135.0                               2.49            0.65
120.0           3.4                 3.04            0.86
105.0                               2.69            0.87
 90.0                               1.82            0.84
 75.0           0.8                 1.32            1.05
 60.0                               1.67            1.65
 45.0                               2.97            2.61
 30.0                               4.92            3.68
 15.0                               6.73            4.54
  0.0           6.1                 7.43            4.87

Table III -- Comparison of Force Constants

            Term   New Set   Old Set
____________________________________
        C O bond    580       600
       C CH bond    405       400
       C NH bond    471       500
      NH CH bond    422       450
       NH H bond    405       350

    CH C O angle     85        40
   CH C NH angle     20        25
    NH C O angle     65        50
   C NH CH angle     77.5      60
    C NH H angle     35        30
   CH NH H angle     30        30
  CH CH CH angle     45        30

    C NH torsion      8.0       7.0
   CH CH torsion      1.6       0.2

      C improper    105        25
      N improper     50        25
toluene improper    120        25
 phenol improper    150        25

PARAM2 -- Modification of PARAM1 for CO and Heme

`PARAM2' has new parameters added pertaining to carbon monoxide binding to heme proteins.

Bonds

CM-OM
carbon monoxide stretch, see Caughey et al (Biochemical and Clinical Aspects of Hemoglobin Abnormalities).

FE-CM
Iron-carbon monoxide stretch, see Kroeker et al (J. Chem. Phys. 72: 4846, 1980) Values for the chemisorption of CO onto Fe matrix, data based on tunnelling spectroscopy; no data for proteins

FE-NR
Fe-proximal histidine N epsilon stretch, see K. Nagai et al (J. Mol. Biol. 136: 271, 1980); Kincaid et al (Proc. Nat. Acad. Sci. USA 76: 549, 1979) No change in equilibrium bond distance from that in Gelin's thesis.

Angles

FE-CM-OM
Iron carbon monoxide bend, see Kroeker et al (vide supra)

NP-FE-CM
Porphyrin nitrogen Fe carbon monoxide bend. No data available, values selected to be consistent with that for proximal histidine (5th ligand) nitrogen Fe porphyrin nitrogen bend.

Dihedrals

X-FE-CM-X
no data available, values selected to be consistent with that for X-FE-NR-X (i.e. just as the heme can rotate without great hindrance about the Iron-proximal histidine bond so can the CO rotate freely about the Iron-carbon bond)

Polarizabilities Etc.

The values for CM same as that for carbonyl carbon, and the values for OM same as that for carbonyl oxygen.

PARAM3

`PARAM3' is a modification of `PARAM2' which has peptide geometries from Ramachandran et al BBA 359: 298 (1974), and torsions from Hagler et al J. Amer. Chem. Soc. 98: 4600 (1976).

PARAM4

`PARAM4' is a modification of `PARAM3' which has a few additional parameters for carbon monoxide and molecular oxygen as well as revised torsions for better agreement to infrared spectra for phenol, toluene, and napthalene.

PARAM5

`PARAM5' is a modification of `PARAM4' which has a few additional parameters for ring carbons, and those needed for the all hydrogen topology (`ALLH') files.

AMBERPARM

The AMBER parameter set(38) is available as `AMBERPARM'.

This parameter file is now obsolete, and is replaced by the AMBER 94 parameter file, which is described in the next section.

AMBER94PARM

The AMBER 94 parameter set(39) is available as `AMBER94PARM'. This parameter file must be used with the AMBER 94 topology file, see section AMBER94RTF.

Several atoms in the AMBER94 potential have zero van der Waals radii. This causes problems for CONGEN because of the possibility that such atoms may get close enough to another atom for electrostatic fusion to occur. Therefore, such atoms have been given a small van der Waals radius and a small van der Waals repulsive interaction. Please see the file for the exact value.

This potential file has some features which you should be aware of. First, the amide torsion terms (as found in the aspargine and glutamine sidechains) are designed so that the minimum energy is non-zero, and the shape is correct only when both hydrogens are considered. Second, there is no explicit hydrogen bond energy. Such interactions are accounted for by the electrostatic energy. Third, zero energy improper torsion terms have been added for chiral centers so that the analysis tables, see section Static Properties of Internal Coordinates, will show the chirality and allow the user to check it.

Obsolete Parameters for Proteins

PARMEXEL

Title found in the run creating the parameters:

*   PARAMETER FILE FOR EXEL VERSION 11 CREATION RUN
*   HYDROGEN BOND STRENGTHS INCREASED BY 1.0 KCAL/MOLE FROM PARMFIX11
*   LENGTHS DECREASED BY 0.1-0.2 ANGSTROM WALLWORK ACTA. CRYST. (66) 15:758
*       7-Apr-1981  21:33:52 - DJS

Title written to the binary file:

*   PARAMETER  VERSION 10 -- EXTENDED AND EXPLICIT SET -- 27 ATOM TYPES
*   COMPILED BY WFVG, SNS AND BDO -- MARCH 1979
*   HYDROGEN BOND PARAMETERS CHANGED FROM VERSION 5 -- ONLY O-O,O-N,N-N
*   H BOND STRENGTHS INCREASED 1.0 KCAL/MOLE FOR EXEL LENGTHS SHORTENED
*   NONBONDED PARAMETERS FOR HYDROGEN (A-0.044,N-1,R-0.8 == EMIN-0.1)
*   PHIS INVOLVING C(CARBONYL) AND NITROGENS MADE CONSISTENT IN VER. 8
*   O-H BOND FORCE CONSTANTS CHANGED TO 400.0 IN VERSION 9
*   ADDED ANGLE FOR N-TERMINAL PROLINE -- BRUC
*   LAST UPDATE 7-Apr-1981  21:20:20

PARMFIX10

Title written to the binary file:

*   PARAMETER  VERSION 10 -- EXTENDED AND EXPLICIT SET -- 27 ATOM TYPES
*   COMPILED BY WFVG, SNS AND BDO -- MARCH 1979
*   HYDROGEN BOND PARAMETERS CHANGED FROM VERSION 5 -- ONLY O-O,O-N,N-N
*   NONBONDED PARAMETERS FOR HYDROGEN (A-0.044,N-1,R-0.8 == EMIN-0.1)
*   PHIS INVOLVING C(CARBONYL) AND NITROGENS MADE CONSISTENT IN VER. 8
*   O-H BOND FORCE CONSTANTS CHANGED TO 400.0 IN VERSION 9
*   ADDED ANGLE FOR N-TERMINAL PROLINE -- BRUC
*   LAST UPDATE 16-Mar-1981  21:54:36

Additional angle for proline cropped up when a bug was found in PATEXH and explicit hydrogen topology file. The angle added was CH1E-NH3-CH2E which was given the parameters for HC-NH3-CH1E which appeared to be the closest angle to it. --Bruc

PARMFIX9

Title written to the binary file:

*   PARAMETER  VERSION 9
*   EXTENDED AND EXPLICIT SET -- 27 ATOM TYPES
*   COMPILED BY WFVG, SNS AND BDO -- MARCH 1979
*   HYDROGEN BOND PARAMETERS CHANGED FROM VERSION 5 -- ONLY O-O,O-N,N-N
*   NONBONDED PARAMETERS FOR HYDROGEN (A-0.044,N-1,R-0.8 == EMIN-0.1)
*   PHIS INVOLVING C(CARBONYL) AND NITROGENS MADE CONSISTENT IN VER. 8
*   O-H BOND FORCE CONSTANTS CHANGED TO 400.0 IN VERSION 9
*   LAST UPDATE 5/4/80 

PARMFIX8

Title written to the binary file:

*   PARAMETER  VERSION 8
*   EXTENDED AND EXPLICIT SET -- 27 ATOM TYPES
*   COMPILED BY WFVG, SNS AND BDO -- MARCH 1979
*   HYDROGEN BOND PARAMETERS CHANGED FROM VERSION 5 -- ONLY O-O,O-N,N-N
*   NONBONDED PARAMETERS FOR HYDROGEN (A-0.044,N-1,R-0.8 == EMIN-0.1)
*   PHIS INVOLVING C(CARBONYL) AND NITROGENS MADE CONSISTENT IN VER. 8
*   LAST UPDATE 3/23/80 

PARMFIX7

Title written to the current binary file:

*   PARAMETER  VERSION 7
*   EXTENDED AND EXPLICIT SET -- 27 ATOM TYPES
*   COMPILED BY WFVG, SNS AND BDO -- MARCH 1979
*   HYDROGEN BOND PARAMETERS CHANGED FROM VERSION 5 -- ONLY O-O,O-N,N-N
*   NONBONDED PARAMETERS FOR HYDROGEN (A-0.044,N-1,R-0.8 == EMIN-0.1)
*   LAST UPDATE /1/24/80 

PARMFIX5 (Unreadable)

Title written to the binary file:

*   PARAMETER  LIST  -  VERSION 5
*   EXTENDED AND EXPLICIT SET -- 27 ATOM TYPES
*   COMPILED BY WFVG, SNS AND BDO -- MARCH 1979
*   LAST UPDATE 4/10/79 

PARMFIX2 (Unreadable)

Title written to the binary file:

*   EXTENDED ATOM PARAMETERS -- VERSION 2
*   15 ATOM TYPES -- THIS PARAMETER FILE WAS THE MOST LIKELY
*   FILE WITH WHICH THE LONG ( 100 PICOSECS ) PTI DYNAMICS RUN
*   WAS MADE -- IT HAS SOME LARGE ANGLE FORCE CONSTANTS AND NO
*   PARAMETERS FOR IMPROPER TORSIONS -- PARAMETERS NOT NEEDED IN
*   PTI HAVE ALSO BEEN ADDED -- 3/15/79 -- BDO AND SNS

PARMALLH

This file is obsolete. The latest version of `PARAMn' should be used for all hydrogen topology files.

Title in the input file generating the binary file.:

*   PARAMETER FILE FOR ALL EXPLICIT HYDROGENS BASED ON CFF PARAMETERS
*   IN BRUCE GELIN'S THESIS (PARMFIX7 VALUES HAVE TRAILING 0'S)
*   BOND ANGLE PARAMETERS INCLUDE THE HARMONIC CONTRIBUTION FROM UREY-BRADLEY
*   F TERM ( (R13-R0)**2 )
*   DIHEDRALS INCLUDE MULTIPLICITY FROM CFF
*   MODIFIED TO INCLUDE HA AND CT
*   HIS AND TRYP INCLUDED 4/13/80 DJS

Title written to the parameter file:

*   ALLH PARAMETER SET MODIFIED FROM GELIN THESIS 3/4/80
*   MODIFIED TO INCLUDE HA AND CT   -- 2/24/80 -- DJS

DNA Parameter Files

The DNA parameter files have had very limited usage. Do not trust the results using them unless you satisfy yourself that the parameters are reasonable.

The DNA parameters are inconsistent with the protein parameters. Do not mix proteins and DNA together. You will run into problems with chemical type codes being reused and general confusion and errors.

PARMDNA4

`PARMDNA4' is the same as `PARMDNA3' except a sodium ion has been added to the nonbonded parameters.

Creation title:

*   PARAMETER FILE -- VERSION 4 -- CREATION RUN -- 10/20/81 -- BRB

Title in the file:

*   PARAMETER  VERSION 4
*   DNA

PARMDNA3

Titles from the creation run:

*   PARAMETER FILE -- VERSION 2 -- CREATION RUN -- 10/20/81 -- BRB
*   BROMINATED CYT AND WATER ADDED 10/20/81

Title in the file:

*   PARAMETER  VERSION 2
*   PARAMETERS FROM UCSF EXCEPT FOR HBONDS AND IMPHIS
*   DNA

PARMDNA1

Title from the input file:

*   PARAMETER FILE -- VERSION 1 -- CREATION RUN -- 1/29/81 -- BRB

Title written to the binary file:

*   PARAMETER  VERSION 1
*   PARAMETERS FROM UCSF EXCEPT FOR HBONDS AND IMPHIS
*   DNA

Go to the previous, next section.