Conformational Searching Opcodes
The entire contents of the input file are used as the “seed” for the search for most search methods. At the start of a run, the full contents of this file is read, and the search proceeds as if all the structures in the input file had been found in the current search. This allows a new search to use the results of a previously run, partial search as its input. Of course, for a new search, the input file will contain a single structure.
The output file from the search contains in the header lines of the individual structures the number of times each structure was found. Therefore, the full benefits of usage-directedness are obtained in a subsequently run search. This degeneracy information is ignored, for technical reasons, during a network-distributed search. In a distributed search, or in any search seeded from an output file produced by an earlier version of MacroModel, the structures are treated as if they each occurred exactly once in a previous search. Debug (DEBG) flag 360, if set, instructs the program to ignore the degeneracy information even if it would otherwise utilize it.
Searches may be updated during program execution. When a search is updated, the information in the temporary file (filename.tmp) is written to the output file, after discarding structures that are not within the specified energetic window of the current global minimum. A summary of the job progress is also printed, which includes information on the number of times various structures were found and the convergence of these structures during minimization. Automatic, periodic updates can be controlled with the MCOP command.
For a discussion of conformational search protocols, see MacroModel Conformational Searches. As discussed there, it is advisable to perform the search without exhaustive minimization of all found structures and to subsequently reminimize the surviving structures.
This can be done without initiating a new MacroModel job, by inserting into the .com file a RWND command following the MINI command that terminates the search setup, then inserting a BGIN/END loop within which a READ/MINI sequence is specified. The COMP atoms used in the search will continue to be active during the reminimization.
The maximum number of conformations that can be stored when using MCMM, LMCS, SPMC, or MULT is no longer limited to 10000. See the respective command descriptions and MacroModel Program Capacity for further details.
The opcodes in this section are linked below:
LMCS — Low-Mode Conformational Search
LMC2 — Low-Mode Conformational search for large molecules
LOOP — protein LOOP conformation generation
MCMM — Monte Carlo Multiple Minimum
SPMC — Systematic Pseudo-Monte Carlo search
MCSS — Monte Carlo Structure Selection
FLAP — use FLAP algorithm for ring conformations
MCRC — Monte Carlo Ring Conformations
MCTS — Monte Carlo Torsion Selection
MCSM — Monte Carlo Single Minimum
MCNV — Monte Carlo Number of Variables
SEED — random number generator SEED
TORS — variable TORSion selection
MOLS — variable MOLeculeS selection
MCMF — Monte Carlo Maximum constraint Failures
AUTO — AUTOmatic setup
Perform automatic setup for MCMM, MINI, SPMC, LMCS, LMC2, and combinations of MCMM with LMCS or LMC2, on the current structure. This procedure takes into account the current substructure information. The setup for MCMM is compatible with MCMM/LMCS and MCMM/LMC2. In addition, it can be applied in a serial manner (across different molecules in the input structure file) in MCMM, MCMM/LMCS, and MCMM/LMC2 serial searches, but not in pure LMC2 serial searches.
AUTO is incompatible with LOOP and must be used carefully with MBAE. In addition, AUTO is incompatible with frozen atoms unless a substructure is being used.
Note: Serial searches can generate large numbers of output structures, and the resulting output structure file can be large enough to exceed file size limits.
AUTO should be preceded by the MSYM, MCMM, MCNV, MCOP, LMCS, LMC2, SPMC, SUBS, and READ opcodes, if present, and is best placed just before the MINI line in the .com file.
The type of calculation in use is detected automatically. For MINI calculations AUTO sets up comparison atoms, chiral atoms, and torsional constraints unless instructed not to. For MCMM calculations AUTO sets up comparison atoms, chiral atoms, torsional constraints, molecule moves, variable torsions, and ring closures. For LMCS and LMC2 calculations, molecule moves, variable torsions, and ring closures are set up but not used, unless the setup is turned off. Series of bonded atoms that connect fixed or frozen atoms in substructures have ring closure bonds created within them to prevent rotation of fixed or frozen regions. Failure to find such a ring closure turns off all variable torsions in the SUBS atoms connecting such regions.
AUTO arg8 controls the extent to which variable torsions are applied to the input structures. This option is relevant mostly to amide and ester linkages, and supplies a greater degree of control over sampling of standard and nonstandard amides and esters. arg8=1 applies torsion moves to the amide/ester linkage of nonstandard groups like anhydrides, carbamates, hydrazones, and so on. Normal esters and amides are not sampled with this setting, but a torsional constraint is applied to them to ensure that the relative conformation of these groups is maintained. arg8=2 adds torsion moves to sample normal esters and amides in addition to amide and ester linkages of nonstandard groups. arg8=3 adds torsion moves to C=N and N=N bonds, in addition to all amide and ester derivatives. arg8=4 in addition removes torsional constraints involving hydrogen atoms.
In automatic setup, the maximum number of torsions varied at the same time is set to the minimum of arg2 of MCNV and the number of torsions present in the current system. If MCNV arg2 has not been set then the number of torsions present in the current system is used. In serial calculations, the number of torsions varied at the same time is determined for each system.
|
arg1 |
AUTO On/Off |
|
−1 |
Turn off automatic setup. |
|
0 |
Turn on automatic setup and generate a new setup using the current system. |
|
1 |
Turn on automatic setup. If automatic setup is already on, use the existing setup. |
|
arg2 |
Comparison atom setup |
|
−1 |
Use existing list of comparison atoms. |
|
0 |
Use previous setting. If none exists, arg2 is set to 2. |
|
1 |
Create a new list of comparison atoms containing all non-hydrogen atoms. |
|
2 |
Create a new list of comparison atoms containing all non-hydrogen atoms and hydrogen atoms bound to oxygen atoms. |
|
3 |
Create a new list of comparison atoms containing all of the atoms. |
|
4 |
Create a new list of comparison atoms containing all non-hydrogen atoms. Also use dihedral angle differences involving polar H atoms (bonded to a O, S or N atom) in redundant conformer elimination. If the dihedral angles in two conformers differ by more than a threshold (controlled by |
|
5 |
Create a new list of comparison atoms containing all ring atoms. If no ring atoms are found, use heavy atoms and hydrogen atoms bonded to oxygen atoms. |
|
arg3 |
Chiral atom setup |
|
−1 |
Use existing list of chiral atoms. |
|
0 |
Use previous setting. If none exists, arg3 is set to 1. |
|
1 |
Create a new list of chiral atoms (overrides CHIG for the current structure). |
|
arg4 |
Torsion checks |
|
−1 |
Use existing list of torsion checks. |
|
0 |
Use previous setting. If none exists, arg4 is set to 1. |
|
1 |
Create a new list of torsion checks for carbon-carbon double bonds, amides, and esters. |
|
2 |
Create a new list of torsion checks for carbon-carbon double bonds only. |
|
arg5 |
Torsional selection |
|
−1 |
Use existing list of torsions. |
|
0 |
Use previous setting. If none exists arg5 is set to 1. |
|
1 |
Create a new list of torsions to sample. |
|
arg6 |
Serial calculation |
|
−1 |
Turn off serial mode for MCMM calculations. |
|
0 |
Use previous setting. |
|
1 |
Turn on serial mode for MCMM calculations. Serial MCMM calculations are off by default. If LMCS serial calculations have already been requested, then serial MCMM calculations are also turned on automatically when the |
|
arg7 |
Minimum ring size for ring closures and torsional sampling |
|
0 |
Use the previous setting. If there is no previous setting, use the default setting. |
|
> 3 |
Attempt to sample rings with at least this many atoms as members. The default value is 5 (five-membered rings or larger). |
|
arg8 |
Sampling of torsions around bonds involving planar groups |
|
Specify sampling of torsions around amide and ester linkages, azo (N=N) and imino (C=N) groups. |
|
|
0 |
Do not sample torsions around planar groups. |
|
1 |
Sample torsions around nonstandard amide and ester linkages; do not sample other planar groups. |
|
2 |
Sample all amide and ester linkages; do not sample azo and imino linkages. |
|
3 |
Sample all amide and ester linkages, and sample azo and imino linkages. |
|
4 |
Sample all of the above and include hydrogen atoms in the sampling. |
Related DEBG flags: 520, 521.
AUOP — AUto OPtions
When AUTO is used in conjunction with an MCMM, mixed MCMM/LMCS, mixed MCMM/LMC2, SPMC, or CGEN conformational search, AUOP may be used to modify the number of steps depending on some measure of the number of degrees of freedom present. AUOP should be used immediately before the AUTO command.
For CGEN by default the number of moves can also depend on ConfGen’s internal estimate for the number of degrees of freedom. See CGO2 arg3 for more information.
|
arg1 |
Number of times global minimum is found |
|
> 0 |
The search will terminate when the global minimum is found this many times. Not recommended for ConfGen searches. |
|
arg2 |
Exclude backbone torsions from torsion list |
|
0 |
Include protein backbone torsions in the conformational search. |
|
1 |
Exclude protein backbone torsions from the conformational search. |
|
arg3 |
Exclude variable chirality atoms from chiral atom list |
|
0,1 |
Use full list of chiral atoms. |
|
2 |
Exclude atoms whose chirality can vary in solution. This currently only excludes nitrogen atoms bonded to 4 atoms one of which much be a hydrogen atom. |
|
arg5 |
Number of steps |
|
< 1 |
The search will use the number of steps specified by the search command itself, e.g. arg1 of MCMM . |
|
> 1 |
The search will use (number of degrees of freedom) * arg5 steps if this is less than the number of steps specified by the search command itself. For MCMM, mixed MCMM/LMCS, mixed MCMM/LMC2 and SPMC searches the number of degrees of freedom is given by the number of torsions (TORS) and molecules moved (MOLS) identified by AUTO. For CGEN jobs, the number of degrees of freedom can be estimated this way or from ConfGen’s internal estimate for the number of degrees of freedom, depending on the value of CGO2 arg3. |
LMCS — Low-Mode Conformational Search
This method is described by Kolossváry and Guida [30, 31]. In tests so far, it has proved highly efficient, and it has the advantage that ring structures and variable torsion angles do not have to be specified. LMCS works by exploring the low-frequency eigenvectors of the system, which are expected to follow “soft” degrees of freedom, such as torsions. It is, however, helpful to specify independently movable molecules, by means of MOLS commands. Like the MCMM procedure, LMCS may be seeded or restarted with multiple input structures, and LMCS may also be used with distributed MacroModel. Most features that work with MCMM also work with LMCS. COMP, CHIG, DEMX, and MCSS commands should ordinarily be specified, and several MCOP arguments take on special meaning when used with LMCS.
The AUTO opcode can be used for automatic setup of LMCS and MCMM/LMCS calculations.
LMCS comes in two varieties: LMCS-SHAKE and mode-following. LMCS-SHAKE follows the low-mode vector, but every 2 Å it applies a few steps of steepest descent minimization to relieve any strain due to distorted bond lengths and bond angles introduced by the move.
The mode-following (eigenvector-following) option allows the user to apply the original low-mode search concept: “Since the potential energy hypersurface is a network of interconnected minima and saddle points, we reasoned that one could utilize a procedure that relies on eigenvector following for conformational searching. Thus, one could initiate the search by starting with any local minimum. By using one of the eigenvector-following techniques, one could locate a saddle point associated with this minimum and then the other minimum associated with this saddle point. By application of the eigenvector-following technique to the second minimum or to a different eigenvector of the first minimum, additional minima could be located which could then be used to find additional saddles, etc.” (quoted from the J. Comp. Chem. article cited above).
Mode-following uses the same eigenvector-following saddle point search method as in the SDLP command (see details there). If LMCS mode-following locates a saddle point, it proceeds as follows. The eigenvector for the negative eigenvalue is computed, followed by a short move along that eigenvector away from the starting structure. The resulting structure, which is slightly lower in energy than the saddle point structure, is then energy-minimized to locate the minimum energy structure on the other side of the saddle point.
If LMCS mode-following cannot locate a saddle point, the starting (minimum) structure is restored and is automatically “rejected by starting geometry.” Thus, only minima found via saddle points are stored. This makes LMCS mode-following useful for the mapping of conformational interconversions in a local region of the potential energy surface. The recommended procedure for this is to set LMCS arg5=−1 (every step mode-following), and MCSS arg1=0 (random walk structure selection) or LMCS arg4=1 (local search mode).
Note: LMCS mode-following is not intended for general conformational searches. Simple LMCS or LMCS-SHAKE (LMCS arg5>=0) is much more efficient for this purpose. LMCS mode-following is only recommended for local searches exploring the conformational interconversions of a molecule. DEBG 920 saves all the intermediate structures during LMCS-SHAKE or mode-following in the output structure file and colors them so that you can conveniently visualize the different multiple LMCS moves.
LMCS can be combined with MCMM search. Tests have confirmed that the most efficient use of LMCS is to allow explicit torsional rotation of key torsion bonds, especially in acyclic structures. Therefore, you can combine LMCS with TORS commands (with RCA4 if necessary) without restriction. MOLS commands can also be applied to translate/rotate independently movable molecules. See also the MCOP command.
|
arg1 |
Number of Monte Carlo steps to be carried out before stopping |
|
0 |
Carry out the search until arg2 structures have been found. We do not recommend the use of this default. |
|
arg2 |
Maximum number of structures to retain while running |
|
0 |
The maximum number of conformations to retain while running is the value specified in arg1 or 10,000, whichever is greater. |
|
> 0 |
The maximum number of conformations to retain while running. |
|
arg3 |
Number of low-frequency modes |
|
|
|
|
0 |
Default: 10 (pure modes). |
|
< 0 |
|
|
arg4 |
Search mode |
|
0 |
Global search (each Monte Carlo step begins with the preceding Monte Carlo structure, providing the structure is within 100 kJ of the global minimum). This is equivalent to MCSS (arg1=0, arg2=0, arg4=100.0). |
|
1 |
Local search (each Monte Carlo step begins with the original structure). Generally used with small coordinate variations in TORS or MOLS to find other minima which are closely related to the starting structure. |
|
arg5 |
Control of multiple LMCS steps |
|
0 |
Default: Single LMCS leap (MacroModel 6.0 behavior). |
|
> 0 |
Frequency of using LMCS-SHAKE (must be in range 0 to 1). |
|
< 0 |
|arg5| is frequency of using mode-following (arg5 must be in range 0 to −1). |
|
arg6 |
Allowable interatomic approach distance |
|
Fraction of sum of van der Waals radii used as a closest atomic approach limit. If a van der Waals pair comes closer together than this as the result of an LMCS move, the move is rejected before minimization. |
|
|
0 |
Default: 0.25 |
|
> 0 |
Other fraction. |
|
arg7 |
Minimum distance (Å) |
|
In an LMCS move, a random total travelling distance is selected between the specified minimum and maximum values. The distances specified here and in arg6 correspond to the motion of the fastest-moving atom. |
|
|
It is often useful to perform a short conformational search with the default values of arg7 and arg8, and, based on the results, adjust these arguments accordingly. If many conformations minimize back to the starting conformation, increase arg7 (and perhaps arg8). If many conformations are ruled out because of distorted sp3 carbons, decrease arg8 (and perhaps arg7). The information needed to make these choices will become visible by specifying MCOP arg1=1. |
|
|
0 |
Default: 3 Å |
|
> 0 |
Other value to be used. |
|
arg8 |
Maximum distance (Å) |
|
0 |
Default: 6 Å |
|
> 0 |
Other value to be used. |
LMC2 — Low-Mode Conformational search for large molecules
This method, termed LLMOD, is described by Kolossváry and Keseru [32]. LLMOD has been developed for large-scale conformational searching, such as protein loop optimization, homology model refinement, and fully flexible docking for induced fit modeling, only to mention a few applications [33]. LLMOD is very similar to LMOD, which is implemented in MacroModel as the LMCS command, but LLMOD utilizes the ARPACK package to compute low-mode eigenvectors of a Hessian matrix that is only referenced implicitly, through its product with a series of vectors. The Hessian × vector product can be calculated by a number of methods. LLMOD is the first conformational search method that can be applied to fully flexible, unconstrained protein structures.
LMC2 is implemented to extend the basic use of LMCS to molecules of virtually any size. Note that two main features of LMCS, LMCS-SHAKE and LMCS mode-following, are not implemented in LMC2. The command arguments of LMC2 are fairly similar to those of LMCS. Like LMCS, LMC2 can also be used in conjunction with the MCMM command to explicitly alter key torsion bonds via TORS commands (with RCA4 to open macrocyclic structures if necessary) and apply explicit translation/rotation of a ligand with the MOLS command for docking. LMC2 is primarily designed for fully flexible molecules, but the use of frozen atoms is allowed, and recommended in cases where parts of a macromolecule can be treated rigidly. Also note that you can set certain parameters controlling the ARPACK package via the ARPK command.
The AUTO opcode can be used for automatic setup of LMC2 and MCMM/LMC2 calculations. It can be used for MCMM/LMC2 serial searches but not for pure LMC2 serial searches.
LMCS is more efficient than LMC2 when the number of atoms is small enough for the entire Hessian to be stored in computer memory without swapping.
Note: It has proven to be good practice and time-saving to minimize molecular structures during a conformational search to a gradient that is not too low and only re-minimize entirely the final set of low-energy conformations. In that respect, LMC2 needs special attention. It is still recommended to follow this scheme, e.g., for unconstrained proteins, a gradient RMS of 1 kJ/mol/Å is sufficient during the search, but the very first structure, for which the low-modes are computed, must be pre-minimized to a low, < 0.1 kJ/mol/Å gradient RMS in order to derive reasonable modes. Pre-minimization can either be done in a separate MacroModel job, or the RWND command can be used to fully minimize the first structure, rewind the output file and start the LMC2 conformational search from there.
Note: Large-scale low-mode calculations can use a large amount of memory. If memory issues are encountered, consider eliminating the use of solvation. In addition, reducing non-bonded cutoffs also reduces the memory required for the computation.
|
arg1 |
Number of Monte Carlo steps to be carried out before stopping |
|
0 |
Carry out the search until arg2 structures have been found. We do not recommend the use of this default. |
|
arg2 |
Maximum number of structures to retain while running |
|
0 |
The maximum number of conformations to retain while running is the value specified in arg1 or 10,000, whichever is greater. |
|
> 0 |
This is the maximum number of conformations to retain while running. |
|
arg3 |
Number of low-frequency modes |
|
|
|
|
0 |
Default: 30 (pure modes). |
|
< 0 |
|
|
arg4 |
Search mode |
|
0 |
Global search: each Monte Carlo step begins with the preceding Monte Carlo structure, providing the structure is within 100 kJ of the global minimum. This is equivalent to MCSS (arg1=0, arg2=0, arg4=100.0). |
|
1 |
Local search: each Monte Carlo step begins with the original structure. Generally used with small coordinate variations in TORS or MOLS to find other minima which are closely related to the starting structure. |
|
arg5 |
Determines when low-mode eigenvectors are calculated |
|
ARPACK calculations on proteins are time consuming. The time span of a single low-mode calculation can vary from a few minutes to an overnight job. It is highly recommended that eigenvectors be re-calculated only when it is absolutely necessary, i.e., when there is evidence that low modes calculated for a particular conformation cannot be transferred to other conformations. In other words, as long as a single set of low modes can produce efficient search directions starting from a number of different local minima, there is no need to calculate a different set of modes for every new conformation. |
|
|
0 |
Default: Calculate the low modes for the very first minimized structure only and use those modes throughout the entire search (recommended). |
|
> 0 |
Low-mode eigenvectors are recalculated every time the search finds a new global minimum. The modes of the current global minimum are always used. |
|
< 0 |
Low modes are recalculated for every new structure (LMCS behavior). |
|
arg6 |
Allowable interatomic approach distance |
|
Fraction of sum of van der Waals radii used as a closest atomic approach limit. If a van der Waals pair comes closer together than this as the result of an LMCS move, the move is rejected before minimization. |
|
|
0 |
Default: 0.25 |
|
> 0 |
Other fraction. |
|
arg7 |
Minimum distance (Å) |
|
Same as LMCS, except that the default values are different, depending on arg3. |
|
|
0 |
Default: 3 (Å) for single mode, arg3 >= 0 |
|
1 |
(Å) for mixed mode, arg3 < 0 |
|
> 0 |
Other value to be used. |
|
arg8 |
Maximum distance (Å) |
|
Same as LMCS, except that the default values are different, depending on arg3. |
|
|
0 |
Default: 6 (Å) for single mode, arg3 >= 0 |
|
2 |
(Å) for mixed mode, arg3 < 0 |
|
> 0 |
Other value to be used. |
LOOP — protein LOOP conformation generation
LOOP is a conformational search method for protein loops that works in conjunction with the MINI opcode in a manner similar to MCMM.
Given a single protein structure LOOP attempts to generate candidate loop conformations using either the specified loop from the protein structure provided or an alternate user specified sequence for the loop. LOOP rapidly generates a candidate structure with the appropriate bond lengths and angles. In addition, heavy atoms in this structure (i.e., atoms heavier than hydrogen) will not approach each other more closely than a specified distance. These candidate structures are then energy minimized and can be checked for uniqueness in a manner analogous to that for MCMM. A more general discussion of the use of LOOP is available in Protein Loop Construction with MacroModel.
LOOP generates candidate structures in two stages:
-
Initial loop generation via random selection of rotatable dihedral angles without concern for loop closure
-
“Tweak,” an iterative process involving the systematic adjustment of the rotatable bonds to attain loop closure [34, 35] while eliminating very close encounters amongst heavy atoms.
In the loop generation stage, the N-terminus and C-terminus halves of the loop are constructed separately, starting from their attachment points, by growing the chain out to the next rotatable bond. Then a random dihedral angle is chosen for this bond and the heavy atoms positioned unambiguously as a result are checked for close approaches with heavy atoms in the rest of the protein (not the loop itself). If there are such close approaches, the process of selecting a new random angle and checking for close approaches is repeated until either Ntry such attempts are made or no close approaches result. If Ntry such attempts fail then the loop construction process retreats back to the previous rotatable bond for which a new random angle is chosen. When a dihedral angle has been chosen successfully, the process focuses on the next rotatable bond out from the attachment point. Rotatable bonds in side chains are also sampled in this manner. The loop generation stage produces a random structure for the protein loop that is not connected in the middle and does not have close approaches between the loop atoms and the rest of the protein, but there may such close approaches within the loop itself.
In the second stage the Tweak algorithm is used to bring the middle of the loop together in the right geometry. Briefly, this algorithm proceeds by iteratively selecting a set of minimal changes to the rotatable bonds that lead toward loop closure based upon a linear approximation. Loop closure is specified by a set of target distances that must be met to within a tolerance of 0.001 Angstroms. Along the way close approaches involving heavy atoms of the loop atoms and the rest of the protein are removed by temporarily introducing additional target distances between the overlapping atoms. When the set of distances for closing the loop are met to within 1 Angstrom the Tweak algorithm also attempts to remove close approaches amongst the heavy atoms within the loop. By using a minimal set of changes to the rotatable bonds the Tweak algorithm retains as much of the original, random loop conformation in torsional space as possible and thus provides a mechanism for generating a diverse set of candidate loop structures.
Limitations:
-
Only one loop can be treated, although successive calculations may target different loops.
-
An all-atom representation of the protein must be used.
-
The protein loop and the residues that it is immediately attached to must consist of alpha amino acids.
-
Rings (e.g., the one in proline) are treated as rigid.
-
No disulfide bonds are permitted within the loop or between the loop and the rest of the protein.
-
No atoms in the loop can be frozen.
Using LOOP usually results in a renumbering of the atoms within the structure such that the loop atoms become the highest numbered atoms in the structure. LOOP tries to make this work by automatically shifting the atom numbers provided in SUBS, FXAT, FXDI, FXBA, FXTA, COMP, and CHIG commands. There are also routines for automatically generating COMP and CHIG atoms for the loop atoms as described for arg3, below. To facilitate subsequent studies using the structures generated, a new substructure file, out_filename.sbc, containing the shifted atom numbers is generated automatically. In addition, shifted COMP and CHIG commands suitable for use in .com files are written in the .log file.
If comparison atoms (COMP) are specified explicitly or implicitly (see arg 3) then MSYM must be used.
|
arg1 |
N-terminus atom number for the loop |
|
This is the atom number for the peptidic Nitrogen atom that joins the loop to the protein at the N-terminus of the loop. This must be nonzero. |
|
|
arg2 |
C-terminus atom number of the loop |
|
This is the atom number for the peptidic Carbon atom that joins the loop to the protein at the C-terminus of the loop. This must be nonzero. |
|
|
arg3 |
Type of LOOP Generation |
|
−1 |
Turn off |
|
0 |
Generates alternate conformations for the loop specified by args 1 and 2. In this mode the first structure minimized is that originally provided. |
|
1 |
Remove the loop specified by args 1 and 2 and replace it with the amino acid sequence specified in an in_filename |
|
arg4 |
The number of loop conformations to generate |
|
0 |
Default: 100 |
|
arg5 |
The maximum number of loop conformations to keep |
|
0 |
Default: 10000 |
|
arg6 |
Allowable interaction approach distance |
|
If a given pair of atoms is closer than arg6 times the sum of their van der Waals radii after loop generation just prior to minimization then the structure is rejected and another loop structure is generated. |
|
|
0 |
Default: 0.25 |
|
arg7 |
Automatic generation of COMP and CHIG atom lists for the loop |
|
−1 |
Don’t generate COMP or CHIG commands for the atoms in the loop. |
|
0 |
Automatically generate COMP commands for the heavy atoms in the loop and CHIG commands for atoms in the loop. |
|
1 |
Automatically generate COMP commands and CHIG commands for all the atoms in the loop. |
|
2 |
Automatically generate COMP commands for the heavy atoms in the loop and do not generate any CHIG commands. |
|
3 |
Automatically generate COMP commands for all the atoms and do not generate any CHIG commands. |
Related DEBG flags: 555
LPOP — LooP OPtions
LPOP provides a means to modify some of the parameters related to protein loop generation with the LOOP opcode. If no LPOP commands are provided the default values for the parameters are used. Multiple LPOP commands (e.g., with different arg1 values) may be used. It is not expected that these parameters would need to be modified from their default values under normal conditions so typically one would not need to include a LPOP command in the .com file.
|
arg1 |
Parameter set modified by LPOP |
||||||||||||||||||||||||||||
|
This argument controls which set of parameters is modified by the remaining arguments to |
|||||||||||||||||||||||||||||
|
0 |
Tweak parameters
|
||||||||||||||||||||||||||||
|
1 |
Initial construction parameters
|
||||||||||||||||||||||||||||
|
2 |
Geometric parameters
|
Related DEBG flags: 555
ARPK — ARPacK parameters
This opcode applies to LMC2, VBR2, and LTNCG (MINI arg1=11)
The ARPACK package was developed by Danny Sorensen, Richard Lehoucq, Chao Yang, and Kristi Maschhoff at Rice University [36, 37]. Instead of attempting to solve a huge eigenproblem directly, ARPACK solves it indirectly by solving a series of small problems of much lower dimensionality and references the huge matrix only implicitly through its product with some vectors. ARPACK is ideally suited for LLMOD calculations, because direct calculation of the low-mode eigenvectors of a huge Hessian matrix of a protein molecule is prohibitive. Instead, LLMOD utilizes ARPACK to compute the low-mode eigenvectors indirectly. The ARPK command allows a user to fine tune ARPACK parameters. Note that the VBR2 command also uses ARPACK to calculate vibrational modes.
DEBG 999 can be used to provide greater verbosity for ARPK.
|
arg1 |
Select a method to calculate Hessian × vector |
|
0 |
Default: The Hessian is calculated analytically, but it is stored in a sparse vector representation, not in matrix form, and allows for fast matrix-vector multiplication, which scales only linearly with the number of dimensions. This is the recommended method. |
|
1 |
H×v is calculated by a finite difference formula based on gradients (see Kolossváry and Keseru [32]). This method does not need second derivatives. It is fast, but is often unstable and converges only very slowly. |
|
2 |
H×v is calculated using a LBFGS (see MINI arg1=10) representation of an approximation to the inverse Hessian matrix. This method is the fastest, but it can only be used to calculate the three lowest modes. Higher modes are usually only computational artifacts: they are totally useless for LLMOD. Check arg4 if you choose this method. |
|
3 |
This method is a variant of arg1=1 using spectral transformation to boost convergence. Extremely slow, not recommended. If you choose to try it, check arg6 and arg7. |
|
4 |
Same as arg1=0, but sparse Hessian storage is built on top of an explicit Hessian matrix. It is faster than arg1=0, but explicit Hessian storage makes it prohibitive for more than approx. 1000 atoms. (The Hessian storage scheme is the same used for TNCG minimization (MINI arg1=9). |
|
5 |
H×v is calculated with exact arithmetic. The Hessian is not stored in any form, its elements are always recalculated as they are needed for computing vk(i)=H(i,j)*vl(j) and vk(j)=H(i,j)*vl(i). This is the most accurate method, but it is prohibitively slow on large molecules because of the burden of recalculating the Hessian numerous times. |
|
arg2 |
Dimension of “small” problems to solve (length of Arnoldi factorization) |
|
0 |
Default: 300 |
|
> 0 |
Other value to be used. arg2 >= requested number of modes + 1 (LMC2 arg3, VBR2 arg2), recommended value: arg2 = 10 * requested number of modes |
|
arg3 |
Number of “small” problems to solve (number of Arnoldi iterations) |
|
0 |
Default: 450 |
|
> 0 |
Other value to be used (maximum 25,000). |
|
arg4 |
This argument is multipurpose. |
|
If arg1=2, then arg4 specifies the number of extra LBFGS steps applied after minimization has reached convergence, to build a more accurate LBFGS representation of the Hessian matrix. |
|
|
0 |
Default: 500 |
|
> 0 |
Other value to be used. |
|
If arg1=0, then arg4 is ignored. The amount of memory used for sparse Hessian is increased automatically as needed. |
|
|
If arg1 is not 0 or 2, then arg4 is used to specify the amount of memory used for sparse Hessian storage. |
|
|
0 |
Default: 20 (MB) |
|
> 0 |
Other value to be used. |
|
Note that the user-defined memory is evenly distributed between storage of Hessian values and storage of corresponding (i, j) indices. |
|
|
arg5 |
Accuracy of ARPACK calculation in terms of ABS(λc − λt) < arg5* ABS(λc) where λc is a calculated eigenvalue and λt is the true eigenvalue. |
|
0 |
Default: 0.0001 |
|
< 0 |
Machine precision (double precision). |
|
> 0 |
Other value to be used. |
|
arg6 |
Eigenvalue shift of spectral transformation. Used with arg1=3. |
|
0 |
Default: 5.0 |
|
> 0 |
Other value to be used. |
|
Spectral transformation transforms the eigenvalue spectrum such that the closest eigenvalue to arg5 becomes the largest eigenvalue and therefore, most likely the first eigenvalue to be found by ARPACK. |
|
|
arg7 |
Conjugate gradient tolerance criterion for spectral transformation. Used with arg1=3. |
|
0 |
Default: 0.01 |
|
> 0 |
Other value to be used. |
|
Spectral transformation involves minimum residual conjugate gradient iteration for which arg7 sets the convergence limit. |
|
|
arg8 |
Sparse Hessian cutoff value. Used with arg1=0 and arg1=4. (Similar to MINI/arg6). |
|
0 |
Default: 0.001 |
|
> 0 |
Other value to be used. |
|
Used with arg1=0, any Hij term that is less than arg8 in magnitude is excluded from the sparse Hessian representation. A single Hij element is the sum of a number of Hij terms coming from different interactions that map to the same coordinate indices (i, j). On the other hand, used with arg1=4, arg8 is interpreted the same way as arg6 in the MINI command, i.e., only those elements of the Hessian matrix that are greater than arg8 are used in the sparse Hessian representation. |
MCMM — Monte Carlo Multiple Minimum
MCMM [38, 39] is our recommended conformational search method. The input structure will be modified by random changes in torsion angles and/or molecular position as specified by the TORS or MOLS commands. Ordinarily, whether a single structure or multiple structures appear in the input file, they will first all be read in, minimized and treated as if already found by the MCMM procedure. This allows a new search to be initialized from the output of a previous search, by using the output file of the old search as input for the new one. However, if the necessary READ and MINI commands are placed within a BGIN/END loop, then a separate search is carried out for each input structure. This is called a serial search. For such serial calculations, the AUTO opcode is also needed.
The TORS command is used to specify dihedral angles to be varied; the MOLS command specifies relative positions of multiple molecules, as in an enzyme-substrate docking procedure. In addition, RCA4 commands and LIGB commands can be used to open rings and break ligand bonds, respectively, before performing torsional or relative molecular motion. CHIG commands should be specified for ring-closure atoms as well as to retain chirality about other centers. TORC commands may be used to hold double-bond configurations constant.
The AUTO opcode can be used for automatic setup of any kind of MCMM calculation.
Only unique structures will be retained, as in the MULT conformational searches. Use DEMX to set an energetic window to select low energy conformations. It is usually found that not all structures converge in minimization during a conformational search. A MULT minimization of the output file is recommended to achieve convergence for the final result.
While the default search method is random walk, we find that the usage-directed search (MCSS arg1=2) gives improved search performance.
|
arg1 |
Number of Monte Carlo steps to be carried out before stopping |
|
0 |
Carry out the search until arg2 structures have been found. We do not recommend the use of this default. |
|
arg2 |
Maximum number of structures to retain while running |
|
0 |
The maximum number of conformations to retain while running is the value specified in arg1 or 10,000, whichever is greater. |
|
> 0 |
This is the maximum number of conformations to retain while running. |
|
arg3 |
Number of variables altered in each step |
|
The number of torsional angles varied and/or molecules moved in each step (default: 3). The value specified is randomly varied by +/−1 to prevent concentration of the search in local areas of conformational space. The default (3) thus alters 2–4 variables at each step. |
|
|
Greater control over this parameter may be obtained using the MCNV command; this is the most common procedure. |
|
|
arg4 |
Search mode |
|
0 |
Global search (each Monte Carlo step begins with the preceding Monte Carlo structure, providing the structure is within 100 kJ of the global minimum). This is equivalent to MCSS (arg1=0, arg2=0, arg4=100.0). |
|
1 |
Local search (each Monte Carlo step begins with the original structure). Generally used with small coordinate variations in TORS or MOLS to find other minima which are closely related to the starting structure. |
|
arg6 |
Allowable interatomic approach distance |
|
Fraction of sum of van der Waals radii which is used as a closest atomic approach limit (default: 0.25). |
SPMC — Systematic Pseudo-Monte Carlo search
Similar to MCMM, but invokes systematic search in place of random search. The search begins at low torsional resolution (120°), searches all angles without duplicating coverage, then doubles the resolution, etc. This method has the advantage of not retracing its path and consequently converges the final stages of the conformational search more efficiently than MCMM. Like MCMM, the method is effectively open-ended: it will search conformational space until stopped by the user or with arg1.
SPMC searches are conducted starting from a single input structure. However, if the necessary READ and MINI opcodes are placed within a BGIN/END loop, then a separate search is carried out ion each input structure. This is called a serial search. AUTO may be used to set up single SPMC searches and is required for serial SPMC searches.
It is suggested that torsional memory (MCSS arg3) be activated when using SPMC to prevent retracing of points in conformational space when starting from different starting geometries. If rings are being varied (i.e., RCA4 commands are being used), geometrical preoptimization (MCOP arg2) should also be activated.
Use with MCNV arg1=1 and arg2=N−1, where N is the number of variable torsions plus the number of molecules being independently translated/rotated with MOLS commands.
Details of the method are provided by Goodman and Still [40].
The arguments are the same as for MCMM, except arg3:
|
arg3 |
Maximum resolution for torsional alterations |
|
Default: 24, implying angular resolution of 360° / 24 = 15°. |
MCOP — Monte Carlo OPtions
This command alters the data written to the .log file, and also the geometrical optimization routine. If MCOP is omitted, this is equivalent to setting arg1=250 and arg2=0.
Note: Despite its name, MCOP specifies parameters for use in low-mode (LMCS and LMC2) and LOOP calculations, as well as Monte Carlo searches. Arguments 4 and 5 only apply to LMCS and LMC2 jobs. Starting in MacroModel 6.5, these arguments have new meanings and arguments 6 and 7, which pertained to the earlier LMCS methodology, have been eliminated.
|
arg1 |
Number of steps between printout to log-file |
|
0 |
Print to log file every 250 Monte Carlo steps. |
|
1 |
Print to log file every step. |
|
n |
Print to log file every n steps. |
|
arg2 |
Geometrical preoptimization |
|
0 |
Off. |
|
1 |
On. Preoptimizes variable internal coordinates to improve ring closure distances. Recommended for SPMC of ring systems. |
|
arg3 |
Frequency of updating a conformational search |
|
When an update is performed, the following actions take place:
|
|
|
0 |
Default: Perform an update every tenth of a run, but not more often than every ten steps nor less often than every 500 steps. |
|
n |
Perform an update every n steps. |
|
arg4 |
LMCS serial job |
|
0 |
Not an LMCS serial job. A single low-mode search is performed, with all input structures treated as seed structures, as if found in previous iterations. |
Note: arg4=0 is unsuitable for non-conformers.
|
≠0 |
LMCS serial job; this implies that a separate conformational search will be performed for each structure in the input file. This takes advantage of the ability of LMCS to define fruitful search directions without specification of variable torsions. An LMCS serial job can be run only when there are no commands specifying atom numbers—such as TORS or CHIG—which might translate to incorrect specifications in the different input structures. |
|
arg5 |
Probability of taking a TORS/MOLS step |
|
0 |
If this is an LMCS job, all steps will be LMCS steps. |
|
n |
If this is an LMCS job and there are TORS or MOLS commands present, this fraction of moves will be TORS or MOLS (i.e., not LMCS) moves. |
|
arg6 |
Maximum number of conformers to write to the output structure file for each search performed |
|
For example, in a serial search, the search for each input structure would produce at most the number of structures specified here. |
|
|
Using a value other than 0 or 1 for this argument is not permitted in a |
|
|
0 |
Use setting from the previous |
|
If no previous |
|
|
> 0 |
Save up to this many structures per search. |
|
arg7 |
The number of conformers read in from the input structure file to seed each search in a serial search. |
|
This argument is primarily intended for use in |
|
|
Do not use this argument in LOOP calculations. |
|
|
Do not use this argument in |
|
|
0, 1 |
Seed each search with one structure. |
|
> 1 |
Seed each search with this many input structures. |
MCSS — Monte Carlo Structure Selection
This command allows the program to select starting geometries for Monte Carlo search steps in several different ways. Arg1 selects between random walk and two usage-related criteria. Arg2 affects weighting among selected structures for low energy geometries. Arg5 gives an optional energy window which prevents structures which are high in energy from being chosen.
This command is active only when doing global searching (i.e., when MCMM or SPMC arg4 = 0). In any case, structures must be within 100 kJ of the current global minimum to be candidates for starting geometry selection.
|
arg1 |
Starting structure selection criterion |
|
0 |
Random walk. Most recent structure will be chosen whose energies are allowed by args2 and 5. |
|
1 |
Use-directed. The least used structures will be used as starting geometries if their energies are allowed by args2 and 5. “Use” is defined as (times used as starting structure) − (times resulting structure is kept). |
|
2 |
Use-directed. The least used structures will be used as starting geometries if their energies are allowed by args2 and 5. Use defined simply as (times used as starting structure). |
|
arg2 |
Energetic window modifier |
|
0 |
Arg5 will be used directly; recommended. |
|
1 |
Arg5 will be multiplied by a random number between 0 and 1. |
|
arg3 |
Torsional memory selection |
|
0 |
Torsional memory is not used. |
|
> 0 |
Torsional memory is used. Structures are considered identical if all torsions match within RES/arg3 where RES is the operative search resolution (smallest value = 2). |
|
arg5 |
Energetic window, kJ/mol |
|
Current structure will be used as a starting geometry for a subsequent step only if its energy is within arg5 kJ/mol of the lowest energy structure yet found. A good choice for arg5 is simply the value of the overall energetic window being used in DEMX. Default: 100 kJ/mol. |
FLAP — use FLAP algorithm for ring conformations
Use the FLAP ring vertex reflection algorithm [54] to generate ring conformations in MCMM, low-mode, or mixed MCMM/low-mode conformational searches. This opcode must be added before the relevant MCMM or LMCS opcode.
|
arg5 |
Probability of using a FLAP step |
|
Specify the fraction of conformational search steps that will use FLAP. If used in conjunction with MCRC, the sum of the FLAP and template probabilities must not be greater than 1. |
|
|
0 |
Use the default value of 0.5 |
|
> 0 |
Use this value. Should be between 0 and 1. |
MCRC — Monte Carlo Ring Conformations
Use templates of ring conformations in MCMM, low-mode, or mixed MCMM/low-mode conformational searches. This opcode must be added before the relevant MCMM or LMCS opcode.
|
arg1 |
Use of templates |
|
0 |
Use templates in conformational search (default) |
|
−1 |
Do not use templates in conformational search |
|
arg5 |
Probability of using templates |
|
Specify the fraction of conformational search steps that will use templates. If used in conjunction with FLAP, the sum of the FLAP and template probabilities must not be greater than 1. |
|
|
0 |
Use the default value of 0.5 |
|
> 0 |
Use this value. Should be between 0 and 1. |
MCTS — Monte Carlo Torsion Selection
Experience has shown that this command is not terribly useful.
Use in conjunction with MCMM to favor torsion angle selection near local torsional minima for the angles being varied. One MCTS command is required for each torsion which is to be effected. Arg1-4 are the atoms defining a particular torsion angle (i.e., arg2-3 should appear as a rotatable bond in a TORS command). Arg5-7 are 1-fold, 2-fold, and 3-fold torsional barriers which are used to compute local torsional energies as part of the test for an allowable value of the randomly selected Monte Carlo angular change. After a Monte Carlo variation of a torsion angle described by arg1-4 of this command is performed, a local torsional energy (ET) is computed based on the value of the dihedral angle, τ , using the usual MM2 formula:
ET = (V1/2)(1+cos τ) + (V2/2)(1−cos 2τ ) + (V3/2)(1+cos 3τ ) − ET,min
where ET,min is the minimum possible value of ET. If ET is greater than 1, the torsion is rejected and a new random torsion angle is chosen. Otherwise, ET is compared with a random number between 0 and 1 and, if ET is larger than that number, a new random torsion angle is chosen. This scheme selects for local torsions which are low in energy. For sp3-sp3 linkages, one can favor the gauche and anti conformers (the minima) by using a V1 (arg5) and V3 (arg7)of 0.25. This will cause totally eclipsed (0-degree torsion) torsions to be strongly disfavored, 120-degree torsions to be moderately disfavored and the gauche and anti conformations to be favored. By choosing V1-V3 with care and using a random number for comparison, even high energy geometries are occasionally explored.
|
arg1-4 |
Atoms defining the torsion |
|
arg5 |
V1 (positive number gives anti minima) |
|
arg6 |
V2 (positive number gives eclipsed minima) |
|
arg7 |
V3 (positive number gives staggered minima) |
MCSM — Monte Carlo Single Minimum
This procedure is similar to that described by Li and Scheraga [41] and is a global search for the single lowest energy structure. The search can be conducted with or without minimization (depending on arg3). While this method is good at finding a single low energy conformer, there is no guarantee that it will locate the true global minimum energy conformer.
Monte Carlo internal coordinate conformational search is performed with minimizations on a single structure. Variable internal coordinates are specified by TORS and/or MOLS.
MCSM can be used with cyclic structures providing that either ring bonds are not varied or that ring closure commands (RCA4) are used for each ring in which variable torsions are used. During the run, random variations will be applied to 2-4 randomly selected dihedral angles from the TORS lists or molecular translations/rotations from the MOLS lists for each Monte Carlo step. If some other range of variable coordinates is desired, the range is set with the MCNV command.
If used within a BGIN/END loop, all structures in the input file will be read and the global minimum found will be listed to the output file at the end of every MC iteration set for each input structure. The ultimate global minimum would be found by examining the energies of each of the output structures.
Unlike MCMM, MCSM requires no MINI command. It must appear after a READ command.
In general, Monte Carlo searches should use CHIG commands to maintain all chiral centers and TORC commands to hold double bond geometries constant. If chirality or double-bond geometry is lost in any step, then following steps will be wrong if the resulting geometry is used for subsequent steps.
|
arg1 |
Minimization mode |
|
0 |
Steepest descent. |
|
1 |
PR conjugate gradient (best general method). |
|
3 |
Variable Metric (not recommended with |
|
4 |
Full matrix NR (not recommended with |
|
9 |
TNCG (good for flexible structures). |
|
arg2 |
Line-search control for second-derivative methods (arg1=4 or 9) |
|
0 |
No line searching (best choice). |
|
1 |
Line searching on. |
|
arg3 |
Maximum number of minimization iterations |
|
arg4 |
Number of Monte Carlo cycles (default: 100) |
|
arg5 |
Initial temperature (K) |
|
0.0 |
Metropolis sampling will not be done; every structure will be used as the next starting point |
|
arg6 |
Final Temperature (K) |
|
0.0 |
Continuous sampling at the arg5 temperature will be done throughout the run. If a nonzero temperature is supplied, cooling from the arg5 to the arg6 temperature will be carried out during the run. This slow cooling is equivalent to simulated annealing. |
|
arg7 |
Step size buffer |
|
|
As in MINI arg5. |
|
arg8 |
TNCG Hessian cutoff |
|
|
As in MINI arg6. |
MCNV — Monte Carlo Number of Variables
This command resets the degrees of freedom (number of torsion angles varied plus number of molecules moved in space) altered in a single search step. If this command is not used, the value is taken from arg3 of the MCMM command. This command overrides the MCMM arg3.
For MCMM on single unsymmetrical molecules, we find it best to specify the range as 1 to N, where N is the number of variable dihedral angles, when TORS commands are being used. It is best to use a range of values, rather than a single value. When MOLS is being used, N should be incremented by one. When ZMAT is being used, N should take account of all degrees of freedom.
Used with MCMM, MCSD, MCLO, IMPS, and SPMC commands.
In the context of MC or MC(JBW) simulations this command sets the number of degrees of freedom to be changed at each MC step or randomized at each MC(JBW) step. A degree of freedom is either a bond length, bond angle or torsion or a molecular translation or rotation along or about a single axis. This number should be set in such a way to provide a compromise between acceptance rate and conformational interconversions. When args 1 and 2 of this command differ, MCNV defines a range for the number of degrees of freedom to be changed at each step. When the two arguments are identical, MCNV defines an exact number for the degrees of freedom to be changed at each step. In both cases, the initial values will be modified during the run by the adaptive mechanism unless debug 103 is defined.
For MC, MC(JBW) or MCSD simulations, a number of degrees of freedom must be defined and consequently, the MCNV command should be present; however, its two arguments can be 0, in which case the program will provide a default range, from 1 to maximum number of the degrees of freedom of the molecules. Such a range is probably not efficient. For MC(JBW)/SD, no randomization in the JBW part is needed and all the randomization can effectively be done by the SD part of the simulation. Doing so will increase the number of conformational interconversions. In order to achieve that, the MCNV command should be omitted and arg 2 of the MCSD command should be set to a negative number.
|
arg1 |
Minimum number of degrees of freedom altered |
|
0 |
Default: 1 |
|
arg2 |
Maximum number of degrees of freedom changed in a MC step |
|
0 |
Default: number of variable degrees of freedom in the system. |
|
arg4 |
Cluster torsions varied at one time |
|
This is no longer seen as a useful option. |
|
|
0 |
Do not cluster torsions; recommended. |
|
1 |
All torsions rotated will be in a contiguous group as defined by the ordering of torsions in the TORS commands. |
|
2 |
Allow a single intervening unused torsion. |
|
n |
If, at a given stage, m torsions are rotated, these are selected from a contiguous group of (m+n−1) torsions selected from the list taken from the TORS commands. |
SEED — random number generator SEED
This command sets a starting value for the random number generator. Used to start the Monte Carlo random number generator at a different point so that repeated Monte Carlo or molecular dynamics runs give different results. The starting value should not exceed 700 000; if it does it will be divided by 2 until it is less than 700 000.
|
arg1 |
Seed value |
|
−1 |
Use a value generated from the time and date. |
|
0 |
Use the default value. |
|
> 0 |
Use the value specified (must be less than 700 000), or use positive values less than 78593 if the MacroModel random-number generator is used (DEBG |
TORS — variable TORSion selection
Each TORS command specifies up to two torsions, using the numbers of the two central atoms. These will be used as variable dihedral angles by the MCMM, SPMC or MCSM commands. The actual number of torsions which will be varied during a single Monte Carlo step depends on the search method, but the number varied will be taken from the list specified in TORS commands. A given random torsional variation will be plus or minus a random number selected from the range extending from the arg5 to the arg6 specification.
Variable torsions within rings require the ring-closure commands RCA4 in addition to TORS commands. It is advisable to specify at least two variable torsions within each ring containing RCA4 ring closures. The atoms of a ring closures (args 2 and 3 in RCA4) must not be listed as variable torsions.
The minimum and maximum angular increment (arg5 and arg6) refer to the torsions given in arg1-4. It is possible to use a different angular increment for each torsion by using only arg1, arg2, arg5, and arg6 and a different TORS command for each torsion. For global searching, arg5 and arg6 of 0.0° and 180.0° are appropriate values. If you wish to focus the search on conformations having only small angular variations from the starting conformation, a value of 30.0° for arg6 could be used.
If you are searching multicyclic ring systems, you should include CHIG commands for any substituted atoms at the ring closure atoms (arg2 and arg3 in RCA4) to assure maintenance of stereochemistry.
When doing substructure MC searching, always order the pairs of atoms defining torsions such that the second atom of each pair is not connected to any fixed atoms (FXAT) except via the first atom (in the torsional movements, the chain connected to the second atom is the one that will actually be moved). If both ends are anchored by FXAT commands, then a ring closure (RCA4) command will be necessary. TORS and MOLS commands can be used together.
|
arg1-2 |
Atom numbers specifying first torsion |
|
arg3-4 |
Atom numbers specifying second torsion |
|
arg5 |
Minimum dihedral angle variation |
|
A positive number in degrees (default: 0.0°). |
|
|
arg6 |
Maximum dihedral angle variation |
|
A positive number in degrees (default: 180.0°). |
TRES — Torsional RESolution
This command may be used in the SPMC systematic pseudo Monte Carlo searches to alter the initial resolution of the search around a particular torsion angle. This command must come after the TORS command.
|
arg1-2 |
Atoms defining the torsional angle |
|
These must have been listed already with a TORS command. |
|
|
arg3 |
Resolution for this angle |
|
The value in degrees of the initial moves will be 360°/arg3; thus arg3=3 (the default) gives 120° resolution. |
MOLS — variable MOLeculeS selection
This command selects molecules to be independently rotated and/or translated during a conformational search. It is used for configurational/conformational searches of complexes. In particular, given a docked bimolecular complex, MOLS can be used to translate and rotate the smaller molecule within the binding site of the larger one in order to explore possible binding geometries. MOLS and TORS commands can be used together; this allows the internal geometry of the separate molecules to be explored together with the relative orientation. If there are N molecules in the system, it suffices to specify N−1 of them in MOLS commands to ensure that all are adjusted.
MOLS commands must come after TORS commands.
During a search, random molecules are selected for motion. For each molecule, rotation about all three axes and translation along all three axes are performed by amounts selected randomly from within the ranges specified in arg5-6 and arg7-8. MOLS has the same relationship to LIGB as TORS has to RCA4: the bonds specified in a LIGB command are broken before the MOLS-specified molecular motion is carried out, then the LIGB bonds are remade prior to minimization of the resulting structure.
Note: During a long enough search, if pair-list cutoffs are in effect, as is normal (see EXNB), one molecule may eventually wander far enough away from another that no nonbonded energies between them exist anymore. In this situation, further searching just explores random spatial dispositions of this pair with no energetic contribution from their mutual interaction. Unless already-found binding conformations are lower in energy by at least the DEMX-specified energy, this will lead to essentially an infinite and fruitless search of conformational space. To avoid this, use FXDI to constrain the distance between atom pairs spanning pairs of molecules to some maximum distance. This maximum distance should be specified as the half-width of an FXDI potential. In practice, we do this only if we encounter difficulties without doing so.
|
arg1-4 |
Atom in a molecule to be moved |
|
Each nonzero atom given specifies independent motion of the entire molecule containing the atom. Thus, to specify independent motion of two molecules, put an atom number from the first in arg1 and an atom number from the second in arg2. |
|
|
< 0 |
Perform rotations about the atom number given. |
|
> 0 |
Perform rotations about the center of mass of the molecule containing the atom. |
|
arg5 |
Minimum rotational variation |
|
Angle in degrees (default: 0.0). |
|
|
arg6 |
Maximum rotational variation |
|
Angle in degrees (default: 0.0, i.e., no rotation). 180° is a reasonable value. |
|
|
arg7 |
Minimum translational variation |
|
Movement in Angstroms (default: 0.0). |
|
|
arg8 |
Maximum translational variation |
|
Movement in Angstroms (default: 0.0). A value in the range 3-5 Å is reasonable value. |
LIGB — LIGand Bonds
Used chiefly for configurational searches of inorganic complexes in conjunction with the VDWB command.
This command defines bonds to be broken, creating molecular fragments which will be moved independently during the search. A MOLS command must be present for each fragment to be moved. For a bidentate ligand, there will be two LIGB bonds specified for the single MOLS command that moves the ligand; for a tridentate ligand, there will be three, and so on.
This procedure allows an MCMM search to find, for example, both mer and fac isomers of an octahedral complex with stoichiometry MA3B3. LIGB can also be used to extend a conformational search to a configurational search; for example, by specifying bonds to chiral carbons as LIGB bonds, the R and S configurations about these carbons will be explored.
|
arg1 |
First atom in bond to be broken (typically a metal atom) |
|
arg2 |
Bond(s) to be broken |
|
0 |
Add all bonds to arg1 to the |
|
> 0 |
Add the arg1-arg2 bond to the |
|
< 0 |
Remove the bond between arg1 and |arg2| from the |
RCA4 — Ring Closure Atoms
This command directs the program to temporarily break a bond to sever a ring for the purpose of Monte Carlo torsion angle searching. The first 4 arguments are four atoms within the ring which comprise a ring-closure torsion angle. One RCA4 command is necessary for each ring with dihedral angles specified in a TORS command. Used with MCMM, SPMC, IMPS, and MCSM.
With MCMM and MCSM, and for small-medium rings, arg5 and arg6 should be approximately 0.5 and 2.5. For large rings, arg5 and arg6 should be ca 0.1 and 5.0.
With SPMC and for small-medium rings, arg5 and arg6 should be approximately 1.0 and 2.0. For larger rings, arg5 and arg6 should be ca 0.5 and 3.0-4.0. The exact choice is not very important but has a minor effect on search efficiency.
It is forbidden that an arg2 or arg3 atom be used in more than one RCA4 command (common ring closure atoms are not allowed). The closure angle arguments (arg7 and arg8) are optional and refer to both closure angles 1-2-3 and 2-3-4. Our tests so far indicate that there is little reason to use arg7 and arg8 closure angle constraints.
|
arg1-4 |
Atom numbers within ring |
|
Four contiguous atoms within a ring. The closure bond is the one between arg2 and arg3. |
|
|
arg5 |
Minimum allowable ring closure distance (Å) (default: 0) |
|
arg6 |
Maximum allowable ring closure distance (Å) |
|
The default is essentially infinite, so that ring-closure distance never precludes minimization on a structure produced by Monte Carlo variations. |
|
|
arg7 |
Minimum allowable closure angle (°) |
|
Default = 0.0 |
|
|
arg8 |
Maximum allowable closure angle (°) |
|
The default is 180°, so that ring-closure angle never precludes minimization on a structure produced by Monte Carlo variations. |
DISC — DIStance Check
This command causes the distance between atoms given in arg1 and arg2 to be monitored and structures which have such distances outside the allowable range to be eliminated from the output file. Used only with MULT, MCSM, and MCMM.
|
arg1-2 |
Atom numbers defining distance |
|
arg4 |
Control |
|
0 |
Check distance before minimization. |
|
1 |
Check distance after minimization. |
|
arg5 |
Minimum allowable distance (Å) |
|
arg6 |
Maximum allowable distance (Å) |
TORC — TORsional Check
This command causes the absolute value of the torsion angle defined by arg1-4 to be monitored and structures which have such torsions outside the allowable range to be eliminated from the output file. Used only with MULT, MCSM, and MCMM. The test is applied after minimization in each case, and acts as a filter, particularly during conformational searches.
This command is intended to reject unwanted cis or trans isomers created during a conformation search. Appropriate values for rejecting all trans isomers of a cis torsion (original torsion angle ~0) are arg5=0 and arg6=90. To reject all cis isomers of a trans torsion (original torsion angle ~180), use arg5=90, and arg6=180.
Note: Torsion checks do not limit the range of conformers searched or constrain the torsion during minimization. See FXTA for details on applying a torsional constraint during minimization.
|
arg1-4 |
Atom numbers specifying a dihedral angle |
|
arg5 |
Minimum allowable angle (|degrees|) |
|
arg6 |
Maximum allowable angle (|degrees|) |
MCMF — Monte Carlo Maximum constraint Failures
This command allows you to adjust the maximum number of constraint failures, e.g. ring closures (RCA4) or constrained torsion (TORC), that are allowed before accepting a faulty structure. A limit of some kind is appropriate since the user could supply constraints that make valid structure generation impossible. Thus the program tries a given starting geometry and a given number of varying torsions (or other coordinates) repeatedly until arg1 failures (or the default of 10000) have occurred. Then, the program allows a different number of varying torsions and makes arg1 new attempts to create a valid structure. This process is repeated arg2 times before a faulty structure is accepted. Thus when constraint tests have failed arg1 times, the program allows arg2 failures (of arg1 tries each) before accepting the structure anyway. Such faulty structures often fail to minimize properly.
|
arg1 |
Maximum number of constraint failures with constant torsion set (default: 10000) |
|
arg2 |
Maximum number of attempts with variable torsion sets (default: 10) |
SMPL — monte carlo SaMPLing
This command allows one to write sample structures to disk during a Monte Carlo Single Minimum (MCSM) run.
|
arg1 |
Sampling interval |
|
0 |
Write the last structure sampled and the global minimum; the latter appears last in the output file. |
|
n |
Write every nth structure sampled. |
SRNO— SeRial Number Ordering
|
arg1 |
Number to be added to serial number property in output structure file |