Program : MOLEMAN2
Version : 990504
Author : Gerard J. Kleywegt, Dept. of Cell and Molecular Biology,
Uppsala University, Biomedical Centre, Box 590,
SE-751 24 Uppsala, SWEDEN
E-mail : gerard@xray.bmc.uu.se
Purpose : manipulation and analysis of PDB files
Package : X-UTIL
Reference(s) for this program:
* 1 * G.J. Kleywegt (1995). Dictionaries for Heteros. CCP4/ESF-EACBM Newsletter on Protein Crystallography 31, June 1995, pp. 45-50. [http://alpha2.bmc.uu.se/usf/factory_5.html]
* 2 * G.J. Kleywegt (1996). Making the most of your search model. CCP4/ESF-EACBM Newsletter on Protein Crystallography 32, June 1996, pp. 32-36. [http://alpha2.bmc.uu.se/usf/factory_6.html]
* 3 * G.J. Kleywegt & T.A. Jones (1996). Phi/Psi-chology: Ramachandran revisited. Structure 4, 1395-1400. [http://www4.ncbi.nlm.nih.gov/htbin-post/Entrez/query?uid=8994966&form=6&db=m&Dopt=r]
* 4 * G.J. Kleywegt & T.A. Jones (1997). Model-building and refinement practice. Methods in Enzymology 277, 208-230. [http://alpha2.bmc.uu.se/~gerard/gmrp/gmrp.html]
* 5 * G.J. Kleywegt (1997). Validation of protein models from CA coordinates alone. J Mol Biol 273, 371-376. [http://www4.ncbi.nlm.nih.gov/htbin-post/Entrez/query?uid=9344745&form=6&db=m&Dopt=r]
* 6 * G.J. Kleywegt (1999 ?). Experimental assessment of differences between related protein crystal structures. Submitted.
* 7 * G.J. Kleywegt & T.A. Jones (1999 ?). Chapter 25.2.6. O and associated programs. Int. Tables for Crystallography, Volume F. To be published.
951107 - 0.1 - initial programming
951108 - 0.2 - more (REad and APpend commands)
960216 - 0.3 - more (some BF and OC commands; implemented main/side;
SElections; first documentation)
960217 - 0.4 - WRite command; PDb CRystal/HEtero; PRotein MC_analysis
960222 - 0.5 - PRotein SC_analysis
960223 - 0.6 - PRotein CA_analysis; COnstant; BF BOnded; BF SMooth;
STatistics; more SElect ANd/OR options; some XYz commands
960226 - 0.7 - more XYz commands; BF/OC PRod_plus; first CHain commands
960227 - 0.8 - SPlit command; more CHain commands; more PDb commands
960301 - 0.9 - SElect NUmeric; LIst_selected; ONo RSr/FIt/COnnect/TOrsion
960303 - 0.10 - implemented macro facility; split PDb REmark command
into three separate ones (LIst, DElete, REmark); changed
some parameters from optional to required to make them
useful for macros (e.g., XYz ROtate/TRanslate); ONo
DIsulfide and ONo WAter_fit
960304 - 0.11 - ONo OOps; DIstance PLot, DIstribution, SHort, SElect;
GEometry_selected; first useful version for Uppsala
960311 - 0.12 - PDb NAme/NUmber; ONo XPlor_hydrogens; SQuence LIst/PIr;
SQuence GLyco_sites/MOtif
960405 - 0.13 - minor bug fixes; MUlti_geom
960408 - 0.14 - XYz CEntre_origin and XYz ALign_inertia_axes; first
general release
960410 - 0.15 - add END card at end of PDB file (oops ...)
960411 - 0.16 - debug generate.inp file for X-PLOR with SPlit command;
debug write CCP4 format (include cryst1 etc. cards)
960412 - 0.17 - implemented LS_plane and ONo LS_plane_odl; SQuence
COunt, EXtinction_280; added optional parameter to
the "?" command which may be the name of any command
that has sub-commands; DELETE_molecule command;
AUto SPink, BOnes
960414 - 0.18 - AUto SSe; lot of debugging in the core AUto subroutine
to fix instabilities -> the new algorithm appears to
be stable (and no longer has a random component ;-)
960415 - 0.19 - minor bug fixes
960416 - 0.20 - minor bug fixes
960513 - 0.21 - correct naming of OT1/OT2 with SPlit command; includes
automatic generation of OT2 if necessary !
960517 - 0.22 - implemented simple SYMBOL mechanism (& command)
960520 - 0.23 - check for atoms which have X~Y~Z when reading a PDB
file (and in the BOok_keep command); new PDb command
CHemical+charge to add the symbol of the chemical
element and the charge to columns 77-80 of the ATOM and
HETATM records
960521 - 1.0 - added optional "use_masses" parameter to the XYz
CEntre_origin command; this version stable and useful
enough for goverment work
960629 - 1.0.1 - several small bug fixes made while at Yale
960801 - 1.0.2 - added code to PRotein CA_analysis command to look
for sequential stretches of poor residues; this
also seems to detect some register errors !
960802 - 1.0.3 - vastly improved ONo LS_plane command
960804 - 1.0.4 - make border around atoms a parameter for ONo LS_plane
960805 - 1.1 - MUlti_geometry now correctly averages dihedrals,
i.e., using RTODEG*ATAN2(AVESIN,AVECOS)
961101 - 1.1.1 - change X-PLOR "generate" input files from SPlit command
so as to delete hydrogens and atoms with unknown coordinates
970124 - 1.1.2 - minor bug fix (SE ? would crash on Alphas)
970211 - 1.1.3 - the SQ PIr command to create a PIR file now only writes
one-letter code for residues for which at least one atom
has been selected (so you can easily avoid getting
hundreds of '?' residues for your waters etc.)
970626 - 1.2 - support initialisation macro (setenv GKMOLEMAN2 macrofile)
970701 - 1.2.1 - check for weird B-factors and occupancies while reading
a new PDB file
970714 - 1.2.2 - implemented X-PLOR polars and X-PLOR/Lattmann Euler
angles in XYz ROtate
970723 - 2.0 - implemented VRML commands
970724 - 2.1 - added VRml CEll command; SElect NUmeric can now also
select on atomic Mass, Covalent bond radius and
chemical Element number; added SElect BUtnot,
SElect BY_residue and SElect Dist_to_sel; SElect
NUmeric can now have AND, OR or BUTNOT; VRml FAt_trace
970729 - 2.1.1 - fixed bug in calculation of radius-of-gyration
970807 - 2.1.2 - allow "?" wildcards in atom names in library file
(e.g., some people call their water oxygen " O ",
others " O1 ", " OHH", etc.; use " O??" in the library
to capture all of these; similarly for metal ions)
970924 - 2.1.3 - new PDb NO_atom_numbers command to remove O-style
atom numbers (indicating chemical element type)
980420 - 2.1.4 - fixed bug in SElect DIst command (wrong parameters
were passed to the subroutine)
981009 - 2.1.5 - improved macro generated by ONo OOps_macro command
981014 - 2.1.6 - correct ONo COnnect and TOrsion datablocks even if
hydrogen atoms are present
981021 - 2.1.7 - new ECho command to echo command-line input (useful
in scripts)
981022 - 2.2 - implemented command history (# command)
981216 - 2.2.1 - added some comments to output PostScript files
990223 - 2.2.2 - doubled max nr of atoms and residues; removed
"on_off" commands from O macros generated by MOLEMAN2
990301 - 2.2.3 - echo some PDB header lines when reading a PDB file
990504 - 2.3 - ANISOU cards are now read and written
- the SElect ANd, OR and BUtnot commands can now also
be used with the attributes ALtloc (alternative
location identifier, e.g. A, B, X, " ", etc.) and
ANisou (can be either T(rue) or F(alse))
- new BFactor NO_anisou command to delete all ANISOUs
- up to 20 least-squares planes (ONo LS_plane command)
can be stored, and their mutual angles calculated
with the new ONo ANgle_ls_planes command
From version 1.2 on, MOLEMAN2 can execute a macro at start-up (whether it is run interactively or in batch mode). This can be used to execute commands which you (almost) always want to have executed. To use this feature, set the environment variable GKMOLEMAN2 to point to a MOLEMAN2 macro file, e.g.:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- setenv GKMOLEMAN2 /home/gerard/moleman2.init ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
MOLEMAN2 is a new version of the old MOLEMAN program. It can be used for all sorts of manipulation and analysis of PDB files. It is intended primarily for practicing crystallographers who need to do hundreds of little things to their PDB files when switching between different programs, etc. Users of O, CCP4 and X-PLOR will benefit most from the functionality of this program.
The user-interface is different from that of the old MOLEMAN, and more similar
to that of MAPMAN, DATAMAN, etc. I.e., instead of question-and-answer
game you can supply any or all parameters for a command on one line.
For example, to read a PDB file, "re file.pdb" is enough. The other
two parameters of the REad command (format and option to read hydrogens)
will be set to their default values. For all commands, the first
two characters are unique, so "re" is the same as "read", etc. Optional
parameters are enclosed in [square brackets] in the list of commands.
NOTE: the DELETE_molecule command is an exception and requires the
first six characters to be typed (i.e., the word "delete"); this
is to reduce the risk of accidental deletion of your molecule !
An important difference with the old MOLEMAN is the fact that you can select subsets of atoms which will be used by many commands. In this fashion, you can use the same command ("bfactor stats") to get statistics about all atoms, all protein atoms, all main-chain atoms in segment XYZ1, etc. This makes the program much more flexible and easier to maintain (since no special-purpose options are necessary for different possible subsets of atoms).
Another important difference is that a library file is used which contains information about residues, such as their constituent main-chain and side-chain atoms, their type (protein, metal, carbohydrate, etc.), aliases (e.g., waters may be called WAT, HOH, H2O, etc.), and so on. You can use residue types in your selections so that it is very easy to get B-factor statistics for all non-hydrogen carbohydrate atoms with segment id CRB1, for instance.
MOLEMAN2 also allows you to write and execute macros for series of commands that you execute often (e.g., when going from a new X-PLOR model to a PDB file suitable for O and CCP4).
Many commands have a built-in mini-help facility which explains what the parameters to the command are or what values they may have. For instance, if you type "write ?" the program will explain what the parameters are:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: WRite file [format] [which]
file = PDB file name
format = Pdb | Xplor | Ccp4
which = ALl | NO_hydro | SElected | PAla |
PGly | PSer | CAlpha
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Parameters in [square brackets] are optional and will default to the first value listed (e.g., in this example, the default is to write ALl atoms to a Pdb-formatted file).
For parameters which can have several values, the UPPERCASE characters show how many characters define a unique value. In the example above, you can enter P, X or C for the format, but for the "which" parameter you must supply (at least) *two* characters.
You will be prompted to supply values for all parameters that you do not type on the command line, except those for which default values exist. Usually, the values suggested by the program make sense (if not, let me know).
If you need to provide a text parameter which contains spaces (such as the spacegroup symbol for the PDB CRYST1 record), enclose the whole string in "double quotes". Otherwise, one or more blanks and/or tabs are used to delimit parameter values.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > pd cr 84 84 111.8 90 90 90 8 "P 21 21 2" Unit-cell axes (A) : ( 84.000 84.000 111.800) Unit-cell angles (deg) : ( 90.000 90.000 90.000) Unit-cell volume (A3) : ( 7.889E+05) Nr of molecules in cell : ( 8) Spacegroup symbol : (P 21 21 2) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
The dimensioning of the program (e.g., the maximum number of atoms, etc.) is shown at startup. If you need a bigger version. let me know.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- Array dimensioning:1) Library: Max nr of residue types : ( 200) Max nr of atom types : ( 2000) Max nr of residue aliases : ( 100) Nr of defined residue classes : ( 100)
2) Molecule: Max nr of atoms : ( 100000) Max nr of residues : ( 10000) Max nr of REMARK records : ( 1000) Max nr of other records : ( 1000)
3) Program: Max buffer size : ( 524288) Max nr of atoms per residue : ( 100) Max nr of residue torsions : ( 100) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
In MOLEMAN2 you can also use macros (as in MAMA). A macro is a small text file containing MOLEMAN2 commands (but usually few parameters; these are left to the user to enter on demand) and comments. A simple macro to convert an X-PLOR PDB file into one suitable for O and CCP4 may look as follows:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- ! xplor_to_ccp4.momac - gj kleywegt @ 960303 ! ! MOLEMAN2 macro to go from an X-PLOR PDB file with segment ids ! and hydrogen atoms, to a ccp4/o file with chain names and ! no hydrogens etc. ! ! Enter X-PLOR PDB file name: read ! ! Some information about the molecule(s) statistics ! ! Generate chain names from segment IDs chain from_segid auto ! ! Enter cell constants etc.: pdb crystal ! ! Enter * to delete all X-PLOR remarks: pdb delete_remark ! ! Enter a descriptive remark about the file: pdb remark ! ! Enter CCP4/O PDB file name: write ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
When executed (with the @ command), this will give the following:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > @xplor_to_ccp4.momac ... Opened macro file : (xplor_to_ccp4.momac) ... On unit : ( 61) > (!) > (! xplor_to_ccp4.momac - gj kleywegt @ 960303) > (!) > (! MOLEMAN2 macro to go from an X-PLOR PDB file with segment ids) > (! and hydrogen atoms, to a ccp4/o file with chain names and) > (! no hydrogens etc.) > (!) > (! Enter X-PLOR PDB file name:) > (read) PDB file ? (m1.pdb) hydro.pdb Reading from file : (hydro.pdb) ... > (! Some information about the molecule(s)) > (statistics) Nr of atoms : ( 5794) Nr of residues : ( 802) ... > (! Generate chain names from segment IDs) > (chain from_segid auto)RESIDUE ALA 86 AAAA New chain name : (A)
RESIDUE NAG 501 BBBB New chain name : (B) ... > (! Enter cell constants etc.:) > (pdb crystal) A axis (A) ? ( 1.00) 49.1 B axis (A) ? ( 1.00) 75.8 C axis (A) ? ( 1.00) 92.9 Alpha angle (deg) ? ( 90.00) Beta angle (deg) ? ( 90.00) 103.2 Gamma angle (deg) ? ( 90.00) Nr of molecules in cell ? ( 1) 4 Spacegroup symbol ? (P 1) P 21 ... > (! Enter * to delete all X-PLOR remarks:) > (pdb delete_remark) Which ? (-1) * Delete all REMARK records > (!) > (! Enter a descriptive remark about the file:) > (pdb remark) Text ? (???) Model M3 @ 960303 R=0.231 Rfree=0.273 Add REMARK record : (Model M3 @ 960303 R=0.231 Rfree=0.273) 1: REMARK Model M3 @ 960303 R=0.231 Rfree=0.273 > (!) > (! Enter CCP4/O PDB file name:) > (write) PDB file ? (hydro.pdb) m3.pdb Output PDB file : (m3.pdb) Format : (Pdb) Atoms : (ALl) ... ... End of macro file ... Control returned to terminal ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Generally useful macros will be made available via the public domain OMAC directory (/nfs/public/omac in Uppsala; pub/gerard/omac for downloading from other sites).
Note that macros may be nested (the level depends on how many files may be open at the same time on your paticular type of machine). For instance, a macro which converts a new X-PLOR model into a PDB for O/CCP4, and does some quality analysis, and lists B-factor statistics may look as simple as this (assuming your directory contains a soft link called "omac" to your local OMAC directory):
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- ! ! new_model.momac - gj kleywegt @ 960303 ! ! MOLEMAN2 macro to generate an O/CCP4 PDB file from an X-PLOR PDB ! file; analyse main-chain, side-chain and CA-geometry, and list ! B-factor statistics ! @omac/xplor_to_ccp4.momac @omac/prot_qual.momac @omac/bfac_stats.momac ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
A number of features and options from the old MOLEMAN have been dropped (but many more have been improved ;-). If you need any of these, use the old MOLEMAN program. Dropped features include:
- Balasubramanian plots
- HPGL and O2D files for Ramachadran plots
- occupancy plots (if someone really needs them, let me know)
- averaging temperature factors over different chains
- BAD files (i.e., internal coordinates, Bond-distances, Angles,
Dihedrals)
- flag-colour datablocks (e.g., to colour your molecule according
to the Dutch or Swedish flag)
Note that MOLEMAN2 versions below 1.0 are still development versions, so not all the functionality may have been implemented yet !
The following list shows the old MOLEMAN commands and their counterparts in the new MOLEMAN2 (up-to-date at 960304):
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- READ_pdb_file = REad NO_H_read = REad ALWYn_format_read = REad APPEnd_pdb_file = APpend WRITe_pdb_file = WRite DUMP_pdb_file = WRite SPLIt_pdb_file = SPlit EXPOrt_bad_file = NOT SUPPORTED IMPOrt_bad_file = NOT SUPPORTED SAME_export = NOT SUPPORTED HELIx_generate = AUto SPink/BOnes/SSe STRAnd_generate = AUto SPink/BOnes/SSe QUIT = QUitREMArk_etc_cards = PDb REmark/LIst_remark/DElete_remark CRYStal_PDB_card = PDb CRystal SSBOnd_records = PDb SSbond PIR_sequence_file = SQuence PIr GLYCo_sites = SQuence GLyco_sites EXTInction_280 = SQuence EXtinction_280 TALLy_residues = SQuence COunt COUNt_elements = not implemented yet WATEr_sort = not implemented yet
RSR_datablock = ONo RSr CONNect_file = ONo COnnect TORSion_datablock = ONo TOrs RSFIt_datablock = ONo FIt DISUlfide_ODL_file = ONo DIsulfide FIT_water_macro = ONo WAter_fit FLAG_colours = NOT SUPPORTED
STATistics = STatistics PLOT_Bs_or_Qs = BFactor PLot RADIal_B_plot = BFactor PLot RAMAchandran_plot = PRotein MC_analysis PLANar_peptides = PRotein MC_analysis BALAsubramanian_plot = NOT SUPPORTED CHI_list = PRotein SC_analysis CA_Ramachandran_plot = PRotein CA_analysis CACA_distances = PRotein CA_analysis CA_Distance_plot = DIstance PLot LIST_residue = LIst_selected GEOMetry_list = GEometry_selected SEQUence_list = LIst_selected/SQuence LIst BURIed_charges = not implemented yet DISTance_distribution = DIstance DIstribution SHORt_contacts = DIstance SHort
LIMIt_B_and_Q = BFactor/OCcupancy LImit AVERage_temp_factors = BFactor GRoup TEMP_factors_set = BFactor LImit/PRod_plus OCCUpancies_set = OCcupancy LImit/PRod_plus SMOOth_Bs = BFactor SMooth B_Q_statistics = BFactor/OCcupancy STats BONDed_Bs = BFactor BOnded NONBonded_Bs = BFactor BOnded
O2XHydrogens = ONo XPlor_hydrogens SUGGest_OT2 = CHain OT2_suggest RENUmber_atoms = NOT SUPPORTED ALTEr_residue_name = PDb NAme RESIdu_renumber = PDb NUmber ZONE_renumber = PDb NUmber CHECk_nomenclature = PRotein SC_analysis CORRect_nomenclature = not implemented yet CHAIn_name = CHain NAme_selection/REname XPLOr_ids = CHain NAme_selection/SEgid_rename FROM_chain_to_XID = CHain TO_segid XID_to_chain = CHain FRom_segid AUTO_chain_segid = CHain AUto ASK_auto_chain_segid = CHain ASk
FRACtional_to_cartesian = XYz ORthogonalise CARTesian_to_fractional = XYz FRactionalise ROTAte_molecule = XYz ROtate TRANslate_molecule = XYz TRanslate APPLy_random_rotation = XYz RAndom_rotation RANDom_shifts = XYz PErturb ORIGin_move = XYz CEntre_origin MIRRor_zone = XYz MIrror INVErt_zone = XYz INvert ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
On the whole, MOLEMAN2 provides a superset of the functionality of the old
MOLEMAN, with many additional benefits:
- removed bugs
- more use of (sensible) defaults to minimise typing (compare the new
WRite command to the one in the old MOLEMAN)
- use of atom selections to operate on only a subset of the atoms
- combined commands (e.g., the PRotein MC_analysis combines several old
MOLEMAN commands, plus new functionality)
- extended functionality (e.g., the XYz commands are much more flexible
than the combined set of old MOLEMAN commands they replace; also the
SPlit command will now auto-generate a GENERATE input file for X-PLOR
which usually requires little editing; the PDb HEtero command is new)
- the program is now easier to extend since I have actually thought a bit
about my data structures (not too much, of course, I'm still a Fortran
relic). The old MOLEMAN was initially written to do just two simple
things: add X-PLOR segment IDs and write an END card at the end of
PDB files ;-) Since then it has outgrown itself rapidly, making it
a major pain to implement or change functionality.
Not written yet.
The default library file in Uppsala can be found in /nfs/public/lib, with name "moleman2.lib". If you set the environment variable GKLIB in your .cshrc file to /nfs/public/lib, the program will always come up with the correct default name for this name. Outside Uppsala, the library file can be found in directory pub/gerard/xutil in the compressed tar file xutil_etc.dirtar.Z.
The library file must be read on start-up. The program will prompt you for the filename.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
...
Max nr of residue torsions : ( 100)
Name of library file ? (/nfs/public/lib/moleman2.lib)
Reading library ...
> ( = MOLEMAN2.LIB = VERSION 0.1 = 951106 = GJ KLEYWEGT = 118 ENTRIES =)
Lines read : ( 785)
Residue types : ( 118)
Atom types : ( 1314)
Aliases : ( 60)
First and last residue types:
Residue # 1 = GLY (PROT) = GLYCINE
Atoms | N | (T) | CA | (T) | C | (T) | O | (T)
Residue # 118 = TRS (ORGA) = TRIS TRIS(HYDROXYMETHYL)-AMINOMETHANE
Atoms | O1 | (F) | C2 | (F) | C3 | (F) | C4 | (F) | O5 | (F) | C6 | (F)
Atoms | O7 | (F) | N8 | (F)
Check integrity:
WARNING - name or alias conflict: 55 = MAL and 95 = MAL
WARNING - name or alias conflict: 60 = GLC and 61 = GLC
WARNING - name or alias conflict: 61 = GLC and 60 = GLC
WARNING - name or alias conflict: 63 = MAN and 64 = MAN
WARNING - name or alias conflict: 64 = MAN and 63 = MAN
WARNING - name or alias conflict: 95 = MAL and 55 = MAL
ERROR --- Non-unique residue names/aliases
Count types:
Nr of amino acid residue types : ( 22)
Nr of nucleic acid types : ( 4)
Nr of water types : ( 1)
Nr of metal types : ( 13)
Nr of inorganic types : ( 12)
Nr of carbohydrate types : ( 18)
Nr of organic compound types : ( 48)
Nr of other compound types : ( 0)
MOLEMAN2 commands :
...
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
As you can see, there are some duplicate names (not necessarily for identical compounds, e.g. alpha- and beta-glucose can both be called GLC). The program will use the first occurrence.
The format is as follows (in case you want to edit the file or add new residue type definitions):
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- For each residue type:RES XYZ description FORMAT: (A3,1x,A3,1x,A80) - XYZ = 3-letter residue type - description = free text TYP ABCDefghijkl FORMAT: (A3,1x,A4) - ABCD = one of the defined categories, e.g. protein AKA ABC DEF GHI FORMAT: (A3,n(1x,A3)) (multiple cards allowed) - 0 or more synonyms for the residue type MCH ... FORMAT: first card (A3,1X,*) subsequent cards (4X,*) - 0 or more atom names which constitute the main chain or backbone SCH ... FORMAT: first card (A3,1X,*) subsequent cards (4X,*) - 0 or more atom names which constitute the side chain END FORMAT: (A3) - signals end of residue definition
in MCH and SCH lines, use a "-" at the end of a line to signal continuation on the next one; start the next line with 5 spaces !
any line beginning with "!" is a comment which will not be printed
any line beginning with REM is a comment which will be printed ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
The pre-defined residue types are: PROTein, NUCLeic acid, WATEr, METAl ions, INORganic ions and clusters, CARBohydrates, and ORGAnic ligands, ions, substrates, co-factors etc. Anything else will be classified as HETEro.
An example:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- RES CYS cysteine TYP protein AKA CSS CSH CYH CYX MCH ' N ' ' CA ' ' C ' ' O ' SCH ' CB ' ' SG ' END ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
If you want your special residue/ligand to be included in the standard distribution version of the library, or if you find any errors, E-mail me (gerard@xray.bmc.uu.se).
If you have compounds which are not in the library, it usually doesn't
matter too much, as long as you realise that:
- the compound will be assigned type HETEro; if it is in actual fact an
unusual amino acid you should add it to the library
- all atoms will be flagged as being side-chain atoms; again, if you
have unusual amino acids or nucleotides, this may not be what you want
- no check for missing/superfluous atoms can be carried out (BOok_keep
command)
Some commands produce plot files for O2D. On an SGI, you can view these plots interactively, and analyse them. On other machines, you can convert them to PostScript (or CricketGraph) files. Use the script OMAC/o2dps to do the conversion to PostScript automatically (and for lots of files at once, if you like).
Other commands directly produce PostScript files. Use your local viewer (e.g., ghostview or ghostscript) to look at these, and/or print them on a PostScript printer.
With the COnstants command a number of program parameters can be altered by the user. These include:
- BLIMLO/BLIMHI/QLIMLO/QLIMHI - used by the BFacor/OCcupancy LImit
commands
- MXCACA - maximum CA-CA distance for connected residues (used by
several commands)
- TORTOL - tolerance for certain impropers/dihedrals (e.g., used by
the PRotein SC_analysis and ONo TOrsion commands)
- MXBOND - maximum distance for two atoms to be considered bonded
(e.g., used by the BFactor SMooth command)
- MXNONB - maximum distance for non-bonded interactions (two atoms
are involved in a non-bonded interaction if their distance lies
in the range <MXBOND,MXNONB]); used by BFactor BOnded
- MXCYSS - maximum Cys-SG...SG-Cys distance for disulfide links
(e.g., used by the SPlit command to generate DISUlfide patches
for your X-PLOR GENERATE input file)
- ISEED - special (integer) number used to initialise the random
number generator (positive number: initialise with that number,
i.e. reproducibly; zero or negative number: initialise with the
current value of MCLOCK(), the system clock); used by the XYz
RAndom_rotation and PErturb commands
For an up-to-date list, use the COnstants LIst command. To revert to the "factory defaults", use the COnstants REset command.
Note that the values you enter are NOT checked at all. So if you want to set the maximum distance for connected CA atoms to -3.14 A, you may do so.
Syntax: ? [command]
command = name of any command which has sub-commands (e.g., XYz, ONo, etc.)
Typing a single question mark will provide you with a list of all available commands. If you supply an argument "?", you will only see the general commands. If the argument is the name of a command which has sub-commands (such as SElect, XYz, PRotein, etc.), you will only get the list of those commands.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > ? ?MOLEMAN2 commands :
? [command] (list (sub-)commands) ! (comment) QUit $ shell_command @ macro_file BOok_keeping COnstants REset COnstants SEt name value COnstants LIst STatistics GEometry_selected LIst_selected [which] MUlti_geometry which LS_plane
REad file [format] [hydro] WRite file [format] [which] APpend file [format] [hydro] SPlit file_prefix DElete_molecule
Commands with sub-commands: SElect BFactor OCcupancy CHain PDb PRotein XYz ONo DIstance SQuence AUto To see sub-commands, use for instance: ? xy ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > ? seleSElect All SElect NOne SElect HYdrogen SElect EXhydrogen SElect OR what which SElect ANd what which SElect NEgate SElect NUmeric and_or what lo hi SElect ? ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
This command can be used to manipulate symbols. These are probably
only useful for advanced users who want to write fancier macros.
The command can be used in three ways:
(1) & ? -> lists currently defined symbols
(2) & symbol value -> sets "SYMBOL" to "value"
(3) & symbol -> prompts the user to supply a value for "SYMBOL"
(even if the program is executing a macro)
A few symbols are predefined:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > & ? Nr of defined symbols : ( 4) Symbol PROGRAM : (MOLEMAN2) Symbol VERSION : (960517/0.22) Symbol START_TIME : (Fri May 17 19:55:43 1996) Symbol USERNAME : (gerard) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
The symbol mechanism is fairly simplistic and has some limitations:
- max length of a symbol name is 20 characters
- max length of a symbol value is 80 characters
- max number of symbols is 100
- symbols can not be deleted, but they can be redefined
- symbol values are accessed by supplying $SYMBOL_NAME as an
argument on the command line; the line that you type on the terminal
(or in a macro) is parsed once; if there are additional parameters
which the program prompts you for, you cannot use symbols for those
- only one substitution per argument (e.g., "$file1 $file2" will
lead to a substituion of the entire argument by the value of
symbol FILE1 only !)
- command names (first argument on any command line) cannot be
replaced by a symbol (e.g.: "$command $arg1 $arg2" is not valid)
- symbols may be equated to each other, e.g. "& file2 $file1" will
give FILE2 the same value as FILE1
- symbol substitution is not recursive (e.g., if you set the value
of FILE2 to be "$file1", any reference to $FILE2 will be replaced
by "$file1", not by the value of FILE1
- symbols on comment lines (starting with "!") are not expanded
- symbols on system command lines (starting with "$") are not expanded
Example of the use of symbols:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- ! Which rotation angle convention ? ! Enter CE for CCP4 Euler angles, or CP for CCP4 Polar angles: & conv ! ! Rotation angle 1 ? & alpha ! ! Rotation angle 2 ? & beta ! ! Rotation angle 3 ? & gamma ! ! Applying rotation function solution xyz rotate $conv $alpha $beta $gamma ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
If you run the program with scripts, it is sometimes useful to see input commands echoed. The parameter to the ECho command may be ON, OFf, or ? (to list the echo status).
This command calculates and prints some statistics for the currently selected set of atoms.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > stats Nr of atoms : ( 7038) Nr of residues : ( 1404)Nr of amino acid residues : ( 868) Nr of nucleic acids : ( 0) Nr of waters : ( 529) Nr of metals : ( 0) Nr of inorganics : ( 0) Nr of carbohydrates : ( 4) Nr of organic compounds : ( 0) Nr of other compounds : ( 3)
Nr of selected atoms : ( 7038) Ditto, hydrogen : ( 0)
Item Average St.Dev Min Max ---- ------- ------ --- --- X-coord 10.543 22.100 -33.949 54.913 Y-coord 42.379 23.814 -2.748 86.626 Z-coord 33.375 31.576 -29.766 91.680 B-factor 17.965 8.663 5.310 67.750 Occpncy 1.000 0.000 1.000 1.000
Radius of gyration (A) : ( 46.191) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
This command will re-determine where residues start and end, if there
are duplicate atoms or missing atoms or extra atoms in residues
that occur in the library, or if there are residues not in the
library, and if all residue names are unique.
The program also checks if there are any atoms which have X ~ Y ~ Z;
often, this is an indication of unset or unknown coordinates (e.g.,
new atoms in O will be placed at (1500,1500,1500), sometimes waters
end up at (0,0,0) etc.) which may lead to all sorts of trouble
later on (e.g., in map extension around the molecule or the generation
of NCSRel-cards by XPAND). The tolerance used is 0.01 A.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > boTotal nr of residues : ( 239) Nr of amino acid residues : ( 137) Nr of nucleic acids : ( 0) Nr of waters : ( 101) Nr of metals : ( 0) Nr of inorganics : ( 0) Nr of carbohydrates : ( 0) Nr of organic compounds : ( 1) Nr of other compounds : ( 0)
Checking for missing/extra atoms ... ERROR --- Unknown atom in structure ATOM 1091 OXT GLU 137 26.188 -39.912 33.302 1.00 22.23 1CBS1313 ERROR --- Unknown atom in structure ATOM 1115 O HOH 300 15.524 -31.764 26.116 1.00 17.43 1CBS1337 ERROR --- Missing atom in structure RESIDUE HOH 300 1CBS Atom name : ( O1) ... ERROR --- Missing atom in structure RESIDUE HOH 399 1CBS Atom name : ( O1)
Checking uniqueness of residue names ... Non-unique residue names : # 224 = HOH 385 1CBS <-> # 230 = HOH 385 1CBS
Checking "special" positions (X~Y~Z) ... ATOM 1200 O HOH 385 0.000 0.000 0.000 1.00 32.12 1CBS1422 ATOM 1200 O HOH 385 1500.0001500.0001500.000 1.00 32.12 1CBS1422 ATOM 1210 O HOH 395 -36.656 -36.656 -36.656 1.00 35.59 1CBS1432 WARNING - Nr of atoms with X~Y~Z : ( 3) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: LIst_selected [which]
which = Residues | Atoms
You may either list all selected atoms, or the first selected atom (if any) of all residues.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > se nu and bf 50 9999 Select Numeric : ( AND B-factor 50.00000 9999.000) Selection history : (NON-HYDROGEN | AND B-factor 50.00000 9999.000 |) Nr of selected atoms : ( 8) MOLEMAN2 > li r List first selected atom of every residue ATOM 807 CE LYS 101 4.479 34.527 19.248 1.00 51.31 1CBS1029 ATOM 819 CD GLU 103 -0.383 25.503 13.395 1.00 50.23 1CBS1041 ATOM 1193 O HOH 378 9.543 16.072 11.145 1.00 50.91 1CBS1415 ATOM 1194 O HOH 379 8.174 14.289 20.240 1.00 54.21 1CBS1416 ATOM 1196 O HOH 381 5.486 15.385 24.922 1.00 50.19 1CBS1418 Nr of residues listed : ( 5) MOLEMAN2 > li a List all selected atoms ATOM 807 CE LYS 101 4.479 34.527 19.248 1.00 51.31 1CBS1029 ATOM 808 NZ LYS 101 4.917 33.952 20.559 1.00 51.14 1CBS1030 ATOM 819 CD GLU 103 -0.383 25.503 13.395 1.00 50.23 1CBS1041 ATOM 820 OE1 GLU 103 -0.130 26.346 12.499 1.00 53.12 1CBS1042 ATOM 821 OE2 GLU 103 -1.464 25.500 14.036 1.00 52.16 1CBS1043 ATOM 1193 O HOH 378 9.543 16.072 11.145 1.00 50.91 1CBS1415 ATOM 1194 O HOH 379 8.174 14.289 20.240 1.00 54.21 1CBS1416 ATOM 1196 O HOH 381 5.486 15.385 24.922 1.00 50.19 1CBS1418 Nr of atoms listed : ( 8) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
This command calculates the equation of the least-squares plane
through the currently selected atoms (unit weights), and the
RMSD of the atoms to the plane.
If you want to display the plane in O, use the ONo LS_plane_odl
command instead. Also, if you want to see the distance of
the individual atoms to the plane, use the ONo LS_plane command.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > se and res rea AND atom selection With atoms for which : (RES) Equals : (REA) Selection history : (ALL | AND REsidu = REA |) Nr of selected atoms : ( 22) MOLEMAN2 > ls Nr of selected atoms : ( 22) Centre of Gravity : ( 22.065 26.283 20.209) Eigen value 1 = 515.0 Vector : 0.145303 -0.597192 0.788828 Eigen value 2 = 31.0 Vector : 0.847841 0.486099 0.211834 Eigen value 3 = 6.7 Vector : -0.509954 0.638020 0.576955 Determinant : ( 1.000) Eigenvector #3 defines the least-squares plane Equation: -0.509954 X + 0.638020 Y + 0.576955 Z = 17.176491 RMSD to plane : ( 0.552) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Note that the three eigenvalues tell you something about the shape of the selected set of atoms: in the example above (an all-trans- retinoic acid molecule), it is clearly very long, not very wide, and fairly planar (eigenvalue 1 >> 2 > 3).
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- ... Selection history : (ALL | AND REsidu = TRP | AND CLass = Side | AND Residue_nr 109 109 |) Nr of selected atoms : ( 10) MOLEMAN2 > ls Nr of selected atoms : ( 10) Centre of Gravity : ( 13.617 20.280 29.635) Eigen value 1 = 28.3 Vector : 0.044093 0.093234 0.994667 Eigen value 2 = 12.4 Vector : -0.373900 0.924815 -0.070112 Eigen value 3 = 0.0 Vector : 0.926420 0.368814 -0.075638 Determinant : ( -1.000) ERROR --- Negative determinant; change hand of inertia axes Eigen value 1 = 28.3 Vector : -0.044093 -0.093234 -0.994667 Eigen value 2 = 12.4 Vector : 0.373900 -0.924815 0.070112 Eigen value 3 = 0.0 Vector : -0.926420 -0.368814 0.075638 Determinant : ( 1.000) Eigenvector #3 defines the least-squares plane Equation: -0.926420 X + -0.368814 Y + 0.075638 Z = -17.853298 RMSD to plane : ( 0.029) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: MUlti_geometry which
which = residue_type (must be defined in the library)
This command will list statistics (number of observations, average, standard
deviation, minimum and maximum value) for the following for all currently
selected copies of a user-specified residue type:
- bonded distances
- bond angles and 1-3 distances
- torsion angles and 1-4 distances
Note that the (torsion) angles used to be averaged in degrees, and that periodicity is not taken into account ! This means that dihedrals which are around +/-180 degrees may give seemingly large ranges !!! This has been changed in version 1.1, so that the average of -178 and +178 is now (+ or -) 180, rather than zero.
This option may be useful when creating "ideal geometry" dictionaries for a refinement program when you have several examples of the residue. In addition it can be used to look for large outliers.
The program constants LARGEB and LARGEA are used to flag bond distances and angles if they show a large range (i.e., maximum minus minimum value exceeds LARGEB for bonds or LARGEA for angles).
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > mu gly Multiple copy geometry for : (GLY) Nr of atoms : ( 4) Atoms : ( N CA C O) Looking for selected residues ... RESIDUE GLY A 22 1CEL RESIDUE GLY A 23 1CEL ... RESIDUE GLY B 424 1CEL RESIDUE GLY B 427 1CEL ERROR --- Too many copies Maximum : ( 100) Nr of copies found : ( 100) Nr of bonds : ( 3) Bond distance range large if > ( 0.050) Bond angle range large if > ( 5.000)Bonded distances with cut-off : 2.000 A ========================================== N - CA # 100 Ave, Sdv, Min, Max 1.451 0.004 1.440 1.464 CA - C # 100 Ave, Sdv, Min, Max 1.517 0.006 1.501 1.529 C - O # 100 Ave, Sdv, Min, Max 1.232 0.004 1.223 1.244
Angles and 1-3 angle distances ============================== N - CA - C Angle : 100 113.89 4.23 101.49 122.24 Large range 1-3 Dist : 100 2.486 0.059 2.307 2.588 CA - C - O Angle : 100 120.44 0.96 116.96 122.29 Large range 1-3 Dist : 100 2.389 0.015 2.339 2.415
Dihedrals and 1-4 torsion distances =================================== N - CA - C - O Dihedral : 100 5.27 114.42 -178.61 179.19 1-4 Dist : 100 3.150 0.432 2.608 3.695 ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > mu glc Multiple copy geometry for : (GLC) Nr of atoms : ( 12) Atoms : ( C1 C2 C3 C4 C5 C6 O1 O2 O3 O4 O5 O6) Looking for selected residues ... RESIDUE GLC 1 RESIDUE GLC 2 RESIDUE GLC 3 RESIDUE GLC 4 RESIDUE GLC 5 RESIDUE GLC 6 RESIDUE GLC 7 RESIDUE GLC 8 Nr of copies found : ( 8) Nr of bonds : ( 11) Bond distance range large if > ( 0.050) Bond angle range large if > ( 5.000)Bonded distances with cut-off : 2.000 A ========================================== C1 - C2 # 8 Ave, Sdv, Min, Max 1.533 0.013 1.513 1.548 C1 - O5 # 8 Ave, Sdv, Min, Max 1.435 0.014 1.414 1.456 C2 - C3 # 8 Ave, Sdv, Min, Max 1.519 0.013 1.503 1.543 C2 - O2 # 8 Ave, Sdv, Min, Max 1.409 0.016 1.385 1.426 C3 - C4 # 8 Ave, Sdv, Min, Max 2.528 0.413 1.497 2.875 Large range C3 - O3 # 8 Ave, Sdv, Min, Max 1.416 0.008 1.411 1.436 C4 - C5 # 8 Ave, Sdv, Min, Max 2.537 0.390 1.544 2.792 Large range C4 - O4 # 1 Ave, Sdv, Min, Max 1.413 0.000 1.413 1.413 ... C6 - O6 # 8 Ave, Sdv, Min, Max 1.410 0.016 1.374 1.426 Large range
Angles and 1-3 angle distances ============================== C2 - C1 - O5 Angle : 8 109.93 2.36 106.56 113.02 Large range 1-3 Dist : 8 2.430 0.027 2.383 2.463 C1 - C2 - C3 Angle : 8 111.72 1.48 110.48 114.24 1-3 Dist : 8 2.526 0.013 2.512 2.546 ... C1 - O5 - C5 Angle : 8 117.02 1.53 115.16 119.50 1-3 Dist : 8 2.454 0.045 2.414 2.529
Dihedrals and 1-4 torsion distances =================================== O5 - C1 - C2 - C3 Dihedral : 8 51.28 7.74 39.17 60.96 1-4 Dist : 8 2.853 0.019 2.839 2.902 O5 - C1 - C2 - O2 Dihedral : 8 37.03 167.50 -179.25 174.97 1-4 Dist : 8 3.662 0.036 3.592 3.694 ... O4 - C4 - C5 - O5 Dihedral : 1 -172.14 0.00 -172.14 -172.14 1-4 Dist : 1 3.718 0.000 3.718 3.718 ... C6 - C5 - O5 - C1 Dihedral : 8 -89.61 149.13 -176.72 168.72 1-4 Dist : 8 3.760 0.052 3.691 3.839 ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
This command will list the following for all currently selected atoms:
- bonded distances
- bond angles and 1-3 distances
- torsion angles and 1-4 distances
Make sure to use the SElect commands to isolate only those atoms that you are interested in ! Use the LIst_selected command before using this command to see exactly how many (and which) atoms/residues are selected.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > se an re rea AND atom selection With atoms for which : (RE) Equals : (REA) Selection history : (ALL | AND REsidu = REA |) Nr of selected atoms : ( 22) MOLEMAN2 > ge Nr of selected atoms : ( 22)Bonded distances with cut-off : 2.000 A ========================================== C1 [B 200 ] - C2 [B 200 ] = 1.546 A C1 [B 200 ] - C6 [B 200 ] = 1.564 A ... C15 [B 200 ] - O2 [B 200 ] = 1.251 A Nr of bonded distances : ( 22)
Angles and 1-3 angle distances ============================== C2 [B 200 ] - C1 [B 200 ] - C6 [B 200 ] = 109.691 deg = 2.543 A C2 [B 200 ] - C1 [B 200 ] - C16 [B 200 ] = 108.110 deg = 2.494 A ... O1 [B 200 ] - C15 [B 200 ] - O2 [B 200 ] = 121.547 deg = 2.183 A Nr of angles : ( 30)
Dihedrals and 1-4 torsion distances =================================== C6 [B 200 ] - C1 [B 200 ] - C2 [B 200 ] - C3 [B 200 ] = -42.758 deg = 2.902 A C16 [B 200 ] - C1 [B 200 ] - C2 [B 200 ] - C3 [B 200 ] = -162.723 deg = 3.847 A ... C13 [B 200 ] - C14 [B 200 ] - C15 [B 200 ] - O2 [B 200 ] = 129.005 deg = 3.505 A Nr of dihedrals : ( 32) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > se and ch m AND atom selection With atoms for which : (CH) Equals : (M) Selection history : (ALL | AND CHain = M |) Nr of selected atoms : ( 22) MOLEMAN2 > ge Nr of selected atoms : ( 22)Bonded distances with cut-off : 2.000 A ========================================== C1 [M 602 ] - O1 [M 602 ] = 1.410 A C1 [M 602 ] - C2 [M 602 ] = 1.515 A ... O5 [M 603 ] - C5 [M 603 ] - C6 [M 603 ] - O6 [M 603 ] = -51.669 deg = 2.729 A Nr of dihedrals : ( 43) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > se all ... MOLEMAN2 > se and chain a ... MOLEMAN2 > se num and res 85 87 ... MOLEMAN2 > geom Nr of selected atoms : ( 12)Bonded distances with cut-off : 2.000 A ========================================== CB [A 86 ] - CA [A 86 ] = 1.521 A C [A 86 ] - O [A 86 ] = 1.229 A ... CB [A 87 ] - CA [A 87 ] - C [A 87 ] - O [A 87 ] = 86.999 deg = 3.216 A Nr of dihedrals : ( 14) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Any command beginning with an exclamation mark will prompt the program to ignore that line. This may be useful to annotate scripts which run MOLEMAN2.
This will end you current session.
Syntax: $ shell_command
Anything following the dollar sign is passed on to the operating system shell. This can be used to list the files in a directory, or even to run another program from within MOLEMAN2.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > $ ls *.pdb 1cbs.pdb 1cel.pdb hydro.pdb q.pdb twentyz.pdb ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
This resets some of the program parameters to the values that I deemed reasonable.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > co re Reset program constants to defaults ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
This lists the names of those parameters than can be modified by the user.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > co li BLIMLO = B-factor default minimum : ( 2.000) BLIMHI = B-factor default maximum : ( 50.000) QLIMLO = Occupancy default minimum : ( 0.000) QLIMHI = Occupancy default maximum : ( 1.000) MXCACA = Max connected CA-CA distance : ( 4.500) TORTOL = Torsion/improper tolerance : ( 5.000) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: COnstants SEt name value
name = parameter name
value = new value for this parameter
This enables you to set the value of a program parameter. Use the COnstants LIst option to see what parameters can be set in the current version of the program
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > co set tortol 10 Set TORTOL to 10.000 MOLEMAN2 > co set junk 103 Set JUNK to 103.000 ERROR --- Name not recognised ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: REad file [format] [hydro]
file = PDB file name
format = Pdb | Alwyn
hydro = No | Yes
The Alwyn format is only required when you try to read *very* old PDB
files created with older versions of O.
By default, hydrogen atoms are *stripped* when you read a file, unless
you set the hydro parameter to y(es) !
Once the file has been read, some book-keeping is done. By default,
*all* atoms will be selected (including hydrogen atoms, if they
were read) !
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > re 1cbs.pdb Reading from file : (1cbs.pdb) in normal PDB format ignoring hydrogen atoms>>>>> END card encountered <<<<<
Nr of lines read : ( 1459)
Total nr of residues : ( 238) Nr of amino acid residues : ( 137) Nr of nucleic acids : ( 0) Nr of waters : ( 100) Nr of metals : ( 0) Nr of inorganics : ( 0) Nr of carbohydrates : ( 0) Nr of organic compounds : ( 1) Nr of other compounds : ( 0)
Nr of atoms now : ( 1213) Nr of residues : ( 238) Select ALL atoms Selection history : (ALL |) Nr of selected atoms : ( 1213) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: APpend file [format] [hydro]
Same parameters etc. as for the REad command. Book-keeping is done again, and *all* atoms are selected.
Syntax: WRite file [format] [which]
file = PDB file name
format = Pdb | Xplor | Ccp4
which = ALl | NO_hydro | SElected | PAla |
PGly | PSer | CAlpha
Write (a subset of) the current atoms to a PDB file. You must provide the name of the output file. If the file already exists, an error message is generated and you are asked if you want to overwrite it ("Open file as OLD (Y/N) ?"). If you don't, you can subsequently supply a different file name.
If the format is Pdb, then all records (including any REMARKS etc.) will be written, and atoms which were on HETATM cards on input will be on HETATM cards on output. Cell constants etc. are also included.
If the format is Xplor, only ATOM records will be written (even for HETATMs), and nothing else (no CRYST1, etc.).
If the format is Ccp4, the same is written as for Xplor format, but in addition the CRYST1 etc. cards will added at the top of the file.
You can specify which atoms you want to write out:
- ALl writes all atoms currently in memory
- NO_hydro writes all non-hydrogen atoms
- SElected writes only the currently selected atoms (if any)
- PGly creates a poly-Glycine file (i.e., only main-chain atoms of
protein residues and all residues will be called GLY; the program
recognizes main-chain atoms by their atom names or by the fact
that they have been labelled as such in the library file)
- PAla generates a poly-Alanine file in a similar fashion
- PSer generates a poly-Serine file (in which atoms CG, OG, SG,
OG1 and CG1 will be called OG)
- CAlpha writes only the C-alpha atoms of protein residues
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > wr q.pdb p se Output PDB file : (q.pdb) Format : (Pdb) Atoms : (SE) Number of atoms to write : ( 3220) Nr of atoms written : ( 3220) Nr of lines written : ( 3705)CPU total/user/sys : 1.0 1.0 0.0 ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > wr q.pdb ccp pser Output PDB file : (q.pdb) Format : (CCP) Atoms : (PSER) ERROR --- XOPXNA - error # 126 while opening NEW file : q.pdb OPEN : (UNIT= 10 STATUS=NEW CAR_CONTROL=LIST FORM=FORMATTED ACCESS=SEQUENTIAL) Error : (Connection timed out) Open file as OLD (Y/N) ? (N) y Number of atoms to write : ( 4952) Nr of atoms written : ( 4952) Nr of lines written : ( 4952)CPU total/user/sys : 1.5 1.5 0.0 ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: SPlit file_prefix
This option is primarily intended for X-PLOR users. As such, it removes much of the dread previously associated with the preparation of files for the GENERATE step.
You provide the prefix for the PDB file names (e.g., /Usr/Billy/Xplor/m1 or ../../xplor/m5tom6/m5 or ./m1). Upper and lower case characters in the prefix will be maintained. For each segment id found in the set of *ALL* atoms, a separate PDB file is written which contains all atoms which have that segment id (even if they are separated in the structure). The name of this file will be your prefix plus an underscore (_) plus the segment id (converted to lowercase and without any spaces) plus the extension ".pdb". For instance, if your prefix is "../Xplor/m1", then segment "PROA" will be written to a file called "../Xplor/m1_proa.pdb".
In addition to all these PDB files, an X-PLOR input file is created for the GENERATE stage. It will need some editing if your structure contains things other than protein and water (i.e.: insert topology and parameter file names; insert any patches that are necessary). The program will also look for any disulfide links and create DISU patch statements for each of these (if any). Note that in the case of for instance iron-sulfur clusters you may have to remove (some of) the DISU patch statements !
IMPORTANT: the segment id *ALONE* is used to identify different segments. In other words: the chain names are ignored (this is in agreement with the current PDB convention in which segment identifiers designate separate entities; a chain may contain multiple segments, e.g. protein + glycosylation + ligand + water).
NOTE: make sure that the segment ids make sense !! If you just read in a PDB file from O or downloaded one from Brookhaven, you probably have to use one or more of the CHain commands (e.g., CHain ASk or CHain AUto) to get correct segmentation. You have yourself to blame for any segmentation faults ;-)
Since the program writes all atoms with a certain segment id to the same file (even if they are interspersed with other segments), you can use the CHain commands to "merge" two previously distinct segments into one. For instance, if you have two-fold NCS, you may initially have refined only waters which obey the NCS and used separate segments for them (e.g., WATA and WATB). When you start to add waters which are not conserved in both molecules, you could create a third segment but this tends to get very confusing as you add and delete waters while you rebuild. Instead, you may want to merge all waters. You can do this by simply giving all of them the same segment id with the CHain commands.
NOTE: from version 0.21 onward, the program tries its best to make sure that OT1 and OT2 are named correctly. Also, if an OT2 atom is missing it will be generated on the fly and written to the PDB file (but it will not be added to your structure in memory !).
NOTE: from version 1.1.1 onward, the default is to delete hydrogen atoms and atoms with unknown coordinates.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > re test.pdb ... MOLEMAN2 > ch au ... MOLEMAN2 > split qqq Split PDB files for X-PLOR File prefix : (qqq) X-PLOR generate input file : (qqq_generate.inp)File nr 1 Segment |AAAA| File name : qqq_aaaa.pdb ATOM 1 CB SER A 4 51.238 19.799 7.294 1.00 34.72 AAAA Found & written OT1 ... Adding OT2 at : ( 39.097 17.619 13.212) Nr of lines written : ( 446) Nr of atoms written : ( 443)
File nr 2 Segment |AAAB| File name : qqq_aaab.pdb ATOM 483 N ASN B 66 26.962 19.074 20.708 1.00 2.00 AAAB Found & written OT1 Found & written OT2 Nr of lines written : ( 3747) Nr of atoms written : ( 3744)
File nr 3 Segment |AAAC| File name : qqq_aaac.pdb ATOM 4227 CB THR C 1 -3.115 5.872 13.814 1.00 77.65 AAAC Found & written OT1 ... Adding OT2 at : ( 1.912 19.586 15.804) Nr of lines written : ( 467) Nr of atoms written : ( 464)
File nr 4 Segment |AAAD| File name : qqq_aaad.pdb ATOM 4691 O1 HOH D 701 -6.674 -2.991 11.360 1.00 15.26 AAAD Nr of lines written : ( 39) Nr of atoms written : ( 36)
File nr 5 Segment |AAAE| File name : qqq_aaae.pdb ATOM 4727 ZN+2 ZNC E 901 -15.390 36.554 20.084 1.00 19.82 AAAE Nr of lines written : ( 5) Nr of atoms written : ( 2)
File nr 6 Segment |AAAF| File name : qqq_aaaf.pdb ATOM 4729 C1 NAG F 990 39.707 11.948 13.548 1.00 77.90 AAAF Nr of lines written : ( 17) Nr of atoms written : ( 14)
Nr of PDB files generated : ( 6)
Looking for disulfides ... Looking for CYS- SG atoms ... ATOM 496 SG CYS B 67 26.853 23.275 24.788 1.00 18.16 AAAB ATOM 719 SG CYS B 94 27.958 21.920 25.822 1.00 2.00 AAAB ATOM 1765 SG CYS B 231 17.706 42.764 23.358 1.00 2.00 AAAB ATOM 1944 SG CYS B 254 15.900 18.643 43.219 1.00 14.44 AAAB ATOM 2032 SG CYS B 265 17.033 17.112 43.932 1.00 13.67 AAAB ATOM 3118 SG CYS B 402 11.202 50.463 20.218 1.00 17.06 AAAB ATOM 4105 SG CYS B 521 12.171 51.649 18.881 1.00 2.00 AAAB ATOM 4247 SG CYS C 3 -0.622 9.196 15.268 1.00 34.29 AAAC ATOM 4355 SG CYS C 17 1.031 6.187 17.911 1.00 53.34 AAAC ATOM 4388 SG CYS C 22 -1.328 8.466 17.020 1.00 31.20 AAAC ATOM 4529 SG CYS C 39 2.164 5.235 19.293 1.00 20.78 AAAC ATOM 4539 SG CYS C 41 -5.604 11.002 25.701 1.00 52.29 AAAC ATOM 4620 SG CYS C 52 -4.745 12.199 24.311 1.00 29.91 AAAC ATOM 4626 SG CYS C 53 -4.229 13.247 16.868 1.00 42.61 AAAC ATOM 4669 SG CYS C 59 -4.012 11.998 15.283 1.00 12.80 AAAC Nr of CYS SG atoms : ( 15) Max SG-SG distance for link : ( 2.200) Disulfide # 1 67 AAAB <-> 94 AAAB @ 2.03 A Disulfide # 2 254 AAAB <-> 265 AAAB @ 2.03 A Disulfide # 3 402 AAAB <-> 521 AAAB @ 2.03 A Disulfide # 4 3 AAAC <-> 22 AAAC @ 2.03 A Disulfide # 5 17 AAAC <-> 39 AAAC @ 2.02 A Disulfide # 6 41 AAAC <-> 52 AAAC @ 2.03 A Disulfide # 7 53 AAAC <-> 59 AAAC @ 2.03 A Nr of disulfides : ( 7)
X-PLOR generate input file written CPU total/user/sys : 1.5 1.4 0.1 ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
The PDB files may look as follows:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- REMARK Created by MOLEMAN2 V. 960227/0.8 at Tue Feb 27 15:29:35 1996 for user gerard REMARK File name : m1_aaaa.pdb ATOM 1 CB ALA 86 3.109 42.928 53.312 1.00 28.54 AAAA ATOM 2 C ALA 86 4.129 45.032 52.428 1.00 27.90 AAAA ... ATOM 3319 OT1 LEU 447 38.267 47.693 63.602 1.00 18.65 AAAA ATOM 3320 OT2 LEU 447 39.049 46.653 65.371 1.00 20.12 AAAA REMARK Nr of atoms in file : 2740 END ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
The GENERATE input file may look as follows:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- remarks File m1_generate.inp - generate pdb/psf file remarks Created by MOLEMAN2 V. 960227/0.8 at Tue Feb 27 15:29:35 1996 for user gerardtopology @tophcsdx.pro @toph19.sol { add more topology files here } end
parameter @parhcsdx.pro @param19.sol { add more parameter files here }
nbonds atom cdie shift eps=8.0 e14fac=0.4 cutnb=7.5 ctonnb=6.0 ctofnb=6.5 nbxmod=5 vswitch wmin=0.5 end { dielectric constant set to 8.0 (EPS) } { close contacts printed only if dist < 0.5 A (WMIN) } end
{ protein } segment name="AAAA" chain @toph19.pep coordinates @m1_aaaa.pdb end end vector do (name="CD1") ( name CD and resname ile ) coordinates @m1_aaaa.pdb
{ CARB } segment name="BBBB" chain coordinates @m1_bbbb.pdb end end coordinates @m1_bbbb.pdb
{ HETE } segment name="CCCC" chain coordinates @m1_cccc.pdb end end coordinates @m1_cccc.pdb
{ protein } segment name="KKKK" chain @toph19.pep coordinates @m1_kkkk.pdb end end vector do (name="CD1") ( name CD and resname ile ) coordinates @m1_kkkk.pdb
{ CARB } segment name="LLLL" chain coordinates @m1_llll.pdb end end coordinates @m1_llll.pdb
{ HETE } segment name="MMMM" chain coordinates @m1_mmmm.pdb end end coordinates @m1_mmmm.pdb
{ META } segment name="XXXX" chain coordinates @m1_xxxx.pdb end end coordinates @m1_xxxx.pdb
{ WATE } segment name="WWWW" chain coordinates @m1_wwww.pdb end end coordinates @m1_wwww.pdb
{ the disulfides } patch DISU refer=1=(segid="AAAA" and resid 176) refer=2=(segid="AAAA" and resid 235) end
patch DISU refer=1=(segid="AAAA" and resid 368) refer=2=(segid="AAAA" and resid 415) end
patch DISU refer=1=(segid="KKKK" and resid 176) refer=2=(segid="KKKK" and resid 235) end
patch DISU refer=1=(segid="KKKK" and resid 368) refer=2=(segid="KKKK" and resid 415) end
flags exclude vdw elec end
hbuild selection=(hydrogen and not known) phistep=45 end
{ optimise hydrogens to get rid of clashes } constraints fix=( not hydrogen ) end flags include vdw elec end minimize powell nstep=50 end constraints fix=( not all ) end
write coordinates output=m1_gen.pdb end
write structure output=m1.psf end
stop ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
This command (of which the first SIX letters should be typed - to prevent accidental deletion) removes all atoms from memory.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > dele ERROR --- Invalid command ==> (dele) MOLEMAN2 > delete ALL ATOMS AND RESIDUES DELETED !!! ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
You have the following options with this command:
- Atom_all; this will change all HETATMs to ATOMs (useful if you
want to use a model from the PDB for molecular replacement or
refinement with a program that chokes on or ignores HETATM cards);
- Deduce; the program will set all protein and nucleic acid atoms
to ATOM, and all others to HETATM (useful when you deposit your
model coordinates with the PDB).
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > pd he a All atoms set to type ATOM MOLEMAN2 > pd he d Deducing ATOM/HETATM types ... Nr set to ATOM : ( 4952) Nr set to HETATM : ( 0) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
O likes to write the chemical element number of each atom/hetatm, but the PDB does not like them. Use this command to remove them.
Supply the six cell constants, the number of molecules in the unit cell (using the PDB definition), and the spacegroup symbol (using the PDB definition, i.e. with spaces). The cell constants *must* be known for CCP4-type PDB files !
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > pd cr 84 84 111.8 90 90 90 8 "P 21 21 2" Unit-cell axes (A) : ( 84.000 84.000 111.800) Unit-cell angles (deg) : ( 90.000 90.000 90.000) Unit-cell volume (A3) : ( 7.889E+05) Nr of molecules in cell : ( 8) Spacegroup symbol : (P 21 21 2) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > pdb crys 41.2 41.2 196.5 90 90 90 Nr of molecules in cell ? ( 8) Spacegroup symbol ? (P 21 21 2) P 41 21 2 Unit-cell axes (A) : ( 41.200 41.200 196.500) Unit-cell angles (deg) : ( 90.000 90.000 90.000) Unit-cell volume (A3) : ( 3.335E+05) Nr of molecules in cell : ( 8) Spacegroup symbol : (P 41 21 2) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: PDb SSbond what
what = List | Delete | Generate
Use the COnstants SEt command to change the cut-off S-S link distance if necessary.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > pdb ssb lis List SSBOND records SSBOND 1 CYS A 4 CYS A 72 1CEL 340 SSBOND 2 CYS A 19 CYS A 25 1CEL 341 SSBOND 3 CYS A 50 CYS A 71 1CEL 342 SSBOND 4 CYS A 61 CYS A 67 1CEL 343 ... SSBOND 20 CYS B 261 CYS B 331 1CEL 359 Nr of SSBOND records listed : ( 20) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > pdb ssb del Delete SSBOND records Nr of SSBOND records deleted : ( 20) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > pdb ssb gen Generate SSBOND records Looking for CYS- SG atoms ... ATOM 25 SG CYS A 4 36.618 67.989 34.228 1.00 21.90 1CEL 398 ... ATOM 5990 SG CYS B 331 -7.298 9.647 20.243 1.00 10.03 1CEL6363 ATOM 6483 SG CYS B 397 -25.330 24.877 13.503 1.00 10.62 1CEL6856 Max SG-SG distance for link : ( 2.200) SSBOND 1 CYS A 4 CYS A 72 S-S = 2.03 A SSBOND 2 CYS A 19 CYS A 25 S-S = 2.02 A ... SSBOND 19 CYS B 238 CYS B 243 S-S = 2.03 A SSBOND 20 CYS B 261 CYS B 331 S-S = 2.03 A Nr of SSBOND records generated : ( 20) MOLEMAN2 > pdb ssb l List SSBOND records SSBOND 1 CYS A 4 CYS A 72 S-S = 2.03 A SSBOND 2 CYS A 19 CYS A 25 S-S = 2.02 A SSBOND 3 CYS A 50 CYS A 71 S-S = 2.02 A ... SSBOND 19 CYS B 238 CYS B 243 S-S = 2.03 A SSBOND 20 CYS B 261 CYS B 331 S-S = 2.03 A Nr of SSBOND records listed : ( 20) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: PDb REmark text
text = text of the remark
Add information about the molecule(s) as a REMARK record (e.g., model name, current R-factor and Rfree, etc.).
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
MOLEMAN2 > pdb re "Model M3 @ 960303 R=0.231 Rfree=0.273"
Add REMARK record : (Model M3 @ 960303 R=0.231 Rfree=0.273)
4: REMARK Model M3 @ 960303 R=0.231 Rfree=0.273
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: PDb LIst_remark
List the current set of REMARK records (if any).
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
MOLEMAN2 > pdb li
List REMARK records
Nr of REMARK records : ( 4)
1: REMARK FILENAME="/nfs/gerard/gerard/proteins/cbh2/ibgx/xplor/m1_final_2.pdb"
2: REMARK Uses *NO* sigma or amplitude cut-offs
3: REMARK DATE:15-Jan-96 23:31:00 created by user: gerard
4: REMARK Model M3 @ 960303 R=0.231 Rfree=0.273
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: PDb DElete_remark which
which = number of the REMARK record (* means ALL)
Delete one or all REMARK records (if any).
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > pdb del 3 Delete REMARK record nr: ( 3) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: PDb NAme which old new
which = Atom | Residue
old = old name to replace
new = new name
Change the atom or residue name of selected atoms. A residue is considered to be selected if at least one of its atoms is selected.
This command can be used to rename all waters (e.g., from residue HOH with atom O1 to residue WAT and atom OW). You can also write macros, for instance to convert residue and atom names for your favourite ligand or cofactor from/to X-PLOR or CCP4 or TNT or PDB or ... conventions.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > sel and type water ... MOLEMAN2 > pdb name res hoh wat Replace residue name |HOH| by |WAT| Nr of residues changed : ( 100) MOLEMAN2 > pdb name atom " o1 " " ow " Replace atom name | O1 | by | OW | Nr of atom names changed : ( 100) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
The following macro would convert X-PLOR waters into PDB waters:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- select all select and type water pdb name atom " o1 " " o " ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
In the new PDB format definition, column 77-78 of each ATOM and HETATM record contains the symbol of the chemical element type of the atom and column 79-80 the charge (don't ask me why the charge has to fit inside two columns ...). This command tries to deduce this information as best as it can, using information in the atom name (4 characters) field.
Chemical element: the following four methods are tried in turn:
(1) is it one of the special hydrogen names ?
(2) are the first two characters of the atom name a valid element symbol ?
(3) is a space plus the second character a valid element symbol ?
(4) is a space plus the first character a valid element symbol ?
(5) if none of the above, generate an error message, and use "??" as
the element symbol
Charge: the following are valid charge definitions for columns 3 and 4
of the atom name, which will be recognised by the program (default
charge otherwise is " 0"):
(1) "++" or "--"
(2) "+ " or " +" or "- " or " -"
(3) "n+" or "+n" or "n-" or "-n", where n is 0,1,2,...,9
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > pdb che Deriving chemical name and charge ... Nr of atoms processed : ( 10) Unknown chemical element : ( 0) Nr of positive atoms : ( 7) Nr of negative atoms : ( 3) Nr uncharged or unknown : ( 0) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Example: the following bogus PDB file:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- HETATM 7039 CA++ CA K 440 40.443 57.977 83.935 1.00 16.86 HETATM 7039 CA2+ CA K 441 40.443 57.977 83.935 1.00 16.86 HETATM 7039 CA+2 CA K 442 40.443 57.977 83.935 1.00 16.86 HETATM 7039 S-- CA K 443 40.443 57.977 83.935 1.00 16.86 HETATM 7039 S-2 CA K 444 40.443 57.977 83.935 1.00 16.86 HETATM 7039 S2- CA K 445 40.443 57.977 83.935 1.00 16.86 HETATM 7039 AH1+ CA K 446 40.443 57.977 83.935 1.00 16.86 HETATM 7039 H+ CA K 447 40.443 57.977 83.935 1.00 16.86 HETATM 7039 'H1+ CA K 448 40.443 57.977 83.935 1.00 16.86 HETATM 7039 HX + CA K 449 40.443 57.977 83.935 1.00 16.86 ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
gives the following result:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- HETATM 7039 CA++ CA A 440 40.443 57.977 83.935 1.00 16.86 AAAACA+2 HETATM 7039 CA2+ CA A 441 40.443 57.977 83.935 1.00 16.86 AAAACA+2 HETATM 7039 CA+2 CA A 442 40.443 57.977 83.935 1.00 16.86 AAAACA+2 HETATM 7039 S-- CA A 443 40.443 57.977 83.935 1.00 16.86 AAAA S-2 HETATM 7039 S-2 CA A 444 40.443 57.977 83.935 1.00 16.86 AAAA S-2 HETATM 7039 S2- CA A 445 40.443 57.977 83.935 1.00 16.86 AAAA S-2 HETATM 7039 AH1+ CA A 446 40.443 57.977 83.935 1.00 16.86 AAAA H+1 HETATM 7039 H+ CA A 447 40.443 57.977 83.935 1.00 16.86 AAAA H+1 HETATM 7039 'H1+ CA A 448 40.443 57.977 83.935 1.00 16.86 AAAA H+1 HETATM 7039 HX + CA A 449 40.443 57.977 83.935 1.00 16.86 AAAA H+1 ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: PDb NUmber first_new
Renumber the selected residues, starting from the given number.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > se and typ water AND atom selection With atoms for which : (TYP) Equals : (WATER) Selection history : (ALL | AND TYpe = WATE |) Nr of selected atoms : ( 100) MOLEMAN2 > pdb numb 501 Renumber selected residues starting at : ( 501) Nr of last changed residue : ( 600) Nr of residues changed : ( 100) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
These commands enable you to define a subset of atoms for which one or more operations have to be carried out. You can use this, for example, to get statistics about the temperature factors of all protein main-chain atoms in segment ABC3, or to set the occupancy of all calcium atoms to 0.5.
Some selection commands reset the selections (ALl, NOne, HYdrogen, and EXhydrogen). Others make it narrower (ANd), wider (OR) or invert the current selection (NEgate).
Examples:
- to select the non-hydrogen atoms of all segments except QQQQ, use:
SElect HYdrogen, SElect OR SEgid QQQQ, SElect NEgate
- to select all non-hydrogen main-chain protein atoms of chain A, use:
SElect EXhydro, SElect ANd TYpe PROT, SElect ANd CLass Main,
SElect ANd CHain A
- to select all non-hydrogen protein atoms, use: SElect EXhydro,
SElect ANd TYpe PROT
- to select all non-hydrogen protein and nucleic acid atoms, use:
SElect EXhydro, SElect ANd TYpe PROT, SElect OR TYpe NUCL
The program keeps a record of the SElection commands executed since the last time a resetting command was carried out. For instance, after selecting all non-hydrogen main-chain protein atoms, you will see:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > se ? Selection history : (NON-HYDROGEN | AND TYpe = PROT | AND CLass = MAI |) Nr of selected atoms : ( 548) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
The number of selected atoms is shown with a history of the recently executed SElect commands.
This selects all atoms, including any hydrogen atoms.
This de-selects all atoms.
All atoms which had previously been selected will be de-selected, and the other way around.
No non-hydrogen atoms will be selected.
No hydrogen atoms will be selected.
If at least one atom of a residue has been selected, all atoms of the residue will be selected.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > se and res rea ... Nr of selected atoms : ( 22) MOLEMAN2 > se dist 0 3.5 ... Nr of selected atoms : ( 29) MOLEMAN2 > se by Select by residue Selection history : (ALL | AND REsidu = REA | DIstance 0.00 3.50 | BY_residue |) Nr of selected atoms : ( 55) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
This can be used to select, e.g. a ligand, and subsequently all atoms within 3.5 or 4 Å from this compound.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > se and res rea ... Nr of selected atoms : ( 22) MOLEMAN2 > se dist 0 3.5 ... Nr of selected atoms : ( 29) MOLEMAN2 > se by Select by residue Selection history : (ALL | AND REsidu = REA | DIstance 0.00 3.50 | BY_residue |) Nr of selected atoms : ( 55) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: SElect ANd what which
what = CHain | SEgid | TYpe | CLass |
REsidue | ATom
which = which chain, segid, etc. to use
type can prot, nucl, wate, etc.
class can be main or side chain
residue can be Ala, HOH, BGL etc.
atom can be " CA ", " O1 " etc.
This can be used to select by chain name (one character), segment name (4 characters), type (any of PROT, NUCL, WATE, META, INOR, CARB, ORGA or HETE; 4 characters), or class (main or side chain, one character). Alternatively, you can select by residue type (Ala, Asn, etc.) or by atom name (enclose in "double quotes"). Note that atom names are 4 characters (e.g., use " CA " to select C-alpha atoms).
The new selection will be those atoms that were already selected AND satisfy the new criterion. This usually reduces the selection.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > se ex Select NON-HYDROGEN atoms Selection history : (NON-HYDROGEN |) Nr of selected atoms : ( 1213) MOLEMAN2 > se an ty prot AND atom selection With atoms for which : (TY) Equals : (PROT) Selection history : (NON-HYDROGEN | AND TYpe = PROT |) Nr of selected atoms : ( 1091) MOLEMAN2 > se and cl main AND atom selection With atoms for which : (CL) Equals : (MAIN) Selection history : (NON-HYDROGEN | AND TYpe = PROT | AND CLass = MAIN |) Nr of selected atoms : ( 548) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > se ex Select NON-HYDROGEN atoms Selection history : (NON-HYDROGEN |) Nr of selected atoms : ( 5794) MOLEMAN2 > se an resi trp AND atom selection With atoms for which : (RESI) Equals : (TRP) Selection history : (NON-HYDROGEN | AND REsidu = TRP |) Nr of selected atoms : ( 280) MOLEMAN2 > se and at " CA " AND atom selection With atoms for which : (AT) Equals : ( CA) Selection history : (NON-HYDROGEN | AND REsidu = TRP | AND ATom = CA |) Nr of selected atoms : ( 20) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: SElect BUtnot what which
what = CHain | SEgid | TYpe | CLass |
REsidue | ATom
which = which chain, segid, etc. to use
type can prot, nucl, wate, etc.
class can be main or side chain
residue can be Ala, HOH, BGL etc.
atom can be " CA ", " O1 " etc.
Analogous to SElect ANd and SElect OR.
The new selection will be those atoms that were already selected AND do NOT satisfy the new criterion. This usually narrows the selection.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > sel but type prot BUtnot atom selection With atoms for which : (TYPE) Equals : (PROT) Selection history : (ALL | BUTNOT TYpe = PROT |) Nr of selected atoms : ( 906) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: SElect OR what which
what = CHain | SEgid | TYpe | CLass |
REsidue | ATom
which = which chain, segid, etc. to use
type can prot, nucl, wate, etc.
class can be main or side chain
residue can be Ala, HOH, BGL etc.
atom can be " CA ", " O1 " etc.
This can be used to select by chain name (one character), segment name (4 characters), type (any of PROT, NUCL, WATE, META, INOR, CARB, ORGA or HETE; 4 characters), or class (main or side chain, one character). Note that atom names are 4 characters (e.g., use " CA " to select C-alpha atoms).
The new selection will be those atoms that were already selected OR satisfy the new criterion. This usually increases the selection.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > se ex Select NON-HYDROGEN atoms Selection history : (NON-HYDROGEN |) Nr of selected atoms : ( 1213) MOLEMAN2 > se and type prot AND atom selection With atoms for which : (TYPE) Equals : (PROT) Selection history : (NON-HYDROGEN | AND TYpe = PROT |) Nr of selected atoms : ( 1091) MOLEMAN2 > se or type water OR atom selection With atoms for which : (TYPE) Equals : (WATER) Selection history : (NON-HYDROGEN | AND TYpe = PROT | OR TYpe = WATER |) Nr of selected atoms : ( 1191) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: SElect NUmeric and_or what lo hi
and_or = And | Or | Butnot
what = Residue_nr | B-factor | Occupancy |
X-coord | Y-coord | Z-coord |
Mass | Element | Covalent_bond_radius
lo = minimum value to select
hi = maximum value to select
Make a selection wider or narrow using numerical properties. This can be used to select a zone of residues (e.g., SEl NUm And Res 23 38), all atoms with high B-factors (SEl NUm And Bfac 50 9999), non-zero occupancy (SEl NUm And Occ 0.0 0.9999), etc.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > sel num and res 50 50 Select Numeric : ( AND Residue_nr 50 50) Selection history : (ALL | AND Residue_nr 50 50 |) Nr of selected atoms : ( 11) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > sel all Select ALL atoms Selection history : (ALL |) Nr of selected atoms : ( 7038) MOLEMAN2 > sel num and mass 40 999 Select Numeric : ( AND Mass 40.00000 999.0000) Selection history : (ALL | AND Mass 40.00000 999.0000 |) Nr of selected atoms : ( 3) MOLEMAN2 > lis List first selected atom of every residue ATOM 3244 I IBZ A 436 19.199 55.708 63.367 1.00 38.03 1CEL3617 ATOM 6763 I IBZ B 436 -19.688 12.428 7.328 1.00 32.65 1CEL7136 ATOM 7039 CA CA 440 40.443 57.977 83.935 1.00 16.86 1CEL7412 Nr of residues listed : ( 3) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > sel all ... MOLEMAN2 > se num and bfac 50 9999 Select Numeric : ( AND B-factor 50.00000 9999.000) Selection history : (ALL | AND B-factor 50.00000 9999.000 |) Nr of selected atoms : ( 8) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: BFactor STats [how]
how = Chain | Type
Lists statistics for the B-factors of all selected atoms, either split up by chain or by type. For example, to get B-factor statistics for all non-hydrogen atoms by chain, use:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
MOLEMAN2 > sel ex
Select NON-HYDROGEN atoms
Selection history : (NON-HYDROGEN |)
Nr of selected atoms : ( 7038)
MOLEMAN2 > bf st c
Chain name Atoms Bave Bsdv Bmin Bmax
A<->1CEL 3518 18.70 8.58 5.31 66.29
B<->1CEL 3518 17.23 8.68 5.91 67.75
<->1CEL 2 24.27 7.41 16.86 31.68
Nr of chains encountered : ( 3)
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Ditto, but by type:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
MOLEMAN2 > bf st ty
Type Atoms Bave Bsdv Bmin Bmax
PROT 6440 17.14 8.06 5.31 67.75
WATE 529 27.65 9.99 7.05 55.30
CARB 50 20.91 5.42 12.52 34.04
HETE 19 21.67 6.61 8.66 38.03
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
To get information for only the main-chain protein atoms of chain A, use:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
MOLEMAN2 > sel ex
MOLEMAN2 > se and ty prot
MOLEMAN2 > se and ch a
MOLEMAN2 > sel an cl mai
AND atom selection
With atoms for which : (CL)
Equals : (MAI)
Selection history : (NON-HYDROGEN | AND TYpe = PROT | AND CHain = A | AND
CLass = MAI |)
Nr of selected atoms : ( 1736)
MOLEMAN2 > bf st ty
Type Atoms Bave Bsdv Bmin Bmax
PROT 1736 17.02 6.61 6.97 53.21
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Don't forget to *explicitly* exclude any hydrogen atoms you may have !
Syntax: OCcupancy STats [how]
how = Chain | Type
Lists statistics for the occupancies of all selected atoms, either split up by chain or by type. See the BFactor STats command for examples.
Don't forget to *explicitly* exclude any hydrogen atoms !
This command will remove all ANISOUs (if any are present; otherwise nothing will happen).
Syntax: BFactor LImit lo hi
Reset all B-factors less than "lo" to a value of "lo", and all those greater
than "hi" to a value of "hi". This can be used to reset very low and/or
very high B-factors, or to reset all temperature factors.
To reset very low B-factors: use a very high number for "hi" (e.g., 1000).
To reset very high B-factors: use a negative number for "lo" (e.g., -1).
To reset all B-factors, make "lo" equal to "hi".
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
MOLEMAN2 > se ex
Select NON-HYDROGEN atoms
Selection history : (NON-HYDROGEN |)
Nr of selected atoms : ( 7038)
MOLEMAN2 > bf st c
Chain name Atoms Bave Bsdv Bmin Bmax
A<->1CEL 3518 18.70 8.58 5.31 66.29
B<->1CEL 3518 17.23 8.68 5.91 67.75
<->1CEL 2 24.27 7.41 16.86 31.68
Nr of chains encountered : ( 3)
MOLEMAN2 > bf li 10 50
Reset B-factors to lie in range 10.00 to 50.00
MOLEMAN2 > bf st c
Chain name Atoms Bave Bsdv Bmin Bmax
A<->1CEL 3518 18.72 8.39 10.00 50.00
B<->1CEL 3518 17.32 8.44 10.00 50.00
<->1CEL 2 24.27 7.41 16.86 31.68
Nr of chains encountered : ( 3)
MOLEMAN2 > bf li 18 18
Reset B-factors to lie in range 18.00 to 18.00
MOLEMAN2 > bf st c
Chain name Atoms Bave Bsdv Bmin Bmax
A<->1CEL 3518 18.00 0.00 18.00 18.00
B<->1CEL 3518 18.00 0.00 18.00 18.00
<->1CEL 2 18.00 0.00 18.00 18.00
Nr of chains encountered : ( 3)
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: OCcupancy LImit lo hi
Reset all occupancies less than "lo" to a value of "lo", and all those greater
than "hi" to a value of "hi". This can be used to reset very low and/or
very high occupancies, or to reset all occupancies.
See the BFactor LImit command for further discussion and examples.
Syntax: BFactor PRod_plus [prod] [plus]
prod = multiply selected Bs by this number (default 1.0)
plus = add this number to the selected Bs (default 0.0)
This can be used to change the scale of temperature factors (e.g., prior to molecular replacement with a room-temperature search model using new cryo data).
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
MOLEMAN2 > stat
...
Item Average St.Dev Min Max
---- ------- ------ --- ---
...
B-factor 16.622 9.305 3.920 54.210
...
MOLEMAN2 > bf pr 0.6 2
New B = 0.6000 * Old B + 2.0000
Nr of atoms updated : ( 1213)
MOLEMAN2 > st
...
B-factor 11.973 5.583 4.352 34.526
...
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: OCcupancy PRod_plus [prod] [plus]
prod = multiply selected occupancies by this number (default 1.0)
plus = add this number to the selected occupancies (default 0.0)
This can be used to change the occupancies of the selected atoms.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > oc pr 0.5 New Q = 0.5000 * Old Q + 0.0000 Nr of atoms updated : ( 1213) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: BFactor PLot file [what]
file = O2D plot file name
what = Average | Radial
Option Average creates a plot of average B-factor per residue, using only currently selected atoms. Option Radial produces a plot of average B-factor in 2.0 A shells from the centre-of-gravity of the currently selected atoms. The plot files are for use with O2D.
For example, to plot the average B-factors for all protein residues in chain A, use: SE EX, SE ANd CHain A, SE ANd TYpe PROT (only needed if chain A contains things other than protein residues), BF PLot bave.plt a.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > se ex MOLEMAN2 > se an ch a MOLEMAN2 > bf pl bave.plt a Residues with selected atoms : ( 701) MOLEMAN2 > bf pl brad.plt r Selected atoms : ( 3518) Centre-of-gravity : ( 30.052 63.904 61.356) Shell 0.0 - 2.0 A - 3 atoms; <B> = 13.80 A**2 Shell 2.0 - 4.0 A - 9 atoms; <B> = 17.73 A**2 Shell 4.0 - 6.0 A - 35 atoms; <B> = 12.22 A**2 Shell 6.0 - 8.0 A - 69 atoms; <B> = 12.69 A**2 ... Shell 32.0 - 34.0 A - 33 atoms; <B> = 25.13 A**2 Shell 34.0 - 36.0 A - 13 atoms; <B> = 35.35 A**2 Shell 36.0 - 38.0 A - 1 atoms; <B> = 43.24 A**2 ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
This commands calculates (for the currently selected set of atoms only) the RMS delta-B for all bonded atoms, bonded main-chain and side-chain atoms separately, and for atoms involved in non-bonded interactions (the distance cut-offs can be changed with the COnstants SEt command). Note that hydrogens are included if they are currently selected !
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
MOLEMAN2 > BF BO
Calculating RMS delta-B for (non-)bonded atoms
Max bonded distance : ( 2.000)
Max non-bonded distance : ( 3.600)
Nr of selected non-H atoms : ( 7038)
Nr of bonds (all atoms) : ( 6662)
RMS delta-B (A**2) : ( 2.099)
Nr of bonds (main chain) : ( 4278)
RMS delta-B (A**2) : ( 1.562)
Nr of bonds (side chain) : ( 3192)
RMS delta-B (A**2) : ( 2.613)
Nr of non-bonded contacts : ( 23812)
RMS delta-B (A**2) : ( 5.194)
CPU total/user/sys : 17.7 17.7 0.1
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
This command will operate on all selected atoms, and replace the B-factor of each atom by the average of its own B-factor and those of all its bonded neighbours (if any). This can be useful when you make the transition from refining grouped to individual temperature factors. Use the BFactor BOnded command before and after this operation to see the effect. Note that hydrogens are included if they are currently selected !
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > BF SM Smoothing Bs for bonded atoms Max bonded distance : ( 2.000) Nr of selected non-H atoms : ( 7038) CPU total/user/sys : 34.6 34.4 0.2 ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: BFactor GRoup how
how = Mc_sc | Residue | Overall
This command enables you to average B-factors for certain groups of atoms, e.g. prior to refinement of a high-resolution model against low-resolution data (e.g., a complex or mutant). Note that hydrogens are included if they are currently selected !
Grouping is carried out for all currently selected atoms in one of
three ways:
- Overall: the B-factor of each selected atom is set to the average
B-factor of all selected atoms
- Residue: the B-factors are averaged for all selected atoms for
every residue
- Mc_sc: ditto, but grouped for main-chain and side-chain atoms
separately
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > bf gr mc Group Bs for main/side chain atoms Nr of main-chain groups : ( 137) Nr of side-chain groups : ( 230) MOLEMAN2 > bf gr re Group Bs per residue Nr of residues grouped : ( 238) MOLEMAN2 > bf gr ov Group Bs for all selected atoms Nr of selected atoms : ( 1213) Average B-factor : ( 16.622) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: PRotein MC_analysis [file] [what]
file = name of output file or blank
what = Rama_plot | Labelled_rama | Text_file
If you don't provide a filename, no output file is generated. If you do, the second parameter what type of file you will get. You can get a simple list of Phi, Psi, Omega and Planarity (pseudo-)torsion angles for every residue by selecting type T(ext_file). If you want a Ramachandran plot (PostScript format only; use the old MOLEMAN if you want other formats, or Balasubramanian plots, or polar Ramachandran plots), you can select R or L. With option L, the outliers will be marked with a text label in the plot; with option R they won't be. Note that the plot file is already in PostScript format, so you don't have to convert it with O2D.
The screen output is simply a list of "features" of your protein
main chain, including:
- start and end residues of chains (looking at CA-CA distances, *not*
chain or segment names; the maximum CA-CA distance allowed is 4.5 A);
- inability to calculate certain torsion angles (e.g., due to missing
atoms in a residue or one of its neighbours);
- type of peptide (cis, trans, or "unusual");
- planarity of the peptide bond;
- positive PHI angles (not for glycines; unusual outside the poly-Pro area);
- outliers in the Ramachandran plot (not for glycines).
Note that all protein residues will be included in the analysis, i.e. the current SElection is ignored. If you want to do the analysis for separate chains, use the SPlit command to write them to separate PDB files, and then analyse each in turn.
The Ramachandran outliers are identified using data obtained in an analysis of 403 PDB structurs (October 1995) which are mutually <= 95 % homologous (Hobohm & Sander list), and were refined at a resolution of 2.0 A or better (GJK & TAJ, to be published). For the 74,893 non-Gly residues included (with temperature factors less than two standard deviations above the average for the protein), a mega-Ramachandran plot was generated using 10 degree-square bins. The set of most-populated bins containing 98% of all residues was then defined as the core of the ramachandran plot. Any residue outside this core region is called an outlier. For your average 2.0 A or better structure, one would expect ~0-5% outliers. Outliers will be shown with an asterisks (and, optionally, a text label) in the Ramachandran plot.
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > prot mc 1cel_rama.ps lab => XPS_GRAF - GJK (2.3 @ 960209) Opened PostScript file : (1cel_rama.ps) Date : (Sun Feb 18 02:15:33 1996) User : (gerard) Program : (MOLEMAN2) Nr of residues to check : ( 868)RESIDUE PCA A 1 1CEL <<<<< Start of new chain
RESIDUE SER A 99 1CEL Warning - unusual PHI,PSI combination = -118.7 -112.1 ... RESIDUE MET A 374 1CEL Warning - positive PHI = 65.2
RESIDUE TYR A 381 1CEL Cis-peptide; omega = 0.4; next residue is PRO Warning - non-planar peptide; planarity = 5.5
RESIDUE ASN A 420 1CEL Warning - positive PHI = 47.3
RESIDUE GLY A 434 1CEL >>>>> End of chain
RESIDUE PCA B 1 1CEL <<<<< Start of new chain
RESIDUE SER B 99 1CEL Warning - unusual PHI,PSI combination = -134.0 -130.7 ... RESIDUE TYR B 381 1CEL Cis-peptide; omega = -1.1; next residue is PRO
RESIDUE ASN B 420 1CEL Warning - positive PHI = 49.6
RESIDUE GLY B 434 1CEL >>>>> End of chain
Total nr of residues checked : ( 868) Cis-peptide bonds : ( 2) Unusual OMEGA values : ( 0) Non-planar peptide bonds : ( 1) Start/end residues : ( 4) Problems (missing atoms ?) : ( 0) Glycine residues : ( 100) Remaining residues in Ramach : ( 764) In core regions : ( 759) Outliers : ( 5) Outlier percentage : ( 0.654) An average <= 2.0 A model has ~0-5% outliers ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > pr mc cbh2.list t Nr of residues to check : ( 725)RESIDUE ALA 86 AAAA <<<<< Start of new chain
RESIDUE ASN 158 AAAA Warning - unusual PHI,PSI combination = -102.6 52.9 ... RESIDUE GLN 422 KKKK Cis-peptide; omega = 0.1; next residue is PRO
RESIDUE ASN 443 KKKK Cis-peptide; omega = 1.1; next residue is PRO
RESIDUE LEU 447 KKKK >>>>> End of chain
Total nr of residues checked : ( 725) Cis-peptide bonds : ( 6) Unusual OMEGA values : ( 0) Non-planar peptide bonds : ( 0) Start/end residues : ( 4) Problems (missing atoms ?) : ( 0) Glycine residues : ( 60) Remaining residues in Ramach : ( 661) In core regions : ( 637) Outliers : ( 24) Outlier percentage : ( 3.631) An average <= 2.0 A model has ~0-5% outliers ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
The text file produced in the example above may look as follows:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- Created by MOLEMAN2 V. 960217/0.4 at Sun Feb 18 02:16:43 1996 for user gerardListing of Phi, Psi, Omega and Planarity (pseudo) torsion angles for PDB file hydro.pdb
PHI = C(i-1) - N(i) - CA(i) - C(i) - usually negative PSI = N(i) - CA(i) - C(i) - N(i+1) OMEGA = CA(i) - C(i) - N(i+1) - CA (i+1) - cis=0, trans=180 PLANARITY = C(i) - CA(i) - N(i+1) - O(i) - planar < 5
An entry of "-999.9" means that the torsion angle could not be calculated (for terminal residues and residues with missing atoms).
Residue Phi Psi Omega Planarity ------- --- --- ----- ---------
ALA- 86 -AAAA -999.9 -60.2 -179.0 -0.8 THR- 87 -AAAA -73.7 142.9 -178.7 0.2 TYR- 88 -AAAA -139.9 162.0 178.3 -0.7 SER- 89 -AAAA -124.5 117.2 179.7 1.7 GLY- 90 -AAAA 86.1 -131.6 -179.6 0.4 ASN- 91 -AAAA -65.5 116.2 -179.7 -1.9 ... SER- 445 -KKKK -70.6 146.1 178.6 0.7 PHE- 446 -KKKK -98.7 -32.7 179.2 0.4 LEU- 447 -KKKK -77.7 -999.9 -999.9 -999.9 ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
Syntax: PRotein SC_analysis [file]
file = name of output file or blank
If you don't provide a filename, no output file is generated. If you do, you'll get a plain text file with a listing for all protein residues except Gly of the CA-chirality improper and the side chain torsion angles Chi1 to Chi5 (the latter only occurs for Arg and should be zero).
On the screen you'll get a list of "features" of your sidechains.
Things that are checked (using a tolerance of 5 degrees) include:
- CA-chirality (D/L amino acids; improper CA - N - C - CB; +34 for
L-amino acids, -34 for D-amino acids; bad if not near an ideal
value; note that the ideal value for Pro is more like 38 degrees)
- naming of equivalent sidechain 1/2 atoms (Asp, Glu, Phe, Tyr, Arg;
tested by checking if the corresponding Chi torsion i in the range
[-90,+90]; if not, a message is printed; atom names are not swapped)
- chirality of Ile CB (CB - CG1 - CG2 - CA; should be +34)
- "tetrahedral-ness" of some other carbons (Leu CG, Val CB and Thr CB)
- flatness of several atom types (not of rings; Arg CZ, Asn/Asp CG,
Gln/Glu CD, His/Phe/Tyr/Trp CG)
Example of the output:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- MOLEMAN2 > pr scRESIDUE ASP A 173 1CEL Warning - poor CA chirality improper = 27.3
RESIDUE PRO A 194 1CEL Warning - poor CA chirality improper = 39.1
RESIDUE PRO A 229 1CEL Warning - poor CA chirality improper = 39.8
RESIDUE TYR A 274 1CEL Warning - sidechain 1/2 atom names incorrect; Chi = 91.1
...
RESIDUE ASP B 368 1CEL Warning - poor CA chirality improper = 27.8
Nr of residues checked : ( 766) Nr of D-amino acids : ( 0) Nr with poor CA chirality : ( 9) Nr wrong 1/2 atom names : ( 3) ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE -----
The output text file (optional) may look as follows:
----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- EXAMPLE ----- Created by MOLEMAN2 V. 960217/0.4 at Thu Feb 22 19:37:07 1996 for user gerardListing of Chi1-5 and Chirality (pseudo) torsion angles for PDB file 1cel.pdb
CHIRAL = CA - N - C - CB (+34=L-aa; -34=D-aa) CHI-1 = N - CA - CB - ?G(1) CHI-2 = CA - CB - ?G(1) - ?D(1) CHI-3 = CB - ?G(1) - ?D(1) - ?E(1) CHI-4 = ?G(1) - ?D(1) - ?E(1) - ?Z(1) CHI-5 = ?D(1) - ?E(1) - ?Z(1) - ?H(1)
An entry of "-999.9" means that the torsion angle could not be calculated (e.g., for residues with missing atoms).
Residue CA-chir Chi-1 Chi-2 Chi-3 Chi-4 Chi-5 ------- ------- ----- ----- ----- ----- ----- PCA-A 1 -1CEL 37.9 31.7 -30.2 -160.3 SER-A 2 -1CEL 34.3 67.1 ALA-A 3 -1CEL 35.0 CYS-A 4 -1CEL 33.1 -80.1 ... THR-B 429 -1CEL 34.5 69.0 ASN-B 431 -1CEL -1.0 -69.5 -37.0 PRO-B 432 -1CEL 36.6 -39.5