(Read more about HyperChem and Protein Databank Structures)

Protein Data Bank Files

Structures of proteins and nucleic acids are available in PDB files.1 These structures are derived from physical studies of molecules (for example, x-ray diffraction or nuclear magnetic resonance (NMR) analyses). HyperChem can interpret and display structures stored in all PDB files that contain atomic coordinates, and can also store new molecular structures in PDB-type files.

Obtaining PDB Files

The HyperChem distribution disks contain several sample PDB files. You can obtain the complete PDB collection, containing about 17000 structures, from the Research Collaboratory for Structural Bioinformatics (RCSB). Alternatively, you can download a specific PDB file. For information, contact:

www.rcsb.org/pdb/

Reading PDB Files

HyperChem can read all PDB files that contain atomic coordinates, up to the limit of the computer memory. HyperChem uses the PDB information on atomic coordinates, plus the information in its own TPL files (see ), to reconstruct standard amino acid and nucleic acid residues. For HETATM records and atoms that are not included in TPL files, HyperChem uses atomic coordinates and CONECT records from the PDB file to reconstruct a molecular system.

HyperChem expects that PDB files will follow the formatting standards described by the maintainers of the PDB archive. Some software creates "PDB" files that do not follow this format properly; for example, the data entries on a line of the file may be in the wrong columns, or shifted left or right within a section. If you use the script command " non-standard-pdb-names = yes ", HyperChem will attempt to make allowances for incorrect formatting. Use of this command may prevent correct reading of files which do conform to the standards.

Most PDB files contain all the information that is necessary for HyperChem to regenerate the structure. However, if a PDB file includes non-standard residues which are not in HyperChem's template files, but does not include information about bonding and/or hydrogen atoms, then HyperChem will not be able to process the structure properly. If the file does not have bonding information, you will probably end up with a number of disconnected atoms in the workspace which you will have to connect to form molecules. (See .) If the file does not include hydrogen atoms, you can add them with Build/Add Hydrogens .

  • HyperChem's internal storage of structures requires that residues within molecules be numbered sequentially starting from 1. Many PDB files include structures whose residue numbers start with values other than 1, and/or have gaps in the numbering. Therefore, the residue numbering in the original file may be different from the numbering in HyperChem when a PDB file is read in. If you save the structure in the PDB format, the residue numbering will correspond to HyperChem's internal storage numbering, and so may not be the same as the numbering in the original file.

Creating PDB-type Files

You can save any molecular system in the workspace as a PDB-type file. Choose Save As on the File menu and L-click on PDB in the Save File dialog box.

 

HyperChem provides the filename extension .ent , but you have to enter a new filename in the File Name text box. The PDB file can contain 13 types of records (each record has a title and takes up one line in the file):

HEADER
COMPND
SOURCE
AUTHOR
REVDAT
JRNL

These are all comment records from a PDB file that you can enter or edit in the Save File dialog box.

  • Comments must adhere to the format described in the PDB document, Atomic Coordinates and Bibliographic Entry Format Description , July, 1989 .
REMARK

Comments from a HIN file, entered in the Save File dialog box.

SSBOND

Residues involved in disulphide bridges.

ATOM

The identity and coordinates of each heavy atom in the molecular system. These atoms must be part of the standard set of residues that HyperChem recognizes (see and ).

HETATM

The identity and coordinates of all heavy atoms that are not part of standard residues. Standard residues belong to the set that HyperChem recognizes (see and ).

CONECT

The numbers of atoms directly bonded to each other. These numbers come from the ATOM or HETATM records. The Numbers label ( Choose Labels on the Display menu) also uses these numbers. CONECT records appear for HETATMs and for all atoms if you choose the Connectivity option (see below).

TER

A record marking the end of a chain.

END

The end of the file.

HyperChem has two options for storing PDB-type files. If you store files with both options off (no ), the file contains only REMARK, ATOM, HETATM, and END records for heavy atoms. These are the options:

Hydrogens

With this option on ( ), the file also contains ATOM records for all hydrogen atoms.

Connectivity

With this option on ( ), the file also contains CONECT records.

The FillPDBRecords setting in the Registry or in chem.ini can require HyperChem to fill each record with spaces, to a total of 70 characters (see ).

Sample PDB Files

HyperChem provides these sample PDB files2:

 

Filename

Molecule

PDB1BNA

DNA, B form, single stranded, 290K Daltons

PDB1CRN

Crambin

PDB3CYT

Cytochrome C, albacore, oxidized

PDB2EST

Elastase-TFAP complex, porcine

PDB1F19

IGG, FAB portion, mouse

PDB1FDX

Ferredoxin

PDB1GCN

Glucagon

PDB2PTC

Trypsin-trypsin inhibitor complex

PDB5PTI

Pancreatic trypsin inhibitor

PDB4RHV

Rhinovirus 14, human

PDB6TNA

Transfer RNA, yeast phenylalanine

PDB1XY1

Deamino-oxytocin, wet form


1. The Protein Data Bank is operated by the Research Collaboratory for Structural Bioinformatics (RCSB) and is supported by funds from the National Science Foundation, the Department of Energy, and two units of the National Institutes of Health: the National Institute of General Medical Sciences and the National Library of Medicine.

2. These files are for public use, courtesy of the Protein Data Bank. To obtain other files, see See Obtaining PDB Files.

 

 

Important: Read our Eligibility Requirements for Academic Pricing
CyberChem, Inc. © 2004, All Rights Reserved. 1115 NW 4th Street Suite 2, Gainesville, FL 32601 USA
Phone (352) 371-6854 Fax (352) 371-3662 email: sales@cyberchem.com
 
-.......
 
-.......