Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asymmetric Unit
(-)Asym. Unit - sites
(-)Biological Unit 1
(-)Biol. Unit 1 - sites
(-)Biological Unit 2
collapse expand < >
Image Asymmetric Unit
Asymmetric Unit  (Jmol Viewer)
Image Asym. Unit - sites
Asym. Unit - sites  (Jmol Viewer)
Image Biological Unit 1
Biological Unit 1  (Jmol Viewer)
Image Biol. Unit 1 - sites
Biol. Unit 1 - sites  (Jmol Viewer)
Image Biological Unit 2
Biological Unit 2  (Jmol Viewer)

(-) Description

Title :  CRYSTAL STRUCTURE OF DEHALOGENASE HALOTAG2 WITH HALTS AT THE RESOLUTION 1.8A. NORTHEAST STRUCTURAL GENOMICS CONSORTIUM (NESG) TARGET OR150
 
Authors :  A. Kuzin, S. Lew, J. Seetharaman, M. Maglaqui, R. Xiao, E. Kohan, H. Wang J. K. Everett, G. Acton, T. B. , Kornhaber, G. T. Montelione, J. F. Hunt, Northeast Structural Genomics Consortium (Nesg)
Date :  29 May 13  (Deposition) - 24 Jul 13  (Release) - 24 Jul 13  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  1.80
Chains :  Asym. Unit :  A,B
Biol. Unit 1:  A  (1x)
Biol. Unit 2:  B  (1x)
Keywords :  Structural Genomics, Psi-Biology, Protein Structure Initiative, Northeast Structural Genomics Consortium, Nesg, Halts, Halogenase, De Novo Protein (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  A. Kuzin, S. Lew, J. Seetharaman, M. Maglaqui, R. Xiao, E. Kohan, H. Wang, J. K. Everett, G. Acton, T. B. , Kornhaber, G. T. Montelione, J. F. Hunt, L. Tong
Northeast Structural Genomics Consortium Target Or150
To Be Published
PubMed: search

(-) Compounds

Molecule 1 - DEHALOGENASE HALOTAG2
    ChainsA, B
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPET15_NESG
    Expression System StrainBL21(DE3)+MAGIC
    Expression System Taxid469008
    Expression System Vector TypePLASMID
    Organism ScientificSYNTHETIC CONSTRUCT
    Organism Taxid32630

 Structural Features

(-) Chains, Units

  12
Asymmetric Unit AB
Biological Unit 1 (1x)A 
Biological Unit 2 (1x) B

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (3, 18)

Asymmetric Unit (3, 18)
No.NameCountTypeFull Name
11Q92Ligand/IonN-(2-ETHOXY-3,5-DIMETHYLBENZYL)-1H-TETRAZOL-5-AMINE
2MSE12Mod. Amino AcidSELENOMETHIONINE
3NA4Ligand/IonSODIUM ION
Biological Unit 1 (2, 7)
No.NameCountTypeFull Name
11Q91Ligand/IonN-(2-ETHOXY-3,5-DIMETHYLBENZYL)-1H-TETRAZOL-5-AMINE
2MSE6Mod. Amino AcidSELENOMETHIONINE
3NA-1Ligand/IonSODIUM ION
Biological Unit 2 (2, 7)
No.NameCountTypeFull Name
11Q91Ligand/IonN-(2-ETHOXY-3,5-DIMETHYLBENZYL)-1H-TETRAZOL-5-AMINE
2MSE6Mod. Amino AcidSELENOMETHIONINE
3NA-1Ligand/IonSODIUM ION

(-) Sites  (6, 6)

Asymmetric Unit (6, 6)
No.NameEvidenceResiduesDescription
1AC1SOFTWAREPRO A:53 , ASP A:117 , TRP A:118 , TRP A:152 , ALA A:156 , PHE A:160 , PHE A:179 , ALA A:183 , PRO A:217 , LEU A:220 , VAL A:256 , PHE A:283BINDING SITE FOR RESIDUE 1Q9 A 401
2AC2SOFTWAREASN A:61 , HIS A:65 , ASP A:291 , HOH A:599BINDING SITE FOR RESIDUE NA A 402
3AC3SOFTWAREASN A:52 , PHE A:179 , LEU A:213 , PRO A:217BINDING SITE FOR RESIDUE NA A 403
4AC4SOFTWAREASN B:52 , PRO B:53 , ASP B:117 , TRP B:118 , TRP B:152 , PHE B:160 , PHE B:179 , ALA B:183 , MSE B:186 , PRO B:217 , LEU B:220 , VAL B:256 , PHE B:283BINDING SITE FOR RESIDUE 1Q9 B 401
5AC5SOFTWAREASN B:61 , HIS B:65 , ASP B:291 , HOH B:623BINDING SITE FOR RESIDUE NA B 402
6AC6SOFTWAREASN B:52 , PHE B:179 , LEU B:213 , PRO B:217BINDING SITE FOR RESIDUE NA B 403

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 4KYV)

(-) Cis Peptide Bonds  (6, 6)

Asymmetric Unit
No.Residues
1Asn A:52 -Pro A:53
2Glu A:225 -Pro A:226
3Thr A:253 -Pro A:254
4Asn B:52 -Pro B:53
5Glu B:225 -Pro B:226
6Thr B:253 -Pro B:254

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 4KYV)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 4KYV)

(-) Exons   (0, 0)

(no "Exon" information available for 4KYV)

(-) Sequences/Alignments

Asymmetric Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:298
                                                                                                                                                                                                                                                                                                                                          
               SCOP domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ...............eeeee..eeeeeeee.......eeee.....hhhhhh.hhhhhh....eeee...............hhhhhhhhhhhhhhhh....eeeeeehhhhhhhhhhhhhh...eeeeeee.......hhhhhhhhhhhhhhhhh..hhhhhhhh..hhhhhhhhhhh.....hhhhhhhhhhhhhhhhhhhhhhhhhhhh.....hhhhhhhhhhhhhhhhhh...eeeeeeee....hhhhhhhhhhhh..eeeeeeeee..hhhhhhhhhhhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 4kyv A   4 HHHHHEIGTGFPFDPHYVEVLGERmHYVDVGPRDGTPVLFLHGNPTSSYLWRNIIPHVAPSHRCIAPDLIGmGKSDKPDLDYFFDDHVRYLDAFIEALGLEEVVLVIHDWGSALGFHWAKRNPERVKGIACmEFIRPIPTWDEWPEFARETFQAFRTADVGRELIIDQNAFIEGALPmGVVRPLTEVEmDHYREPFLKPVDREPLWRLPNELPIAGEPANIVALVEAYmNWLHQSPVPKLLFWGTPGVLIPPAEAARLAESLPNCKTVDIGPGLFLLQEDNPDLIGSEIARWLPGLAG 306
                                ||  18        28    |   38        48        58        68        78 |      88        98       108       118       128       138 |     148       158       168       178       188       198       208       218       228       238       248       258       268       278       288       298        
                                8|                 33-MSE                                         80-MSE                                                     140-MSE                                       186-MSE    197-MSE                                 237-MSE                                                                 
                                14                                                                                                                                                                                                                                                                                                    

Chain B from PDB  Type:PROTEIN  Length:300
                                                                                                                                                                                                                                                                                                                                            
               SCOP domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Pfam domains
         Sec.struct. author .................eeeee..eeeeeeee.......eeee.....hhhhhh.hhhhhh....eeee...............hhhhhhhhhhhhhhhhh...eeeeeehhhhhhhhhhhhhh...eeeeeee.......hhhhhhhhhhhhhhhhh..hhhhhhhh..hhhhhhhhhhh.....hhhhhhhhhhhhhhhhhhhhhhhhhhhh.....hhhhhhhhhhhhhhhhhh...eeeeeeee....hhhhhhhhhhhh..eeeeeeeee..hhhhhhhhhhhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Transcript
                 4kyv B   4 HHHHHSSEIGTGFPFDPHYVEVLGERmHYVDVGPRDGTPVLFLHGNPTSSYLWRNIIPHVAPSHRCIAPDLIGmGKSDKPDLDYFFDDHVRYLDAFIEALGLEEVVLVIHDWGSALGFHWAKRNPERVKGIACmEFIRPIPTWDEWPEFARETFQAFRTADVGRELIIDQNAFIEGALPmGVVRPLTEVEmDHYREPFLKPVDREPLWRLPNELPIAGEPANIVALVEAYmNWLHQSPVPKLLFWGTPGVLIPPAEAARLAESLPNCKTVDIGPGLFLLQEDNPDLIGSEIARWLPGLAG 306
                                 || 16        26      | 36        46        56        66        76   |    86        96       106       116       126       136   |   146       156       166       176       186       196|      206       216       226       236|      246       256       266       276       286       296       306
                                 9|                  33-MSE                                         80-MSE                                                     140-MSE                                       186-MSE    197-MSE                                 237-MSE                                                                 
                                 13                                                                                                                                                                                                                                                                                                     

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 4KYV)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 4KYV)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 4KYV)

(-) Gene Ontology  (0, 0)

Asymmetric Unit(hide GO term definitions)
    (no "Gene Ontology" information available for 4KYV)

 Visualization

(-) Interactive Views

Asymmetric Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    1Q9  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    MSE  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    NA  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
    AC4  [ RasMol ]  +environment [ RasMol ]
    AC5  [ RasMol ]  +environment [ RasMol ]
    AC6  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
    Asn A:52 - Pro A:53   [ RasMol ]  
    Asn B:52 - Pro B:53   [ RasMol ]  
    Glu A:225 - Pro A:226   [ RasMol ]  
    Glu B:225 - Pro B:226   [ RasMol ]  
    Thr A:253 - Pro A:254   [ RasMol ]  
    Thr B:253 - Pro B:254   [ RasMol ]  
 
Biological Units
  Complete Structure
    Biological Unit 1  [ Jena3D ]
    Biological Unit 2  [ Jena3D ]

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  4kyv
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  (no 'UniProt ID/Accession number' available) |
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  (no 'UniProt ID/Accession number' available) |
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

(no "Entries Sharing at Least One Protein Chain" available for 4KYV)

(-) Related Entries Specified in the PDB File

4kac 100% IDENTITY RELATED ID: NESG-OR150 RELATED DB: TARGETTRACK