Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asym./Biol. Unit
(-)Asym./Biol. Unit - sites
collapse expand < >
Image Asym./Biol. Unit
Asym./Biol. Unit  (Jmol Viewer)
Image Asym./Biol. Unit - sites
Asym./Biol. Unit - sites  (Jmol Viewer)

(-) Description

Title :  ERGOTHIONEINE-BIOSYNTHETIC NTN HYDROLASE EGTC WITH GLUTAMINE
 
Authors :  A. Vit, F. P. Seebeck, W. Blankenfeldt
Date :  21 Apr 15  (Deposition) - 01 Jul 15  (Release) - 15 Jul 15  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  1.82
Chains :  Asym./Biol. Unit :  A,B,C,D
Keywords :  Ntn Hydrolase, Ergothioneine Biosynthesis, Sulfur Chemistry, Mycobacteria, Hydrolase (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  A. Vit, G. T. Mashabela, W. Blankenfeldt, F. P. Seebeck
Structure Of The Ergothioneine-Biosynthesis Amidohydrolase Egtc.
Chembiochem V. 16 1490 2015
PubMed-ID: 26079795  |  Reference-DOI: 10.1002/CBIC.201500168

(-) Compounds

Molecule 1 - AMIDOHYDROLASE EGTC
    ChainsA, B, C, D
    EC Number3.5.1.-
    EngineeredYES
    Expression SystemESCHERICHIA COLI BL21(DE3)
    Expression System PlasmidPET28A
    Expression System Taxid469008
    Expression System Vector TypePLASMID
    GeneEGTC, MSMEG_6248, MSMEI_6087
    MutationYES
    Organism ScientificMYCOBACTERIUM SMEGMATIS
    Organism Taxid246196

 Structural Features

(-) Chains, Units

  1234
Asymmetric/Biological Unit ABCD

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (2, 31)

Asymmetric/Biological Unit (2, 31)
No.NameCountTypeFull Name
1EDO27Ligand/Ion1,2-ETHANEDIOL
2GLN4Mod. Amino AcidGLUTAMINE

(-) Sites  (31, 31)

Asymmetric Unit (31, 31)
No.NameEvidenceResiduesDescription
01AC1SOFTWARELEU A:19 , ASP A:20 , PRO A:21 , VAL A:27 , HOH A:406 , HOH A:509binding site for residue EDO A 301
02AC2SOFTWAREGLY A:181 , ASP A:182 , THR A:183 , ASP A:211 , HOH A:448 , HOH A:452 , HOH A:492 , HOH A:521binding site for residue EDO A 302
03AC3SOFTWAREARG A:58 , ARG A:60 , EDO A:304 , HOH A:414 , HOH A:562 , SER B:70binding site for residue EDO A 303
04AC4SOFTWAREARG A:58 , EDO A:303 , HOH A:407 , SER B:73binding site for residue EDO A 304
05AC5SOFTWARECYS A:2 , ARG A:88 , SER A:89 , ALA A:90 , THR A:91 , ASN A:114 , GLY A:115 , VAL A:132 , ASP A:133 , SER A:134 , HOH A:445 , HOH A:486 , HOH A:489 , HOH A:581binding site for residue GLN A 305
06AC6SOFTWAREGLY B:38 , SER B:89 , THR B:91 , GLN B:311 , HOH B:464 , HOH B:472 , HOH B:493binding site for residue EDO B 301
07AC7SOFTWARELEU B:19 , ASP B:20 , PRO B:21 , VAL B:27 , HOH B:416 , HOH B:480 , HOH B:562binding site for residue EDO B 302
08AC8SOFTWAREMET B:40 , HOH B:409 , HOH B:439 , TRP D:66binding site for residue EDO B 303
09AC9SOFTWAREALA A:69 , SER A:70 , ARG B:58 , ARG B:60 , HOH B:402 , HOH B:506binding site for residue EDO B 304
10AD1SOFTWARETRP B:66 , GLY B:67 , ALA B:72 , HOH B:444 , HOH B:575 , HOH B:619binding site for residue EDO B 305
11AD2SOFTWAREPHE B:50 , ASP B:51 , ARG B:81 , GLY B:172 , HOH B:504binding site for residue EDO B 306
12AD3SOFTWAREALA B:151 , LEU B:228 , HIS B:230 , HOH B:433 , HOH B:551binding site for residue EDO B 307
13AD4SOFTWAREGLY B:181 , ASP B:182 , THR B:183 , ASP B:211 , HOH B:422 , HOH B:431 , HOH B:435 , HOH B:518binding site for residue EDO B 308
14AD5SOFTWARETRP B:109 , GLY B:145 , LEU B:146 , ASP B:147 , HOH B:414 , HOH B:463binding site for residue EDO B 309
15AD6SOFTWAREARG B:34 , TRP B:206 , SER B:207 , ASP B:208 , HOH B:516binding site for residue EDO B 310
16AD7SOFTWARECYS B:2 , ARG B:88 , SER B:89 , THR B:91 , ASN B:114 , GLY B:115 , VAL B:132 , ASP B:133 , SER B:134 , EDO B:301 , HOH B:451 , HOH B:492 , HOH B:493binding site for residue GLN B 311
17AD8SOFTWAREARG C:58 , ARG C:60 , HOH C:407 , HOH C:413 , ALA D:69 , SER D:70binding site for residue EDO C 301
18AD9SOFTWAREHIS C:37 , SER C:89 , THR C:91 , GLN C:309 , HOH C:533 , HOH C:559binding site for residue EDO C 302
19AE1SOFTWARELEU C:8 , VAL C:217 , ARG C:218 , ASP C:219 , ALA C:220binding site for residue EDO C 303
20AE2SOFTWARELEU C:19 , ASP C:20 , PRO C:21 , VAL C:27 , HOH C:415binding site for residue EDO C 304
21AE3SOFTWAREARG C:33 , ARG C:34 , HOH C:405binding site for residue EDO C 305
22AE4SOFTWAREALA C:56 , ARG C:58 , SER C:105 , HOH C:418binding site for residue EDO C 306
23AE5SOFTWAREARG C:144 , GLU C:155 , LEU C:159 , HOH C:409binding site for residue EDO C 307
24AE6SOFTWAREASP C:182 , ASP C:211 , HOH C:433 , HOH C:487 , HOH C:497binding site for residue EDO C 308
25AE7SOFTWARECYS C:2 , ARG C:88 , SER C:89 , THR C:91 , ASN C:114 , GLY C:115 , VAL C:132 , ASP C:133 , SER C:134 , EDO C:302 , HOH C:475 , HOH C:510 , HOH C:533binding site for residue GLN C 309
26AE8SOFTWARELEU D:19 , VAL D:27 , HOH D:421binding site for residue EDO D 301
27AE9SOFTWAREALA C:69 , SER C:70 , ARG D:58 , ARG D:60 , HOH D:404binding site for residue EDO D 302
28AF1SOFTWAREVAL D:117 , ASP D:118 , ARG D:119 , HOH D:475binding site for residue EDO D 303
29AF2SOFTWARETRP D:109 , GLY D:145 , LEU D:146 , ASP D:147 , HOH D:411 , HOH D:483binding site for residue EDO D 304
30AF3SOFTWAREALA C:77 , ALA D:77 , LEU D:78 , ARG D:79 , HOH D:423binding site for residue EDO D 305
31AF4SOFTWARECYS D:2 , ARG D:88 , SER D:89 , THR D:91 , ASN D:114 , GLY D:115 , VAL D:132 , ASP D:133 , SER D:134 , HOH D:453 , HOH D:479 , HOH D:513binding site for residue GLN D 306

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 4ZFK)

(-) Cis Peptide Bonds  (4, 4)

Asymmetric/Biological Unit
No.Residues
1Ala A:102 -Pro A:103
2Ala B:102 -Pro B:103
3Ala C:102 -Pro C:103
4Ala D:102 -Pro D:103

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 4ZFK)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 4ZFK)

(-) Exons   (0, 0)

(no "Exon" information available for 4ZFK)

(-) Sequences/Alignments

Asymmetric/Biological Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:227
                                                                                                                                                                                                                                                                   
               SCOP domains ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .eeeeeeeeeeehhhhhh....hhhhhh...............eeeeee.....eeeeee..hhhhhhhhhhhh...eeeeeeeee.........hhhhh..ee....eeeeeee.hhhhh.........hhhhhhhhhhhhhh..hhhhhhhhhhhhh...eeeeeee....eeeee.....eeeee..eeeee..........ee....eeeeee..eeeeee.. Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 4zfk A   2 CRHVAWLGAPRSLADLVLDPPQGLLVQSYAPRRQKHGLMNADGWGAGFFDDDGVARRWRSDKPLWGDASFASVAPALRSRCVVAAVRSATIGMPIEPSASAPFSDGQWLLSHNGLVDRGVLPLTGAAESTVDSAILAALIFSRGLDALGATIAEVGELDPNARLNILAANGSRLLATTWGDTLSVLRRPDGVVLASEPYDDDPGWSDIPDRHLVDVRDAHVVVTPLL 228
                                    11        21        31        41        51        61        71        81        91       101       111       121       131       141       151       161       171       181       191       201       211       221       

Chain B from PDB  Type:PROTEIN  Length:231
                                                                                                                                                                                                                                                                       
               SCOP domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .eeeeeeeeeeehhhhhh....hhhhhh...............eeeeee.....eeeeee..hhhhhhhhhhhh...eeeeeeeee.........hhhhh..ee....eeeeeee.hhhhh.........hhhhhhhhhhhhhh..hhhhhhhhhhhhh...eeeeeee....eeeee.....eeeee..eeeee..........ee....eeeeee..eeeeee...... Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 4zfk B   2 CRHVAWLGAPRSLADLVLDPPQGLLVQSYAPRRQKHGLMNADGWGAGFFDDDGVARRWRSDKPLWGDASFASVAPALRSRCVVAAVRSATIGMPIEPSASAPFSDGQWLLSHNGLVDRGVLPLTGAAESTVDSAILAALIFSRGLDALGATIAEVGELDPNARLNILAANGSRLLATTWGDTLSVLRRPDGVVLASEPYDDDPGWSDIPDRHLVDVRDAHVVVTPLLEHHH 232
                                    11        21        31        41        51        61        71        81        91       101       111       121       131       141       151       161       171       181       191       201       211       221       231 

Chain C from PDB  Type:PROTEIN  Length:230
                                                                                                                                                                                                                                                                      
               SCOP domains -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .eeeeeeeeeeehhhhhh....hhhhhh...............eeeeee.....eeeeee..hhhhhhhhhhhh...eeeeeeeee.........hhhhh..ee....eeeeeee.hhhhh.........hhhhhhhhhhhhhh..hhhhhhhhhhhhh...eeeeeee....eeeee.....eeeee..eeeee..........ee....eeeeee..eeeeee..... Sec.struct. author
                 SAPs(SNPs) -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 4zfk C   2 CRHVAWLGAPRSLADLVLDPPQGLLVQSYAPRRQKHGLMNADGWGAGFFDDDGVARRWRSDKPLWGDASFASVAPALRSRCVVAAVRSATIGMPIEPSASAPFSDGQWLLSHNGLVDRGVLPLTGAAESTVDSAILAALIFSRGLDALGATIAEVGELDPNARLNILAANGSRLLATTWGDTLSVLRRPDGVVLASEPYDDDPGWSDIPDRHLVDVRDAHVVVTPLLEHH 231
                                    11        21        31        41        51        61        71        81        91       101       111       121       131       141       151       161       171       181       191       201       211       221       231

Chain D from PDB  Type:PROTEIN  Length:228
                                                                                                                                                                                                                                                                    
               SCOP domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Pfam domains
         Sec.struct. author .eeeeeeeeeeehhhhhh....hhhhhh...............eeeeee.....eeeeee..hhhhhhhhhhhh...eeeeeeeee.........hhhhh..ee....eeeeeee.hhhhh.........hhhhhhhhhhhhhh..hhhhhhhhhhhhh...eeeeeee....eeeee.....eeeee..eeeee..........ee....eeeeee..eeeeee... Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Transcript
                 4zfk D   2 CRHVAWLGAPRSLADLVLDPPQGLLVQSYAPRRQKHGLMNADGWGAGFFDDDGVARRWRSDKPLWGDASFASVAPALRSRCVVAAVRSATIGMPIEPSASAPFSDGQWLLSHNGLVDRGVLPLTGAAESTVDSAILAALIFSRGLDALGATIAEVGELDPNARLNILAANGSRLLATTWGDTLSVLRRPDGVVLASEPYDDDPGWSDIPDRHLVDVRDAHVVVTPLLE 229
                                    11        21        31        41        51        61        71        81        91       101       111       121       131       141       151       161       171       181       191       201       211       221        

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 4ZFK)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 4ZFK)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 4ZFK)

(-) Gene Ontology  (5, 5)

Asymmetric/Biological Unit(hide GO term definitions)

 Visualization

(-) Interactive Views

Asymmetric/Biological Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    EDO  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    GLN  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
    AC4  [ RasMol ]  +environment [ RasMol ]
    AC5  [ RasMol ]  +environment [ RasMol ]
    AC6  [ RasMol ]  +environment [ RasMol ]
    AC7  [ RasMol ]  +environment [ RasMol ]
    AC8  [ RasMol ]  +environment [ RasMol ]
    AC9  [ RasMol ]  +environment [ RasMol ]
    AD1  [ RasMol ]  +environment [ RasMol ]
    AD2  [ RasMol ]  +environment [ RasMol ]
    AD3  [ RasMol ]  +environment [ RasMol ]
    AD4  [ RasMol ]  +environment [ RasMol ]
    AD5  [ RasMol ]  +environment [ RasMol ]
    AD6  [ RasMol ]  +environment [ RasMol ]
    AD7  [ RasMol ]  +environment [ RasMol ]
    AD8  [ RasMol ]  +environment [ RasMol ]
    AD9  [ RasMol ]  +environment [ RasMol ]
    AE1  [ RasMol ]  +environment [ RasMol ]
    AE2  [ RasMol ]  +environment [ RasMol ]
    AE3  [ RasMol ]  +environment [ RasMol ]
    AE4  [ RasMol ]  +environment [ RasMol ]
    AE5  [ RasMol ]  +environment [ RasMol ]
    AE6  [ RasMol ]  +environment [ RasMol ]
    AE7  [ RasMol ]  +environment [ RasMol ]
    AE8  [ RasMol ]  +environment [ RasMol ]
    AE9  [ RasMol ]  +environment [ RasMol ]
    AF1  [ RasMol ]  +environment [ RasMol ]
    AF2  [ RasMol ]  +environment [ RasMol ]
    AF3  [ RasMol ]  +environment [ RasMol ]
    AF4  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
    Ala A:102 - Pro A:103   [ RasMol ]  
    Ala B:102 - Pro B:103   [ RasMol ]  
    Ala C:102 - Pro C:103   [ RasMol ]  
    Ala D:102 - Pro D:103   [ RasMol ]  
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  4zfk
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  EGTC_MYCS2 | A0R5M9
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  3.5.1.-
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  EGTC_MYCS2 | A0R5M9
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        EGTC_MYCS2 | A0R5M94zfj 4zfl

(-) Related Entries Specified in the PDB File

4zfj 4ZFJ CONTAINS THE EGTC PROTEIN IN ITS APO FORM