Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asym./Biol. Unit
(-)Asym./Biol. Unit - sites
collapse expand < >
Image Asym./Biol. Unit
Asym./Biol. Unit  (Jmol Viewer)
Image Asym./Biol. Unit - sites
Asym./Biol. Unit - sites  (Jmol Viewer)

(-) Description

Title :  CRYSTAL STRUCTURE OF A FUNGAL CHIMERIC CELLOBIOHYDROLASE CEL6A
 
Authors :  F. H. Arnold, I. Wu
Date :  28 Nov 12  (Deposition) - 03 Apr 13  (Release) - 12 Jun 13  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  1.50
Chains :  Asym./Biol. Unit :  A
Keywords :  Cellobiohydrolase, Chimera Protein, Glycoside Hydrolase, Hydrolase (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  I. Wu, F. H. Arnold
Engineered Thermostable Fungal Cel6A And Cel7A Cellobiohydrolases Hydrolyze Cellulose Efficiently At Elevated Temperatures.
Biotechnol. Bioeng. V. 110 1874 2013
PubMed-ID: 23404363  |  Reference-DOI: 10.1002/BIT.24864

(-) Compounds

Molecule 1 - CHIMERIC CEL6A
    ChainsA
    EngineeredYES
    Expression SystemSACCHAROMYCES CEREVISIAE
    Expression System Taxid4932
    GeneAVI2, CEL6A, CBH2
    Organism CommonTRICHODERMA REESEI, TRICHODERMA REESEI
    Organism ScientificHUMICOLA INSOLENS, TRICHODERMA REESEI, CHAETOMIUM THERMOPHILUM
    Organism Taxid34413, 51453, 209285

 Structural Features

(-) Chains, Units

  1
Asymmetric/Biological Unit A

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (4, 8)

Asymmetric/Biological Unit (4, 8)
No.NameCountTypeFull Name
1ACT1Ligand/IonACETATE ION
2EDO5Ligand/Ion1,2-ETHANEDIOL
3PG41Ligand/IonTETRAETHYLENE GLYCOL
4PGE1Ligand/IonTRIETHYLENE GLYCOL

(-) Sites  (8, 8)

Asymmetric Unit (8, 8)
No.NameEvidenceResiduesDescription
1AC1SOFTWARELYS A:195 , ASN A:198 , VAL A:208 , GLU A:209 , SER A:211BINDING SITE FOR RESIDUE PGE A 501
2AC2SOFTWAREASN A:102 , ARG A:105 , HIS A:109BINDING SITE FOR RESIDUE PG4 A 502
3AC3SOFTWARELYS A:395 , GLU A:399BINDING SITE FOR RESIDUE EDO A 503
4AC4SOFTWAREVAL A:193 , LYS A:197 , GLU A:245 , HOH A:767BINDING SITE FOR RESIDUE EDO A 504
5AC5SOFTWAREPRO A:321 , PRO A:323 , HOH A:819BINDING SITE FOR RESIDUE EDO A 505
6AC6SOFTWARETRP A:99 , ALA A:100 , ARG A:105 , ASN A:238BINDING SITE FOR RESIDUE EDO A 506
7AC7SOFTWARETHR A:229 , ASN A:276 , HIS A:450 , HIS A:452BINDING SITE FOR RESIDUE EDO A 507
8AC8SOFTWARELEU A:256 , PRO A:257 , VAL A:259 , ARG A:295 , ALA A:296 , HOH A:729 , HOH A:765BINDING SITE FOR RESIDUE ACT A 508

(-) SS Bonds  (2, 2)

Asymmetric/Biological Unit
No.Residues
1A:177 -A:236
2A:368 -A:415

(-) Cis Peptide Bonds  (4, 4)

Asymmetric/Biological Unit
No.Residues
1Ser A:320 -Pro A:321
2Gln A:357 -Pro A:358
3Gln A:422 -Pro A:423
4Asn A:443 -Pro A:444

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 4I5R)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 4I5R)

(-) Exons   (0, 0)

(no "Exon" information available for 4I5R)

(-) Sequences/Alignments

Asymmetric/Biological Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:364
                                                                                                                                                                                                                                                                                                                                                                                                            
               SCOP domains d4i5ra_ A: automated matches                                                                                                                                                                                                                                                                                                                                                 SCOP domains
               CATH domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .......ee..hhhhhhhhhhhhhhhh.hhhhhhhhhhhh.....eee.hhhhhhhhhhhhhhhhhhhhh...eeeeeee.................hhhhhhhhhhhhhhhhhhhhhhhh....eeeee...hhhhhhhh..hhhhhhhhhhhhhhhhhhhhhh....eeeeee........hhhhhhhhhhhhhhhhhhh.....eeeeee.............hhhhh.....hhhhhhhhhhhhhhhh....eeeee.....................................eeeee...................hhhhhh.............hhhhhhhhhhh............ Sec.struct. author
                 SAPs(SNPs) ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 4i5r A  90 GNPFEGVQLWANNYYRSEVHTLAIPQITDPALRAAASAVAEVPSFMWLDTLDKTPLMEQTLADIRTANKNGGNYAGQFVVYDLPDRDCAALASNGEYSIADGGVAKYKNYIDTIRQIVVEYSDIRTLLVIEPDSLANLVTNLGTPKCANAQSAYLECINYAVTQLNLPNVAMYLDAGHAGWLGWPANQDPAAQLFANVYKNASSPRALRGLATNVANYNAWSIASPPSYTSPNPNYDEKHYIEAFAPLLRNQGFDAKFIVDTGRNGKQPTGQLEWGHWCNVKGTGFGVRPTANTGHELVDAFVWVKPGGESDGTSDSSAPRFDSHCALPDALQPAPQAGAWFQAYFVQLLTNANPSFLHHHHHH 453
                                    99       109       119       129       139       149       159       169       179       189       199       209       219       229       239       249       259       269       279       289       299       309       319       329       339       349       359       369       379       389       399       409       419       429       439       449    

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (1, 1)

Asymmetric/Biological Unit

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 4I5R)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 4I5R)

(-) Gene Ontology  (11, 30)

Asymmetric/Biological Unit(hide GO term definitions)

 Visualization

(-) Interactive Views

Asymmetric/Biological Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    ACT  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    EDO  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    PG4  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    PGE  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
    AC4  [ RasMol ]  +environment [ RasMol ]
    AC5  [ RasMol ]  +environment [ RasMol ]
    AC6  [ RasMol ]  +environment [ RasMol ]
    AC7  [ RasMol ]  +environment [ RasMol ]
    AC8  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
    Asn A:443 - Pro A:444   [ RasMol ]  
    Gln A:357 - Pro A:358   [ RasMol ]  
    Gln A:422 - Pro A:423   [ RasMol ]  
    Ser A:320 - Pro A:321   [ RasMol ]  
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  4i5r
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  GUX2_HYPJE | P07987
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
  GUX6_HUMIN | Q9C1S9
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
  Q5G2D4_9PEZI | Q5G2D4
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  GUX2_HYPJE | P07987
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)
  GUX6_HUMIN | Q9C1S9
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)
  Q5G2D4_9PEZI | Q5G2D4
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        GUX2_HYPJE | P079871cb2 1hgw 1hgy 1qjw 1qk0 1qk2 3cbh 4au0 4ax6 4ax7 4i5u
        GUX6_HUMIN | Q9C1S91bvw 1gz1 1oc5 1oc6 1oc7 1ocb 1ocj 1ocn 2bvw 4i5u
        Q5G2D4_9PEZI | Q5G2D44i5u

(-) Related Entries Specified in the PDB File

4i5u FUNGAL CHIMERIC CELLOBIOHYDROLASE CEL6A