Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asymmetric Unit
(-)Asym. Unit - sites
(-)Biological Unit 1
(-)Biol. Unit 1 - sites
(-)Biological Unit 2
collapse expand < >
Image Asymmetric Unit
Asymmetric Unit  (Jmol Viewer)
Image Asym. Unit - sites
Asym. Unit - sites  (Jmol Viewer)
Image Biological Unit 1
Biological Unit 1  (Jmol Viewer)
Image Biol. Unit 1 - sites
Biol. Unit 1 - sites  (Jmol Viewer)
Image Biological Unit 2
Biological Unit 2  (Jmol Viewer)

(-) Description

Title :  CRYSTAL STRUCTURE OF THE THERMUS THERMOPHILUS HYPOTHETICAL PROTEIN TTHA0967, A THIOESTERASE SUPERFAMILY MEMBER
 
Authors :  A. A. Pioszak, K. Murayama, M. Shirouzu, S. Yokoyama, Riken Structura Genomics/Proteomics Initiative (Rsgi)
Date :  27 Jun 05  (Deposition) - 27 Dec 05  (Release) - 13 Jul 11  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  1.85
Chains :  Asym. Unit :  A,B,C,D
Biol. Unit 1:  A,B  (1x)
Biol. Unit 2:  C,D  (1x)
Keywords :  Structural Genomics, Thioesterase, Nppsfa, National Project On Protein Structural And Functional Analyses, Riken Structural Genomics/Proteomics Initiative, Rsgi, Hydrolase (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  A. A. Pioszak, K. Murayama, M. Shirouzu, S. Yokoyama
Crystal Structure Of The Thermus Thermophilus Hypothetical Protein Ttha0967, A Thioesterase Superfamily Member
To Be Published
PubMed: search

(-) Compounds

Molecule 1 - THIOESTERASE FAMILY PROTEIN
    ChainsA, B, C, D
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPET11
    Expression System StrainB834 (DE3)
    Expression System Taxid562
    Expression System Vector TypePLASMID
    GeneTTHA0967
    Organism ScientificTHERMUS THERMOPHILUS
    Organism Taxid300852
    StrainHB8
    SynonymHYPOTHETICAL PROTEIN TTHA0967

 Structural Features

(-) Chains, Units

  1234
Asymmetric Unit ABCD
Biological Unit 1 (1x)AB  
Biological Unit 2 (1x)  CD

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (2, 26)

Asymmetric Unit (2, 26)
No.NameCountTypeFull Name
1EDO6Ligand/Ion1,2-ETHANEDIOL
2MSE20Mod. Amino AcidSELENOMETHIONINE
Biological Unit 1 (2, 13)
No.NameCountTypeFull Name
1EDO3Ligand/Ion1,2-ETHANEDIOL
2MSE10Mod. Amino AcidSELENOMETHIONINE
Biological Unit 2 (2, 13)
No.NameCountTypeFull Name
1EDO3Ligand/Ion1,2-ETHANEDIOL
2MSE10Mod. Amino AcidSELENOMETHIONINE

(-) Sites  (6, 6)

Asymmetric Unit (6, 6)
No.NameEvidenceResiduesDescription
1AC1SOFTWARETRP A:38 , LYS A:41 , TYR B:37 , TRP B:38BINDING SITE FOR RESIDUE EDO A 501
2AC2SOFTWAREARG A:79 , ARG A:81 , ASN A:102 , HOH A:598BINDING SITE FOR RESIDUE EDO A 502
3AC3SOFTWAREPHE A:127 , HOH A:544 , HOH A:573 , HOH B:172BINDING SITE FOR RESIDUE EDO A 503
4AC4SOFTWARETRP C:38 , LYS C:41 , TRP D:38BINDING SITE FOR RESIDUE EDO C 504
5AC5SOFTWAREGLU C:44 , ARG C:48 , ILE C:62 , GLY C:63 , GLN C:115 , PHE D:24BINDING SITE FOR RESIDUE EDO C 505
6AC6SOFTWAREARG C:48 , GLU C:60 , LYS C:120 , HOH C:574BINDING SITE FOR RESIDUE EDO C 506

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 2CWZ)

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 2CWZ)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 2CWZ)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 2CWZ)

(-) Exons   (0, 0)

(no "Exon" information available for 2CWZ)

(-) Sequences/Alignments

Asymmetric Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:138
 aligned with Q5SJP1_THET8 | Q5SJP1 from UniProtKB/TrEMBL  Length:141

    Alignment length:138
                                    10        20        30        40        50        60        70        80        90       100       110       120       130        
         Q5SJP1_THET8     1 MRPIPEGYEAVFETVVTPEMTVRFEELGPVHPVYATYWMVKHMELAGRKIILPFLEEGEEGIGSYVEARHLASALPGMRVRVVARHEKTEGNRVYARVEAYNELGDLIGVGRTEQVILPKAKVEALFRRLKERWEAER 138
               SCOP domains d2cwza1 A:1-138 Hypothetical protein TTHA0967                                                                                              SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------------------------------------------------ CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------ Pfam domains
         Sec.struct. author ........eeeeeee.hhh.eeee...eeeeeeehhhhhhhhhhhhhhhhhh......eeeeeeeeeeee........eeeeeeeeeeee..eeeeeeeeee....eeeeeeeeeeeeehhhhhhhhhhhhhhhhh.. Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------ SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------ PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------ Transcript
                 2cwz A   1 mRPIPEGYEAVFETVVTPEmTVRFEELGPVHPVYATYWmVKHmELAGRKIILPFLEEGEEGIGSYVEARHLASALPGmRVRVVARHEKTEGNRVYARVEAYNELGDLIGVGRTEQVILPKAKVEALFRRLKERWEAER 138
                            |       10        20        30        40  |     50        60        70       |80        90       100       110       120       130        
                            |                 20-MSE             39-MSE                                 78-MSE                                                        
                            1-MSE                                    43-MSE                                                                                           

Chain B from PDB  Type:PROTEIN  Length:137
 aligned with Q5SJP1_THET8 | Q5SJP1 from UniProtKB/TrEMBL  Length:141

    Alignment length:137
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       
         Q5SJP1_THET8     1 MRPIPEGYEAVFETVVTPEMTVRFEELGPVHPVYATYWMVKHMELAGRKIILPFLEEGEEGIGSYVEARHLASALPGMRVRVVARHEKTEGNRVYARVEAYNELGDLIGVGRTEQVILPKAKVEALFRRLKERWEAE 137
               SCOP domains d2cwzb_ B: Hypothetical protein TTHA0967                                                                                                  SCOP domains
               CATH domains ----------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ----------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ........eeeeeee.hhh.eeee...eeeeeeehhhhhhhhhhhhhhhhhhhhh...eeeeeeeeeeee........eeeeeeeeeeee..eeeeeeeeee....eeeeeeeeeeeeehhhhhhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ----------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ----------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 2cwz B   1 mRPIPEGYEAVFETVVTPEmTVRFEELGPVHPVYATYWmVKHmELAGRKIILPFLEEGEEGIGSYVEARHLASALPGmRVRVVARHEKTEGNRVYARVEAYNELGDLIGVGRTEQVILPKAKVEALFRRLKERWEAE 137
                            |       10        20        30        40  |     50        60        70       |80        90       100       110       120       130       
                            1-MSE             20-MSE             39-MSE                                 78-MSE                                                       
                                                                     43-MSE                                                                                          

Chain C from PDB  Type:PROTEIN  Length:137
 aligned with Q5SJP1_THET8 | Q5SJP1 from UniProtKB/TrEMBL  Length:141

    Alignment length:137
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       
         Q5SJP1_THET8     1 MRPIPEGYEAVFETVVTPEMTVRFEELGPVHPVYATYWMVKHMELAGRKIILPFLEEGEEGIGSYVEARHLASALPGMRVRVVARHEKTEGNRVYARVEAYNELGDLIGVGRTEQVILPKAKVEALFRRLKERWEAE 137
               SCOP domains d2cwzc_ C: Hypothetical protein TTHA0967                                                                                                  SCOP domains
               CATH domains ----------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ----------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ........eeeeeee.hhh.eeee...eeeeeeehhhhhhhhhhhhhhhhhh......eeeeeeeeeeee........eeeeeeeeeeee..eeeeeeeeee....eeeeeeeeeeeeehhhhhhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ----------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ----------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 2cwz C   1 mRPIPEGYEAVFETVVTPEmTVRFEELGPVHPVYATYWmVKHmELAGRKIILPFLEEGEEGIGSYVEARHLASALPGmRVRVVARHEKTEGNRVYARVEAYNELGDLIGVGRTEQVILPKAKVEALFRRLKERWEAE 137
                            |       10        20        30        40  |     50        60        70       |80        90       100       110       120       130       
                            1-MSE             20-MSE             39-MSE                                 78-MSE                                                       
                                                                     43-MSE                                                                                          

Chain D from PDB  Type:PROTEIN  Length:137
 aligned with Q5SJP1_THET8 | Q5SJP1 from UniProtKB/TrEMBL  Length:141

    Alignment length:137
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       
         Q5SJP1_THET8     1 MRPIPEGYEAVFETVVTPEMTVRFEELGPVHPVYATYWMVKHMELAGRKIILPFLEEGEEGIGSYVEARHLASALPGMRVRVVARHEKTEGNRVYARVEAYNELGDLIGVGRTEQVILPKAKVEALFRRLKERWEAE 137
               SCOP domains d2cwzd_ D: Hypothetical protein TTHA0967                                                                                                  SCOP domains
               CATH domains ----------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ----------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ........eeeeeee.hhh.eeee...eeeeeeehhhhhhhhhhhhhhhhhh......eeeeeeeeeeee........eeeeeeeeeeee..eeeeeeeeee....eeeeeeeeeeeeehhhhhhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ----------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ----------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 2cwz D   1 mRPIPEGYEAVFETVVTPEmTVRFEELGPVHPVYATYWmVKHmELAGRKIILPFLEEGEEGIGSYVEARHLASALPGmRVRVVARHEKTEGNRVYARVEAYNELGDLIGVGRTEQVILPKAKVEALFRRLKERWEAE 137
                            |       10        20        30        40  |     50        60        70       |80        90       100       110       120       130       
                            1-MSE             20-MSE             39-MSE                                 78-MSE                                                       
                                                                     43-MSE                                                                                          

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (1, 4)

Asymmetric Unit

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 2CWZ)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 2CWZ)

(-) Gene Ontology  (0, 0)

Asymmetric Unit(hide GO term definitions)
    (no "Gene Ontology" information available for 2CWZ)

 Visualization

(-) Interactive Views

Asymmetric Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    EDO  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    MSE  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
    AC4  [ RasMol ]  +environment [ RasMol ]
    AC5  [ RasMol ]  +environment [ RasMol ]
    AC6  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 2cwz)
 
Biological Units
  Complete Structure
    Biological Unit 1  [ Jena3D ]
    Biological Unit 2  [ Jena3D ]

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  2cwz
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  Q5SJP1_THET8 | Q5SJP1
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/TrEMBL
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  Q5SJP1_THET8 | Q5SJP1
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

(no "Entries Sharing at Least One Protein Chain" available for 2CWZ)

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 2CWZ)