Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asym./Biol. Unit - manually
(-)Asym./Biol. Unit
(-)Asym./Biol. Unit - sites
collapse expand < >
Image Asym./Biol. Unit - manually
Asym./Biol. Unit - manually  (Jmol Viewer)
Image Asym./Biol. Unit
Asym./Biol. Unit  (Jmol Viewer)
Image Asym./Biol. Unit - sites
Asym./Biol. Unit - sites  (Jmol Viewer)

(-) Description

Title :  STRUCTURE OF THAUMATIN AT 2.0 A WAVELENGTH
 
Authors :  C. Mueller-Dieckmann, M. S. Weiss
Date :  22 Feb 06  (Deposition) - 20 Feb 07  (Release) - 24 Feb 09  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  1.98
Chains :  Asym./Biol. Unit :  A
Keywords :  Thaumatin Structure At A Wavelength Of 2 A, Plant Protein (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  C. Mueller-Dieckmann, S. Panjikar, A. Schmidt, S. Mueller, J. Kuper, A. Geerlof, M. Wilmanns, R. K. Singh, P. A. Tucker, M. S. Weiss
On The Routine Use Of Soft X-Rays In Macromolecular Crystallography. Part Iv. Efficient Determination Of Anomalous Substructures In Biomacromolecules Using Longer X-Ray Wavelengths.
Acta Crystallogr. , Sect. D V. 63 366 2007
PubMed-ID: 17327674  |  Reference-DOI: 10.1107/S0907444906055624
(for further references see the PDB file header)

(-) Compounds

Molecule 1 - THAUMATIN-1
    ChainsA
    Organism CommonMIRACLE FRUIT
    Organism ScientificTHAUMATOCOCCUS DANIELLII
    Organism Taxid4621
    SynonymTHAUMATIN I

 Structural Features

(-) Chains, Units

  1
Asymmetric/Biological Unit A

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (1, 1)

Asymmetric/Biological Unit (1, 1)
No.NameCountTypeFull Name
1TAR1Ligand/IonD(-)-TARTARIC ACID

(-) Sites  (1, 1)

Asymmetric Unit (1, 1)
No.NameEvidenceResiduesDescription
1AC1SOFTWAREARG A:29 , GLU A:35 , SER A:36 , PHE A:152 , TYR A:157 , HOH A:278 , HOH A:279BINDING SITE FOR RESIDUE TAR A 208

(-) SS Bonds  (8, 8)

Asymmetric/Biological Unit
No.Residues
1A:9 -A:204
2A:56 -A:66
3A:71 -A:77
4A:121 -A:193
5A:126 -A:177
6A:134 -A:145
7A:149 -A:158
8A:159 -A:164

(-) Cis Peptide Bonds  (1, 1)

Asymmetric/Biological Unit
No.Residues
1Pro A:83 -Pro A:84

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 2G4Y)

(-) PROSITE Motifs  (2, 2)

Asymmetric/Biological Unit (2, 2)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1THAUMATIN_2PS51367 Thaumatin family profile.THM1_THADA1-207  1A:1-207
2THAUMATIN_1PS00316 Thaumatin family signature.THM1_THADA62-77  1A:62-77

(-) Exons   (0, 0)

(no "Exon" information available for 2G4Y)

(-) Sequences/Alignments

Asymmetric/Biological Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:207
 aligned with Q8RVT0_THADA | Q8RVT0 from UniProtKB/TrEMBL  Length:207

    Alignment length:207
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150       160       170       180       190       200       
         Q8RVT0_THADA     1 ATFEIVNRCSYTVWAAASKGDAALDAGGRQLNSGESWTINVEPGTNGGKIWARTDCYFDDSGSGICKTGDCGGLLRCKRFGRPPTTLAEFSLNQYGKDYIDISNIKGFNVPMDFSPTTRGCRGVRCAADIVGQCPAKLKAPGGGCNDACTVFQTSEYCCTTGKCGPTEYSRFFKRLCPDAFSYVLDKPTTVTCPGSSNYRVTFCPTA 207
               SCOP domains d2g4ya_ A: automated matches                                                                                                                                                                                    SCOP domains
               CATH domains 2g4yA00 A:1-207 Thaumatin                                                                                                                                                                                       CATH domains
               Pfam domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .eeeeee.....eeeeee....eeeeeeeee....eeeee.......eeeeeeeeeee.....eeeee..................eeeeeeee..eeeeeee........eeeee.......eee..hhhhhhhhhhh.......hhhhhhhhhhhhh.......hhhhhhhhhhh.............eeee....eeeee.... Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 2g4y A   1 ATFEIVNRCSYTVWAAASKGDAALDAGGRQLNSGESWTINVEPGTKGGKIWARTDCYFDDSGSGICKTGDCGGLLRCKRFGRPPTTLAEFSLNQYGKDYIDISNIKGFNVPMDFSPTTRGCRGVRCAADIVGQCPAKLKAPGGGCNDACTVFQTSEYCCTTGKCGPTEYSRFFKRLCPDAFSYVLDKPTTVTCPGSSNYRVTFCPTA 207
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150       160       170       180       190       200       

Chain A from PDB  Type:PROTEIN  Length:207
 aligned with THM1_THADA | P02883 from UniProtKB/Swiss-Prot  Length:207

    Alignment length:207
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150       160       170       180       190       200       
           THM1_THADA     1 ATFEIVNRCSYTVWAAASKGDAALDAGGRQLNSGESWTINVEPGTNGGKIWARTDCYFDDSGSGICKTGDCGGLLRCKRFGRPPTTLAEFSLNQYGKDYIDISNIKGFNVPMNFSPTTRGCRGVRCAADIVGQCPAKLKAPGGGCNDACTVFQTSEYCCTTGKCGPTEYSRFFKRLCPDAFSYVLDKPTTVTCPGSSNYRVTFCPTA 207
               SCOP domains d2g4ya_ A: automated matches                                                                                                                                                                                    SCOP domains
               CATH domains 2g4yA00 A:1-207 Thaumatin                                                                                                                                                                                       CATH domains
               Pfam domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .eeeeee.....eeeeee....eeeeeeeee....eeeee.......eeeeeeeeeee.....eeeee..................eeeeeeee..eeeeeee........eeeee.......eee..hhhhhhhhhhh.......hhhhhhhhhhhhh.......hhhhhhhhhhh.............eeee....eeeee.... Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                PROSITE (1) THAUMATIN_2  PDB: A:1-207 UniProt: 1-207                                                                                                                                                                        PROSITE (1)
                PROSITE (2) -------------------------------------------------------------THAUMATIN_1     ---------------------------------------------------------------------------------------------------------------------------------- PROSITE (2)
                 Transcript --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 2g4y A   1 ATFEIVNRCSYTVWAAASKGDAALDAGGRQLNSGESWTINVEPGTKGGKIWARTDCYFDDSGSGICKTGDCGGLLRCKRFGRPPTTLAEFSLNQYGKDYIDISNIKGFNVPMDFSPTTRGCRGVRCAADIVGQCPAKLKAPGGGCNDACTVFQTSEYCCTTGKCGPTEYSRFFKRLCPDAFSYVLDKPTTVTCPGSSNYRVTFCPTA 207
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150       160       170       180       190       200       

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (1, 1)

Asymmetric/Biological Unit

(-) CATH Domains  (1, 1)

Asymmetric/Biological Unit
(-)
Class: Mainly Beta (13760)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 2G4Y)

(-) Gene Ontology  (1, 1)

Asymmetric/Biological Unit(hide GO term definitions)
Chain A   (Q8RVT0_THADA | Q8RVT0)

Chain A   (THM1_THADA | P02883)
cellular component
    GO:0031410    cytoplasmic vesicle    A vesicle found in the cytoplasm of a cell.

 Visualization

(-) Interactive Views

Asymmetric/Biological Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    TAR  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
    Pro A:83 - Pro A:84   [ RasMol ]  
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  2g4y
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  Q8RVT0_THADA | Q8RVT0
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/TrEMBL
  THM1_THADA | P02883
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  Q8RVT0_THADA | Q8RVT0
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)
  THM1_THADA | P02883
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        Q8RVT0_THADA | Q8RVT03al7 3ald 3e3s 3v7v 3v82 3v84 3v87 3v88 3v8a 3vce 3vcg 3vch 3vci 3vcj 3vck 3vhf 3vhg 3vjq 4dc5 4dc6 4diy 4diz 4dj0 4dj1 4xvb
        THM1_THADA | P028831kwn 1lr2 1lr3 1lxz 1ly0 1pp3 1rqw 1thi 1thu 1thv 1thw 2a7i 2blr 2blu 2d8o 2d8p 2oqn 2pe7 2vhk 2vhr 2vi1 2vi2 2vi3 2vi4 2vu6 2vu7 2wbz 3al7 3ald 3dzn 3dzp 3dzr 3e0a 3e3s 3n02 3n03 3qy5 3v7v 3v82 3v84 3v87 3v88 3v8a 3vce 3vcg 3vch 3vci 3vcj 3vck 3vhf 3vhg 3vjq 3x3o 3x3p 3x3q 3x3r 3x3s 3x3t 3zej 4axr 4axu 4bal 4bar 4c3c 4dc5 4dc6 4diy 4diz 4dj0 4dj1 4ek0 4eka 4ekb 4ekh 4eko 4ekt 4el2 4el3 4el7 4ela 4tvt 4xvb 4zg3 4zxr 5a47 5amz 5avg 5fgt 5fgx 5jvx 5k7q 5kvw 5kvx 5kvz 5kw0 5kw3 5kw4 5kw5 5kw7 5kw8 5l4r 5lh0 5lh1 5lh3 5lh5 5lh6 5lh7 5lmh 5ln0 5t3g

(-) Related Entries Specified in the PDB File

2g4h 2g4i 2g4j 2g4k 2g4l 2g4m 2g4n 2g4o 2g4p 2g4q 2g4r 2g4s 2g4t 2g4u 2g4v 2g4w 2g4x 2g4z 2g51 2g52 2g55