Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asymmetric Unit
(-)Asym. Unit - sites
(-)Biological Unit 1
(-)Biol. Unit 1 - sites
(-)Biological Unit 2
collapse expand < >
Image Asymmetric Unit
Asymmetric Unit  (Jmol Viewer)
Image Asym. Unit - sites
Asym. Unit - sites  (Jmol Viewer)
Image Biological Unit 1
Biological Unit 1  (Jmol Viewer)
Image Biol. Unit 1 - sites
Biol. Unit 1 - sites  (Jmol Viewer)
Image Biological Unit 2
Biological Unit 2  (Jmol Viewer)

(-) Description

Title :  X-RAY STRUCTURE OF GENE PRODUCT FROM ARABIDOPSIS THALIANA AT3G16990
 
Authors :  G. W. Wesenberg, D. W. Smith, G. N. Phillips Jr. , K. A. Johnson, E. Bitto C. A. Bingman, Center For Eukaryotic Structural Genomics (Cesg
Date :  16 Nov 05  (Deposition) - 13 Dec 05  (Release) - 13 Jul 11  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  2.10
Chains :  Asym. Unit :  A,B
Biol. Unit 1:  A  (1x)
Biol. Unit 2:  B  (1x)
Keywords :  Tena_thi-4 Domain, Tena/Thi-4/Pqqc Family, At3G16990, Structural Genomics, Protein Structure Initiative, Psi, Cesg, Center Foreukaryotic Structural Genomics, Center For Eukaryotic Structural Genomics, Plant Protein (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  P. G. Blommel, D. W. Smith, C. A. Bingman, D. H. Dyer, I. Rayment, H. M. Holden, B. G. Fox, G. N. Phillips Jr.
Crystal Structure Of Gene Locus At3G16990 From Arabidopsis Thaliana
Proteins V. 57 221 2004
PubMed-ID: 15326608  |  Reference-DOI: 10.1002/PROT.20213

(-) Compounds

Molecule 1 - SEED MATURATION PROTEIN PM36 HOMOLOG
    ChainsA, B
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPVP13
    Expression System StrainROSETTA
    Expression System Taxid562
    Expression System Vector TypePLASMID
    GeneAT3G16990
    Organism CommonTHALE CRESS
    Organism ScientificARABIDOPSIS THALIANA
    Organism Taxid3702
    SynonymPROTEIN AT3G16990

 Structural Features

(-) Chains, Units

  12
Asymmetric Unit AB
Biological Unit 1 (1x)A 
Biological Unit 2 (1x) B

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (3, 14)

Asymmetric Unit (3, 14)
No.NameCountTypeFull Name
1HMH2Ligand/Ion4-AMINO-5-HYDROXYMETHYL-2-METHYLPYRIMIDINE
2MSE8Mod. Amino AcidSELENOMETHIONINE
3SO44Ligand/IonSULFATE ION
Biological Unit 1 (3, 6)
No.NameCountTypeFull Name
1HMH1Ligand/Ion4-AMINO-5-HYDROXYMETHYL-2-METHYLPYRIMIDINE
2MSE4Mod. Amino AcidSELENOMETHIONINE
3SO41Ligand/IonSULFATE ION
Biological Unit 2 (3, 8)
No.NameCountTypeFull Name
1HMH1Ligand/Ion4-AMINO-5-HYDROXYMETHYL-2-METHYLPYRIMIDINE
2MSE4Mod. Amino AcidSELENOMETHIONINE
3SO43Ligand/IonSULFATE ION

(-) Sites  (6, 6)

Asymmetric Unit (6, 6)
No.NameEvidenceResiduesDescription
1AC1SOFTWAREARG A:23 , HIS A:24BINDING SITE FOR RESIDUE SO4 A 1403
2AC2SOFTWAREARG B:64 , ASP B:68 , ARG B:110 , VAL B:127 , LYS B:128 , TYR B:129 , HOH B:1466 , HOH B:1505BINDING SITE FOR RESIDUE SO4 B 1401
3AC3SOFTWAREARG A:117 , GLU A:120 , HOH A:1597 , PHE B:118 , ASP B:121 , ARG B:205 , HOH B:1506BINDING SITE FOR RESIDUE SO4 B 1402
4AC4SOFTWAREARG A:110 , VAL B:107 , SER B:192 , VAL B:195 , HOH B:1430 , HOH B:1563BINDING SITE FOR RESIDUE SO4 B 1404
5AC5SOFTWAREASP A:47 , PHE A:50 , ILE A:139 , TYR A:143 , GLU A:210 , HOH A:1437BINDING SITE FOR RESIDUE HMH A 1300
6AC6SOFTWAREASP B:47 , PHE B:50 , VAL B:51 , ILE B:139 , GLU B:140 , TYR B:143 , GLU B:210 , PHE B:213 , HOH B:1481BINDING SITE FOR RESIDUE HMH B 1301

(-) SS Bonds  (2, 2)

Asymmetric Unit
No.Residues
1A:150 -A:165
2B:150 -B:165

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 2F2G)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 2F2G)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 2F2G)

(-) Exons   (0, 0)

(no "Exon" information available for 2F2G)

(-) Sequences/Alignments

Asymmetric Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:215
 aligned with TENAE_ARATH | Q9ASY9 from UniProtKB/Swiss-Prot  Length:221

    Alignment length:215
                                    14        24        34        44        54        64        74        84        94       104       114       124       134       144       154       164       174       184       194       204       214     
          TENAE_ARATH     5 GVIDTWIDKHRSIYTAATRHAFVVSIRDGSVDLSSFRTWLGQDYLFVRRFVPFVASVLIRACKDSGESSDMEVVLGGIASLNDEIEWFKREGSKWDVDFSTVVPQRANQEYGRFLEDLMSSEVKYPVIMTAFWAIEAVYQESFAHCLEDGNKTPVELTGACHRWGNDGFKQYCSSVKNIAERCLENASGEVLGEAEDVLVRVLELEVAFWEMSRG 219
               SCOP domains d2f2ga_ A: Seed maturation protein-related At3g16990                                                                                                                                                                    SCOP domains
               CATH domains 2f2gA00 A:5-219 Heme Oxygenase; Chain A                                                                                                                                                                                 CATH domains
               Pfam domains ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author hhhhhhhhhhhhhhhhhh......eee..eeehhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh....hhhhhhhhhhhhhhhhhhhhhhhhhhh..hhhhh..hhhhhhhhhhhhhh.....hhhhhhhhhhhhhhhhhhhh.hhhhhhhh.....hhhhhhhhhhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhhhhhh... Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 2f2g A   5 GVIDTWIDKHRSIYTAATRHAFVVSIRDGSVDLSSFRTWLGQDYLFVRRFVPFVASVLIRACKDSGESSDmEVVLGGIASLNDEIEWFKREGSKWDVDFSTVVPQRANQEYGRFLEDLmSSEVKYPVImTAFWAIEAVYQESFAHCLEDGNKTPVELTGACHRWGNDGFKQYCSSVKNIAERCLENASGEVLGEAEDVLVRVLELEVAFWEmSRG 219
                                    14        24        34        44        54        64        74|       84        94       104       114       124       134       144       154       164       174       184       194       204       214 |   
                                                                                                 75-MSE                                         123-MSE   133-MSE                                                                            216-MSE

Chain B from PDB  Type:PROTEIN  Length:211
 aligned with TENAE_ARATH | Q9ASY9 from UniProtKB/Swiss-Prot  Length:221

    Alignment length:216
                                    13        23        33        43        53        63        73        83        93       103       113       123       133       143       153       163       173       183       193       203       213      
          TENAE_ARATH     4 RGVIDTWIDKHRSIYTAATRHAFVVSIRDGSVDLSSFRTWLGQDYLFVRRFVPFVASVLIRACKDSGESSDMEVVLGGIASLNDEIEWFKREGSKWDVDFSTVVPQRANQEYGRFLEDLMSSEVKYPVIMTAFWAIEAVYQESFAHCLEDGNKTPVELTGACHRWGNDGFKQYCSSVKNIAERCLENASGEVLGEAEDVLVRVLELEVAFWEMSRG 219
               SCOP domains d2f2gb_ B: Seed maturation protein-related At3g16990                                                                                                                                                                     SCOP domains
               CATH domains 2f2gB00 B:4-219 Heme Oxygenase; Chain A                                                                                                                                                                                  CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Pfam domains
         Sec.struct. author .hhhhhhhhhhhhhhhhhhhhhhhhhhhh...hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh....hhhhhhhhhhhhhhhhhhhhhhhhhhh..hhhhh..hhhhhhhhhhhhhhhh...hhhhhhhhhhhhhhhhhhhhh..-----.....hhhhhhhhhhhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhhhhhhhhh Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Transcript
                 2f2g B   4 RGVIDTWIDKHRSIYTAATRHAFVVSIRDGSVDLSSFRTWLGQDYLFVRRFVPFVASVLIRACKDSGESSDmEVVLGGIASLNDEIEWFKREGSKWDVDFSTVVPQRANQEYGRFLEDLmSSEVKYPVImTAFWAIEAVYQESFAHC-----KTPVELTGACHRWGNDGFKQYCSSVKNIAERCLENASGEVLGEAEDVLVRVLELEVAFWEmSRG 219
                                    13        23        33        43        53        63        73 |      83        93       103       113       123       133       143      |  -  |    163       173       183       193       203       213  |   
                                                                                                  75-MSE                                         123-MSE   133-MSE          150   156                                                         216-MSE

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (1, 2)

Asymmetric Unit

(-) CATH Domains  (1, 2)

Asymmetric Unit

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 2F2G)

(-) Gene Ontology  (5, 5)

Asymmetric Unit(hide GO term definitions)
Chain A,B   (TENAE_ARATH | Q9ASY9)
molecular function
    GO:0016787    hydrolase activity    Catalysis of the hydrolysis of various bonds, e.g. C-O, C-N, C-C, phosphoric anhydride bonds, etc. Hydrolase is the systematic name for any enzyme of EC class 3.
    GO:0050334    thiaminase activity    Catalysis of the reaction: H(2)O + thiamine = 4-amino-5-hydroxymethyl-2-methylpyrimidine + 5-(2-hydroxyethyl)-4-methylthiazole + H(+).
biological process
    GO:0009228    thiamine biosynthetic process    The chemical reactions and pathways resulting in the formation of thiamine (vitamin B1), a water soluble vitamin present in fresh vegetables and meats, especially liver.
    GO:0009229    thiamine diphosphate biosynthetic process    The chemical reactions and pathways resulting in the formation of thiamine diphosphate, a derivative of thiamine (vitamin B1) which acts as a coenzyme in a range of processes including the Krebs cycle.
cellular component
    GO:0005575    cellular_component    The part of a cell, extracellular environment or virus in which a gene product is located. A gene product may be located in one or more parts of a cell and its location may be as specific as a particular macromolecular complex, that is, a stable, persistent association of macromolecules that function together.

 Visualization

(-) Interactive Views

Asymmetric Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    HMH  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    MSE  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    SO4  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
    AC4  [ RasMol ]  +environment [ RasMol ]
    AC5  [ RasMol ]  +environment [ RasMol ]
    AC6  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 2f2g)
 
Biological Units
  Complete Structure
    Biological Unit 1  [ Jena3D ]
    Biological Unit 2  [ Jena3D ]

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  2f2g
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  TENAE_ARATH | Q9ASY9
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  TENAE_ARATH | Q9ASY9
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        TENAE_ARATH | Q9ASY92q4x

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 2F2G)