Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asym./Biol. Unit
(-)Asym./Biol. Unit - sites
collapse expand < >
Image Asym./Biol. Unit
Asym./Biol. Unit  (Jmol Viewer)
Image Asym./Biol. Unit - sites
Asym./Biol. Unit - sites  (Jmol Viewer)

(-) Description

Title :  CRYSTAL STRUCTURE OF A PROTEIN OF UNKNOWN FUNCTION AQ1966 FROM AQUIFEX AEOLICUS VF5
 
Authors :  Y. Qiu, Y. Kim, X. Yang, F. Collart, A. Joachimiak, A. Kossiakoff, Midwe For Structural Genomics (Mcsg)
Date :  19 Aug 05  (Deposition) - 04 Oct 05  (Release) - 13 Jul 11  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  2.46
Chains :  Asym./Biol. Unit :  A,B,C
Keywords :  Hypothetical Protein, Structural Genomics, Psi, Protein Structure Initiative, Midwest Center For Structural Genomics, Mcsg, Unknown Function (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  Y. Qiu, Y. Kim, X. Yang, F. Collart, A. Joachimiak, A. Kossiakoff
Crystal Structure Of A Hypothetical Protein Aq_1966 From Aquifex Aeolicus Vf5
To Be Published
PubMed: search

(-) Compounds

Molecule 1 - HYPOTHETICAL PROTEIN AQ_1966
    ChainsA, B, C
    EngineeredYES
    Expression SystemESCHERICHIA COLI BL21
    Expression System PlasmidPMCSG7
    Expression System StrainBL21
    Expression System Taxid511693
    Expression System Vector TypePLASMID
    GeneAQ_1966
    Organism ScientificAQUIFEX AEOLICUS
    Organism Taxid63363

 Structural Features

(-) Chains, Units

  123
Asymmetric/Biological Unit ABC

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (4, 9)

Asymmetric/Biological Unit (4, 9)
No.NameCountTypeFull Name
1CA1Ligand/IonCALCIUM ION
2MSE5Mod. Amino AcidSELENOMETHIONINE
3SE2Ligand/IonSELENIUM ATOM
4SO41Ligand/IonSULFATE ION

(-) Sites  (4, 4)

Asymmetric Unit (4, 4)
No.NameEvidenceResiduesDescription
1AC1SOFTWAREPHE A:32 , LYS A:33 , GLU A:34 , PHE A:53 , ARG A:56 , ARG B:74 , PHE B:102 , GLU B:103BINDING SITE FOR RESIDUE SO4 A 202
2AC2SOFTWARELYS A:3 , GLU A:6 , HIS A:93BINDING SITE FOR RESIDUE CA A 203
3AC3SOFTWAREGLU A:182 , TYR C:4 , PHE C:36 , SE C:203BINDING SITE FOR RESIDUE SE C 202
4AC4SOFTWAREGLU A:182 , TYR C:4 , SE C:202BINDING SITE FOR RESIDUE SE C 203

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 2ARH)

(-) Cis Peptide Bonds  (1, 1)

Asymmetric/Biological Unit
No.Residues
1Lys A:57 -Pro A:58

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 2ARH)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 2ARH)

(-) Exons   (0, 0)

(no "Exon" information available for 2ARH)

(-) Sequences/Alignments

Asymmetric/Biological Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:198
 aligned with O67778_AQUAE | O67778 from UniProtKB/TrEMBL  Length:201

    Alignment length:198
                              1                                                                                                                                                                                                   
                              |      8        18        28        38        48        58        68        78        88        98       108       118       128       138       148       158       168       178       188        
         O67778_AQUAE     - --MVKYEELLKTLENGINSEEGEIRLVRKSQGRFKEEFNFDLSLGSKPLLTLKVFLGRKPYWQPWVEVFGVNPNLRNVFFGSEAERKLYEFLSEHFGRIFVEYFEDKETTYELQKGVPPALSRLGFELLKLGYTYFRDWYIPEGLMEGGHKIQAEKPKTEEAKKRHLENLKKEFEEFIGKCEDEGLIKKVKERYNFLE 196
               SCOP domains --d2arha1 A:1-196 Hypothetical protein Aq_1966                                                                                                                                                         SCOP domains
               CATH domains 2arhA01 A:-1-156  [code=3.40.630.30, no name defined]                                                                                                         2arhA02 A:157-196                        CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Pfam domains
         Sec.struct. author hhhhhhhhhhhhhhhh.......eeeeeeee.....eeeeeeee..eeeeeeeee.........eeeee..hhhhhhhhh.hhhhhhhhhhhhh...eeeee...hhhhhhhhhh..hhhhhhhhhhhhh....eeee............eeeee...hhhhhhhhhhhhhhhhhhhhh...hhhhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Transcript
                 2arh A  -1 DAmVKYEELLKTLENGINSEEGEIRLVRKSQGRFKEEFNFDLSLGSKPLLTLKVFLGRKPYWQPWVEVFGVNPNLRNVFFGSEAERKLYEFLSEHFGRIFVEYFEDKETTYELQKGVPPALSRLGFELLKLGYTYFRDWFIPEGLmEGGHKIQAEKPKTEEAKKRHLENLKKEFEEFIGKCEDEGLIKKVKERYNFLE 196
                              |      8        18        28        38        48        58        68        78        88        98       108       118       128       138     | 148       158       168       178       188        
                              |                                                                                                                                            144-MSE                                                
                              1-MSE                                                                                                                                                                                               

Chain B from PDB  Type:PROTEIN  Length:197
 aligned with O67778_AQUAE | O67778 from UniProtKB/TrEMBL  Length:201

    Alignment length:197
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150       160       170       180       190       
         O67778_AQUAE     1 MVKYEELLKTLENGINSEEGEIRLVRKSQGRFKEEFNFDLSLGSKPLLTLKVFLGRKPYWQPWVEVFGVNPNLRNVFFGSEAERKLYEFLSEHFGRIFVEYFEDKETTYELQKGVPPALSRLGFELLKLGYTYFRDWYIPEGLMEGGHKIQAEKPKTEEAKKRHLENLKKEFEEFIGKCEDEGLIKKVKERYNFLEH 197
               SCOP domains d2arhb_ B: Hypothetical protein Aq_1966                                                                                                                                                               SCOP domains
               CATH domains -2arhB01 B:2-156  [code=3.40.630.30, no name defined]                                                                                                       2arhB02 B:157-197                         CATH domains
               Pfam domains ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author hhhhhhhhhhhhhheee..eeeeeeeeee.....eeeeeeee..eeeeeeeee.........eeeee..hhhhhhhhh.hhhhhhhhhhhhh...eeeee...hhhhhhhhhh..hhhhhhhhhhhhhh...eeeeee.........eeeeee...hhhhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 2arh B   1 mVKYEELLKTLENGINSEEGEIRLVRKSQGRFKEEFNFDLSLGSKPLLTLKVFLGRKPYWQPWVEVFGVNPNLRNVFFGSEAERKLYEFLSEHFGRIFVEYFEDKETTYELQKGVPPALSRLGFELLKLGYTYFRDWFIPEGLmEGGHKIQAEKPKTEEAKKRHLENLKKEFEEFIGKCEDEGLIKKVKERYNFLEH 197
                            |       10        20        30        40        50        60        70        80        90       100       110       120       130       140   |   150       160       170       180       190       
                            1-MSE                                                                                                                                        144-MSE                                                 

Chain C from PDB  Type:PROTEIN  Length:189
 aligned with O67778_AQUAE | O67778 from UniProtKB/TrEMBL  Length:201

    Alignment length:193
                                    11        21        31        41        51        61        71        81        91       101       111       121       131       141       151       161       171       181       191   
         O67778_AQUAE     2 VKYEELLKTLENGINSEEGEIRLVRKSQGRFKEEFNFDLSLGSKPLLTLKVFLGRKPYWQPWVEVFGVNPNLRNVFFGSEAERKLYEFLSEHFGRIFVEYFEDKETTYELQKGVPPALSRLGFELLKLGYTYFRDWYIPEGLMEGGHKIQAEKPKTEEAKKRHLENLKKEFEEFIGKCEDEGLIKKVKERYNF 194
               SCOP domains d2arhc_ C: Hypothetical protein Aq_1966                                                                                                                                                           SCOP domains
               CATH domains 2arhC01 C:2-156  [code=3.40.630.30, no name defined]                                                                                                       2arhC02 C:157-194                      CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ...hhhhhhhh.........eeeeee.......eeeeeeee..eeeeeeeee.........eeeee.....hhhhhh.hhhhhhhhhhhhh...eeeee...hhhhhhhhhh..hhhhhhhhhhhhh....eeeeee.........eeeeee...hhhhhhhhhhhhhhhhhh...----..hhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 2arh C   2 VKYEELLKTLENGINSEEGEIRLVRKSQGRFKEEFNFDLSLGSKPLLTLKVFLGRKPYWQPWVEVFGVNPNLRNVFFGSEAERKLYEFLSEHFGRIFVEYFEDKETTYELQKGVPPALSRLGFELLKLGYTYFRDWFIPEGLmEGGHKIQAEKPKTEEAKKRHLENLKKEFEEFIG----EGLIKKVKERYNF 194
                                    11        21        31        41        51        61        71        81        91       101       111       121       131       141  |    151       161       171     |   -|      191   
                                                                                                                                                                        144-MSE                          177  182            

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (1, 3)

Asymmetric/Biological Unit

(-) CATH Domains  (2, 6)

Asymmetric/Biological Unit
(-)
Class: Alpha Beta (26913)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 2ARH)

(-) Gene Ontology  (0, 0)

Asymmetric/Biological Unit(hide GO term definitions)
    (no "Gene Ontology" information available for 2ARH)

 Visualization

(-) Interactive Views

Asymmetric/Biological Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    CA  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    MSE  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    SE  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    SO4  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
    AC4  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
    Lys A:57 - Pro A:58   [ RasMol ]  
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  2arh
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  O67778_AQUAE | O67778
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/TrEMBL
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  O67778_AQUAE | O67778
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/TrEMBL
        O67778_AQUAE | O677784zsv 4zsx 4zsz

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 2ARH)