Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asym./Biol. Unit - manually
(-)Asym./Biol. Unit
(-)Asym./Biol. Unit - sites
collapse expand < >
Image Asym./Biol. Unit - manually
Asym./Biol. Unit - manually  (Jmol Viewer)
Image Asym./Biol. Unit
Asym./Biol. Unit  (Jmol Viewer)
Image Asym./Biol. Unit - sites
Asym./Biol. Unit - sites  (Jmol Viewer)

(-) Description

Title :  CRYSTAL STRUCTURE OF A A MARINE METAGENOME PROTEIN (JCVI_PEP_1096685590403) FROM UNCULTURED MARINE ORGANISM AT 2.53 A RESOLUTION
 
Authors :  Joint Center For Structural Genomics (Jcsg)
Date :  09 Apr 07  (Deposition) - 24 Apr 07  (Release) - 13 Jul 11  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  2.53
Chains :  Asym./Biol. Unit :  A,B,C,D,E
Keywords :  Metagenomics Target, Structural Genomics, Joint Center For Structural Genomics, Jcsg, Protein Structure Initiative, Psi-2, Unknown Function (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  Joint Center For Structural Genomics (Jcsg)
Crystal Structure Of Uncharacterized Protein (Jcvi_pep_1096685590403) From An Environmental Metagenome (Unidentified Marine Microbe), Sorcerer Ii Global Ocean Sampling Experiment At 2. 53 A Resolution
To Be Published
PubMed: search

(-) Compounds

Molecule 1 - UNCHARACTERIZED PROTEIN
    ChainsA, B, C, D, E
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidSPEEDET
    Expression System StrainHK100
    Expression System Taxid562
    Expression System Vector TypePLASMID
    Organism ScientificUNCULTURED MARINE ORGANISM
    Organism Taxid360281

 Structural Features

(-) Chains, Units

  12345
Asymmetric/Biological Unit ABCDE

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (2, 42)

Asymmetric/Biological Unit (2, 42)
No.NameCountTypeFull Name
1CL6Ligand/IonCHLORIDE ION
2MSE36Mod. Amino AcidSELENOMETHIONINE

(-) Sites  (6, 6)

Asymmetric Unit (6, 6)
No.NameEvidenceResiduesDescription
1AC1SOFTWAREARG D:19 , SER D:46BINDING SITE FOR RESIDUE CL D 207
2AC2SOFTWARETHR A:118 , TYR A:179 , TYR A:188 , ARG A:200BINDING SITE FOR RESIDUE CL A 207
3AC3SOFTWARETHR B:118 , TYR B:179 , ARG B:200BINDING SITE FOR RESIDUE CL B 207
4AC4SOFTWARETHR C:118 , TYR C:179 , ARG C:200BINDING SITE FOR RESIDUE CL C 207
5AC5SOFTWARETHR D:118 , TYR D:179 , ARG D:200BINDING SITE FOR RESIDUE CL D 208
6AC6SOFTWARETHR E:118 , TYR E:179 , ARG E:200BINDING SITE FOR RESIDUE CL E 207

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 2PGC)

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 2PGC)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 2PGC)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 2PGC)

(-) Exons   (0, 0)

(no "Exon" information available for 2PGC)

(-) Sequences/Alignments

Asymmetric/Biological Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:207
                                                                                                                                                                                                                                               
               SCOP domains -d2pgca1 A:1-206 Uncharacterized protein GOS_2596953                                                                                                                                                            SCOP domains
               CATH domains 2pgcA01 A:0-100  [code=3.30.70.900, no name defined]                                                 2pgcA02 A:101-206  [code=3.30.70.900, no name defined]                                                     CATH domains
               Pfam domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .....eeeeeeeeee...hhhhhhhhhhhhhhhhhhhh....eeeeee........eeeeeee.hhhhhhhhhhhhhhhhhhhhhhhhh.eeeeeeeeeee.............eeeeeeee.hhhhhhhhhhhhhhhhhhhhhh...eeeeee.........eeeeeee.hhhhhhhhhhhhhhhhhhhhhh...eeeeeeeeee. Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 2pgc A   0 GmSNINYVILTVASVDFSYRETmARLmSSYSKDLIDNAGAKGTRFGSIGTGDHAGSLIFIQFYDDLTGYQKALEIQSKSSVFKEImDSGKANIYLRNISTSLPTKFEQSYEHPKYIVLTRAEAAmSDKDKFLNCINDTASCFKDNGALTLRFGNLLTGSNVGNYLLGVGYPSmEAIEKTYDELLAHSSYKELmTFAKVNmRNIIKIL 206
                             |       9        19  |   | 29        39        49        59        69        79     |  89        99       109       119    |  129       139       149       159       169  |    179       189  |    199       
                             |                   22-MSE                                                         85-MSE                                124-MSE                                         172-MSE             192-MSE  |       
                             1-MSE                   26-MSE                                                                                                                                                                      199-MSE   

Chain B from PDB  Type:PROTEIN  Length:205
                                                                                                                                                                                                                                             
               SCOP domains d2pgcb_ B: Uncharacterized protein GOS_2596953                                                                                                                                                                SCOP domains
               CATH domains 2pgcB01 B:2-100  [code=3.30.70.900, no name defined]                                               2pgcB02 B:101-206  [code=3.30.70.900, no name defined]                                                     CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ...eeeeeeeeee...hhhhhhhhhhhhhhhhhhhh....eeeeee........eeeeeee.hhhhhhhhhhhhhhhhhhhhhhhh..eeeeeeeeeee.............eeeeeeee.hhhhhhhhhhhhhhhhhhhhhh...eeeeee.........eeeeeee.hhhhhhhhhhhhh.hhhhhhhh...eeeeeeeeee. Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 2pgc B   2 SNINYVILTVASVDFSYRETmARLmSSYSKDLIDNAGAKGTRFGSIGTGDHAGSLIFIQFYDDLTGYQKALEIQSKSSVFKEImDSGKANIYLRNISTSLPTKFEQSYEHPKYIVLTRAEAAmSDKDKFLNCINDTASCFKDNGALTLRFGNLLTGSNVGNYLLGVGYPSmEAIEKTYDELLAHSSYKELmTFAKVNmRNIIKIL 206
                                    11        21|   |   31        41        51        61        71        81   |    91       101       111       121  |    131       141       151       161       171|      181       191|      201     
                                               22-MSE                                                         85-MSE                                124-MSE                                         172-MSE             192-MSE  |       
                                                   26-MSE                                                                                                                                                                      199-MSE   

Chain C from PDB  Type:PROTEIN  Length:203
                                                                                                                                                                                                                                           
               SCOP domains d2pgcc_ C: Uncharacterized protein GOS_2596953                                                                                                                                                              SCOP domains
               CATH domains 2pgcC01 C:4-100  [code=3.30.70.900, no name defined]                                             2pgcC02 C:101-206  [code=3.30.70.900, no name defined]                                                     CATH domains
               Pfam domains ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .eeeeeeeeee...hhhhhhhhhhhhhhhhhhhh....eeeeee........eeeeeee.hhhhhhhhhhhhhhhhhhhhhhhhh.eeeeeeeeeee.............eeeeeeee.hhhhhhhhhhhhhhhhhhhhhh...eeeeee.........eeeeeee.hhhhhhhhhhhhhhhhhhhhhh...eeeeeeeeee. Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 2pgc C   4 INYVILTVASVDFSYRETmARLmSSYSKDLIDNAGAKGTRFGSIGTGDHAGSLIFIQFYDDLTGYQKALEIQSKSSVFKEImDSGKANIYLRNISTSLPTKFEQSYEHPKYIVLTRAEAAmSDKDKFLNCINDTASCFKDNGALTLRFGNLLTGSNVGNYLLGVGYPSmEAIEKTYDELLAHSSYKELmTFAKVNmRNIIKIL 206
                                    13        23  |     33        43        53        63        73        83 |      93       103       113       123|      133       143       153       163       173       183       193     | 203   
                                             22-MSE                                                         85-MSE                                124-MSE                                         172-MSE             192-MSE  |       
                                                 26-MSE                                                                                                                                                                      199-MSE   

Chain D from PDB  Type:PROTEIN  Length:205
                                                                                                                                                                                                                                             
               SCOP domains d2pgcd_ D: Uncharacterized protein GOS_2596953                                                                                                                                                                SCOP domains
               CATH domains 2pgcD01 D:2-100  [code=3.30.70.900, no name defined]                                               2pgcD02 D:101-206  [code=3.30.70.900, no name defined]                                                     CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ...eeeeeeeeee...hhhhhhhhhhhhhhhhhhhh....eeeeee........eeeeeee.hhhhhhhhhhhhhhhhhhhhhhhhh.eeeeeeeeeee.............eeeeeeee.hhhhhhhhhhhhhhhhhhhhhh...eeeeee.........eeeeeee.hhhhhhhhhhhhhhhhhhhhhh...eeeeeeeeee. Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 2pgc D   2 SNINYVILTVASVDFSYRETmARLmSSYSKDLIDNAGAKGTRFGSIGTGDHAGSLIFIQFYDDLTGYQKALEIQSKSSVFKEImDSGKANIYLRNISTSLPTKFEQSYEHPKYIVLTRAEAAmSDKDKFLNCINDTASCFKDNGALTLRFGNLLTGSNVGNYLLGVGYPSmEAIEKTYDELLAHSSYKELmTFAKVNmRNIIKIL 206
                                    11        21|   |   31        41        51        61        71        81   |    91       101       111       121  |    131       141       151       161       171|      181       191|      201     
                                               22-MSE                                                         85-MSE                                124-MSE                                         172-MSE             192-MSE  |       
                                                   26-MSE                                                                                                                                                                      199-MSE   

Chain E from PDB  Type:PROTEIN  Length:205
                                                                                                                                                                                                                                             
               SCOP domains d2pgce_ E: Uncharacterized protein GOS_2596953                                                                                                                                                                SCOP domains
               CATH domains 2pgcE01 E:2-100  [code=3.30.70.900, no name defined]                                               2pgcE02 E:101-206  [code=3.30.70.900, no name defined]                                                     CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ...eeeeeeeeee...hhhhhhhhhhhhhhhhhhhh....eeeeee........eeeeeee.hhhhhhhhhhhhhhhhhhhhhhhh..eeeeeeeeeee.............eeeeeeee.hhhhhhhhhhhhhhhhhhhhhh...eeeeee.........eeeeeee.hhhhhhhhhhhhh.hhhhhhhh...eeeeeeeeee. Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 2pgc E   2 SNINYVILTVASVDFSYRETmARLmSSYSKDLIDNAGAKGTRFGSIGTGDHAGSLIFIQFYDDLTGYQKALEIQSKSSVFKEImDSGKANIYLRNISTSLPTKFEQSYEHPKYIVLTRAEAAmSDKDKFLNCINDTASCFKDNGALTLRFGNLLTGSNVGNYLLGVGYPSmEAIEKTYDELLAHSSYKELmTFAKVNmRNIIKIL 206
                                    11        21|   |   31        41        51        61        71        81   |    91       101       111       121  |    131       141       151       161       171|      181       191|      201     
                                               22-MSE                                                         85-MSE                                124-MSE                                         172-MSE             192-MSE  |       
                                                   26-MSE                                                                                                                                                                      199-MSE   

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (1, 5)

Asymmetric/Biological Unit

(-) CATH Domains  (1, 10)

Asymmetric/Biological Unit
(-)
Class: Alpha Beta (26913)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 2PGC)

(-) Gene Ontology  (0, 0)

Asymmetric/Biological Unit(hide GO term definitions)
    (no "Gene Ontology" information available for 2PGC)

 Visualization

(-) Interactive Views

Asymmetric/Biological Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    CL  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    MSE  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
    AC4  [ RasMol ]  +environment [ RasMol ]
    AC5  [ RasMol ]  +environment [ RasMol ]
    AC6  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 2pgc)
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  2pgc
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  (no 'UniProt ID/Accession number' available) |
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  (no 'UniProt ID/Accession number' available) |
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

(no "Entries Sharing at Least One Protein Chain" available for 2PGC)

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 2PGC)