Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asymmetric Unit
(-)Asym. Unit - sites
(-)Biological Unit 1
(-)Biol. Unit 1 - sites
(-)Biological Unit 2
(-)Biological Unit 3
(-)Biological Unit 4
(-)Biological Unit 5
(-)Biological Unit 6
collapse expand < >
Image Asymmetric Unit
Asymmetric Unit  (Jmol Viewer)
Image Asym. Unit - sites
Asym. Unit - sites  (Jmol Viewer)
Image Biological Unit 1
Biological Unit 1  (Jmol Viewer)
Image Biol. Unit 1 - sites
Biol. Unit 1 - sites  (Jmol Viewer)
Image Biological Unit 2
Biological Unit 2  (Jmol Viewer)
Image Biological Unit 3
Biological Unit 3  (Jmol Viewer)
Image Biological Unit 4
Biological Unit 4  (Jmol Viewer)
Image Biological Unit 5
Biological Unit 5  (Jmol Viewer)
Image Biological Unit 6
Biological Unit 6  (Jmol Viewer)

(-) Description

Title :  STRUCTURE OF GFCC (YMCB), PROTEIN ENCODED BY THE E. COLI GROUP 4 CAPSULE OPERON
 
Authors :  M. A. Saper, K. Sathiyamoorthy
Date :  05 Oct 10  (Deposition) - 06 Apr 11  (Release) - 29 Jun 11  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  1.91
Chains :  Asym. Unit :  A,B,C,D
Biol. Unit 1:  A  (1x)
Biol. Unit 2:  B  (1x)
Biol. Unit 3:  C  (1x)
Biol. Unit 4:  D  (1x)
Biol. Unit 5:  A,B  (1x)
Biol. Unit 6:  C,D  (1x)
Keywords :  Beta-Grasp, Unknown Function (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  K. Sathiyamoorthy, E. Mills, T. M. Franzmann, I. Rosenshine, M. A. Saper
The Crystal Structure Of Escherichia Coli Group 4 Capsule Protein Gfcc Reveals A Domain Organization Resembling That Of Wza.
Biochemistry V. 50 5465 2011
PubMed-ID: 21449614  |  Reference-DOI: 10.1021/BI101869H

(-) Compounds

Molecule 1 - PREDICTED PROTEIN
    ChainsA, B, C, D
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPETBLUE2
    Expression System StrainTUNER(DE3) PLACI
    Expression System Taxid469008
    Expression System Vector TypePLASMID
    FragmentUNP RESIDUES 22-248
    GeneE2348C_0970, E2348_C_0970, GFCC
    Organism Taxid574521
    StrainE2348/69

 Structural Features

(-) Chains, Units

  1234
Asymmetric Unit ABCD
Biological Unit 1 (1x)A   
Biological Unit 2 (1x) B  
Biological Unit 3 (1x)  C 
Biological Unit 4 (1x)   D
Biological Unit 5 (1x)AB  
Biological Unit 6 (1x)  CD

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (2, 15)

Asymmetric Unit (2, 15)
No.NameCountTypeFull Name
1MSE12Mod. Amino AcidSELENOMETHIONINE
2SO43Ligand/IonSULFATE ION
Biological Unit 1 (1, 3)
No.NameCountTypeFull Name
1MSE3Mod. Amino AcidSELENOMETHIONINE
2SO4-1Ligand/IonSULFATE ION
Biological Unit 2 (2, 5)
No.NameCountTypeFull Name
1MSE3Mod. Amino AcidSELENOMETHIONINE
2SO42Ligand/IonSULFATE ION
Biological Unit 3 (1, 3)
No.NameCountTypeFull Name
1MSE3Mod. Amino AcidSELENOMETHIONINE
2SO4-1Ligand/IonSULFATE ION
Biological Unit 4 (2, 4)
No.NameCountTypeFull Name
1MSE3Mod. Amino AcidSELENOMETHIONINE
2SO41Ligand/IonSULFATE ION
Biological Unit 5 (2, 8)
No.NameCountTypeFull Name
1MSE6Mod. Amino AcidSELENOMETHIONINE
2SO42Ligand/IonSULFATE ION
Biological Unit 6 (2, 7)
No.NameCountTypeFull Name
1MSE6Mod. Amino AcidSELENOMETHIONINE
2SO41Ligand/IonSULFATE ION

(-) Sites  (3, 3)

Asymmetric Unit (3, 3)
No.NameEvidenceResiduesDescription
1AC1SOFTWAREGLY B:137 , ASP B:138 , HOH B:260 , HOH B:271BINDING SITE FOR RESIDUE SO4 B 1
2AC2SOFTWAREGLY A:182 , LYS A:185 , HOH A:315 , LYS B:185 , HOH B:290 , HOH B:675BINDING SITE FOR RESIDUE SO4 B 3
3AC3SOFTWAREALA C:181 , GLY C:182 , HOH C:625 , GLY D:182 , HOH D:614BINDING SITE FOR RESIDUE SO4 D 2

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 3P42)

(-) Cis Peptide Bonds  (4, 4)

Asymmetric Unit
No.Residues
1Gly A:40 -Pro A:41
2Gly B:40 -Pro B:41
3Gly C:40 -Pro C:41
4Gly D:40 -Pro D:41

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 3P42)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 3P42)

(-) Exons   (0, 0)

(no "Exon" information available for 3P42)

(-) Sequences/Alignments

Asymmetric Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:234
 aligned with B7UN63_ECO27 | B7UN63 from UniProtKB/TrEMBL  Length:248

    Alignment length:234
                                                                                                                                                                                                                                                           248        
                                    32        42        52        62        72        82        92       102       112       122       132       142       152       162       172       182       192       202       212       222       232       242     |   -    
         B7UN63_ECO27    23 QGMVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWLAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPE--------   -
               SCOP domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Pfam domains
         Sec.struct. author .eeeeee......eeeeeee.hhhhhhh........hhhhheeehhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh.................ee.hhhh....eeeeee......eeeeee.....eeee.....hhhhhhh...........eeeee.....eeeee........ee.....eeee.......hhhhhhhhhhhhhhhhhhhhhhh...... Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Transcript
                 3p42 A  23 QGmVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVmAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWLAGRSVTDYLQDHPRLAGADKNNVmVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPELEHHHHHH 256
                              |     32        42        52        62        72        82        92       102       112       122       132       142       152       162       172       182      |192       202       212       222       232       242       252    
                              |                                                       82-MSE                                                                                                    189-MSE                                                               
                             25-MSE                                                                                                                                                                                                                                   

Chain B from PDB  Type:PROTEIN  Length:225
 aligned with B7UN63_ECO27 | B7UN63 from UniProtKB/TrEMBL  Length:248

    Alignment length:225
                                    31        41        51        61        71        81        91       101       111       121       131       141       151       161       171       181       191       201       211       221       231       241     
         B7UN63_ECO27    22 AQGMVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWLAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRV 246
               SCOP domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .eeeeeee.......eeeeee.hhhhhhh........hhhhheeehhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh.................ee........eeeeeee......eeeeee.....eeee.....hhhhhhh...........eeeee.....eeeee........ee.....eeee.......hhhhhhhhhhhhhhhh... Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 3p42 B  22 AQGmVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVmAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWLAGRSVTDYLQDHPRLAGADKNNVmVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRV 246
                               |    31        41        51        61        71        81|       91       101       111       121       131       141       151       161       171       181       191       201       211       221       231       241     
                              25-MSE                                                   82-MSE                                                                                                    189-MSE                                                     

Chain C from PDB  Type:PROTEIN  Length:226
 aligned with B7UN63_ECO27 | B7UN63 from UniProtKB/TrEMBL  Length:248

    Alignment length:226
                                    32        42        52        62        72        82        92       102       112       122       132       142       152       162       172       182       192       202       212       222       232       242      
         B7UN63_ECO27    23 QGMVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWLAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPE 248
               SCOP domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .eeeeee......eeeeeee.hhhhhhh........hhhhheeehhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh.................eee.........eeeee......eeeeee.....eeee.....hhhhhh............eeeee.....eeeee........ee.....eeee.......hhhhhhhhhhhhhhhh..... Sec.struct. author
                 SAPs(SNPs) ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 3p42 C  23 QGmVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVmAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWLAGRSVTDYLQDHPRLAGADKNNVmVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPE 248
                              |     32        42        52        62        72        82        92       102       112       122       132       142       152       162       172       182      |192       202       212       222       232       242      
                             25-MSE                                                   82-MSE                                                                                                    189-MSE                                                       

Chain D from PDB  Type:PROTEIN  Length:229
 aligned with B7UN63_ECO27 | B7UN63 from UniProtKB/TrEMBL  Length:248

    Alignment length:229
                                                                                                                                                                                                                                                           248   
                                    32        42        52        62        72        82        92       102       112       122       132       142       152       162       172       182       192       202       212       222       232       242     |   
         B7UN63_ECO27    23 QGMVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWLAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPE---   -
               SCOP domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
           Pfam domains (1) Caps_synth_GfcC-3p42D01 D:23-248                                                                                                                                                                                                  --- Pfam domains (1)
           Pfam domains (2) Caps_synth_GfcC-3p42D02 D:23-248                                                                                                                                                                                                  --- Pfam domains (2)
           Pfam domains (3) Caps_synth_GfcC-3p42D03 D:23-248                                                                                                                                                                                                  --- Pfam domains (3)
           Pfam domains (4) Caps_synth_GfcC-3p42D04 D:23-248                                                                                                                                                                                                  --- Pfam domains (4)
         Sec.struct. author .eeeeee.......eeeeee.hhhhhhhh.......hhhhheeehhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh..................ee..........eeeee......eeeeee.....eeee.....hhhhhhh...........eeeee.....eeeee.......eee.....eeee.......hhhhhhhhhhhhhhhhh....... Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 3p42 D  23 QGmVTIYLPGEQQTLSVGPVENVAQLVTQPQLRDRLWWPGALLTDSAAKAKALKDYQHVmAQLASWEAEADDDVAATIKSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTITLLGAVSGAGQLPWLAGRSVTDYLQDHPRLAGADKNNVmVITPEGETVVAPVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPELEH 251
                              |     32        42        52        62        72        82        92       102       112       122       132       142       152       162       172       182      |192       202       212       222       232       242         
                             25-MSE                                                   82-MSE                                                                                                    189-MSE                                                          

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 3P42)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 3P42)

(-) Pfam Domains  (1, 4)

Asymmetric Unit
(-)
Clan: Ubiquitin (279)

(-) Gene Ontology  (0, 0)

Asymmetric Unit(hide GO term definitions)
    (no "Gene Ontology" information available for 3P42)

 Visualization

(-) Interactive Views

Asymmetric Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    MSE  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    SO4  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
    Gly A:40 - Pro A:41   [ RasMol ]  
    Gly B:40 - Pro B:41   [ RasMol ]  
    Gly C:40 - Pro C:41   [ RasMol ]  
    Gly D:40 - Pro D:41   [ RasMol ]  
 
Biological Units
  Complete Structure
    Biological Unit 1  [ Jena3D ]
    Biological Unit 2  [ Jena3D ]
    Biological Unit 3  [ Jena3D ]
    Biological Unit 4  [ Jena3D ]
    Biological Unit 5  [ Jena3D ]
    Biological Unit 6  [ Jena3D ]

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  3p42
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  B7UN63_ECO27 | B7UN63
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/TrEMBL
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  B7UN63_ECO27 | B7UN63
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

(no "Entries Sharing at Least One Protein Chain" available for 3P42)

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 3P42)