Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asym./Biol. Unit
collapse expand < >
Image Asym./Biol. Unit
Asym./Biol. Unit  (Jmol Viewer)

(-) Description

Title :  CRYSTAL STRUCTURE OF THE MINIMAL PHO-SFMBT COMPLEX (P6122 SPACEGROUP)
 
Authors :  C. Alfieri, S. Glatt, C. W. Mueller
Date :  11 Sep 13  (Deposition) - 13 Nov 13  (Release) - 13 Nov 13  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  2.10
Chains :  Asym./Biol. Unit :  A,B
Keywords :  Transcription (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  C. Alfieri, M. C. Gambetta, R. Matos, S. Glatt, P. Sehr, S. Fraterman, M. Wilm, J. Mueller, C. W. Mueller
Structural Basis For Targeting The Chromatin Repressor Sfmbt To Polycomb Response Elements
Genes Dev. V. 27 2367 2013
PubMed-ID: 24186981  |  Reference-DOI: 10.1101/GAD.226621.113

(-) Compounds

Molecule 1 - POLYCOMB PROTEIN SFMBT
    ChainsA
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System StrainBL21(DE3)
    Expression System Taxid469008
    Expression System VariantGOLD
    Fragment4MBT, RESIDUES 531-980
    Organism CommonFRUIT FLY
    Organism ScientificDROSOPHILA MELANOGASTER
    Organism Taxid7227
    SynonymSCM-LIKE WITH FOUR MBT DOMAIN-CONTAINING PROTEIN 1, DSFMBT
 
Molecule 2 - POLYCOMB PROTEIN PHO
    ChainsB
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System StrainBL21(DE3)
    Expression System Taxid469008
    Expression System VariantGOLD
    FragmentSPACER, RESIDUES 145-172
    Organism CommonFRUIT FLY
    Organism ScientificDROSOPHILA MELANOGASTER
    Organism Taxid7227
    SynonymPROTEIN PLEIOHOMEOTIC, TRANSCRIPTION FACTOR YY1 HOMOLOG

 Structural Features

(-) Chains, Units

  12
Asymmetric/Biological Unit AB

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (0, 0)

(no "Ligand,Modified Residues,Ions" information available for 4C5G)

(-) Sites  (0, 0)

(no "Site" information available for 4C5G)

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 4C5G)

(-) Cis Peptide Bonds  (1, 1)

Asymmetric/Biological Unit
No.Residues
1Gly B:142 -Met B:143

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 4C5G)

(-) PROSITE Motifs  (1, 4)

Asymmetric/Biological Unit (1, 4)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1MBTPS51079 MBT repeat profile.SMBT_DROME536-647
655-753
761-871
879-975
  4A:536-647
A:655-753
A:761-871
A:879-975

(-) Exons   (0, 0)

(no "Exon" information available for 4C5G)

(-) Sequences/Alignments

Asymmetric/Biological Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:441
 aligned with SMBT_DROME | Q9VK33 from UniProtKB/Swiss-Prot  Length:1220

    Alignment length:445
                                   542       552       562       572       582       592       602       612       622       632       642       652       662       672       682       692       702       712       722       732       742       752       762       772       782       792       802       812       822       832       842       852       862       872       882       892       902       912       922       932       942       952       962       972     
           SMBT_DROME   533 THSYDWLPRLSKENFNAAPVTCFPHAPGCEVWDNLGVGMKVEVENTDCDSIEVIQPGQTPTSFWVATILEIKGYKALMSYEGFDTDSHDFWVNLCNAEVHSVGWCATRGKPLIPPRTIEHKYKDWKDFLVGRLSGARTLPSNFYNKINDSLQSRFRLGLNLECVDKDRISQVRLATVTKIVGKRLFLRYFDSDDGFWCHEDSPIIHPVGWATTVGHNLAAPQDYLERMLAGREAMIEVHEDDATIELFKMNFTFDEYYSDGKTNSFVEGMKLEAVDPLNLSSICPATVMAVLKFGYMMIRIDSYQPDASGSDWFCYHEKSPCIFPAGFCSVNNISVTPPNGYDSRTFTWEGYLRDTGAVAAGQHLFHRIIPDHGFEVGMSLECADLMDPRLVCVATVARVVGRLLKVHFDGWTDEYDQWLDCESADIYPVGWCVLVNHKLEGPPR 977
               SCOP domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .....hhhhhh.......hhhhh.......hhhhh....eeeee..................eeeeeeeeee..eeeeee.........eeee.....ee..hhhhhh.........hhhhh..hhhhhhhhhh.......hhhhhhhh..........eeeeee..eeeeeeeeeeeeee..eeeeee.....eeeee.....ee..hhhhhhh.eee.hhhhhhhhhh.----........hhhhh......................eeeeee..eeeeeeeeeeeee....eeeeee............eeee.....ee..hhhhhh...............hhhhhhhhhh....hhhhh.............eeeee.......eeeeeeeeee..eeeeee...hhhhheeee.....ee..hhhhhhh..ee.... Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ---MBT  PDB: A:536-647 UniProt: 536-647                                                                            -------MBT  PDB: A:655-753 UniProt: 655-753                                                               -------MBT  PDB: A:761-871 UniProt: 761-871                                                                           -------MBT  PDB: A:879-975 UniProt: 879-975                                                             -- PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 4c5g A 533 THSYDWLPRLSKENFNAAPVTCFPHAPGCEVWDNLGVGMKVEVENTDCDSIEVIQPGQTPTSFWVATILEIKGYKALMSYEGFDTDSHDFWVNLCNAEVHSVGWCATRGKPLIPPRTIEHKYKDWKDFLVGRLSGARTLPSNFYNKINDSLQSRFRLGLNLECVDKDRISQVRLATVTKIVGKRLFLRYFDSDDGFWCHEDSPIIHPVGWATTVGHNLAAPQDYLERMLAG----IEVHEDDATIELFKMNFTFDEYYSDGKTNSFVEGMKLEAVDPLNLSSICPATVMAVLKFGYMMIRIDSYQPDASGSDWFCYHEKSPCIFPAGFCSVNNISVTPPNGYDSRTFTWEGYLRDTGAVAAGQHLFHRIIPDHGFEVGMSLECADLMDPRLVCVATVARVVGRLLKVHFDGWTDEYDQWLDCESADIYPVGWCVLVNHKLEGPPR 977
                                   542       552       562       572       582       592       602       612       622       632       642       652       662       672       682       692       702       712       722       732       742       752       762|    | 772       782       792       802       812       822       832       842       852       862       872       882       892       902       912       922       932       942       952       962       972     
                                                                                                                                                                                                                                                                763  768                                                                                                                                                                                                                 

Chain B from PDB  Type:PROTEIN  Length:30
 aligned with PHO_DROME | Q8ST83 from UniProtKB/Swiss-Prot  Length:520

    Alignment length:49
                                   131       141       151       161         
            PHO_DROME   122 SGVVDKNSPFLTLGTTILNSNGKSRRWEQKLVHIKTMEGEFSVTMWASG 170
               SCOP domains ------------------------------------------------- SCOP domains
               CATH domains ------------------------------------------------- CATH domains
               Pfam domains ------------------------------------------------- Pfam domains
         Sec.struct. author ....-------------------...eeeeeeeeee..eeeeeeeeee. Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------- PROSITE
                 Transcript ------------------------------------------------- Transcript
                 4c5g B 141 AGMA-------------------SRRWEQKLVHIKTMEGEFSVTMWASG 170
                               |     -         -   |   151       161         
                             144                 145                         

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 4C5G)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 4C5G)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 4C5G)

(-) Gene Ontology  (34, 44)

Asymmetric/Biological Unit(hide GO term definitions)
Chain A   (SMBT_DROME | Q9VK33)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0003682    chromatin binding    Interacting selectively and non-covalently with chromatin, the network of fibers of DNA, protein, and sometimes RNA, that make up the chromosomes of the eukaryotic nucleus during interphase.
    GO:0046872    metal ion binding    Interacting selectively and non-covalently with any metal ion.
    GO:0035064    methylated histone binding    Interacting selectively and non-covalently with a histone in which a residue has been modified by methylation. Histones are any of a group of water-soluble proteins found in association with the DNA of eukaroytic chromosomes.
    GO:0005515    protein binding    Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
    GO:0008270    zinc ion binding    Interacting selectively and non-covalently with zinc (Zn) ions.
biological process
    GO:0006342    chromatin silencing    Repression of transcription by altering the structure of chromatin, e.g. by conversion of large regions of DNA into an inaccessible state often called heterochromatin.
    GO:0007446    imaginal disc growth    The increase in mass of imaginal discs by cell proliferation prior to metamorphosis. Imaginal discs are epithelial infoldings in the larvae of holometabolous insects that develop into adult structures (legs, antennae, wings, etc.) during metamorphosis from larval to adult form.
    GO:0010629    negative regulation of gene expression    Any process that decreases the frequency, rate or extent of gene expression. Gene expression is the process in which a gene's coding sequence is converted into a mature gene product or products (proteins or RNA). This includes the production of an RNA transcript as well as any processing to produce a mature RNA product or an mRNA or circRNA (for protein-coding genes) and the translation of that mRNA or circRNA into protein. Protein maturation is included when required to form an active form of a product from an inactive precursor form.
    GO:0048477    oogenesis    The complete process of formation and maturation of an ovum or female gamete from a primordial female germ cell. Examples of this process are found in Mus musculus and Drosophila melanogaster.
    GO:0006355    regulation of transcription, DNA-templated    Any process that modulates the frequency, rate or extent of cellular DNA-templated transcription.
    GO:0006351    transcription, DNA-templated    The cellular synthesis of RNA on a template of DNA.
cellular component
    GO:0031519    PcG protein complex    A chromatin-associated multiprotein complex containing Polycomb Group proteins. In Drosophila, Polycomb group proteins are involved in the long-term maintenance of gene repression, and PcG protein complexes associate with Polycomb group response elements (PREs) in target genes to regulate higher-order chromatin structure.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.

Chain B   (PHO_DROME | Q8ST83)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0000980    RNA polymerase II distal enhancer sequence-specific DNA binding    Interacting selectively and non-covalently with a RNA polymerase II (Pol II) distal enhancer. In mammalian cells, enhancers are distal sequences that increase the utilization of some promoters, and can function in either orientation and in any location (upstream or downstream) relative to the core promoter.
    GO:0031490    chromatin DNA binding    Interacting selectively and non-covalently with DNA that is assembled into chromatin.
    GO:0003682    chromatin binding    Interacting selectively and non-covalently with chromatin, the network of fibers of DNA, protein, and sometimes RNA, that make up the chromosomes of the eukaryotic nucleus during interphase.
    GO:0046872    metal ion binding    Interacting selectively and non-covalently with any metal ion.
    GO:0003676    nucleic acid binding    Interacting selectively and non-covalently with any nucleic acid.
    GO:0005515    protein binding    Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
    GO:0043565    sequence-specific DNA binding    Interacting selectively and non-covalently with DNA of a specific nucleotide composition, e.g. GC-rich DNA binding, or with a specific sequence motif or type of DNA e.g. promotor binding or rDNA binding.
    GO:0003700    transcription factor activity, sequence-specific DNA binding    Interacting selectively and non-covalently with a specific DNA sequence in order to modulate transcription. The transcription factor may or may not also interact selectively with a protein or macromolecular complex.
biological process
    GO:0006310    DNA recombination    Any process in which a new genotype is formed by reassortment of genes resulting in gene combinations different from those that were present in the parents. In eukaryotes genetic recombination can occur by chromosome assortment, intrachromosomal recombination, or nonreciprocal interchromosomal recombination. Intrachromosomal recombination occurs by crossing over. In bacteria it may occur by genetic transformation, conjugation, transduction, or F-duction.
    GO:0006281    DNA repair    The process of restoring DNA after damage. Genomes are subject to damage by chemical and physical agents in the environment (e.g. UV and ionizing radiations, chemical mutagens, fungal and bacterial toxins, etc.) and by free radicals or alkylating agents endogenously generated in metabolism. DNA is also damaged because of errors during its replication. A variety of different DNA repair pathways have been reported that include direct reversal, base excision repair, nucleotide excision repair, photoreactivation, bypass, double-strand break repair pathway, and mismatch repair pathway.
    GO:0006265    DNA topological change    The process in which a transformation is induced in the topological structure of a double-stranded DNA helix, resulting in a change in linking number.
    GO:0006974    cellular response to DNA damage stimulus    Any process that results in a change in state or activity of a cell (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of a stimulus indicating damage to its DNA from environmental insults or errors during metabolism.
    GO:0006338    chromatin remodeling    Dynamic structural changes to eukaryotic chromatin occurring throughout the cell division cycle. These changes range from the local changes necessary for transcriptional regulation to global changes necessary for chromosome segregation.
    GO:0006342    chromatin silencing    Repression of transcription by altering the structure of chromatin, e.g. by conversion of large regions of DNA into an inaccessible state often called heterochromatin.
    GO:0048096    chromatin-mediated maintenance of transcription    Maintenance of transcription by remodelling of chromatin into an 'open configuration'. Once established, this regulation is mitotically stable and is maintained over many cell divisions. It is also heritable.
    GO:0048813    dendrite morphogenesis    The process in which the anatomical structures of a dendrite are generated and organized. A dendrite is a freely branching protoplasmic process of a nerve cell.
    GO:0016458    gene silencing    Any process carried out at the cellular level that results in either long-term transcriptional repression via action on chromatin structure or RNA mediated, post-transcriptional repression of gene expression.
    GO:0007275    multicellular organism development    The biological process whose specific outcome is the progression of a multicellular organism over time from an initial condition (e.g. a zygote or a young adult) to a later condition (e.g. a multicellular animal or an aged adult).
    GO:0010629    negative regulation of gene expression    Any process that decreases the frequency, rate or extent of gene expression. Gene expression is the process in which a gene's coding sequence is converted into a mature gene product or products (proteins or RNA). This includes the production of an RNA transcript as well as any processing to produce a mature RNA product or an mRNA or circRNA (for protein-coding genes) and the translation of that mRNA or circRNA into protein. Protein maturation is included when required to form an active form of a product from an inactive precursor form.
    GO:0045892    negative regulation of transcription, DNA-templated    Any process that stops, prevents, or reduces the frequency, rate or extent of cellular DNA-templated transcription.
    GO:0006355    regulation of transcription, DNA-templated    Any process that modulates the frequency, rate or extent of cellular DNA-templated transcription.
    GO:0006351    transcription, DNA-templated    The cellular synthesis of RNA on a template of DNA.
cellular component
    GO:0031011    Ino80 complex    A multisubunit protein complex that contains the Ino80p ATPase; exhibits chromatin remodeling activity and 3' to 5' DNA helicase activity.
    GO:0031519    PcG protein complex    A chromatin-associated multiprotein complex containing Polycomb Group proteins. In Drosophila, Polycomb group proteins are involved in the long-term maintenance of gene repression, and PcG protein complexes associate with Polycomb group response elements (PREs) in target genes to regulate higher-order chromatin structure.
    GO:0000790    nuclear chromatin    The ordered and organized complex of DNA, protein, and sometimes RNA, that forms the chromosome in the nucleus.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.
    GO:0005700    polytene chromosome    A type of chromosome in a polyploid cell, formed when multiple copies of homologous chromosomes are aligned side by side to give a giant chromosome in which distinct chromosome bands are readily visible.
    GO:0005704    polytene chromosome band    A stretch of densely packed chromatin along the polytene chromosome, visible as a morphologically distinct band.
    GO:0005703    polytene chromosome puff    A swelling at a site along the length of a polytene chromosome, thought to be the site of active transcription.

 Visualization

(-) Interactive Views

Asymmetric/Biological Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
(no "Ligands, Modified Residues, Ions" information available for 4c5g)
 
  Sites
(no "Sites" information available for 4c5g)
 
  Cis Peptide Bonds
    Gly B:142 - Met B:143   [ RasMol ]  
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  4c5g
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  PHO_DROME | Q8ST83
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
  SMBT_DROME | Q9VK33
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  PHO_DROME | Q8ST83
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)
  SMBT_DROME | Q9VK33
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        PHO_DROME | Q8ST834c5e 4c5h
        SMBT_DROME | Q9VK333h6z 4c5e 4c5h 5j8y

(-) Related Entries Specified in the PDB File

4c5e CRYSTAL STRUCTURE OF THE MINIMAL PHO-SFMBT COMPLEX (P21 SPACEGROUP)
4c5h CRYSTAL STRUCTURE OF THE MINIMAL PHO-SFMBT COMPLEX (P3121 SPACEGROUP)
4c5i CRYSTAL STRUCTURE OF MBTD1 YY1 COMPLEX