Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asym./Biol. Unit
(-)Asym./Biol. Unit - sites
collapse expand < >
Image Asym./Biol. Unit
Asym./Biol. Unit  (Jmol Viewer)
Image Asym./Biol. Unit - sites
Asym./Biol. Unit - sites  (Jmol Viewer)

(-) Description

Title :  THE CRYSTAL STRUCTURE OF MLL1 (N3861I/Q3867L) IN COMPLEX WITH RBBP5 AND ASH2L
 
Authors :  Y. Li, M. Lei, Y. Chen
Date :  06 Dec 15  (Deposition) - 24 Feb 16  (Release) - 20 Apr 16  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  1.90
Chains :  Asym./Biol. Unit :  A,B,J
Keywords :  Histone Methyltransferase, Histone Methylation, Set Domain, Protein Complex, Protein Binding-Transferase Complex (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  Y. Li, J. Han, Y. Zhang, F. Cao, Z. Liu, S. Li, J. Wu, C. Hu, Y. Wang, J. Shuai, J. Chen, L. Cao, D. Li, P. Shi, C. Tian, J. Zhang, Y. Dou, G. Li, Y. Chen, M. Lei
Structural Basis For Activity Regulation Of Mll Family Methyltransferases.
Nature V. 530 447 2016
PubMed-ID: 26886794  |  Reference-DOI: 10.1038/NATURE16952

(-) Compounds

Molecule 1 - RETINOBLASTOMA-BINDING PROTEIN 5
    ChainsJ
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPET28B-SUMO
    Expression System StrainROSETTA
    Expression System Taxid562
    Expression System Vector TypePLASMID
    FragmentUNP RESIDUES 330-356
    GeneRBBP5, RBQ3
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SynonymRBBP-5,RETINOBLASTOMA-BINDING PROTEIN RBQ-3
 
Molecule 2 - SET1/ASH2 HISTONE METHYLTRANSFERASE COMPLEX SUBUNIT ASH2
    ChainsB
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPET28B-SUMO
    Expression System StrainROSETTA
    Expression System Taxid562
    Expression System Vector TypePLASMID
    FragmentUNP RESIDUES 380-496, 539-598
    GeneASH2L, ASH2L1
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SynonymASH2-LIKE PROTEIN
 
Molecule 3 - HISTONE-LYSINE N-METHYLTRANSFERASE 2A
    ChainsA
    EC Number2.1.1.43
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPET28B-SUMO
    Expression System StrainROSETTA
    Expression System Taxid562
    Expression System Vector TypePLASMID
    FragmentUNP RESIDUES 3813-3969
    GeneKMT2A, ALL1, CXXC7, HRX, HTRX, MLL, MLL1, TRX1
    MutationYES
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SynonymLYSINE N-METHYLTRANSFERASE 2A,ALL-1,CXXC-TYPE ZINC FINGER PROTEIN 7,MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA,MYELOID/LYMPHOID OR MIXED-LINEAGE LEUKEMIA PROTEIN 1,TRITHORAX-LIKE PROTEIN,ZINC FINGER PROTEIN HRX

 Structural Features

(-) Chains, Units

  123
Asymmetric/Biological Unit ABJ

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (2, 2)

Asymmetric/Biological Unit (2, 2)
No.NameCountTypeFull Name
1SAH1Ligand/IonS-ADENOSYL-L-HOMOCYSTEINE
2ZN1Ligand/IonZINC ION

(-) Sites  (2, 2)

Asymmetric Unit (2, 2)
No.NameEvidenceResiduesDescription
1AC1SOFTWAREILE A:3838 , HIS A:3839 , GLY A:3840 , ARG A:3841 , ARG A:3903 , PHE A:3904 , ASN A:3906 , HIS A:3907 , TYR A:3944 , PRO A:3956 , CYS A:3957 , ASN A:3958 , LEU A:3968 , HOH A:4108 , HOH A:4156binding site for residue SAH A 4001
2AC2SOFTWARECYS A:3909 , CYS A:3957 , CYS A:3959 , CYS A:3964binding site for residue ZN A 4002

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 5F6L)

(-) Cis Peptide Bonds  (1, 1)

Asymmetric/Biological Unit
No.Residues
1Gly B:485 -Pro B:486

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 5F6L)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 5F6L)

(-) Exons   (0, 0)

(no "Exon" information available for 5F6L)

(-) Sequences/Alignments

Asymmetric/Biological Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:153
                                                                                                                                                                                          
               SCOP domains --------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains --------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .hhhhhhhhhhhhhhhheeeee.....eeeee........eeee...eeee..hhhhhhhhhhhh.....eee....eeee...eehhhhhhee.....eeeeeeee..eeeeeeee..........ee........................ Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript --------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                5f6l A 3814 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGIVIRSILTDKREKYYDSKGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYDYKFPIEDKLPCNCGAKKCRKFLN 3969
                                  3823      3833      3843      3853      3863      3873      3883      3893      3903      3913      3923      3933      3943      3956      3966   
                                                                                                                                                                 3950|               
                                                                                                                                                                  3954               

Chain B from PDB  Type:PROTEIN  Length:178
                                                                                                                                                                                                                   
               SCOP domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ....eeeeee....ee.....eee.....eeeee......eeeeeeeeeee.....eeeeeee...............eeeee.....eee..eee..........eeeeeeee.....eeeeee..eeeeeeee.......eeeeeee...eeeee..............ee.hhhh Sec.struct. author
                 SAPs(SNPs) ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                5f6l B  285 SRVLLALHDRAPQLKISDDRLTVVGEKGYSMVRASHGVRKGAWYFEITVDEMPPDTAARLGWSQPLGNLQAPLGYDKFSYSWRSKKGTKFHQSIGKHYSSGYGQGDVLGFYINLPEGSSEIIFYKNGVNQGVAYKDIFEGVYFPAISLYKSCTVSINFGPCFKYPPKDLTYRPMSDMG  504
                                   294       304       314       324       334       344       354       364       374       384       394     ||446       456       466       476       486       496        
                                                                                                                                             400|                                                             
                                                                                                                                              443                                                             

Chain J from PDB  Type:PROTEIN  Length:19
                                                    
               SCOP domains ------------------- SCOP domains
               CATH domains ------------------- CATH domains
               Pfam domains ------------------- Pfam domains
         Sec.struct. author .ee....ee.......... Sec.struct. author
                 SAPs(SNPs) ------------------- SAPs(SNPs)
                    PROSITE ------------------- PROSITE
                 Transcript ------------------- Transcript
                5f6l J  336 FKELDENVEYEERESEFDI  354
                                   345         

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 5F6L)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 5F6L)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 5F6L)

(-) Gene Ontology  (72, 103)

Asymmetric/Biological Unit(hide GO term definitions)

 Visualization

(-) Interactive Views

Asymmetric/Biological Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    SAH  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    ZN  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
    Gly B:485 - Pro B:486   [ RasMol ]  
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  5f6l
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  ASH2L_HUMAN | Q9UBL3
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
  KMT2A_HUMAN | Q03164
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
  RBBP5_HUMAN | Q15291
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  2.1.1.43
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  ASH2L_HUMAN | Q9UBL3
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)
  KMT2A_HUMAN | Q03164
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)
  RBBP5_HUMAN | Q15291
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        ASH2L_HUMAN | Q9UBL33rsn 3s32 3toj 4riq 4x8n 4x8p 5f6k
        KMT2A_HUMAN | Q031642agh 2j2s 2jyi 2kkf 2ku7 2kyu 2lxs 2lxt 2msr 2mtn 2w5y 2w5z 3eg6 3emh 3lqh 3lqi 3lqj 3p4f 3u85 3u88 4esg 4gq6 4nw3 5f5e
        RBBP5_HUMAN | Q152913p4f 4x8n 4x8p 5f6k

(-) Related Entries Specified in the PDB File

5f59
5f5e THE SAME MLL1 PROTEIN IN APO FORM
5f6k