Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asymmetric Unit
(-)Biological Unit 1
(-)Biological Unit 2
(-)Biological Unit 3
(-)Biological Unit 4
collapse expand < >
Image Asymmetric Unit
Asymmetric Unit  (Jmol Viewer)
Image Biological Unit 1
Biological Unit 1  (Jmol Viewer)
Image Biological Unit 2
Biological Unit 2  (Jmol Viewer)
Image Biological Unit 3
Biological Unit 3  (Jmol Viewer)
Image Biological Unit 4
Biological Unit 4  (Jmol Viewer)

(-) Description

Title :  STRUCTURE OF HOXB13 COMPLEX WITH METHYLATED DNA
 
Authors :  E. Morgunova, Y. Yin, A. Jolma, A. Popov, J. Taipale
Date :  23 Oct 15  (Deposition) - 08 Feb 17  (Release) - 17 May 17  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  3.00
Chains :  Asym. Unit :  A,B,C,D,E,F,G,H,I,J,K,L
Biol. Unit 1:  A,C,F  (1x)
Biol. Unit 2:  B,D,E  (1x)
Biol. Unit 3:  G,H,I  (1x)
Biol. Unit 4:  J,K,L  (1x)
Keywords :  Transcription Factor, Methylated Dna, Complex, Transcription (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  Y. Yin, E. Morgunova, A. Jolma, E. Kaasinen, B. Sahu, S. Khund-Sayeed, P. K. Das, T. Kivioja, K. Dave, F. Zhong, K. R. Nitta, M. Taipale, A. Popov, P. A. Ginno, S. Domcke, J. Yan, D. Schubeler, C. Vinson, J. Taipale
Impact Of Cytosine Methylation On Dna Binding Specificities Of Human Transcription Factors.
Science V. 356 2017
PubMed-ID: 28473536  |  Reference-DOI: 10.1126/SCIENCE.AAJ2239

(-) Compounds

Molecule 1 - HOMEOBOX PROTEIN HOX-B13
    ChainsA, B, G, J
    EngineeredYES
    Expression SystemESCHERICHIA COLI BL21(DE3)
    Expression System PlasmidPETG20A
    Expression System Taxid469008
    Expression System VariantROSETTA
    FragmentUNP RESIDUES 217-278
    GeneHOXB13
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
 
Molecule 2 - DNA (5'-D(P*TP*TP*GP*TP*GP*TP*TP*TP*TP*AP*(5CM) P*GP*AP*GP*GP*TP*CP*C)-3')
    ChainsC, D, H, K
    EngineeredYES
    Organism ScientificSYNTHETIC CONSTRUCT
    Organism Taxid32630
    SyntheticYES
 
Molecule 3 - DNA (5'-D(P*GP*GP*AP*CP*CP*TP*(5CM) P*GP*TP*AP*AP*AP*AP*CP*AP*CP*AP*A)-3')
    ChainsF, E, I, L
    EngineeredYES
    Organism ScientificSYNTHETIC CONSTRUCT
    Organism Taxid32630
    SyntheticYES

 Structural Features

(-) Chains, Units

  123456789101112
Asymmetric Unit ABCDEFGHIJKL
Biological Unit 1 (1x)A C  F      
Biological Unit 2 (1x) B DE       
Biological Unit 3 (1x)      GHI   
Biological Unit 4 (1x)         JKL

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (1, 8)

Asymmetric Unit (1, 8)
No.NameCountTypeFull Name
15CM8Mod. Nucleotide5-METHYL-2'-DEOXY-CYTIDINE-5'-MONOPHOSPHATE
Biological Unit 1 (1, 2)
No.NameCountTypeFull Name
15CM2Mod. Nucleotide5-METHYL-2'-DEOXY-CYTIDINE-5'-MONOPHOSPHATE
Biological Unit 2 (1, 2)
No.NameCountTypeFull Name
15CM2Mod. Nucleotide5-METHYL-2'-DEOXY-CYTIDINE-5'-MONOPHOSPHATE
Biological Unit 3 (1, 2)
No.NameCountTypeFull Name
15CM2Mod. Nucleotide5-METHYL-2'-DEOXY-CYTIDINE-5'-MONOPHOSPHATE
Biological Unit 4 (1, 2)
No.NameCountTypeFull Name
15CM2Mod. Nucleotide5-METHYL-2'-DEOXY-CYTIDINE-5'-MONOPHOSPHATE

(-) Sites  (0, 0)

(no "Site" information available for 5EF6)

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 5EF6)

(-) Cis Peptide Bonds  (14, 14)

Asymmetric Unit
No.Residues
1Arg A:217 -Lys A:218
2Lys A:218 -Lys A:219
3Ala A:276 -Lys A:277
4Arg B:217 -Lys B:218
5Lys B:218 -Lys B:219
6Ala B:276 -Lys B:277
7Arg G:217 -Lys G:218
8Lys G:218 -Lys G:219
9Ala G:276 -Lys G:277
10Lys G:277 -Val G:278
11Arg J:217 -Lys J:218
12Lys J:218 -Lys J:219
13Ala J:276 -Lys J:277
14Lys J:277 -Val J:278

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 5EF6)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 5EF6)

(-) Exons   (0, 0)

(no "Exon" information available for 5EF6)

(-) Sequences/Alignments

Asymmetric Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:61
                                                                                             
               SCOP domains ------------------------------------------------------------- SCOP domains
               CATH domains ------------------------------------------------------------- CATH domains
               Pfam domains ------------------------------------------------------------- Pfam domains
         Sec.struct. author .......hhhhhhhhhhhhhhh...hhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhh... Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------- PROSITE
                 Transcript ------------------------------------------------------------- Transcript
                 5ef6 A 217 RKKRIPYSKGQLRELEREYAANKFITKDKRRKISAATSLSERQITIWFQNRRVKEKKVLAK 277
                                   226       236       246       256       266       276 

Chain B from PDB  Type:PROTEIN  Length:61
                                                                                             
               SCOP domains ------------------------------------------------------------- SCOP domains
               CATH domains ------------------------------------------------------------- CATH domains
               Pfam domains ------------------------------------------------------------- Pfam domains
         Sec.struct. author .......hhhhhhhhhhhhhhh...hhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhh... Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------- PROSITE
                 Transcript ------------------------------------------------------------- Transcript
                 5ef6 B 217 RKKRIPYSKGQLRELEREYAANKFITKDKRRKISAATSLSERQITIWFQNRRVKEKKVLAK 277
                                   226       236       246       256       266       276 

Chain C from PDB  Type:DNA  Length:18
                                                  
                 5ef6 C   1 TTGTGTTTTAxGAGGTCC  18
                                    10|       
                                     11-5CM   

Chain D from PDB  Type:DNA  Length:18
                                                  
                 5ef6 D   1 TTGTGTTTTAxGAGGTCC  18
                                    10|       
                                     11-5CM   

Chain E from PDB  Type:DNA  Length:18
                                                  
                 5ef6 E   1 GGACCTxGTAAAACACAA  18
                                  | 10        
                                  7-5CM       

Chain F from PDB  Type:DNA  Length:18
                                                  
                 5ef6 F   1 GGACCTxGTAAAACACAA  18
                                  | 10        
                                  7-5CM       

Chain G from PDB  Type:PROTEIN  Length:62
                                                                                              
               SCOP domains -------------------------------------------------------------- SCOP domains
               CATH domains -------------------------------------------------------------- CATH domains
               Pfam domains -------------------------------------------------------------- Pfam domains
         Sec.struct. author .......hhhhhhhhhhhhhhh...hhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhh..... Sec.struct. author
                 SAPs(SNPs) -------------------------------------------------------------- SAPs(SNPs)
                    PROSITE -------------------------------------------------------------- PROSITE
                 Transcript -------------------------------------------------------------- Transcript
                 5ef6 G 217 RKKRIPYSKGQLRELEREYAANKFITKDKRRKISAATSLSERQITIWFQNRRVKEKKVLAKV 278
                                   226       236       246       256       266       276  

Chain H from PDB  Type:DNA  Length:18
                                                  
                 5ef6 H   1 TTGTGTTTTAxGAGGTCC  18
                                    10|       
                                     11-5CM   

Chain I from PDB  Type:DNA  Length:18
                                                  
                 5ef6 I   1 GGACCTxGTAAAACACAA  18
                                  | 10        
                                  7-5CM       

Chain J from PDB  Type:PROTEIN  Length:62
                                                                                              
               SCOP domains -------------------------------------------------------------- SCOP domains
               CATH domains -------------------------------------------------------------- CATH domains
               Pfam domains -------------------------------------------------------------- Pfam domains
         Sec.struct. author .......hhhhhhhhhhhhhhh...hhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhh.... Sec.struct. author
                 SAPs(SNPs) -------------------------------------------------------------- SAPs(SNPs)
                    PROSITE -------------------------------------------------------------- PROSITE
                 Transcript -------------------------------------------------------------- Transcript
                 5ef6 J 217 RKKRIPYSKGQLRELEREYAANKFITKDKRRKISAATSLSERQITIWFQNRRVKEKKVLAKV 278
                                   226       236       246       256       266       276  

Chain K from PDB  Type:DNA  Length:18
                                                  
                 5ef6 K   1 TTGTGTTTTAxGAGGTCC  18
                                    10|       
                                     11-5CM   

Chain L from PDB  Type:DNA  Length:18
                                                  
                 5ef6 L   1 GGACCTxGTAAAACACAA  18
                                  | 10        
                                  7-5CM       

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 5EF6)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 5EF6)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 5EF6)

(-) Gene Ontology  (16, 16)

Asymmetric Unit(hide GO term definitions)

 Visualization

(-) Interactive Views

Asymmetric Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    5CM  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
(no "Sites" information available for 5ef6)
 
  Cis Peptide Bonds
    Ala A:276 - Lys A:277   [ RasMol ]  
    Ala B:276 - Lys B:277   [ RasMol ]  
    Ala G:276 - Lys G:277   [ RasMol ]  
    Ala J:276 - Lys J:277   [ RasMol ]  
    Arg A:217 - Lys A:218   [ RasMol ]  
    Arg B:217 - Lys B:218   [ RasMol ]  
    Arg G:217 - Lys G:218   [ RasMol ]  
    Arg J:217 - Lys J:218   [ RasMol ]  
    Lys A:218 - Lys A:219   [ RasMol ]  
    Lys B:218 - Lys B:219   [ RasMol ]  
    Lys G:218 - Lys G:219   [ RasMol ]  
    Lys G:277 - Val G:278   [ RasMol ]  
    Lys J:218 - Lys J:219   [ RasMol ]  
    Lys J:277 - Val J:278   [ RasMol ]  
 
Biological Units
  Complete Structure
    Biological Unit 1  [ Jena3D ]
    Biological Unit 2  [ Jena3D ]
    Biological Unit 3  [ Jena3D ]
    Biological Unit 4  [ Jena3D ]

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  5ef6
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  HXB13_HUMAN | Q92826
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  HXB13_HUMAN | Q92826
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        HXB13_HUMAN | Q928262cra 5edn 5eea 5eg0 5ego

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 5EF6)