Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asym./Biol. Unit
(-)Asym./Biol. Unit - sites
collapse expand < >
Image Asym./Biol. Unit
Asym./Biol. Unit  (Jmol Viewer)
Image Asym./Biol. Unit - sites
Asym./Biol. Unit - sites  (Jmol Viewer)

(-) Description

Title :  THE NUCLEOSOME CONTAINING HUMAN TSH2B
 
Authors :  T. Urahama, N. Horikoshi, A. Osakabe, H. Tachiwana, H. Kurumizaka
Date :  22 Oct 13  (Deposition) - 09 Apr 14  (Release) - 09 Apr 14  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  2.80
Chains :  Asym./Biol. Unit :  A,B,C,D,E,F,G,H,I,J
Keywords :  Histone Variant, Histone-Fold, Dna Binding Protein, Structural Protein-Dna Complex (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  T. Urahama, N. Horikoshi, A. Osakabe, H. Tachiwana, H. Kurumizaka
Structure Of Human Nucleosome Containing The Testis-Specifi Histone Variant Tsh2B
Acta Crystallogr. , Sect. F V. 70 444 2014
PubMed: search  |  Reference-DOI: 10.1107/S2053230X14004695

(-) Compounds

Molecule 1 - HISTONE H3.1
    ChainsA, E
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPHCE
    Expression System StrainBL21(DE3)
    Expression System Taxid562
    Expression System Vector TypePLASMID
    GeneH3.1
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SynonymHISTONE H3/A, HISTONE H3/B, HISTONE H3/C, HISTONE H3/D, HISTONE H3/F, HISTONE H3/H, HISTONE H3/I, HISTONE H3/J, HISTONE H3/K, HISTONE H3/L
 
Molecule 2 - HISTONE H4
    ChainsB, F
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPET15B
    Expression System StrainJM109(DE3)
    Expression System Taxid562
    Expression System Vector TypePLASMID
    GeneH4
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
 
Molecule 3 - HISTONE H2A TYPE 1-B/E
    ChainsC, G
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPHCE
    Expression System StrainBL21(DE3)
    Expression System Taxid562
    Expression System Vector TypePLASMID
    GeneH2A
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SynonymHISTONE H2A.2, HISTONE H2A/A, HISTONE H2A/M
 
Molecule 4 - HISTONE H2B TYPE 1-A
    ChainsD, H
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPHCE
    Expression System StrainBL21(DE3)
    Expression System Taxid562
    Expression System Vector TypePLASMID
    GeneHIST1H2BA, TSH2B
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SynonymHISTONE H2B, TESTIS, TSH2B.1, TESTIS-SPECIFIC HISTONE H2B
 
Molecule 5 - DNA (145-MER)
    ChainsI, J
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPGEM-T EASY
    Expression System StrainDH5-ALPHA
    Expression System Taxid562
    Expression System Vector TypePLASMID
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606

 Structural Features

(-) Chains, Units

  12345678910
Asymmetric/Biological Unit ABCDEFGHIJ

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (2, 9)

Asymmetric/Biological Unit (2, 9)
No.NameCountTypeFull Name
1CL4Ligand/IonCHLORIDE ION
2MN5Ligand/IonMANGANESE (II) ION

(-) Sites  (8, 8)

Asymmetric Unit (8, 8)
No.NameEvidenceResiduesDescription
1AC1SOFTWAREPRO A:121 , LYS A:122BINDING SITE FOR RESIDUE CL A 201
2AC2SOFTWAREGLY C:46 , ALA C:47 , SER D:91 , SER D:92BINDING SITE FOR RESIDUE CL C 201
3AC3SOFTWAREPRO E:121 , LYS E:122BINDING SITE FOR RESIDUE CL E 201
4AC4SOFTWAREGLY G:44 , ALA G:45 , GLY G:46 , SER H:91 , SER H:92BINDING SITE FOR RESIDUE CL G 201
5AC5SOFTWAREDG I:68 , DC J:225BINDING SITE FOR RESIDUE MN I 401
6AC6SOFTWAREDG I:121 , DG I:122 , DC J:171BINDING SITE FOR RESIDUE MN I 402
7AC7SOFTWAREDG J:267BINDING SITE FOR RESIDUE MN J 301
8AC8SOFTWAREDG J:217BINDING SITE FOR RESIDUE MN J 302

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 3WKJ)

(-) Cis Peptide Bonds  (1, 1)

Asymmetric/Biological Unit
No.Residues
1Gly D:105 -Glu D:106

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (1, 2)

Asymmetric/Biological Unit (1, 2)
  dbSNPPDB
No.SourceVariant IDVariantUniProt IDStatusIDChainVariant
1UniProtVAR_036206E64QH4_HUMANUnclassified747622981B/FE63Q

  SNP/SAP Summary Statistics (UniProtKB/Swiss-Prot)

(-) PROSITE Motifs  (3, 6)

Asymmetric/Biological Unit (3, 6)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1HISTONE_H2APS00046 Histone H2A signature.H2A1B_HUMAN22-28
 
  2C:21-27
G:21-27
2HISTONE_H3_2PS00959 Histone H3 signature 2.H31_HUMAN67-75
 
  2A:66-74
E:66-74
3HISTONE_H2BPS00357 Histone H2B signature.H2B1A_HUMAN94-116
 
  2D:93-115
H:93-115

(-) Exons   (2, 4)

Asymmetric/Biological Unit (2, 4)
 ENSEMBLUniProtKBPDB
No.Transcript IDExonExon IDGenome LocationLengthIDLocationLengthCountLocationLength
1.2bENST000003593032bENSE00001974165chr6:27858570-27858160411H31_HUMAN1-1361362A:38-134
E:38-135
97
98

2.1ENST000003778031ENSE00001475159chr6:26104104-26104518415H4_HUMAN1-1271272B:25-102
F:19-102
78
84

(-) Sequences/Alignments

Asymmetric/Biological Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:97
 aligned with H31_HUMAN | P68431 from UniProtKB/Swiss-Prot  Length:136

    Alignment length:97
                                    48        58        68        78        88        98       108       118       128       
            H31_HUMAN    39 PHRYRPGTVALREIRRYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSSAVMALQEACEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGER 135
               SCOP domains d3wkja_ A: Histone H3                                                                             SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ......hhhhhhhhhhhh.......hhhhhhhhhhhhhhh.....eehhhhhhhhhhhhhhhhhhhhhhhhhhhhhh...eehhhhhhhhhhhh... Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ----------------------------HISTONE_H------------------------------------------------------------ PROSITE
               Transcript 1 Exon 1.2b  PDB: A:38-134 UniProt: 1-136 [INCOMPLETE]                                              Transcript 1
                 3wkj A  38 PHRYRPGTVALREIRRYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSSAVMALQEACEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGER 134
                                    47        57        67        77        87        97       107       117       127       

Chain B from PDB  Type:PROTEIN  Length:78
 aligned with H4_HUMAN | P62805 from UniProtKB/Swiss-Prot  Length:103

    Alignment length:78
                                    35        45        55        65        75        85        95        
             H4_HUMAN    26 NIQGITKPAIRRLARRGGVKRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFGG 103
               SCOP domains d3wkjb_ B: automated matches                                                   SCOP domains
               CATH domains ------------------------------------------------------------------------------ CATH domains
               Pfam domains ------------------------------------------------------------------------------ Pfam domains
         Sec.struct. author hhhhhhhhhhhhhhhhhh..ee..hhhhhhhhhhhhhhhhhhhhhhhhhhhh...eehhhhhhhhhhhh...ee.... Sec.struct. author
                 SAPs(SNPs) --------------------------------------Q--------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------ PROSITE
               Transcript 2 Exon 2.1  PDB: B:25-102 UniProt: 1-127 [INCOMPLETE]                            Transcript 2
                 3wkj B  25 NIQGITKPAIRRLARRGGVKRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFGG 102
                                    34        44        54        64        74        84        94        

Chain C from PDB  Type:PROTEIN  Length:107
 aligned with H2A1B_HUMAN | P04908 from UniProtKB/Swiss-Prot  Length:130

    Alignment length:107
                                    22        32        42        52        62        72        82        92       102       112       
          H2A1B_HUMAN    13 AKAKTRSSRAGLQFPVGRVHRLLRKGNYSERVGAGAPVYLAAVLEYLTAEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGRVTIAQGGVLPNIQAVLLPK 119
               SCOP domains d3wkjc_ C: automated matches                                                                                SCOP domains
               CATH domains ----------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ----------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ....hhhhhhh...hhhhhhhhhhh.....ee.hhhhhhhhhhhhhhhhhhhhhhhhhhhhh...eehhhhhhhhhhhhhhhhhhh...ee.........hhhhh.. Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ---------HISTONE------------------------------------------------------------------------------------------- PROSITE
                 Transcript ----------------------------------------------------------------------------------------------------------- Transcript
                 3wkj C  12 AKAKTRSSRAGLQFPVGRVHRLLRKGNYSERVGAGAPVYLAAVLEYLTAEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGRVTIAQGGVLPNIQAVLLPK 118
                                    21        31        41        51        61        71        81        91       101       111       

Chain D from PDB  Type:PROTEIN  Length:93
 aligned with H2B1A_HUMAN | Q96A08 from UniProtKB/Swiss-Prot  Length:127

    Alignment length:93
                                    43        53        63        73        83        93       103       113       123   
          H2B1A_HUMAN    34 TRKESYSIYIYKVLKQVHPDTGISSKAMSIMNSFVTDIFERIASEASRLAHYSKRSTISSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTSS 126
               SCOP domains d3wkjd_ D: automated matches                                                                  SCOP domains
               CATH domains --------------------------------------------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .....hhhhhhhhhhhhh...eehhhhhhhhhhhhhhhhhhhhhhhhhhhhhh...eehhhhhhhhhhhhh..hhhhhhhhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------HISTONE_H2B            ---------- PROSITE
                 Transcript --------------------------------------------------------------------------------------------- Transcript
                 3wkj D  33 TRKESYSIYIYKVLKQVHPDTGISSKAMSIMNSFVTDIFERIASEASRLAHYSKRSTISSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTSS 125
                                    42        52        62        72        82        92       102       112       122   

Chain E from PDB  Type:PROTEIN  Length:98
 aligned with H31_HUMAN | P68431 from UniProtKB/Swiss-Prot  Length:136

    Alignment length:98
                                    48        58        68        78        88        98       108       118       128        
            H31_HUMAN    39 PHRYRPGTVALREIRRYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSSAVMALQEACEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA 136
               SCOP domains d3wkje_ E: Histone H3                                                                              SCOP domains
               CATH domains -------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains -------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ......hhhhhhhhhhhhhh.....hhhhhhhhhhhhhhh.....eehhhhhhhhhhhhhhhhhhhhhhhhhhhhhh...eehhhhhhhhhhhh.... Sec.struct. author
                 SAPs(SNPs) -------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ----------------------------HISTONE_H------------------------------------------------------------- PROSITE
               Transcript 1 Exon 1.2b  PDB: E:38-135 UniProt: 1-136 [INCOMPLETE]                                               Transcript 1
                 3wkj E  38 PHRYRPGTVALREIRRYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSSAVMALQEACEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA 135
                                    47        57        67        77        87        97       107       117       127        

Chain F from PDB  Type:PROTEIN  Length:84
 aligned with H4_HUMAN | P62805 from UniProtKB/Swiss-Prot  Length:103

    Alignment length:84
                                    29        39        49        59        69        79        89        99    
             H4_HUMAN    20 RKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFGG 103
               SCOP domains d3wkjf_ F: automated matches                                                         SCOP domains
               CATH domains ------------------------------------------------------------------------------------ CATH domains
               Pfam domains ------------------------------------------------------------------------------------ Pfam domains
         Sec.struct. author .....hhhhhhhhhhhhhhhhhh...ee..hhhhhhhhhhhhhhhhhhhhhhhhhhhh...eehhhhhhhhhhhh...ee.... Sec.struct. author
                 SAPs(SNPs) --------------------------------------------Q--------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------ PROSITE
               Transcript 2 Exon 2.1  PDB: F:19-102 UniProt: 1-127 [INCOMPLETE]                                  Transcript 2
                 3wkj F  19 RKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFGG 102
                                    28        38        48        58        68        78        88        98    

Chain G from PDB  Type:PROTEIN  Length:104
 aligned with H2A1B_HUMAN | P04908 from UniProtKB/Swiss-Prot  Length:130

    Alignment length:104
                                    25        35        45        55        65        75        85        95       105       115    
          H2A1B_HUMAN    16 KTRSSRAGLQFPVGRVHRLLRKGNYSERVGAGAPVYLAAVLEYLTAEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGRVTIAQGGVLPNIQAVLLPK 119
               SCOP domains d3wkjg_ G: automated matches                                                                             SCOP domains
               CATH domains -------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains -------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .hhhhhhh...hhhhhhhhhhh.....ee..hhhhhhhhhhhhhhhhhhhhhhhhhhhh...eehhhhhhhhhhhhhhhhhhh...ee.........hhhhh.. Sec.struct. author
                 SAPs(SNPs) -------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------HISTONE------------------------------------------------------------------------------------------- PROSITE
                 Transcript -------------------------------------------------------------------------------------------------------- Transcript
                 3wkj G  15 KTRSSRAGLQFPVGRVHRLLRKGNYSERVGAGAPVYLAAVLEYLTAEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGRVTIAQGGVLPNIQAVLLPK 118
                                    24        34        44        54        64        74        84        94       104       114    

Chain H from PDB  Type:PROTEIN  Length:92
 aligned with H2B1A_HUMAN | Q96A08 from UniProtKB/Swiss-Prot  Length:127

    Alignment length:92
                                    44        54        64        74        84        94       104       114       124  
          H2B1A_HUMAN    35 RKESYSIYIYKVLKQVHPDTGISSKAMSIMNSFVTDIFERIASEASRLAHYSKRSTISSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTSS 126
               SCOP domains d3wkjh_ H: automated matches                                                                 SCOP domains
               CATH domains -------------------------------------------------------------------------------------------- CATH domains
               Pfam domains -------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ....hhhhhhhhhhhhh...eehhhhhhhhhhhhhhhhhhhhhhhhhhhhhh...eehhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) -------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE -----------------------------------------------------------HISTONE_H2B            ---------- PROSITE
                 Transcript -------------------------------------------------------------------------------------------- Transcript
                 3wkj H  34 RKESYSIYIYKVLKQVHPDTGISSKAMSIMNSFVTDIFERIASEASRLAHYSKRSTISSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTSS 125
                                    43        53        63        73        83        93       103       113       123  

Chain I from PDB  Type:DNA  Length:145
                                                                                                                                                                                 
                 3wkj I   1 ATCAATATCCACCTGCAGATTCTACCAAAAGTGTATTTGGAAACTGCTCCATCAAAAGGCATGTTCAGCTGAATTCAGCTGAACATGCCTTTTGATGGAGCAGTTTCCAAATACACTTTTGGTAGAATCTGCAGGTGGATATTGA 145
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140     

Chain J from PDB  Type:DNA  Length:145
                                                                                                                                                                                 
                 3wkj J 148 TCAATATCCACCTGCAGATTCTACCAAAAGTGTATTTGGAAACTGCTCCATCAAAAGGCATGTTCAGCTGAATTCAGCTGAACATGCCTTTTGATGGAGCAGTTTCCAAATACACTTTTGGTAGAATCTGCAGGTGGATATTGAT 292
                                   157       167       177       187       197       207       217       227       237       247       257       267       277       287     

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (2, 8)

Asymmetric/Biological Unit

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 3WKJ)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 3WKJ)

(-) Gene Ontology  (48, 86)

Asymmetric/Biological Unit(hide GO term definitions)
Chain A,E   (H31_HUMAN | P68431)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0042393    histone binding    Interacting selectively and non-covalently with a histone, any of a group of water-soluble proteins found in association with the DNA of eukaroytic chromosomes. They are involved in the condensation and coiling of chromosomes during cell division and have also been implicated in nonspecific suppression of gene activity.
    GO:0005515    protein binding    Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
    GO:0046982    protein heterodimerization activity    Interacting selectively and non-covalently with a nonidentical protein to form a heterodimer.
biological process
    GO:0006335    DNA replication-dependent nucleosome assembly    The formation of nucleosomes on newly replicated DNA, coupled to strand elongation.
    GO:0007596    blood coagulation    The sequential process in which the multiple coagulation factors of the blood interact, ultimately resulting in the formation of an insoluble fibrin clot; it may be divided into three stages: stage 1, the formation of intrinsic and extrinsic prothrombin converting principle; stage 2, the formation of thrombin; stage 3, the formation of stable fibrin polymers.
    GO:0044267    cellular protein metabolic process    The chemical reactions and pathways involving a specific protein, rather than of proteins in general, occurring at the level of an individual cell. Includes cellular protein modification.
    GO:0000183    chromatin silencing at rDNA    Repression of transcription of ribosomal DNA by altering the structure of chromatin.
    GO:0031047    gene silencing by RNA    Any process in which RNA molecules inactivate expression of target genes.
    GO:0045814    negative regulation of gene expression, epigenetic    Any epigenetic process that stops, prevents or reduces the rate of gene expression.
    GO:0006334    nucleosome assembly    The aggregation, arrangement and bonding together of a nucleosome, the beadlike structural units of eukaryotic chromatin composed of histones and DNA.
    GO:0002230    positive regulation of defense response to virus by host    Any host process that results in the promotion of antiviral immune response mechanisms, thereby limiting viral replication.
    GO:0045815    positive regulation of gene expression, epigenetic    Any epigenetic process that activates or increases the rate of gene expression.
    GO:0051290    protein heterotetramerization    The formation of a protein heterotetramer, a macromolecular structure consisting of four noncovalently associated subunits, of which not all are identical.
    GO:0060968    regulation of gene silencing    Any process that modulates the rate, frequency, or extent of gene silencing, the transcriptional or post-transcriptional process carried out at the cellular level that results in long-term gene inactivation.
    GO:0032200    telomere organization    A process that is carried out at the cellular level which results in the assembly, arrangement of constituent parts, or disassembly of telomeres, terminal regions of a linear chromosome that include the telomeric DNA repeats and associated proteins.
    GO:0098792    xenophagy    The macroautophagy process in which a region of cytoplasm containing an intracellular pathogen or some part of an intracellular pathogen (e.g. viral capsid) is enclosed in a double membrane bound autophagosome, which then fuses with the lysosome leading to degradation of the contents.
cellular component
    GO:0005694    chromosome    A structure composed of a very long molecule of DNA and associated proteins (e.g. histones) that carries hereditary information.
    GO:0070062    extracellular exosome    A vesicle that is released into the extracellular region by fusion of the limiting endosomal membrane of a multivesicular body with the plasma membrane. Extracellular exosomes, also simply called exosomes, have a diameter of about 40-100 nm.
    GO:0005576    extracellular region    The space external to the outermost structure of a cell. For cells without external protective or external encapsulating structures this refers to space outside of the plasma membrane. This term covers the host cell environment outside an intracellular parasite.
    GO:0016020    membrane    A lipid bilayer along with all the proteins and protein complexes embedded in it an attached to it.
    GO:0000228    nuclear chromosome    A chromosome that encodes the nuclear genome and is found in the nucleus of a eukaryotic cell during the cell cycle phases when the nucleus is intact.
    GO:0000784    nuclear chromosome, telomeric region    The terminal region of a linear nuclear chromosome that includes the telomeric DNA repeats and associated proteins.
    GO:0000788    nuclear nucleosome    A complex comprised of DNA wound around a multisubunit core and associated proteins, which forms the primary packing unit of DNA in the nucleus into higher order structures.
    GO:0005654    nucleoplasm    That part of the nuclear content other than the chromosomes or the nucleolus.
    GO:0000786    nucleosome    A complex comprised of DNA wound around a multisubunit core and associated proteins, which forms the primary packing unit of DNA into higher order structures.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.
    GO:0043234    protein complex    A stable macromolecular complex composed (only) of two or more polypeptide subunits along with any covalently attached molecules (such as lipid anchors or oligosaccharide) or non-protein prosthetic groups (such as nucleotides or metal ions). Prosthetic group in this context refers to a tightly bound cofactor. The component polypeptide subunits may be identical.

Chain B,F   (H4_HUMAN | P62805)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0042393    histone binding    Interacting selectively and non-covalently with a histone, any of a group of water-soluble proteins found in association with the DNA of eukaroytic chromosomes. They are involved in the condensation and coiling of chromosomes during cell division and have also been implicated in nonspecific suppression of gene activity.
    GO:0005515    protein binding    Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
    GO:0019904    protein domain specific binding    Interacting selectively and non-covalently with a specific domain of a protein.
    GO:0046982    protein heterodimerization activity    Interacting selectively and non-covalently with a nonidentical protein to form a heterodimer.
biological process
    GO:0034080    CENP-A containing nucleosome assembly    The formation of nucleosomes containing the histone H3 variant CENP-A to form centromeric chromatin. This specialised chromatin occurs at centromeric region in point centromeres, and the central core in modular centromeres.
    GO:0006335    DNA replication-dependent nucleosome assembly    The formation of nucleosomes on newly replicated DNA, coupled to strand elongation.
    GO:0006336    DNA replication-independent nucleosome assembly    The formation of nucleosomes outside the context of DNA replication.
    GO:0006352    DNA-templated transcription, initiation    Any process involved in the assembly of the RNA polymerase preinitiation complex (PIC) at the core promoter region of a DNA template, resulting in the subsequent synthesis of RNA from that promoter. The initiation phase includes PIC assembly and the formation of the first few bonds in the RNA chain, including abortive initiation, which occurs when the first few nucleotides are repeatedly synthesized and then released. The initiation phase ends just before and does not include promoter clearance, or release, which is the transition between the initiation and elongation phases of transcription.
    GO:1904837    beta-catenin-TCF complex assembly    The aggregation, arrangement and bonding together of a set of components to form a beta-catenin-TCF complex.
    GO:0044267    cellular protein metabolic process    The chemical reactions and pathways involving a specific protein, rather than of proteins in general, occurring at the level of an individual cell. Includes cellular protein modification.
    GO:0000183    chromatin silencing at rDNA    Repression of transcription of ribosomal DNA by altering the structure of chromatin.
    GO:0006303    double-strand break repair via nonhomologous end joining    The repair of a double-strand break in DNA in which the two broken ends are rejoined with little or no sequence complementarity. Information at the DNA ends may be lost due to the modification of broken DNA ends. This term covers instances of separate pathways, called classical (or canonical) and alternative nonhomologous end joining (C-NHEJ and A-NHEJ). These in turn may further branch into sub-pathways, but evidence is still unclear.
    GO:0031047    gene silencing by RNA    Any process in which RNA molecules inactivate expression of target genes.
    GO:0045814    negative regulation of gene expression, epigenetic    Any epigenetic process that stops, prevents or reduces the rate of gene expression.
    GO:0045653    negative regulation of megakaryocyte differentiation    Any process that stops, prevents, or reduces the frequency, rate or extent of megakaryocyte differentiation.
    GO:0006334    nucleosome assembly    The aggregation, arrangement and bonding together of a nucleosome, the beadlike structural units of eukaryotic chromatin composed of histones and DNA.
    GO:0045815    positive regulation of gene expression, epigenetic    Any epigenetic process that activates or increases the rate of gene expression.
    GO:0051290    protein heterotetramerization    The formation of a protein heterotetramer, a macromolecular structure consisting of four noncovalently associated subunits, of which not all are identical.
    GO:0016233    telomere capping    A process in which telomeres are protected from degradation and fusion, thereby ensuring chromosome stability by protecting the ends from both degradation and from being recognized as damaged DNA. May be mediated by specific single- or double-stranded telomeric DNA binding proteins.
    GO:0032200    telomere organization    A process that is carried out at the cellular level which results in the assembly, arrangement of constituent parts, or disassembly of telomeres, terminal regions of a linear chromosome that include the telomeric DNA repeats and associated proteins.
cellular component
    GO:0005694    chromosome    A structure composed of a very long molecule of DNA and associated proteins (e.g. histones) that carries hereditary information.
    GO:0070062    extracellular exosome    A vesicle that is released into the extracellular region by fusion of the limiting endosomal membrane of a multivesicular body with the plasma membrane. Extracellular exosomes, also simply called exosomes, have a diameter of about 40-100 nm.
    GO:0005576    extracellular region    The space external to the outermost structure of a cell. For cells without external protective or external encapsulating structures this refers to space outside of the plasma membrane. This term covers the host cell environment outside an intracellular parasite.
    GO:0016020    membrane    A lipid bilayer along with all the proteins and protein complexes embedded in it an attached to it.
    GO:0000228    nuclear chromosome    A chromosome that encodes the nuclear genome and is found in the nucleus of a eukaryotic cell during the cell cycle phases when the nucleus is intact.
    GO:0000784    nuclear chromosome, telomeric region    The terminal region of a linear nuclear chromosome that includes the telomeric DNA repeats and associated proteins.
    GO:0005654    nucleoplasm    That part of the nuclear content other than the chromosomes or the nucleolus.
    GO:0000786    nucleosome    A complex comprised of DNA wound around a multisubunit core and associated proteins, which forms the primary packing unit of DNA into higher order structures.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.
    GO:0043234    protein complex    A stable macromolecular complex composed (only) of two or more polypeptide subunits along with any covalently attached molecules (such as lipid anchors or oligosaccharide) or non-protein prosthetic groups (such as nucleotides or metal ions). Prosthetic group in this context refers to a tightly bound cofactor. The component polypeptide subunits may be identical.

Chain C,G   (H2A1B_HUMAN | P04908)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0046982    protein heterodimerization activity    Interacting selectively and non-covalently with a nonidentical protein to form a heterodimer.
biological process
    GO:0006342    chromatin silencing    Repression of transcription by altering the structure of chromatin, e.g. by conversion of large regions of DNA into an inaccessible state often called heterochromatin.
    GO:0008285    negative regulation of cell proliferation    Any process that stops, prevents or reduces the rate or extent of cell proliferation.
cellular component
    GO:0005694    chromosome    A structure composed of a very long molecule of DNA and associated proteins (e.g. histones) that carries hereditary information.
    GO:0070062    extracellular exosome    A vesicle that is released into the extracellular region by fusion of the limiting endosomal membrane of a multivesicular body with the plasma membrane. Extracellular exosomes, also simply called exosomes, have a diameter of about 40-100 nm.
    GO:0000790    nuclear chromatin    The ordered and organized complex of DNA, protein, and sometimes RNA, that forms the chromosome in the nucleus.
    GO:0000786    nucleosome    A complex comprised of DNA wound around a multisubunit core and associated proteins, which forms the primary packing unit of DNA into higher order structures.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.

Chain D,H   (H2B1A_HUMAN | Q96A08)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0003674    molecular_function    Elemental activities, such as catalysis or binding, describing the actions of a gene product at the molecular level. A given gene product may exhibit one or more molecular functions.
    GO:0046982    protein heterodimerization activity    Interacting selectively and non-covalently with a nonidentical protein to form a heterodimer.
biological process
    GO:0006325    chromatin organization    Any process that results in the specification, formation or maintenance of the physical structure of eukaryotic chromatin.
    GO:0006954    inflammatory response    The immediate defensive reaction (by vertebrate tissue) to infection or injury caused by chemical or physical agents. The process is characterized by local vasodilation, extravasation of plasma into intercellular spaces and accumulation of white blood cells and macrophages.
    GO:0071674    mononuclear cell migration    The movement of a mononuclear cell within or between different tissues and organs of the body.
    GO:0006334    nucleosome assembly    The aggregation, arrangement and bonding together of a nucleosome, the beadlike structural units of eukaryotic chromatin composed of histones and DNA.
    GO:0006337    nucleosome disassembly    The controlled breakdown of nucleosomes, the beadlike structural units of eukaryotic chromatin composed of histones and DNA.
    GO:0031639    plasminogen activation    The process in which inactive plasminogen is processed to active plasmin. This process includes cleavage at an internal Arg-Val site to form an N-terminal A-chain and C-terminal B-chain held together by a disulfide bond, and can include further proteolytic cleavage events to remove the preactivation peptide.
    GO:0051099    positive regulation of binding    Any process that activates or increases the rate or extent of binding, the selective interaction of a molecule with one or more specific sites on another molecule.
    GO:0035093    spermatogenesis, exchange of chromosomal proteins    The replacement of somatic histones within sperm chromatin with sperm-specific histones or protamines with unique DNA-binding properties, resulting in condensation of the sperm chromatin.
cellular component
    GO:0005694    chromosome    A structure composed of a very long molecule of DNA and associated proteins (e.g. histones) that carries hereditary information.
    GO:0019897    extrinsic component of plasma membrane    The component of a plasma membrane consisting of gene products and protein complexes that are loosely bound to one of its surfaces, but not integrated into the hydrophobic region.
    GO:0000784    nuclear chromosome, telomeric region    The terminal region of a linear nuclear chromosome that includes the telomeric DNA repeats and associated proteins.
    GO:0000788    nuclear nucleosome    A complex comprised of DNA wound around a multisubunit core and associated proteins, which forms the primary packing unit of DNA in the nucleus into higher order structures.
    GO:0005654    nucleoplasm    That part of the nuclear content other than the chromosomes or the nucleolus.
    GO:0000786    nucleosome    A complex comprised of DNA wound around a multisubunit core and associated proteins, which forms the primary packing unit of DNA into higher order structures.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.

 Visualization

(-) Interactive Views

Asymmetric/Biological Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    CL  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    MN  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
    AC4  [ RasMol ]  +environment [ RasMol ]
    AC5  [ RasMol ]  +environment [ RasMol ]
    AC6  [ RasMol ]  +environment [ RasMol ]
    AC7  [ RasMol ]  +environment [ RasMol ]
    AC8  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
    Gly D:105 - Glu D:106   [ RasMol ]  
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  3wkj
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  H2A1B_HUMAN | P04908
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
  H2B1A_HUMAN | Q96A08
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
  H31_HUMAN | P68431
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
  H4_HUMAN | P62805
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  H2A1B_HUMAN | P04908
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)
  H2B1A_HUMAN | Q96A08
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)
  H31_HUMAN | P68431
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)
  H4_HUMAN | P62805
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        H2A1B_HUMAN | P049082cv5 2rvq 3a6n 3afa 3an2 3av1 3av2 3ayw 3aze 3azf 3azg 3azh 3azi 3azj 3azk 3azl 3azm 3azn 3w96 3w97 3w98 3w99 3wtp 3x1s 3x1v 4ym5 4ym6 4z5t 5av5 5av6 5av8 5av9 5avb 5avc 5ay8 5b0y 5b0z 5b24 5b2i 5b2j 5b31 5b32 5b40 5cpi 5cpj 5cpk 5gse 5gtc 5gxq 5jrg 5vey 5x7x
        H2B1A_HUMAN | Q96A085gsu 5gt3
        H31_HUMAN | P684311cs9 1ct6 1guw 1o9s 1q3l 2b2t 2b2u 2b2v 2b2w 2c1j 2c1n 2cv5 2kwj 2kwk 2l75 2lbm 2m0o 2ndf 2ndg 2oq6 2ot7 2ox0 2ri7 2uxn 2v89 2vpg 2x0l 3a1b 3afa 3avr 3ayw 3aze 3azf 3azg 3azh 3azi 3azj 3azk 3azl 3azm 3azn 3b95 3kmt 3kqi 3lqi 3lqj 3o34 3o35 3o37 3qj6 3rig 3riy 3sou 3sow 3u31 3u3d 3u4s 3u5n 3u5o 3u5p 3uee 3uef 3uig 3uii 3uik 3v43 3w96 3w97 3w98 3w99 3wa9 3waa 3x1s 3x1t 3x1u 3x1v 3zg6 3zvy 4a0j 4a0n 4a7j 4bd3 4c1q 4f4u 4f56 4ft2 4ft4 4fwf 4hon 4i51 4l7x 4lk9 4lka 4llb 4lxl 4n4h 4qbq 4qbr 4qbs 4tn7 4u68 4up0 4uy4 4x3k 4y6l 4yhp 4yhz 4ym5 4ym6 4z0r 4z2m 5av5 5av6 5av8 5av9 5avb 5avc 5b24 5b2i 5b2j 5b31 5c11 5c13 5c3i 5cpi 5cpj 5cpk 5d6y 5dah 5fb0 5fb1 5ffv 5gse 5gsu 5gt0 5gt3 5gtc 5hjb 5hjc 5hjd 5hyn 5iql 5j3v 5j9s 5jhn 5jin 5jiy 5jj0 5jrg 5kjh 5kji 5kkl 5m5g 5svx 5svy 5t0k 5t0m 5t1g 5t1i 5t8r 5tbn 5tdr 5tdw 5v21 5v22 5va6
        H4_HUMAN | P628051kx4 1kx5 1m18 1m19 1m1a 1s32 1zkk 2bqz 2cv5 2ig0 2kwn 2kwo 2lvm 2qqs 2rje 2rny 2rs9 3a6n 3afa 3an2 3av1 3av2 3ayw 3aze 3azf 3azg 3azh 3azi 3azj 3azk 3azl 3azm 3azn 3cfs 3cfv 3f9w 3f9x 3f9y 3f9z 3ij1 3jpx 3nqj 3nqu 3o36 3qby 3qzs 3qzt 3qzv 3r45 3uvw 3uvx 3uvy 3uw9 3w96 3w97 3w98 3w99 3wa9 3waa 3wtp 3x1s 3x1t 3x1u 3x1v 4gqb 4h9n 4h9o 4h9p 4h9q 4h9r 4h9s 4hga 4m38 4n3w 4n4f 4qut 4quu 4qyd 4u9w 4ym5 4ym6 4yy6 4yyd 4yyg 4yyh 4yyi 4yyj 4yyk 4yym 4yyn 4z2m 4z5t 5av5 5av6 5av8 5av9 5avb 5avc 5ay8 5b0y 5b0z 5b24 5b2i 5b2j 5b31 5b32 5b33 5b40 5bnv 5bnx 5bo0 5c3i 5cpi 5cpj 5cpk 5fa5 5ffw 5fwe 5gse 5gsu 5gt0 5gt3 5gtc 5gxq 5ja4 5jrg 5kdm 5teg 5x7x

(-) Related Entries Specified in the PDB File

3wkk