Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asym./Biol. Unit
collapse expand < >
Image Asym./Biol. Unit
Asym./Biol. Unit  (Jmol Viewer)

(-) Description

Title :  CRYSTAL STRUCTURE OF THE HETEROTYPIC NUCLEOSOME CONTAINING HUMAN CENP-A AND H3.3
 
Authors :  Y. Arimura, K. Shirayama, N. Horikoshi, R. Fujita, W. Kagawa, T. Fukaga G. Almouzni, H. Kurumizaka
Date :  14 Apr 14  (Deposition) - 03 Dec 14  (Release) - 03 Dec 14  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  2.67
Chains :  Asym./Biol. Unit :  A,B,C,D,E,F,G,H,I,J
Keywords :  Histone Fold, Dna Binding, Chromatin Formation, Dna Binding Protein- Dna Complex (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  Y. Arimura, K. Shirayama, N. Horikoshi, R. Fujita, H. Taguchi, W. Kagawa, T. Fukagawa, G. Almouzni, H. Kurumizaka
Crystal Structure And Stable Property Of The Cancer-Associated Heterotypic Nucleosome Containing Cenp-A And H3. 3
Sci Rep V. 4 7115 2014
PubMed-ID: 25408271  |  Reference-DOI: 10.1038/SREP07115

(-) Compounds

Molecule 1 - HISTONE H3-LIKE CENTROMERIC PROTEIN A
    ChainsA
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPUC19
    Expression System StrainDH5A
    Expression System Taxid562
    Expression System Vector TypePLASMID
    GeneCENPA
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SynonymCENTROMERE AUTOANTIGEN A, CENTROMERE PROTEIN A, CENP-A
 
Molecule 2 - HISTONE H4
    ChainsB, F
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPUC19
    Expression System StrainBL21(DE3)
    Expression System Taxid562
    Expression System Vector TypePLASMID
    GeneH3.3, H4/A, H4/B, H4/C, H4/D, H4/E, H4/G, H4/H, H4/I, H4/J, H4/K, H4/M, H4/N, H4/O, H4F2, H4FA, H4FB, H4FC, H4FD, H4FE, H4FG, H4FH, H4FI, H4FJ, H4FK, H4FM, H4FN, H4FO, HIST1H4A, HIST1H4B, HIST1H4C, HIST1H4D, HIST1H4E, HIST1H4F, HIST1H4H, HIST1H4I, HIST1H4J, HIST1H4K, HIST1H4L, HIST2H4, HIST2H4A, HIST2H4B, HIST4H4
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
 
Molecule 3 - HISTONE H2A TYPE 1-B/E
    ChainsC, G
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPET15B
    Expression System StrainJM109(DE3)
    Expression System Taxid562
    Expression System Vector TypePLASMID
    GeneH2AFA, H2AFM, H4, HIST1H2AB, HIST1H2AE
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SynonymHISTONE H2A.2, HISTONE H2A/A, HISTONE H2A/M
 
Molecule 4 - HISTONE H2B TYPE 1-J
    ChainsD, H
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPUC19
    Expression System StrainBL21(DE3)
    Expression System Taxid562
    Expression System Vector TypePLASMID
    GeneH2A, H2BFR, HIST1H2BJ
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SynonymHISTONE H2B.1, HISTONE H2B.R, H2B/R
 
Molecule 5 - HISTONE H3.3
    ChainsE
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPUC19
    Expression System StrainBL21(DE3)
    Expression System Taxid562
    Expression System Vector TypePLASMID
    GeneH2B, H3.3A, H3.3B, H3F3, H3F3A, H3F3B, PP781
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
 
Molecule 6 - DNA (146-MER)
    ChainsI, J
    EngineeredYES
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SyntheticYES

 Structural Features

(-) Chains, Units

  12345678910
Asymmetric/Biological Unit ABCDEFGHIJ

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (0, 0)

(no "Ligand,Modified Residues,Ions" information available for 3WTP)

(-) Sites  (0, 0)

(no "Site" information available for 3WTP)

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 3WTP)

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 3WTP)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (1, 2)

Asymmetric/Biological Unit (1, 2)
  dbSNPPDB
No.SourceVariant IDVariantUniProt IDStatusIDChainVariant
1UniProtVAR_036206E64QH4_HUMANUnclassified747622981B/FE63Q

  SNP/SAP Summary Statistics (UniProtKB/Swiss-Prot)

(-) PROSITE Motifs  (3, 6)

Asymmetric/Biological Unit (3, 6)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1HISTONE_H2APS00046 Histone H2A signature.H2A1B_HUMAN22-28
 
  2C:21-27
G:21-27
2HISTONE_H3_2PS00959 Histone H3 signature 2.CENPA_HUMAN66-74  1A:66-74
H33_HUMAN67-75  1E:66-74
3HISTONE_H2BPS00357 Histone H2B signature.H2B1J_HUMAN93-115
 
  2D:92-114
H:92-114

(-) Exons   (4, 5)

Asymmetric/Biological Unit (4, 5)
 ENSEMBLUniProtKBPDB
No.Transcript IDExonExon IDGenome LocationLengthIDLocationLengthCountLocationLength
1.1ENST000003668161ENSE00001442675chr1:226249552-226249795244H33_HUMAN-00--
1.2cENST000003668162cENSE00001442674chr1:226250436-22625051277H33_HUMAN-00--
1.3bENST000003668163bENSE00001428176chr1:226252030-226252180151H33_HUMAN1-43431E:36-427
1.4aENST000003668164aENSE00001692850chr1:226253357-226253510154H33_HUMAN43-94521E:42-9352
1.5aENST000003668165aENSE00001381318chr1:226259052-226259224173H33_HUMAN95-136421E:94-13542

2.1ENST000003778031ENSE00001475159chr6:26104104-26104518415H4_HUMAN1-1271272B:25-102
F:18-102
78
85

(-) Sequences/Alignments

Asymmetric/Biological Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:89
 aligned with CENPA_HUMAN | P49450 from UniProtKB/Swiss-Prot  Length:140

    Alignment length:89
                                    55        65        75        85        95       105       115       125         
          CENPA_HUMAN    46 GWLKEIRKLQKSTHLLIRKLPFSRLAREICVKFTRGVDFNWQAQALLALQEAAEAFLVHLFEDAYLLTLHAGRVTLFPKDVQLARRIRG 134
               SCOP domains ----------------------------------------------------------------------------------------- SCOP domains
               CATH domains ----------------------------------------------------------------------------------------- CATH domains
               Pfam domains ----------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author hhhhhhhhhhhh.....hhhhhhhhhhhhhhhhh.....eehhhhhhhhhhhhhhhhhhhhhhhhhhhhhh...eehhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------HISTONE_H------------------------------------------------------------ PROSITE
                 Transcript ----------------------------------------------------------------------------------------- Transcript
                 3wtp A  46 GWLKEIRKLQKSTHLLIRKLPFSRLAREICVKFTRGVDFNWQAQALLALQEAAEAFLVHLFEDAYLLTLHAGRVTLFPKDVQLARRIRG 134
                                    55        65        75        85        95       105       115       125         

Chain B from PDB  Type:PROTEIN  Length:78
 aligned with H4_HUMAN | P62805 from UniProtKB/Swiss-Prot  Length:103

    Alignment length:78
                                    35        45        55        65        75        85        95        
             H4_HUMAN    26 NIQGITKPAIRRLARRGGVKRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFGG 103
               SCOP domains ------------------------------------------------------------------------------ SCOP domains
               CATH domains ------------------------------------------------------------------------------ CATH domains
               Pfam domains ------------------------------------------------------------------------------ Pfam domains
         Sec.struct. author hhhhhhhhhhhhhhhhh...ee..hhhhhhhhhhhhhhhhhhhhhhhhhhhh...eehhhhhhhhhhhh..eee.... Sec.struct. author
                 SAPs(SNPs) --------------------------------------Q--------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------ PROSITE
               Transcript 2 Exon 2.1  PDB: B:25-102 UniProt: 1-127 [INCOMPLETE]                            Transcript 2
                 3wtp B  25 NIQGITKPAIRRLARRGGVKRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFGG 102
                                    34        44        54        64        74        84        94        

Chain C from PDB  Type:PROTEIN  Length:107
 aligned with H2A1B_HUMAN | P04908 from UniProtKB/Swiss-Prot  Length:130

    Alignment length:107
                                    22        32        42        52        62        72        82        92       102       112       
          H2A1B_HUMAN    13 AKAKTRSSRAGLQFPVGRVHRLLRKGNYSERVGAGAPVYLAAVLEYLTAEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGRVTIAQGGVLPNIQAVLLPK 119
               SCOP domains ----------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ----------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ----------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ....hhhhhhh...hhhhhhhhhhhh....ee.hhhhhhhhhhhhhhhhhhhhhhhhhhhhh...eehhhhhhhhhhhhhhhhhhh..eee.........hhhhh.. Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ---------HISTONE------------------------------------------------------------------------------------------- PROSITE
                 Transcript ----------------------------------------------------------------------------------------------------------- Transcript
                 3wtp C  12 AKAKTRSSRAGLQFPVGRVHRLLRKGNYSERVGAGAPVYLAAVLEYLTAEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGRVTIAQGGVLPNIQAVLLPK 118
                                    21        31        41        51        61        71        81        91       101       111       

Chain D from PDB  Type:PROTEIN  Length:94
 aligned with H2B1J_HUMAN | P06899 from UniProtKB/Swiss-Prot  Length:126

    Alignment length:94
                                    40        50        60        70        80        90       100       110       120    
          H2B1J_HUMAN    31 KRSRKESYSIYVYKVLKQVHPDTGISSKAMGIMNSFVNDIFERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTS 124
               SCOP domains ---------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ---------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ---------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .......hhhhhhhhhhhhh...eehhhhhhhhhhhhhhhhhhhhhhhhhhhhhh...eehhhhhhhhhhhhh..hhhhhhhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) ---------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------------HISTONE_H2B            --------- PROSITE
                 Transcript ---------------------------------------------------------------------------------------------- Transcript
                 3wtp D  30 KRSRKESYSIYVYKVLKQVHPDTGISSKAMGIMNSFVNDIFERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTS 123
                                    39        49        59        69        79        89        99       109       119    

Chain E from PDB  Type:PROTEIN  Length:100
 aligned with H33_HUMAN | P84243 from UniProtKB/Swiss-Prot  Length:136

    Alignment length:100
                                    46        56        66        76        86        96       106       116       126       136
            H33_HUMAN    37 KKPHRYRPGTVALREIRRYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSAAIGALQEASEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA 136
               SCOP domains ---------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ---------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ---------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ........hhhhhhhhhhhhhh.....hhhhhhhhhhhhhhhhh...eehhhhhhhhhhhhhhhhhhhhhhhhhhhhhh...eehhhhhhhhhhhh.... Sec.struct. author
                 SAPs(SNPs) ---------------------------------------------------------------------------------------------------- SAPs(SNPs)
                PROSITE (2) ------------------------------HISTONE_H------------------------------------------------------------- PROSITE (2)
           Transcript 1 (1) 1.3b   ---------------------------------------------------Exon 1.5a  PDB: E:94-135 UniProt: 95-136   Transcript 1 (1)
           Transcript 1 (2) ------Exon 1.4a  PDB: E:42-93 UniProt: 43-94              ------------------------------------------ Transcript 1 (2)
                 3wtp E  36 KKPHRYRPGTVALREIRRYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSAAIGALQEASEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA 135
                                    45        55        65        75        85        95       105       115       125       135

Chain F from PDB  Type:PROTEIN  Length:85
 aligned with H4_HUMAN | P62805 from UniProtKB/Swiss-Prot  Length:103

    Alignment length:85
                                    28        38        48        58        68        78        88        98     
             H4_HUMAN    19 HRKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFGG 103
               SCOP domains ------------------------------------------------------------------------------------- SCOP domains
               CATH domains ------------------------------------------------------------------------------------- CATH domains
               Pfam domains ------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ......hhhhhhhhhhhhhhhhhh...ee..hhhhhhhhhhhhhhhhhhhhhhhhhhhh...eehhhhhhhhhhhh..eee.... Sec.struct. author
                 SAPs(SNPs) ---------------------------------------------Q--------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------- PROSITE
               Transcript 2 Exon 2.1  PDB: F:18-102 UniProt: 1-127 [INCOMPLETE]                                   Transcript 2
                 3wtp F  18 HRKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGFGG 102
                                    27        37        47        57        67        77        87        97     

Chain G from PDB  Type:PROTEIN  Length:103
 aligned with H2A1B_HUMAN | P04908 from UniProtKB/Swiss-Prot  Length:130

    Alignment length:103
                                    24        34        44        54        64        74        84        94       104       114   
          H2A1B_HUMAN    15 AKTRSSRAGLQFPVGRVHRLLRKGNYSERVGAGAPVYLAAVLEYLTAEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGRVTIAQGGVLPNIQAVLL 117
               SCOP domains ------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ..hhhhhhh...hhhhhhhhhhhh....ee.hhhhhhhhhhhhhhhhhhhhhhhhhhhhh...eehhhhhhhhhhhhhhhhhhh..eee.........hhhhh Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE -------HISTONE----------------------------------------------------------------------------------------- PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------- Transcript
                 3wtp G  14 AKTRSSRAGLQFPVGRVHRLLRKGNYSERVGAGAPVYLAAVLEYLTAEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGRVTIAQGGVLPNIQAVLL 116
                                    23        33        43        53        63        73        83        93       103       113   

Chain H from PDB  Type:PROTEIN  Length:93
 aligned with H2B1J_HUMAN | P06899 from UniProtKB/Swiss-Prot  Length:126

    Alignment length:93
                                    42        52        62        72        82        92       102       112       122   
          H2B1J_HUMAN    33 SRKESYSIYVYKVLKQVHPDTGISSKAMGIMNSFVNDIFERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTSA 125
               SCOP domains --------------------------------------------------------------------------------------------- SCOP domains
               CATH domains --------------------------------------------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .....hhhhhhhhhhhhh...eehhhhhhhhhhhhhhhhhhhhhhhhhhhhhh...eehhhhhhhhhhhhh..hhhhhhhhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------HISTONE_H2B            ---------- PROSITE
                 Transcript --------------------------------------------------------------------------------------------- Transcript
                 3wtp H  32 SRKESYSIYVYKVLKQVHPDTGISSKAMGIMNSFVNDIFERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTSA 124
                                    41        51        61        71        81        91       101       111       121   

Chain I from PDB  Type:DNA  Length:146
                                                                                                                                                                                  
                 3wtp I   1 ATCAATATCCACCTGCAGATTCTACCAAAAGTGTATTTGGAAACTGCTCCATCAAAAGGCATGTTCAGCTGAATTCAGCTGAACATGCCTTTTGATGGAGCAGTTTCCAAATACACTTTTGGTAGAATCTGCAGGTGGATATTGAT 146
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140      

Chain J from PDB  Type:DNA  Length:146
                                                                                                                                                                                  
                 3wtp J 147 ATCAATATCCACCTGCAGATTCTACCAAAAGTGTATTTGGAAACTGCTCCATCAAAAGGCATGTTCAGCTGAATTCAGCTGAACATGCCTTTTGATGGAGCAGTTTCCAAATACACTTTTGGTAGAATCTGCAGGTGGATATTGAT 292
                                   156       166       176       186       196       206       216       226       236       246       256       266       276       286      

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 3WTP)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 3WTP)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 3WTP)

(-) Gene Ontology  (61, 104)

Asymmetric/Biological Unit(hide GO term definitions)
Chain A   (CENPA_HUMAN | P49450)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0003682    chromatin binding    Interacting selectively and non-covalently with chromatin, the network of fibers of DNA, protein, and sometimes RNA, that make up the chromosomes of the eukaryotic nucleus during interphase.
    GO:0005515    protein binding    Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
    GO:0046982    protein heterodimerization activity    Interacting selectively and non-covalently with a nonidentical protein to form a heterodimer.
biological process
    GO:0034080    CENP-A containing nucleosome assembly    The formation of nucleosomes containing the histone H3 variant CENP-A to form centromeric chromatin. This specialised chromatin occurs at centromeric region in point centromeres, and the central core in modular centromeres.
    GO:0000132    establishment of mitotic spindle orientation    A cell cycle process that sets the alignment of mitotic spindle relative to other cellular structures.
    GO:0051382    kinetochore assembly    The aggregation, arrangement and bonding together of a set of components to form the kinetochore, a multisubunit complex that is located at the centromeric region of DNA and provides an attachment point for the spindle microtubules.
    GO:0071459    protein localization to chromosome, centromeric region    Any process in which a protein is transported to, or maintained at, the centromeric region of a chromosome.
    GO:0007062    sister chromatid cohesion    The cell cycle process in which the sister chromatids of a replicated chromosome become tethered to each other.
    GO:0016032    viral process    A multi-organism process in which a virus is a participant. The other participant is the host. Includes infection of a host cell, replication of the viral genome, and assembly of progeny virus particles. In some cases the viral genetic material may integrate into the host genome and only subsequently, under particular circumstances, 'complete' its life cycle.
cellular component
    GO:0005694    chromosome    A structure composed of a very long molecule of DNA and associated proteins (e.g. histones) that carries hereditary information.
    GO:0000775    chromosome, centromeric region    The region of a chromosome that includes the centromeric DNA and associated proteins. In monocentric chromosomes, this region corresponds to a single area of the chromosome, whereas in holocentric chromosomes, it is evenly distributed along the chromosome.
    GO:0000939    condensed chromosome inner kinetochore    The region of a condensed chromosome kinetochore closest to centromeric DNA; in mammals the CREST antigens (CENP proteins) are found in this layer; this layer may help define underlying centromeric chromatin structure and position of the kinetochore on the chromosome.
    GO:0000777    condensed chromosome kinetochore    A multisubunit complex that is located at the centromeric region of a condensed chromosome and provides an attachment point for the spindle microtubules.
    GO:0000778    condensed nuclear chromosome kinetochore    A multisubunit complex that is located at the centromeric region of a condensed nuclear chromosome and provides an attachment point for the spindle microtubules.
    GO:0000780    condensed nuclear chromosome, centromeric region    The region of a condensed nuclear chromosome that includes the centromere and associated proteins, including the kinetochore. In monocentric chromosomes, this region corresponds to a single area of the chromosome, whereas in holocentric chromosomes, it is evenly distributed along the chromosome.
    GO:0005829    cytosol    The part of the cytoplasm that does not contain organelles but which does contain other particulate matter, such as protein complexes.
    GO:0000776    kinetochore    A multisubunit complex that is located at the centromeric region of DNA and provides an attachment point for the spindle microtubules.
    GO:0005654    nucleoplasm    That part of the nuclear content other than the chromosomes or the nucleolus.
    GO:0000786    nucleosome    A complex comprised of DNA wound around a multisubunit core and associated proteins, which forms the primary packing unit of DNA into higher order structures.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.

Chain B,F   (H4_HUMAN | P62805)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0042393    histone binding    Interacting selectively and non-covalently with a histone, any of a group of water-soluble proteins found in association with the DNA of eukaroytic chromosomes. They are involved in the condensation and coiling of chromosomes during cell division and have also been implicated in nonspecific suppression of gene activity.
    GO:0005515    protein binding    Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
    GO:0019904    protein domain specific binding    Interacting selectively and non-covalently with a specific domain of a protein.
    GO:0046982    protein heterodimerization activity    Interacting selectively and non-covalently with a nonidentical protein to form a heterodimer.
biological process
    GO:0034080    CENP-A containing nucleosome assembly    The formation of nucleosomes containing the histone H3 variant CENP-A to form centromeric chromatin. This specialised chromatin occurs at centromeric region in point centromeres, and the central core in modular centromeres.
    GO:0006335    DNA replication-dependent nucleosome assembly    The formation of nucleosomes on newly replicated DNA, coupled to strand elongation.
    GO:0006336    DNA replication-independent nucleosome assembly    The formation of nucleosomes outside the context of DNA replication.
    GO:0006352    DNA-templated transcription, initiation    Any process involved in the assembly of the RNA polymerase preinitiation complex (PIC) at the core promoter region of a DNA template, resulting in the subsequent synthesis of RNA from that promoter. The initiation phase includes PIC assembly and the formation of the first few bonds in the RNA chain, including abortive initiation, which occurs when the first few nucleotides are repeatedly synthesized and then released. The initiation phase ends just before and does not include promoter clearance, or release, which is the transition between the initiation and elongation phases of transcription.
    GO:1904837    beta-catenin-TCF complex assembly    The aggregation, arrangement and bonding together of a set of components to form a beta-catenin-TCF complex.
    GO:0044267    cellular protein metabolic process    The chemical reactions and pathways involving a specific protein, rather than of proteins in general, occurring at the level of an individual cell. Includes cellular protein modification.
    GO:0000183    chromatin silencing at rDNA    Repression of transcription of ribosomal DNA by altering the structure of chromatin.
    GO:0006303    double-strand break repair via nonhomologous end joining    The repair of a double-strand break in DNA in which the two broken ends are rejoined with little or no sequence complementarity. Information at the DNA ends may be lost due to the modification of broken DNA ends. This term covers instances of separate pathways, called classical (or canonical) and alternative nonhomologous end joining (C-NHEJ and A-NHEJ). These in turn may further branch into sub-pathways, but evidence is still unclear.
    GO:0031047    gene silencing by RNA    Any process in which RNA molecules inactivate expression of target genes.
    GO:0045814    negative regulation of gene expression, epigenetic    Any epigenetic process that stops, prevents or reduces the rate of gene expression.
    GO:0045653    negative regulation of megakaryocyte differentiation    Any process that stops, prevents, or reduces the frequency, rate or extent of megakaryocyte differentiation.
    GO:0006334    nucleosome assembly    The aggregation, arrangement and bonding together of a nucleosome, the beadlike structural units of eukaryotic chromatin composed of histones and DNA.
    GO:0045815    positive regulation of gene expression, epigenetic    Any epigenetic process that activates or increases the rate of gene expression.
    GO:0051290    protein heterotetramerization    The formation of a protein heterotetramer, a macromolecular structure consisting of four noncovalently associated subunits, of which not all are identical.
    GO:0016233    telomere capping    A process in which telomeres are protected from degradation and fusion, thereby ensuring chromosome stability by protecting the ends from both degradation and from being recognized as damaged DNA. May be mediated by specific single- or double-stranded telomeric DNA binding proteins.
    GO:0032200    telomere organization    A process that is carried out at the cellular level which results in the assembly, arrangement of constituent parts, or disassembly of telomeres, terminal regions of a linear chromosome that include the telomeric DNA repeats and associated proteins.
cellular component
    GO:0005694    chromosome    A structure composed of a very long molecule of DNA and associated proteins (e.g. histones) that carries hereditary information.
    GO:0070062    extracellular exosome    A vesicle that is released into the extracellular region by fusion of the limiting endosomal membrane of a multivesicular body with the plasma membrane. Extracellular exosomes, also simply called exosomes, have a diameter of about 40-100 nm.
    GO:0005576    extracellular region    The space external to the outermost structure of a cell. For cells without external protective or external encapsulating structures this refers to space outside of the plasma membrane. This term covers the host cell environment outside an intracellular parasite.
    GO:0016020    membrane    A lipid bilayer along with all the proteins and protein complexes embedded in it an attached to it.
    GO:0000228    nuclear chromosome    A chromosome that encodes the nuclear genome and is found in the nucleus of a eukaryotic cell during the cell cycle phases when the nucleus is intact.
    GO:0000784    nuclear chromosome, telomeric region    The terminal region of a linear nuclear chromosome that includes the telomeric DNA repeats and associated proteins.
    GO:0005654    nucleoplasm    That part of the nuclear content other than the chromosomes or the nucleolus.
    GO:0000786    nucleosome    A complex comprised of DNA wound around a multisubunit core and associated proteins, which forms the primary packing unit of DNA into higher order structures.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.
    GO:0043234    protein complex    A stable macromolecular complex composed (only) of two or more polypeptide subunits along with any covalently attached molecules (such as lipid anchors or oligosaccharide) or non-protein prosthetic groups (such as nucleotides or metal ions). Prosthetic group in this context refers to a tightly bound cofactor. The component polypeptide subunits may be identical.

Chain C,G   (H2A1B_HUMAN | P04908)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0046982    protein heterodimerization activity    Interacting selectively and non-covalently with a nonidentical protein to form a heterodimer.
biological process
    GO:0006342    chromatin silencing    Repression of transcription by altering the structure of chromatin, e.g. by conversion of large regions of DNA into an inaccessible state often called heterochromatin.
    GO:0008285    negative regulation of cell proliferation    Any process that stops, prevents or reduces the rate or extent of cell proliferation.
cellular component
    GO:0005694    chromosome    A structure composed of a very long molecule of DNA and associated proteins (e.g. histones) that carries hereditary information.
    GO:0070062    extracellular exosome    A vesicle that is released into the extracellular region by fusion of the limiting endosomal membrane of a multivesicular body with the plasma membrane. Extracellular exosomes, also simply called exosomes, have a diameter of about 40-100 nm.
    GO:0000790    nuclear chromatin    The ordered and organized complex of DNA, protein, and sometimes RNA, that forms the chromosome in the nucleus.
    GO:0000786    nucleosome    A complex comprised of DNA wound around a multisubunit core and associated proteins, which forms the primary packing unit of DNA into higher order structures.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.

Chain D,H   (H2B1J_HUMAN | P06899)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0046982    protein heterodimerization activity    Interacting selectively and non-covalently with a nonidentical protein to form a heterodimer.
biological process
    GO:0019731    antibacterial humoral response    An immune response against bacteria mediated through a body fluid. Examples of this process are the antibacterial humoral responses in Mus musculus and Drosophila melanogaster.
    GO:0050830    defense response to Gram-positive bacterium    Reactions triggered in response to the presence of a Gram-positive bacterium that act to protect the cell or organism.
    GO:0042742    defense response to bacterium    Reactions triggered in response to the presence of a bacterium that act to protect the cell or organism.
    GO:0002227    innate immune response in mucosa    Any process of the innate immune response that takes place in the mucosal tissues.
    GO:0006334    nucleosome assembly    The aggregation, arrangement and bonding together of a nucleosome, the beadlike structural units of eukaryotic chromatin composed of histones and DNA.
cellular component
    GO:0005694    chromosome    A structure composed of a very long molecule of DNA and associated proteins (e.g. histones) that carries hereditary information.
    GO:0005737    cytoplasm    All of the contents of a cell excluding the plasma membrane and nucleus, but including other subcellular structures.
    GO:0005615    extracellular space    That part of a multicellular organism outside the cells proper, usually taken to be outside the plasma membranes, and occupied by fluid.
    GO:0000788    nuclear nucleosome    A complex comprised of DNA wound around a multisubunit core and associated proteins, which forms the primary packing unit of DNA in the nucleus into higher order structures.
    GO:0005654    nucleoplasm    That part of the nuclear content other than the chromosomes or the nucleolus.
    GO:0000786    nucleosome    A complex comprised of DNA wound around a multisubunit core and associated proteins, which forms the primary packing unit of DNA into higher order structures.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.

Chain E   (H33_HUMAN | P84243)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0000979    RNA polymerase II core promoter sequence-specific DNA binding    Interacting selectively and non-covalently with the regulatory region composed of the transcription start site and binding sites for transcription factors of the RNA polymerase II basal transcription machinery.
    GO:0000980    RNA polymerase II distal enhancer sequence-specific DNA binding    Interacting selectively and non-covalently with a RNA polymerase II (Pol II) distal enhancer. In mammalian cells, enhancers are distal sequences that increase the utilization of some promoters, and can function in either orientation and in any location (upstream or downstream) relative to the core promoter.
    GO:0042393    histone binding    Interacting selectively and non-covalently with a histone, any of a group of water-soluble proteins found in association with the DNA of eukaroytic chromosomes. They are involved in the condensation and coiling of chromosomes during cell division and have also been implicated in nonspecific suppression of gene activity.
    GO:0031492    nucleosomal DNA binding    Interacting selectively and non-covalently with the DNA portion of a nucleosome.
    GO:0005515    protein binding    Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
    GO:0046982    protein heterodimerization activity    Interacting selectively and non-covalently with a nonidentical protein to form a heterodimer.
biological process
    GO:0006336    DNA replication-independent nucleosome assembly    The formation of nucleosomes outside the context of DNA replication.
    GO:0007596    blood coagulation    The sequential process in which the multiple coagulation factors of the blood interact, ultimately resulting in the formation of an insoluble fibrin clot; it may be divided into three stages: stage 1, the formation of intrinsic and extrinsic prothrombin converting principle; stage 2, the formation of thrombin; stage 3, the formation of stable fibrin polymers.
    GO:0007420    brain development    The process whose specific outcome is the progression of the brain over time, from its formation to the mature structure. Brain development begins with patterning events in the neural tube and ends with the mature structure that is the center of thought and emotion. The brain is responsible for the coordination and control of bodily activities and the interpretation of information from the senses (sight, hearing, smell, etc.).
    GO:0044267    cellular protein metabolic process    The chemical reactions and pathways involving a specific protein, rather than of proteins in general, occurring at the level of an individual cell. Includes cellular protein modification.
    GO:0000183    chromatin silencing at rDNA    Repression of transcription of ribosomal DNA by altering the structure of chromatin.
    GO:0031047    gene silencing by RNA    Any process in which RNA molecules inactivate expression of target genes.
    GO:0045814    negative regulation of gene expression, epigenetic    Any epigenetic process that stops, prevents or reduces the rate of gene expression.
    GO:0006334    nucleosome assembly    The aggregation, arrangement and bonding together of a nucleosome, the beadlike structural units of eukaryotic chromatin composed of histones and DNA.
    GO:0030307    positive regulation of cell growth    Any process that activates or increases the frequency, rate, extent or direction of cell growth.
    GO:0045815    positive regulation of gene expression, epigenetic    Any epigenetic process that activates or increases the rate of gene expression.
    GO:0009725    response to hormone    Any process that results in a change in state or activity of a cell or an organism (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of a hormone stimulus.
    GO:0032200    telomere organization    A process that is carried out at the cellular level which results in the assembly, arrangement of constituent parts, or disassembly of telomeres, terminal regions of a linear chromosome that include the telomeric DNA repeats and associated proteins.
cellular component
    GO:0005694    chromosome    A structure composed of a very long molecule of DNA and associated proteins (e.g. histones) that carries hereditary information.
    GO:0070062    extracellular exosome    A vesicle that is released into the extracellular region by fusion of the limiting endosomal membrane of a multivesicular body with the plasma membrane. Extracellular exosomes, also simply called exosomes, have a diameter of about 40-100 nm.
    GO:0005576    extracellular region    The space external to the outermost structure of a cell. For cells without external protective or external encapsulating structures this refers to space outside of the plasma membrane. This term covers the host cell environment outside an intracellular parasite.
    GO:0000228    nuclear chromosome    A chromosome that encodes the nuclear genome and is found in the nucleus of a eukaryotic cell during the cell cycle phases when the nucleus is intact.
    GO:0000784    nuclear chromosome, telomeric region    The terminal region of a linear nuclear chromosome that includes the telomeric DNA repeats and associated proteins.
    GO:0000788    nuclear nucleosome    A complex comprised of DNA wound around a multisubunit core and associated proteins, which forms the primary packing unit of DNA in the nucleus into higher order structures.
    GO:0005654    nucleoplasm    That part of the nuclear content other than the chromosomes or the nucleolus.
    GO:0000786    nucleosome    A complex comprised of DNA wound around a multisubunit core and associated proteins, which forms the primary packing unit of DNA into higher order structures.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.
    GO:0043234    protein complex    A stable macromolecular complex composed (only) of two or more polypeptide subunits along with any covalently attached molecules (such as lipid anchors or oligosaccharide) or non-protein prosthetic groups (such as nucleotides or metal ions). Prosthetic group in this context refers to a tightly bound cofactor. The component polypeptide subunits may be identical.

 Visualization

(-) Interactive Views

Asymmetric/Biological Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
(no "Ligands, Modified Residues, Ions" information available for 3wtp)
 
  Sites
(no "Sites" information available for 3wtp)
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 3wtp)
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  3wtp
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  CENPA_HUMAN | P49450
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
  H2A1B_HUMAN | P04908
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
  H2B1J_HUMAN | P06899
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
  H33_HUMAN | P84243
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
  H4_HUMAN | P62805
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  CENPA_HUMAN | P49450
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)
  H2A1B_HUMAN | P04908
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)
  H2B1J_HUMAN | P06899
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)
  H33_HUMAN | P84243
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)
  H4_HUMAN | P62805
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        CENPA_HUMAN | P494503an2 3nqj 3nqu 3r45 5cvd
        H2A1B_HUMAN | P049082cv5 2rvq 3a6n 3afa 3an2 3av1 3av2 3ayw 3aze 3azf 3azg 3azh 3azi 3azj 3azk 3azl 3azm 3azn 3w96 3w97 3w98 3w99 3wkj 3x1s 3x1v 4ym5 4ym6 4z5t 5av5 5av6 5av8 5av9 5avb 5avc 5ay8 5b0y 5b0z 5b24 5b2i 5b2j 5b31 5b32 5b40 5cpi 5cpj 5cpk 5gse 5gtc 5gxq 5jrg 5vey 5x7x
        H2B1J_HUMAN | P068992rvq 3a6n 3afa 3an2 3av1 3av2 3ayw 3aze 3azf 3azg 3azh 3azi 3azj 3azk 3azl 3azm 3azn 3w96 3w97 3w98 3w99 3wa9 3waa 4cay 4ym5 4ym6 4z5t 5av5 5av6 5av8 5av9 5avb 5avc 5ay8 5b0y 5b0z 5b24 5b2i 5b2j 5b31 5b32 5b33 5b40 5cpi 5cpj 5cpk 5fug 5gse 5gtc 5gxq 5jrg 5vey 5x7x
        H33_HUMAN | P842431pdq 2l43 3ask 3asl 3av2 3jvk 3muk 3mul 3ql9 3qla 3qlc 4gne 4gnf 4gng 4gu0 4gur 4gus 4gy5 4h9n 4h9o 4h9p 4h9q 4h9r 4h9s 4hga 4l58 4n4i 4o62 4qq4 4tmp 4u7t 4w5a 5ay8 5b32 5b33 5bnv 5bnx 5dwq 5dx0 5ja4 5jjy 5jlb 5kdm 5x7x
        H4_HUMAN | P628051kx4 1kx5 1m18 1m19 1m1a 1s32 1zkk 2bqz 2cv5 2ig0 2kwn 2kwo 2lvm 2qqs 2rje 2rny 2rs9 3a6n 3afa 3an2 3av1 3av2 3ayw 3aze 3azf 3azg 3azh 3azi 3azj 3azk 3azl 3azm 3azn 3cfs 3cfv 3f9w 3f9x 3f9y 3f9z 3ij1 3jpx 3nqj 3nqu 3o36 3qby 3qzs 3qzt 3qzv 3r45 3uvw 3uvx 3uvy 3uw9 3w96 3w97 3w98 3w99 3wa9 3waa 3wkj 3x1s 3x1t 3x1u 3x1v 4gqb 4h9n 4h9o 4h9p 4h9q 4h9r 4h9s 4hga 4m38 4n3w 4n4f 4qut 4quu 4qyd 4u9w 4ym5 4ym6 4yy6 4yyd 4yyg 4yyh 4yyi 4yyj 4yyk 4yym 4yyn 4z2m 4z5t 5av5 5av6 5av8 5av9 5avb 5avc 5ay8 5b0y 5b0z 5b24 5b2i 5b2j 5b31 5b32 5b33 5b40 5bnv 5bnx 5bo0 5c3i 5cpi 5cpj 5cpk 5fa5 5ffw 5fwe 5gse 5gsu 5gt0 5gt3 5gtc 5gxq 5ja4 5jrg 5kdm 5teg 5x7x

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 3WTP)