Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asym./Biol. Unit - manually
(-)Asym./Biol. Unit
collapse expand < >
Image Asym./Biol. Unit - manually
Asym./Biol. Unit - manually  (Jmol Viewer)
Image Asym./Biol. Unit
Asym./Biol. Unit  (Jmol Viewer)

(-) Description

Title :  CRYSTAL STRUCTURE OF HOXA9 AND PBX1 HOMEODOMAINS BOUND TO DNA
 
Authors :  N. A. Laronde-Leblanc, C. Wolberger
Date :  24 Jun 03  (Deposition) - 02 Sep 03  (Release) - 24 Feb 09  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  1.90
Chains :  Asym./Biol. Unit :  A,B,D,E
Keywords :  Homeodomian, Protein-Dna Complex, Hox Hexapeptide, Tale Homeodomain, Homeodomain Interaction, Transcription/Dna Complex (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  N. A. Laronde-Leblanc, C. Wolberger
Structure Of Hoxa9 And Pbx1 Bound To Dna: Hox Hexapeptide And Dna Recognition Anterior To Posterior
Genes Dev. V. 17 2060 2003
PubMed-ID: 12923056  |  Reference-DOI: 10.1101/GAD.1103303
(for further references see the PDB file header)

(-) Compounds

Molecule 1 - 5'- D(*AP*CP*TP*CP*TP*AP*TP*GP*AP*TP*TP*TP*AP*CP*GP*AP*CP*GP*CP *T)-3'
    ChainsD
    EngineeredYES
    SyntheticYES
 
Molecule 2 - 5'- D(*TP*AP*GP*CP*GP*TP*CP*GP*TP*AP*AP*AP*TP*CP*AP*TP*AP*GP*AP *G)-3'
    ChainsE
    EngineeredYES
    SyntheticYES
 
Molecule 3 - HOMEOBOX PROTEIN HOX-A9
    ChainsA
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPET28A+
    Expression System StrainBL21(DE3) CODON + -RIL
    Expression System Taxid562
    Expression System Vector TypePLASMID
    FragmentRESIDUES 193-269
    GeneHOXA9
    Organism CommonHOUSE MOUSE
    Organism ScientificMUS MUSCULUS
    Organism Taxid10090
    Other DetailsHOMEODOMAIN
    SynonymHOX-1.7
 
Molecule 4 - PRE-B-CELL LEUKEMIA TRANSCRIPTION FACTOR-1
    ChainsB
    EngineeredYES
    Expression SystemESCHERICHIA COLI BL21(DE3)
    Expression System PlasmidPET11D
    Expression System StrainBL21(DE3)
    Expression System Taxid469008
    Expression System Vector TypePLASMID
    FragmentRESIDUES 233-305
    GenePBX1
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    Other DetailsHOMEODOMAIN
    SynonymHOMEOBOX PROTEIN PBX1, HOMEOBOX PROTEIN PRL

 Structural Features

(-) Chains, Units

  1234
Asymmetric/Biological Unit ABDE

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (0, 0)

(no "Ligand,Modified Residues,Ions" information available for 1PUF)

(-) Sites  (0, 0)

(no "Site" information available for 1PUF)

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 1PUF)

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 1PUF)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 1PUF)

(-) PROSITE Motifs  (2, 3)

Asymmetric/Biological Unit (2, 3)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1HOMEOBOX_2PS50071 'Homeobox' domain profile.HXA9_MOUSE203-263  1A:203-263
2HOMEOBOX_1PS00027 'Homeobox' domain signature.HXA9_MOUSE238-261  1A:238-261
PBX1_HUMAN269-292  1B:269-292

(-) Exons   (0, 0)

(no "Exon" information available for 1PUF)

(-) Sequences/Alignments

Asymmetric/Biological Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:77
 aligned with HXA9_MOUSE | P09631 from UniProtKB/Swiss-Prot  Length:271

    Alignment length:77
                                   202       212       222       232       242       252       262       
           HXA9_MOUSE   193 NNPAANWLHARSTRKKRCPYTKHQTLELEKEFLFNMYLTRDRRYEVARLLNLTERQVKIWFQNRRMKMKKINKDRAK 269
               SCOP domains d1pufa_ A: Homeobox protein hox-a9                                            SCOP domains
               CATH domains 1pufA00 A:193-269 Homeodomain-like                                            CATH domains
               Pfam domains -------------Homeobox-1pufA01 A:206-262                               ------- Pfam domains
         Sec.struct. author ....................hhhhhhhhhhhhhhh...hhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------------------------------- SAPs(SNPs)
                PROSITE (1) ----------HOMEOBOX_2  PDB: A:203-263 UniProt: 203-263                  ------ PROSITE (1)
                PROSITE (2) ---------------------------------------------HOMEOBOX_1              -------- PROSITE (2)
                 Transcript ----------------------------------------------------------------------------- Transcript
                 1puf A 193 NNPAANWLHARSTRKKRCPYTKHQTLELEKEFLFNMYLTRDRRYEVARLLNLTERQVKIWFQNRRMKMKKINKDRAK 269
                                   202       212       222       232       242       252       262       

Chain B from PDB  Type:PROTEIN  Length:73
 aligned with PBX1_HUMAN | P40424 from UniProtKB/Swiss-Prot  Length:430

    Alignment length:73
                                   242       252       262       272       282       292       302   
           PBX1_HUMAN   233 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRYKKNIGKFQEEANIY 305
               SCOP domains d1pufb_ B: pbx1                                                           SCOP domains
               CATH domains 1pufB00 B:233-305 Homeodomain-like                                        CATH domains
               Pfam domains -Homeobox-1pufB01 B:234-293                                  ------------ Pfam domains
         Sec.struct. author ........hhhhhhhhhhhhhhh......hhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhh.....hhhhhh Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------HOMEOBOX_1              ------------- PROSITE
                 Transcript ------------------------------------------------------------------------- Transcript
                 1puf B 233 ARRKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRYKKNIGKFQEEANIY 305
                                   242       252       262       272       282       292       302   

Chain D from PDB  Type:DNA  Length:20
                                                    
                 1puf D   1 ACTCTATGATTTACGACGCT  20
                                    10        20

Chain E from PDB  Type:DNA  Length:20
                                                    
                 1puf E  21 TAGCGTCGTAAATCATAGAG  40
                                    30        40

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (2, 2)

Asymmetric/Biological Unit

(-) CATH Domains  (1, 2)

Asymmetric/Biological Unit

(-) Pfam Domains  (2, 2)

Asymmetric/Biological Unit
(-)
Clan: HTH (544)

(-) Gene Ontology  (50, 62)

Asymmetric/Biological Unit(hide GO term definitions)
Chain A   (HXA9_MOUSE | P09631)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0019899    enzyme binding    Interacting selectively and non-covalently with any enzyme.
    GO:0005515    protein binding    Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
    GO:0043565    sequence-specific DNA binding    Interacting selectively and non-covalently with DNA of a specific nucleotide composition, e.g. GC-rich DNA binding, or with a specific sequence motif or type of DNA e.g. promotor binding or rDNA binding.
biological process
    GO:0009952    anterior/posterior pattern specification    The regionalization process in which specific areas of cell differentiation are determined along the anterior-posterior axis. The anterior-posterior axis is defined by a line that runs from the head or mouth of an organism to the tail or opposite end of the organism.
    GO:0060216    definitive hemopoiesis    A second wave of blood cell production that, in vertebrates, generates long-term hemopoietic stem cells that continously provide erythroid, myeloid and lymphoid lineages throughout adulthood.
    GO:0035115    embryonic forelimb morphogenesis    The process, occurring in the embryo, by which the anatomical structures of the forelimb are generated and organized. The forelimbs are the front limbs of an animal, e.g. the arms of a human.
    GO:0048706    embryonic skeletal system development    The process, occurring during the embryonic phase, whose specific outcome is the progression of the skeleton over time, from its formation to the mature structure.
    GO:0042118    endothelial cell activation    The change in morphology and behavior of an endothelial cell resulting from exposure to a cytokine, chemokine, cellular ligand, or soluble factor.
    GO:0008584    male gonad development    The process whose specific outcome is the progression of the male gonad over time, from its formation to the mature structure.
    GO:0030879    mammary gland development    The process whose specific outcome is the progression of the mammary gland over time, from its formation to the mature structure. The mammary gland is a large compound sebaceous gland that in female mammals is modified to secrete milk. Its development starts with the formation of the mammary line and ends as the mature gland cycles between nursing and weaning stages.
    GO:0007275    multicellular organism development    The biological process whose specific outcome is the progression of a multicellular organism over time from an initial condition (e.g. a zygote or a young adult) to a later condition (e.g. a multicellular animal or an aged adult).
    GO:0045638    negative regulation of myeloid cell differentiation    Any process that stops, prevents, or reduces the frequency, rate or extent of myeloid cell differentiation.
    GO:0009954    proximal/distal pattern formation    The regionalization process in which specific areas of cell differentiation are determined along a proximal/distal axis. The proximal/distal axis is defined by a line that runs from main body (proximal end) of an organism outward (distal end).
    GO:0010468    regulation of gene expression    Any process that modulates the frequency, rate or extent of gene expression. Gene expression is the process in which a gene's coding sequence is converted into a mature gene product or products (proteins or RNA). This includes the production of an RNA transcript as well as any processing to produce a mature RNA product or an mRNA or circRNA (for protein-coding genes) and the translation of that mRNA or circRNA into protein. Protein maturation is included when required to form an active form of a product from an inactive precursor form.
    GO:0006355    regulation of transcription, DNA-templated    Any process that modulates the frequency, rate or extent of cellular DNA-templated transcription.
    GO:0007338    single fertilization    The union of male and female gametes to form a zygote.
    GO:0007283    spermatogenesis    The process of formation of spermatozoa, including spermatocytogenesis and spermiogenesis.
    GO:0006351    transcription, DNA-templated    The cellular synthesis of RNA on a template of DNA.
    GO:0060065    uterus development    The reproductive developmental process whose specific outcome is the progression of the uterus over time, from its formation to the mature structure.
cellular component
    GO:0005737    cytoplasm    All of the contents of a cell excluding the plasma membrane and nucleus, but including other subcellular structures.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.
    GO:0005667    transcription factor complex    A protein complex that is capable of associating with DNA by direct binding, or via other DNA-binding proteins or complexes, and regulating transcription.

Chain B   (PBX1_HUMAN | P40424)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0000978    RNA polymerase II core promoter proximal region sequence-specific DNA binding    Interacting selectively and non-covalently with a sequence of DNA that is in cis with and relatively close to a core promoter for RNA polymerase II.
    GO:0005515    protein binding    Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
    GO:0046982    protein heterodimerization activity    Interacting selectively and non-covalently with a nonidentical protein to form a heterodimer.
    GO:0043565    sequence-specific DNA binding    Interacting selectively and non-covalently with DNA of a specific nucleotide composition, e.g. GC-rich DNA binding, or with a specific sequence motif or type of DNA e.g. promotor binding or rDNA binding.
    GO:0003700    transcription factor activity, sequence-specific DNA binding    Interacting selectively and non-covalently with a specific DNA sequence in order to modulate transcription. The transcription factor may or may not also interact selectively with a protein or macromolecular complex.
    GO:0008134    transcription factor binding    Interacting selectively and non-covalently with a transcription factor, any protein required to initiate or regulate transcription.
    GO:0001077    transcriptional activator activity, RNA polymerase II core promoter proximal region sequence-specific binding    Interacting selectively and non-covalently with a sequence of DNA that is in cis with and relatively close to a core promoter for RNA polymerase II (RNAP II) in order to activate or increase the frequency, rate or extent of transcription from the RNAP II promoter.
biological process
    GO:0030325    adrenal gland development    The process whose specific outcome is the progression of the adrenal gland over time, from its formation to the mature structure. This gland can either be a discrete structure located bilaterally above each kidney, or a cluster of cells in the head kidney that perform the functions of the adrenal gland. In either case, this organ consists of two cells types, aminergic chromaffin cells and steroidogenic cortical cells.
    GO:0009887    animal organ morphogenesis    Morphogenesis of an animal organ. An organ is defined as a tissue or set of tissues that work together to perform a specific function or functions. Morphogenesis is the process in which anatomical structures are generated and organized. Organs are commonly observed as visibly distinct structures, but may also exist as loosely associated clusters of cells that work together to perform a specific function or functions.
    GO:0009952    anterior/posterior pattern specification    The regionalization process in which specific areas of cell differentiation are determined along the anterior-posterior axis. The anterior-posterior axis is defined by a line that runs from the head or mouth of an organism to the tail or opposite end of the organism.
    GO:0001658    branching involved in ureteric bud morphogenesis    The process in which the branching structure of the ureteric bud is generated and organized. The ureteric bud is an epithelial tube that grows out from the metanephric duct. The bud elongates and branches to give rise to the ureter and kidney collecting tubules.
    GO:0030154    cell differentiation    The process in which relatively unspecialized cells, e.g. embryonic or regenerative cells, acquire specialized structural and/or functional features that characterize the cells, tissues, or organs of the mature organism or some other relatively stable phase of the organism's life history. Differentiation includes the processes involved in commitment of a cell to a specific fate and its subsequent development to the mature state.
    GO:0035162    embryonic hemopoiesis    The stages of blood cell formation that take place within the embryo.
    GO:0030326    embryonic limb morphogenesis    The process, occurring in the embryo, by which the anatomical structures of the limb are generated and organized. A limb is an appendage of an animal used for locomotion or grasping.
    GO:0048568    embryonic organ development    Development, taking place during the embryonic phase, of a tissue or tissues that work together to perform a specific function or functions. Development pertains to the process whose specific outcome is the progression of a structure over time, from its formation to the mature structure. Organs are commonly observed as visibly distinct structures, but may also exist as loosely associated clusters of cells that work together to perform a specific function or functions.
    GO:0048706    embryonic skeletal system development    The process, occurring during the embryonic phase, whose specific outcome is the progression of the skeleton over time, from its formation to the mature structure.
    GO:0007275    multicellular organism development    The biological process whose specific outcome is the progression of a multicellular organism over time from an initial condition (e.g. a zygote or a young adult) to a later condition (e.g. a multicellular animal or an aged adult).
    GO:0045665    negative regulation of neuron differentiation    Any process that stops, prevents, or reduces the frequency, rate or extent of neuron differentiation.
    GO:0043433    negative regulation of sequence-specific DNA binding transcription factor activity    Any process that stops, prevents, or reduces the frequency, rate or extent of the activity of a transcription factor, any factor involved in the initiation or regulation of transcription.
    GO:0010971    positive regulation of G2/M transition of mitotic cell cycle    Any process that increases the rate or extent of progression from G2 phase to M phase of the mitotic cell cycle.
    GO:0008284    positive regulation of cell proliferation    Any process that activates or increases the rate or extent of cell proliferation.
    GO:0045944    positive regulation of transcription from RNA polymerase II promoter    Any process that activates or increases the frequency, rate or extent of transcription from an RNA polymerase II promoter.
    GO:0009954    proximal/distal pattern formation    The regionalization process in which specific areas of cell differentiation are determined along a proximal/distal axis. The proximal/distal axis is defined by a line that runs from main body (proximal end) of an organism outward (distal end).
    GO:0042127    regulation of cell proliferation    Any process that modulates the frequency, rate or extent of cell proliferation.
    GO:0030278    regulation of ossification    Any process that modulates the frequency, rate or extent of bone formation.
    GO:0006355    regulation of transcription, DNA-templated    Any process that modulates the frequency, rate or extent of cellular DNA-templated transcription.
    GO:0007548    sex differentiation    The establishment of the sex of an organism by physical differentiation.
    GO:0035019    somatic stem cell population maintenance    Any process by which an organism retains a population of somatic stem cells, undifferentiated cells in the embryo or adult which can undergo unlimited division and give rise to cell types of the body other than those of the germ-line.
    GO:0048536    spleen development    The process whose specific outcome is the progression of the spleen over time, from its formation to the mature structure. The spleen is a large vascular lymphatic organ composed of white and red pulp, involved both in hemopoietic and immune system functions.
    GO:0006694    steroid biosynthetic process    The chemical reactions and pathways resulting in the formation of steroids, compounds with a 1,2,cyclopentanoperhydrophenanthrene nucleus; includes de novo formation and steroid interconversion by modification.
    GO:0048538    thymus development    The process whose specific outcome is the progression of the thymus over time, from its formation to the mature structure. The thymus is a symmetric bi-lobed organ involved primarily in the differentiation of immature to mature T cells, with unique vascular, nervous, epithelial, and lymphoid cell components.
    GO:0006366    transcription from RNA polymerase II promoter    The synthesis of RNA from a DNA template by RNA polymerase II, originating at an RNA polymerase II promoter. Includes transcription of messenger RNA (mRNA) and certain small nuclear RNAs (snRNAs).
    GO:0006351    transcription, DNA-templated    The cellular synthesis of RNA on a template of DNA.
    GO:0001655    urogenital system development    The process whose specific outcome is the progression of the urogenital system over time, from its formation to the mature structure.
cellular component
    GO:0005737    cytoplasm    All of the contents of a cell excluding the plasma membrane and nucleus, but including other subcellular structures.
    GO:0005654    nucleoplasm    That part of the nuclear content other than the chromosomes or the nucleolus.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.
    GO:0005667    transcription factor complex    A protein complex that is capable of associating with DNA by direct binding, or via other DNA-binding proteins or complexes, and regulating transcription.

 Visualization

(-) Interactive Views

Asymmetric/Biological Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
(no "Ligands, Modified Residues, Ions" information available for 1puf)
 
  Sites
(no "Sites" information available for 1puf)
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 1puf)
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  1puf
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  HXA9_MOUSE | P09631
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
  PBX1_HUMAN | P40424
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  HXA9_MOUSE | P09631
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)
  PBX1_HUMAN | P40424
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        PBX1_HUMAN | P404241b72

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 1PUF)