Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asymmetric Unit
(-)Asym. Unit - sites
(-)Biological Unit 1
(-)Biol. Unit 1 - sites
(-)Biological Unit 2
collapse expand < >
Image Asymmetric Unit
Asymmetric Unit  (Jmol Viewer)
Image Asym. Unit - sites
Asym. Unit - sites  (Jmol Viewer)
Image Biological Unit 1
Biological Unit 1  (Jmol Viewer)
Image Biol. Unit 1 - sites
Biol. Unit 1 - sites  (Jmol Viewer)
Image Biological Unit 2
Biological Unit 2  (Jmol Viewer)

(-) Description

Title :  STRUCTURE OF A CYS2HIS2 ZINC FINGER/TATA BOX COMPLEX (TATAZF;CLONE #6)
 
Authors :  S. A. Wolfe, R. A. Grant, M. Elrod-Erickson, C. O. Pabo
Date :  18 Oct 00  (Deposition) - 07 Sep 01  (Release) - 24 Feb 09  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  2.00
Chains :  Asym. Unit :  A,B,C,D,E,F
Biol. Unit 1:  A,B,C  (1x)
Biol. Unit 2:  D,E,F  (1x)
Keywords :  Phage Display, Zinc Finger-Dna Complex, Tata Box, Cys2His2, Transcription/Dna Complex (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  S. A. Wolfe, R. A. Grant, M. Elrod-Erickson, C. O. Pabo
Beyond The "Recognition Code": Structures Of Two Cys2His2 Zinc Finger/Tata Box Complexes.
Structure V. 9 717 2001
PubMed-ID: 11587646  |  Reference-DOI: 10.1016/S0969-2126(01)00632-3
(for further references see the PDB file header)

(-) Compounds

Molecule 1 - 5'- D(*GP*AP*CP*GP*CP*TP*AP*TP*AP*AP*AP*AP*GP*GP*AP*G)-3'
    ChainsA, D
    EngineeredYES
    SyntheticYES
 
Molecule 2 - 5'- D(*TP*CP*CP*TP*TP*TP*TP*AP*TP*AP*GP*CP*GP*TP*CP*C)-3'
    ChainsB, E
    EngineeredYES
    SyntheticYES
 
Molecule 3 - TATA BOX ZINC FINGER PROTEIN
    ChainsC, F
    EngineeredYES
    Expression SystemESCHERICHIA COLI BL21(DE3)
    Expression System PlasmidPET21D
    Expression System StrainBL21DE3
    Expression System Taxid469008
    Expression System Vector TypePLASMID
    Organism CommonHOUSE MOUSE
    Organism ScientificMUS MUSCULUS
    Organism Taxid10090
    Other DetailsPROTEIN SELECTED BY PHAGE DISPLAY

 Structural Features

(-) Chains, Units

  123456
Asymmetric Unit ABCDEF
Biological Unit 1 (1x)ABC   
Biological Unit 2 (1x)   DEF

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (1, 6)

Asymmetric Unit (1, 6)
No.NameCountTypeFull Name
1ZN6Ligand/IonZINC ION
Biological Unit 1 (0, 0)
No.NameCountTypeFull Name
1ZN-1Ligand/IonZINC ION
Biological Unit 2 (0, 0)
No.NameCountTypeFull Name
1ZN-1Ligand/IonZINC ION

(-) Sites  (6, 6)

Asymmetric Unit (6, 6)
No.NameEvidenceResiduesDescription
1AC1SOFTWARECYS C:107 , CYS C:112 , HIS C:125 , HIS C:129BINDING SITE FOR RESIDUE ZN C 301
2AC2SOFTWARECYS C:137 , CYS C:140 , HIS C:153 , HIS C:157BINDING SITE FOR RESIDUE ZN C 302
3AC3SOFTWARECYS C:165 , CYS C:168 , HIS C:181 , HIS C:185BINDING SITE FOR RESIDUE ZN C 303
4AC4SOFTWARECYS F:207 , CYS F:212 , HIS F:225 , HIS F:229BINDING SITE FOR RESIDUE ZN F 304
5AC5SOFTWARECYS F:237 , CYS F:240 , HIS F:253 , HIS F:257BINDING SITE FOR RESIDUE ZN F 305
6AC6SOFTWARECYS F:265 , CYS F:268 , HIS F:281 , HIS F:285BINDING SITE FOR RESIDUE ZN F 306

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 1G2F)

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 1G2F)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 1G2F)

(-) PROSITE Motifs  (1, 6)

Asymmetric Unit (1, 6)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1ZINC_FINGER_C2H2_1PS00028 Zinc finger C2H2 type domain signature.EGR1_MOUSE338-360
 
368-388
 
396-416
 
  6C:107-129
F:207-229
C:137-157
F:237-257
C:165-185
F:265-285
Biological Unit 1 (1, 3)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1ZINC_FINGER_C2H2_1PS00028 Zinc finger C2H2 type domain signature.EGR1_MOUSE338-360
 
368-388
 
396-416
 
  3C:107-129
-
C:137-157
-
C:165-185
-
Biological Unit 2 (1, 3)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1ZINC_FINGER_C2H2_1PS00028 Zinc finger C2H2 type domain signature.EGR1_MOUSE338-360
 
368-388
 
396-416
 
  3-
F:207-229
-
F:237-257
-
F:265-285

(-) Exons   (0, 0)

(no "Exon" information available for 1G2F)

(-) Sequences/Alignments

Asymmetric Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:DNA  Length:16
                                                
                 1g2f A   2 GACGCTATAAAAGGAG  17
                                    11      

Chain B from PDB  Type:DNA  Length:16
                                                
                 1g2f B  18 TCCTTTTATAGCGTCC  33
                                    27      

Chain C from PDB  Type:PROTEIN  Length:89
 aligned with EGR1_MOUSE | P08046 from UniProtKB/Swiss-Prot  Length:533

    Alignment length:89
                                   341       351       361       371       381       391       401       411         
           EGR1_MOUSE   332 HERPYACPVESCDRRFSRSDELTRHIRIHTGQKPFQCRICMRNFSRSDHLTTHIRTHTGEKPFACDICGRKFARSDERKRHTKIHLRQK 420
               SCOP domains d1g2fc_ C: TATA box-binding zinc finger, TATAZF                                           SCOP domains
               CATH domains --1g2fC01 C:103-132             1g2fC02 C:133-159          1g2fC03 C:160-186          --- CATH domains
               Pfam domains ----------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ....ee........ee.hhhhhhhhhhhhhh...ee......ee.hhhhhhhhhhhhhh...ee......ee.hhhhhhhhhhhhhh.. Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------ZINC_FINGER_C2H2_1     -------ZINC_FINGER_C2H2_1   -------ZINC_FINGER_C2H2_1   ---- PROSITE
                 Transcript ----------------------------------------------------------------------------------------- Transcript
                 1g2f C 101 MERPYACPVESCDRRFSQKTNLDTHIRIHTGQKPFQCRICMRNFSQQASLNAHIRTHTGEKPFACDICGRKFATLHTRTRHTKIHLRQK 189
                                   110       120       130       140       150       160       170       180         

Chain D from PDB  Type:DNA  Length:16
                                                
                 1g2f D  52 GACGCTATAAAAGGAG  67
                                    61      

Chain E from PDB  Type:DNA  Length:16
                                                
                 1g2f E  68 TCCTTTTATAGCGTCC  83
                                    77      

Chain F from PDB  Type:PROTEIN  Length:87
 aligned with EGR1_MOUSE | P08046 from UniProtKB/Swiss-Prot  Length:533

    Alignment length:87
                                   342       352       362       372       382       392       402       412       
           EGR1_MOUSE   333 ERPYACPVESCDRRFSRSDELTRHIRIHTGQKPFQCRICMRNFSRSDHLTTHIRTHTGEKPFACDICGRKFARSDERKRHTKIHLRQ 419
               SCOP domains d1g2ff_ F: TATA box-binding zinc finger, TATAZF                                         SCOP domains
               CATH domains -1g2fF01 F:203-232             1g2fF02 F:233-259          1g2fF03 F:260-286          -- CATH domains
               Pfam domains --------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ...ee........ee.hhhhhhhhhhhhhh...ee......ee.hhhhhhhhhhhhhh...ee......ee.hhhhhhhhhhhhh.. Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE -----ZINC_FINGER_C2H2_1     -------ZINC_FINGER_C2H2_1   -------ZINC_FINGER_C2H2_1   --- PROSITE
                 Transcript --------------------------------------------------------------------------------------- Transcript
                 1g2f F 202 ERPYACPVESCDRRFSQKTNLDTHIRIHTGQKPFQCRICMRNFSQQASLNAHIRTHTGEKPFACDICGRKFATLHTRTRHTKIHLRQ 288
                                   211       221       231       241       251       261       271       281       

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (1, 2)

Asymmetric Unit

(-) CATH Domains  (1, 6)

Asymmetric Unit
(-)
Class: Alpha Beta (26913)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 1G2F)

(-) Gene Ontology  (36, 36)

Asymmetric Unit(hide GO term definitions)
Chain C,F   (EGR1_MOUSE | P08046)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0000977    RNA polymerase II regulatory region sequence-specific DNA binding    Interacting selectively and non-covalently with a specific sequence of DNA that is part of a regulatory region that controls the transcription of a gene or cistron by RNA polymerase II.
    GO:0035035    histone acetyltransferase binding    Interacting selectively and non-covalently with the enzyme histone acetyltransferase.
    GO:0046872    metal ion binding    Interacting selectively and non-covalently with any metal ion.
    GO:0003676    nucleic acid binding    Interacting selectively and non-covalently with any nucleic acid.
    GO:0043565    sequence-specific DNA binding    Interacting selectively and non-covalently with DNA of a specific nucleotide composition, e.g. GC-rich DNA binding, or with a specific sequence motif or type of DNA e.g. promotor binding or rDNA binding.
    GO:0000982    transcription factor activity, RNA polymerase II core promoter proximal region sequence-specific binding    Interacting selectively and non-covalently with a sequence of DNA that is in cis with and relatively close to a core promoter for RNA polymerase II (RNAP II) in order to modulate transcription by RNAP II.
    GO:0003700    transcription factor activity, sequence-specific DNA binding    Interacting selectively and non-covalently with a specific DNA sequence in order to modulate transcription. The transcription factor may or may not also interact selectively with a protein or macromolecular complex.
    GO:0044212    transcription regulatory region DNA binding    Interacting selectively and non-covalently with a DNA region that regulates the transcription of a region of DNA, which may be a gene, cistron, or operon. Binding may occur as a sequence specific interaction or as an interaction observed only once a factor has been recruited to the DNA by other factors.
    GO:0000976    transcription regulatory region sequence-specific DNA binding    Interacting selectively and non-covalently with a specific sequence of DNA that is part of a regulatory region that controls transcription of that section of the DNA. The transcribed region might be described as a gene, cistron, or operon.
    GO:0001077    transcriptional activator activity, RNA polymerase II core promoter proximal region sequence-specific binding    Interacting selectively and non-covalently with a sequence of DNA that is in cis with and relatively close to a core promoter for RNA polymerase II (RNAP II) in order to activate or increase the frequency, rate or extent of transcription from the RNAP II promoter.
biological process
    GO:0030509    BMP signaling pathway    A series of molecular signals initiated by the binding of a member of the BMP (bone morphogenetic protein) family to a receptor on the surface of a target cell, and ending with regulation of a downstream cellular process, e.g. transcription.
    GO:0030217    T cell differentiation    The process in which a precursor cell type acquires characteristics of a more mature T-cell. A T cell is a type of lymphocyte whose definin characteristic is the expression of a T cell receptor complex.
    GO:0071480    cellular response to gamma radiation    Any process that results in a change in state or activity of a cell (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of a gamma radiation stimulus. Gamma radiation is a form of electromagnetic radiation (EMR) or light emission of a specific frequency produced from sub-atomic particle interaction, such as electron-positron annihilation and radioactive decay. Gamma rays are generally characterized as EMR having the highest frequency and energy, and also the shortest wavelength, within the electromagnetic radiation spectrum.
    GO:0071504    cellular response to heparin    Any process that results in a change in state or activity of a cell (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of a heparin stimulus.
    GO:0071506    cellular response to mycophenolic acid    Any process that results in a change in state or activity of a cell (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of a mycophenolic acid stimulus.
    GO:0071310    cellular response to organic substance    Any process that results in a change in state or activity of a cell (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of an organic substance stimulus.
    GO:0072110    glomerular mesangial cell proliferation    The multiplication or reproduction of glomerular mesangial cells, resulting in the expansion of the population.
    GO:0070498    interleukin-1-mediated signaling pathway    A series of molecular signals initiated by the binding of interleukin-1 to a receptor on the surface of a cell, and ending with regulation of a downstream cellular process, e.g. transcription.
    GO:0090090    negative regulation of canonical Wnt signaling pathway    Any process that decreases the rate, frequency, or extent of the Wnt signaling pathway through beta-catenin, the series of molecular signals initiated by binding of a Wnt protein to a frizzled family receptor on the surface of the target cell, followed by propagation of the signal via beta-catenin, and ending with a change in transcription of target genes.
    GO:0000122    negative regulation of transcription from RNA polymerase II promoter    Any process that stops, prevents, or reduces the frequency, rate or extent of transcription from an RNA polymerase II promoter.
    GO:0072303    positive regulation of glomerular metanephric mesangial cell proliferation    Any process that increases the frequency, rate or extent of metanephric glomerular mesangial cell proliferation.
    GO:0045944    positive regulation of transcription from RNA polymerase II promoter    Any process that activates or increases the frequency, rate or extent of transcription from an RNA polymerase II promoter.
    GO:0045893    positive regulation of transcription, DNA-templated    Any process that activates or increases the frequency, rate or extent of cellular DNA-templated transcription.
    GO:0042981    regulation of apoptotic process    Any process that modulates the occurrence or rate of cell death by apoptotic process.
    GO:0033233    regulation of protein sumoylation    Any process that modulates the frequency, rate or extent of the addition of SUMO groups to a protein.
    GO:0061418    regulation of transcription from RNA polymerase II promoter in response to hypoxia    Any process that modulates the frequency, rate or extent of transcription from an RNA polymerase II promoter as a result of a hypoxia stimulus.
    GO:0006355    regulation of transcription, DNA-templated    Any process that modulates the frequency, rate or extent of cellular DNA-templated transcription.
    GO:0009749    response to glucose    Any process that results in a change in state or activity of a cell or an organism (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of a glucose stimulus.
    GO:0032868    response to insulin    Any process that results in a change in state or activity of a cell or an organism (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of an insulin stimulus. Insulin is a polypeptide hormone produced by the islets of Langerhans of the pancreas in mammals, and by the homologous organs of other organisms.
    GO:0035914    skeletal muscle cell differentiation    The process in which a relatively unspecialized cell acquires specialized features of a skeletal muscle cell, a somatic cell located in skeletal muscle.
    GO:0006366    transcription from RNA polymerase II promoter    The synthesis of RNA from a DNA template by RNA polymerase II, originating at an RNA polymerase II promoter. Includes transcription of messenger RNA (mRNA) and certain small nuclear RNAs (snRNAs).
    GO:0006351    transcription, DNA-templated    The cellular synthesis of RNA on a template of DNA.
cellular component
    GO:0005737    cytoplasm    All of the contents of a cell excluding the plasma membrane and nucleus, but including other subcellular structures.
    GO:0005654    nucleoplasm    That part of the nuclear content other than the chromosomes or the nucleolus.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.

 Visualization

(-) Interactive Views

Asymmetric Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    ZN  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
    AC4  [ RasMol ]  +environment [ RasMol ]
    AC5  [ RasMol ]  +environment [ RasMol ]
    AC6  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 1g2f)
 
Biological Units
  Complete Structure
    Biological Unit 1  [ Jena3D ]
    Biological Unit 2  [ Jena3D ]

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  1g2f
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  EGR1_MOUSE | P08046
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  EGR1_MOUSE | P08046
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        EGR1_MOUSE | P080461a1f 1a1g 1a1h 1a1i 1a1j 1a1k 1a1l 1aay 1f2i 1g2d 1jk1 1jk2 1llm 1p47 1zaa

(-) Related Entries Specified in the PDB File

1aay 1AAY CONTAINS THE 1.6 A STRUCTURE OF ZIF268
1g2d STRUCTURE OF A CYS2HIS2 ZINC FINGER/TATA BOX COMPLEX (CLONE #2)