1G2F | PD0187 - JenaLib

Asymmetric Unit (Jmol Viewer)

Asym. Unit - sites (Jmol Viewer)

Biological Unit 1 (Jmol Viewer)

Biol. Unit 1 - sites (Jmol Viewer)

Biological Unit 2 (Jmol Viewer)

Title :   STRUCTURE OF A CYS2HIS2 ZINC FINGER/TATA BOX COMPLEX (TATAZF;CLONE #6)

Authors :   S. A. Wolfe, R. A. Grant, M. Elrod-Erickson, C. O. Pabo

Date :   18 Oct 00 (Deposition) - 07 Sep 01 (Release) - 24 Feb 09 (Revision)

Method :   X-RAY DIFFRACTION

Resolution :   2.00

Chains :   Asym. Unit : A,B,C,D,E,F
Biol. Unit 1: A,B,C  (1x)
Biol. Unit 2: D,E,F  (1x)

Keywords :   Phage Display, Zinc Finger-Dna Complex, Tata Box, Cys2His2, Transcription/Dna Complex (Keyword Search: [Gene Ontology, PubMed, Web (Google)] )

Reference :   S. A. Wolfe, R. A. Grant, M. Elrod-Erickson, C. O. Pabo
Beyond The "Recognition Code": Structures Of Two Cys2His2 Zinc Finger/Tata Box Complexes.
Structure V. 9 717 2001
PubMed-ID: 11587646 | Reference-DOI: 10.1016/S0969-2126(01)00632-3
(for further references see the PDB file header)

*Molecule 1 - 5'- D(GPAPCPGPCPTPAPTPAPAPAPAPGPGPAPG)-3'*
	Chains	:	A, D
	Engineered	:	YES
	Synthetic	:	YES

*Molecule 2 - 5'- D(TPCPCPTPTPTPTPAPTPAPGPCPGPTPCPC)-3'*
	Chains	:	B, E
	Engineered	:	YES
	Synthetic	:	YES

Molecule 3 - TATA BOX ZINC FINGER PROTEIN
	Chains	:	C, F
	Engineered	:	YES
	Expression System	:	ESCHERICHIA COLI BL21(DE3)
	Expression System Plasmid	:	PET21D
	Expression System Strain	:	BL21DE3
	Expression System Taxid	:	469008
	Expression System Vector Type	:	PLASMID
	Organism Common	:	HOUSE MOUSE
	Organism Scientific	:	MUS MUSCULUS
	Organism Taxid	:	10090
	Other Details	:	PROTEIN SELECTED BY PHAGE DISPLAY

		1	2	3	4	5	6
Asymmetric Unit	:	A	B	C	D	E	F
Biological Unit 1 (1x)	:	A	B	C
Biological Unit 2 (1x)	:				D	E	F

Summary Information (see also Sequences/Alignments below)

Asymmetric Unit (1, 6)

No.	Name	Count	Type	Full Name
1	ZN	6	Ligand/Ion	ZINC ION

Biological Unit 1 (0, 0)

No.	Name	Count	Type	Full Name
1	ZN	-1	Ligand/Ion	ZINC ION

Biological Unit 2 (0, 0)

No.	Name	Count	Type	Full Name
1	ZN	-1	Ligand/Ion	ZINC ION

Asymmetric Unit (6, 6)

No.	Name	Evidence	Residues	Description
1	AC1	SOFTWARE	CYS C:107 , CYS C:112 , HIS C:125 , HIS C:129	BINDING SITE FOR RESIDUE ZN C 301
2	AC2	SOFTWARE	CYS C:137 , CYS C:140 , HIS C:153 , HIS C:157	BINDING SITE FOR RESIDUE ZN C 302
3	AC3	SOFTWARE	CYS C:165 , CYS C:168 , HIS C:181 , HIS C:185	BINDING SITE FOR RESIDUE ZN C 303
4	AC4	SOFTWARE	CYS F:207 , CYS F:212 , HIS F:225 , HIS F:229	BINDING SITE FOR RESIDUE ZN F 304
5	AC5	SOFTWARE	CYS F:237 , CYS F:240 , HIS F:253 , HIS F:257	BINDING SITE FOR RESIDUE ZN F 305
6	AC6	SOFTWARE	CYS F:265 , CYS F:268 , HIS F:281 , HIS F:285	BINDING SITE FOR RESIDUE ZN F 306

(no "SS Bond" information available for 1G2F)

(no "Cis Peptide Bond" information available for 1G2F)

(no "SAP(SNP)/Variant" information available for 1G2F)

Asymmetric Unit (1, 6)

	PROSITE			UniProtKB		PDB
No.	ID	AC	Description	ID	Location	Count	Location
1	ZINC_FINGER_C2H2_1	PS00028	Zinc finger C2H2 type domain signature.	EGR1_MOUSE	338-360 368-388 396-416	6	C:107-129 F:207-229 C:137-157 F:237-257 C:165-185 F:265-285

Biological Unit 1 (1, 3)

	PROSITE			UniProtKB		PDB
No.	ID	AC	Description	ID	Location	Count	Location
1	ZINC_FINGER_C2H2_1	PS00028	Zinc finger C2H2 type domain signature.	EGR1_MOUSE	338-360 368-388 396-416	3	C:107-129 - C:137-157 - C:165-185 -

Biological Unit 2 (1, 3)

	PROSITE			UniProtKB		PDB
No.	ID	AC	Description	ID	Location	Count	Location
1	ZINC_FINGER_C2H2_1	PS00028	Zinc finger C2H2 type domain signature.	EGR1_MOUSE	338-360 368-388 396-416	3	- F:207-229 - F:237-257 - F:265-285

(no "Exon" information available for 1G2F)

Asymmetric Unit

Reformat:	Number of residues per line = ('0' or empty: single-line sequence representation)
	Number of residues per labelling interval =
	UniProt sequence: complete aligned part

Show mapping:	SCOP domains	CATH domains	Pfam domains	Secondary structure (by author)
	SAPs(SNPs)	PROSITE motifs	Exons
	(details for a mapped element are shown in a popup box when the mouse pointer rests over it)

Chain A from PDB  Type:DNA  Length:16
                                                
                 1g2f A   2 GACGCTATAAAAGGAG  17
                                    11

Chain B from PDB  Type:DNA  Length:16
                                                
                 1g2f B  18 TCCTTTTATAGCGTCC  33
                                    27

Chain C from PDB  Type:PROTEIN  Length:89
 aligned with EGR1_MOUSE | P08046 from UniProtKB/Swiss-Prot  Length:533

    Alignment length:89
                                   341       351       361       371       381       391       401       411         
           EGR1_MOUSE   332 HERPYACPVESCDRRFSRSDELTRHIRIHTGQKPFQCRICMRNFSRSDHLTTHIRTHTGEKPFACDICGRKFARSDERKRHTKIHLRQK 420
               SCOP domains d1g2fc_ C: TATA box-binding zinc finger, TATAZF                                           SCOP domains
               CATH domains --1g2fC01 C:103-132             1g2fC02 C:133-159          1g2fC03 C:160-186          --- CATH domains
               Pfam domains ----------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ....ee........ee.hhhhhhhhhhhhhh...ee......ee.hhhhhhhhhhhhhh...ee......ee.hhhhhhhhhhhhhh.. Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------ZINC_FINGER_C2H2_1     -------ZINC_FINGER_C2H2_1   -------ZINC_FINGER_C2H2_1   ---- PROSITE
                 Transcript ----------------------------------------------------------------------------------------- Transcript
                 1g2f C 101 MERPYACPVESCDRRFSQKTNLDTHIRIHTGQKPFQCRICMRNFSQQASLNAHIRTHTGEKPFACDICGRKFATLHTRTRHTKIHLRQK 189
                                   110       120       130       140       150       160       170       180

Chain D from PDB  Type:DNA  Length:16
                                                
                 1g2f D  52 GACGCTATAAAAGGAG  67
                                    61

Chain E from PDB  Type:DNA  Length:16
                                                
                 1g2f E  68 TCCTTTTATAGCGTCC  83
                                    77

Chain F from PDB  Type:PROTEIN  Length:87
 aligned with EGR1_MOUSE | P08046 from UniProtKB/Swiss-Prot  Length:533

    Alignment length:87
                                   342       352       362       372       382       392       402       412       
           EGR1_MOUSE   333 ERPYACPVESCDRRFSRSDELTRHIRIHTGQKPFQCRICMRNFSRSDHLTTHIRTHTGEKPFACDICGRKFARSDERKRHTKIHLRQ 419
               SCOP domains d1g2ff_ F: TATA box-binding zinc finger, TATAZF                                         SCOP domains
               CATH domains -1g2fF01 F:203-232             1g2fF02 F:233-259          1g2fF03 F:260-286          -- CATH domains
               Pfam domains --------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ...ee........ee.hhhhhhhhhhhhhh...ee......ee.hhhhhhhhhhhhhh...ee......ee.hhhhhhhhhhhhh.. Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE -----ZINC_FINGER_C2H2_1     -------ZINC_FINGER_C2H2_1   -------ZINC_FINGER_C2H2_1   --- PROSITE
                 Transcript --------------------------------------------------------------------------------------- Transcript
                 1g2f F 202 ERPYACPVESCDRRFSQKTNLDTHIRIHTGQKPFQCRICMRNFSQQASLNAHIRTHTGEKPFACDICGRKFATLHTRTRHTKIHLRQ 288
                                   211       221       231       241       251       261       271       281

Legend:		→ Mismatch	(orange background)
	-	→ Gap	(green background, '-', border residues have a numbering label)
		→ Modified Residue	(blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
	x	→ Chemical Group	(purple background, 'x', labelled with number + name, e.g. ACE or NH2)
	extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '\|'

Asymmetric Unit

Classes(

)

Folds(

)

Superfamilies( (-)

)

Families(

)

Protein Domains( (-)

)

Organisms( (-)

)

Class: Designed proteins (167)

Fold: Zinc finger design (10)

Superfamily: Zinc finger design (10)

Family: Zinc finger design (10)

Protein domain: TATA box-binding zinc finger, TATAZF (2)

Mouse (Mus musculus), Zif268 based [TaxId: 10090] (2)

1a	d1g2fc_	C:
1b	d1g2ff_	F:

Asymmetric Unit

Classes(

)

Architectures( (-)

)

Topologies( (-)

)

Homologous Superfamilies( (-)

)

Organisms( (-)

)

Class: Alpha Beta (26913)

Architecture: 2-Layer Sandwich (8480)

Topology: Double Stranded RNA Binding Domain (204)

Homologous Superfamily: Classic Zinc Finger (45)

[unclassified] (29)

1a	1g2fC02	C:133-159
1b	1g2fF02	F:233-259
1c	1g2fC03	C:160-186
1d	1g2fF03	F:260-286
1e	1g2fC01	C:103-132
1f	1g2fF01	F:203-232

(no "Pfam Domain" information available for 1G2F)

Asymmetric Unit(hide GO term definitions)

Chain C,F (EGR1_MOUSE | P08046)

molecular function
	GO:0003677	DNA binding	Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
	GO:0000977	RNA polymerase II regulatory region sequence-specific DNA binding	Interacting selectively and non-covalently with a specific sequence of DNA that is part of a regulatory region that controls the transcription of a gene or cistron by RNA polymerase II.
	GO:0035035	histone acetyltransferase binding	Interacting selectively and non-covalently with the enzyme histone acetyltransferase.
	GO:0046872	metal ion binding	Interacting selectively and non-covalently with any metal ion.
	GO:0003676	nucleic acid binding	Interacting selectively and non-covalently with any nucleic acid.
	GO:0043565	sequence-specific DNA binding	Interacting selectively and non-covalently with DNA of a specific nucleotide composition, e.g. GC-rich DNA binding, or with a specific sequence motif or type of DNA e.g. promotor binding or rDNA binding.
	GO:0000982	transcription factor activity, RNA polymerase II core promoter proximal region sequence-specific binding	Interacting selectively and non-covalently with a sequence of DNA that is in cis with and relatively close to a core promoter for RNA polymerase II (RNAP II) in order to modulate transcription by RNAP II.
	GO:0003700	transcription factor activity, sequence-specific DNA binding	Interacting selectively and non-covalently with a specific DNA sequence in order to modulate transcription. The transcription factor may or may not also interact selectively with a protein or macromolecular complex.
	GO:0044212	transcription regulatory region DNA binding	Interacting selectively and non-covalently with a DNA region that regulates the transcription of a region of DNA, which may be a gene, cistron, or operon. Binding may occur as a sequence specific interaction or as an interaction observed only once a factor has been recruited to the DNA by other factors.
	GO:0000976	transcription regulatory region sequence-specific DNA binding	Interacting selectively and non-covalently with a specific sequence of DNA that is part of a regulatory region that controls transcription of that section of the DNA. The transcribed region might be described as a gene, cistron, or operon.
	GO:0001077	transcriptional activator activity, RNA polymerase II core promoter proximal region sequence-specific binding	Interacting selectively and non-covalently with a sequence of DNA that is in cis with and relatively close to a core promoter for RNA polymerase II (RNAP II) in order to activate or increase the frequency, rate or extent of transcription from the RNAP II promoter.
biological process
	GO:0030509	BMP signaling pathway	A series of molecular signals initiated by the binding of a member of the BMP (bone morphogenetic protein) family to a receptor on the surface of a target cell, and ending with regulation of a downstream cellular process, e.g. transcription.
	GO:0030217	T cell differentiation	The process in which a precursor cell type acquires characteristics of a more mature T-cell. A T cell is a type of lymphocyte whose definin characteristic is the expression of a T cell receptor complex.
	GO:0071480	cellular response to gamma radiation	Any process that results in a change in state or activity of a cell (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of a gamma radiation stimulus. Gamma radiation is a form of electromagnetic radiation (EMR) or light emission of a specific frequency produced from sub-atomic particle interaction, such as electron-positron annihilation and radioactive decay. Gamma rays are generally characterized as EMR having the highest frequency and energy, and also the shortest wavelength, within the electromagnetic radiation spectrum.
	GO:0071504	cellular response to heparin	Any process that results in a change in state or activity of a cell (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of a heparin stimulus.
	GO:0071506	cellular response to mycophenolic acid	Any process that results in a change in state or activity of a cell (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of a mycophenolic acid stimulus.
	GO:0071310	cellular response to organic substance	Any process that results in a change in state or activity of a cell (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of an organic substance stimulus.
	GO:0072110	glomerular mesangial cell proliferation	The multiplication or reproduction of glomerular mesangial cells, resulting in the expansion of the population.
	GO:0070498	interleukin-1-mediated signaling pathway	A series of molecular signals initiated by the binding of interleukin-1 to a receptor on the surface of a cell, and ending with regulation of a downstream cellular process, e.g. transcription.
	GO:0090090	negative regulation of canonical Wnt signaling pathway	Any process that decreases the rate, frequency, or extent of the Wnt signaling pathway through beta-catenin, the series of molecular signals initiated by binding of a Wnt protein to a frizzled family receptor on the surface of the target cell, followed by propagation of the signal via beta-catenin, and ending with a change in transcription of target genes.
	GO:0000122	negative regulation of transcription from RNA polymerase II promoter	Any process that stops, prevents, or reduces the frequency, rate or extent of transcription from an RNA polymerase II promoter.
	GO:0072303	positive regulation of glomerular metanephric mesangial cell proliferation	Any process that increases the frequency, rate or extent of metanephric glomerular mesangial cell proliferation.
	GO:0045944	positive regulation of transcription from RNA polymerase II promoter	Any process that activates or increases the frequency, rate or extent of transcription from an RNA polymerase II promoter.
	GO:0045893	positive regulation of transcription, DNA-templated	Any process that activates or increases the frequency, rate or extent of cellular DNA-templated transcription.
	GO:0042981	regulation of apoptotic process	Any process that modulates the occurrence or rate of cell death by apoptotic process.
	GO:0033233	regulation of protein sumoylation	Any process that modulates the frequency, rate or extent of the addition of SUMO groups to a protein.
	GO:0061418	regulation of transcription from RNA polymerase II promoter in response to hypoxia	Any process that modulates the frequency, rate or extent of transcription from an RNA polymerase II promoter as a result of a hypoxia stimulus.
	GO:0006355	regulation of transcription, DNA-templated	Any process that modulates the frequency, rate or extent of cellular DNA-templated transcription.
	GO:0009749	response to glucose	Any process that results in a change in state or activity of a cell or an organism (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of a glucose stimulus.
	GO:0032868	response to insulin	Any process that results in a change in state or activity of a cell or an organism (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of an insulin stimulus. Insulin is a polypeptide hormone produced by the islets of Langerhans of the pancreas in mammals, and by the homologous organs of other organisms.
	GO:0035914	skeletal muscle cell differentiation	The process in which a relatively unspecialized cell acquires specialized features of a skeletal muscle cell, a somatic cell located in skeletal muscle.
	GO:0006366	transcription from RNA polymerase II promoter	The synthesis of RNA from a DNA template by RNA polymerase II, originating at an RNA polymerase II promoter. Includes transcription of messenger RNA (mRNA) and certain small nuclear RNAs (snRNAs).
	GO:0006351	transcription, DNA-templated	The cellular synthesis of RNA on a template of DNA.
cellular component
	GO:0005737	cytoplasm	All of the contents of a cell excluding the plasma membrane and nucleus, but including other subcellular structures.
	GO:0005654	nucleoplasm	That part of the nuclear content other than the chromosomes or the nucleolus.
	GO:0005634	nucleus	A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.

Asymmetric Unit
	Complete Structure
		Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
		WebMol \| AstexViewer[tm]@PDBe (Java Applets, require no local installation except for Java; loading may be slow)
		STRAP (Java WebStart application, automatic local installation, requires Java; full application with system access!)
		RasMol (require local installation)
		Molscript (VRML) (requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)

	Ligands, Modified Residues, Ions
		ZN	[ RasMol \| Jena3D ]	+environment [ RasMol \| Jena3D ]

	Sites
		AC1	[ RasMol ]	+environment [ RasMol ]
		AC2	[ RasMol ]	+environment [ RasMol ]
		AC3	[ RasMol ]	+environment [ RasMol ]
		AC4	[ RasMol ]	+environment [ RasMol ]
		AC5	[ RasMol ]	+environment [ RasMol ]
		AC6	[ RasMol ]	+environment [ RasMol ]

	Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 1g2f)

Biological Units
	Complete Structure
		Biological Unit 1	[ Jena3D ]
		Biological Unit 2	[ Jena3D ]

Jmol
	protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick [ Asym. Unit PNG format \| Asym. Unit - sites PNG format \| Biol. Unit 1 PNG format \| Biol. Unit 1 - sites PNG format \| Biol. Unit 2 PNG format ](automatic orientation, automatically generated)
Molscript
	protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick [ mono PDF format \| stereo PDF format ](default orientation, automatically generated)

Access by PDB/NDB ID
	1g2f
		Family and Domain Information	:	ProDom \| SYSTERS
		General Structural Information	:	GlycoscienceDB \| MMDB \| NDB \| OCA \| PDB \| PDBe \| PDBj \| PDBsum \| PDBWiki \| PQS \| PROTEOPEDIA
		Orientation in Membranes	:	OPM
		Protein Surface	:	SURFACE
		Secondary Structure	:	DSSP (structure derived) \| HSSP (homology derived)
		Structural Genomics	:	GeneCensus
		Structural Neighbours	:	CE \| VAST
		Structure Classification	:	CATH \| Dali \| SCOP
		Validation and Original Data	:	BMRB Data View \| BMRB Restraints Grid \| EDS \| PROCHECK \| RECOORD \| WHAT_CHECK

Access by UniProt ID/Accession number
	EGR1_MOUSE \| P08046
		Comparative Protein Structure Models	:	ModBase
		Genomic Information	:	Ensembl
		Protein-protein Interaction	:	DIP
		Sequence, Family and Domain Information	:	InterPro \| Pfam \| SMART \| UniProtKB/SwissProt

Access by Enzyme Classificator (EC Number)
	(no 'Enzyme Classificator' available)
		General Enzyme Information	:	BRENDA \| EC-PDB \| Enzyme \| IntEnz
		Pathway	:	KEGG \| MetaCyc

Access by Disease Identifier (MIM ID)
	(no 'MIM ID' available)
		Disease Information	:	OMIM

Access by GenAge ID
	(no 'GenAge ID' available)
		Age Related Information	:	GenAge

Access by PDB/NDB ID
		Domain Information	:	XDom
		Interatomic Contacts of Structural Units	:	CSU
		Ligand-protein Contacts	:	LPC
		Protein Cavities	:	castP
		Sequence and Secondary Structure	:	PDBCartoon
		Structure Alignment	:	STRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
		Structure and Sequence Browser	:	STING

Access by UniProt ID/Accession number
	EGR1_MOUSE \| P08046
		Protein Disorder Prediction	:	DisEMBL \| FoldIndex \| GLOBPLOT (for more information see DisProt)

1aay	1AAY CONTAINS THE 1.6 A STRUCTURE OF ZIF268
1g2d	STRUCTURE OF A CYS2HIS2 ZINC FINGER/TATA BOX COMPLEX (CLONE #2)

Description

Compounds

Structural Features

Chains, Units

Ligands, Modified Residues, Ions (1, 6)

Sites (6, 6)

SS Bonds (0, 0)

Cis Peptide Bonds (0, 0)

Sequence-Structure Mapping

SAPs(SNPs)/Variants (0, 0)

PROSITE Motifs (1, 6)

Exons (0, 0)

Sequences/Alignments

Classification and Annotation

SCOP Domains (1, 2)

CATH Domains (1, 6)

Pfam Domains (0, 0)

Gene Ontology (36, 36)

Visualization

Interactive Views

Still Images

Databases and Analysis Tools

Databases

Analysis Tools

Related Entries

Entries Sharing at Least One Protein Chain (UniProt ID)

Related Entries Specified in the PDB File