Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asym./Biol. Unit
(-)Asym./Biol. Unit - sites
collapse expand < >
Image Asym./Biol. Unit
Asym./Biol. Unit  (Jmol Viewer)
Image Asym./Biol. Unit - sites
Asym./Biol. Unit - sites  (Jmol Viewer)

(-) Description

Title :  DOUBLE STRANDED DNA BINDING PROTEIN
 
Authors :  S. Cui, P. Wollmann, M. Moldt, K. -P. Hopfner
Date :  16 Sep 08  (Deposition) - 22 Sep 09  (Release) - 22 Sep 09  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  1.90
Chains :  Asym./Biol. Unit :  A,B
Keywords :  Tata Box Binding Protein, Dna-Binding, Initiation Factor, Nucleus, Transcription (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  S. Cui, P. Wollmann, M. Moldt, K. -P. Hopfner
Structural Studies Of Ectbp
To Be Published
PubMed: search
(for further references see the PDB file header)

(-) Compounds

Molecule 1 - TATA-BOX-BINDING PROTEIN
    ChainsA, B
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPET-28A
    Expression System StrainROSETTA
    Expression System Taxid562
    Expression System Vector TypePLASMID
    Organism ScientificENCEPHALITOZOON CUNICULI
    Organism Taxid6035
    StrainGB-M1

 Structural Features

(-) Chains, Units

  12
Asymmetric/Biological Unit AB

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (1, 6)

Asymmetric/Biological Unit (1, 6)
No.NameCountTypeFull Name
1EDO6Ligand/Ion1,2-ETHANEDIOL

(-) Sites  (6, 6)

Asymmetric Unit (6, 6)
No.NameEvidenceResiduesDescription
1AC1SOFTWARETYR A:140 , TYR A:153 , MET A:155 , VAL A:156 , LYS A:157 , HOH A:3350BINDING SITE FOR RESIDUE EDO A3318
2AC2SOFTWAREVAL A:161 , THR A:173 , GLY A:174 , HOH A:3332 , TYR B:140 , CYS B:141 , SER B:142 , ILE B:152 , HOH B:3327BINDING SITE FOR RESIDUE EDO A3319
3AC3SOFTWAREALA A:134 , SER A:138 , CYS A:141 , SER A:142 , TYR A:143 , GLU A:144 , ASN B:117BINDING SITE FOR RESIDUE EDO A3320
4AC4SOFTWARETYR A:140 , CYS A:141 , SER A:142 , ILE A:152 , HOH A:3333 , VAL B:161 , THR B:173 , GLY B:174BINDING SITE FOR RESIDUE EDO A3321
5AC5SOFTWAREASN A:117 , ALA B:134 , SER B:138 , CYS B:141 , SER B:142 , TYR B:143 , GLU B:144BINDING SITE FOR RESIDUE EDO B3318
6AC6SOFTWAREPHE A:148 , PRO A:149 , PHE B:148 , PRO B:149BINDING SITE FOR RESIDUE EDO B3319

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 3EIK)

(-) Cis Peptide Bonds  (4, 4)

Asymmetric/Biological Unit
No.Residues
1Glu A:66 -Pro A:67
2Lys A:157 -Pro A:158
3Glu B:66 -Pro B:67
4Lys B:157 -Pro B:158

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 3EIK)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 3EIK)

(-) Exons   (0, 0)

(no "Exon" information available for 3EIK)

(-) Sequences/Alignments

Asymmetric/Biological Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:178
 aligned with Q8ST28_ENCCU | Q8ST28 from UniProtKB/TrEMBL  Length:198

    Alignment length:178
                                    29        39        49        59        69        79        89        99       109       119       129       139       149       159       169       179       189        
         Q8ST28_ENCCU    20 GIIPTLQNVVATVNLSCKLDLKNIALRARNAEYNPKRFAAVIMRIREPKTTALIFASGKMVITGAKSEKSSRMAAQRYAKIIHKLGFNATFDDFKIQNIVSSCDIKFSIRLEGLAYAHSNYCSYEPELFPGLIYRMVKPKIVLLIFVSGKIVLTGAKVRDDIYQAFNNIYPVLIQHRK 197
               SCOP domains d3eika1 A:20-113 automated matches                                                            d3eika2 A:114-197 automated matches                                                  SCOP domains
               CATH domains 3eikA01   3eikA02 A:30-115 TATA-Binding Protein                                                 3eikA01 A:20-29,A:116-197 TATA-Binding Protein                                     CATH domains
               Pfam domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ....eeeeeeeeee.....hhhhhhhhh..ee........eeeee....eeeee....eeeeeee.hhhhhhhhhhhhhhhhhhh......eeeeeeeeeeeee...eehhhhhhhhh...ee........eeeee....eeeee....eeeeeee.hhhhhhhhhhhhhhhhhhhee Sec.struct. author
                 SAPs(SNPs) ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 3eik A  20 GIIPTLQNVVATVNLSCKLDLKNIALRARNAEYNPKRFAAVIMRIREPKTTALIFASGKMVITGAKSEKSSRMAAQRYAKIIHKLGFNATFDDFKIQNIVSSCDIKFSIRLEGLAYAHSNYCSYEPELFPGLIYRMVKPKIVLLIFVSGKIVLTGAKVRDDIYQAFNNIYPVLIQHRK 197
                                    29        39        49        59        69        79        89        99       109       119       129       139       149       159       169       179       189        

Chain B from PDB  Type:PROTEIN  Length:179
 aligned with Q8ST28_ENCCU | Q8ST28 from UniProtKB/TrEMBL  Length:198

    Alignment length:179
                                    29        39        49        59        69        79        89        99       109       119       129       139       149       159       169       179       189         
         Q8ST28_ENCCU    20 GIIPTLQNVVATVNLSCKLDLKNIALRARNAEYNPKRFAAVIMRIREPKTTALIFASGKMVITGAKSEKSSRMAAQRYAKIIHKLGFNATFDDFKIQNIVSSCDIKFSIRLEGLAYAHSNYCSYEPELFPGLIYRMVKPKIVLLIFVSGKIVLTGAKVRDDIYQAFNNIYPVLIQHRKA 198
               SCOP domains d3eikb1 B:20-113 automated matches                                                            d3eikb2 B:114-198 automated matches                                                   SCOP domains
               CATH domains 3eikB01   3eikB02 B:30-115 TATA-Binding Protein                                                 3eikB01 B:20-29,B:116-198 TATA-Binding Protein                                      CATH domains
               Pfam domains ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ....eeeeeeeeee.....hhhhhhhhh..ee........eeeee....eeeee....eeeeeee.hhhhhhhhhhhhhhhhhhh......eeeeeeeeeeeee...eehhhhhhhhh...ee........eeeee....eeeee....eeeeeee.hhhhhhhhhhhhhhhhhh.ee. Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 3eik B  20 GIIPTLQNVVATVNLSCKLDLKNIALRARNAEYNPKRFAAVIMRIREPKTTALIFASGKMVITGAKSEKSSRMAAQRYAKIIHKLGFNATFDDFKIQNIVSSCDIKFSIRLEGLAYAHSNYCSYEPELFPGLIYRMVKPKIVLLIFVSGKIVLTGAKVRDDIYQAFNNIYPVLIQHRKA 198
                                    29        39        49        59        69        79        89        99       109       119       129       139       149       159       169       179       189         

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (1, 4)

Asymmetric/Biological Unit

(-) CATH Domains  (1, 4)

Asymmetric/Biological Unit
(-)
Class: Alpha Beta (26913)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 3EIK)

(-) Gene Ontology  (23, 23)

Asymmetric/Biological Unit(hide GO term definitions)
Chain A,B   (Q8ST28_ENCCU | Q8ST28)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0008301    DNA binding, bending    The activity of binding selectively and non-covalently to and distorting the original structure of DNA, typically a straight helix, into a bend, or increasing the bend if the original structure was intrinsically bent due to its sequence.
    GO:0001179    RNA polymerase I transcription factor binding    Interacting selectively and non-covalently with an RNA polymerase I transcription factor, any protein required to initiate or regulate transcription by RNA polymerase I.
    GO:0001102    RNA polymerase II activating transcription factor binding    Interacting selectively and non-covalently with an RNA polymerase II transcription activating factor, a protein involved in positive regulation of transcription.
    GO:0000979    RNA polymerase II core promoter sequence-specific DNA binding    Interacting selectively and non-covalently with the regulatory region composed of the transcription start site and binding sites for transcription factors of the RNA polymerase II basal transcription machinery.
    GO:0001016    RNA polymerase III regulatory region DNA binding    Interacting selectively and non-covalently with a DNA region that controls the transcription of a gene by RNA polymerase III. Binding may occur as a sequence specific interaction or as an interaction observed only once a factor has been recruited to the DNA by other factors.
    GO:0003682    chromatin binding    Interacting selectively and non-covalently with chromatin, the network of fibers of DNA, protein, and sometimes RNA, that make up the chromosomes of the eukaryotic nucleus during interphase.
    GO:0001186    transcription factor activity, RNA polymerase I transcription factor recruiting    The function of binding to an RNA polymerase I (RNAP I) transcription factor and recruiting it to the transcription machinery complex in order to modulate transcription by RNAP I.
    GO:0000983    transcription factor activity, RNA polymerase II core promoter sequence-specific    Interacting selectively and non-covalently with a specific DNA sequence in an RNA polymerase II (Pol II) core promoter, the region composed of the transcription start site and binding sites for transcription factors of the Pol II basal transcription machinery, in order to modulate transcription by Pol II. The transcription factor may or may not also interact selectively with a protein or macromolecular complex.
    GO:0003743    translation initiation factor activity    Functions in the initiation of ribosome-mediated translation of mRNA into a polypeptide.
biological process
    GO:0006352    DNA-templated transcription, initiation    Any process involved in the assembly of the RNA polymerase preinitiation complex (PIC) at the core promoter region of a DNA template, resulting in the subsequent synthesis of RNA from that promoter. The initiation phase includes PIC assembly and the formation of the first few bonds in the RNA chain, including abortive initiation, which occurs when the first few nucleotides are repeatedly synthesized and then released. The initiation phase ends just before and does not include promoter clearance, or release, which is the transition between the initiation and elongation phases of transcription.
    GO:0051123    RNA polymerase II transcriptional preinitiation complex assembly    The aggregation, arrangement and bonding together of proteins on an RNA polymerase II promoter DNA to form the transcriptional preinitiation complex (PIC), the formation of which is a prerequisite for transcription by RNA polymerase.
    GO:0070898    RNA polymerase III transcriptional preinitiation complex assembly    The aggregation, arrangement and bonding together of proteins on promoter DNA to form the transcriptional preinitiation complex (PIC), the formation of which is a prerequisite for transcription from an RNA polymerase III promoter.
    GO:0006356    regulation of transcription from RNA polymerase I promoter    Any process that modulates the frequency, rate or extent of transcription from an RNA polymerase I promoter.
    GO:0006357    regulation of transcription from RNA polymerase II promoter    Any process that modulates the frequency, rate or extent of transcription from an RNA polymerase II promoter.
    GO:1903357    regulation of transcription initiation from RNA polymerase I promoter for nuclear large rRNA transcript    Any process that modulates the frequency, rate or extent of transcription initiation from RNA polymerase I promoter for nuclear large rRNA transcript.
    GO:0042790    transcription of nuclear large rRNA transcript from RNA polymerase I promoter    The synthesis of the large ribosomal RNA (rRNA) transcript which encodes several rRNAs, e.g. in mammals 28S, 18S and 5.8S, from a nuclear DNA template transcribed by RNA polymerase I.
    GO:0006413    translational initiation    The process preceding formation of the peptide bond between the first two amino acids of a protein. This includes the formation of a complex of the ribosome, mRNA or circRNA, and an initiation complex that contains the first aminoacyl-tRNA.
    GO:0070893    transposon integration    Any process in which a transposable element is incorporated into another DNA molecule such as a chromosome.
cellular component
    GO:0005668    RNA polymerase transcription factor SL1 complex    A RNA polymerase I-specific transcription factor complex that contains the TATA-box-binding protein (TBP) and at least three TBP-associated factors including proteins known in mammals as TAFI110, TAFI63 and TAFI48.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.
    GO:0005669    transcription factor TFIID complex    A complex composed of TATA binding protein (TBP) and TBP associated factors (TAFs); the total mass is typically about 800 kDa. Most of the TAFs are conserved across species. In TATA-containing promoters for RNA polymerase II (Pol II), TFIID is believed to recognize at least two distinct elements, the TATA element and a downstream promoter element. TFIID is also involved in recognition of TATA-less Pol II promoters. Binding of TFIID to DNA is necessary but not sufficient for transcription initiation from most RNA polymerase II promoters.
    GO:0000126    transcription factor TFIIIB complex    A transcription factor complex that is involved in regulating transcription from RNA polymerase III (Pol III) promoters. TFIIIB contains the TATA-binding protein (TBP) and two Pol III-specific proteins, B'' and BRF.

 Visualization

(-) Interactive Views

Asymmetric/Biological Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    EDO  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
    AC4  [ RasMol ]  +environment [ RasMol ]
    AC5  [ RasMol ]  +environment [ RasMol ]
    AC6  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
    Glu A:66 - Pro A:67   [ RasMol ]  
    Glu B:66 - Pro B:67   [ RasMol ]  
    Lys A:157 - Pro A:158   [ RasMol ]  
    Lys B:157 - Pro B:158   [ RasMol ]  
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  3eik
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  Q8ST28_ENCCU | Q8ST28
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/TrEMBL
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  Q8ST28_ENCCU | Q8ST28
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/TrEMBL
        Q8ST28_ENCCU | Q8ST283oc3 3oci 4wzs

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 3EIK)