Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asym./Biol. Unit
(-)Asym./Biol. Unit - sites
collapse expand < >
Image Asym./Biol. Unit
Asym./Biol. Unit  (Jmol Viewer)
Image Asym./Biol. Unit - sites
Asym./Biol. Unit - sites  (Jmol Viewer)

(-) Description

Title :  RNA POLYMERASE ALPHA C-TERMINAL DOMAIN (E. COLI) AND SIGMA REGION 4 (T. AQ. MUTANT) BOUND TO (UP,-35 ELEMENT) DNA
 
Authors :  S. Lara-Gonzalez, J. J. Birktoft, C. L. Lawson
Date :  28 May 10  (Deposition) - 08 Jun 11  (Release) - 08 Jun 11  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  3.25
Chains :  Asym./Biol. Unit :  A,B,C,D,M,N
Keywords :  Protein-Protein Interactions, Protein-Dna Interactions, Transcription Initiation, Gene Regulation-Dna Complex (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  S. Lara-Gonzalez, J. J. Birktoft, C. L. Lawson
Combined Structural Model For A Transcription Activation Assembly
To Be Published
PubMed: search

(-) Compounds

Molecule 1 - RNA POLYMERASE SIGMA FACTOR
    ChainsA, D
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPET
    Expression System StrainBL21 DE3
    Expression System Taxid562
    Expression System Vector TypePLASMID
    FragmentSIGMA SUBUNIT REGION 4, RESIDUES 366-438
    GeneRPOS
    MutationYES
    Organism ScientificTHERMUS AQUATICUS
    Organism Taxid271
 
Molecule 2 - DNA-DIRECTED RNA POLYMERASE SUBUNIT ALPHA
    ChainsB, C
    EC Number2.7.7.6
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPET
    Expression System StrainBL21 DE3
    Expression System Taxid562
    Expression System Vector TypePLASMID
    FragmentALPHA SUBUNIT C-TERMINAL DOMAIN, RESIDUES 246-329
    GeneB3295, JW3257, PEZ, PHS, RPOA, SEZ
    Organism ScientificESCHERICHIA COLI
    Organism Taxid83333
    StrainK12
    SynonymRNAP SUBUNIT ALPHA, TRANSCRIPTASE SUBUNIT ALPHA, RNA POLYMERASE SUBUNIT ALPHA
 
Molecule 3 - DNA (5'- D(*TP*GP*GP*AP*AP*AP*AP*AP*AP*GP*TP*AP*CP*TP*TP*GP*AP*CP*AP*TP*GP*G)- 3')
    ChainsM
    EngineeredYES
    SyntheticYES
 
Molecule 4 - DNA (5'- D(*CP*CP*AP*TP*GP*TP*CP*AP*AP*GP*TP*AP*CP*TP*TP*TP*TP*TP*TP*CP*C)- 3')
    ChainsN
    EngineeredYES
    SyntheticYES

 Structural Features

(-) Chains, Units

  123456
Asymmetric/Biological Unit ABCDMN

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (1, 1)

Asymmetric/Biological Unit (1, 1)
No.NameCountTypeFull Name
1ACT1Ligand/IonACETATE ION

(-) Sites  (1, 1)

Asymmetric Unit (1, 1)
No.NameEvidenceResiduesDescription
1AC1SOFTWAREARG C:265 , ASN C:294 , DA M:6 , DT N:19 , DC N:20BINDING SITE FOR RESIDUE ACT C 1

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 3N97)

(-) Cis Peptide Bonds  (2, 2)

Asymmetric/Biological Unit
No.Residues
1Trp B:321 -Pro B:322
2Trp C:321 -Pro C:322

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 3N97)

(-) PROSITE Motifs  (1, 2)

Asymmetric/Biological Unit (1, 2)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1SIGMA70_2PS00716 Sigma-70 factors family signature 2.SIGA_THEAQ397-423
 
  2A:397-423
D:397-423

(-) Exons   (0, 0)

(no "Exon" information available for 3N97)

(-) Sequences/Alignments

Asymmetric/Biological Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:53
 aligned with SIGA_THEAQ | Q9EZJ8 from UniProtKB/Swiss-Prot  Length:438

    Alignment length:53
                                   383       393       403       413       423   
           SIGA_THEAQ   374 SKLSEREAMVLKLRKGLIDGREHTLEEVGAYFGVTRERIRQIENKALRKLKYH 426
               SCOP domains ----------------------------------------------------- SCOP domains
               CATH domains ----------------------------------------------------- CATH domains
               Pfam domains ----------------------------------------------------- Pfam domains
         Sec.struct. author ...hhhhhhhhhhhhh.......hhhhhhhhhh.hhhhhhhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------- SAPs(SNPs)
                    PROSITE -----------------------SIGMA70_2  PDB: A:397-423  --- PROSITE
                 Transcript ----------------------------------------------------- Transcript
                 3n97 A 374 SKLSEREAMVLKMRKGLIDGREHTLEEVGAYFGVTRERIRQIENKALRKLRHP 426
                                   383       393       403       413       423   

Chain B from PDB  Type:PROTEIN  Length:75
 aligned with RPOA_ECOLI | P0A7Z4 from UniProtKB/Swiss-Prot  Length:329

    Alignment length:75
                                   259       269       279       289       299       309       319     
           RPOA_ECOLI   250 DPILLRPVDDLELTVRSANCLKAEAIHYIGDLVQRTEVELLKTPNLGKKSLTEIKDVLASRGLSLGMRLENWPPA 324
               SCOP domains --------------------------------------------------------------------------- SCOP domains
               CATH domains --------------------------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ......hhhhhh.hhhhhhhhhh....hhhhhhh.hhhhhhhh...hhhhhhhhhhhhhh............... Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------------------------- PROSITE
                 Transcript --------------------------------------------------------------------------- Transcript
                 3n97 B 250 DPILLRPVDDLELTVRSANCLKAEAIHYIGDLVQRTEVELLKTPNLGKKSLTEIKDVLASRGLSLGMRLENWPPA 324
                                   259       269       279       289       299       309       319     

Chain C from PDB  Type:PROTEIN  Length:74
 aligned with RPOA_ECOLI | P0A7Z4 from UniProtKB/Swiss-Prot  Length:329

    Alignment length:74
                                   259       269       279       289       299       309       319    
           RPOA_ECOLI   250 DPILLRPVDDLELTVRSANCLKAEAIHYIGDLVQRTEVELLKTPNLGKKSLTEIKDVLASRGLSLGMRLENWPP 323
               SCOP domains -------------------------------------------------------------------------- SCOP domains
               CATH domains -------------------------------------------------------------------------- CATH domains
           Pfam domains (1) RNA_pol_A_CTD-3n97C01 C:250-309                             -------------- Pfam domains (1)
           Pfam domains (2) RNA_pol_A_CTD-3n97C02 C:250-309                             -------------- Pfam domains (2)
         Sec.struct. author ......hhhhhh.hhhhhhhhhh....hhhhhhh.hhhhhhhh...hhhhhhhhhhhhhh.............. Sec.struct. author
                 SAPs(SNPs) -------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE -------------------------------------------------------------------------- PROSITE
                 Transcript -------------------------------------------------------------------------- Transcript
                 3n97 C 250 DPILLRPVDDLELTVRSANCLKAEAIHYIGDLVQRTEVELLKTPNLGKKSLTEIKDVLASRGLSLGMRLENWPP 323
                                   259       269       279       289       299       309       319    

Chain D from PDB  Type:PROTEIN  Length:50
 aligned with SIGA_THEAQ | Q9EZJ8 from UniProtKB/Swiss-Prot  Length:438

    Alignment length:50
                                   384       394       404       414       424
           SIGA_THEAQ   375 KLSEREAMVLKLRKGLIDGREHTLEEVGAYFGVTRERIRQIENKALRKLK 424
               SCOP domains -------------------------------------------------- SCOP domains
               CATH domains -------------------------------------------------- CATH domains
               Pfam domains -------------------------------------------------- Pfam domains
         Sec.struct. author ..hhhhhhhhhhhhh.......hhhhhhhhhh.hhhhhhhhhhhhhhhhh Sec.struct. author
                 SAPs(SNPs) -------------------------------------------------- SAPs(SNPs)
                    PROSITE ----------------------SIGMA70_2  PDB: D:397-423  - PROSITE
                 Transcript -------------------------------------------------- Transcript
                 3n97 D 375 KLSEREAMVLKMRKGLIDGREHTLEEVGAYFGVTRERIRQIENKALRKLR 424
                                   384       394       404       414       424

Chain M from PDB  Type:DNA  Length:22
                                                      
                 3n97 M   1 TGGAAAAAAGTACTTGACATGG  22
                                    10        20  

Chain N from PDB  Type:DNA  Length:21
                                                     
                 3n97 N   1 CCATGTCAAGTACTTTTTTCC  21
                                    10        20 

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 3N97)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 3N97)

(-) Pfam Domains  (1, 2)

Asymmetric/Biological Unit

(-) Gene Ontology  (17, 20)

Asymmetric/Biological Unit(hide GO term definitions)
Chain A,D   (SIGA_THEAQ | Q9EZJ8)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0016987    sigma factor activity    A sigma factor is the promoter specificity subunit of eubacterial-type multisubunit RNA polymerases, those whose core subunit composition is often described as alpha(2)-beta-beta-prime. (This type of multisubunit RNA polymerase complex is known to be found in eubacteria and plant plastids). Although sigma does not bind DNA on its own, when combined with the core to form the holoenzyme, this binds specifically to promoter sequences, with the sigma factor making sequence specific contacts with the promoter elements. The sigma subunit is released from the elongating form of the polymerase and is thus free to act catalytically for multiple RNA polymerase core enzymes.
    GO:0003700    transcription factor activity, sequence-specific DNA binding    Interacting selectively and non-covalently with a specific DNA sequence in order to modulate transcription. The transcription factor may or may not also interact selectively with a protein or macromolecular complex.
biological process
    GO:0006352    DNA-templated transcription, initiation    Any process involved in the assembly of the RNA polymerase preinitiation complex (PIC) at the core promoter region of a DNA template, resulting in the subsequent synthesis of RNA from that promoter. The initiation phase includes PIC assembly and the formation of the first few bonds in the RNA chain, including abortive initiation, which occurs when the first few nucleotides are repeatedly synthesized and then released. The initiation phase ends just before and does not include promoter clearance, or release, which is the transition between the initiation and elongation phases of transcription.
    GO:0010468    regulation of gene expression    Any process that modulates the frequency, rate or extent of gene expression. Gene expression is the process in which a gene's coding sequence is converted into a mature gene product or products (proteins or RNA). This includes the production of an RNA transcript as well as any processing to produce a mature RNA product or an mRNA or circRNA (for protein-coding genes) and the translation of that mRNA or circRNA into protein. Protein maturation is included when required to form an active form of a product from an inactive precursor form.
    GO:0006355    regulation of transcription, DNA-templated    Any process that modulates the frequency, rate or extent of cellular DNA-templated transcription.
    GO:0001123    transcription initiation from bacterial-type RNA polymerase promoter    Any process involved in the assembly of bacterial-type RNA polymerase preinitiation complex (PIC) at the core promoter region of a DNA template, resulting in the subsequent synthesis of RNA from that promoter. The initiation phase includes PIC assembly and the formation of the first few bonds in the RNA chain, including abortive initiation, which occurs when the first few nucleotides are repeatedly synthesized and then released. The initiation phase ends just before and does not include promoter clearance, or release, which is the transition between the initiation and elongation phases of transcription.
    GO:0006351    transcription, DNA-templated    The cellular synthesis of RNA on a template of DNA.
cellular component
    GO:0005737    cytoplasm    All of the contents of a cell excluding the plasma membrane and nucleus, but including other subcellular structures.

Chain B,C   (RPOA_ECOLI | P0A7Z4)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0003899    DNA-directed 5'-3' RNA polymerase activity    Catalysis of the reaction: nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1). Utilizes a DNA template, i.e. the catalysis of DNA-template-directed extension of the 3'-end of an RNA strand by one nucleotide at a time. Can initiate a chain 'de novo'.
    GO:0016779    nucleotidyltransferase activity    Catalysis of the transfer of a nucleotidyl group to a reactant.
    GO:0005515    protein binding    Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
    GO:0046983    protein dimerization activity    The formation of a protein dimer, a macromolecular structure consists of two noncovalently associated identical or nonidentical subunits.
    GO:0016740    transferase activity    Catalysis of the transfer of a group, e.g. a methyl group, glycosyl group, acyl group, phosphorus-containing, or other groups, from one compound (generally regarded as the donor) to another compound (generally regarded as the acceptor). Transferase is the systematic name for any enzyme of EC class 2.
    GO:0008270    zinc ion binding    Interacting selectively and non-covalently with zinc (Zn) ions.
biological process
    GO:0006351    transcription, DNA-templated    The cellular synthesis of RNA on a template of DNA.
cellular component
    GO:0005737    cytoplasm    All of the contents of a cell excluding the plasma membrane and nucleus, but including other subcellular structures.
    GO:0005829    cytosol    The part of the cytoplasm that does not contain organelles but which does contain other particulate matter, such as protein complexes.
    GO:0016020    membrane    A lipid bilayer along with all the proteins and protein complexes embedded in it an attached to it.

 Visualization

(-) Interactive Views

Asymmetric/Biological Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    ACT  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
    Trp B:321 - Pro B:322   [ RasMol ]  
    Trp C:321 - Pro C:322   [ RasMol ]  
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  3n97
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  RPOA_ECOLI | P0A7Z4
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
  SIGA_THEAQ | Q9EZJ8
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  2.7.7.6
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  RPOA_ECOLI | P0A7Z4
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)
  SIGA_THEAQ | Q9EZJ8
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        RPOA_ECOLI | P0A7Z41bdf 1coo 1lb2 1xs9 2jzb 3iyd 3k4g 3lu0 3n4m 4jk1 4jk2 4kmu 4kn4 4kn7 4mex 4mey 4s20 4xsx 4xsy 4xsz 4yg2 4yln 4ylo 4ylp 4zh2 4zh3 4zh4 5byh 5ciz 5ezk 5ipl 5ipm 5ipn 5ms0 5my1 5nsr 5nss 5uac 5uag 5uah 5uaj 5ual 5uaq 5up6 5upc 5vsw 5w1s 5w1t
        SIGA_THEAQ | Q9EZJ81ku2 1ku3 1ku7 1l9u 1l9z 1rio 3les 3lev 3ugo 3ugp 4ki2 4xln 4xlp 4xlq 4xlr 4xls 5tjg

(-) Related Entries Specified in the PDB File

3k4g CRYSTAL STRUCTURE OF E. COLI RNA POLYMERASE ALPHA SUBUNIT C -TERMINAL DOMAIN
3n4m E. COLI RNA POLYMERASE ALPHA SUBUNIT C-TERMINAL DOMAIN IN COMPLEX WITH CAP AND DNA