Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asym./Biol. Unit
(-)Asym./Biol. Unit - sites
collapse expand < >
Image Asym./Biol. Unit
Asym./Biol. Unit  (Jmol Viewer)
Image Asym./Biol. Unit - sites
Asym./Biol. Unit - sites  (Jmol Viewer)

(-) Description

Title :  EGR1/ZIF268 ZINC FINGERS IN COMPLEX WITH HYDROXYMETHYLATED DNA
 
Authors :  H. Hashimoto, Y. O. Olanrewaju, Y. Zheng, G. G. Wilson, X. Zhang, X. Chen
Date :  11 Aug 14  (Deposition) - 08 Oct 14  (Release) - 19 Nov 14  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  1.89
Chains :  Asym./Biol. Unit :  A,B,C
Keywords :  Zinc Finger, Transcription, Dna Binding Protein-Dna Complex (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  H. Hashimoto, Y. O. Olanrewaju, Y. Zheng, G. G. Wilson, X. Zhang, X. Cheng
Wilms Tumor Protein Recognizes 5-Carboxylcytosine Within A Specific Dna Sequence.
Genes Dev. V. 28 2304 2014
PubMed-ID: 25258363  |  Reference-DOI: 10.1101/GAD.250746.114

(-) Compounds

Molecule 1 - EARLY GROWTH RESPONSE PROTEIN 1
    ChainsA
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPGEX6P-1, PXC1272
    Expression System StrainBL21(DE3) CODON PLUS
    Expression System Taxid469008
    Expression System Vector TypePLASMID
    FragmentZINC FINGER 1-3
    GeneEGR1, KROX24, ZNF225
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SynonymEGR-1, AT225, NERVE GROWTH FACTOR-INDUCED PROTEIN A, NGFI-A, TRANSCRIPTION FACTOR ETR103, TRANSCRIPTION FACTOR ZIF268, ZINC FINGER PROTEIN 225, ZINC FINGER PROTEIN KROX-24
 
Molecule 2 - DNA (5'-D(*AP*GP*CP*GP*TP*GP*GP*GP*(5HC)P*GP*T)-3')
    ChainsB
    EngineeredYES
    Organism ScientificSYNTHETIC CONSTRUCT
    Organism Taxid32630
    Other DetailsCHEMICAL SYNTHESIS
    SyntheticYES
 
Molecule 3 - DNA (5'-D(*TP*AP*(5HC)P*GP*CP*CP*CP*AP*CP*GP*C)-3')
    ChainsC
    EngineeredYES
    Organism ScientificSYNTHETIC CONSTRUCT
    Organism Taxid32630
    Other DetailsCHEMICAL SYNTHESIS
    SyntheticYES

 Structural Features

(-) Chains, Units

  123
Asymmetric/Biological Unit ABC

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (3, 6)

Asymmetric/Biological Unit (3, 6)
No.NameCountTypeFull Name
15HC2Mod. Nucleotide2'-DEOXY-5-(HYDROXYMETHYL)CYTIDINE 5'-(DIHYDROGENPHOSPHATE)
2K1Ligand/IonPOTASSIUM ION
3ZN3Ligand/IonZINC ION

(-) Sites  (4, 4)

Asymmetric Unit (4, 4)
No.NameEvidenceResiduesDescription
1AC1SOFTWARECYS A:340 , CYS A:345 , HIS A:358 , HIS A:362BINDING SITE FOR RESIDUE ZN A 501
2AC2SOFTWARECYS A:370 , CYS A:373 , HIS A:386 , HIS A:390BINDING SITE FOR RESIDUE ZN A 502
3AC3SOFTWARECYS A:398 , CYS A:401 , HIS A:414 , HIS A:418BINDING SITE FOR RESIDUE ZN A 503
4AC4SOFTWARECYS A:340 , VAL A:342 , CYS A:345BINDING SITE FOR RESIDUE K A 504

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 4R2C)

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 4R2C)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 4R2C)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 4R2C)

(-) Exons   (0, 0)

(no "Exon" information available for 4R2C)

(-) Sequences/Alignments

Asymmetric/Biological Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:86
                                                                                                                      
               SCOP domains -------------------------------------------------------------------------------------- SCOP domains
               CATH domains -------------------------------------------------------------------------------------- CATH domains
               Pfam domains -------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ...ee........ee.hhhhhhhhhhhhhh...ee......ee.hhhhhhhhhhhhhh...ee......ee.hhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) -------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE -------------------------------------------------------------------------------------- PROSITE
                 Transcript -------------------------------------------------------------------------------------- Transcript
                 4r2c A 335 ERPYACPVESCDRRFSRSDELTRHIRIHTGQKPFQCRICMRNFSRSDHLTTHIRTHTGEKPFACDICGRKFARSDERKRHTKIHLR 420
                                   344       354       364       374       384       394       404       414      

Chain B from PDB  Type:DNA  Length:11
                                           
                 4r2c B   1 AGCGTGGGcGT  11
                                    10 
                                    9-5HC

Chain C from PDB  Type:DNA  Length:11
                                           
                 4r2c C   1 TAcGCCCACGC  11
                              |     10 
                              |        
                              3-5HC    

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 4R2C)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 4R2C)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 4R2C)

(-) Gene Ontology  (38, 38)

Asymmetric/Biological Unit(hide GO term definitions)

 Visualization

(-) Interactive Views

Asymmetric/Biological Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    5HC  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    K  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    ZN  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
    AC4  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 4r2c)
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  4r2c
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  EGR1_HUMAN | P18146
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  EGR1_HUMAN | P18146
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        EGR1_HUMAN | P181464r2a 4r2d 4x9j

(-) Related Entries Specified in the PDB File

4r2a 4r2d 4r2e 4r2p 4r2q 4r2r 4r2s