Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asym./Biol. Unit
(-)Asym./Biol. Unit - sites
collapse expand < >
Image Asym./Biol. Unit
Asym./Biol. Unit  (Jmol Viewer)
Image Asym./Biol. Unit - sites
Asym./Biol. Unit - sites  (Jmol Viewer)

(-) Description

Title :  CRYSTAL STRUCTURE OF THE METHYL CPG BINDING DOMAIN OF MBD4 IN COMPLEX WITH THE 5MCG/TG SEQUENCE IN SPACE GROUP P1
 
Authors :  J. Otani, K. Arita, T. Kato, M. Kinoshita, M. Ariyoshi, M. Shirakawa
Date :  02 Oct 12  (Deposition) - 16 Jan 13  (Release) - 14 Aug 13  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  2.52
Chains :  Asym./Biol. Unit :  A,B,C,D
Keywords :  Methyl Cpg Binding Domain, Protein-Dna Complex, Versatile Base Recognition, Hydrolase-Dna Complex (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  J. Otani, K. Arita, T. Kato, M. Kinoshita, H. Kimura, I. Suetake, S. Tajima, M. Ariyoshi, M. Shirakawa
Structural Basis Of The Versatile Dna Recognition Ability O The Methyl-Cpg Binding Domain Of Methyl-Cpg Binding Domain Protein 4
J. Biol. Chem. V. 288 6351 2013
PubMed-ID: 23316048  |  Reference-DOI: 10.1074/JBC.M112.431098

(-) Compounds

Molecule 1 - METHYL-CPG-BINDING DOMAIN PROTEIN 4
    ChainsA, D
    EC Number3.2.2.-
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System StrainBL21(DE3)
    Expression System Taxid562
    Expression System Vector TypePLASMID
    FragmentMETHYL CPG BINDING DOMAIN, UNP RESIDUES 63-136
    GeneMBD4
    Organism CommonMOUSE
    Organism ScientificMUS MUSCULUS
    Organism Taxid10090
    SynonymMETHYL-CPG-BINDING PROTEIN MBD4, MISMATCH-SPECIFIC DNA N- GLYCOSYLASE
 
Molecule 2 - DNA (5'-D(*AP*CP*AP*TP*CP*(5CM)P*GP*GP*TP*GP*A)-3')
    ChainsB
    EngineeredYES
    SyntheticYES
 
Molecule 3 - DNA (5'-D(*TP*CP*AP*CP*TP*GP*GP*AP*TP*GP*T)-3')
    ChainsC
    EngineeredYES
    SyntheticYES

 Structural Features

(-) Chains, Units

  1234
Asymmetric/Biological Unit ABCD

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (3, 5)

Asymmetric/Biological Unit (3, 5)
No.NameCountTypeFull Name
15CM1Mod. Nucleotide5-METHYL-2'-DEOXY-CYTIDINE-5'-MONOPHOSPHATE
2EDO1Ligand/Ion1,2-ETHANEDIOL
3ZN3Ligand/IonZINC ION

(-) Sites  (4, 4)

Asymmetric Unit (4, 4)
No.NameEvidenceResiduesDescription
1AC1SOFTWARECYS A:75 , HOH A:311 , GLU D:120 , HOH D:303BINDING SITE FOR RESIDUE ZN A 201
2AC2SOFTWAREDG B:10 , ARG D:106BINDING SITE FOR RESIDUE EDO B 101
3AC3SOFTWAREGLU A:120 , HOH A:320 , CYS D:75BINDING SITE FOR RESIDUE ZN D 201
4AC4SOFTWAREHIS A:70 , HIS D:70BINDING SITE FOR RESIDUE ZN D 202

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 3VYQ)

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 3VYQ)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 3VYQ)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 3VYQ)

(-) Exons   (0, 0)

(no "Exon" information available for 3VYQ)

(-) Sequences/Alignments

Asymmetric/Biological Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:63
 aligned with MBD4_MOUSE | Q9Z2D7 from UniProtKB/Swiss-Prot  Length:554

    Alignment length:63
                                    79        89        99       109       119       129   
           MBD4_MOUSE    70 HKPVPCGWERVVKQRLSGKTAGKFDVYFISPQGLKFRSKRSLANYLLKNGETFLKPEDFNFTV 132
               SCOP domains --------------------------------------------------------------- SCOP domains
               CATH domains --------------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------------- Pfam domains
         Sec.struct. author ........eeeeee.........eeeeee.....ee.hhhhhhhhhhhh.....hhhhh.... Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------------- PROSITE
                 Transcript --------------------------------------------------------------- Transcript
                 3vyq A  70 HKPVPCGWERVVKQRLSGKTAGKFDVYFISPQGLKFRSKRSLANYLLKNGETFLKPEDFNFTV 132
                                    79        89        99       109       119       129   

Chain B from PDB  Type:DNA  Length:10
                                          
                 3vyq B   2 CATCcGGTGA  11
                                |   11
                                6-5CM 

Chain C from PDB  Type:DNA  Length:11
                                           
                 3vyq C   1 TCACTGGATGT  11
                                    10 

Chain D from PDB  Type:PROTEIN  Length:63
 aligned with MBD4_MOUSE | Q9Z2D7 from UniProtKB/Swiss-Prot  Length:554

    Alignment length:63
                                    79        89        99       109       119       129   
           MBD4_MOUSE    70 HKPVPCGWERVVKQRLSGKTAGKFDVYFISPQGLKFRSKRSLANYLLKNGETFLKPEDFNFTV 132
               SCOP domains --------------------------------------------------------------- SCOP domains
               CATH domains --------------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------------- Pfam domains
         Sec.struct. author ........eeeeee.........eeeeee.....ee.hhhhhhhhhhhh.....hhhhh.... Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------------- PROSITE
                 Transcript --------------------------------------------------------------- Transcript
                 3vyq D  70 HKPVPCGWERVVKQRLSGKTAGKFDVYFISPQGLKFRSKRSLANYLLKNGETFLKPEDFNFTV 132
                                    79        89        99       109       119       129   

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 3VYQ)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 3VYQ)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 3VYQ)

(-) Gene Ontology  (14, 14)

Asymmetric/Biological Unit(hide GO term definitions)
Chain A,D   (MBD4_MOUSE | Q9Z2D7)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0003824    catalytic activity    Catalysis of a biochemical reaction at physiological temperatures. In biologically catalyzed reactions, the reactants are known as substrates, and the catalysts are naturally occurring macromolecular substances known as enzymes. Enzymes possess specific binding sites for substrates, and are usually composed wholly or largely of protein, but RNA that has catalytic activity (ribozyme) is often also regarded as enzymatic.
    GO:0016787    hydrolase activity    Catalysis of the hydrolysis of various bonds, e.g. C-O, C-N, C-C, phosphoric anhydride bonds, etc. Hydrolase is the systematic name for any enzyme of EC class 3.
    GO:0008263    pyrimidine-specific mismatch base pair DNA N-glycosylase activity    Catalysis of the removal of mismatched pyrimidine bases in DNA. Enzymes with this activity recognize and remove pyrimidines present in mismatches by cleaving the N-C1' glycosidic bond between the target damaged DNA base and the deoxyribose sugar. The reaction releases a free base and leaves an apyrimidinic (AP) site.
biological process
    GO:0006306    DNA methylation    The covalent transfer of a methyl group to either N-6 of adenine or C-5 or N-4 of cytosine.
    GO:0006281    DNA repair    The process of restoring DNA after damage. Genomes are subject to damage by chemical and physical agents in the environment (e.g. UV and ionizing radiations, chemical mutagens, fungal and bacterial toxins, etc.) and by free radicals or alkylating agents endogenously generated in metabolism. DNA is also damaged because of errors during its replication. A variety of different DNA repair pathways have been reported that include direct reversal, base excision repair, nucleotide excision repair, photoreactivation, bypass, double-strand break repair pathway, and mismatch repair pathway.
    GO:0006974    cellular response to DNA damage stimulus    Any process that results in a change in state or activity of a cell (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of a stimulus indicating damage to its DNA from environmental insults or errors during metabolism.
    GO:0008630    intrinsic apoptotic signaling pathway in response to DNA damage    A series of molecular signals in which an intracellular signal is conveyed to trigger the apoptotic death of a cell. The pathway is induced by the detection of DNA damage, and ends when the execution phase of apoptosis is triggered.
    GO:0007095    mitotic G2 DNA damage checkpoint    A mitotic cell cycle checkpoint that detects and negatively regulates progression through the G2/M transition of the cell cycle in response to DNA damage.
    GO:0009314    response to radiation    Any process that results in a change in state or activity of a cell or an organism (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of an electromagnetic radiation stimulus. Electromagnetic radiation is a propagating wave in space with electric and magnetic components. These components oscillate at right angles to each other and to the direction of propagation.
cellular component
    GO:0000785    chromatin    The ordered and organized complex of DNA, protein, and sometimes RNA, that forms the chromosome.
    GO:0005737    cytoplasm    All of the contents of a cell excluding the plasma membrane and nucleus, but including other subcellular structures.
    GO:0005654    nucleoplasm    That part of the nuclear content other than the chromosomes or the nucleolus.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.

 Visualization

(-) Interactive Views

Asymmetric/Biological Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    5CM  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    EDO  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    ZN  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
    AC4  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 3vyq)
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  3vyq
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  MBD4_MOUSE | Q9Z2D7
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  3.2.2.-
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  MBD4_MOUSE | Q9Z2D7
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        MBD4_MOUSE | Q9Z2D71ngn 3vxv 3vxx 3vyb 4evv 4ew0 4ew4

(-) Related Entries Specified in the PDB File

3vxv 3vxx 3vyb