Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asymmetric Unit
(-)Biological Unit 1
(-)Biological Unit 2
collapse expand < >
Image Asymmetric Unit
Asymmetric Unit  (Jmol Viewer)
Image Biological Unit 1
Biological Unit 1  (Jmol Viewer)
Image Biological Unit 2
Biological Unit 2  (Jmol Viewer)

(-) Description

Title :  CRYSTAL STRUCTURE OF HUMAN THYN1 PROTEIN IN COMPLEX WITH 5-METHYLCYTOSINE CONTAINING DNA
 
Authors :  L. Halabelian, W. Tempel, Y. Li, C. Bountra, A. M. Edwards, C. H. Arrowsm Structural Genomics Consortium (Sgc)
Date :  30 Mar 16  (Deposition) - 20 Apr 16  (Release) - 20 Apr 16  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  2.60
Chains :  Asym. Unit :  A,B,C,D,E,F
Biol. Unit 1:  A,E,F  (1x)
Biol. Unit 2:  B,C,D  (1x)
Keywords :  Protein-Dna Complex, Modified Dna, 5-Methylcytosine Containing Dna, Structural Genomics Consortium, Sgc, Nuclear Protein-Dna Complex (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  L. Halabelian, W. Tempel, Y. Li, C. Bountra, A. M. Edwards, C. H. Arrowsmith
Crystal Structure Of Human Thyn1 Protein In Complex With 5-Methylcytosine Containing Dna
To Be Published
PubMed: search

(-) Compounds

Molecule 1 - THYMOCYTE NUCLEAR PROTEIN 1
    ChainsA, B
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPET28-MHL
    Expression System StrainBL21(DE3)-V3R-PRARE2
    Expression System Taxid562
    Expression System Vector TypePLASMID
    GeneTHYN1, THY28, HSPC144, MDS012, MY0054
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SynonymTHYMOCYTE PROTEIN THY28
 
Molecule 2 - 5-METHYLCYTOSINE CONTAINING DNA
    ChainsC, D, E, F
    EngineeredYES
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    Other DetailsSYNTHESIZED BY IDTDNA
    Synonym5-METHYLCYTOSINE CONTAINING DNA
    SyntheticYES

 Structural Features

(-) Chains, Units

  123456
Asymmetric Unit ABCDEF
Biological Unit 1 (1x)A   EF
Biological Unit 2 (1x) BCD  

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (2, 12)

Asymmetric Unit (2, 12)
No.NameCountTypeFull Name
15CM4Mod. Nucleotide5-METHYL-2'-DEOXY-CYTIDINE-5'-MONOPHOSPHATE
2UNX8Ligand/IonUNKNOWN ATOM OR ION
Biological Unit 1 (2, 5)
No.NameCountTypeFull Name
15CM2Mod. Nucleotide5-METHYL-2'-DEOXY-CYTIDINE-5'-MONOPHOSPHATE
2UNX3Ligand/IonUNKNOWN ATOM OR ION
Biological Unit 2 (2, 7)
No.NameCountTypeFull Name
15CM2Mod. Nucleotide5-METHYL-2'-DEOXY-CYTIDINE-5'-MONOPHOSPHATE
2UNX5Ligand/IonUNKNOWN ATOM OR ION

(-) Sites  (0, 0)

(no "Site" information available for 5J3E)

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 5J3E)

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 5J3E)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 5J3E)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 5J3E)

(-) Exons   (0, 0)

(no "Exon" information available for 5J3E)

(-) Sequences/Alignments

Asymmetric Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:172
                                                                                                                                                                                                            
               SCOP domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ...eeeeee.....ee..ee...hhhhhhhh...eee.....hhhhhhhhhhh....eeeeee......eeeeeeeeeeeeee.hhhhh.................eeeeeeeeeeeeeeeehhhhhhhhhhhhhhh.....hhhhhh....eeeehhhhhhhhhhhh.... Sec.struct. author
                 SAPs(SNPs) ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 5j3e A  53 LSSHWLMKSEPESRLEKGVDVKFSIEDLKAQPKQTTCWDGVRNYQARNFLRAMKLGEEAFFYHSNCKEPGIAGLMKIVKEAYPDHTQFEKNNPHYDPSSKEDNPKWSMVDVQFVRMMKRFIPLAELKSYHQAHKATGGPLKNMVLFTRQRLSIQPLTQEEFDFVLSLEEKEP 224
                                    62        72        82        92       102       112       122       132       142       152       162       172       182       192       202       212       222  

Chain B from PDB  Type:PROTEIN  Length:171
                                                                                                                                                                                                           
               SCOP domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ..eeeeee.....ee..ee...hhhhhhhhhh.eee.....hhhhhhhhhhh....eeeeee......eeeeeeeeeeeeee.hhhhh.................eeeeeeeeeeeeeeeehhhhhhhhhhhhhhhhhhhhhhhhhh....eeeehhhhhhhhhhhh.... Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript --------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 5j3e B  54 SSHWLMKSEPESRLEKGVDVKFSIEDLKAQPKQTTCWDGVRNYQARNFLRAMKLGEEAFFYHSNCKEPGIAGLMKIVKEAYPDHTQFEKNNPHYDPSSKEDNPKWSMVDVQFVRMMKRFIPLAELKSYHQAHKATGGPLKNMVLFTRQRLSIQPLTQEEFDFVLSLEEKEP 224
                                    63        73        83        93       103       113       123       133       143       153       163       173       183       193       203       213       223 

Chain C from PDB  Type:DNA  Length:12
                                            
                 5j3e C   1 GCCAAxGTTGGC  12
                                 |  10  
                                 6-5CM  

Chain D from PDB  Type:DNA  Length:12
                                            
                 5j3e D   1 GCCAAxGTTGGC  12
                                 |  10  
                                 6-5CM  

Chain E from PDB  Type:DNA  Length:12
                                            
                 5j3e E   1 GCCAAxGTTGGC  12
                                 |  10  
                                 6-5CM  

Chain F from PDB  Type:DNA  Length:12
                                            
                 5j3e F   1 GCCAAxGTTGGC  12
                                 |  10  
                                 6-5CM  

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 5J3E)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 5J3E)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 5J3E)

(-) Gene Ontology  (3, 3)

Asymmetric Unit(hide GO term definitions)

 Visualization

(-) Interactive Views

Asymmetric Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    5CM  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    UNX  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
(no "Sites" information available for 5j3e)
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 5j3e)
 
Biological Units
  Complete Structure
    Biological Unit 1  [ Jena3D ]
    Biological Unit 2  [ Jena3D ]

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  5j3e
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  THYN1_HUMAN | Q9P016
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  THYN1_HUMAN | Q9P016
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        THYN1_HUMAN | Q9P0163eop

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 5J3E)