Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)NMR Structure - model 1
(-)NMR Structure - model 1, sites
(-)NMR Structure - all models
collapse expand < >
Image NMR Structure - model 1
NMR Structure - model 1  (Jmol Viewer)
Image NMR Structure - model 1, sites
NMR Structure - model 1, sites  (Jmol Viewer)
Image NMR Structure - all models
NMR Structure - all models  (Jmol Viewer)

(-) Description

Title :  SOLUTION STRUCTURE OF THE THAP-ZINC FINGER OF THAP1
 
Authors :  D. Bessiere, S. Campagne, A. Milon, V. Gervais
Date :  31 Jul 07  (Deposition) - 11 Dec 07  (Release) - 24 Feb 09  (Revision)
Method :  SOLUTION NMR
Resolution :  NOT APPLICABLE
Chains :  NMR Structure  :  A  (20x)
Keywords :  Thap, Zinc Finger, Ccch, Dna-Binding, Metal-Binding, Zinc- Finger, Metal Binding Protein (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  D. Bessiere, C. Lacroix, S. Campagne, V. Ecochard, V. Guillet, L. Mourey, F. Lopez, J. Czaplicki, P. Demange, A. Milon, J. P. Girard, V. Gervais
Structure-Function Analysis Of The Thap Zinc Finger Of Thap1, A Large C2Ch Dna-Binding Module Linked To Rb/E2F Pathways
J. Biol. Chem. V. 283 4352 2008
PubMed-ID: 18073205  |  Reference-DOI: 10.1074/JBC.M707537200
(for further references see the PDB file header)

(-) Compounds

Molecule 1 - THAP DOMAIN-CONTAINING PROTEIN 1
    ChainsA
    EngineeredYES
    Expression SystemESCHERICHIA COLI BL21(DE3)
    Expression System PlasmidPET-26
    Expression System StrainBL21(DE3)
    Expression System Taxid469008
    Expression System Vector TypePLASMID
    FragmentTHAP DOMAIN, RESIDUES 1-82
    GeneTHAP1
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606

 Structural Features

(-) Chains, Units

  
NMR Structure (20x)

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (1, 1)

NMR Structure (1, 1)
No.NameCountTypeFull Name
1ZN1Ligand/IonZINC ION

(-) Sites  (1, 1)

NMR Structure (1, 1)
No.NameEvidenceResiduesDescription
1AC1SOFTWARECYS A:5 , ALA A:7 , CYS A:10BINDING SITE FOR RESIDUE ZN A 88

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 2JTG)

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 2JTG)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (29, 29)

NMR Structure (29, 29)
  dbSNPPDB
No.SourceVariant IDVariantUniProt IDStatusIDChainVariant
01UniProtVAR_066677S6FTHAP1_HUMANDisease (DYT6)  ---AS6F
02UniProtVAR_066678S6PTHAP1_HUMANDisease (DYT6)  ---AS6P
03UniProtVAR_067356A7DTHAP1_HUMANDisease (DYT6)  ---AA7D
04UniProtVAR_066679Y8CTHAP1_HUMANDisease (DYT6)  ---AY8C
05UniProtVAR_066680G9CTHAP1_HUMANDisease (DYT6)267607112AG9C
06UniProtVAR_065880N12KTHAP1_HUMANDisease (DYT6)  ---AN12K
07UniProtVAR_066681R13HTHAP1_HUMANDisease (DYT6)  ---AR13H
08UniProtVAR_066682K16ETHAP1_HUMANDisease (DYT6)  ---AK16E
09UniProtVAR_066683D17GTHAP1_HUMANDisease (DYT6)766483829AD17G
10UniProtVAR_067357S21CTHAP1_HUMANDisease (DYT6)  ---AS21C
11UniProtVAR_065881S21TTHAP1_HUMANDisease (DYT6)  ---AS21T
12UniProtVAR_066684H23PTHAP1_HUMANDisease (DYT6)387907177AH23P
13UniProtVAR_066685K24ETHAP1_HUMANDisease (DYT6)387907176AK24E
14UniProtVAR_066686P26LTHAP1_HUMANDisease (DYT6)  ---AP26L
15UniProtVAR_066687P26RTHAP1_HUMANDisease (DYT6)  ---AP26R
16UniProtVAR_065882R29PTHAP1_HUMANDisease (DYT6)  ---AR29P
17UniProtVAR_066688R29QTHAP1_HUMANDisease (DYT6)767952378AR29Q
18UniProtVAR_066689P30RTHAP1_HUMANDisease (DYT6)  ---AP30R
19UniProtVAR_066690L32HTHAP1_HUMANDisease (DYT6)  ---AL32H
20UniProtVAR_065883A39TTHAP1_HUMANDisease (DYT6)  ---AA39T
21UniProtVAR_066691C54FTHAP1_HUMANDisease (DYT6)  ---AC54F
22UniProtVAR_072272E56GTHAP1_HUMANDisease (DYT6)  ---AE56G
23UniProtVAR_066692H57NTHAP1_HUMANDisease (DYT6)  ---AH57N
24UniProtVAR_065884T59ITHAP1_HUMANDisease (DYT6)  ---AT59I
25UniProtVAR_066694L72RTHAP1_HUMANDisease (DYT6)  ---AL72R
26UniProtVAR_066695N75ITHAP1_HUMANDisease (DYT6)  ---AN75I
27UniProtVAR_066696I80VTHAP1_HUMANDisease (DYT6)372080941AI80V
28UniProtVAR_054788F81LTHAP1_HUMANDisease (DYT6)118204013AF81L
29UniProtVAR_066697C83RTHAP1_HUMANDisease (DYT6)768017019AE83R

  SNP/SAP Summary Statistics (UniProtKB/Swiss-Prot)

(-) PROSITE Motifs  (1, 1)

NMR Structure (1, 1)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1ZF_THAPPS50950 Zinc finger THAP-type profileTHAP1_HUMAN1-81  1A:1-81

(-) Exons   (2, 2)

NMR Structure (2, 2)
 ENSEMBLUniProtKBPDB
No.Transcript IDExonExon IDGenome LocationLengthIDLocationLengthCountLocationLength
1.1aENST000002542501aENSE00000901799chr8:42698468-42698167302THAP1_HUMAN1-24241A:1-2424
1.3bENST000002542503bENSE00001090345chr8:42694524-42694329196THAP1_HUMAN24-89661A:24-8764
1.4bENST000002542504bENSE00002195192chr8:42693479-426918171663THAP1_HUMAN90-2131240--

(-) Sequences/Alignments

NMR Structure
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:87
 aligned with THAP1_HUMAN | Q9NVV9 from UniProtKB/Swiss-Prot  Length:213

    Alignment length:87
                                    10        20        30        40        50        60        70        80       
           THAP1_HUMAN    1 MVQSCSAYGCKNRYDKDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFTPDCFKRECNNKLLKENAVPTIFLCTEPH 87
               SCOP domains d2jtga_ A: automated matches                                                            SCOP domains
               CATH domains --------------------------------------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .....................eee.......hhhhhhhhhh..........eee.hhhhhhhhh.............hhhhh..... Sec.struct. author
             SAPs(SNPs) (1) -----FDCC--KH--EG---C-PE-L--PR-H------T--------------F-GN-I------------R--I----VL-R---- SAPs(SNPs) (1)
             SAPs(SNPs) (2) -----P--------------T----R--Q---------------------------------------------------------- SAPs(SNPs) (2)
                    PROSITE ZF_THAP  PDB: A:1-81 UniProt: 1-81                                               ------ PROSITE
           Transcript 1 (1) Exon 1.1a  PDB: A:1-24  --------------------------------------------------------------- Transcript 1 (1)
           Transcript 1 (2) -----------------------Exon 1.3b  PDB: A:24-87 UniProt: 24-89 [INCOMPLETE]              Transcript 1 (2)
                  2jtg A  1 MVQSCSAYGCKNRYDKDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFTPDCFKRECNNKLLKENAVPTIFLELVPR 87
                                    10        20        30        40        50        60        70        80       

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (1, 1)

NMR Structure

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 2JTG)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 2JTG)

(-) Gene Ontology  (17, 17)

NMR Structure(hide GO term definitions)
Chain A   (THAP1_HUMAN | Q9NVV9)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0042802    identical protein binding    Interacting selectively and non-covalently with an identical protein or proteins.
    GO:0046872    metal ion binding    Interacting selectively and non-covalently with any metal ion.
    GO:0003676    nucleic acid binding    Interacting selectively and non-covalently with any nucleic acid.
    GO:0005515    protein binding    Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
    GO:0043565    sequence-specific DNA binding    Interacting selectively and non-covalently with DNA of a specific nucleotide composition, e.g. GC-rich DNA binding, or with a specific sequence motif or type of DNA e.g. promotor binding or rDNA binding.
    GO:0008270    zinc ion binding    Interacting selectively and non-covalently with zinc (Zn) ions.
biological process
    GO:0007049    cell cycle    The progression of biochemical and morphological phases and events that occur in a cell during successive cell replication or nuclear replication events. Canonically, the cell cycle comprises the replication and segregation of genetic material followed by the division of the cell, but in endocycles or syncytial cells nuclear replication or nuclear division may not be followed by cell division.
    GO:0001935    endothelial cell proliferation    The multiplication or reproduction of endothelial cells, resulting in the expansion of a cell population. Endothelial cells are thin flattened cells which line the inside surfaces of body cavities, blood vessels, and lymph vessels, making up the endothelium.
    GO:0007346    regulation of mitotic cell cycle    Any process that modulates the rate or extent of progress through the mitotic cell cycle.
    GO:0006355    regulation of transcription, DNA-templated    Any process that modulates the frequency, rate or extent of cellular DNA-templated transcription.
    GO:0006351    transcription, DNA-templated    The cellular synthesis of RNA on a template of DNA.
cellular component
    GO:0016605    PML body    A class of nuclear body; they react against SP100 auto-antibodies (PML, promyelocytic leukemia); cells typically contain 10-30 PML bodies per nucleus; alterations in the localization of PML bodies occurs after viral infection.
    GO:0043231    intracellular membrane-bounded organelle    Organized structure of distinctive morphology and function, bounded by a single or double lipid bilayer membrane and occurring within the cell. Includes the nucleus, mitochondria, plastids, vacuoles, and vesicles. Excludes the plasma membrane.
    GO:0005730    nucleolus    A small, dense body one or more of which are present in the nucleus of eukaryotic cells. It is rich in RNA and protein, is not bounded by a limiting membrane, and is not seen during mitosis. Its prime function is the transcription of the nucleolar DNA into 45S ribosomal-precursor RNA, the processing of this RNA into 5.8S, 18S, and 28S components of ribosomal RNA, and the association of these components with 5S RNA and proteins synthesized outside the nucleolus. This association results in the formation of ribonucleoprotein precursors; these pass into the cytoplasm and mature into the 40S and 60S subunits of the ribosome.
    GO:0005654    nucleoplasm    That part of the nuclear content other than the chromosomes or the nucleolus.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.

 Visualization

(-) Interactive Views

NMR Structure
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    ZN  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 2jtg)
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  2jtg
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  THAP1_HUMAN | Q9NVV9
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  602629
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  THAP1_HUMAN | Q9NVV9
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        THAP1_HUMAN | Q9NVV92ko0 2l1g

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 2JTG)