Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)NMR Structure - model 1
(-)NMR Structure - all models
collapse expand < >
Image NMR Structure - model 1
NMR Structure - model 1  (Jmol Viewer)
Image NMR Structure - all models
NMR Structure - all models  (Jmol Viewer)

(-) Description

Title :  SOLUTION STRUCTURE OF THE CORE SMN-GEMIN2 COMPLEX
 
Authors :  K. L. Sarachan, K. Valentine, K. Gupta, V. Moorman, J. Gledhill, M. Bern C. Tommos, A. J. Wand, G. Van Duyne
Date :  15 Jun 11  (Deposition) - 27 Jun 12  (Release) - 11 Jul 12  (Revision)
Method :  SOLUTION NMR
Resolution :  NOT APPLICABLE
Chains :  NMR Structure  :  A,B  (32x)
NMR Structure *:  A,B  (1x)
Keywords :  Spinal Muscular Atrophy, Snrnp Assembly, Protein Binding (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  K. L. Sarachan, K. G. Valentine, K. Gupta, V. R. Moorman, J. M. Gledhill Jr, M. Bernens, C. Tommos, A. J. Wand, G. D. Van Duyne
Solution Structure Of The Core Smn-Gemin2 Complex.
Biochem. J. V. 445 361 2012
PubMed-ID: 22607171  |  Reference-DOI: 10.1042/BJ20120241

(-) Compounds

Molecule 1 - SURVIVAL OF MOTOR NEURON PROTEIN-INTERACTING PROTEIN 1
    ChainsA
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System StrainBL21(DE3)
    Expression System Taxid469008
    Expression System VectorPKS987
    FragmentUNP RESIDUES 95-280
    GeneSIP1, GEMIN2
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SynonymSMN-INTERACTING PROTEIN 1, COMPONENT OF GEMS 2, GEMIN-2
 
Molecule 2 - SURVIVAL MOTOR NEURON PROTEIN
    ChainsB
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System StrainBL21(DE3)
    Expression System Taxid469008
    Expression System VectorPKS1449
    FragmentUNP RESIDUES 26-51
    GeneSMN1, SMN, SMNT, SMN2, SMNC
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SynonymCOMPONENT OF GEMS 1, GEMIN-1

 Structural Features

(-) Chains, Units

  12
NMR Structure (32x)AB
NMR Structure * (1x)AB

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (0, 0)

(no "Ligand,Modified Residues,Ions" information available for 2LEH)

(-) Sites  (0, 0)

(no "Site" information available for 2LEH)

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 2LEH)

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 2LEH)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (2, 2)

NMR Structure (2, 2)
  dbSNPPDB
No.SourceVariant IDVariantUniProt IDStatusIDChainVariant
1UniProtVAR_034803D30NSMN_HUMANDisease (SMA2)104893930BD30N
2UniProtVAR_034804D44VSMN_HUMANDisease (SMA3)104893931BD44V

  SNP/SAP Summary Statistics (UniProtKB/Swiss-Prot)
NMR Structure * (2, 2)
  dbSNPPDB
No.SourceVariant IDVariantUniProt IDStatusIDChainVariant
1UniProtVAR_034803D30NSMN_HUMANDisease (SMA2)104893930BD30N
2UniProtVAR_034804D44VSMN_HUMANDisease (SMA3)104893931BD44V

  SNP/SAP Summary Statistics (UniProtKB/Swiss-Prot)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 2LEH)

(-) Exons   (2, 2)

NMR Structure (2, 2)
 ENSEMBLUniProtKBPDB
No.Transcript IDExonExon IDGenome LocationLengthIDLocationLengthCountLocationLength
1.1bENST000003807431bENSE00001904512chr5:69345439-69345593155SMN_HUMAN1-27271B:26-272
1.2bENST000003807432bENSE00001725187chr5:69359242-6935931372SMN_HUMAN28-51241B:28-5124
1.3ENST000003807433ENSE00001607984chr5:69361792-69361911120SMN_HUMAN52-91400--
1.4aENST000003807434aENSE00001656721chr5:69362761-69362961201SMN_HUMAN92-158670--
1.5bENST000003807435bENSE00001749658chr5:69363121-69363273153SMN_HUMAN159-209510--
1.7bENST000003807437bENSE00001651294chr5:69365062-6936515796SMN_HUMAN210-241320--
1.8aENST000003807438aENSE00001730323chr5:69366468-69366578111SMN_HUMAN242-278370--
1.10aENST0000038074310aENSE00001755872chr5:69372348-6937240154SMN_HUMAN279-294160--
1.11cENST0000038074311cENSE00001812360chr5:69372846-69373419574SMN_HUMAN-00--

(-) Sequences/Alignments

NMR Structure
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:189
 aligned with GEMI2_HUMAN | O14893 from UniProtKB/Swiss-Prot  Length:280

    Alignment length:189
                                                                                                                                                                                                                   280   
                                   104       114       124       134       144       154       164       174       184       194       204       214       224       234       244       254       264       274     |   
          GEMI2_HUMAN    95 GYSPTLQWQQQQVAQFSTVRQNVNKHRSHWKSQQLDSNVTMPKSEDEEGWKKFCLGEKLCADGAVGPATNESPGIDYVQIGFPPLLSIVSRMNQATVTSVLEYLSNWFGERDFTPELGRWLYALLACLEKPLLPEAHSLIRQLARRCSEVRLLVDSKDDERVPALNLLICLVSRYFDQRDLADEPS---   -
               SCOP domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ....hhhhhhhhhhhhhhhhhhhhhhhhhhhh.............hhhhhhhhhhhhhhhhh......................hhhhhhhhhhhhhhhhhhhhhhhh......hhhhhhhhhhhhhh....hhhhhhhhhhhhhhhhhhhhhh......hhhhhhhhhhhhhhhh............. Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 2leh A  95 GYSPTLQWQQQQVAQFSTVRQNVNKHRSHWKSQQLDSNVTMPKSEDEEGWKKFCLGEKLCADGAVGPATNESPGIDYVQIGFPPLLSIVSRMNQATVTSVLEYLSNWFGERDFTPELGRWLYALLACLEKPLLPEAHSLIRQLARRCSEVRLLVDSKDDERVPALNLLICLVSRYFDQRDLADEPSLEY 283
                                   104       114       124       134       144       154       164       174       184       194       204       214       224       234       244       254       264       274         

Chain B from PDB  Type:PROTEIN  Length:26
 aligned with SMN_HUMAN | Q16637 from UniProtKB/Swiss-Prot  Length:294

    Alignment length:26
                                    35        45      
            SMN_HUMAN    26 GQSDDSDIWDDTALIKAYDKAVASFK  51
               SCOP domains -------------------------- SCOP domains
               CATH domains -------------------------- CATH domains
               Pfam domains -------------------------- Pfam domains
         Sec.struct. author ...........hhhhhhhhhhhhh.. Sec.struct. author
                 SAPs(SNPs) ----N-------------V------- SAPs(SNPs)
                    PROSITE -------------------------- PROSITE
               Transcript 1 1.Exon 1.2b  PDB: B:28-51  Transcript 1
                 2leh B  26 GQSDDSDIWDDTALIKAYDKAVASFK  51
                                    35        45      

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 2LEH)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 2LEH)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 2LEH)

(-) Gene Ontology  (26, 39)

NMR Structure(hide GO term definitions)
Chain A   (GEMI2_HUMAN | O14893)
molecular function
    GO:0005515    protein binding    Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
biological process
    GO:0008380    RNA splicing    The process of removing sections of the primary RNA transcript to remove sequences not present in the mature form of the RNA and joining the remaining sections to form the mature form of the RNA.
    GO:0000375    RNA splicing, via transesterification reactions    Splicing of RNA via a series of two transesterification reactions.
    GO:0006397    mRNA processing    Any process involved in the conversion of a primary mRNA transcript into one or more mature mRNA(s) prior to translation into polypeptide.
    GO:0051170    nuclear import    The directed movement of substances into the nucleus.
    GO:0000245    spliceosomal complex assembly    The aggregation, arrangement and bonding together of a spliceosomal complex, a ribonucleoprotein apparatus that catalyzes nuclear mRNA splicing via transesterification reactions.
    GO:0000387    spliceosomal snRNP assembly    The aggregation, arrangement and bonding together of one or more snRNA and multiple protein components to form a ribonucleoprotein complex that is involved in formation of the spliceosome.
cellular component
    GO:0097504    Gemini of coiled bodies    Nuclear bodies frequently found near or associated with Cajal bodies (also called coiled bodies or CBs). Gemini of coiled bodies, or 'gems', are similar in size and shape to CBs, and often indistinguishable under the microscope. Unlike CBs, gems do not contain small nuclear ribonucleoproteins (snRNPs); they contain a protein called survivor of motor neurons (SMN) whose function relates to snRNP biogenesis. Gems are believed to assist CBs in snRNP biogenesis, and to play a role in the etiology of spinal muscular atrophy (SMA).
    GO:0032797    SMN complex    A protein complex that contains the survival motor neuron (SMN) protein and at least eight additional integral components, including the Gemin2-8 and Unrip proteins; the complex is found in the cytoplasm and in nuclear Gems, and is involved in spliceosomal snRNP assembly in the cytoplasm and in pre-mRNA splicing in the nucleus.
    GO:0034719    SMN-Sm protein complex    A protein complex formed by the association of several methylated Sm proteins with the SMN complex; the latter contains the survival motor neuron (SMN) protein and at least eight additional integral components, including the Gemin2-8 and unrip proteins; additional proteins, including galectin-1 and galectin-3, are also found in the SMN-SM complex. The SMN-Sm complex is involved in spliceosomal snRNP assembly in the cytoplasm.
    GO:0005737    cytoplasm    All of the contents of a cell excluding the plasma membrane and nucleus, but including other subcellular structures.
    GO:0005829    cytosol    The part of the cytoplasm that does not contain organelles but which does contain other particulate matter, such as protein complexes.
    GO:0005622    intracellular    The living contents of a cell; the matter contained within (but not including) the plasma membrane, usually taken to exclude large vacuoles and masses of secretory or ingested material. In eukaryotes it includes the nucleus and cytoplasm.
    GO:0005730    nucleolus    A small, dense body one or more of which are present in the nucleus of eukaryotic cells. It is rich in RNA and protein, is not bounded by a limiting membrane, and is not seen during mitosis. Its prime function is the transcription of the nucleolar DNA into 45S ribosomal-precursor RNA, the processing of this RNA into 5.8S, 18S, and 28S components of ribosomal RNA, and the association of these components with 5S RNA and proteins synthesized outside the nucleolus. This association results in the formation of ribonucleoprotein precursors; these pass into the cytoplasm and mature into the 40S and 60S subunits of the ribosome.
    GO:0005654    nucleoplasm    That part of the nuclear content other than the chromosomes or the nucleolus.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.
    GO:0005681    spliceosomal complex    Any of a series of ribonucleoprotein complexes that contain snRNA(s) and small nuclear ribonucleoproteins (snRNPs), and are formed sequentially during the spliceosomal splicing of one or more substrate RNAs, and which also contain the RNA substrate(s) from the initial target RNAs of splicing, the splicing intermediate RNA(s), to the final RNA products. During cis-splicing, the initial target RNA is a single, contiguous RNA transcript, whether mRNA, snoRNA, etc., and the released products are a spliced RNA and an excised intron, generally as a lariat structure. During trans-splicing, there are two initial substrate RNAs, the spliced leader RNA and a pre-mRNA.

Chain B   (SMN_HUMAN | Q16637)
molecular function
    GO:0003723    RNA binding    Interacting selectively and non-covalently with an RNA molecule or a portion thereof.
    GO:0042802    identical protein binding    Interacting selectively and non-covalently with an identical protein or proteins.
    GO:0005515    protein binding    Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
biological process
    GO:0006353    DNA-templated transcription, termination    The cellular process that completes DNA-templated transcription; the formation of phosphodiester bonds ceases, the RNA-DNA hybrid dissociates, and RNA polymerase releases the DNA.
    GO:0008380    RNA splicing    The process of removing sections of the primary RNA transcript to remove sequences not present in the mature form of the RNA and joining the remaining sections to form the mature form of the RNA.
    GO:0006397    mRNA processing    Any process involved in the conversion of a primary mRNA transcript into one or more mature mRNA(s) prior to translation into polypeptide.
    GO:0007399    nervous system development    The process whose specific outcome is the progression of nervous tissue over time, from its formation to its mature state.
    GO:0051170    nuclear import    The directed movement of substances into the nucleus.
    GO:0000245    spliceosomal complex assembly    The aggregation, arrangement and bonding together of a spliceosomal complex, a ribonucleoprotein apparatus that catalyzes nuclear mRNA splicing via transesterification reactions.
    GO:0000387    spliceosomal snRNP assembly    The aggregation, arrangement and bonding together of one or more snRNA and multiple protein components to form a ribonucleoprotein complex that is involved in formation of the spliceosome.
cellular component
    GO:0015030    Cajal body    A class of nuclear body, first seen after silver staining by Ramon y Cajal in 1903, enriched in small nuclear ribonucleoproteins, and certain general RNA polymerase II transcription factors; ultrastructurally, they appear as a tangle of coiled, electron-dense threads roughly 0.5 micrometers in diameter; involved in aspects of snRNP biogenesis; the protein coilin serves as a marker for Cajal bodies. Some argue that Cajal bodies are the sites for preassembly of transcriptosomes, unitary particles involved in transcription and processing of RNA.
    GO:0097504    Gemini of coiled bodies    Nuclear bodies frequently found near or associated with Cajal bodies (also called coiled bodies or CBs). Gemini of coiled bodies, or 'gems', are similar in size and shape to CBs, and often indistinguishable under the microscope. Unlike CBs, gems do not contain small nuclear ribonucleoproteins (snRNPs); they contain a protein called survivor of motor neurons (SMN) whose function relates to snRNP biogenesis. Gems are believed to assist CBs in snRNP biogenesis, and to play a role in the etiology of spinal muscular atrophy (SMA).
    GO:0032797    SMN complex    A protein complex that contains the survival motor neuron (SMN) protein and at least eight additional integral components, including the Gemin2-8 and Unrip proteins; the complex is found in the cytoplasm and in nuclear Gems, and is involved in spliceosomal snRNP assembly in the cytoplasm and in pre-mRNA splicing in the nucleus.
    GO:0034719    SMN-Sm protein complex    A protein complex formed by the association of several methylated Sm proteins with the SMN complex; the latter contains the survival motor neuron (SMN) protein and at least eight additional integral components, including the Gemin2-8 and unrip proteins; additional proteins, including galectin-1 and galectin-3, are also found in the SMN-SM complex. The SMN-Sm complex is involved in spliceosomal snRNP assembly in the cytoplasm.
    GO:0030018    Z disc    Platelike region of a muscle sarcomere to which the plus ends of actin filaments are attached.
    GO:0005737    cytoplasm    All of the contents of a cell excluding the plasma membrane and nucleus, but including other subcellular structures.
    GO:0036464    cytoplasmic ribonucleoprotein granule    A ribonucleoprotein granule located in the cytoplasm.
    GO:0005829    cytosol    The part of the cytoplasm that does not contain organelles but which does contain other particulate matter, such as protein complexes.
    GO:0043005    neuron projection    A prolongation or process extending from a nerve cell, e.g. an axon or dendrite.
    GO:0005654    nucleoplasm    That part of the nuclear content other than the chromosomes or the nucleolus.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.
    GO:0043204    perikaryon    The portion of the cell soma (neuronal cell body) that excludes the nucleus.

 Visualization

(-) Interactive Views

NMR Structure
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
(no "Ligands, Modified Residues, Ions" information available for 2leh)
 
  Sites
(no "Sites" information available for 2leh)
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 2leh)
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  2leh
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  GEMI2_HUMAN | O14893
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
  SMN_HUMAN | Q16637
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  253400
    Disease InformationOMIM
  253550
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  GEMI2_HUMAN | O14893
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)
  SMN_HUMAN | Q16637
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        GEMI2_HUMAN | O148933s6n
        SMN_HUMAN | Q166371g5v 1mhn 3s6n 4a4e 4a4g 4gli 4qq6 5mf9

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 2LEH)