2FHO - JenaLib

NMR Structure - model 1 (Jmol Viewer)

NMR Structure - all models (Jmol Viewer)

Title :   NMR SOLUTION STRUCTURE OF THE HUMAN SPLICEOSOMAL PROTEIN COMPLEX P14-SF3B155

Authors :   K. Kuwasako, N. Dohmae, M. Inoue, M. Shirouzu, P. Guntert, B. Seraphin, Y. Muto, S. Yokoyama, Riken Structural Genomics/Proteomics Initiative (Rsgi)

Date :   26 Dec 05 (Deposition) - 26 Dec 06 (Release) - 24 Feb 09 (Revision)

Method :   SOLUTION NMR

Resolution :   NOT APPLICABLE

Chains :   NMR Structure  : A,B  (20x)

Keywords :   Rrm Domain, Structural Genomics, Nppsfa, National Project On Protein Structural And Functional Analyses, Riken Structural Genomics/Proteomics Initiative, Rsgi, Rna Binding Protein (Keyword Search: [Gene Ontology, PubMed, Web (Google)] )

Reference :   K. Kuwasako, N. Dohmae, M. Inoue, M. Shirouzu, P. Guntert, B. Seraphin, Y. Muto, S. Yokoyama
Nmr Solution Structure Of The Human Spliceosomal Protein Complex P14-Sf3B155
To Be Published
PubMed: search
(for further references see the PDB file header)

Molecule 1 - SPLICEOSOMAL PROTEIN SF3B155
	Chains	:	A
	Engineered	:	YES
	Expression System	:	ESCHERICHIA COLI
	Expression System Plasmid	:	PGEX6P-1-SF3B155(379-424)-HIS6- P14(8-93)
	Expression System Taxid	:	562
	Expression System Vector Type	:	PLASMID
	Fragment	:	RESIDUES IN DATABASE 379-424
	Gene	:	SF3B155
	Organism Common	:	HUMAN
	Organism Scientific	:	HOMO SAPIENS
	Organism Taxid	:	9606
	Synonym	:	SPLICEOSOMAL PROTEIN SF3B155, SPLICEOSOME ASSOCIATED PROTEIN 155, SAP 155, SF3B155, PRE-MRNA SPLICING FACTOR SF3B 155 KDA SUBUNIT

Molecule 2 - SPLICEOSOMAL PROTEIN P14
	Chains	:	B
	Engineered	:	YES
	Expression System	:	ESCHERICHIA COLI
	Expression System Plasmid	:	PGEX6P-1-SF3B155(379-424)-HIS6- P14(8-93)
	Expression System Taxid	:	562
	Expression System Vector Type	:	PLASMID
	Fragment	:	RNA RECOGNITION MOTIF
	Gene	:	P14
	Organism Common	:	HUMAN
	Organism Scientific	:	HOMO SAPIENS
	Organism Taxid	:	9606
	Synonym	:	SPLICEOSOMAL PROTEIN P14, SF3B 14 KDA SUBUNIT


NMR Structure (20x)	:

Summary Information (see also Sequences/Alignments below)

(no "Ligand,Modified Residues,Ions" information available for 2FHO)

(no "Site" information available for 2FHO)

(no "SS Bond" information available for 2FHO)

(no "Cis Peptide Bond" information available for 2FHO)

(no "SAP(SNP)/Variant" information available for 2FHO)

NMR Structure (1, 1)

	PROSITE			UniProtKB		PDB
No.	ID	AC	Description	ID	Location	Count	Location
1	RRM	PS50102	Eukaryotic RNA Recognition Motif (RRM) profile.	SF3B6_HUMAN	19-94	1	B:19-93

NMR Structure (5, 5)

	ENSEMBL					UniProtKB			PDB
No.	Transcript ID	Exon	Exon ID	Genome Location	Length	ID	Location	Length	Count	Location	Length
1.1a	ENST00000233468	1a	ENSE00001034726	chr2:24299313-24299070	244	SF3B6_HUMAN	1-10	10	1	B:7-10	4
1.2a	ENST00000233468	2a	ENSE00000721455	chr2:24297064-24296946	119	SF3B6_HUMAN	11-50	40	1	B:11-50	40
1.3	ENST00000233468	3	ENSE00000721450	chr2:24291329-24291191	139	SF3B6_HUMAN	50-96	47	1	B:50-93	44
1.4	ENST00000233468	4	ENSE00000808827	chr2:24290721-24290454	268	SF3B6_HUMAN	97-125	29	0	-	-

2.1a	ENST00000335508	1a	ENSE00001858642	chr2:198299815-198299696	120	SF3B1_HUMAN	1-10	10	0	-	-
2.2b	ENST00000335508	2b	ENSE00001005134	chr2:198288698-198288532	167	SF3B1_HUMAN	10-65	56	0	-	-
2.3	ENST00000335508	3	ENSE00001779244	chr2:198285857-198285753	105	SF3B1_HUMAN	66-100	35	0	-	-
2.4a	ENST00000335508	4a	ENSE00000784199	chr2:198285266-198285152	115	SF3B1_HUMAN	101-139	39	0	-	-
2.4h	ENST00000335508	4h	ENSE00000784198	chr2:198283312-198283233	80	SF3B1_HUMAN	139-165	27	0	-	-
2.5b	ENST00000335508	5b	ENSE00000784196	chr2:198281635-198281465	171	SF3B1_HUMAN	166-222	57	0	-	-
2.6b	ENST00000335508	6b	ENSE00000964860	chr2:198274731-198274494	238	SF3B1_HUMAN	223-302	80	0	-	-
2.7b	ENST00000335508	7b	ENSE00000964861	chr2:198273305-198273093	213	SF3B1_HUMAN	302-373	72	0	-	-
2.8a	ENST00000335508	8a	ENSE00000964862	chr2:198272843-198272722	122	SF3B1_HUMAN	373-413	41	1	A:378-413 (gaps)	41
2.9	ENST00000335508	9	ENSE00000964863	chr2:198270196-198269999	198	SF3B1_HUMAN	414-479	66	1	A:414-424	11
2.10	ENST00000335508	10	ENSE00000964864	chr2:198269901-198269800	102	SF3B1_HUMAN	480-513	34	0	-	-
2.11	ENST00000335508	11	ENSE00000964865	chr2:198268488-198268309	180	SF3B1_HUMAN	514-573	60	0	-	-
2.12	ENST00000335508	12	ENSE00000964866	chr2:198267759-198267673	87	SF3B1_HUMAN	574-602	29	0	-	-
2.13	ENST00000335508	13	ENSE00000964867	chr2:198267550-198267280	271	SF3B1_HUMAN	603-693	91	0	-	-
2.15b	ENST00000335508	15b	ENSE00000964868	chr2:198266854-198266709	146	SF3B1_HUMAN	693-741	49	0	-	-
2.16	ENST00000335508	16	ENSE00000964869	chr2:198266612-198266466	147	SF3B1_HUMAN	742-790	49	0	-	-
2.17	ENST00000335508	17	ENSE00000964870	chr2:198266249-198266124	126	SF3B1_HUMAN	791-832	42	0	-	-
2.18	ENST00000335508	18	ENSE00000964871	chr2:198265660-198265439	222	SF3B1_HUMAN	833-906	74	0	-	-
2.19	ENST00000335508	19	ENSE00000964872	chr2:198265158-198264976	183	SF3B1_HUMAN	907-967	61	0	-	-
2.20	ENST00000335508	20	ENSE00000964873	chr2:198264890-198264779	112	SF3B1_HUMAN	968-1005	38	0	-	-
2.22b	ENST00000335508	22b	ENSE00000964874	chr2:198263305-198263185	121	SF3B1_HUMAN	1005-1045	41	0	-	-
2.23a	ENST00000335508	23a	ENSE00000964875	chr2:198262840-198262709	132	SF3B1_HUMAN	1045-1089	45	0	-	-
2.24	ENST00000335508	24	ENSE00000784174	chr2:198261052-198260780	273	SF3B1_HUMAN	1089-1180	92	0	-	-
2.25c	ENST00000335508	25c	ENSE00000964876	chr2:198257912-198257696	217	SF3B1_HUMAN	1180-1252	73	0	-	-
2.26b	ENST00000335508	26b	ENSE00001337127	chr2:198257185-198256698	488	SF3B1_HUMAN	1253-1304	52	0	-	-

NMR Structure

Reformat:	Number of residues per line = ('0' or empty: single-line sequence representation)
	Number of residues per labelling interval =
	UniProt sequence: complete aligned part

Show mapping:	SCOP domains	CATH domains	Pfam domains	Secondary structure (by author)
	SAPs(SNPs)	PROSITE motifs	Exons
	(details for a mapped element are shown in a popup box when the mouse pointer rests over it)

Chain A from PDB  Type:PROTEIN  Length:47
 aligned with SF3B1_HUMAN | O75533 from UniProtKB/Swiss-Prot  Length:1304

    Alignment length:52
                                   382       392       402       412       422  
          SF3B1_HUMAN   373 GHIMSMTPEQLQAWRWEREIDERNRPLSDEELDAMFPEGYKVLPPPAGYVPI 424
               SCOP domains ---------------------------------------------------- SCOP domains
               CATH domains ---------------------------------------------------- CATH domains
               Pfam domains ---------------------------------------------------- Pfam domains
         Sec.struct. author .-----.......................hhhhhhh...eee.......... Sec.struct. author
                 SAPs(SNPs) ---------------------------------------------------- SAPs(SNPs)
                    PROSITE ---------------------------------------------------- PROSITE
           Transcript 2 (1) 2----------------------------------------Exon 2.9    Transcript 2 (1)
           Transcript 2 (2) Exon 2.8a  PDB: A:378-413 (gaps)         ----------- Transcript 2 (2)
                 2fho A 378 G-----TPEQLQAWRWEREIDERNRPLSDEELDAMFPEGYKVLPPPAGYVPI 424
                            |     |382       392       402       412       422  
                            |   379                                             
                          378

Chain B from PDB  Type:PROTEIN  Length:87
 aligned with SF3B6_HUMAN | Q9Y3B4 from UniProtKB/Swiss-Prot  Length:125

    Alignment length:87
                                    16        26        36        46        56        66        76        86       
          SF3B6_HUMAN     7 KRANIRLPPEVNRILYIRNLPYKITAEEMYDIFGKYGPIRQIRVGNTPETRGTAYVVYEDIFDAKNACDHLSGFNVCNRYLVVLYYN  93
               SCOP domains d2fhob_ B: automated matches                                                            SCOP domains
               CATH domains --------------------------------------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .............eeeee......hhhhhhhhhh....eeeeeee.......eeeeee..hhhhhhhhhhh...ee..ee.eeee.. Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------RRM  PDB: B:19-93 UniProt: 19-94                                            PROSITE
           Transcript 1 (1) 1.1aExon 1.2a  PDB: B:11-50 UniProt: 11-50  ------------------------------------------- Transcript 1 (1)
           Transcript 1 (2) -------------------------------------------Exon 1.3  PDB: B:50-93 UniProt: 50-96        Transcript 1 (2)
                 2fho B   7 GRANIRLPPEVNRILYIRNLPYKITAEEMYDIFGKYGPIRQIRVGNTPETRGTAYVVYEDIFDAKNACDHLSGFNVCNRYLVVLYYN  93
                                    16        26        36        46        56        66        76        86

Legend:		→ Mismatch	(orange background)
	-	→ Gap	(green background, '-', border residues have a numbering label)
		→ Modified Residue	(blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
	x	→ Chemical Group	(purple background, 'x', labelled with number + name, e.g. ACE or NH2)
	extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '\|'

NMR Structure

Classes(

)

Folds(

)

Superfamilies( (-)

)

Families(

)

Protein Domains( (-)

)

Organisms( (-)

)

Class: Alpha and beta proteins (a+b) (23004)

Fold: Ferredoxin-like (1795)

Superfamily: RNA-binding domain, RBD (289)

Family: automated matches (67)

Protein domain: automated matches (67)

Human (Homo sapiens) [TaxId: 9606] (56)

d2fhob_

(no "CATH Domain" information available for 2FHO)

(no "Pfam Domain" information available for 2FHO)

NMR Structure(hide GO term definitions)

Chain A (SF3B1_HUMAN | O75533)

molecular function
	GO:0003729	mRNA binding	Interacting selectively and non-covalently with messenger RNA (mRNA), an intermediate molecule between DNA and protein. mRNA includes UTR and coding sequences, but does not contain introns.
	GO:0005515	protein binding	Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
biological process
	GO:0008380	RNA splicing	The process of removing sections of the primary RNA transcript to remove sequences not present in the mature form of the RNA and joining the remaining sections to form the mature form of the RNA.
	GO:0000375	RNA splicing, via transesterification reactions	Splicing of RNA via a series of two transesterification reactions.
	GO:0006397	mRNA processing	Any process involved in the conversion of a primary mRNA transcript into one or more mature mRNA(s) prior to translation into polypeptide.
	GO:0000398	mRNA splicing, via spliceosome	The joining together of exons from one or more primary transcripts of messenger RNA (mRNA) and the excision of intron sequences, via a spliceosomal mechanism, so that mRNA consisting only of the joined exons is produced.
	GO:0045815	positive regulation of gene expression, epigenetic	Any epigenetic process that activates or increases the rate of gene expression.
	GO:0000245	spliceosomal complex assembly	The aggregation, arrangement and bonding together of a spliceosomal complex, a ribonucleoprotein apparatus that catalyzes nuclear mRNA splicing via transesterification reactions.
cellular component
	GO:0034693	U11/U12 snRNP	A ribonucleoprotein complex formed by the association of the U11 and U12 small nuclear ribonucleoproteins.
	GO:0005689	U12-type spliceosomal complex	Any spliceosomal complex that forms during the splicing of a messenger RNA primary transcript to excise an intron; the series of U12-type spliceosomal complexes is involved in the splicing of the majority of introns that contain atypical AT-AC terminal dinucleotides, as well as other non-canonical introns. The entire splice site signal, not just the terminal dinucleotides, is involved in determining which spliceosome utilizes the site.
	GO:0005686	U2 snRNP	A ribonucleoprotein complex that contains small nuclear RNA U2.
	GO:0071004	U2-type prespliceosome	A spliceosomal complex that is formed by association of the 5' splice site with the U1 snRNP, while the branch point sequence is recognized by the U2 snRNP. The prespliceosome includes many proteins in addition to those found in the U1 and U2 snRNPs. Commitment to a given pair of 5' and 3' splice sites occurs at the time of prespliceosome formation.
	GO:0071013	catalytic step 2 spliceosome	A spliceosomal complex that contains three snRNPs, including U5, bound to a splicing intermediate in which the first catalytic cleavage of the 5' splice site has occurred. The precise subunit composition differs significantly from that of the catalytic step 1, or activated, spliceosome, and includes many proteins in addition to those found in the associated snRNPs.
	GO:0016607	nuclear speck	A discrete extra-nucleolar subnuclear domain, 20-50 in number, in which splicing factors are seen to be localized by immunofluorescence microscopy.
	GO:0005654	nucleoplasm	That part of the nuclear content other than the chromosomes or the nucleolus.
	GO:0005634	nucleus	A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.
	GO:0005681	spliceosomal complex	Any of a series of ribonucleoprotein complexes that contain snRNA(s) and small nuclear ribonucleoproteins (snRNPs), and are formed sequentially during the spliceosomal splicing of one or more substrate RNAs, and which also contain the RNA substrate(s) from the initial target RNAs of splicing, the splicing intermediate RNA(s), to the final RNA products. During cis-splicing, the initial target RNA is a single, contiguous RNA transcript, whether mRNA, snoRNA, etc., and the released products are a spliced RNA and an excised intron, generally as a lariat structure. During trans-splicing, there are two initial substrate RNAs, the spliced leader RNA and a pre-mRNA.

Chain B (SF3B6_HUMAN | Q9Y3B4)

molecular function
	GO:0003723	RNA binding	Interacting selectively and non-covalently with an RNA molecule or a portion thereof.
	GO:0003676	nucleic acid binding	Interacting selectively and non-covalently with any nucleic acid.
	GO:0000166	nucleotide binding	Interacting selectively and non-covalently with a nucleotide, any compound consisting of a nucleoside that is esterified with (ortho)phosphate or an oligophosphate at any hydroxyl group on the ribose or deoxyribose.
biological process
	GO:0008380	RNA splicing	The process of removing sections of the primary RNA transcript to remove sequences not present in the mature form of the RNA and joining the remaining sections to form the mature form of the RNA.
	GO:0001825	blastocyst formation	The initial formation of a blastocyst from a solid ball of cells known as a morula.
	GO:0006397	mRNA processing	Any process involved in the conversion of a primary mRNA transcript into one or more mature mRNA(s) prior to translation into polypeptide.
	GO:0000398	mRNA splicing, via spliceosome	The joining together of exons from one or more primary transcripts of messenger RNA (mRNA) and the excision of intron sequences, via a spliceosomal mechanism, so that mRNA consisting only of the joined exons is produced.
cellular component
	GO:0005689	U12-type spliceosomal complex	Any spliceosomal complex that forms during the splicing of a messenger RNA primary transcript to excise an intron; the series of U12-type spliceosomal complexes is involved in the splicing of the majority of introns that contain atypical AT-AC terminal dinucleotides, as well as other non-canonical introns. The entire splice site signal, not just the terminal dinucleotides, is involved in determining which spliceosome utilizes the site.
	GO:0005686	U2 snRNP	A ribonucleoprotein complex that contains small nuclear RNA U2.
	GO:0005684	U2-type spliceosomal complex	Any spliceosomal complex that forms during the splicing of a messenger RNA primary transcript to excise an intron that has canonical consensus sequences near the 5' and 3' ends.
	GO:0071013	catalytic step 2 spliceosome	A spliceosomal complex that contains three snRNPs, including U5, bound to a splicing intermediate in which the first catalytic cleavage of the 5' splice site has occurred. The precise subunit composition differs significantly from that of the catalytic step 1, or activated, spliceosome, and includes many proteins in addition to those found in the associated snRNPs.
	GO:0005654	nucleoplasm	That part of the nuclear content other than the chromosomes or the nucleolus.
	GO:0005634	nucleus	A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.
	GO:0071011	precatalytic spliceosome	A spliceosomal complex that is formed by the recruitment of a preassembled U5-containing tri-snRNP to the prespliceosome. Although all 5 snRNPs are present, the precatalytic spliceosome is catalytically inactive. The precatalytic spliceosome includes many proteins in addition to those found in the associated snRNPs.

NMR Structure
	Complete Structure
		Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
		WebMol \| AstexViewer[tm]@PDBe (Java Applets, require no local installation except for Java; loading may be slow)
		STRAP (Java WebStart application, automatic local installation, requires Java; full application with system access!)
		RasMol (require local installation)
		Molscript (VRML) (requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)

	Ligands, Modified Residues, Ions
(no "Ligands, Modified Residues, Ions" information available for 2fho)

	Sites
(no "Sites" information available for 2fho)

	Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 2fho)

Jmol
	protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick [ NMR Structure - model 1 PNG format \| NMR Structure - all models PNG format ](automatic orientation, automatically generated)
Molscript
	protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick [ mono PDF format \| stereo PDF format ](default orientation, automatically generated)

Access by PDB/NDB ID
	2fho
		Family and Domain Information	:	ProDom \| SYSTERS
		General Structural Information	:	GlycoscienceDB \| MMDB \| NDB \| OCA \| PDB \| PDBe \| PDBj \| PDBsum \| PDBWiki \| PQS \| PROTEOPEDIA
		Orientation in Membranes	:	OPM
		Protein Surface	:	SURFACE
		Secondary Structure	:	DSSP (structure derived) \| HSSP (homology derived)
		Structural Genomics	:	GeneCensus
		Structural Neighbours	:	CE \| VAST
		Structure Classification	:	CATH \| Dali \| SCOP
		Validation and Original Data	:	BMRB Data View \| BMRB Restraints Grid \| EDS \| PROCHECK \| RECOORD \| WHAT_CHECK

Access by UniProt ID/Accession number
	SF3B1_HUMAN \| O75533
		Comparative Protein Structure Models	:	ModBase
		Genomic Information	:	Ensembl
		Protein-protein Interaction	:	DIP
		Sequence, Family and Domain Information	:	InterPro \| Pfam \| SMART \| UniProtKB/SwissProt
	SF3B6_HUMAN \| Q9Y3B4
		Comparative Protein Structure Models	:	ModBase
		Genomic Information	:	Ensembl
		Protein-protein Interaction	:	DIP
		Sequence, Family and Domain Information	:	InterPro \| Pfam \| SMART \| UniProtKB/SwissProt

Access by Enzyme Classificator (EC Number)
	(no 'Enzyme Classificator' available)
		General Enzyme Information	:	BRENDA \| EC-PDB \| Enzyme \| IntEnz
		Pathway	:	KEGG \| MetaCyc

Access by Disease Identifier (MIM ID)
	(no 'MIM ID' available)
		Disease Information	:	OMIM

Access by GenAge ID
	(no 'GenAge ID' available)
		Age Related Information	:	GenAge

Access by PDB/NDB ID
		Domain Information	:	XDom
		Interatomic Contacts of Structural Units	:	CSU
		Ligand-protein Contacts	:	LPC
		Protein Cavities	:	castP
		Sequence and Secondary Structure	:	PDBCartoon
		Structure Alignment	:	STRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
		Structure and Sequence Browser	:	STING

Access by UniProt ID/Accession number
	SF3B1_HUMAN \| O75533
		Protein Disorder Prediction	:	DisEMBL \| FoldIndex \| GLOBPLOT (for more information see DisProt)
	SF3B6_HUMAN \| Q9Y3B4
		Protein Disorder Prediction	:	DisEMBL \| FoldIndex \| GLOBPLOT (for more information see DisProt)

UniProtKB/Swiss-Prot
	SF3B1_HUMAN \| O75533	:	2f9d 2f9j 2peh 3lqv 4oz1 5ife
	SF3B6_HUMAN \| Q9Y3B4	:	2f9d 2f9j 3lqv

(no "Related Entries Specified in the PDB File" available for 2FHO)

Description

Compounds

Structural Features

Chains, Units

Ligands, Modified Residues, Ions (0, 0)

Sites (0, 0)

SS Bonds (0, 0)

Cis Peptide Bonds (0, 0)

Sequence-Structure Mapping

SAPs(SNPs)/Variants (0, 0)

PROSITE Motifs (1, 1)

Exons (5, 5)

Sequences/Alignments

Classification and Annotation

SCOP Domains (1, 1)

CATH Domains (0, 0)

Pfam Domains (0, 0)

Gene Ontology (23, 31)

Visualization

Interactive Views

Still Images

Databases and Analysis Tools

Databases

Analysis Tools

Related Entries

Entries Sharing at Least One Protein Chain (UniProt ID)

Related Entries Specified in the PDB File