3BHO - JenaLib

Title :   CRYSTAL STRUCTURE OF THE 25KDA SUBUNIT OF HUMAN CLEAVAGE FACTOR IM WITH AP4A

Authors :   M. Coseno, S. Doublie

Date :   28 Nov 07 (Deposition) - 19 Feb 08 (Release) - 24 Feb 09 (Revision)

Method :   X-RAY DIFFRACTION

Resolution :   1.80

Chains :   Asym. Unit : A
Biol. Unit 1: A  (2x)

Keywords :   Cpsf5, Rna Processing, Cleavage Factor, Diadenosine Tetraphosphate, Mrna Processing, Nucleus, Phosphoprotein, Rna-Binding, Nuclear Protein (Keyword Search: [Gene Ontology, PubMed, Web (Google)] )

Reference :   M. Coseno, G. Martin, C. Berger, G. Gilmartin, W. Keller, S. Doublie
Crystal Structure Of The 25 Kda Subunit Of Human Cleavage Factor Im.
Nucleic Acids Res. V. 36 3474 2008
PubMed-ID: 18445629 | Reference-DOI: 10.1093/NAR/GKN079
(for further references see the PDB file header)

Compounds

Structural Features

Chains, Units

Ligands, Modified Residues, Ions (1, 1)

Sites (1, 1)

SS Bonds (0, 0)

Cis Peptide Bonds (0, 0)

Sequence-Structure Mapping

SAPs(SNPs)/Variants (0, 0)

PROSITE Motifs (1, 1)

Exons (7, 7)

Asymmetric Unit (7, 7)

	ENSEMBL					UniProtKB			PDB
No.	Transcript ID	Exon	Exon ID	Genome Location	Length	ID	Location	Length	Count	Location	Length
1.1	ENST00000300291	1	ENSE00001482691	chr16:56485287-56484999	289	CPSF5_HUMAN	1-39	39	1	A:20-39	20
1.2	ENST00000300291	2	ENSE00001108683	chr16:56481901-56481701	201	CPSF5_HUMAN	39-106	68	1	A:39-106	68
1.3	ENST00000300291	3	ENSE00001108681	chr16:56480601-56480538	64	CPSF5_HUMAN	106-127	22	1	A:106-127	22
1.4	ENST00000300291	4	ENSE00001108678	chr16:56473658-56473569	90	CPSF5_HUMAN	128-157	30	1	A:128-157 (gaps)	30
1.5	ENST00000300291	5	ENSE00001108686	chr16:56468741-56468666	76	CPSF5_HUMAN	158-183	26	1	A:158-183	26
1.6	ENST00000300291	6	ENSE00001108677	chr16:56468357-56468243	115	CPSF5_HUMAN	183-221	39	1	A:183-221	39
1.7	ENST00000300291	7	ENSE00001108679	chr16:56466645-56463045	3601	CPSF5_HUMAN	221-227	7	1	A:221-227	7

Sequences/Alignments

Asymmetric Unit

Reformat:	Number of residues per line = ('0' or empty: single-line sequence representation)
	Number of residues per labelling interval =
	UniProt sequence: complete aligned part

Show mapping:	SCOP domains	CATH domains	Pfam domains	Secondary structure (by author)
	SAPs(SNPs)	PROSITE motifs	Exons
	(details for a mapped element are shown in a popup box when the mouse pointer rests over it)

Chain A from PDB  Type:PROTEIN  Length:203
 aligned with CPSF5_HUMAN | O43809 from UniProtKB/Swiss-Prot  Length:227

    Alignment length:208
                                    29        39        49        59        69        79        89        99       109       119       129       139       149       159       169       179       189       199       209       219        
          CPSF5_HUMAN    20 FGNKYIQQTKPLTLERTINLYPLTNYTFGTKEPLYEKDSSVAARFQRMREEFDKIGMRRTVEGVLIVHEHRLPHVLLLQLGTTFFKLPGGELNPGEDEVEGLKRLMTEILGRQDGVLQDWVIDDCIGNWWRPNFEPPQYPYIPAHITKPKEHKKLFLVQLQEKALFAVPKNYKLVAAPLFELYDNAPGYGPIISSLPQLLSRFNFIYN 227
               SCOP domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ................eeee.hhh.eeeeee........hhhhhhhhhhhhhhhhh.eeeeeeeeeeee..eeeeeeeeee..eee..eee.....hhhhhhhhhhhhhhh.-----...eeeeeeeeeee...................eeeeeeeee....eeeeee...eeeeeehhhhh.hhhhhhhhhhhhhhhhh..eeee. Sec.struct. author
                 SAPs(SNPs) ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------NUDIX  PDB: A:76-201 UniProt: 76-201                                                                                          -------------------------- PROSITE
           Transcript 1 (1) Exon 1.1            ------------------------------------------------------------------Exon 1.3              Exon 1.4  PDB: A:128-157 (gapsExon 1.5  PDB: A:158-183  -------------------------------------1.7     Transcript 1 (1)
           Transcript 1 (2) -------------------Exon 1.2  PDB: A:39-106 UniProt: 39-106                             ----------------------------------------------------------------------------Exon 1.6  PDB: A:183-221               ------ Transcript 1 (2)
                 3bho A  20 FGNKYIQQTKPLTLERTINLYPLTNYTFGTKEPLYEKDSSVAARFQRMREEFDKIGMRRTVEGVLIVHEHRLPHVLLLQLGTTFFKLPGGELNPGEDEVEGLKRLMTEILGR-----QDWVIDDCIGNWWRPNFEPPQYPYIPAHITKPKEHKKLFLVQLQEKALFAVPKNYKLVAAPLFELYDNAPGYGPIISSLPQLLSRFNFIYN 227
                                    29        39        49        59        69        79        89        99       109       119       129 |     139       149       159       169       179       189       199       209       219        
                                                                                                                                         131   137

Legend:		→ Mismatch	(orange background)
	-	→ Gap	(green background, '-', border residues have a numbering label)
		→ Modified Residue	(blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
	x	→ Chemical Group	(purple background, 'x', labelled with number + name, e.g. ACE or NH2)
	extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '\|'

Classification and Annotation

SCOP Domains (0, 0)

CATH Domains (0, 0)

Pfam Domains (0, 0)

Gene Ontology (19, 19)

Asymmetric Unit(hide GO term definitions)

Chain A (CPSF5_HUMAN | O43809)

molecular function
	GO:0017091	AU-rich element binding	Interacting selectively and non-covalently with a region of RNA containing frequent adenine and uridine bases.
	GO:0003723	RNA binding	Interacting selectively and non-covalently with an RNA molecule or a portion thereof.
	GO:0042826	histone deacetylase binding	Interacting selectively and non-covalently with the enzyme histone deacetylase.
	GO:0016787	hydrolase activity	Catalysis of the hydrolysis of various bonds, e.g. C-O, C-N, C-C, phosphoric anhydride bonds, etc. Hydrolase is the systematic name for any enzyme of EC class 3.
	GO:0003729	mRNA binding	Interacting selectively and non-covalently with messenger RNA (mRNA), an intermediate molecule between DNA and protein. mRNA includes UTR and coding sequences, but does not contain introns.
	GO:0005515	protein binding	Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
	GO:0042803	protein homodimerization activity	Interacting selectively and non-covalently with an identical protein to form a homodimer.
biological process
	GO:0031124	mRNA 3'-end processing	Any process involved in forming the mature 3' end of an mRNA molecule.
	GO:0006378	mRNA polyadenylation	The enzymatic addition of a sequence of 40-200 adenylyl residues at the 3' end of a eukaryotic mRNA primary transcript.
	GO:0006397	mRNA processing	Any process involved in the conversion of a primary mRNA transcript into one or more mature mRNA(s) prior to translation into polypeptide.
	GO:0000398	mRNA splicing, via spliceosome	The joining together of exons from one or more primary transcripts of messenger RNA (mRNA) and the excision of intron sequences, via a spliceosomal mechanism, so that mRNA consisting only of the joined exons is produced.
	GO:0051262	protein tetramerization	The formation of a protein tetramer, a macromolecular structure consisting of four noncovalently associated identical or nonidentical subunits.
	GO:0006369	termination of RNA polymerase II transcription	The process in which the synthesis of an RNA molecule by RNA polymerase II using a DNA template is completed.
cellular component
	GO:0005813	centrosome	A structure comprised of a core structure (in most organisms, a pair of centrioles) and peripheral material from which a microtubule-based structure, such as a spindle apparatus, is organized. Centrosomes occur close to the nucleus during interphase in many eukaryotic cells, though in animal cells it changes continually during the cell-division cycle.
	GO:0005849	mRNA cleavage factor complex	Any macromolecular complex involved in cleavage or polyadenylation of mRNA molecules.
	GO:0005815	microtubule organizing center	An intracellular structure that can catalyze gamma-tubulin-dependent microtubule nucleation and that can anchor microtubules by interacting with their minus ends, plus ends or sides.
	GO:0005654	nucleoplasm	That part of the nuclear content other than the chromosomes or the nucleolus.
	GO:0005634	nucleus	A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.
	GO:0042382	paraspeckles	Discrete subnuclear bodies in the interchromatin nucleoplasmic space, often located adjacent to nuclear specks. 10-20 paraspeckles are typically found in human cell nuclei.

Visualization

Interactive Views

Asymmetric Unit
	Complete Structure
		Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
		WebMol \| AstexViewer[tm]@PDBe (Java Applets, require no local installation except for Java; loading may be slow)
		STRAP (Java WebStart application, automatic local installation, requires Java; full application with system access!)
		RasMol (require local installation)
		Molscript (VRML) (requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)

	Ligands, Modified Residues, Ions
		B4P	[ RasMol \| Jena3D ]	+environment [ RasMol \| Jena3D ]

	Sites
		AC1	[ RasMol ]	+environment [ RasMol ]

	Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 3bho)

Biological Unit
	Complete Structure
		Biological Unit 1	[ Jena3D ]

Still Images

Databases and Analysis Tools

Databases

Access by PDB/NDB ID
	3bho
		Family and Domain Information	:	ProDom \| SYSTERS
		General Structural Information	:	GlycoscienceDB \| MMDB \| NDB \| OCA \| PDB \| PDBe \| PDBj \| PDBsum \| PDBWiki \| PQS \| PROTEOPEDIA
		Orientation in Membranes	:	OPM
		Protein Surface	:	SURFACE
		Secondary Structure	:	DSSP (structure derived) \| HSSP (homology derived)
		Structural Genomics	:	GeneCensus
		Structural Neighbours	:	CE \| VAST
		Structure Classification	:	CATH \| Dali \| SCOP
		Validation and Original Data	:	BMRB Data View \| BMRB Restraints Grid \| EDS \| PROCHECK \| RECOORD \| WHAT_CHECK

Access by UniProt ID/Accession number
	CPSF5_HUMAN \| O43809
		Comparative Protein Structure Models	:	ModBase
		Genomic Information	:	Ensembl
		Protein-protein Interaction	:	DIP
		Sequence, Family and Domain Information	:	InterPro \| Pfam \| SMART \| UniProtKB/SwissProt

Access by Enzyme Classificator (EC Number)
	(no 'Enzyme Classificator' available)
		General Enzyme Information	:	BRENDA \| EC-PDB \| Enzyme \| IntEnz
		Pathway	:	KEGG \| MetaCyc

Access by Disease Identifier (MIM ID)
	(no 'MIM ID' available)
		Disease Information	:	OMIM

Access by GenAge ID
	(no 'GenAge ID' available)
		Age Related Information	:	GenAge

Description