3ULH - JenaLib

Asymmetric Unit (Jmol Viewer)

Asym. Unit - sites (Jmol Viewer)

Biological Unit 1 (Jmol Viewer)

Biol. Unit 1 - sites (Jmol Viewer)

Title :   CRYSTAL STRUCTURE OF A RNA BINDING DOMAIN OF THO COMPLEX SUBUNIT 4 PROTEIN (THOC4) FROM HOMO SAPIENS AT 2.54 A RESOLUTION

Authors :   Joint Center For Structural Genomics (Jcsg), Partnership For Biology (Tcell)

Date :   10 Nov 11 (Deposition) - 07 Dec 11 (Release) - 21 Oct 15 (Revision)

Method :   X-RAY DIFFRACTION

Resolution :   2.54

Chains :   Asym. Unit : A
Biol. Unit 1: A  (2x)

Keywords :   Nuclear Protein, Rna Binding, Tho Complex, Structural Genomics, Joint Center For Structural Genomics, Jcsg, Protein Structure Initiative, Psi-Biology, Rna Binding Protein, Partnership For T-Cell Biology (Keyword Search: [Gene Ontology, PubMed, Web (Google)] )

Reference :   Joint Center For Structural Genomics (Jcsg), Partnership For T-Cell Biology (Tcell)
Crystal Structure Of A Rna Binding Domain Of Tho Complex Subunit 4 Protein (Thoc4) From Homo Sapiens At 2. 54 A Resolution
To Be Published
PubMed: search

Molecule 1 - THO COMPLEX SUBUNIT 4
	Chains	:	A
	Engineered	:	YES
	Expression System	:	ESCHERICHIA COLI
	Expression System Plasmid	:	SPEEDET
	Expression System Strain	:	HK100
	Expression System Taxid	:	562
	Expression System Vector Type	:	PLASMID
	Fragment	:	RNA BINDING DOMAIN
	Gene	:	ALY, BC052302, BEF, THOC4
	Organism Common	:	HUMAN
	Organism Scientific	:	HOMO SAPIENS
	Organism Taxid	:	9606
	Synonym	:	THO4, ALLY OF AML-1 AND LEF-1, TRANSCRIPTIONAL COACTIVATOR ALY/REF, BZIP-ENHANCING FACTOR BEF

		1
Asymmetric Unit	:	A
Biological Unit 1 (2x)	:	A

Summary Information (see also Sequences/Alignments below)

Asymmetric Unit (2, 4)

No.	Name	Count	Type	Full Name
1	MSE	2	Mod. Amino Acid	SELENOMETHIONINE
2	PO4	2	Ligand/Ion	PHOSPHATE ION

Biological Unit 1 (2, 8)

No.	Name	Count	Type	Full Name
1	MSE	4	Mod. Amino Acid	SELENOMETHIONINE
2	PO4	4	Ligand/Ion	PHOSPHATE ION

Asymmetric Unit (2, 2)

No.	Name	Evidence	Residues	Description
1	AC1	SOFTWARE	ASN A:112 , GLY A:173 , ARG A:174 , PRO A:175 , PO4 A:201	BINDING SITE FOR RESIDUE PO4 A 200
2	AC2	SOFTWARE	ARG A:174 , PO4 A:200	BINDING SITE FOR RESIDUE PO4 A 201

(no "SS Bond" information available for 3ULH)

(no "Cis Peptide Bond" information available for 3ULH)

(no "SAP(SNP)/Variant" information available for 3ULH)

Asymmetric Unit (1, 1)

	PROSITE			UniProtKB		PDB
No.	ID	AC	Description	ID	Location	Count	Location
1	RRM	PS50102	Eukaryotic RNA Recognition Motif (RRM) profile.	THOC4_HUMAN	106-183	1	A:106-183

Biological Unit 1 (1, 2)

	PROSITE			UniProtKB		PDB
No.	ID	AC	Description	ID	Location	Count	Location
1	RRM	PS50102	Eukaryotic RNA Recognition Motif (RRM) profile.	THOC4_HUMAN	106-183	2	A:106-183

(no "Exon" information available for 3ULH)

Asymmetric Unit

Reformat:	Number of residues per line = ('0' or empty: single-line sequence representation)
	Number of residues per labelling interval =
	UniProt sequence: complete aligned part

Show mapping:	SCOP domains	CATH domains	Pfam domains	Secondary structure (by author)
	SAPs(SNPs)	PROSITE motifs	Exons
	(details for a mapped element are shown in a popup box when the mouse pointer rests over it)

Chain A from PDB  Type:PROTEIN  Length:84
 aligned with THOC4_HUMAN | Q86V81 from UniProtKB/Swiss-Prot  Length:257

    Alignment length:84
                                   109       119       129       139       149       159       169       179    
          THOC4_HUMAN   100 AGVETGGKLLVSNLDFGVSDADIQELFAEFGTLKKAAVHYDRSGRSLGTADVHFERKADALKAMKQYNGVPLDGRPMNIQLVTS 183
               SCOP domains d3ulha_ A: automated matches                                                         SCOP domains
               CATH domains ------------------------------------------------------------------------------------ CATH domains
               Pfam domains ------------------------------------------------------------------------------------ Pfam domains
         Sec.struct. author ......eeeeee......hhhhhhhhhhh...eeeeeeee.....eeeeeeeee.hhhhhhhhhhhhh..ee..ee.eeeee.. Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------ SAPs(SNPs)
                    PROSITE ------RRM  PDB: A:106-183 UniProt: 106-183                                           PROSITE
                 Transcript ------------------------------------------------------------------------------------ Transcript
                 3ulh A 100 AGVETGGKLLVSNLDFGVSDADIQELFAEFGTLKKAAVHYDRSGRSLGTADVHFERKADALKAmKQYNGVPLDGRPmNIQLVTS 183
                                   109       119       129       139       149       159   |   169      |179    
                                                                                         163-MSE      176-MSE

Legend:		→ Mismatch	(orange background)
	-	→ Gap	(green background, '-', border residues have a numbering label)
		→ Modified Residue	(blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
	x	→ Chemical Group	(purple background, 'x', labelled with number + name, e.g. ACE or NH2)
	extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '\|'

Asymmetric Unit

Classes(

)

Folds(

)

Superfamilies( (-)

)

Families(

)

Protein Domains( (-)

)

Organisms( (-)

)

Class: Alpha and beta proteins (a+b) (23004)

Fold: Ferredoxin-like (1795)

Superfamily: RNA-binding domain, RBD (289)

Family: Canonical RBD (214)

Protein domain: automated matches (24)

Human (Homo sapiens) [TaxId: 9606] (21)

d3ulha_

(no "CATH Domain" information available for 3ULH)

(no "Pfam Domain" information available for 3ULH)

Asymmetric Unit(hide GO term definitions)

Chain A (THOC4_HUMAN | Q86V81)

molecular function
	GO:0003723	RNA binding	Interacting selectively and non-covalently with an RNA molecule or a portion thereof.
	GO:0003676	nucleic acid binding	Interacting selectively and non-covalently with any nucleic acid.
	GO:0000166	nucleotide binding	Interacting selectively and non-covalently with a nucleotide, any compound consisting of a nucleoside that is esterified with (ortho)phosphate or an oligophosphate at any hydroxyl group on the ribose or deoxyribose.
	GO:0005515	protein binding	Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
biological process
	GO:0006405	RNA export from nucleus	The directed movement of RNA from the nucleus to the cytoplasm.
	GO:0008380	RNA splicing	The process of removing sections of the primary RNA transcript to remove sequences not present in the mature form of the RNA and joining the remaining sections to form the mature form of the RNA.
	GO:0031124	mRNA 3'-end processing	Any process involved in forming the mature 3' end of an mRNA molecule.
	GO:0006406	mRNA export from nucleus	The directed movement of mRNA from the nucleus to the cytoplasm.
	GO:0006397	mRNA processing	Any process involved in the conversion of a primary mRNA transcript into one or more mature mRNA(s) prior to translation into polypeptide.
	GO:0000398	mRNA splicing, via spliceosome	The joining together of exons from one or more primary transcripts of messenger RNA (mRNA) and the excision of intron sequences, via a spliceosomal mechanism, so that mRNA consisting only of the joined exons is produced.
	GO:0051028	mRNA transport	The directed movement of mRNA, messenger ribonucleic acid, into, out of or within a cell, or between cells, by means of some agent such as a transporter or pore.
	GO:0001649	osteoblast differentiation	The process whereby a relatively unspecialized cell acquires the specialized features of an osteoblast, a mesodermal or neural crest cell that gives rise to bone.
	GO:0032786	positive regulation of DNA-templated transcription, elongation	Any process that activates or increases the frequency, rate or extent of transcription elongation, the extension of an RNA molecule after transcription initiation and promoter clearance by the addition of ribonucleotides catalyzed by a DNA-dependent RNA polymerase.
	GO:0000018	regulation of DNA recombination	Any process that modulates the frequency, rate or extent of DNA recombination, a DNA metabolic process in which a new genotype is formed by reassortment of genes resulting in gene combinations different from those that were present in the parents.
	GO:0031297	replication fork processing	The process in which a DNA replication fork that has stalled is restored to a functional state and replication is restarted. The stalling may be due to DNA damage, DNA secondary structure, bound proteins, dNTP shortage, or other causes.
	GO:0006369	termination of RNA polymerase II transcription	The process in which the synthesis of an RNA molecule by RNA polymerase II using a DNA template is completed.
	GO:0006810	transport	The directed movement of substances (such as macromolecules, small molecules, ions) or cellular components (such as complexes and organelles) into, out of or within a cell, or between cells, or within a multicellular organism by means of some agent such as a transporter, pore or motor protein.
	GO:0046784	viral mRNA export from host cell nucleus	The directed movement of intronless viral mRNA from the host nucleus to the cytoplasm for translation.
	GO:0016032	viral process	A multi-organism process in which a virus is a participant. The other participant is the host. Includes infection of a host cell, replication of the viral genome, and assembly of progeny virus particles. In some cases the viral genetic material may integrate into the host genome and only subsequently, under particular circumstances, 'complete' its life cycle.
cellular component
	GO:0071013	catalytic step 2 spliceosome	A spliceosomal complex that contains three snRNPs, including U5, bound to a splicing intermediate in which the first catalytic cleavage of the 5' splice site has occurred. The precise subunit composition differs significantly from that of the catalytic step 1, or activated, spliceosome, and includes many proteins in addition to those found in the associated snRNPs.
	GO:0005737	cytoplasm	All of the contents of a cell excluding the plasma membrane and nucleus, but including other subcellular structures.
	GO:0005829	cytosol	The part of the cytoplasm that does not contain organelles but which does contain other particulate matter, such as protein complexes.
	GO:0035145	exon-exon junction complex	A multi-subunit complex deposited by the spliceosome upstream of messenger RNA exon-exon junctions. The exon-exon junction complex provides a binding platform for factors involved in mRNA export and nonsense-mediated mRNA decay.
	GO:0070062	extracellular exosome	A vesicle that is released into the extracellular region by fusion of the limiting endosomal membrane of a multivesicular body with the plasma membrane. Extracellular exosomes, also simply called exosomes, have a diameter of about 40-100 nm.
	GO:0043231	intracellular membrane-bounded organelle	Organized structure of distinctive morphology and function, bounded by a single or double lipid bilayer membrane and occurring within the cell. Includes the nucleus, mitochondria, plastids, vacuoles, and vesicles. Excludes the plasma membrane.
	GO:0016020	membrane	A lipid bilayer along with all the proteins and protein complexes embedded in it an attached to it.
	GO:0000784	nuclear chromosome, telomeric region	The terminal region of a linear nuclear chromosome that includes the telomeric DNA repeats and associated proteins.
	GO:0016607	nuclear speck	A discrete extra-nucleolar subnuclear domain, 20-50 in number, in which splicing factors are seen to be localized by immunofluorescence microscopy.
	GO:0005654	nucleoplasm	That part of the nuclear content other than the chromosomes or the nucleolus.
	GO:0005634	nucleus	A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.
	GO:0005681	spliceosomal complex	Any of a series of ribonucleoprotein complexes that contain snRNA(s) and small nuclear ribonucleoproteins (snRNPs), and are formed sequentially during the spliceosomal splicing of one or more substrate RNAs, and which also contain the RNA substrate(s) from the initial target RNAs of splicing, the splicing intermediate RNA(s), to the final RNA products. During cis-splicing, the initial target RNA is a single, contiguous RNA transcript, whether mRNA, snoRNA, etc., and the released products are a spliced RNA and an excised intron, generally as a lariat structure. During trans-splicing, there are two initial substrate RNAs, the spliced leader RNA and a pre-mRNA.
	GO:0000346	transcription export complex	The transcription export (TREX) complex couples transcription elongation by RNA polymerase II to mRNA export. The complex associates with the polymerase and travels with it along the length of the transcribed gene. TREX is composed of the THO transcription elongation complex as well as other proteins that couple THO to mRNA export proteins. The TREX complex is known to be found in a wide range of eukaryotes, including S. cerevisiae and metazoans.

Asymmetric Unit
	Complete Structure
		Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
		WebMol \| AstexViewer[tm]@PDBe (Java Applets, require no local installation except for Java; loading may be slow)
		STRAP (Java WebStart application, automatic local installation, requires Java; full application with system access!)
		RasMol (require local installation)
		Molscript (VRML) (requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)

	Ligands, Modified Residues, Ions
		MSE	[ RasMol \| Jena3D ]	+environment [ RasMol \| Jena3D ]
		PO4	[ RasMol \| Jena3D ]	+environment [ RasMol \| Jena3D ]

	Sites
		AC1	[ RasMol ]	+environment [ RasMol ]
		AC2	[ RasMol ]	+environment [ RasMol ]

	Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 3ulh)

Biological Unit
	Complete Structure
		Biological Unit 1	[ Jena3D ]

Jmol
	protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick [ Asym. Unit PNG format \| Asym. Unit - sites PNG format \| Biol. Unit 1 PNG format \| Biol. Unit 1 - sites PNG format ](automatic orientation, automatically generated)
Molscript
	protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick [ mono PDF format \| stereo PDF format ](default orientation, automatically generated)

Access by PDB/NDB ID
	3ulh
		Family and Domain Information	:	ProDom \| SYSTERS
		General Structural Information	:	GlycoscienceDB \| MMDB \| NDB \| OCA \| PDB \| PDBe \| PDBj \| PDBsum \| PDBWiki \| PQS \| PROTEOPEDIA
		Orientation in Membranes	:	OPM
		Protein Surface	:	SURFACE
		Secondary Structure	:	DSSP (structure derived) \| HSSP (homology derived)
		Structural Genomics	:	GeneCensus
		Structural Neighbours	:	CE \| VAST
		Structure Classification	:	CATH \| Dali \| SCOP
		Validation and Original Data	:	BMRB Data View \| BMRB Restraints Grid \| EDS \| PROCHECK \| RECOORD \| WHAT_CHECK

Access by UniProt ID/Accession number
	THOC4_HUMAN \| Q86V81
		Comparative Protein Structure Models	:	ModBase
		Genomic Information	:	Ensembl
		Protein-protein Interaction	:	DIP
		Sequence, Family and Domain Information	:	InterPro \| Pfam \| SMART \| UniProtKB/SwissProt

Access by Enzyme Classificator (EC Number)
	(no 'Enzyme Classificator' available)
		General Enzyme Information	:	BRENDA \| EC-PDB \| Enzyme \| IntEnz
		Pathway	:	KEGG \| MetaCyc

Access by Disease Identifier (MIM ID)
	(no 'MIM ID' available)
		Disease Information	:	OMIM

Access by GenAge ID
	(no 'GenAge ID' available)
		Age Related Information	:	GenAge

Access by PDB/NDB ID
		Domain Information	:	XDom
		Interatomic Contacts of Structural Units	:	CSU
		Ligand-protein Contacts	:	LPC
		Protein Cavities	:	castP
		Sequence and Secondary Structure	:	PDBCartoon
		Structure Alignment	:	STRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
		Structure and Sequence Browser	:	STING

Access by UniProt ID/Accession number
	THOC4_HUMAN \| Q86V81
		Protein Disorder Prediction	:	DisEMBL \| FoldIndex \| GLOBPLOT (for more information see DisProt)

(no "Entries Sharing at Least One Protein Chain" available for 3ULH)

(no "Related Entries Specified in the PDB File" available for 3ULH)

Description

Compounds

Structural Features

Chains, Units

Ligands, Modified Residues, Ions (2, 4)

Sites (2, 2)

SS Bonds (0, 0)

Cis Peptide Bonds (0, 0)

Sequence-Structure Mapping

SAPs(SNPs)/Variants (0, 0)

PROSITE Motifs (1, 1)

Exons (0, 0)

Sequences/Alignments

Classification and Annotation

SCOP Domains (1, 1)

CATH Domains (0, 0)

Pfam Domains (0, 0)

Gene Ontology (32, 32)

Visualization

Interactive Views

Still Images

Databases and Analysis Tools

Databases

Analysis Tools

Related Entries

Entries Sharing at Least One Protein Chain (UniProt ID)

Related Entries Specified in the PDB File