2CPQ - JenaLib

NMR Structure - manually (Jmol Viewer)

NMR Structure - model 1 (Jmol Viewer)

NMR Structure - all models (Jmol Viewer)

Title :   SOLUTION STRUCTURE OF THE N-TERMINAL KH DOMAIN OF HUMAN FXR1

Authors :   T. Nagata, Y. Muto, M. Inoue, T. Kigawa, T. Terada, M. Shirouzu, S. Yokoyama, Riken Structural Genomics/Proteomics Initiative (Rsgi)

Date :   19 May 05 (Deposition) - 19 Nov 05 (Release) - 24 Feb 09 (Revision)

Method :   SOLUTION NMR

Resolution :   NOT APPLICABLE

Chains :   NMR Structure  : A  (20x)

Keywords :   Kh Domain, Structural Genomics, Nppsfa, National Project On Protein Structural And Functional Analyses, Riken Structural Genomics/Proteomics Initiative, Rsgi, Rna Binding Protein (Keyword Search: [Gene Ontology, PubMed, Web (Google)] )

Reference :   T. Nagata, Y. Muto, M. Inoue, T. Kigawa, T. Terada, M. Shirouzu, S. Yokoyama
Solution Structure Of The N-Terminal Kh Domain Of Human Fxr1
To Be Published
PubMed: search
(for further references see the PDB file header)

Molecule 1 - FRAGILE X MENTAL RETARDATION SYNDROME RELATED PROTEIN 1, ISOFORM B'
	Chains	:	A
	Engineered	:	YES
	Expression System Plasmid	:	P040621-02
	Expression System Vector Type	:	PLASMID
	Fragment	:	KH DOMAIN
	Gene	:	FXR1
	Organism Common	:	HUMAN
	Organism Scientific	:	HOMO SAPIENS
	Organism Taxid	:	9606
	Other Details	:	CELL-FREE PROTEIN SYNTHESIS


NMR Structure (20x)	:

Summary Information (see also Sequences/Alignments below)

(no "Ligand,Modified Residues,Ions" information available for 2CPQ)

(no "Site" information available for 2CPQ)

(no "SS Bond" information available for 2CPQ)

(no "Cis Peptide Bond" information available for 2CPQ)

NMR Structure (2, 2)

								dbSNP	PDB
No.	Source	Variant ID	Variant			UniProt ID	Status	ID	Chain	Variant
1	UniProt	VAR_036050	A	233	T	FXR1_HUMAN	Unclassified	---	A	A	233	T
2	CancerSNP	VAR_FXR1_HUMAN_CCDS3238_1_01 *	A	233	T	FXR1_HUMAN	Disease (Breast cancer)	---	A	A	233	T
	* ID not provided by source

SNP/SAP Summary Statistics (UniProtKB/Swiss-Prot)

NMR Structure (1, 2)

	PROSITE			UniProtKB		PDB
No.	ID	AC	Description	ID	Location	Count	Location
1	KH_TYPE_1	PS50084	Type-1 KH domain profile.	FXR1_HUMAN	218-279 281-351	2	A:218-279 A:281-295

NMR Structure (5, 5)

	ENSEMBL					UniProtKB			PDB
No.	Transcript ID	Exon	Exon ID	Genome Location	Length	ID	Location	Length	Count	Location	Length
1.3a	ENST00000357559	3a	ENSE00001862123	chr3:180630090-180630524	435	FXR1_HUMAN	1-17	17	0	-	-
1.8b	ENST00000357559	8b	ENSE00001730554	chr3:180651122-180651174	53	FXR1_HUMAN	18-35	18	0	-	-
1.9b	ENST00000357559	9b	ENSE00001708989	chr3:180652926-180653019	94	FXR1_HUMAN	35-66	32	0	-	-
1.10a	ENST00000357559	10a	ENSE00001639975	chr3:180665653-180665724	72	FXR1_HUMAN	67-90	24	0	-	-
1.11	ENST00000357559	11	ENSE00000780635	chr3:180666135-180666283	149	FXR1_HUMAN	91-140	50	0	-	-
1.12c	ENST00000357559	12c	ENSE00000780636	chr3:180666509-180666602	94	FXR1_HUMAN	140-171	32	1	A:205-206	2
1.13d	ENST00000357559	13d	ENSE00000780637	chr3:180667015-180667131	117	FXR1_HUMAN	172-210	39	1	A:207-211	5
1.14a	ENST00000357559	14a	ENSE00000780638	chr3:180669086-180669256	171	FXR1_HUMAN	211-267	57	1	A:212-267	56
1.15a	ENST00000357559	15a	ENSE00000780639	chr3:180671550-180671628	79	FXR1_HUMAN	268-294	27	1	A:268-290	23
1.16c	ENST00000357559	16c	ENSE00000780640	chr3:180675607-180675716	110	FXR1_HUMAN	294-330	37	1	A:291-295 (gaps)	15
1.17	ENST00000357559	17	ENSE00000780641	chr3:180679256-180679342	87	FXR1_HUMAN	331-359	29	0	-	-
1.18a	ENST00000357559	18a	ENSE00000795164	chr3:180680671-180680728	58	FXR1_HUMAN	360-379	20	0	-	-
1.18d	ENST00000357559	18d	ENSE00001127668	chr3:180680816-180680878	63	FXR1_HUMAN	379-400	22	0	-	-
1.19a	ENST00000357559	19a	ENSE00000780643	chr3:180685839-180686042	204	FXR1_HUMAN	400-468	69	0	-	-
1.20a	ENST00000357559	20a	ENSE00000780644	chr3:180687946-180688146	201	FXR1_HUMAN	468-535	68	0	-	-
1.22	ENST00000357559	22	ENSE00000780645	chr3:180693101-180693192	92	FXR1_HUMAN	535-565	31	0	-	-
1.23i	ENST00000357559	23i	ENSE00001920434	chr3:180693910-180694950	1041	FXR1_HUMAN	566-621	56	0	-	-

NMR Structure

Reformat:	Number of residues per line = ('0' or empty: single-line sequence representation)
	Number of residues per labelling interval =
	UniProt sequence: complete aligned part

Show mapping:	SCOP domains	CATH domains	Pfam domains	Secondary structure (by author)
	SAPs(SNPs)	PROSITE motifs	Exons
	(details for a mapped element are shown in a popup box when the mouse pointer rests over it)

Chain A from PDB  Type:PROTEIN  Length:91
 aligned with FXR1_HUMAN | P51114 from UniProtKB/Swiss-Prot  Length:621

    Alignment length:158
                                   164       174       184       194       204       214       224       234       244       254       264       274       284       294       304        
           FXR1_HUMAN   155 GACRIFYHPETTQLMILSASEATVKRVNILSDMHLRSIRTKLMLMSRNEEATKHLECTKQLAAAFHEEFVVREDLMGLAIGTHGSNIQQARKVPGVTAIELDEDTGTFRIYGESADAVKKARGFLEFVEDFIQVPRNLVGKVIGKNGKVIQEIVDKSG 312
               SCOP domains ---------------------------------------------------------d2cpqa1 A:212-289                                                             ----------------------- SCOP domains
               CATH domains -------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains -------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ..---------------.....-----------------------------------.......eeeeee....hhhhhh....hhhhhhh....eeeeeee....eeeeee..hhhhhhhh..............-------...----------.. Sec.struct. author
             SAPs(SNPs) (1) ------------------------------------------------------------------------------T------------------------------------------------------------------------------- SAPs(SNPs) (1)
             SAPs(SNPs) (2) ------------------------------------------------------------------------------T------------------------------------------------------------------------------- SAPs(SNPs) (2)
                    PROSITE ---------------------------------------------------------------KH_TYPE_1  PDB: A:218-279 UniProt: 218-279                    -KH_TYPE_1  PDB: A:281-295        PROSITE
           Transcript 1 (1) Exon 1.12c       Exon 1.13d  PDB: A:207-211 [INCOMPLETE]Exon 1.14a  PDB: A:212-267 UniProt: 211-267 [INCOMPLETE] Exon 1.15a  PDB: A:268-290 ------------------ Transcript 1 (1)
           Transcript 1 (2) -------------------------------------------------------------------------------------------------------------------------------------------Exon 1.16c          Transcript 1 (2)
                 2cpq A 205 GS---------------SGSSG-----------------------------------TKQLAAAFHEEFVVREDLMGLAIGTHGSNIQQARKVPGVTAIELDEDTGTFRIYGESADAVKKARGFLEFVEDFIQVPS-------GPS----------SG 295
                             |       -       209 |       -         -         -       214       224       234       244       254       264       274       284     |   -   | |   -      | 
                             |             207 211                                 212                                                                           290     291 |        294 
                           206                                                                                                                                             293

Legend:		→ Mismatch	(orange background)
	-	→ Gap	(green background, '-', border residues have a numbering label)
		→ Modified Residue	(blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
	x	→ Chemical Group	(purple background, 'x', labelled with number + name, e.g. ACE or NH2)
	extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '\|'

(no "CATH Domain" information available for 2CPQ)

(no "Pfam Domain" information available for 2CPQ)

NMR Structure(hide GO term definitions)

Chain A (FXR1_HUMAN | P51114)

molecular function
	GO:0002151	G-quadruplex RNA binding	Interacting selectively and non-covalently with G-quadruplex RNA structures, in which groups of four guanines adopt a flat, cyclic hydrogen-bonding arrangement known as a guanine tetrad.
	GO:0003723	RNA binding	Interacting selectively and non-covalently with an RNA molecule or a portion thereof.
	GO:0033592	RNA strand annealing activity	Facilitates the base-pairing of complementary single-stranded RNA.
	GO:0003730	mRNA 3'-UTR binding	Interacting selectively and non-covalently with the 3' untranslated region of an mRNA molecule.
	GO:0003729	mRNA binding	Interacting selectively and non-covalently with messenger RNA (mRNA), an intermediate molecule between DNA and protein. mRNA includes UTR and coding sequences, but does not contain introns.
	GO:0003676	nucleic acid binding	Interacting selectively and non-covalently with any nucleic acid.
	GO:0005515	protein binding	Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
	GO:0046982	protein heterodimerization activity	Interacting selectively and non-covalently with a nonidentical protein to form a heterodimer.
	GO:0042803	protein homodimerization activity	Interacting selectively and non-covalently with an identical protein to form a homodimer.
biological process
	GO:0006915	apoptotic process	A programmed cell death process which begins when a cell receives an internal (e.g. DNA damage) or external signal (e.g. an extracellular death ligand), and proceeds through a series of biochemical events (signaling pathway phase) which trigger an execution phase. The execution phase is the last step of an apoptotic process, and is typically characterized by rounding-up of the cell, retraction of pseudopodes, reduction of cellular volume (pyknosis), chromatin condensation, nuclear fragmentation (karyorrhexis), plasma membrane blebbing and fragmentation of the cell into apoptotic bodies. When the execution phase is completed, the cell has died.
	GO:0030154	cell differentiation	The process in which relatively unspecialized cells, e.g. embryonic or regenerative cells, acquire specialized structural and/or functional features that characterize the cells, tissues, or organs of the mature organism or some other relatively stable phase of the organism's life history. Differentiation includes the processes involved in commitment of a cell to a specific fate and its subsequent development to the mature state.
	GO:0007275	multicellular organism development	The biological process whose specific outcome is the progression of a multicellular organism over time from an initial condition (e.g. a zygote or a young adult) to a later condition (e.g. a multicellular animal or an aged adult).
	GO:0007517	muscle organ development	The process whose specific outcome is the progression of the muscle over time, from its formation to the mature structure. The muscle is an organ consisting of a tissue made up of various elongated cells that are specialized to contract and thus to produce movement and mechanical work.
	GO:0017148	negative regulation of translation	Any process that stops, prevents, or reduces the frequency, rate or extent of the chemical reactions and pathways resulting in the formation of proteins by the translation of mRNA or circRNA.
	GO:2000637	positive regulation of gene silencing by miRNA	Any process that activates or increases the frequency, rate or extent of gene silencing by miRNA.
cellular component
	GO:0030424	axon	The long process of a neuron that conducts nerve impulses, usually away from the cell body to the terminals and varicosities, which are sites of storage and release of neurotransmitter.
	GO:0043034	costamere	Regular periodic sub membranous arrays of vinculin in skeletal and cardiac muscle cells, these arrays link Z-discs to the sarcolemma and are associated with links to extracellular matrix.
	GO:0005737	cytoplasm	All of the contents of a cell excluding the plasma membrane and nucleus, but including other subcellular structures.
	GO:0030425	dendrite	A neuron projection that has a short, tapering, often branched, morphology, receives and integrates signals from other neurons or from sensory stimuli, and conducts a nerve impulse towards the axon or the cell body. In most neurons, the impulse is conveyed from dendrites to axon via the cell body, but in some types of unipolar neuron, the impulse does not travel via the cell body.
	GO:0043197	dendritic spine	A small, membranous protrusion from a dendrite that forms a postsynaptic compartment - typically receiving input from a single presynapse. They function as partially isolated biochemical and an electrical compartments. Spine morphology is variable including "thin", "stubby", "mushroom", and "branched", with a continuum of intermediate morphologies. They typically terminate in a bulb shape, linked to the dendritic shaft by a restriction. Spine remodeling is though to be involved in synaptic plasticity.
	GO:0016020	membrane	A lipid bilayer along with all the proteins and protein complexes embedded in it an attached to it.
	GO:0005730	nucleolus	A small, dense body one or more of which are present in the nucleus of eukaryotic cells. It is rich in RNA and protein, is not bounded by a limiting membrane, and is not seen during mitosis. Its prime function is the transcription of the nucleolar DNA into 45S ribosomal-precursor RNA, the processing of this RNA into 5.8S, 18S, and 28S components of ribosomal RNA, and the association of these components with 5S RNA and proteins synthesized outside the nucleolus. This association results in the formation of ribonucleoprotein precursors; these pass into the cytoplasm and mature into the 40S and 60S subunits of the ribosome.
	GO:0005634	nucleus	A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.
	GO:0048471	perinuclear region of cytoplasm	Cytoplasm situated near, or occurring around, the nucleus.
	GO:0005844	polysome	A multiribosomal structure representing a linear array of ribosomes held together by messenger RNA. They represent the active complexes in cellular protein synthesis and are able to incorporate amino acids into polypeptides both in vivo and in vitro.
	GO:0035770	ribonucleoprotein granule	A non-membranous macromolecular complex containing proteins and translationally silenced mRNAs. RNA granules contain proteins that control the localization, stability, and translation of their RNA cargo. Different types of RNA granules (RGs) exist, depending on the cell type and cellular conditions.

NMR Structure
	Complete Structure
		Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
		WebMol \| AstexViewer[tm]@PDBe (Java Applets, require no local installation except for Java; loading may be slow)
		STRAP (Java WebStart application, automatic local installation, requires Java; full application with system access!)
		RasMol (require local installation)
		Molscript (VRML) (requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)

	Ligands, Modified Residues, Ions
(no "Ligands, Modified Residues, Ions" information available for 2cpq)

	Sites
(no "Sites" information available for 2cpq)

	Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 2cpq)

Jmol
	protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick [ NMR Structure - model 1 PNG format \| NMR Structure - all models PNG format ](automatic orientation, automatically generated)
Molscript
	protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick [ mono PDF format \| stereo PDF format ](default orientation, automatically generated)

Access by PDB/NDB ID
	2cpq
		Family and Domain Information	:	ProDom \| SYSTERS
		General Structural Information	:	GlycoscienceDB \| MMDB \| NDB \| OCA \| PDB \| PDBe \| PDBj \| PDBsum \| PDBWiki \| PQS \| PROTEOPEDIA
		Orientation in Membranes	:	OPM
		Protein Surface	:	SURFACE
		Secondary Structure	:	DSSP (structure derived) \| HSSP (homology derived)
		Structural Genomics	:	GeneCensus
		Structural Neighbours	:	CE \| VAST
		Structure Classification	:	CATH \| Dali \| SCOP
		Validation and Original Data	:	BMRB Data View \| BMRB Restraints Grid \| EDS \| PROCHECK \| RECOORD \| WHAT_CHECK

Access by UniProt ID/Accession number
	FXR1_HUMAN \| P51114
		Comparative Protein Structure Models	:	ModBase
		Genomic Information	:	Ensembl
		Protein-protein Interaction	:	DIP
		Sequence, Family and Domain Information	:	InterPro \| Pfam \| SMART \| UniProtKB/SwissProt

Access by Enzyme Classificator (EC Number)
	(no 'Enzyme Classificator' available)
		General Enzyme Information	:	BRENDA \| EC-PDB \| Enzyme \| IntEnz
		Pathway	:	KEGG \| MetaCyc

Access by Disease Identifier (MIM ID)
	(no 'MIM ID' available)
		Disease Information	:	OMIM

Access by GenAge ID
	(no 'GenAge ID' available)
		Age Related Information	:	GenAge

Access by PDB/NDB ID
		Domain Information	:	XDom
		Interatomic Contacts of Structural Units	:	CSU
		Ligand-protein Contacts	:	LPC
		Protein Cavities	:	castP
		Sequence and Secondary Structure	:	PDBCartoon
		Structure Alignment	:	STRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
		Structure and Sequence Browser	:	STING

Access by UniProt ID/Accession number
	FXR1_HUMAN \| P51114
		Protein Disorder Prediction	:	DisEMBL \| FoldIndex \| GLOBPLOT (for more information see DisProt)

UniProtKB/Swiss-Prot
	FXR1_HUMAN \| P51114	:	3kuf 3o8v

(no "Related Entries Specified in the PDB File" available for 2CPQ)

Description

Compounds

Structural Features

Chains, Units

Ligands, Modified Residues, Ions (0, 0)

Sites (0, 0)

SS Bonds (0, 0)

Cis Peptide Bonds (0, 0)

Sequence-Structure Mapping

SAPs(SNPs)/Variants (2, 2)

PROSITE Motifs (1, 2)

Exons (5, 5)

Sequences/Alignments

Classification and Annotation

SCOP Domains (1, 1)

CATH Domains (0, 0)

Pfam Domains (0, 0)

Gene Ontology (26, 26)

Visualization

Interactive Views

Still Images

Databases and Analysis Tools

Databases

Analysis Tools

Related Entries

Entries Sharing at Least One Protein Chain (UniProt ID)

Related Entries Specified in the PDB File