Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)NMR Structure - manually
(-)NMR Structure - model 1
(-)NMR Structure - all models
collapse expand < >
Image NMR Structure - manually
NMR Structure - manually  (Jmol Viewer)
Image NMR Structure - model 1
NMR Structure - model 1  (Jmol Viewer)
Image NMR Structure - all models
NMR Structure - all models  (Jmol Viewer)

(-) Description

Title :  SOLUTION STRUCTURES OF THE FN3 DOMAIN OF HUMAN COLLAGEN ALPHA-1(XX) CHAIN
 
Authors :  M. Sato, N. Tochio, S. Koshiba, S. Watanabe, T. Harada, T. Kigawa, S. Yokoyama, Riken Structural Genomics/Proteomics Initiative (Rsgi)
Date :  15 Feb 07  (Deposition) - 21 Aug 07  (Release) - 24 Feb 09  (Revision)
Method :  SOLUTION NMR
Resolution :  NOT APPLICABLE
Chains :  NMR Structure  :  A  (20x)
Keywords :  Kiaa1510, Structural Genomics, Nppsfa, National Project On Protein Structural And Functional Analyses, Riken Structural Genomics/Proteomics Initiative, Rsgi, Signaling Protein (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  M. Sato, N. Tochio, S. Koshiba, S. Watanabe, T. Harada, T. Kigawa, S. Yokoyama
Solution Structures Of The Fn3 Domain Of Human Collagen Alpha-1(Xx) Chain
To Be Published
PubMed: search
(for further references see the PDB file header)

(-) Compounds

Molecule 1 - COLLAGEN ALPHA-1(XX) CHAIN
    ChainsA
    EngineeredYES
    Expression System PlasmidP051216-01
    Expression System Vector TypePLASMID
    FragmentFIBRONECTIN TYPE III DOMAIN
    GeneCOL20A1
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    Other DetailsCELL-FREE PROTEIN SYNTHESIS

 Structural Features

(-) Chains, Units

  
NMR Structure (20x)

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (0, 0)

(no "Ligand,Modified Residues,Ions" information available for 2EE3)

(-) Sites  (0, 0)

(no "Site" information available for 2EE3)

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 2EE3)

(-) Cis Peptide Bonds  (1, 20)

NMR Structure
No.ModelResidues
11, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20Gly A:65 -Pro A:66

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 2EE3)

(-) PROSITE Motifs  (1, 2)

NMR Structure (1, 2)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1FN3PS50853 Fibronectin type-III domain profile.COKA1_HUMAN28-119
469-559
379-468
560-647
649-738
743-833
  2-
-
-
A:11-98
A:100-108
-

(-) Exons   (4, 4)

NMR Structure (4, 4)
 ENSEMBLUniProtKBPDB
No.Transcript IDExonExon IDGenome LocationLengthIDLocationLengthCountLocationLength
1.1aENST000003588941aENSE00001410478chr20:61924538-6192462790COKA1_HUMAN-00--
1.2bENST000003588942bENSE00002172896chr20:61926450-6192654192COKA1_HUMAN1-28280--
1.3ENST000003588943ENSE00000991860chr20:61929262-61929372111COKA1_HUMAN28-65380--
1.4ENST000003588944ENSE00000991861chr20:61936769-61936912144COKA1_HUMAN65-113490--
1.5aENST000003588945aENSE00001548158chr20:61937233-61937391159COKA1_HUMAN113-166540--
1.6ENST000003588946ENSE00000856567chr20:61938842-61939000159COKA1_HUMAN166-219540--
1.7ENST000003588947ENSE00000663386chr20:61939323-61939442120COKA1_HUMAN219-259410--
1.8ENST000003588948ENSE00000663387chr20:61939894-61940058165COKA1_HUMAN259-314560--
1.9ENST000003588949ENSE00000856568chr20:61940685-61940849165COKA1_HUMAN314-369560--
1.10bENST0000035889410bENSE00000856569chr20:61941110-61941267158COKA1_HUMAN369-421530--
1.11ENST0000035889411ENSE00000663389chr20:61941733-61941862130COKA1_HUMAN422-465440--
1.12ENST0000035889412ENSE00000663390chr20:61942746-61942891146COKA1_HUMAN465-513490--
1.13ENST0000035889413ENSE00000663391chr20:61942977-61943100124COKA1_HUMAN514-555421A:1-77
1.14ENST0000035889414ENSE00000663392chr20:61943268-61943407140COKA1_HUMAN555-601471A:8-5245
1.15aENST0000035889415aENSE00000663393chr20:61943772-61943901130COKA1_HUMAN602-645441A:53-9644
1.16ENST0000035889416ENSE00000663394chr20:61944144-61944286143COKA1_HUMAN645-692481A:96-108 (gaps)30
1.17ENST0000035889417ENSE00000663395chr20:61944469-61944601133COKA1_HUMAN693-737450--
1.18ENST0000035889418ENSE00000663396chr20:61945095-61945243149COKA1_HUMAN737-786500--
1.19ENST0000035889419ENSE00000663397chr20:61945424-61945553130COKA1_HUMAN787-830440--
1.20bENST0000035889420bENSE00000856570chr20:61946756-6194679136COKA1_HUMAN830-842130--
1.21ENST0000035889421ENSE00000409384chr20:61947905-61948043139COKA1_HUMAN842-888470--
1.22aENST0000035889422aENSE00000663399chr20:61950410-61950552143COKA1_HUMAN888-936490--
1.22dENST0000035889422dENSE00000663400chr20:61950839-61950948110COKA1_HUMAN936-972370--
1.23aENST0000035889423aENSE00000663401chr20:61951391-61951549159COKA1_HUMAN973-1025530--
1.24aENST0000035889424aENSE00000663402chr20:61951643-6195172078COKA1_HUMAN1026-1051260--
1.25ENST0000035889425ENSE00000663403chr20:61952365-6195245187COKA1_HUMAN1052-1080290--
1.27ENST0000035889427ENSE00000663404chr20:61953410-6195346354COKA1_HUMAN1081-1098180--
1.28cENST0000035889428cENSE00000663405chr20:61956793-6195684654COKA1_HUMAN1099-1116180--
1.29ENST0000035889429ENSE00000663406chr20:61957020-6195707354COKA1_HUMAN1117-1134180--
1.30ENST0000035889430ENSE00000856571chr20:61957448-6195750154COKA1_HUMAN1135-1152180--
1.31ENST0000035889431ENSE00000991863chr20:61958104-6195817572COKA1_HUMAN1153-1176240--
1.32bENST0000035889432bENSE00001173757chr20:61959304-6195933936COKA1_HUMAN1177-1188120--
1.33aENST0000035889433aENSE00001563363chr20:61959431-6195947949COKA1_HUMAN1189-1205170--
1.33eENST0000035889433eENSE00002200481chr20:61959683-61959850168COKA1_HUMAN1205-1261570--
1.34bENST0000035889434bENSE00001385784chr20:61960937-6196101377COKA1_HUMAN1261-1284240--
1.35bENST0000035889435bENSE00001819106chr20:61962072-61962285214COKA1_HUMAN-00--

(-) Sequences/Alignments

NMR Structure
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:108
 aligned with COKA1_HUMAN | Q9P218 from UniProtKB/Swiss-Prot  Length:1284

    Alignment length:134
                                   550       560       570       580       590       600       610       620       630       640       650       660       670    
          COKA1_HUMAN   541 GPEGSEARGIRARTPTLAPPRHLGFSDVSHDAARVFWEGAPRPVRLVRVTYVSSEGGHSGQTEAPGNATSATLGPLSSSTTYTVRVTCLYPGGGSSTLTGRVTTKKAPSPSQLSMTELPGDAVQLAWVAAAPSG 674
               SCOP domains d2ee3a_          A: automated matches                                                                                                  SCOP domains
               CATH domains -------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains -------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .......---------......eeee......eeeee........eeeeeeee........eeee....eeee.......eeeeeeeee.....eeeeeeeee........-----------------...... Sec.struct. author
                 SAPs(SNPs) -------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE FN3  PDB: -        FN3  PDB: A:11-98 UniProt: 560-647                                                      -FN3  PDB: A:100-108        PROSITE
           Transcript 1 (1) Exon 1.13      ----------------------------------------------Exon 1.15a  PDB: A:53-96 UniProt: 602-645   ----------------------------- Transcript 1 (1)
           Transcript 1 (2) --------------Exon 1.14  PDB: A:8-52 UniProt: 555-601        -------------------------------------------Exon 1.16  PDB: A:96-108 (gaps Transcript 1 (2)
                 2ee3 A   1 GSSGSSG---------LAPPRHLGFSDVSHDAARVFWEGAPRPVRLVRVTYVSSEGGHSGQTEAPGNATSAMLGPLSSSTTYTVRVTCLYPGGGSSTLTGRVTTKKAPSPS-----------------SGPSSG 108
                                  |  -      | 11        21        31        41        51        61        71        81        91       101|        -       104    
                                  7         8                                                                                           102               103     

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (1, 1)

NMR Structure

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 2EE3)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 2EE3)

(-) Gene Ontology  (6, 6)

NMR Structure(hide GO term definitions)
Chain A   (COKA1_HUMAN | Q9P218)
molecular function
    GO:0005515    protein binding    Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
cellular component
    GO:0005581    collagen trimer    A protein complex consisting of three collagen chains assembled into a left-handed triple helix. These trimers typically assemble into higher order structures.
    GO:0005788    endoplasmic reticulum lumen    The volume enclosed by the membranes of the endoplasmic reticulum.
    GO:0031012    extracellular matrix    A structure lying external to one or more cells, which provides structural support for cells or tissues.
    GO:0005576    extracellular region    The space external to the outermost structure of a cell. For cells without external protective or external encapsulating structures this refers to space outside of the plasma membrane. This term covers the host cell environment outside an intracellular parasite.
    GO:0005615    extracellular space    That part of a multicellular organism outside the cells proper, usually taken to be outside the plasma membranes, and occupied by fluid.

 Visualization

(-) Interactive Views

NMR Structure
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
(no "Ligands, Modified Residues, Ions" information available for 2ee3)
 
  Sites
(no "Sites" information available for 2ee3)
 
  Cis Peptide Bonds
    Gly A:65 - Pro A:66   [ RasMol ]  
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  2ee3
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  COKA1_HUMAN | Q9P218
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  COKA1_HUMAN | Q9P218
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        COKA1_HUMAN | Q9P2182dkm 2ekj 5kf4

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 2EE3)