Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asymmetric Unit
(-)Asym. Unit - sites
(-)Biological Unit 1
(-)Biol. Unit 1 - sites
collapse expand < >
Image Asymmetric Unit
Asymmetric Unit  (Jmol Viewer)
Image Asym. Unit - sites
Asym. Unit - sites  (Jmol Viewer)
Image Biological Unit 1
Biological Unit 1  (Jmol Viewer)
Image Biol. Unit 1 - sites
Biol. Unit 1 - sites  (Jmol Viewer)

(-) Description

Title :  CRYSTAL STRUCTURE OF HUMAN FIBRILLAR PROCOLLAGEN TYPE III C-PROPEPTIDE TRIMER
 
Authors :  J. M. Bourhis, N. Mariano, Y. Zhao, K. Harlos, E. Y. Jones, C. Moali, N. Ag D. J. S. Hulmes
Date :  21 Feb 12  (Deposition) - 12 Sep 12  (Release) - 31 Oct 12  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  3.50
Chains :  Asym. Unit :  A
Biol. Unit 1:  A  (3x)
Keywords :  Structural Protein, Fibrillar Collagen, Extacellular Matrix, Fibrosis (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  J. M. Bourhis, N. Mariano, Y. Zhao, K. Harlos, J. Exposito, E. Y. Jones, C. Moali, N. Aghajari, D. J. S. Hulmes
Structural Basis Of Fibrillar Collagen Trimerization And Related Genetic Disorders.
Nat. Struct. Mol. Biol. V. 19 1031 2012
PubMed-ID: 23001006  |  Reference-DOI: 10.1038/NSMB.2389
(for further references see the PDB file header)

(-) Compounds

Molecule 1 - COLLAGEN ALPHA-1(III) CHAIN
    ChainsA
    EngineeredYES
    Expression SystemHOMO SAPIENS
    Expression System Cell LineHEK293T
    Expression System CommonHUMAN
    Expression System PlasmidPHLSEC
    Expression System Taxid9606
    FragmentCPROPEPTIDE, RESIDUES 1222-1466
    MutationYES
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SynonymPROCOLLAGEN III

 Structural Features

(-) Chains, Units

  1
Asymmetric Unit A
Biological Unit 1 (3x)A

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (1, 1)

Asymmetric Unit (1, 1)
No.NameCountTypeFull Name
1CA1Ligand/IonCALCIUM ION
Biological Unit 1 (0, 0)
No.NameCountTypeFull Name
1CA-1Ligand/IonCALCIUM ION

(-) Sites  (1, 1)

Asymmetric Unit (1, 1)
No.NameEvidenceResiduesDescription
1AC1SOFTWAREASP A:59 , ASN A:61 , GLN A:62 , CYS A:64 , ASP A:67 , HOH A:2001BINDING SITE FOR RESIDUE CA A 246

(-) SS Bonds  (4, 4)

Asymmetric Unit
No.Residues
1A:41 -A:73
2A:47 -A:64
3A:81 -A:243
4A:151 -A:196

(-) Cis Peptide Bonds  (2, 2)

Asymmetric Unit
No.Residues
1Asn A:85 -Pro A:86
2Leu A:218 -Pro A:219

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (3, 3)

Asymmetric Unit (3, 3)
  dbSNPPDB
No.SourceVariant IDVariantUniProt IDStatusIDChainVariant
1UniProtVAR_030115H1353QCO3A1_HUMANPolymorphism1516446AQ132Q
2UniProtVAR_035739R1434CCO3A1_HUMANUnclassified747324731AR213C
3CancerSNPVAR_CO3A1_HUMAN_CCDS2297_1_02 *R1434CCO3A1_HUMANDisease (Colorectal cancer)  ---AR213C
   * ID not provided by source

  SNP/SAP Summary Statistics (UniProtKB/Swiss-Prot)
Biological Unit 1 (3, 9)
  dbSNPPDB
No.SourceVariant IDVariantUniProt IDStatusIDChainVariant
1UniProtVAR_030115H1353QCO3A1_HUMANPolymorphism1516446AQ132Q
2UniProtVAR_035739R1434CCO3A1_HUMANUnclassified747324731AR213C
3CancerSNPVAR_CO3A1_HUMAN_CCDS2297_1_02 *R1434CCO3A1_HUMANDisease (Colorectal cancer)  ---AR213C
   * ID not provided by source

  SNP/SAP Summary Statistics (UniProtKB/Swiss-Prot)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 4AK3)

(-) Exons   (0, 0)

(no "Exon" information available for 4AK3)

(-) Sequences/Alignments

Asymmetric Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:227
 aligned with CO3A1_HUMAN | P02461 from UniProtKB/Swiss-Prot  Length:1466

    Alignment length:232
                                  1244      1254      1264      1274      1284      1294      1304      1314      1324      1334      1344      1354      1364      1374      1384      1394      1404      1414      1424      1434      1444      1454      1464  
         CO3A1_HUMAN   1235 MTSLKSVNGQIESLISPDGSRKNPARNCRDLKFCHPELKSGEYWVDPNQGCKLDAIKVFCNMETGETCISANPLNVPRKHWWTDSSAEKKHVWFGESMDGGFQFSYGNPELPEDVLDVHLAFLRLLSSRASQNITYHCKNSIAYMDQASGNVKKALKLMGSNEGEFKAEGNSKFTYTVLEDGCTKHTGEWSKTVFEYRTRKAVRLPIVDIAPYDIGGPDQEFGVDVGPVCFL 1466
               SCOP domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author hhhhhhhhhhhhhhhh..........hhhhhhhhh.....eeeee..........eeeee......eee.ee..eeeee...................--....ee.....hhhhhhhhhhhhhhhh.eeeeeeeeeee.................eee..--.eee...hhhh..eeeee.......-.eeeeeeeee.hhhhh...eee.........eeeeeeeeeeee Sec.struct. author
             SAPs(SNPs) (1) ----------------------------------------------------------------------------------------------------------------------Q--------------------------------------------------------------------------------C-------------------------------- SAPs(SNPs) (1)
             SAPs(SNPs) (2) -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------C-------------------------------- SAPs(SNPs) (2)
                    PROSITE ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                4ak3 A   14 MTSLKSVNGQIESLISPDGSRKNPARNCRDLKFCHPELKSGEYWVDPNQGCKLDAIKVFCNMETGETCISANPLNVPRKHWWTDSSAEKKHVWFGESM--GFQFSYGNPELPEDVLDVQLAFLRLLSSRASQQITYHCKNSIAYMDQASGNVKKALKLMGS--GEFKAEGNSKFTYTVLEDGCTKHTG-WSKTVFEYRTRKAVRLPIVDIAPYDIGGPDQEFGVDVGPVCFL  245
                                    23        33        43        53        63        73        83        93       103       | -|      123       133       143       153       163       173|  |   183       193       203       213       223       233       243  
                                                                                                                           111  |                                                         174  |                     201 |                                          
                                                                                                                              114                                                            177                       203                                          

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 4AK3)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 4AK3)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 4AK3)

(-) Gene Ontology  (38, 38)

Asymmetric Unit(hide GO term definitions)
Chain A   (CO3A1_HUMAN | P02461)
molecular function
    GO:0046332    SMAD binding    Interacting selectively and non-covalently with a SMAD signaling protein.
    GO:0005201    extracellular matrix structural constituent    The action of a molecule that contributes to the structural integrity of the extracellular matrix.
    GO:0005178    integrin binding    Interacting selectively and non-covalently with an integrin.
    GO:0046872    metal ion binding    Interacting selectively and non-covalently with any metal ion.
    GO:0048407    platelet-derived growth factor binding    Interacting selectively and non-covalently with platelet-derived growth factor.
    GO:0005515    protein binding    Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
biological process
    GO:0007568    aging    A developmental process that is a deterioration and loss of function over time. Aging includes loss of functions such as resistance to disease, homeostasis, and fertility, as well as wear and tear. Aging includes cellular senescence, but is more inclusive. May precede death and may succeed developmental maturation (GO:0021700).
    GO:0060414    aorta smooth muscle tissue morphogenesis    The process in which the structure of the smooth muscle tissue surrounding the aorta is generated and organized. An aorta is an artery that carries blood from the heart to other parts of the body.
    GO:0001568    blood vessel development    The process whose specific outcome is the progression of a blood vessel over time, from its formation to the mature structure. The blood vessel is the vasculature carrying blood.
    GO:0007160    cell-matrix adhesion    The binding of a cell to the extracellular matrix via adhesion molecules.
    GO:0071230    cellular response to amino acid stimulus    Any process that results in a change in state or activity of a cell (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of an amino acid stimulus. An amino acid is a carboxylic acids containing one or more amino groups.
    GO:0021987    cerebral cortex development    The progression of the cerebral cortex over time from its initial formation until its mature state. The cerebral cortex is the outer layered region of the telencephalon.
    GO:0030574    collagen catabolic process    The proteolytic chemical reactions and pathways resulting in the breakdown of collagen in the extracellular matrix, usually carried out by proteases secreted by nearby cells.
    GO:0030199    collagen fibril organization    Any process that determines the size and arrangement of collagen fibrils within an extracellular matrix.
    GO:0048565    digestive tract development    The process whose specific outcome is the progression of the digestive tract over time, from its formation to the mature structure. The digestive tract is the anatomical structure through which food passes and is processed.
    GO:0030198    extracellular matrix organization    A process that is carried out at the cellular level which results in the assembly, arrangement of constituent parts, or disassembly of an extracellular matrix.
    GO:0007507    heart development    The process whose specific outcome is the progression of the heart over time, from its formation to the mature structure. The heart is a hollow, muscular organ, which, by contracting rhythmically, keeps up the circulation of the blood.
    GO:0007229    integrin-mediated signaling pathway    A series of molecular signals initiated by the binding of extracellular ligand to an integrin on the surface of a target cell, and ending with regulation of a downstream cellular process, e.g. transcription.
    GO:0050777    negative regulation of immune response    Any process that stops, prevents, or reduces the frequency, rate or extent of the immune response, the immunological reaction of an organism to an immunogenic stimulus.
    GO:2001223    negative regulation of neuron migration    Any process that stops, prevents or reduces the frequency, rate or extent of neuron migration.
    GO:0018149    peptide cross-linking    The formation of a covalent cross-link between or within protein chains.
    GO:0030168    platelet activation    A series of progressive, overlapping events triggered by exposure of the platelets to subendothelial tissue. These events include shape change, adhesiveness, aggregation, and release reactions. When carried through to completion, these events lead to the formation of a stable hemostatic plug.
    GO:0035025    positive regulation of Rho protein signal transduction    Any process that activates or increases the frequency, rate or extent of Rho protein signal transduction.
    GO:0050776    regulation of immune response    Any process that modulates the frequency, rate or extent of the immune response, the immunological reaction of an organism to an immunogenic stimulus.
    GO:0034097    response to cytokine    Any process that results in a change in state or activity of a cell or an organism (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of a cytokine stimulus.
    GO:0009612    response to mechanical stimulus    Any process that results in a change in state or activity of a cell or an organism (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of a mechanical stimulus.
    GO:0009314    response to radiation    Any process that results in a change in state or activity of a cell or an organism (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of an electromagnetic radiation stimulus. Electromagnetic radiation is a propagating wave in space with electric and magnetic components. These components oscillate at right angles to each other and to the direction of propagation.
    GO:0001501    skeletal system development    The process whose specific outcome is the progression of the skeleton over time, from its formation to the mature structure. The skeleton is the bony framework of the body in vertebrates (endoskeleton) or the hard outer envelope of insects (exoskeleton or dermoskeleton).
    GO:0043588    skin development    The process whose specific outcome is the progression of the skin over time, from its formation to the mature structure. The skin is the external membranous integument of an animal. In vertebrates the skin generally consists of two layers, an outer nonsensitive and nonvascular epidermis (cuticle or skarfskin) composed of cells which are constantly growing and multiplying in the deeper, and being thrown off in the superficial layers, as well as an inner vascular dermis (cutis, corium or true skin) composed mostly of connective tissue.
    GO:0007179    transforming growth factor beta receptor signaling pathway    A series of molecular signals initiated by the binding of an extracellular ligand to a transforming growth factor beta receptor on the surface of a target cell, and ending with regulation of a downstream cellular process, e.g. transcription.
    GO:0042060    wound healing    The series of events that restore integrity to a damaged tissue, following an injury.
cellular component
    GO:0005581    collagen trimer    A protein complex consisting of three collagen chains assembled into a left-handed triple helix. These trimers typically assemble into higher order structures.
    GO:0005586    collagen type III trimer    A collagen homotrimer of alpha1(III) chains; type III collagen triple helices associate to form fibrils.
    GO:0005788    endoplasmic reticulum lumen    The volume enclosed by the membranes of the endoplasmic reticulum.
    GO:0031012    extracellular matrix    A structure lying external to one or more cells, which provides structural support for cells or tissues.
    GO:0005576    extracellular region    The space external to the outermost structure of a cell. For cells without external protective or external encapsulating structures this refers to space outside of the plasma membrane. This term covers the host cell environment outside an intracellular parasite.
    GO:0005615    extracellular space    That part of a multicellular organism outside the cells proper, usually taken to be outside the plasma membranes, and occupied by fluid.
    GO:0005578    proteinaceous extracellular matrix    A layer consisting mainly of proteins (especially collagen) and glycosaminoglycans (mostly as proteoglycans) that forms a sheet underlying or overlying cells such as endothelial and epithelial cells. The proteins are secreted by cells in the vicinity. An example of this component is found in Mus musculus.

 Visualization

(-) Interactive Views

Asymmetric Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    CA  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
    Asn A:85 - Pro A:86   [ RasMol ]  
    Leu A:218 - Pro A:219   [ RasMol ]  
 
Biological Unit
  Complete Structure
    Biological Unit 1  [ Jena3D ]

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  4ak3
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  CO3A1_HUMAN | P02461
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  CO3A1_HUMAN | P02461
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        CO3A1_HUMAN | P024612v53 3dmw 4ae2 4aej 4gyx

(-) Related Entries Specified in the PDB File

2v53 CRYSTAL STRUCTURE OF A SPARC-COLLAGEN COMPLEX
4ae2 CRYSTAL STRUCTURE OF HUMAN FIBRILLAR PROCOLLAGEN TYPE III C-PROPEPTIDE TRIMER
4aej CRYSTAL STRUCTURE OF HUMAN FIBRILLAR PROCOLLAGEN TYPE III C-PROPEPTIDE TRIMER