Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)NMR Structure - model 1
(-)NMR Structure - all models
collapse expand < >
Image NMR Structure - model 1
NMR Structure - model 1  (Jmol Viewer)
Image NMR Structure - all models
NMR Structure - all models  (Jmol Viewer)

(-) Description

Title :  COMBINED HIGH- AND LOW-RESOLUTION TECHNIQUES REVEAL COMPACT STRUCTURE IN CENTRAL PORTION OF FACTOR H DESPITE LONG INTER-MODULAR LINKERS
 
Authors :  C. Q. Schmidt, A. P. Herbert, M. Guariento, H. D. T. Mertens, D. C. Soares, D. Uhrin, A. J. Rowe, D. I. Svergun, P. N. Barlow
Date :  04 Aug 09  (Deposition) - 03 Nov 09  (Release) - 08 Dec 09  (Revision)
Method :  SOLUTION NMR
Resolution :  NOT APPLICABLE
Chains :  NMR Structure  :  A  (20x)
Keywords :  Compact Structure, Limited Interdomain Flexibility, Age- Related Macular Degeneration, Alternative Splicing, Complement Alternate Pathway, Disease Mutation, Disulfide Bond, Glycoprotein, Immune Response, Innate Immunity, Polymorphism, Secreted, Sushi, Immune System (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  C. Q. Schmidt, A. P. Herbert, H. D. T. Mertens, M. Guariento, D. C. Soares, D. Uhrin, A. J. Rowe, D. I. Svergun, P. N. Barlow
The Central Portion Of Factor H (Modules 10-15) Is Compact And Contains A Structurally Deviant Ccp Module
J. Mol. Biol. V. 395 105 2010
PubMed-ID: 19835885  |  Reference-DOI: 10.1016/J.JMB.2009.10.010
(for further references see the PDB file header)

(-) Compounds

Molecule 1 - COMPLEMENT FACTOR H
    ChainsA
    EngineeredYES
    Expression SystemPICHIA PASTORIS
    Expression System StrainKM71H
    Expression System Taxid4922
    Expression System VectorPPICZALPHAB
    Expression System Vector TypeVECTOR
    FragmentSUSHI DOMAIN, RESIDUES 690-804
    GeneCFH, HF, HF1, HF2
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SynonymH FACTOR 1

 Structural Features

(-) Chains, Units

  
NMR Structure (20x)

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (0, 0)

(no "Ligand,Modified Residues,Ions" information available for 2KMS)

(-) Sites  (0, 0)

(no "Site" information available for 2KMS)

(-) SS Bonds  (4, 4)

NMR Structure
No.Residues
1A:691 -A:733
2A:719 -A:744
3A:753 -A:792
4A:781 -A:803

(-) Cis Peptide Bonds  (2, 40)

NMR Structure
No.ModelResidues
11, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20Pro A:707 -Pro A:708
21, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20Asp A:798 -Pro A:799

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 2KMS)

(-) PROSITE Motifs  (1, 1)

NMR Structure (1, 1)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1SUSHIPS50923 Sushi/CCP/SCR domain profile.CFAH_HUMAN19-82
83-143
144-207
208-264
265-322
387-444
446-507
515-566
567-625
628-686
689-746
751-805
809-866
868-928
929-986
987-1045
1046-1104
1107-1165
1194-1230
  1-
-
-
-
-
-
-
-
-
-
-
A:751-804
-
-
-
-
-
-
-

(-) Exons   (2, 2)

NMR Structure (2, 2)
 ENSEMBLUniProtKBPDB
No.Transcript IDExonExon IDGenome LocationLengthIDLocationLengthCountLocationLength
1.1aENST000003674291aENSE00001444484chr1:196621008-196621305298CFAH_HUMAN1-20200--
1.2aENST000003674292aENSE00001134426chr1:196642108-196642293186CFAH_HUMAN20-82630--
1.3ENST000003674293ENSE00000791013chr1:196642987-196643092106CFAH_HUMAN82-117360--
1.4ENST000003674294ENSE00000791014chr1:196645119-19664519577CFAH_HUMAN117-143270--
1.5bENST000003674295bENSE00000469039chr1:196646606-196646797192CFAH_HUMAN143-207650--
1.5cENST000003674295cENSE00000791016chr1:196648753-196648923171CFAH_HUMAN207-264580--
1.6ENST000003674296ENSE00000469041chr1:196654194-196654367174CFAH_HUMAN264-322590--
1.7ENST000003674297ENSE00001769047chr1:196658550-196658744195CFAH_HUMAN322-387660--
1.8aENST000003674298aENSE00001597868chr1:196659193-196659369177CFAH_HUMAN387-446600--
1.12ENST0000036742912ENSE00000469044chr1:196682865-196683047183CFAH_HUMAN446-507620--
1.13ENST0000036742913ENSE00000469045chr1:196684723-196684899177CFAH_HUMAN507-566600--
1.14ENST0000036742914ENSE00000791022chr1:196694251-196694427177CFAH_HUMAN566-625600--
1.15ENST0000036742915ENSE00000791023chr1:196695600-196695782183CFAH_HUMAN625-686620--
1.16ENST0000036742916ENSE00000469048chr1:196695891-196696070180CFAH_HUMAN686-746611A:690-74657
1.17ENST0000036742917ENSE00000791025chr1:196697476-196697652177CFAH_HUMAN746-805601A:746-80459
1.18aENST0000036742918aENSE00000791026chr1:196705954-196706136183CFAH_HUMAN805-866620--
1.19ENST0000036742919ENSE00000791027chr1:196706605-196706790186CFAH_HUMAN866-928630--
1.20aENST0000036742920aENSE00000791028chr1:196709749-196709922174CFAH_HUMAN928-986590--
1.20dENST0000036742920dENSE00000469053chr1:196711005-196711181177CFAH_HUMAN986-1045600--
1.21ENST0000036742921ENSE00001622338chr1:196712582-196712758177CFAH_HUMAN1045-1104600--
1.22ENST0000036742922ENSE00001719380chr1:196714947-196715129183CFAH_HUMAN1104-1165620--
1.23ENST0000036742923ENSE00001931000chr1:196716241-196716634394CFAH_HUMAN1165-1231670--

(-) Sequences/Alignments

NMR Structure
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:115
 aligned with CFAH_HUMAN | P08603 from UniProtKB/Swiss-Prot  Length:1231

    Alignment length:115
                                   699       709       719       729       739       749       759       769       779       789       799     
           CFAH_HUMAN   690 TCGDIPELEHGWAQLSSPPYYYGDSVEFNCSESFTMIGHRSITCIHGVWTQLPQCVAIDKLKKCKSSNLIILEEHLKNKKEFDHNSNIRYRCRGKEGWIHTVCINGRWDPEVNCS 804
               SCOP domains ------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains 2kmsA01 A:690-746 Complement Module, domain 1            ---------------------------------------------------------- CATH domains
               Pfam domains -Sushi-2kmsA01 A:691-744                               ------------------------------------------------------------ Pfam domains
         Sec.struct. author ...........ee...........eeeee...........eee....................ee.........hhhhh.ee....ee............eee....ee...... Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE SUSHI  PDB: - UniProt: 689-746                           ----SUSHI  PDB: A:751-804 UniProt: 751-805                 PROSITE
           Transcript 1 (1) Exon 1.16  PDB: A:690-746 UniProt: 686-746 [INCOMPLETE]  ---------------------------------------------------------- Transcript 1 (1)
           Transcript 1 (2) --------------------------------------------------------Exon 1.17  PDB: A:746-804 UniProt: 746-805 [INCOMPLETE]     Transcript 1 (2)
                 2kms A 690 TCGDIPELEHGWAQLSSPPYYYGDSVEFNCSESFTMIGHRSITCIHGVWTQLPQCVAIDKLKKCKSSNLIILEEHLKNKKEFDHNSNIRYRCRGKEGWIHTVCINGRWDPEVNCS 804
                                   699       709       719       729       739       749       759       769       779       789       799     

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 2KMS)

(-) CATH Domains  (1, 1)

NMR Structure
(-)
Class: Mainly Beta (13760)

(-) Pfam Domains  (1, 1)

NMR Structure

(-) Gene Ontology  (12, 12)

NMR Structure(hide GO term definitions)
Chain A   (CFAH_HUMAN | P08603)
molecular function
    GO:0043395    heparan sulfate proteoglycan binding    Interacting selectively and non-covalently with a heparan sulfate proteoglycan, any proteoglycan containing heparan sulfate as the glycosaminoglycan carbohydrate unit.
    GO:0008201    heparin binding    Interacting selectively and non-covalently with heparin, any member of a group of glycosaminoglycans found mainly as an intracellular component of mast cells and which consist predominantly of alternating alpha-(1->4)-linked D-galactose and N-acetyl-D-glucosamine-6-sulfate residues.
    GO:0005515    protein binding    Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
biological process
    GO:0006956    complement activation    Any process involved in the activation of any of the steps of the complement cascade, which allows for the direct killing of microbes, the disposal of immune complexes, and the regulation of other immune processes; the initial steps of complement activation involve one of three pathways, the classical pathway, the alternative pathway, and the lectin pathway, all of which lead to the terminal complement pathway.
    GO:0006957    complement activation, alternative pathway    Any process involved in the activation of any of the steps of the alternative pathway of the complement cascade which allows for the direct killing of microbes and the regulation of other immune processes.
    GO:0002376    immune system process    Any process involved in the development or functioning of the immune system, an organismal system for calibrated responses to potential internal or invasive threats.
    GO:0045087    innate immune response    Innate immune responses are defense responses mediated by germline encoded components that directly recognize components of potential pathogens.
    GO:0030449    regulation of complement activation    Any process that modulates the frequency, rate or extent of complement activation.
cellular component
    GO:0072562    blood microparticle    A phospholipid microvesicle that is derived from any of several cell types, such as platelets, blood cells, endothelial cells, or others, and contains membrane receptors as well as other proteins characteristic of the parental cell. Microparticles are heterogeneous in size, and are characterized as microvesicles free of nucleic acids.
    GO:0070062    extracellular exosome    A vesicle that is released into the extracellular region by fusion of the limiting endosomal membrane of a multivesicular body with the plasma membrane. Extracellular exosomes, also simply called exosomes, have a diameter of about 40-100 nm.
    GO:0005576    extracellular region    The space external to the outermost structure of a cell. For cells without external protective or external encapsulating structures this refers to space outside of the plasma membrane. This term covers the host cell environment outside an intracellular parasite.
    GO:0005615    extracellular space    That part of a multicellular organism outside the cells proper, usually taken to be outside the plasma membranes, and occupied by fluid.

 Visualization

(-) Interactive Views

NMR Structure
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
(no "Ligands, Modified Residues, Ions" information available for 2kms)
 
  Sites
(no "Sites" information available for 2kms)
 
  Cis Peptide Bonds
    Asp A:798 - Pro A:799   [ RasMol ]  
    Pro A:707 - Pro A:708   [ RasMol ]  
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  2kms
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  CFAH_HUMAN | P08603
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  CFAH_HUMAN | P08603
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        CFAH_HUMAN | P086031fhc 1haq 1hcc 1hfh 1hfi 1kov 2bzm 2g7i 2ic4 2jgw 2jgx 2qfg 2qfh 2rlp 2rlq 2uwn 2v8e 2w80 2w81 2wii 2xqw 3gau 3gav 3gaw 3kxv 3kzj 3oxu 3r62 3rj3 3sw0 4ayd 4aye 4ayi 4aym 4b2r 4b2s 4j38 4k12 4ont 4zh1 5o32 5o35

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 2KMS)