Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Theoretical Model
(-)Theor. Model - sites
collapse expand < >
Image Theoretical Model
Theoretical Model  (Jmol Viewer)
Image Theor. Model - sites
Theor. Model - sites  (Jmol Viewer)

(-) Description

Title :  THREE-DIMENSIONAL STRUCTURES OF THE CYSTEINE-PROTEASES CATHEPSINS K AND S DEDUCED BY KNOWLEDGE-BASED MODELLING AND ACTIVE-SITE CHARACTERISTICS
 
Authors :  A. Fengler, W. Brandt
Date :  02 Oct 98  (Deposition) - 07 Oct 98  (Release) - 01 Apr 03  (Revision)
Method :  THEORETICAL MODEL
Resolution :  NOT APPLICABLE
Chains :  Theor. Model :  A
Keywords :  Cysteine Protease, Cathepsin S, Papain-Like, Theoretical Model, Hydrolase (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  A. Fengler, W. Brandt
Three-Dimensional Structures Of The Cysteine Proteases Cathepsins K And S Deduced By Knowledge-Based Modelling And Active Site Characteristics.
Protein Eng. V. 11 1007 1998
PubMed: search
(for further references see the PDB file header)

(-) Compounds

Molecule 1 - PROTEIN (CATHEPSIN S CYSTEINE PROTEASE)
    ChainsA
    EC Number3.4.22.27
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    TissueTESTIS

 Structural Features

(-) Chains, Units

  
Theoretical Model 

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (0, 0)

(no "Ligand,Modified Residues,Ions" information available for 1BXF)

(-) Sites  (1, 1)

Theoretical Model (1, 1)
No.NameEvidenceResiduesDescription
1CTnot defined [no residue information available]

(-) SS Bonds  (4, 4)

Theoretical Model
No.Residues
1A:12 -A:110
2A:22 -A:66
3A:56 -A:99
4A:158 -A:206

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 1BXF)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (1, 1)

Theoretical Model (1, 1)
  dbSNPPDB
No.SourceVariant IDVariantUniProt IDStatusIDChainVariant
1UniProtVAR_025386S161TCATS_HUMANPolymorphism1059604AT47T

  SNP/SAP Summary Statistics (UniProtKB/Swiss-Prot)

(-) PROSITE Motifs  (3, 3)

Theoretical Model (3, 3)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1THIOL_PROTEASE_CYSPS00139 Eukaryotic thiol (cysteine) proteases cysteine active site.CATS_HUMAN133-144  1A:19-30
2THIOL_PROTEASE_HISPS00639 Eukaryotic thiol (cysteine) proteases histidine active site.CATS_HUMAN276-286  1A:162-172
3THIOL_PROTEASE_ASNPS00640 Eukaryotic thiol (cysteine) proteases asparagine active site.CATS_HUMAN293-312  1A:179-198

(-) Exons   (5, 5)

Theoretical Model (5, 5)
 ENSEMBLUniProtKBPDB
No.Transcript IDExonExon IDGenome LocationLengthIDLocationLengthCountLocationLength
1.1aENST000003689851aENSE00001922877chr1:150738433-150738174260CATS_HUMAN-00--
1.2aENST000003689852aENSE00001448523chr1:150737240-150737114127CATS_HUMAN1-42420--
1.3ENST000003689853ENSE00001044461chr1:150730456-150730334123CATS_HUMAN43-83410--
1.4ENST000003689854ENSE00001044451chr1:150727626-150727477150CATS_HUMAN84-133501A:1-1919
1.5aENST000003689855aENSE00001044460chr1:150724484-150724257228CATS_HUMAN134-209761A:20-9576
1.6bENST000003689856bENSE00001044464chr1:150722647-150722482166CATS_HUMAN210-265561A:96-15156
1.8bENST000003689858bENSE00001044458chr1:150720355-150720253103CATS_HUMAN265-299351A:151-18535
1.9cENST000003689859cENSE00001874248chr1:150705621-1507026722950CATS_HUMAN299-331331A:185-21733

(-) Sequences/Alignments

Theoretical Model
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:217
 aligned with CATS_HUMAN | P25774 from UniProtKB/Swiss-Prot  Length:331

    Alignment length:217
                                   124       134       144       154       164       174       184       194       204       214       224       234       244       254       264       274       284       294       304       314       324       
           CATS_HUMAN   115 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI 331
               SCOP domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ....ee.hhh.............hhhhhhhhhhhhhh...........hhhhhhhhh............hhhhhhhhhhhhh...............................ee....hhhhhhhhhhhh..eeee....hhhhhh.................eeeeeeeee....eeeeee...........eeeee....hhhhh....ee... Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------T-------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------THIOL_PROTEA-----------------------------------------------------------------------------------------------------------------------------------THIOL_PROTE------THIOL_PROTEASE_ASN  ------------------- PROSITE
           Transcript 1 (1) Exon 1.4           Exon 1.5a  PDB: A:20-95 UniProt: 134-209                                    Exon 1.6b  PDB: A:96-151 UniProt: 210-265               ---------------------------------Exon 1.9c  PDB: A:185-217         Transcript 1 (1)
           Transcript 1 (2) ------------------------------------------------------------------------------------------------------------------------------------------------------Exon 1.8b  PDB: A:151-185          -------------------------------- Transcript 1 (2)
                 1bxf A   1 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI 217
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150       160       170       180       190       200       210       

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 1BXF)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 1BXF)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 1BXF)

(-) Gene Ontology  (30, 30)

Theoretical Model(hide GO term definitions)
Chain A   (CATS_HUMAN | P25774)
molecular function
    GO:0005518    collagen binding    Interacting selectively and non-covalently with collagen, a group of fibrous proteins of very high tensile strength that form the main component of connective tissue in animals. Collagen is highly enriched in glycine (some regions are 33% glycine) and proline, occurring predominantly as 3-hydroxyproline (about 20%).
    GO:0004197    cysteine-type endopeptidase activity    Catalysis of the hydrolysis of internal, alpha-peptide bonds in a polypeptide chain by a mechanism in which the sulfhydryl group of a cysteine residue at the active center acts as a nucleophile.
    GO:0008234    cysteine-type peptidase activity    Catalysis of the hydrolysis of peptide bonds in a polypeptide chain by a mechanism in which the sulfhydryl group of a cysteine residue at the active center acts as a nucleophile.
    GO:0001968    fibronectin binding    Interacting selectively and non-covalently with a fibronectin, a group of related adhesive glycoproteins of high molecular weight found on the surface of animal cells, connective tissue matrices, and in extracellular fluids.
    GO:0016787    hydrolase activity    Catalysis of the hydrolysis of various bonds, e.g. C-O, C-N, C-C, phosphoric anhydride bonds, etc. Hydrolase is the systematic name for any enzyme of EC class 3.
    GO:0043236    laminin binding    Interacting selectively and non-covalently with laminins, glycoproteins that are major constituents of the basement membrane of cells.
    GO:0008233    peptidase activity    Catalysis of the hydrolysis of a peptide bond. A peptide bond is a covalent bond formed when the carbon atom from the carboxyl group of one amino acid shares electrons with the nitrogen atom from the amino group of a second amino acid.
    GO:0043394    proteoglycan binding    Interacting selectively and non-covalently with a proteoglycan, any glycoprotein in which the carbohydrate units are glycosaminoglycans.
    GO:0004252    serine-type endopeptidase activity    Catalysis of the hydrolysis of internal, alpha-peptide bonds in a polypeptide chain by a catalytic mechanism that involves a catalytic triad consisting of a serine nucleophile that is activated by a proton relay involving an acidic residue (e.g. aspartate or glutamate) and a basic residue (usually histidine).
biological process
    GO:0002250    adaptive immune response    An immune response mediated by cells expressing specific receptors for antigen produced through a somatic diversification process, and allowing for an enhanced secondary response to subsequent exposures to the same antigen (immunological memory).
    GO:0019882    antigen processing and presentation    The process in which an antigen-presenting cell expresses antigen (peptide or lipid) on its cell surface in association with an MHC protein complex.
    GO:0019886    antigen processing and presentation of exogenous peptide antigen via MHC class II    The process in which an antigen-presenting cell expresses a peptide antigen of exogenous origin on its cell surface in association with an MHC class II protein complex. The peptide antigen is typically, but not always, processed from a whole protein.
    GO:0048002    antigen processing and presentation of peptide antigen    The process in which an antigen-presenting cell expresses peptide antigen in association with an MHC protein complex on its cell surface, including proteolysis and transport steps for the peptide antigen both prior to and following assembly with the MHC protein complex. The peptide antigen is typically, but not always, processed from an endogenous or exogenous protein.
    GO:0034769    basement membrane disassembly    The controlled breakdown of the basement membrane in the context of a normal process such as imaginal disc eversion.
    GO:0097067    cellular response to thyroid hormone stimulus    A change in state or activity of a cell (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of a thyroid hormone stimulus.
    GO:0030574    collagen catabolic process    The proteolytic chemical reactions and pathways resulting in the breakdown of collagen in the extracellular matrix, usually carried out by proteases secreted by nearby cells.
    GO:0022617    extracellular matrix disassembly    A process that results in the breakdown of the extracellular matrix.
    GO:0006955    immune response    Any immune system process that functions in the calibrated response of an organism to a potential internal or invasive threat.
    GO:2001259    positive regulation of cation channel activity    Any process that activates or increases the frequency, rate or extent of cation channel activity.
    GO:0016485    protein processing    Any protein maturation process achieved by the cleavage of a peptide bond or bonds within a protein. Protein maturation is the process leading to the attainment of the full functional capacity of a protein.
    GO:0006508    proteolysis    The hydrolysis of proteins into smaller polypeptides and/or amino acids by cleavage of their peptide bonds.
    GO:0051603    proteolysis involved in cellular protein catabolic process    The hydrolysis of a peptide bond or bonds within a protein as part of the chemical reactions and pathways resulting in the breakdown of a protein by individual cells.
    GO:0010447    response to acidic pH    Any process that results in a change in state or activity of a cell or an organism (in terms of movement, secretion, enzyme production, gene expression, etc.) as a result of a pH stimulus with pH < 7. pH is a measure of the acidity or basicity of an aqueous solution.
    GO:0002224    toll-like receptor signaling pathway    Any series of molecular signals generated as a consequence of binding to a toll-like receptor. Toll-like receptors directly bind pattern motifs from a variety of microbial sources to initiate innate immune response.
cellular component
    GO:0036021    endolysosome lumen    The volume enclosed by the membrane of an endolysosome. An endolysosome is a transient hybrid organelle formed by fusion of a late endosome with a lysosome.
    GO:0005576    extracellular region    The space external to the outermost structure of a cell. For cells without external protective or external encapsulating structures this refers to space outside of the plasma membrane. This term covers the host cell environment outside an intracellular parasite.
    GO:0005615    extracellular space    That part of a multicellular organism outside the cells proper, usually taken to be outside the plasma membranes, and occupied by fluid.
    GO:0043231    intracellular membrane-bounded organelle    Organized structure of distinctive morphology and function, bounded by a single or double lipid bilayer membrane and occurring within the cell. Includes the nucleus, mitochondria, plastids, vacuoles, and vesicles. Excludes the plasma membrane.
    GO:0043202    lysosomal lumen    The volume enclosed within the lysosomal membrane.
    GO:0005764    lysosome    A small lytic vacuole that has cell cycle-independent morphology and is found in most animal cells and that contains a variety of hydrolases, most of which have their maximal activities in the pH range 5-6. The contained enzymes display latency if properly isolated. About 40 different lysosomal hydrolases are known and lysosomes have a great variety of morphologies and functions.

 Visualization

(-) Interactive Views

Theoretical Model
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
(no "Ligands, Modified Residues, Ions" information available for 1bxf)
 
  Sites
    CT  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 1bxf)
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  1bxf
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  CATS_HUMAN | P25774
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  3.4.22.27
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  CATS_HUMAN | P25774
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        CATS_HUMAN | P257741glo 1ms6 1npz 1nqc 2c0y 2f1g 2fq9 2fra 2frq 2ft2 2fud 2fye 2g6d 2g7y 2h7j 2hh5 2hhn 2hxz 2op3 2r9m 2r9n 2r9o 3iej 3kwn 3mpe 3mpf 3n3g 3n4c 3ovx 4p6e 4p6g

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 1BXF)