Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asymmetric Unit
(-)Asym. Unit - sites
(-)Biological Unit 1
(-)Biol. Unit 1 - sites
(-)Biological Unit 2
collapse expand < >
Image Asymmetric Unit
Asymmetric Unit  (Jmol Viewer)
Image Asym. Unit - sites
Asym. Unit - sites  (Jmol Viewer)
Image Biological Unit 1
Biological Unit 1  (Jmol Viewer)
Image Biol. Unit 1 - sites
Biol. Unit 1 - sites  (Jmol Viewer)
Image Biological Unit 2
Biological Unit 2  (Jmol Viewer)

(-) Description

Title :  CRYSTAL STRUCTURE OF CATHEPSIN K COMPLEXED WITH 7-METHYL-SUBSTITUTED AZEPAN-3-ONE COMPOUND
 
Authors :  D. S. Yamashita, Z. Baoguang
Date :  24 Jan 06  (Deposition) - 30 Jan 07  (Release) - 28 Apr 09  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  2.55
Chains :  Asym. Unit :  A,B
Biol. Unit 1:  A  (1x)
Biol. Unit 2:  B  (1x)
Keywords :  Hydrolase, Sulfhydryl Proteinase (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  D. S. Yamashita, R. W. Marquis, R. Xie, S. D. Nidamarthy, H. J. Oh, J. U. Jeong, K. F. Erhard, K. W. Ward, T. J. Roethke, B. R. Smith, H. Y. Cheng, X. Geng, F. Lin, P. H. Offen, B. Wang, N. Nevins, M. S. Head, R. C. Haltiwanger, A. A. Narducci Sarjeant, L. M. Liable-Sands, B. Zhao, W. W. Smith, C. A. Janson, E. Gao, T. Tomaszek, M. Mcqueney, I. E. James, C. J. Gress, D. L. Zembryki, M. W. Lark, D. F. Veber
Structure Activity Relationships Of 5-, 6-, And 7-Methyl-Substituted Azepan-3-One Cathepsin K Inhibitors.
J. Med. Chem. V. 49 1597 2006
PubMed-ID: 16509577  |  Reference-DOI: 10.1021/JM050915U
(for further references see the PDB file header)

(-) Compounds

Molecule 1 - CATHEPSIN K
    ChainsA, B
    EC Number3.4.22.38
    EngineeredYES
    Expression SystemUNIDENTIFIED BACULOVIRUS
    Expression System StrainSF9
    Expression System Taxid10469
    GeneCTSK
    Organism CommonRHESUS MONKEY
    Organism ScientificMACACA MULATTA
    Organism Taxid9544
    StrainOSTEOCLAST

 Structural Features

(-) Chains, Units

  12
Asymmetric Unit AB
Biological Unit 1 (1x)A 
Biological Unit 2 (1x) B

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (1, 2)

Asymmetric Unit (1, 2)
No.NameCountTypeFull Name
1ILI2Ligand/IonN-[(1S)-1-({[(3S,4S,7R)-3-HYDROXY-7-METHYL-1-(PYRIDIN-2-YLSULFONYL)-2,3,4,7-TETRAHYDRO-1H-AZEPIN-4-YL]AMINO}CARBONYL)-3-METHYLBUTYL]-1-BENZOFURAN-2-CARBOXAMIDE
Biological Unit 1 (1, 1)
No.NameCountTypeFull Name
1ILI1Ligand/IonN-[(1S)-1-({[(3S,4S,7R)-3-HYDROXY-7-METHYL-1-(PYRIDIN-2-YLSULFONYL)-2,3,4,7-TETRAHYDRO-1H-AZEPIN-4-YL]AMINO}CARBONYL)-3-METHYLBUTYL]-1-BENZOFURAN-2-CARBOXAMIDE
Biological Unit 2 (1, 1)
No.NameCountTypeFull Name
1ILI1Ligand/IonN-[(1S)-1-({[(3S,4S,7R)-3-HYDROXY-7-METHYL-1-(PYRIDIN-2-YLSULFONYL)-2,3,4,7-TETRAHYDRO-1H-AZEPIN-4-YL]AMINO}CARBONYL)-3-METHYLBUTYL]-1-BENZOFURAN-2-CARBOXAMIDE

(-) Sites  (2, 2)

Asymmetric Unit (2, 2)
No.NameEvidenceResiduesDescription
1AC1SOFTWAREGLN A:19 , GLN A:21 , CYS A:22 , GLY A:23 , SER A:24 , CYS A:25 , TRP A:26 , GLU A:59 , ASP A:61 , GLY A:65 , GLY A:66 , TYR A:67 , ALA A:134 , ASN A:161 , HIS A:162 , ALA A:163 , TRP A:184 , HOH A:392 , TYR B:67BINDING SITE FOR RESIDUE ILI A 300
2AC2SOFTWAREGLN B:19 , GLN B:21 , CYS B:22 , GLY B:23 , CYS B:25 , TRP B:26 , GLU B:59 , GLY B:65 , GLY B:66 , TYR B:67 , ASN B:161 , HIS B:162 , TRP B:184BINDING SITE FOR RESIDUE ILI B 301

(-) SS Bonds  (6, 6)

Asymmetric Unit
No.Residues
1A:22 -A:63
2A:56 -A:96
3A:155 -A:204
4B:22 -B:63
5B:56 -B:96
6B:155 -B:204

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 2FTD)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 2FTD)

(-) PROSITE Motifs  (3, 6)

Asymmetric Unit (3, 6)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1THIOL_PROTEASE_CYSPS00139 Eukaryotic thiol (cysteine) proteases cysteine active site.CATK_MACMU133-144
 
  2A:19-30
B:19-30
2THIOL_PROTEASE_HISPS00639 Eukaryotic thiol (cysteine) proteases histidine active site.CATK_MACMU274-284
 
  2A:160-170
B:160-170
3THIOL_PROTEASE_ASNPS00640 Eukaryotic thiol (cysteine) proteases asparagine active site.CATK_MACMU291-310
 
  2A:177-196
B:177-196
Biological Unit 1 (3, 3)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1THIOL_PROTEASE_CYSPS00139 Eukaryotic thiol (cysteine) proteases cysteine active site.CATK_MACMU133-144
 
  1A:19-30
-
2THIOL_PROTEASE_HISPS00639 Eukaryotic thiol (cysteine) proteases histidine active site.CATK_MACMU274-284
 
  1A:160-170
-
3THIOL_PROTEASE_ASNPS00640 Eukaryotic thiol (cysteine) proteases asparagine active site.CATK_MACMU291-310
 
  1A:177-196
-
Biological Unit 2 (3, 3)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1THIOL_PROTEASE_CYSPS00139 Eukaryotic thiol (cysteine) proteases cysteine active site.CATK_MACMU133-144
 
  1-
B:19-30
2THIOL_PROTEASE_HISPS00639 Eukaryotic thiol (cysteine) proteases histidine active site.CATK_MACMU274-284
 
  1-
B:160-170
3THIOL_PROTEASE_ASNPS00640 Eukaryotic thiol (cysteine) proteases asparagine active site.CATK_MACMU291-310
 
  1-
B:177-196

(-) Exons   (5, 10)

Asymmetric Unit (5, 10)
 ENSEMBLUniProtKBPDB
No.Transcript IDExonExon IDGenome LocationLengthIDLocationLengthCountLocationLength
1.1ENSMMUT000000007861ENSMMUE00000299516chr1:129211826-129211722105CATK_MACMU-00--
1.3aENSMMUT000000007863aENSMMUE00000005071chr1:129210363-129210240124CATK_MACMU1-40400--
1.4ENSMMUT000000007864ENSMMUE00000005074chr1:129209777-129209655123CATK_MACMU41-81410--
1.5ENSMMUT000000007865ENSMMUE00000005078chr1:129209569-129209414156CATK_MACMU82-133522A:1-19
B:1-19
19
19
1.6ENSMMUT000000007866ENSMMUE00000005083chr1:129207797-129207579219CATK_MACMU134-206732A:20-92
B:20-92
73
73
1.7ENSMMUT000000007867ENSMMUE00000005086chr1:129203234-129203069166CATK_MACMU207-262562A:93-148
B:93-148
56
56
1.8ENSMMUT000000007868ENSMMUE00000005089chr1:129202799-129202694106CATK_MACMU262-297362A:148-183
B:148-183
36
36
1.9aENSMMUT000000007869aENSMMUE00000005093chr1:129200410-129199747664CATK_MACMU297-329332A:183-215
B:183-215
33
33

(-) Sequences/Alignments

Asymmetric Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:215
 aligned with CATK_MACMU | P61277 from UniProtKB/Swiss-Prot  Length:329

    Alignment length:215
                                   124       134       144       154       164       174       184       194       204       214       224       234       244       254       264       274       284       294       304       314       324     
           CATK_MACMU   115 APDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
               SCOP domains d2ftda_ A: (Pro)cathepsin K                                                                                                                                                                                             SCOP domains
               CATH domains 2ftdA00 A:1-215 Cysteine proteinases                                                                                                                                                                                    CATH domains
               Pfam domains ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ....eehhhhh............hhhhhhhhhhhhhhhhhhhh.....hhhhhhhhh...hhhhh.hhhhhhhhhhhhh.ee................hhh.eee...eeee....hhhhhhhhhhhh..eeeee...hhhhhh....ee...........eeeeeeeeee....eeeeee...........eeeee....hhhhh....eeee. Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------THIOL_PROTEA---------------------------------------------------------------------------------------------------------------------------------THIOL_PROTE------THIOL_PROTEASE_ASN  ------------------- PROSITE
           Transcript 1 (1) Exon 1.5           Exon 1.6  PDB: A:20-92 UniProt: 134-206                                  Exon 1.7  PDB: A:93-148 UniProt: 207-262                ----------------------------------Exon 1.9a  PDB: A:183-215         Transcript 1 (1)
           Transcript 1 (2) ---------------------------------------------------------------------------------------------------------------------------------------------------Exon 1.8  PDB: A:148-183            -------------------------------- Transcript 1 (2)
                 2ftd A   1 APDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 215
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150       160       170       180       190       200       210     

Chain B from PDB  Type:PROTEIN  Length:215
 aligned with CATK_MACMU | P61277 from UniProtKB/Swiss-Prot  Length:329

    Alignment length:215
                                   124       134       144       154       164       174       184       194       204       214       224       234       244       254       264       274       284       294       304       314       324     
           CATK_MACMU   115 APDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
               SCOP domains d2ftdb_ B: (Pro)cathepsin K                                                                                                                                                                                             SCOP domains
               CATH domains 2ftdB00 B:1-215 Cysteine proteinases                                                                                                                                                                                    CATH domains
               Pfam domains ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ....ee.hhh.............hhhhhhhhhhhhhhhhhhhh.....hhhhhhhhh...hhhhh.hhhhhhhhhhhhh.ee................hhh.eee....eee....hhhhhhhhhhhh..eeeee...hhhhhh....ee...........eeeeeeeeee....eeeeee...........eeeee....hhhhh....eee.. Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------THIOL_PROTEA---------------------------------------------------------------------------------------------------------------------------------THIOL_PROTE------THIOL_PROTEASE_ASN  ------------------- PROSITE
           Transcript 1 (1) Exon 1.5           Exon 1.6  PDB: B:20-92 UniProt: 134-206                                  Exon 1.7  PDB: B:93-148 UniProt: 207-262                ----------------------------------Exon 1.9a  PDB: B:183-215         Transcript 1 (1)
           Transcript 1 (2) ---------------------------------------------------------------------------------------------------------------------------------------------------Exon 1.8  PDB: B:148-183            -------------------------------- Transcript 1 (2)
                 2ftd B   1 APDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 215
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150       160       170       180       190       200       210     

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (1, 2)

Asymmetric Unit

(-) CATH Domains  (1, 2)

Asymmetric Unit
(-)
Class: Alpha Beta (26913)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 2FTD)

(-) Gene Ontology  (14, 14)

Asymmetric Unit(hide GO term definitions)
Chain A,B   (CATK_MACMU | P61277)
molecular function
    GO:0005518    collagen binding    Interacting selectively and non-covalently with collagen, a group of fibrous proteins of very high tensile strength that form the main component of connective tissue in animals. Collagen is highly enriched in glycine (some regions are 33% glycine) and proline, occurring predominantly as 3-hydroxyproline (about 20%).
    GO:0004197    cysteine-type endopeptidase activity    Catalysis of the hydrolysis of internal, alpha-peptide bonds in a polypeptide chain by a mechanism in which the sulfhydryl group of a cysteine residue at the active center acts as a nucleophile.
    GO:0008234    cysteine-type peptidase activity    Catalysis of the hydrolysis of peptide bonds in a polypeptide chain by a mechanism in which the sulfhydryl group of a cysteine residue at the active center acts as a nucleophile.
    GO:0001968    fibronectin binding    Interacting selectively and non-covalently with a fibronectin, a group of related adhesive glycoproteins of high molecular weight found on the surface of animal cells, connective tissue matrices, and in extracellular fluids.
    GO:0016787    hydrolase activity    Catalysis of the hydrolysis of various bonds, e.g. C-O, C-N, C-C, phosphoric anhydride bonds, etc. Hydrolase is the systematic name for any enzyme of EC class 3.
    GO:0008233    peptidase activity    Catalysis of the hydrolysis of a peptide bond. A peptide bond is a covalent bond formed when the carbon atom from the carboxyl group of one amino acid shares electrons with the nitrogen atom from the amino group of a second amino acid.
    GO:0043394    proteoglycan binding    Interacting selectively and non-covalently with a proteoglycan, any glycoprotein in which the carbohydrate units are glycosaminoglycans.
biological process
    GO:0030574    collagen catabolic process    The proteolytic chemical reactions and pathways resulting in the breakdown of collagen in the extracellular matrix, usually carried out by proteases secreted by nearby cells.
    GO:1903955    positive regulation of protein targeting to mitochondrion    Any process that activates or increases the frequency, rate or extent of protein targeting to mitochondrion.
    GO:0006508    proteolysis    The hydrolysis of proteins into smaller polypeptides and/or amino acids by cleavage of their peptide bonds.
    GO:0051603    proteolysis involved in cellular protein catabolic process    The hydrolysis of a peptide bond or bonds within a protein as part of the chemical reactions and pathways resulting in the breakdown of a protein by individual cells.
    GO:1903146    regulation of mitophagy    Any process that modulates the frequency, rate or extent of mitochondrion degradation (mitophagy).
cellular component
    GO:0005615    extracellular space    That part of a multicellular organism outside the cells proper, usually taken to be outside the plasma membranes, and occupied by fluid.
    GO:0005764    lysosome    A small lytic vacuole that has cell cycle-independent morphology and is found in most animal cells and that contains a variety of hydrolases, most of which have their maximal activities in the pH range 5-6. The contained enzymes display latency if properly isolated. About 40 different lysosomal hydrolases are known and lysosomes have a great variety of morphologies and functions.

 Visualization

(-) Interactive Views

Asymmetric Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    ILI  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 2ftd)
 
Biological Units
  Complete Structure
    Biological Unit 1  [ Jena3D ]
    Biological Unit 2  [ Jena3D ]

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  2ftd
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  CATK_MACMU | P61277
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  3.4.22.38
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  CATK_MACMU | P61277
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

(no "Entries Sharing at Least One Protein Chain" available for 2FTD)

(-) Related Entries Specified in the PDB File

1au0 1au2 1au3 1au4