Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asymmetric Unit
(-)Asym. Unit - sites
(-)Biological Unit 1
(-)Biol. Unit 1 - sites
(-)Biological Unit 2
collapse expand < >
Image Asymmetric Unit
Asymmetric Unit  (Jmol Viewer)
Image Asym. Unit - sites
Asym. Unit - sites  (Jmol Viewer)
Image Biological Unit 1
Biological Unit 1  (Jmol Viewer)
Image Biol. Unit 1 - sites
Biol. Unit 1 - sites  (Jmol Viewer)
Image Biological Unit 2
Biological Unit 2  (Jmol Viewer)

(-) Description

Title :  STRUCTURE OF THE PRECURSOR OF A THERMOSTABLE VARIANT OF PAPAIN AT 2.6 ANGSTROEM RESOLUTION
 
Authors :  S. Roy, D. Choudhury, J. K. Dattagupta, S. Biswas
Date :  02 Sep 11  (Deposition) - 12 Sep 12  (Release) - 28 Nov 12  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  2.62
Chains :  Asym. Unit :  A,C
Biol. Unit 1:  A  (1x)
Biol. Unit 2:  C  (1x)
Keywords :  Hydrolase, Cytoplasm For Recombinant Expression (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  S. Roy, D. Choudhury, P. Aich, J. K. Dattagupta, S. Biswas
The Structure Of A Thermostable Mutant Of Pro-Papain Reveal Its Activation Mechanism
Acta Crystallogr. , Sect. D V. 68 1591 2012
PubMed-ID: 23151624  |  Reference-DOI: 10.1107/S0907444912038607
(for further references see the PDB file header)

(-) Compounds

Molecule 1 - PAPAIN
    ChainsA, C
    EC Number3.4.22.2
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPET30 EK/LIC
    Expression System StrainBL21(DE3)
    Expression System Taxid562
    Expression System Vector TypePLASMID
    FragmentUNP RESIDUES 27-345
    MutationYES
    Organism CommonMAMON
    Organism ScientificCARICA PAPAYA
    Organism Taxid3649
    SynonymPAPAYA PROTEINASE I, PPI

 Structural Features

(-) Chains, Units

  12
Asymmetric Unit AC
Biological Unit 1 (1x)A 
Biological Unit 2 (1x) C

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (1, 1)

Asymmetric Unit (1, 1)
No.NameCountTypeFull Name
1CL1Ligand/IonCHLORIDE ION
Biological Unit 1 (0, 0)
No.NameCountTypeFull Name
1CL-1Ligand/IonCHLORIDE ION
Biological Unit 2 (0, 0)
No.NameCountTypeFull Name
1CL-1Ligand/IonCHLORIDE ION

(-) Sites  (1, 1)

Asymmetric Unit (1, 1)
No.NameEvidenceResiduesDescription
1AC1SOFTWAREASN A:191 , THR A:192 , CYS A:260BINDING SITE FOR RESIDUE CL A 320

(-) SS Bonds  (6, 6)

Asymmetric Unit
No.Residues
1A:129 -A:170
2A:163 -A:202
3A:260 -A:307
4C:129 -C:170
5C:163 -C:202
6C:260 -C:307

(-) Cis Peptide Bonds  (2, 2)

Asymmetric Unit
No.Residues
1Gly A:258 -Pro A:259
2Gly C:258 -Pro C:259

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 3TNX)

(-) PROSITE Motifs  (3, 6)

Asymmetric Unit (3, 6)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1THIOL_PROTEASE_CYSPS00139 Eukaryotic thiol (cysteine) proteases cysteine active site.PAPA1_CARPA152-163
 
  2A:126-137
C:126-137
2THIOL_PROTEASE_HISPS00639 Eukaryotic thiol (cysteine) proteases histidine active site.PAPA1_CARPA290-300
 
  2A:264-274
C:264-274
3THIOL_PROTEASE_ASNPS00640 Eukaryotic thiol (cysteine) proteases asparagine active site.PAPA1_CARPA303-322
 
  2A:277-296
C:277-296
Biological Unit 1 (3, 3)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1THIOL_PROTEASE_CYSPS00139 Eukaryotic thiol (cysteine) proteases cysteine active site.PAPA1_CARPA152-163
 
  1A:126-137
-
2THIOL_PROTEASE_HISPS00639 Eukaryotic thiol (cysteine) proteases histidine active site.PAPA1_CARPA290-300
 
  1A:264-274
-
3THIOL_PROTEASE_ASNPS00640 Eukaryotic thiol (cysteine) proteases asparagine active site.PAPA1_CARPA303-322
 
  1A:277-296
-
Biological Unit 2 (3, 3)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1THIOL_PROTEASE_CYSPS00139 Eukaryotic thiol (cysteine) proteases cysteine active site.PAPA1_CARPA152-163
 
  1-
C:126-137
2THIOL_PROTEASE_HISPS00639 Eukaryotic thiol (cysteine) proteases histidine active site.PAPA1_CARPA290-300
 
  1-
C:264-274
3THIOL_PROTEASE_ASNPS00640 Eukaryotic thiol (cysteine) proteases asparagine active site.PAPA1_CARPA303-322
 
  1-
C:277-296

(-) Exons   (0, 0)

(no "Exon" information available for 3TNX)

(-) Sequences/Alignments

Asymmetric Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:310
 aligned with PAPA1_CARPA | P00784 from UniProtKB/Swiss-Prot  Length:345

    Alignment length:310
                                    45        55        65        75        85        95       105       115       125       135       145       155       165       175       185       195       205       215       225       235       245       255       265       275       285       295       305       315       325       335       345
          PAPA1_CARPA    36 NDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRRSYGCNGGYPWSALQLVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGYGPNYILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPVKN 345
               SCOP domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ....hhhhhhhhhhhhhhhh.....hhhhhhhhhhhhhhhhhhhhhhh.....eee........hhhhhhhhhh............................ee.hhhh............hhhhhhhhhhhhhhhhhhhh.....hhhhhhhhh...hhhhh.hhhhhhhhhhhh.................hhhhhh.......eeeee...hhhhhhhhhh...eeeee...hhhhhheeeeee.........eeeeeeeee..eeeee...........eeeee.......hhhhh...eeeee.. Sec.struct. author
                 SAPs(SNPs) ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------------------------------------------------------------------THIOL_PROTEA------------------------------------------------------------------------------------------------------------------------------THIOL_PROTE--THIOL_PROTEASE_ASN  ----------------------- PROSITE
                 Transcript ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 3tnx A  10 NDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSAWAFSAVSTIESIIKIRTGNLNEYSEQELLDCDRRSYGCNGGYPWSALQLVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGYGPNYILIRNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPVKN 319
                                    19        29        39        49        59        69        79        89        99       109       119       129       139       149       159       169       179       189       199       209       219       229       239       249       259       269       279       289       299       309       319

Chain C from PDB  Type:PROTEIN  Length:310
 aligned with PAPA1_CARPA | P00784 from UniProtKB/Swiss-Prot  Length:345

    Alignment length:310
                                    45        55        65        75        85        95       105       115       125       135       145       155       165       175       185       195       205       215       225       235       245       255       265       275       285       295       305       315       325       335       345
          PAPA1_CARPA    36 NDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRRSYGCNGGYPWSALQLVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGYGPNYILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPVKN 345
               SCOP domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .....hhhhhhhhhhhhhhh.....hhhhhhhhhhhhhhhhhhhhhhh.....eee........hhhhhhhhhh............................ee.hhhh............hhhhhhhhhhhhhhhhhhhh.....hhhhhhhhh...hhhhh.hhhhhhhhhhhh...hhhhh........hhhhhhh.......eeee....hhhhhhhhhh...eeeee...hhhhhheeeeee.........eeeeeeeee..eeeee...........eeeee.......hhhhh...eeeee.. Sec.struct. author
                 SAPs(SNPs) ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------------------------------------------------------------------THIOL_PROTEA------------------------------------------------------------------------------------------------------------------------------THIOL_PROTE--THIOL_PROTEASE_ASN  ----------------------- PROSITE
                 Transcript ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 3tnx C  10 NDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSAWAFSAVSTIESIIKIRTGNLNEYSEQELLDCDRRSYGCNGGYPWSALQLVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGYGPNYILIRNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPVKN 319
                                    19        29        39        49        59        69        79        89        99       109       119       129       139       149       159       169       179       189       199       209       219       229       239       249       259       269       279       289       299       309       319

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 3TNX)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 3TNX)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 3TNX)

(-) Gene Ontology  (5, 5)

Asymmetric Unit(hide GO term definitions)
Chain A,C   (PAPA1_CARPA | P00784)
molecular function
    GO:0008234    cysteine-type peptidase activity    Catalysis of the hydrolysis of peptide bonds in a polypeptide chain by a mechanism in which the sulfhydryl group of a cysteine residue at the active center acts as a nucleophile.
    GO:0016787    hydrolase activity    Catalysis of the hydrolysis of various bonds, e.g. C-O, C-N, C-C, phosphoric anhydride bonds, etc. Hydrolase is the systematic name for any enzyme of EC class 3.
    GO:0008233    peptidase activity    Catalysis of the hydrolysis of a peptide bond. A peptide bond is a covalent bond formed when the carbon atom from the carboxyl group of one amino acid shares electrons with the nitrogen atom from the amino group of a second amino acid.
    GO:0097655    serpin family protein binding    Interacting selectively and non-covalently with any member of the serpin protein family (serine protease inhibitors or classified inhibitor family I4). Serpins are a broadly distributed family of protease inhibitors that use a conformational change to inhibit target enzymes. They are central in controlling many important proteolytic cascades. The majority of serpins inhibit serine proteases, but serpins that inhibit caspases and papain-like cysteine proteases have also been identified. Rarely, serpins perform a non-inhibitory function; for example, several human serpins function as hormone transporters and certain serpins function as molecular chaperones or tumor suppressors.
biological process
    GO:0006508    proteolysis    The hydrolysis of proteins into smaller polypeptides and/or amino acids by cleavage of their peptide bonds.

 Visualization

(-) Interactive Views

Asymmetric Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    CL  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
    Gly A:258 - Pro A:259   [ RasMol ]  
    Gly C:258 - Pro C:259   [ RasMol ]  
 
Biological Units
  Complete Structure
    Biological Unit 1  [ Jena3D ]
    Biological Unit 2  [ Jena3D ]

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  3tnx
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  PAPA1_CARPA | P00784
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  3.4.22.2
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  PAPA1_CARPA | P00784
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        PAPA1_CARPA | P007841bp4 1bqi 1cvz 1eff 1khp 1khq 1pad 1pe6 1pip 1pop 1ppd 1ppn 1ppp 1stf 2cio 2pad 3e1z 3ima 3lfy 3usv 4kp9 4pad 4qrg 4qrv 4qrx 5pad 6pad 9pap

(-) Related Entries Specified in the PDB File

9pap MATURE DOMAIN OF PAPAIN