Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asym./Biol. Unit
(-)Asym./Biol. Unit - sites
collapse expand < >
Image Asym./Biol. Unit
Asym./Biol. Unit  (Jmol Viewer)
Image Asym./Biol. Unit - sites
Asym./Biol. Unit - sites  (Jmol Viewer)

(-) Description

Title :  CRYSTAL STRUCTURE OF THE MAIN PROTEASE OF CORONAVIRUS HKU4
 
Authors :  Q. Ma, Y. Xiao, R. Hilgenfeld
Date :  13 Oct 12  (Deposition) - 23 Oct 13  (Release) - 23 Oct 13  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  1.50
Chains :  Asym./Biol. Unit :  A,B
Keywords :  Hydrolase, Sars (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  Q. Ma, Y. Xiao, R. Hilgenfeld
Inhibitor For The Main Protease Of Coronavirus Hku4
To Be Published
PubMed: search

(-) Compounds

Molecule 1 - 3C-LIKE PROTEINASE
    ChainsA, B
    EC Number3.4.22.69, 3.4.22.-
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System StrainBL21(DE3)
    Expression System Taxid469008
    Expression System VariantGOLD
    Organism ScientificTYLONYCTERIS BAT CORONAVIRUS HKU4
    Organism Taxid694007
    StrainHKU4-1L
    Synonym3CL-PRO, 3CLP, NSP5

 Structural Features

(-) Chains, Units

  12
Asymmetric/Biological Unit AB

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (3, 7)

Asymmetric/Biological Unit (3, 7)
No.NameCountTypeFull Name
1GOL4Ligand/IonGLYCEROL
2IMD1Ligand/IonIMIDAZOLE
3NI2Ligand/IonNICKEL (II) ION

(-) Sites  (7, 7)

Asymmetric Unit (7, 7)
No.NameEvidenceResiduesDescription
1AC1SOFTWARELEU A:27 , HIS A:41 , CYS A:148 , GLN A:306 , IMD A:1308BINDING SITE FOR RESIDUE NI A1307
2AC2SOFTWAREHIS B:41 , CYS B:148 , HOH B:2075BINDING SITE FOR RESIDUE NI B1303
3AC3SOFTWARELEU A:27 , HIS A:41 , GLN A:306 , NI A:1307BINDING SITE FOR RESIDUE IMD A1308
4AC4SOFTWARETHR A:126 , HOH A:2275 , ALA B:8 , ASP B:295 , HOH B:2014 , HOH B:2213 , HOH B:2214 , HOH B:2344 , HOH B:2345BINDING SITE FOR RESIDUE GOL B1304
5AC5SOFTWAREMET A:6 , ALA A:8 , GLU A:155 , MET A:298 , HOH A:2014 , HOH A:2307 , HOH A:2308 , HOH A:2311 , HOH A:2466 , HOH A:2467 , THR B:126BINDING SITE FOR RESIDUE GOL A1309
6AC6SOFTWARESER B:107 , THR B:108 , SER B:179 , ASP B:181 , HOH B:2169 , HOH B:2171 , HOH B:2349 , HOH B:2351BINDING SITE FOR RESIDUE GOL B1305
7AC7SOFTWAREASN A:15 , HOH A:2023 , GLY B:11 , GLU B:14 , ASN B:15BINDING SITE FOR RESIDUE GOL B1306

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 2YNA)

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 2YNA)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 2YNA)

(-) PROSITE Motifs  (1, 2)

Asymmetric/Biological Unit (1, 2)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1M_PROPS51442 Coronavirus main protease (M-pro) domain profile.R1A_BCHK43292-3597
 
  2A:1-306
B:1-302

(-) Exons   (0, 0)

(no "Exon" information available for 2YNA)

(-) Sequences/Alignments

Asymmetric/Biological Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:306
 aligned with R1A_BCHK4 | P0C6T4 from UniProtKB/Swiss-Prot  Length:4434

    Alignment length:306
                                  3301      3311      3321      3331      3341      3351      3361      3371      3381      3391      3401      3411      3421      3431      3441      3451      3461      3471      3481      3491      3501      3511      3521      3531      3541      3551      3561      3571      3581      3591      
           R1A_BCHK4   3292 SGLVKMSAPSGAVENCIVQVTCGSMTLNGLWLDNTVWCPRHIMCPADQLTDPNYDALLISKTNHSFIVQKHIGAQANLRVVAHSMVGVLLKLTVDVANPSTPAYTFSTVKPGASFSVLACYNGKPTGVFTVNLRHNSTIKGSFLCGSCGSVGYTENGGVINFVYMHQMELSNGTHTGSSFDGVMYGAFEDKQTHQLQLTDKYCTINVVAWLYAAVLNGCKWFVKPTRVGIVTYNEWALSNQFTEFVGTQSIDMLAHRTGVSVEQMLAAIQSLHAGFQGKTILGQSTLEDEFTPDDVNMQVMGVVMQ 3597
               SCOP domains d2ynaa_ A: automated matches                                                                                                                                                                                                                                                                                       SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Pfam domains
         Sec.struct. author .........hhhhhh.eeeeee..eeeeeeee..eeeee.hhhhhhhhh...hhhhhhhh.hhhhheee..........eeeeeee..eeeeee.........eee.......eeeeeeee..eeeeeeeee..................eeeeee..eeeeeeeeeeeee..eeeee......hhhh..............hhhhhhhhhhhhhhh...........hhhhhhhhhhhh.......hhhhhhhhhhhh.hhhhhhhhhhhhhhh................hhhhhhhhhh..... Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ SAPs(SNPs)
                    PROSITE M_PRO  PDB: A:1-306 UniProt: 3292-3597                                                                                                                                                                                                                                                                             PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Transcript
                2yna A    1 SGLVKMSAPSGAVENCIVQVTCGSMTLNGLWLDNTVWCPRHIMCPADQLTDPNYDALLISKTNHSFIVQKHIGAQANLRVVAHSMVGVLLKLTVDVANPSTPAYTFSTVKPGASFSVLACYNGKPTGVFTVNLRHNSTIKGSFLCGSCGSVGYTENGGVINFVYMHQMELSNGTHTGSSFDGVMYGAFEDKQTHQLQLTDKYCTINVVAWLYAAVLNGCKWFVKPTRVGIVTYNEWALSNQFTEFVGTQSIDMLAHRTGVSVEQMLAAIQSLHAGFQGKTILGQSTLEDEFTPDDVNMQVMGVVMQ  306
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150       160       170       180       190       200       210       220       230       240       250       260       270       280       290       300      

Chain B from PDB  Type:PROTEIN  Length:302
 aligned with R1A_BCHK4 | P0C6T4 from UniProtKB/Swiss-Prot  Length:4434

    Alignment length:302
                                  3301      3311      3321      3331      3341      3351      3361      3371      3381      3391      3401      3411      3421      3431      3441      3451      3461      3471      3481      3491      3501      3511      3521      3531      3541      3551      3561      3571      3581      3591  
           R1A_BCHK4   3292 SGLVKMSAPSGAVENCIVQVTCGSMTLNGLWLDNTVWCPRHIMCPADQLTDPNYDALLISKTNHSFIVQKHIGAQANLRVVAHSMVGVLLKLTVDVANPSTPAYTFSTVKPGASFSVLACYNGKPTGVFTVNLRHNSTIKGSFLCGSCGSVGYTENGGVINFVYMHQMELSNGTHTGSSFDGVMYGAFEDKQTHQLQLTDKYCTINVVAWLYAAVLNGCKWFVKPTRVGIVTYNEWALSNQFTEFVGTQSIDMLAHRTGVSVEQMLAAIQSLHAGFQGKTILGQSTLEDEFTPDDVNMQVMG 3593
               SCOP domains d2ynab_ B: automated matches                                                                                                                                                                                                                                                                                   SCOP domains
               CATH domains -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .........hhhhhh.eeeeee..eeeeeeee..eeeee.hhhhhhhhh...hhhhhhhh.hhh.eeeee.....eee.eeeeeee..eeeeee.........eee.......eeeeeeee..eeeeeeeee..................eeeeee..eeeeeeeeeeeee..eeeee......hhhh..............hhhhhhhhhhhhhhh...........hhhhhhhhhhhh.......hhhhhhhhhhhh.hhhhhhhhhhhhhhh................hhhhhhhh... Sec.struct. author
                 SAPs(SNPs) -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE M_PRO  PDB: B:1-302 UniProt: 3292-3597                                                                                                                                                                                                                                                                         PROSITE
                 Transcript -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                2yna B    1 SGLVKMSAPSGAVENCIVQVTCGSMTLNGLWLDNTVWCPRHIMCPADQLTDPNYDALLISKTNHSFIVQKHIGAQANLRVVAHSMVGVLLKLTVDVANPSTPAYTFSTVKPGASFSVLACYNGKPTGVFTVNLRHNSTIKGSFLCGSCGSVGYTENGGVINFVYMHQMELSNGTHTGSSFDGVMYGAFEDKQTHQLQLTDKYCTINVVAWLYAAVLNGCKWFVKPTRVGIVTYNEWALSNQFTEFVGTQSIDMLAHRTGVSVEQMLAAIQSLHAGFQGKTILGQSTLEDEFTPDDVNMQVMG  302
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150       160       170       180       190       200       210       220       230       240       250       260       270       280       290       300  

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (1, 2)

Asymmetric/Biological Unit

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 2YNA)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 2YNA)

(-) Gene Ontology  (31, 31)

Asymmetric/Biological Unit(hide GO term definitions)
Chain A,B   (R1A_BCHK4 | P0C6T4)
molecular function
    GO:0003723    RNA binding    Interacting selectively and non-covalently with an RNA molecule or a portion thereof.
    GO:0003968    RNA-directed 5'-3' RNA polymerase activity    Catalysis of the reaction: nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1); uses an RNA template, i.e. the catalysis of RNA-template-directed extension of the 3'-end of an RNA strand by one nucleotide at a time.
    GO:0004197    cysteine-type endopeptidase activity    Catalysis of the hydrolysis of internal, alpha-peptide bonds in a polypeptide chain by a mechanism in which the sulfhydryl group of a cysteine residue at the active center acts as a nucleophile.
    GO:0008234    cysteine-type peptidase activity    Catalysis of the hydrolysis of peptide bonds in a polypeptide chain by a mechanism in which the sulfhydryl group of a cysteine residue at the active center acts as a nucleophile.
    GO:0016787    hydrolase activity    Catalysis of the hydrolysis of various bonds, e.g. C-O, C-N, C-C, phosphoric anhydride bonds, etc. Hydrolase is the systematic name for any enzyme of EC class 3.
    GO:0046872    metal ion binding    Interacting selectively and non-covalently with any metal ion.
    GO:0003676    nucleic acid binding    Interacting selectively and non-covalently with any nucleic acid.
    GO:0008242    omega peptidase activity    Catalysis of the removal of terminal peptide residues that are substituted, cyclized or linked by isopeptide bonds (peptide linkages other than those of alpha-carboxyl to alpha-amino groups).
    GO:0008233    peptidase activity    Catalysis of the hydrolysis of a peptide bond. A peptide bond is a covalent bond formed when the carbon atom from the carboxyl group of one amino acid shares electrons with the nitrogen atom from the amino group of a second amino acid.
    GO:0036459    thiol-dependent ubiquitinyl hydrolase activity    Catalysis of the thiol-dependent hydrolysis of an ester, thioester, amide, peptide or isopeptide bond formed by the C-terminal glycine of ubiquitin.
    GO:0016740    transferase activity    Catalysis of the transfer of a group, e.g. a methyl group, glycosyl group, acyl group, phosphorus-containing, or other groups, from one compound (generally regarded as the donor) to another compound (generally regarded as the acceptor). Transferase is the systematic name for any enzyme of EC class 2.
    GO:0008270    zinc ion binding    Interacting selectively and non-covalently with zinc (Zn) ions.
biological process
    GO:0030683    evasion or tolerance by virus of host immune response    Any process, either active or passive, by which a virus avoids the effects of the host organism's immune response. The host is defined as the larger of the organisms involved in a symbiotic interaction.
    GO:0039595    induction by virus of catabolism of host mRNA    The process in which a virus increases the frequency, rate or extent of the breakdown of host messenger RNA (mRNA).
    GO:0039520    induction by virus of host autophagy    Any process in which a virus activates or increases the frequency, rate or extent of autophagy in the host.
    GO:0039648    modulation by virus of host protein ubiquitination    Any process in which a virus modulates the frequency, rate or extent of protein ubiquitination in the host organism. Ubiquitination is the process in which one or more ubiquitin groups are added to a protein.
    GO:0006508    proteolysis    The hydrolysis of proteins into smaller polypeptides and/or amino acids by cleavage of their peptide bonds.
    GO:0039548    suppression by virus of host IRF3 activity    Any process in which a virus stops, prevents, or reduces the activity of host IRF3 (interferon regulatory factor-3), a transcription factor in the RIG-I/MDA-5 signaling pathway. Viral infection triggers phosphorylation of cytoplasmic IRF3, which allows IRF3 to form a homodimer, migrate to the nucleus, and activate transcription of IFN-alpha and IFN-beta genes.
    GO:0039579    suppression by virus of host ISG15 activity    Any process in which a virus stops, prevents, or reduces the frequency, rate or extent of host ubiquitin-like protein ISG15 activity. ISG15 is a ubiquitin-like protein that is conjugated to lysine residues on various target proteins. Viruses escape from the antiviral activity of ISG15 by using different mechanisms; the influenza B virus NS1 protein for instance blocks the covalent linkage of ISG15 to its target proteins by directly interacting with ISG15. The papain-like protease from the coronavirus cleaves ISG15 derivatives.
    GO:0039657    suppression by virus of host gene expression    Any process in which a virus stops, prevents, or reduces the frequency, rate or extent of gene expression in the host organism. Gene expression is the process in which a gene's coding sequence is converted into a mature gene product or products (proteins or RNA). This includes the production of an RNA transcript as well as any processing to produce a mature RNA product or an mRNA (for protein-coding genes) and the translation of that mRNA into protein. Some protein processing events may be included when they are required to form an active form of a product from an inactive precursor form.
    GO:0039503    suppression by virus of host innate immune response    Any process in which a virus stops, prevents, or reduces the frequency, rate or extent of the innate immune response of the host organism, the host's first line of defense.
    GO:0039502    suppression by virus of host type I interferon-mediated signaling pathway    Any process in which a virus stops, prevents, or reduces the frequency, rate or extent of type I interferon-mediated signaling in the host organism. Type I interferons include the interferon-alpha, beta, delta, episilon, zeta, kappa, tau, and omega gene families.
    GO:0001172    transcription, RNA-templated    The cellular synthesis of RNA on a template of RNA.
    GO:0019079    viral genome replication    Any process involved directly in viral genome replication, including viral nucleotide metabolism.
    GO:0016032    viral process    A multi-organism process in which a virus is a participant. The other participant is the host. Includes infection of a host cell, replication of the viral genome, and assembly of progeny virus particles. In some cases the viral genetic material may integrate into the host genome and only subsequently, under particular circumstances, 'complete' its life cycle.
    GO:0019082    viral protein processing    Any protein maturation process achieved by the cleavage of a peptide bond or bonds within a viral protein.
cellular component
    GO:0030430    host cell cytoplasm    The cytoplasm of a host cell.
    GO:0033644    host cell membrane    Double layer of lipid molecules as it encloses host cells, and, in eukaryotes, many organelles; may be a single or double lipid bilayer; also includes associated proteins. The host is defined as the larger of the organisms involved in a symbiotic interaction.
    GO:0044220    host cell perinuclear region of cytoplasm    The host cell cytoplasm situated near, or occurring around, the host nucleus.
    GO:0016021    integral component of membrane    The component of a membrane consisting of the gene products and protein complexes having at least some part of their peptide sequence embedded in the hydrophobic region of the membrane.
    GO:0016020    membrane    A lipid bilayer along with all the proteins and protein complexes embedded in it an attached to it.

 Visualization

(-) Interactive Views

Asymmetric/Biological Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    GOL  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    IMD  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    NI  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
    AC4  [ RasMol ]  +environment [ RasMol ]
    AC5  [ RasMol ]  +environment [ RasMol ]
    AC6  [ RasMol ]  +environment [ RasMol ]
    AC7  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 2yna)
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  2yna
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  R1A_BCHK4 | P0C6T4
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  3.4.22.-
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
  3.4.22.69
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  R1A_BCHK4 | P0C6T4
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        R1A_BCHK4 | P0C6T42ynb

(-) Related Entries Specified in the PDB File

2ynb CRYSTAL STRUCTURE OF THE MAIN PROTEASE OF CORONAVIRUS HKU4 IN COMPLEX WITH A MICHAEL ACCEPTOR SG85