Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asymmetric Unit
(-)Biological Unit 1
collapse expand < >
Image Asymmetric Unit
Asymmetric Unit  (Jmol Viewer)
Image Biological Unit 1
Biological Unit 1  (Jmol Viewer)

(-) Description

Title :  CRYSTAL STRUCTURE OF SARS CORONAVIRUS MAIN PROTEASE COMPLEXED WITH AC-DSFDQ-H (SOAKING)
 
Authors :  L. Zhu, R. Hilgenfeld
Date :  29 Jun 11  (Deposition) - 07 Sep 11  (Release) - 09 Nov 11  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  2.40
Chains :  Asym. Unit :  A,H
Biol. Unit 1:  A,H  (2x)
Keywords :  3C-Like Proteinase, Protease, Ac-Dsfdq-H, Covalent Bound, Hydrolase- Hydrolase Inhibitor Complex (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  L. Zhu, S. George, M. F. Schmidt, S. I. Al-Gharabli, J. Rademann, R. Hilgenfeld
Peptide Aldehyde Inhibitors Challenge The Substrate Specificity Of The Sars-Coronavirus Main Protease.
Antiviral Res. V. 92 204 2011
PubMed-ID: 21854807  |  Reference-DOI: 10.1016/J.ANTIVIRAL.2011.08.001

(-) Compounds

Molecule 1 - 3C-LIKE PROTEINASE
    ChainsA
    EC Number3.4.22.-
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System Taxid562
    Organism CommonSARS-COV
    Organism ScientificSARS CORONAVIRUS
    Organism Taxid227859
    Synonym3CL-PRO, 3CLP, NSP5
 
Molecule 2 - PEPTIDE ALDEHYDE INHIBITOR AC-DSFDQ-H
    ChainsH
    EngineeredYES
    SyntheticYES

 Structural Features

(-) Chains, Units

  12
Asymmetric Unit AH
Biological Unit 1 (2x)AH

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (2, 2)

Asymmetric Unit (2, 2)
No.NameCountTypeFull Name
1ACE1Mod. Amino AcidACETYL GROUP
2ECC1Mod. Amino Acid(4S)-4-AMINO-5-HYDROXYPENTANAMIDE
Biological Unit 1 (2, 4)
No.NameCountTypeFull Name
1ACE2Mod. Amino AcidACETYL GROUP
2ECC2Mod. Amino Acid(4S)-4-AMINO-5-HYDROXYPENTANAMIDE

(-) Sites  (0, 0)

(no "Site" information available for 3SNB)

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 3SNB)

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 3SNB)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (2, 2)

Asymmetric Unit (2, 2)
  dbSNPPDB
No.SourceVariant IDVariantUniProt IDStatusIDChainVariant
1UniProtVAR_R1A_CVHSA_047 *Q3429PR1A_CVHSA  ---  ---AQ189P
2UniProtVAR_R1A_CVHSA_048 *D3488ER1A_CVHSA  ---  ---AD248E
   * ID not provided by source

  SNP/SAP Summary Statistics (UniProtKB/Swiss-Prot)
Biological Unit 1 (2, 4)
  dbSNPPDB
No.SourceVariant IDVariantUniProt IDStatusIDChainVariant
1UniProtVAR_R1A_CVHSA_047 *Q3429PR1A_CVHSA  ---  ---AQ189P
2UniProtVAR_R1A_CVHSA_048 *D3488ER1A_CVHSA  ---  ---AD248E
   * ID not provided by source

  SNP/SAP Summary Statistics (UniProtKB/Swiss-Prot)

(-) PROSITE Motifs  (1, 1)

Asymmetric Unit (1, 1)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1M_PROPS51442 Coronavirus main protease (M-pro) domain profile.R1A_CVHSA3241-3546  1A:1-306
Biological Unit 1 (1, 2)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1M_PROPS51442 Coronavirus main protease (M-pro) domain profile.R1A_CVHSA3241-3546  2A:1-306

(-) Exons   (0, 0)

(no "Exon" information available for 3SNB)

(-) Sequences/Alignments

Asymmetric Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:306
 aligned with R1A_CVHSA | P0C6U8 from UniProtKB/Swiss-Prot  Length:4382

    Alignment length:306
                                  3250      3260      3270      3280      3290      3300      3310      3320      3330      3340      3350      3360      3370      3380      3390      3400      3410      3420      3430      3440      3450      3460      3470      3480      3490      3500      3510      3520      3530      3540      
           R1A_CVHSA   3241 SGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDTVYCPRHVICTAEDMLNPNYEDLLIRKSNHSFLVQAGNVQLRVIGHSMQNCLLRLKVDTSNPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNHTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCAALKELLQNGMNGRTILGSTILEDEFTPFDVVRQCSGVTFQ 3546
               SCOP domains d3snba_ A: Coronavirus main proteinase (3Cl-pro, putative coronavirus nsp2)                                                                                                                                                                                                                                        SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Pfam domains
         Sec.struct. author .........hhhhhh.eeeeee..eeeeeeee..eeeee.hhhhhhhhhh..hhhhhhhh.hhh.eeeee..eee.eeeeeee..eeeeee.........eee.......eeeeeeee..eeeeeeeee..................eeeeee..eeeeeeeeeee.....eeee........................hhhhhhhhhhhhhhh...........hhhhhhhhhhhh.....hhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhh................hhhhhhhhh...... Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------P----------------------------------------------------------E---------------------------------------------------------- SAPs(SNPs)
                    PROSITE M_PRO  PDB: A:1-306 UniProt: 3241-3546                                                                                                                                                                                                                                                                             PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Transcript
                3snb A    1 SGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDTVYCPRHVICTAEDMLNPNYEDLLIRKSNHSFLVQAGNVQLRVIGHSMQNCLLRLKVDTSNPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNHTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCAALKELLQNGMNGRTILGSTILEDEFTPFDVVRQCSGVTFQ  306
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150       160       170       180       190       200       210       220       230       240       250       260       270       280       290       300      

Chain H from PDB  Type:PROTEIN  Length:6
                                       
               SCOP domains ------ SCOP domains
               CATH domains ------ CATH domains
               Pfam domains ------ Pfam domains
         Sec.struct. author ...... Sec.struct. author
                 SAPs(SNPs) ------ SAPs(SNPs)
                    PROSITE ------ PROSITE
                 Transcript ------ Transcript
                3snb H    0 xDSFDq    5
                            |    |
                            0-ACE|
                                 5-ECC

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (1, 1)

Asymmetric Unit

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 3SNB)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 3SNB)

(-) Gene Ontology  (36, 36)

Asymmetric Unit(hide GO term definitions)
Chain A   (R1A_CVHSA | P0C6U8)
molecular function
    GO:0003723    RNA binding    Interacting selectively and non-covalently with an RNA molecule or a portion thereof.
    GO:0003968    RNA-directed 5'-3' RNA polymerase activity    Catalysis of the reaction: nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1); uses an RNA template, i.e. the catalysis of RNA-template-directed extension of the 3'-end of an RNA strand by one nucleotide at a time.
    GO:0004197    cysteine-type endopeptidase activity    Catalysis of the hydrolysis of internal, alpha-peptide bonds in a polypeptide chain by a mechanism in which the sulfhydryl group of a cysteine residue at the active center acts as a nucleophile.
    GO:0008234    cysteine-type peptidase activity    Catalysis of the hydrolysis of peptide bonds in a polypeptide chain by a mechanism in which the sulfhydryl group of a cysteine residue at the active center acts as a nucleophile.
    GO:0004519    endonuclease activity    Catalysis of the hydrolysis of ester linkages within nucleic acids by creating internal breaks.
    GO:0016787    hydrolase activity    Catalysis of the hydrolysis of various bonds, e.g. C-O, C-N, C-C, phosphoric anhydride bonds, etc. Hydrolase is the systematic name for any enzyme of EC class 3.
    GO:0016817    hydrolase activity, acting on acid anhydrides    Catalysis of the hydrolysis of any acid anhydride.
    GO:0016788    hydrolase activity, acting on ester bonds    Catalysis of the hydrolysis of any ester bond.
    GO:0046872    metal ion binding    Interacting selectively and non-covalently with any metal ion.
    GO:0004518    nuclease activity    Catalysis of the hydrolysis of ester linkages within nucleic acids.
    GO:0003676    nucleic acid binding    Interacting selectively and non-covalently with any nucleic acid.
    GO:0008242    omega peptidase activity    Catalysis of the removal of terminal peptide residues that are substituted, cyclized or linked by isopeptide bonds (peptide linkages other than those of alpha-carboxyl to alpha-amino groups).
    GO:0008233    peptidase activity    Catalysis of the hydrolysis of a peptide bond. A peptide bond is a covalent bond formed when the carbon atom from the carboxyl group of one amino acid shares electrons with the nitrogen atom from the amino group of a second amino acid.
    GO:0036459    thiol-dependent ubiquitinyl hydrolase activity    Catalysis of the thiol-dependent hydrolysis of an ester, thioester, amide, peptide or isopeptide bond formed by the C-terminal glycine of ubiquitin.
    GO:0016740    transferase activity    Catalysis of the transfer of a group, e.g. a methyl group, glycosyl group, acyl group, phosphorus-containing, or other groups, from one compound (generally regarded as the donor) to another compound (generally regarded as the acceptor). Transferase is the systematic name for any enzyme of EC class 2.
    GO:0008270    zinc ion binding    Interacting selectively and non-covalently with zinc (Zn) ions.
biological process
    GO:0030683    evasion or tolerance by virus of host immune response    Any process, either active or passive, by which a virus avoids the effects of the host organism's immune response. The host is defined as the larger of the organisms involved in a symbiotic interaction.
    GO:0039595    induction by virus of catabolism of host mRNA    The process in which a virus increases the frequency, rate or extent of the breakdown of host messenger RNA (mRNA).
    GO:0039520    induction by virus of host autophagy    Any process in which a virus activates or increases the frequency, rate or extent of autophagy in the host.
    GO:0039648    modulation by virus of host protein ubiquitination    Any process in which a virus modulates the frequency, rate or extent of protein ubiquitination in the host organism. Ubiquitination is the process in which one or more ubiquitin groups are added to a protein.
    GO:0090305    nucleic acid phosphodiester bond hydrolysis    The nucleic acid metabolic process in which the phosphodiester bonds between nucleotides are cleaved by hydrolysis.
    GO:0006508    proteolysis    The hydrolysis of proteins into smaller polypeptides and/or amino acids by cleavage of their peptide bonds.
    GO:0039548    suppression by virus of host IRF3 activity    Any process in which a virus stops, prevents, or reduces the activity of host IRF3 (interferon regulatory factor-3), a transcription factor in the RIG-I/MDA-5 signaling pathway. Viral infection triggers phosphorylation of cytoplasmic IRF3, which allows IRF3 to form a homodimer, migrate to the nucleus, and activate transcription of IFN-alpha and IFN-beta genes.
    GO:0039579    suppression by virus of host ISG15 activity    Any process in which a virus stops, prevents, or reduces the frequency, rate or extent of host ubiquitin-like protein ISG15 activity. ISG15 is a ubiquitin-like protein that is conjugated to lysine residues on various target proteins. Viruses escape from the antiviral activity of ISG15 by using different mechanisms; the influenza B virus NS1 protein for instance blocks the covalent linkage of ISG15 to its target proteins by directly interacting with ISG15. The papain-like protease from the coronavirus cleaves ISG15 derivatives.
    GO:0039657    suppression by virus of host gene expression    Any process in which a virus stops, prevents, or reduces the frequency, rate or extent of gene expression in the host organism. Gene expression is the process in which a gene's coding sequence is converted into a mature gene product or products (proteins or RNA). This includes the production of an RNA transcript as well as any processing to produce a mature RNA product or an mRNA (for protein-coding genes) and the translation of that mRNA into protein. Some protein processing events may be included when they are required to form an active form of a product from an inactive precursor form.
    GO:0039503    suppression by virus of host innate immune response    Any process in which a virus stops, prevents, or reduces the frequency, rate or extent of the innate immune response of the host organism, the host's first line of defense.
    GO:0039502    suppression by virus of host type I interferon-mediated signaling pathway    Any process in which a virus stops, prevents, or reduces the frequency, rate or extent of type I interferon-mediated signaling in the host organism. Type I interferons include the interferon-alpha, beta, delta, episilon, zeta, kappa, tau, and omega gene families.
    GO:0001172    transcription, RNA-templated    The cellular synthesis of RNA on a template of RNA.
    GO:0019079    viral genome replication    Any process involved directly in viral genome replication, including viral nucleotide metabolism.
    GO:0016032    viral process    A multi-organism process in which a virus is a participant. The other participant is the host. Includes infection of a host cell, replication of the viral genome, and assembly of progeny virus particles. In some cases the viral genetic material may integrate into the host genome and only subsequently, under particular circumstances, 'complete' its life cycle.
    GO:0019082    viral protein processing    Any protein maturation process achieved by the cleavage of a peptide bond or bonds within a viral protein.
cellular component
    GO:0030430    host cell cytoplasm    The cytoplasm of a host cell.
    GO:0033644    host cell membrane    Double layer of lipid molecules as it encloses host cells, and, in eukaryotes, many organelles; may be a single or double lipid bilayer; also includes associated proteins. The host is defined as the larger of the organisms involved in a symbiotic interaction.
    GO:0044220    host cell perinuclear region of cytoplasm    The host cell cytoplasm situated near, or occurring around, the host nucleus.
    GO:0016021    integral component of membrane    The component of a membrane consisting of the gene products and protein complexes having at least some part of their peptide sequence embedded in the hydrophobic region of the membrane.
    GO:0016020    membrane    A lipid bilayer along with all the proteins and protein complexes embedded in it an attached to it.

 Visualization

(-) Interactive Views

Asymmetric Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    ACE  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    ECC  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
(no "Sites" information available for 3snb)
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 3snb)
 
Biological Unit
  Complete Structure
    Biological Unit 1  [ Jena3D ]

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  3snb
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  R1A_CVHSA | P0C6U8
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  3.4.22.-
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  R1A_CVHSA | P0C6U8
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        R1A_CVHSA | P0C6U81o5s 1p76 1p9t 1pa5 1puk 1q1x 1q2w 1qz8 1uj1 1uk2 1uk3 1uk4 1uw7 1wof 1ysy 1z1i 1z1j 2a5a 2a5i 2a5k 2acf 2ahm 2aj5 2alv 2amd 2amq 2bx3 2bx4 2c3s 2d2d 2duc 2fav 2fe8 2fyg 2g1f 2g9t 2ga6 2gdt 2gri 2gt7 2gt8 2gtb 2gx4 2gz7 2gz8 2gz9 2h2z 2hob 2hsx 2idy 2jzd 2jze 2jzf 2k87 2kaf 2kqv 2kqw 2kys 2liz 2op9 2ozk 2pwx 2q6g 2qc2 2qcy 2qiq 2rhb 2rnk 2w2g 2wct 2z3c 2z3d 2z3e 2z94 2z9g 2z9j 2z9k 2z9l 2zu4 2zu5 3atw 3avz 3aw0 3aw1 3d62 3e91 3e9s 3ea7 3ea8 3ea9 3eaj 3ee7 3f9e 3f9f 3f9g 3f9h 3fzd 3iwm 3m3s 3m3t 3m3v 3mj5 3r24 3sn8 3sna 3snc 3snd 3sne 3szn 3tit 3tiu 3tns 3tnt 3v3m 3vb3 3vb4 3vb5 3vb6 3vb7 4hi3 4m0w 4mds 4mm3 4ovz 4ow0 5f22

(-) Related Entries Specified in the PDB File

2h2z THE SAME PROTEIN WITHOUT INHIBITOR
3sn8
3sna
3snc
3snd
3sne