Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Theor.Model - manually
(-)Theoretical Model
(-)Theor. Model - sites
collapse expand < >
Image Theor.Model - manually
Theor.Model - manually  (Jmol Viewer)
Image Theoretical Model
Theoretical Model  (Jmol Viewer)
Image Theor. Model - sites
Theor. Model - sites  (Jmol Viewer)

(-) Description

Title :  COMPARATIVE ANALYSIS OF THE SEQUENCES AND STRUCTURES OF HIV-1 AND HIV-2 PROTEASES
 
Authors :  A. Gustchina, I. T. Weber
Date :  28 Feb 91  (Deposition) - 15 Apr 92  (Release) - 15 Oct 94  (Revision)
Method :  THEORETICAL MODEL
Resolution :  NOT APPLICABLE
Chains :  Theor. Model :  A,B,I,_#
#:   chains that contain no standard or modified protein/DNA/RNA residue)
 
Keywords :  Hydrolase (Acid Protease) (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  A. Gustchina, I. T. Weber
Comparative Analysis Of The Sequences And Structures Of Hiv-1 And Hiv-2 Proteases.
Proteins V. 10 325 1991
PubMed: search
(for further references see the PDB file header)

(-) Compounds

Molecule 1 - HIV-2 PROTEASE
    ChainsA, B
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Organism ScientificHIV-2 ROD ISOLATE
 
Molecule 2 - 
    ChainsI
    EngineeredYES

 Structural Features

(-) Chains, Units

  
Theoretical Model 
#:   chains that contain no standard or modified protein/DNA/RNA residue)
 

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (3, 4)

Theoretical Model (3, 4)
No.NameCountTypeFull Name
1ACE1Mod. Amino AcidACETYL GROUP
2ALA1Mod. Amino AcidALANINE
3STA2Mod. Amino AcidSTATINE

(-) Sites  (2, 2)

Theoretical Model (2, 2)
No.NameEvidenceResiduesDescription
1DTAnot definedASP A:25 , THR A:26 , GLY A:27
2DTBnot definedASP B:25 , THR B:26 , GLY B:27

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 1PHV)

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 1PHV)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 1PHV)

(-) PROSITE Motifs  (2, 2)

Theoretical Model (2, 2)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1ASP_PROT_RETROVPS50175 Aspartyl protease, retroviral-type family profile.POL_HV2RO533-602  1A:20-89
2ASP_PROTEASEPS00141 Eukaryotic and viral aspartyl proteases active site.POL_HV2RO535-546  1A:22-33

(-) Exons   (0, 0)

(no "Exon" information available for 1PHV)

(-) Sequences/Alignments

Theoretical Model
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:99
 aligned with POL_HV2RO | P04584 from UniProtKB/Swiss-Prot  Length:1464

    Alignment length:99
                                   523       533       543       553       563       573       583       593       603         
            POL_HV2RO   514 PQFSLWKRPVVTAYIEGQPVEVLLDTGADDSIVAGIELGNNYSPKIVGGIGGFINTKEYKNVEIEVLNKKVRATIMTGDTPINIFGRNILTALGMSLNL 612
               SCOP domains --------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains --------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------------------------------------------------- Pfam domains
     Sec.struct. author (1) eeee....eeeeeee--eeeeeeee.....eeee........eeeeeee--eeeeeeeeeeeeeee--eeeeeeeeee....eeehhhhhhhhh.eeee Sec.struct. author (1)
     Sec.struct. author (2) ---------------ttt-------------------------------ttt-------------tttt------------------------------ Sec.struct. author (2)
                 SAPs(SNPs) --------------------------------------------------------------------------------------------------- SAPs(SNPs)
                PROSITE (1) -------------------ASP_PROT_RETROV  PDB: A:20-89 UniProt: 533-602                        ---------- PROSITE (1)
                PROSITE (2) ---------------------ASP_PROTEASE------------------------------------------------------------------ PROSITE (2)
                 Transcript --------------------------------------------------------------------------------------------------- Transcript
                 1phv A   1 PQFSLWKRPVVTAYIEGQPVEVLLDTGADDSIVAGIELGNNYSPKIVGGIGGFINTKEYKNVEIEVLNKKVRATIMTGDTPINIFGRNILTALGMSLNL  99
                                    10        20        30        40        50        60        70        80        90         

Chain B from PDB  Type:PROTEIN  Length:99
                                                                                                                                   
               SCOP domains --------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains --------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------------------------------------------------- Pfam domains
     Sec.struct. author (1) eeee....eeeeeee--eeeeeeee.....eeee........eeeeeee--eeeeeeeeeeeeeee--eeeeeeeeee....eeehhhhhhhhh.eeee Sec.struct. author (1)
     Sec.struct. author (2) ---------------ttt-------------------------------ttt-------------tttt------------------------------ Sec.struct. author (2)
                 SAPs(SNPs) --------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------------------------------------------------- PROSITE
                 Transcript --------------------------------------------------------------------------------------------------- Transcript
                 1phv B   1 PQFSLWKRPVVTAYIEGQPVEVLLDTGADDSIVAGIELGNNYSPKIVGGIGGFINTKEYKNVEIEVLNKKVRATIMTGDTPINIFGRNILTALGMSLNL  99
                                    10        20        30        40        50        60        70        80        90         

Chain I from PDB  Type:PROTEIN  Length:6
                                      
               SCOP domains ------ SCOP domains
               CATH domains ------ CATH domains
               Pfam domains ------ Pfam domains
         Sec.struct. author ...... Sec.struct. author
                 SAPs(SNPs) ------ SAPs(SNPs)
                    PROSITE ------ PROSITE
                 Transcript ------ Transcript
                 1phv I   1 xVVxAx   6
                            |  | |
                            1-ACE|
                               4-STA
                                 6-STA

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 1PHV)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 1PHV)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 1PHV)

(-) Gene Ontology  (44, 44)

Theoretical Model(hide GO term definitions)
Chain A   (POL_HV2RO | P04584)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0003887    DNA-directed DNA polymerase activity    Catalysis of the reaction: deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1); the synthesis of DNA from deoxyribonucleotide triphosphates in the presence of a DNA template and a 3'hydroxyl group.
    GO:0003723    RNA binding    Interacting selectively and non-covalently with an RNA molecule or a portion thereof.
    GO:0004523    RNA-DNA hybrid ribonuclease activity    Catalysis of the endonucleolytic cleavage of RNA in RNA-DNA hybrids to 5'-phosphomonoesters.
    GO:0003964    RNA-directed DNA polymerase activity    Catalysis of the reaction: deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1). Catalyzes RNA-template-directed extension of the 3'- end of a DNA strand by one deoxynucleotide at a time.
    GO:0004190    aspartic-type endopeptidase activity    Catalysis of the hydrolysis of internal, alpha-peptide bonds in a polypeptide chain by a mechanism in which a water molecule bound by the side chains of aspartic residues at the active center acts as a nucleophile.
    GO:0003824    catalytic activity    Catalysis of a biochemical reaction at physiological temperatures. In biologically catalyzed reactions, the reactants are known as substrates, and the catalysts are naturally occurring macromolecular substances known as enzymes. Enzymes possess specific binding sites for substrates, and are usually composed wholly or largely of protein, but RNA that has catalytic activity (ribozyme) is often also regarded as enzymatic.
    GO:0004519    endonuclease activity    Catalysis of the hydrolysis of ester linkages within nucleic acids by creating internal breaks.
    GO:0004533    exoribonuclease H activity    Catalysis of the exonucleolytic cleavage of RNA to 5'-phosphomonoester oligonucleotides in both 5' to 3' and 3' to 5' directions.
    GO:0016787    hydrolase activity    Catalysis of the hydrolysis of various bonds, e.g. C-O, C-N, C-C, phosphoric anhydride bonds, etc. Hydrolase is the systematic name for any enzyme of EC class 3.
    GO:0008289    lipid binding    Interacting selectively and non-covalently with a lipid.
    GO:0046872    metal ion binding    Interacting selectively and non-covalently with any metal ion.
    GO:0004518    nuclease activity    Catalysis of the hydrolysis of ester linkages within nucleic acids.
    GO:0003676    nucleic acid binding    Interacting selectively and non-covalently with any nucleic acid.
    GO:0016779    nucleotidyltransferase activity    Catalysis of the transfer of a nucleotidyl group to a reactant.
    GO:0008233    peptidase activity    Catalysis of the hydrolysis of a peptide bond. A peptide bond is a covalent bond formed when the carbon atom from the carboxyl group of one amino acid shares electrons with the nitrogen atom from the amino group of a second amino acid.
    GO:0005198    structural molecule activity    The action of a molecule that contributes to the structural integrity of a complex or its assembly within or outside a cell.
    GO:0016740    transferase activity    Catalysis of the transfer of a group, e.g. a methyl group, glycosyl group, acyl group, phosphorus-containing, or other groups, from one compound (generally regarded as the donor) to another compound (generally regarded as the acceptor). Transferase is the systematic name for any enzyme of EC class 2.
    GO:0008270    zinc ion binding    Interacting selectively and non-covalently with zinc (Zn) ions.
biological process
    GO:0015074    DNA integration    The process in which a segment of DNA is incorporated into another, usually larger, DNA molecule such as a chromosome.
    GO:0006310    DNA recombination    Any process in which a new genotype is formed by reassortment of genes resulting in gene combinations different from those that were present in the parents. In eukaryotes genetic recombination can occur by chromosome assortment, intrachromosomal recombination, or nonreciprocal interchromosomal recombination. Intrachromosomal recombination occurs by crossing over. In bacteria it may occur by genetic transformation, conjugation, transduction, or F-duction.
    GO:0090502    RNA phosphodiester bond hydrolysis, endonucleolytic    The chemical reactions and pathways involving the hydrolysis of internal 3',5'-phosphodiester bonds in one or two strands of ribonucleotides.
    GO:0090503    RNA phosphodiester bond hydrolysis, exonucleolytic    The chemical reactions and pathways involving the hydrolysis of terminal 3',5'-phosphodiester bonds in one or two strands of ribonucleotides.
    GO:0006278    RNA-dependent DNA biosynthetic process    A DNA biosynthetic process that uses RNA as a template for RNA-dependent DNA polymerases (e.g. reverse transcriptase) that synthesize the new strand.
    GO:0075713    establishment of integrated proviral latency    A process by which the virus integrates into the host genome and establishes as a stable provirus or prophage.
    GO:0008152    metabolic process    The chemical reactions and pathways, including anabolism and catabolism, by which living organisms transform chemical substances. Metabolic processes typically transform small molecules, but also include macromolecular processes such as DNA repair and replication, and protein synthesis and degradation.
    GO:0090305    nucleic acid phosphodiester bond hydrolysis    The nucleic acid metabolic process in which the phosphodiester bonds between nucleotides are cleaved by hydrolysis.
    GO:0006508    proteolysis    The hydrolysis of proteins into smaller polypeptides and/or amino acids by cleavage of their peptide bonds.
    GO:0039657    suppression by virus of host gene expression    Any process in which a virus stops, prevents, or reduces the frequency, rate or extent of gene expression in the host organism. Gene expression is the process in which a gene's coding sequence is converted into a mature gene product or products (proteins or RNA). This includes the production of an RNA transcript as well as any processing to produce a mature RNA product or an mRNA (for protein-coding genes) and the translation of that mRNA into protein. Some protein processing events may be included when they are required to form an active form of a product from an inactive precursor form.
    GO:0046718    viral entry into host cell    The process that occurs after viral attachment by which a virus, or viral nucleic acid, breaches the plasma membrane or cell envelope and enters the host cell. The process ends when the viral nucleic acid is released into the host cell cytoplasm.
    GO:0075732    viral penetration into host nucleus    The crossing by the virus of the host nuclear membrane, either as naked viral genome or for small viruses as an intact capsid.
    GO:0016032    viral process    A multi-organism process in which a virus is a participant. The other participant is the host. Includes infection of a host cell, replication of the viral genome, and assembly of progeny virus particles. In some cases the viral genetic material may integrate into the host genome and only subsequently, under particular circumstances, 'complete' its life cycle.
    GO:0019076    viral release from host cell    The dissemination of mature viral particles from the host cell, e.g. by cell lysis or the budding of virus particles from the cell membrane.
cellular component
    GO:0030430    host cell cytoplasm    The cytoplasm of a host cell.
    GO:0044174    host cell endosome    A membrane-bounded organelle that carries materials newly ingested by endocytosis. It passes many of the materials to host cell lysosomes for degradation.
    GO:0033644    host cell membrane    Double layer of lipid molecules as it encloses host cells, and, in eukaryotes, many organelles; may be a single or double lipid bilayer; also includes associated proteins. The host is defined as the larger of the organisms involved in a symbiotic interaction.
    GO:0042025    host cell nucleus    A membrane-bounded organelle as it is found in the host cell in which chromosomes are housed and replicated. The host is defined as the larger of the organisms involved in a symbiotic interaction.
    GO:0020002    host cell plasma membrane    The plasma membrane surrounding a host cell.
    GO:0072494    host multivesicular body    A late endosome in which regions of the limiting host cell endosomal membrane invaginate to form internal vesicles; host membrane proteins that enter the internal vesicles are sequestered from the host cytoplasm.
    GO:0016020    membrane    A lipid bilayer along with all the proteins and protein complexes embedded in it an attached to it.
    GO:0019028    viral capsid    The protein coat that surrounds the infective nucleic acid in some virus particles. It comprises numerous regularly arranged subunits, or capsomeres.
    GO:0019013    viral nucleocapsid    The complete protein-nucleic acid complex that is the packaged form of the genome in a virus particle.
    GO:0019012    virion    The complete fully infectious extracellular virus particle.
    GO:0055036    virion membrane    The lipid bilayer surrounding a virion.

 Visualization

(-) Interactive Views

Theoretical Model
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    ACE  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    ALA  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    STA  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    DTA  [ RasMol ]  +environment [ RasMol ]
    DTB  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 1phv)
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick
Weblab
  protein: ribbon, secondary structure with active site amino acids shown as sticks, chain-specific coloring; inhibitor: spacefill
  protein: spacefill, chain-specific coloring; inhibitor: balls and sticks; water molecule: red sphere

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  1phv
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  POL_HV2RO | P04584
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  POL_HV2RO | P04584
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        POL_HV2RO | P045841e0e 1hii 1hsh 1hsi 1ida 1idb 1ivp 1ivq 1jld 1mu2 2hpe 2hpf 2k4e 2k4h 2k4i 2mip 2phv 3ebz 3ec0 3ecg 3f9k 3upj 4upj 5upj 6upj

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 1PHV)