1PHV - JenaLib

Theor.Model - manually (Jmol Viewer)

Theoretical Model (Jmol Viewer)

Theor. Model - sites (Jmol Viewer)

Title :   COMPARATIVE ANALYSIS OF THE SEQUENCES AND STRUCTURES OF HIV-1 AND HIV-2 PROTEASES

Authors :   A. Gustchina, I. T. Weber

Date :   28 Feb 91 (Deposition) - 15 Apr 92 (Release) - 15 Oct 94 (Revision)

Method :   THEORETICAL MODEL

Resolution :   NOT APPLICABLE

Chains :   Theor. Model : A,B,I,_^#
( ^#: chains that contain no standard or modified protein/DNA/RNA residue)

Keywords :   Hydrolase (Acid Protease) (Keyword Search: [Gene Ontology, PubMed, Web (Google)] )

Reference :   A. Gustchina, I. T. Weber
Comparative Analysis Of The Sequences And Structures Of Hiv-1 And Hiv-2 Proteases.
Proteins V. 10 325 1991
PubMed: search
(for further references see the PDB file header)

Molecule 1 - HIV-2 PROTEASE
	Chains	:	A, B
	Engineered	:	YES
	Expression System	:	ESCHERICHIA COLI
	Organism Scientific	:	HIV-2 ROD ISOLATE

Molecule 2 -
	Chains	:	I
	Engineered	:	YES


Theoretical Model	:

( ^#: chains that contain no standard or modified protein/DNA/RNA residue)

Summary Information (see also Sequences/Alignments below)

Theoretical Model (3, 4)

No.	Name	Count	Type	Full Name
1	ACE	1	Mod. Amino Acid	ACETYL GROUP
2	ALA	1	Mod. Amino Acid	ALANINE
3	STA	2	Mod. Amino Acid	STATINE

Theoretical Model (2, 2)

No.	Name	Evidence	Residues	Description
1	DTA	not defined	ASP A:25 , THR A:26 , GLY A:27
2	DTB	not defined	ASP B:25 , THR B:26 , GLY B:27

(no "SS Bond" information available for 1PHV)

(no "Cis Peptide Bond" information available for 1PHV)

(no "SAP(SNP)/Variant" information available for 1PHV)

Theoretical Model (2, 2)

	PROSITE			UniProtKB		PDB
No.	ID	AC	Description	ID	Location	Count	Location
1	ASP_PROT_RETROV	PS50175	Aspartyl protease, retroviral-type family profile.	POL_HV2RO	533-602	1	A:20-89
2	ASP_PROTEASE	PS00141	Eukaryotic and viral aspartyl proteases active site.	POL_HV2RO	535-546	1	A:22-33

(no "Exon" information available for 1PHV)

Theoretical Model

Reformat:	Number of residues per line = ('0' or empty: single-line sequence representation)
	Number of residues per labelling interval =
	UniProt sequence: complete aligned part

Show mapping:	SCOP domains	CATH domains	Pfam domains	Secondary structure (by author)
	SAPs(SNPs)	PROSITE motifs	Exons
	(details for a mapped element are shown in a popup box when the mouse pointer rests over it)

Chain A from PDB  Type:PROTEIN  Length:99
 aligned with POL_HV2RO | P04584 from UniProtKB/Swiss-Prot  Length:1464

    Alignment length:99
                                   523       533       543       553       563       573       583       593       603         
            POL_HV2RO   514 PQFSLWKRPVVTAYIEGQPVEVLLDTGADDSIVAGIELGNNYSPKIVGGIGGFINTKEYKNVEIEVLNKKVRATIMTGDTPINIFGRNILTALGMSLNL 612
               SCOP domains --------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains --------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------------------------------------------------- Pfam domains
     Sec.struct. author (1) eeee....eeeeeee--eeeeeeee.....eeee........eeeeeee--eeeeeeeeeeeeeee--eeeeeeeeee....eeehhhhhhhhh.eeee Sec.struct. author (1)
     Sec.struct. author (2) ---------------ttt-------------------------------ttt-------------tttt------------------------------ Sec.struct. author (2)
                 SAPs(SNPs) --------------------------------------------------------------------------------------------------- SAPs(SNPs)
                PROSITE (1) -------------------ASP_PROT_RETROV  PDB: A:20-89 UniProt: 533-602                        ---------- PROSITE (1)
                PROSITE (2) ---------------------ASP_PROTEASE------------------------------------------------------------------ PROSITE (2)
                 Transcript --------------------------------------------------------------------------------------------------- Transcript
                 1phv A   1 PQFSLWKRPVVTAYIEGQPVEVLLDTGADDSIVAGIELGNNYSPKIVGGIGGFINTKEYKNVEIEVLNKKVRATIMTGDTPINIFGRNILTALGMSLNL  99
                                    10        20        30        40        50        60        70        80        90

Chain B from PDB  Type:PROTEIN  Length:99
                                                                                                                                   
               SCOP domains --------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains --------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------------------------------------------------- Pfam domains
     Sec.struct. author (1) eeee....eeeeeee--eeeeeeee.....eeee........eeeeeee--eeeeeeeeeeeeeee--eeeeeeeeee....eeehhhhhhhhh.eeee Sec.struct. author (1)
     Sec.struct. author (2) ---------------ttt-------------------------------ttt-------------tttt------------------------------ Sec.struct. author (2)
                 SAPs(SNPs) --------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------------------------------------------------- PROSITE
                 Transcript --------------------------------------------------------------------------------------------------- Transcript
                 1phv B   1 PQFSLWKRPVVTAYIEGQPVEVLLDTGADDSIVAGIELGNNYSPKIVGGIGGFINTKEYKNVEIEVLNKKVRATIMTGDTPINIFGRNILTALGMSLNL  99
                                    10        20        30        40        50        60        70        80        90

Chain I from PDB  Type:PROTEIN  Length:6
                                      
               SCOP domains ------ SCOP domains
               CATH domains ------ CATH domains
               Pfam domains ------ Pfam domains
         Sec.struct. author ...... Sec.struct. author
                 SAPs(SNPs) ------ SAPs(SNPs)
                    PROSITE ------ PROSITE
                 Transcript ------ Transcript
                 1phv I   1 xVVxAx   6
                            |  | |
                            1-ACE|
                               4-STA
                                 6-STA

Legend:		→ Mismatch	(orange background)
	-	→ Gap	(green background, '-', border residues have a numbering label)
		→ Modified Residue	(blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
	x	→ Chemical Group	(purple background, 'x', labelled with number + name, e.g. ACE or NH2)
	extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '\|'

(no "SCOP Domain" information available for 1PHV)

(no "CATH Domain" information available for 1PHV)

(no "Pfam Domain" information available for 1PHV)

Theoretical Model(hide GO term definitions)

Chain A (POL_HV2RO | P04584)

molecular function
	GO:0003677	DNA binding	Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
	GO:0003887	DNA-directed DNA polymerase activity	Catalysis of the reaction: deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1); the synthesis of DNA from deoxyribonucleotide triphosphates in the presence of a DNA template and a 3'hydroxyl group.
	GO:0003723	RNA binding	Interacting selectively and non-covalently with an RNA molecule or a portion thereof.
	GO:0004523	RNA-DNA hybrid ribonuclease activity	Catalysis of the endonucleolytic cleavage of RNA in RNA-DNA hybrids to 5'-phosphomonoesters.
	GO:0003964	RNA-directed DNA polymerase activity	Catalysis of the reaction: deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1). Catalyzes RNA-template-directed extension of the 3'- end of a DNA strand by one deoxynucleotide at a time.
	GO:0004190	aspartic-type endopeptidase activity	Catalysis of the hydrolysis of internal, alpha-peptide bonds in a polypeptide chain by a mechanism in which a water molecule bound by the side chains of aspartic residues at the active center acts as a nucleophile.
	GO:0003824	catalytic activity	Catalysis of a biochemical reaction at physiological temperatures. In biologically catalyzed reactions, the reactants are known as substrates, and the catalysts are naturally occurring macromolecular substances known as enzymes. Enzymes possess specific binding sites for substrates, and are usually composed wholly or largely of protein, but RNA that has catalytic activity (ribozyme) is often also regarded as enzymatic.
	GO:0004519	endonuclease activity	Catalysis of the hydrolysis of ester linkages within nucleic acids by creating internal breaks.
	GO:0004533	exoribonuclease H activity	Catalysis of the exonucleolytic cleavage of RNA to 5'-phosphomonoester oligonucleotides in both 5' to 3' and 3' to 5' directions.
	GO:0016787	hydrolase activity	Catalysis of the hydrolysis of various bonds, e.g. C-O, C-N, C-C, phosphoric anhydride bonds, etc. Hydrolase is the systematic name for any enzyme of EC class 3.
	GO:0008289	lipid binding	Interacting selectively and non-covalently with a lipid.
	GO:0046872	metal ion binding	Interacting selectively and non-covalently with any metal ion.
	GO:0004518	nuclease activity	Catalysis of the hydrolysis of ester linkages within nucleic acids.
	GO:0003676	nucleic acid binding	Interacting selectively and non-covalently with any nucleic acid.
	GO:0016779	nucleotidyltransferase activity	Catalysis of the transfer of a nucleotidyl group to a reactant.
	GO:0008233	peptidase activity	Catalysis of the hydrolysis of a peptide bond. A peptide bond is a covalent bond formed when the carbon atom from the carboxyl group of one amino acid shares electrons with the nitrogen atom from the amino group of a second amino acid.
	GO:0005198	structural molecule activity	The action of a molecule that contributes to the structural integrity of a complex or its assembly within or outside a cell.
	GO:0016740	transferase activity	Catalysis of the transfer of a group, e.g. a methyl group, glycosyl group, acyl group, phosphorus-containing, or other groups, from one compound (generally regarded as the donor) to another compound (generally regarded as the acceptor). Transferase is the systematic name for any enzyme of EC class 2.
	GO:0008270	zinc ion binding	Interacting selectively and non-covalently with zinc (Zn) ions.
biological process
	GO:0015074	DNA integration	The process in which a segment of DNA is incorporated into another, usually larger, DNA molecule such as a chromosome.
	GO:0006310	DNA recombination	Any process in which a new genotype is formed by reassortment of genes resulting in gene combinations different from those that were present in the parents. In eukaryotes genetic recombination can occur by chromosome assortment, intrachromosomal recombination, or nonreciprocal interchromosomal recombination. Intrachromosomal recombination occurs by crossing over. In bacteria it may occur by genetic transformation, conjugation, transduction, or F-duction.
	GO:0090502	RNA phosphodiester bond hydrolysis, endonucleolytic	The chemical reactions and pathways involving the hydrolysis of internal 3',5'-phosphodiester bonds in one or two strands of ribonucleotides.
	GO:0090503	RNA phosphodiester bond hydrolysis, exonucleolytic	The chemical reactions and pathways involving the hydrolysis of terminal 3',5'-phosphodiester bonds in one or two strands of ribonucleotides.
	GO:0006278	RNA-dependent DNA biosynthetic process	A DNA biosynthetic process that uses RNA as a template for RNA-dependent DNA polymerases (e.g. reverse transcriptase) that synthesize the new strand.
	GO:0075713	establishment of integrated proviral latency	A process by which the virus integrates into the host genome and establishes as a stable provirus or prophage.
	GO:0008152	metabolic process	The chemical reactions and pathways, including anabolism and catabolism, by which living organisms transform chemical substances. Metabolic processes typically transform small molecules, but also include macromolecular processes such as DNA repair and replication, and protein synthesis and degradation.
	GO:0090305	nucleic acid phosphodiester bond hydrolysis	The nucleic acid metabolic process in which the phosphodiester bonds between nucleotides are cleaved by hydrolysis.
	GO:0006508	proteolysis	The hydrolysis of proteins into smaller polypeptides and/or amino acids by cleavage of their peptide bonds.
	GO:0039657	suppression by virus of host gene expression	Any process in which a virus stops, prevents, or reduces the frequency, rate or extent of gene expression in the host organism. Gene expression is the process in which a gene's coding sequence is converted into a mature gene product or products (proteins or RNA). This includes the production of an RNA transcript as well as any processing to produce a mature RNA product or an mRNA (for protein-coding genes) and the translation of that mRNA into protein. Some protein processing events may be included when they are required to form an active form of a product from an inactive precursor form.
	GO:0046718	viral entry into host cell	The process that occurs after viral attachment by which a virus, or viral nucleic acid, breaches the plasma membrane or cell envelope and enters the host cell. The process ends when the viral nucleic acid is released into the host cell cytoplasm.
	GO:0075732	viral penetration into host nucleus	The crossing by the virus of the host nuclear membrane, either as naked viral genome or for small viruses as an intact capsid.
	GO:0016032	viral process	A multi-organism process in which a virus is a participant. The other participant is the host. Includes infection of a host cell, replication of the viral genome, and assembly of progeny virus particles. In some cases the viral genetic material may integrate into the host genome and only subsequently, under particular circumstances, 'complete' its life cycle.
	GO:0019076	viral release from host cell	The dissemination of mature viral particles from the host cell, e.g. by cell lysis or the budding of virus particles from the cell membrane.
cellular component
	GO:0030430	host cell cytoplasm	The cytoplasm of a host cell.
	GO:0044174	host cell endosome	A membrane-bounded organelle that carries materials newly ingested by endocytosis. It passes many of the materials to host cell lysosomes for degradation.
	GO:0033644	host cell membrane	Double layer of lipid molecules as it encloses host cells, and, in eukaryotes, many organelles; may be a single or double lipid bilayer; also includes associated proteins. The host is defined as the larger of the organisms involved in a symbiotic interaction.
	GO:0042025	host cell nucleus	A membrane-bounded organelle as it is found in the host cell in which chromosomes are housed and replicated. The host is defined as the larger of the organisms involved in a symbiotic interaction.
	GO:0020002	host cell plasma membrane	The plasma membrane surrounding a host cell.
	GO:0072494	host multivesicular body	A late endosome in which regions of the limiting host cell endosomal membrane invaginate to form internal vesicles; host membrane proteins that enter the internal vesicles are sequestered from the host cytoplasm.
	GO:0016020	membrane	A lipid bilayer along with all the proteins and protein complexes embedded in it an attached to it.
	GO:0019028	viral capsid	The protein coat that surrounds the infective nucleic acid in some virus particles. It comprises numerous regularly arranged subunits, or capsomeres.
	GO:0019013	viral nucleocapsid	The complete protein-nucleic acid complex that is the packaged form of the genome in a virus particle.
	GO:0019012	virion	The complete fully infectious extracellular virus particle.
	GO:0055036	virion membrane	The lipid bilayer surrounding a virion.

Theoretical Model
	Complete Structure
		Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
		WebMol \| AstexViewer[tm]@PDBe (Java Applets, require no local installation except for Java; loading may be slow)
		STRAP (Java WebStart application, automatic local installation, requires Java; full application with system access!)
		RasMol (require local installation)
		Molscript (VRML) (requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)

	Ligands, Modified Residues, Ions
		ACE	[ RasMol \| Jena3D ]	+environment [ RasMol \| Jena3D ]
		ALA	[ RasMol \| Jena3D ]	+environment [ RasMol \| Jena3D ]
		STA	[ RasMol \| Jena3D ]	+environment [ RasMol \| Jena3D ]

	Sites
		DTA	[ RasMol ]	+environment [ RasMol ]
		DTB	[ RasMol ]	+environment [ RasMol ]

	Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 1phv)

Jmol
	protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick [ Theoretical Model PNG format \| Theor. Model - sites PNG format ](automatic orientation, automatically generated)
Molscript
	protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick [ mono PDF format \| stereo PDF format ](default orientation, automatically generated)
Weblab
	protein: ribbon, secondary structure with active site amino acids shown as sticks, chain-specific coloring; inhibitor: spacefill [ mono GIF format \| stereo GIF format ]
	protein: spacefill, chain-specific coloring; inhibitor: balls and sticks; water molecule: red sphere [ mono GIF format \| stereo GIF format ]

Access by PDB/NDB ID
	1phv
		Family and Domain Information	:	ProDom \| SYSTERS
		General Structural Information	:	GlycoscienceDB \| MMDB \| NDB \| OCA \| PDB \| PDBe \| PDBj \| PDBsum \| PDBWiki \| PQS \| PROTEOPEDIA
		Orientation in Membranes	:	OPM
		Protein Surface	:	SURFACE
		Secondary Structure	:	DSSP (structure derived) \| HSSP (homology derived)
		Structural Genomics	:	GeneCensus
		Structural Neighbours	:	CE \| VAST
		Structure Classification	:	CATH \| Dali \| SCOP
		Validation and Original Data	:	BMRB Data View \| BMRB Restraints Grid \| EDS \| PROCHECK \| RECOORD \| WHAT_CHECK

Access by UniProt ID/Accession number
	POL_HV2RO \| P04584
		Comparative Protein Structure Models	:	ModBase
		Genomic Information	:	Ensembl
		Protein-protein Interaction	:	DIP
		Sequence, Family and Domain Information	:	InterPro \| Pfam \| SMART \| UniProtKB/SwissProt

Access by Enzyme Classificator (EC Number)
	(no 'Enzyme Classificator' available)
		General Enzyme Information	:	BRENDA \| EC-PDB \| Enzyme \| IntEnz
		Pathway	:	KEGG \| MetaCyc

Access by Disease Identifier (MIM ID)
	(no 'MIM ID' available)
		Disease Information	:	OMIM

Access by GenAge ID
	(no 'GenAge ID' available)
		Age Related Information	:	GenAge

Access by PDB/NDB ID
		Domain Information	:	XDom
		Interatomic Contacts of Structural Units	:	CSU
		Ligand-protein Contacts	:	LPC
		Protein Cavities	:	castP
		Sequence and Secondary Structure	:	PDBCartoon
		Structure Alignment	:	STRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
		Structure and Sequence Browser	:	STING

Access by UniProt ID/Accession number
	POL_HV2RO \| P04584
		Protein Disorder Prediction	:	DisEMBL \| FoldIndex \| GLOBPLOT (for more information see DisProt)

(no "Related Entries Specified in the PDB File" available for 1PHV)

Description

Compounds

Structural Features

Chains, Units

Ligands, Modified Residues, Ions (3, 4)

Sites (2, 2)

SS Bonds (0, 0)

Cis Peptide Bonds (0, 0)

Sequence-Structure Mapping

SAPs(SNPs)/Variants (0, 0)

PROSITE Motifs (2, 2)

Exons (0, 0)

Sequences/Alignments

Classification and Annotation

SCOP Domains (0, 0)

CATH Domains (0, 0)

Pfam Domains (0, 0)

Gene Ontology (44, 44)

Visualization

Interactive Views

Still Images

Databases and Analysis Tools

Databases

Analysis Tools

Related Entries

Entries Sharing at Least One Protein Chain (UniProt ID)

Related Entries Specified in the PDB File