Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asymmetric Unit
(-)Asym. Unit - sites
(-)Biological Unit 1
(-)Biol. Unit 1 - sites
collapse expand < >
Image Asymmetric Unit
Asymmetric Unit  (Jmol Viewer)
Image Asym. Unit - sites
Asym. Unit - sites  (Jmol Viewer)
Image Biological Unit 1
Biological Unit 1  (Jmol Viewer)
Image Biol. Unit 1 - sites
Biol. Unit 1 - sites  (Jmol Viewer)

(-) Description

Title :  CRYSTAL STRUCTURE OF THE ROUS ASSOCIATED VIRUS INTEGRASE CATALYTIC DOMAIN A182T IN CITRATE BUFFER PH 6.2
 
Authors :  A. Ballandras, X. Robert, P. Gouet
Date :  27 Jul 10  (Deposition) - 31 Aug 11  (Release) - 11 Jan 12  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  1.55
Chains :  Asym. Unit :  A
Biol. Unit 1:  A  (2x)
Keywords :  Dna Integration Process, Dna Binding Protein (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  A. Ballandras, K. Moreau, X. Robert, M. P. Confort, R. Merceron, R. Haser, C. Ronfort, P. Gouet
A Crystal Structure Of The Catalytic Core Domain Of An Avia Sarcoma And Leukemia Virus Integrase Suggests An Alternate Dimeric Assembly.
Plos One V. 6 23032 2011
PubMed-ID: 21857987  |  Reference-DOI: 10.1371/JOURNAL.PONE.0023032

(-) Compounds

Molecule 1 - INTEGRASE
    ChainsA
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPETG10A
    Expression System StrainBL21(DE3) PLYSS
    Expression System Taxid469008
    Expression System Vector TypePLASMID
    FragmentINRAV1 CCD, RESIDUES 625-771
    GenePOL
    MutationYES
    Organism ScientificROUS SARCOMA VIRUS
    Organism Taxid11886

 Structural Features

(-) Chains, Units

  1
Asymmetric Unit A
Biological Unit 1 (2x)A

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (2, 2)

Asymmetric Unit (2, 2)
No.NameCountTypeFull Name
1FLC1Ligand/IonCITRATE ANION
2OCY1Mod. Amino AcidHYDROXYETHYLCYSTEINE
Biological Unit 1 (2, 4)
No.NameCountTypeFull Name
1FLC2Ligand/IonCITRATE ANION
2OCY2Mod. Amino AcidHYDROXYETHYLCYSTEINE

(-) Sites  (1, 1)

Asymmetric Unit (1, 1)
No.NameEvidenceResiduesDescription
1AC1SOFTWAREARG A:132 , ARG A:179 , PRO A:181 , THR A:182 , SER A:183 , HOH A:398 , HOH A:400 , HOH A:427 , HOH A:428BINDING SITE FOR RESIDUE FLC A 2000

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 3O4Q)

(-) Cis Peptide Bonds  (1, 1)

Asymmetric Unit
No.Residues
1Ala A:72 -Pro A:73

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 3O4Q)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 3O4Q)

(-) Exons   (0, 0)

(no "Exon" information available for 3O4Q)

(-) Sequences/Alignments

Asymmetric Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:137
 aligned with POL_RSVSB | O92956 from UniProtKB/Swiss-Prot  Length:1603

    Alignment length:145
                                  1344      1354      1364      1374      1384      1394      1404      1414      1424      1434      1444      1454      1464      1474     
           POL_RSVSB   1335 LGPLQIWQTDFTLEPRMAPRSWLAVTVDTASSAIVVTQHGRVTSVAAQHHWATAIAVLGRPKAIKTDNGSCFTSKSTREWLARWGIAHTTGIPGNSQGQAMVERANRLLKDKIRVLAEGDGFMKRIPASKQGELLAKAMYALNHF 1479
               SCOP domains d3o4qa_ A: Retroviral integrase, catalytic domain                                                                                                 SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .....eeeeeeee.hhhh...eeeeeee.....eeeeee...hhhhhhhhhhhhhhhhh...eee...hhhhhhhhhhhhhhhh..eee.--------hhhhhhhhhhhhhhhhhhhhhh......hhhhhhhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                3o4q A   55 LGPLQIWQTDFTLEPRMAPRSWLAVTVDTASSAIVVTQHGRVTSVAAQHHWATAIAVLGRPKAIKTDNGScFTSKSTREWLARWGIAHTT--------QAMVERANRLLKDKIRVLAEGDGFMKRIPTSKQGELLAKAMYALNHF  199
                                    64        74        84        94       104       114       124|      134       144       154       164       174       184       194     
                                                                                                125-OCY            144      153                                              

Chain A from PDB  Type:PROTEIN  Length:137
 aligned with Q4ZJZ6_RSVSR | Q4ZJZ6 from UniProtKB/TrEMBL  Length:895

    Alignment length:145
                                   636       646       656       666       676       686       696       706       716       726       736       746       756       766     
        Q4ZJZ6_RSVSR    627 LGPLQIWQTDFTLEPRMAPRSWLAVTVDTASSAIVVTQHGRVTSVAAQHHWATAIAVLGRPKAIKTDNGSCFTSKSTREWLARWGIAHTTGIPGNSQGQAMVERANRLLKDKIRVLAEGDGFMKRIPASKQGELLAKAMYALNHF  771
               SCOP domains d3o4qa_ A: Retroviral integrase, catalytic domain                                                                                                 SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .....eeeeeeee.hhhh...eeeeeee.....eeeeee...hhhhhhhhhhhhhhhhh...eee...hhhhhhhhhhhhhhhh..eee.--------hhhhhhhhhhhhhhhhhhhhhh......hhhhhhhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                3o4q A   55 LGPLQIWQTDFTLEPRMAPRSWLAVTVDTASSAIVVTQHGRVTSVAAQHHWATAIAVLGRPKAIKTDNGScFTSKSTREWLARWGIAHTT--------QAMVERANRLLKDKIRVLAEGDGFMKRIPTSKQGELLAKAMYALNHF  199
                                    64        74        84        94       104       114       124|      134       144       154       164       174       184       194     
                                                                                                125-OCY            144      153                                              

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (1, 1)

Asymmetric Unit

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 3O4Q)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 3O4Q)

(-) Gene Ontology  (27, 42)

Asymmetric Unit(hide GO term definitions)
Chain A   (POL_RSVSB | O92956)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0003887    DNA-directed DNA polymerase activity    Catalysis of the reaction: deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1); the synthesis of DNA from deoxyribonucleotide triphosphates in the presence of a DNA template and a 3'hydroxyl group.
    GO:0004523    RNA-DNA hybrid ribonuclease activity    Catalysis of the endonucleolytic cleavage of RNA in RNA-DNA hybrids to 5'-phosphomonoesters.
    GO:0003964    RNA-directed DNA polymerase activity    Catalysis of the reaction: deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1). Catalyzes RNA-template-directed extension of the 3'- end of a DNA strand by one deoxynucleotide at a time.
    GO:0004190    aspartic-type endopeptidase activity    Catalysis of the hydrolysis of internal, alpha-peptide bonds in a polypeptide chain by a mechanism in which a water molecule bound by the side chains of aspartic residues at the active center acts as a nucleophile.
    GO:0003824    catalytic activity    Catalysis of a biochemical reaction at physiological temperatures. In biologically catalyzed reactions, the reactants are known as substrates, and the catalysts are naturally occurring macromolecular substances known as enzymes. Enzymes possess specific binding sites for substrates, and are usually composed wholly or largely of protein, but RNA that has catalytic activity (ribozyme) is often also regarded as enzymatic.
    GO:0004519    endonuclease activity    Catalysis of the hydrolysis of ester linkages within nucleic acids by creating internal breaks.
    GO:0016787    hydrolase activity    Catalysis of the hydrolysis of various bonds, e.g. C-O, C-N, C-C, phosphoric anhydride bonds, etc. Hydrolase is the systematic name for any enzyme of EC class 3.
    GO:0046872    metal ion binding    Interacting selectively and non-covalently with any metal ion.
    GO:0004518    nuclease activity    Catalysis of the hydrolysis of ester linkages within nucleic acids.
    GO:0003676    nucleic acid binding    Interacting selectively and non-covalently with any nucleic acid.
    GO:0016779    nucleotidyltransferase activity    Catalysis of the transfer of a nucleotidyl group to a reactant.
    GO:0005198    structural molecule activity    The action of a molecule that contributes to the structural integrity of a complex or its assembly within or outside a cell.
    GO:0016740    transferase activity    Catalysis of the transfer of a group, e.g. a methyl group, glycosyl group, acyl group, phosphorus-containing, or other groups, from one compound (generally regarded as the donor) to another compound (generally regarded as the acceptor). Transferase is the systematic name for any enzyme of EC class 2.
    GO:0008270    zinc ion binding    Interacting selectively and non-covalently with zinc (Zn) ions.
biological process
    GO:0015074    DNA integration    The process in which a segment of DNA is incorporated into another, usually larger, DNA molecule such as a chromosome.
    GO:0006310    DNA recombination    Any process in which a new genotype is formed by reassortment of genes resulting in gene combinations different from those that were present in the parents. In eukaryotes genetic recombination can occur by chromosome assortment, intrachromosomal recombination, or nonreciprocal interchromosomal recombination. Intrachromosomal recombination occurs by crossing over. In bacteria it may occur by genetic transformation, conjugation, transduction, or F-duction.
    GO:0090502    RNA phosphodiester bond hydrolysis, endonucleolytic    The chemical reactions and pathways involving the hydrolysis of internal 3',5'-phosphodiester bonds in one or two strands of ribonucleotides.
    GO:0006278    RNA-dependent DNA biosynthetic process    A DNA biosynthetic process that uses RNA as a template for RNA-dependent DNA polymerases (e.g. reverse transcriptase) that synthesize the new strand.
    GO:0075713    establishment of integrated proviral latency    A process by which the virus integrates into the host genome and establishes as a stable provirus or prophage.
    GO:0008152    metabolic process    The chemical reactions and pathways, including anabolism and catabolism, by which living organisms transform chemical substances. Metabolic processes typically transform small molecules, but also include macromolecular processes such as DNA repair and replication, and protein synthesis and degradation.
    GO:0090305    nucleic acid phosphodiester bond hydrolysis    The nucleic acid metabolic process in which the phosphodiester bonds between nucleotides are cleaved by hydrolysis.
    GO:0006508    proteolysis    The hydrolysis of proteins into smaller polypeptides and/or amino acids by cleavage of their peptide bonds.
    GO:0046718    viral entry into host cell    The process that occurs after viral attachment by which a virus, or viral nucleic acid, breaches the plasma membrane or cell envelope and enters the host cell. The process ends when the viral nucleic acid is released into the host cell cytoplasm.
    GO:0016032    viral process    A multi-organism process in which a virus is a participant. The other participant is the host. Includes infection of a host cell, replication of the viral genome, and assembly of progeny virus particles. In some cases the viral genetic material may integrate into the host genome and only subsequently, under particular circumstances, 'complete' its life cycle.
cellular component
    GO:0019028    viral capsid    The protein coat that surrounds the infective nucleic acid in some virus particles. It comprises numerous regularly arranged subunits, or capsomeres.
    GO:0019012    virion    The complete fully infectious extracellular virus particle.

Chain A   (Q4ZJZ6_RSVSR | Q4ZJZ6)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0004523    RNA-DNA hybrid ribonuclease activity    Catalysis of the endonucleolytic cleavage of RNA in RNA-DNA hybrids to 5'-phosphomonoesters.
    GO:0003964    RNA-directed DNA polymerase activity    Catalysis of the reaction: deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1). Catalyzes RNA-template-directed extension of the 3'- end of a DNA strand by one deoxynucleotide at a time.
    GO:0004519    endonuclease activity    Catalysis of the hydrolysis of ester linkages within nucleic acids by creating internal breaks.
    GO:0016787    hydrolase activity    Catalysis of the hydrolysis of various bonds, e.g. C-O, C-N, C-C, phosphoric anhydride bonds, etc. Hydrolase is the systematic name for any enzyme of EC class 3.
    GO:0046872    metal ion binding    Interacting selectively and non-covalently with any metal ion.
    GO:0004518    nuclease activity    Catalysis of the hydrolysis of ester linkages within nucleic acids.
    GO:0003676    nucleic acid binding    Interacting selectively and non-covalently with any nucleic acid.
    GO:0016779    nucleotidyltransferase activity    Catalysis of the transfer of a nucleotidyl group to a reactant.
    GO:0016740    transferase activity    Catalysis of the transfer of a group, e.g. a methyl group, glycosyl group, acyl group, phosphorus-containing, or other groups, from one compound (generally regarded as the donor) to another compound (generally regarded as the acceptor). Transferase is the systematic name for any enzyme of EC class 2.
    GO:0008270    zinc ion binding    Interacting selectively and non-covalently with zinc (Zn) ions.
biological process
    GO:0015074    DNA integration    The process in which a segment of DNA is incorporated into another, usually larger, DNA molecule such as a chromosome.
    GO:0090502    RNA phosphodiester bond hydrolysis, endonucleolytic    The chemical reactions and pathways involving the hydrolysis of internal 3',5'-phosphodiester bonds in one or two strands of ribonucleotides.
    GO:0006278    RNA-dependent DNA biosynthetic process    A DNA biosynthetic process that uses RNA as a template for RNA-dependent DNA polymerases (e.g. reverse transcriptase) that synthesize the new strand.
    GO:0090305    nucleic acid phosphodiester bond hydrolysis    The nucleic acid metabolic process in which the phosphodiester bonds between nucleotides are cleaved by hydrolysis.

 Visualization

(-) Interactive Views

Asymmetric Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    FLC  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    OCY  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
    Ala A:72 - Pro A:73   [ RasMol ]  
 
Biological Unit
  Complete Structure
    Biological Unit 1  [ Jena3D ]

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  3o4q
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  POL_RSVSB | O92956
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
  Q4ZJZ6_RSVSR | Q4ZJZ6
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  POL_RSVSB | O92956
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)
  Q4ZJZ6_RSVSR | Q4ZJZ6
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        POL_RSVSB | O929561a5v 1a5w 1a5x 1asu 1asv 1asw 1cxq 1cxu 1cz9 1czb 1vsd 1vse 1vsf 1vsh 1vsi 1vsj 1vsk 1vsl 1vsm 3o4n
        Q4ZJZ6_RSVSR | Q4ZJZ63o4n

(-) Related Entries Specified in the PDB File

3o4n