Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asymmetric Unit
(-)Asym. Unit - sites
(-)Biological Unit 1
(-)Biol. Unit 1 - sites
collapse expand < >
Image Asymmetric Unit
Asymmetric Unit  (Jmol Viewer)
Image Asym. Unit - sites
Asym. Unit - sites  (Jmol Viewer)
Image Biological Unit 1
Biological Unit 1  (Jmol Viewer)
Image Biol. Unit 1 - sites
Biol. Unit 1 - sites  (Jmol Viewer)

(-) Description

Title :  CRYSTAL STRUCTURE OF THE N-TERMINAL DOMAIN OF MOLONEY MURINE LEUKEMIA VIRUS INTEGRASE, NORTHEAST STRUCTURAL GENOMICS CONSORTIUM TARGET OR3
 
Authors :  R. Guan, R. Xiao, T. Acton, M. Jiang, M. Roth, G. T. Montelione, Northeas Structural Genomics Consortium (Nesg)
Date :  24 Jun 10  (Deposition) - 14 Jul 10  (Release) - 22 Feb 17  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  2.69
Chains :  Asym. Unit :  A,B
Biol. Unit 1:  A,B  (2x)
Keywords :  Retroviral Integrase, Zn Finger, Moloney Murine Leukemia Virus, Structural Genomics, Psi-2, Protein Structure Initiative, Northeast Structural Genomics Consortium, Nesg, Viral Protein (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  R. Guan, S. Aiyer, M. L. Cote, R. Xiao, M. Jiang, T. B. Acton, M. J. Roth, G. T. Montelione
X-Ray Crystal Structure Of The N-Terminal Region Of Moloney Murine Leukemia Virus Integrase And Its Implications For Viral Dna Recognition.
Proteins 2017
PubMed-ID: 28066922  |  Reference-DOI: 10.1002/PROT.25245

(-) Compounds

Molecule 1 - N-TERMINAL DOMAIN OF MOLONEY MURINE LEUKEMIA VIRUS INTEGRASE
    ChainsA, B
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPET21
    Expression System StrainBL21(DE3)
    Expression System Taxid469008
    Expression System Vector TypePLASMID
    FragmentN-TERMINAL DOMAIN
    GeneGAG-POL
    Organism CommonMOMLV
    Organism ScientificMOLONEY MURINE LEUKEMIA VIRUS
    Organism Taxid11801
    SynonymINTEGRASE P46

 Structural Features

(-) Chains, Units

  12
Asymmetric Unit AB
Biological Unit 1 (2x)AB

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (3, 10)

Asymmetric Unit (3, 10)
No.NameCountTypeFull Name
1ACT2Ligand/IonACETATE ION
2MSE6Mod. Amino AcidSELENOMETHIONINE
3ZN2Ligand/IonZINC ION
Biological Unit 1 (2, 16)
No.NameCountTypeFull Name
1ACT4Ligand/IonACETATE ION
2MSE12Mod. Amino AcidSELENOMETHIONINE
3ZN-1Ligand/IonZINC ION

(-) Sites  (4, 4)

Asymmetric Unit (4, 4)
No.NameEvidenceResiduesDescription
1AC1SOFTWAREHIS A:58 , HIS A:62 , CYS A:95 , CYS A:98BINDING SITE FOR RESIDUE ZN A 201
2AC2SOFTWARELEU A:71 , ARG A:74 , HOH A:124BINDING SITE FOR RESIDUE ACT A 115
3AC3SOFTWAREHIS B:58 , HIS B:62 , CYS B:95 , CYS B:98BINDING SITE FOR RESIDUE ZN B 201
4AC4SOFTWARELEU B:71 , HOH B:116 , HOH B:124BINDING SITE FOR RESIDUE ACT B 115

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 3NNQ)

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 3NNQ)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 3NNQ)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 3NNQ)

(-) Exons   (0, 0)

(no "Exon" information available for 3NNQ)

(-) Sequences/Alignments

Asymmetric Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:95
 aligned with POL_MLVMS | P03355 from UniProtKB/Swiss-Prot  Length:1738

    Alignment length:95
                                  1350      1360      1370      1380      1390      1400      1410      1420      1430     
           POL_MLVMS   1341 HFHYTVTDIKDLTKLGAIYDKTKKYWVYQGKPVMPDQFTFELLDFLHQLTHLSFSKMKALLERSHSPYYMLNRDRTLKNITETCKACAQVNASKS 1435
               SCOP domains ----------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ----------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ----------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ....hhhhhhhhhhhh.eee....eeee..eeeehhhhhhhhhhhhhhhhh.hhhhhhhhhhhhh...ee.hhhhhhhhhhhhhhhhhhhhh... Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ----------------------------------------------------------------------------------------------- PROSITE
                 Transcript ----------------------------------------------------------------------------------------------- Transcript
                3nnq A   12 HFHYTVTDIKDLTKLGAIYDKTKKYWVYQGKPVmPDQFTFELLDFLHQLTHLSFSKmKALLERSHSPYYmLNRDRTLKNITETCKACAQVNASKS  106
                                    21        31        41   |    51        61      | 71        81        91       101     
                                                            45-MSE                 68-MSE       81-MSE                     

Chain B from PDB  Type:PROTEIN  Length:95
 aligned with POL_MLVMS | P03355 from UniProtKB/Swiss-Prot  Length:1738

    Alignment length:95
                                  1350      1360      1370      1380      1390      1400      1410      1420      1430     
           POL_MLVMS   1341 HFHYTVTDIKDLTKLGAIYDKTKKYWVYQGKPVMPDQFTFELLDFLHQLTHLSFSKMKALLERSHSPYYMLNRDRTLKNITETCKACAQVNASKS 1435
               SCOP domains ----------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ----------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ----------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ....hhhhhhhhhhhh.eee....eeee..eeeehhhhhhhhhhhhhhhhh.hhhhhhhhhhhhh...ee.hhhhhhhhhhhhhhhhhhhhh... Sec.struct. author
                 SAPs(SNPs) ----------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ----------------------------------------------------------------------------------------------- PROSITE
                 Transcript ----------------------------------------------------------------------------------------------- Transcript
                3nnq B   12 HFHYTVTDIKDLTKLGAIYDKTKKYWVYQGKPVmPDQFTFELLDFLHQLTHLSFSKmKALLERSHSPYYmLNRDRTLKNITETCKACAQVNASKS  106
                                    21        31        41   |    51        61      | 71        81        91       101     
                                                            45-MSE                 68-MSE       81-MSE                     

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 3NNQ)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 3NNQ)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 3NNQ)

(-) Gene Ontology  (36, 36)

Asymmetric Unit(hide GO term definitions)
Chain A,B   (POL_MLVMS | P03355)
molecular function
    GO:0003677    DNA binding    Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
    GO:0003887    DNA-directed DNA polymerase activity    Catalysis of the reaction: deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1); the synthesis of DNA from deoxyribonucleotide triphosphates in the presence of a DNA template and a 3'hydroxyl group.
    GO:0003723    RNA binding    Interacting selectively and non-covalently with an RNA molecule or a portion thereof.
    GO:0004523    RNA-DNA hybrid ribonuclease activity    Catalysis of the endonucleolytic cleavage of RNA in RNA-DNA hybrids to 5'-phosphomonoesters.
    GO:0003964    RNA-directed DNA polymerase activity    Catalysis of the reaction: deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1). Catalyzes RNA-template-directed extension of the 3'- end of a DNA strand by one deoxynucleotide at a time.
    GO:0004190    aspartic-type endopeptidase activity    Catalysis of the hydrolysis of internal, alpha-peptide bonds in a polypeptide chain by a mechanism in which a water molecule bound by the side chains of aspartic residues at the active center acts as a nucleophile.
    GO:0003824    catalytic activity    Catalysis of a biochemical reaction at physiological temperatures. In biologically catalyzed reactions, the reactants are known as substrates, and the catalysts are naturally occurring macromolecular substances known as enzymes. Enzymes possess specific binding sites for substrates, and are usually composed wholly or largely of protein, but RNA that has catalytic activity (ribozyme) is often also regarded as enzymatic.
    GO:0004519    endonuclease activity    Catalysis of the hydrolysis of ester linkages within nucleic acids by creating internal breaks.
    GO:0016787    hydrolase activity    Catalysis of the hydrolysis of various bonds, e.g. C-O, C-N, C-C, phosphoric anhydride bonds, etc. Hydrolase is the systematic name for any enzyme of EC class 3.
    GO:0046872    metal ion binding    Interacting selectively and non-covalently with any metal ion.
    GO:0004518    nuclease activity    Catalysis of the hydrolysis of ester linkages within nucleic acids.
    GO:0003676    nucleic acid binding    Interacting selectively and non-covalently with any nucleic acid.
    GO:0016779    nucleotidyltransferase activity    Catalysis of the transfer of a nucleotidyl group to a reactant.
    GO:0008233    peptidase activity    Catalysis of the hydrolysis of a peptide bond. A peptide bond is a covalent bond formed when the carbon atom from the carboxyl group of one amino acid shares electrons with the nitrogen atom from the amino group of a second amino acid.
    GO:0039660    structural constituent of virion    The action of a molecule that contributes to the structural integrity of a virion.
    GO:0005198    structural molecule activity    The action of a molecule that contributes to the structural integrity of a complex or its assembly within or outside a cell.
    GO:0016740    transferase activity    Catalysis of the transfer of a group, e.g. a methyl group, glycosyl group, acyl group, phosphorus-containing, or other groups, from one compound (generally regarded as the donor) to another compound (generally regarded as the acceptor). Transferase is the systematic name for any enzyme of EC class 2.
    GO:0008270    zinc ion binding    Interacting selectively and non-covalently with zinc (Zn) ions.
biological process
    GO:0071897    DNA biosynthetic process    The cellular DNA metabolic process resulting in the formation of DNA, deoxyribonucleic acid, one of the two main types of nucleic acid, consisting of a long unbranched macromolecule formed from one or two strands of linked deoxyribonucleotides, the 3'-phosphate group of each constituent deoxyribonucleotide being joined in 3',5'-phosphodiester linkage to the 5'-hydroxyl group of the deoxyribose moiety of the next one.
    GO:0015074    DNA integration    The process in which a segment of DNA is incorporated into another, usually larger, DNA molecule such as a chromosome.
    GO:0006310    DNA recombination    Any process in which a new genotype is formed by reassortment of genes resulting in gene combinations different from those that were present in the parents. In eukaryotes genetic recombination can occur by chromosome assortment, intrachromosomal recombination, or nonreciprocal interchromosomal recombination. Intrachromosomal recombination occurs by crossing over. In bacteria it may occur by genetic transformation, conjugation, transduction, or F-duction.
    GO:0090502    RNA phosphodiester bond hydrolysis, endonucleolytic    The chemical reactions and pathways involving the hydrolysis of internal 3',5'-phosphodiester bonds in one or two strands of ribonucleotides.
    GO:0006278    RNA-dependent DNA biosynthetic process    A DNA biosynthetic process that uses RNA as a template for RNA-dependent DNA polymerases (e.g. reverse transcriptase) that synthesize the new strand.
    GO:0075713    establishment of integrated proviral latency    A process by which the virus integrates into the host genome and establishes as a stable provirus or prophage.
    GO:0008152    metabolic process    The chemical reactions and pathways, including anabolism and catabolism, by which living organisms transform chemical substances. Metabolic processes typically transform small molecules, but also include macromolecular processes such as DNA repair and replication, and protein synthesis and degradation.
    GO:0090305    nucleic acid phosphodiester bond hydrolysis    The nucleic acid metabolic process in which the phosphodiester bonds between nucleotides are cleaved by hydrolysis.
    GO:0006508    proteolysis    The hydrolysis of proteins into smaller polypeptides and/or amino acids by cleavage of their peptide bonds.
    GO:0046718    viral entry into host cell    The process that occurs after viral attachment by which a virus, or viral nucleic acid, breaches the plasma membrane or cell envelope and enters the host cell. The process ends when the viral nucleic acid is released into the host cell cytoplasm.
    GO:0016032    viral process    A multi-organism process in which a virus is a participant. The other participant is the host. Includes infection of a host cell, replication of the viral genome, and assembly of progeny virus particles. In some cases the viral genetic material may integrate into the host genome and only subsequently, under particular circumstances, 'complete' its life cycle.
    GO:0019068    virion assembly    A late phase of the viral life cycle during which all the components necessary for the formation of a mature virion collect at a particular site in the cell and the basic structure of the virus particle is formed.
cellular component
    GO:0033644    host cell membrane    Double layer of lipid molecules as it encloses host cells, and, in eukaryotes, many organelles; may be a single or double lipid bilayer; also includes associated proteins. The host is defined as the larger of the organisms involved in a symbiotic interaction.
    GO:0020002    host cell plasma membrane    The plasma membrane surrounding a host cell.
    GO:0016020    membrane    A lipid bilayer along with all the proteins and protein complexes embedded in it an attached to it.
    GO:0019028    viral capsid    The protein coat that surrounds the infective nucleic acid in some virus particles. It comprises numerous regularly arranged subunits, or capsomeres.
    GO:0019013    viral nucleocapsid    The complete protein-nucleic acid complex that is the packaged form of the genome in a virus particle.
    GO:0019012    virion    The complete fully infectious extracellular virus particle.

 Visualization

(-) Interactive Views

Asymmetric Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    ACT  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    MSE  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    ZN  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
    AC4  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 3nnq)
 
Biological Unit
  Complete Structure
    Biological Unit 1  [ Jena3D ]

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  3nnq
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  POL_MLVMS | P03355
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  POL_MLVMS | P03355
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        POL_MLVMS | P033551d0e 1d1u 1i6j 1mml 1n4l 1nnd 1qai 1qaj 1ztt 1ztw 2fjv 2fjw 2fjx 2fvp 2fvq 2fvr 2fvs 2hb5 2m9u 2mqv 2ms0 2ms1 2r2r 2r2s 2r2t 2r2u 3fsi 4m94 4m95 4mh8 4nzg 4xo0 4xpc 4xpe 5dmq 5dmr

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 3NNQ)