Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asymmetric Unit
(-)Asym. Unit - sites
(-)Biological Unit 1
(-)Biol. Unit 1 - sites
(-)Biological Unit 2
collapse expand < >
Image Asymmetric Unit
Asymmetric Unit  (Jmol Viewer)
Image Asym. Unit - sites
Asym. Unit - sites  (Jmol Viewer)
Image Biological Unit 1
Biological Unit 1  (Jmol Viewer)
Image Biol. Unit 1 - sites
Biol. Unit 1 - sites  (Jmol Viewer)
Image Biological Unit 2
Biological Unit 2  (Jmol Viewer)

(-) Description

Title :  X-RAY STRUCTURE OF AAV2 OBD-AAVS1 COMPLEX 2:1
 
Authors :  F. N. Musayev, C. R. Escalante
Date :  10 Jun 15  (Deposition) - 23 Sep 15  (Release) - 25 Nov 15  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  2.50
Chains :  Asym. Unit :  A,B,C,D,E,F,G,H
Biol. Unit 1:  A,B,E,F  (1x)
Biol. Unit 2:  C,D,G,H  (1x)
Keywords :  Protein-Dna Complex, Dna Binding Protein-Dna Complex (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  F. N. Musayev, F. Zarate-Perez, C. Bishop, J. W. Burgner, C. R. Escalante
Structural Insights Into The Assembly Of The Adeno-Associated Virus Type 2 Rep68 Protein On The Integration Site Aavs1.
J. Biol. Chem. V. 290 27487 2015
PubMed-ID: 26370092  |  Reference-DOI: 10.1074/JBC.M115.669960

(-) Compounds

Molecule 1 - PROTEIN REP78
    ChainsA, B, C, D
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPET15B
    Expression System StrainBL21(DE3)PLYSS
    Expression System Taxid469008
    Expression System Vector TypePLASMID
    FragmentUNP RESIDUES 1-210
    GeneREP78
    Organism CommonAAV-2
    Organism ScientificADENO-ASSOCIATED VIRUS 2
    Organism Taxid648242
    StrainISOLATE SRIVASTAVA/1982
 
Molecule 2 - DNA (5'- D(*CP*TP*CP*GP*GP*CP*GP*CP*TP*CP*GP*CP*TP*CP*GP*CP*TP*CP*GP*CP*T)- 3')
    ChainsE, G
    EngineeredYES
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SyntheticYES
 
Molecule 3 - DNA (5'- D(*GP*AP*GP*CP*GP*AP*GP*CP*GP*AP*GP*CP*GP*AP*GP*CP*GP*CP*CP*GP*A)- 3')
    ChainsF, H
    EngineeredYES
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SyntheticYES

 Structural Features

(-) Chains, Units

  12345678
Asymmetric Unit ABCDEFGH
Biological Unit 1 (1x)AB  EF  
Biological Unit 2 (1x)  CD  GH

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (2, 5)

Asymmetric Unit (2, 5)
No.NameCountTypeFull Name
1CIT2Ligand/IonCITRIC ACID
2MG3Ligand/IonMAGNESIUM ION
Biological Unit 1 (1, 1)
No.NameCountTypeFull Name
1CIT1Ligand/IonCITRIC ACID
2MG-1Ligand/IonMAGNESIUM ION
Biological Unit 2 (1, 1)
No.NameCountTypeFull Name
1CIT1Ligand/IonCITRIC ACID
2MG-1Ligand/IonMAGNESIUM ION

(-) Sites  (5, 5)

Asymmetric Unit (5, 5)
No.NameEvidenceResiduesDescription
1AC1SOFTWAREHIS A:92binding site for residue MG A 301
2AC2SOFTWAREGLU A:150 , GLN A:174 , TYR A:175 , LEU A:188 , GLN A:191 , HIS A:192 , HIS A:195binding site for residue CIT A 302
3AC3SOFTWAREGLU B:83 , HIS B:90 , HIS B:92 , HOH B:406binding site for residue MG B 301
4AC4SOFTWAREGLU C:83 , HIS C:92 , LYS C:160binding site for residue MG C 301
5AC5SOFTWAREGLU C:150 , PRO C:154 , TYR C:175 , LEU C:188 , GLN C:191 , HIS C:192 , HIS C:195binding site for residue CIT C 302

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 5BYG)

(-) Cis Peptide Bonds  (4, 4)

Asymmetric Unit
No.Residues
1Gln A:162 -Pro A:163
2Gln B:162 -Pro B:163
3Gln C:162 -Pro C:163
4Gln D:162 -Pro D:163

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 5BYG)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 5BYG)

(-) Exons   (0, 0)

(no "Exon" information available for 5BYG)

(-) Sequences/Alignments

Asymmetric Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:181
                                                                                                                                                                                                                     
               SCOP domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ...eeeeeeee............hhhhhhhhhhhhhhhhhhhhhhhhhhhhh.....eeeeeee....eeeeeeee....hhhhhhhhhhhhhhhhhhhh...........eee................hhhhhhh.......eeeeee.hhhhh....hhhhhhhhhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 5byg A   1 MPGFYEIVIKVPSEWELPPDSDMDLNLIEQAPLTVAEKLQRDFLTEWRRVSKAPEALFFVQFEKGESYFHMHVLVETTGVKSMVLGRFLSQIREKLIQRIYRGIEPTLPNWFAVTKTRNGAGGGNKVVDESYIPNFLLPKTQPELQWAWTNMEQYLSACLNLTERKRLVAQHLTHVSQTQE 201
                                    10  ||    40        50        60        70        80        90       100       110       120       130       140       150       160       170       180       190       200 
                                       13|                                                                                                                                                                       
                                        34                                                                                                                                                                       

Chain B from PDB  Type:PROTEIN  Length:201
                                                                                                                                                                                                                                         
               SCOP domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ...eeeeeeee..........................hhhhhhhhhhhhhhhhhhhhhhhhhhhh......eeeeeee....eeeeeeee....hhhhhhhhhhhhhhhhhhhh...........eee................hhhhhhh.......eeeeee.hhhhh....hhhhhhhhhhhhhhhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 5byg B   1 MPGFYEIVIKVPSDGISDSFVNWVAEKEWELPPDSDMDLNLIEQAPLTVAEKLQRDFLTEWRRVSKAPEALFFVQFEKGESYFHMHVLVETTGVKSMVLGRFLSQIREKLIQRIYRGIEPTLPNWFAVTKTRNGAGGGNKVVDESYIPNFLLPKTQPELQWAWTNMEQYLSACLNLTERKRLVAQHLTHVSQTQEQNKENQ 207
                                    10   ||   26        36        46        56        66        76        86        96       106       116       126       136       146       156       166       176       186       196       206 
                                        14|                                                                                                                                                                                          
                                         21                                                                                                                                                                                          

Chain C from PDB  Type:PROTEIN  Length:193
                                                                                                                                                                                                                                 
               SCOP domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ...eeeeeeee.........hhhhhh..........hhhhhhhhhhhhhhhhhhhhhhhhhhhhh.....eeeeeee....eeeeeeee....hhhhhhhhhhhhhhhhhhhh...........eee................hhhhhhh.......eeeeee.hhhhh....hhhhhhhhhhhhhhhhhh.. Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 5byg C   1 MPGFYEIVIKVPSGISDSFVNWVAEKEWELPPDSDMDLNLIEQAPLTVAEKLQRDFLTEWRRVSKAPEALFFVQFEKGESYFHMHVLVETTGVKSMVLGRFLSQIREKLIQRIYRGIEPTLPNWFAVTKTRNGAGGGNKVVDESYIPNFLLPKTQPELQWAWTNMEQYLSACLNLTERKRLVAQHLTHVSQTQ 200
                                    10  ||    27        37        47        57        67        77        87        97       107       117       127       137       147       157       167       177       187       197   
                                       13|                                                                                                                                                                                   
                                        21                                                                                                                                                                                   

Chain D from PDB  Type:PROTEIN  Length:180
                                                                                                                                                                                                                    
               SCOP domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Pfam domains
         Sec.struct. author ...eeeee........hhhhhhhhhhhhhhhhhhhhhhhhhhhhh.....eeeeeee....eeeeeeee....hhhhhhhhhhhhhhhhhhhh............ee............ee..hhhhhhh.......eeeeee.hhhhh....hhhhhhhhhhhhhhhhhhhhhhhhh.. Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Transcript
                 5byg D   1 MPGFYEIVIKPPDSDMDLNLIEQAPLTVAEKLQRDFLTEWRRVSKAPEALFFVQFEKGESYFHMHVLVETTGVKSMVLGRFLSQIREKLIQRIYRGIEPTLPNWFAVTKTRNGAGGGNKVVDESYIPNFLLPKTQPELQWAWTNMEQYLSACLNLTERKRLVAQHLTHVSQTQEQNKENQ 207
                                    10|       47        57        67        77        87        97       107       117       127       137       147       157       167       177       187       197       207
                                    10|                                                                                                                                                                         
                                     38                                                                                                                                                                         

Chain E from PDB  Type:DNA  Length:21
                                                     
                 5byg E   1 CTCGGCGCTCGCTCGCTCGCT  21
                                    10        20 

Chain F from PDB  Type:DNA  Length:21
                                                     
                 5byg F  22 GAGCGAGCGAGCGAGCGCCGA  42
                                    31        41 

Chain G from PDB  Type:DNA  Length:21
                                                     
                 5byg G   1 CTCGGCGCTCGCTCGCTCGCT  21
                                    10        20 

Chain H from PDB  Type:DNA  Length:21
                                                     
                 5byg H  22 GAGCGAGCGAGCGAGCGCCGA  42
                                    31        41 

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 5BYG)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 5BYG)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 5BYG)

(-) Gene Ontology  (10, 10)

Asymmetric Unit(hide GO term definitions)

 Visualization

(-) Interactive Views

Asymmetric Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    CIT  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    MG  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
    AC4  [ RasMol ]  +environment [ RasMol ]
    AC5  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
    Gln A:162 - Pro A:163   [ RasMol ]  
    Gln B:162 - Pro B:163   [ RasMol ]  
    Gln C:162 - Pro C:163   [ RasMol ]  
    Gln D:162 - Pro D:163   [ RasMol ]  
 
Biological Units
  Complete Structure
    Biological Unit 1  [ Jena3D ]
    Biological Unit 2  [ Jena3D ]

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  5byg
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  REP78_AAV2S | Q89268
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  REP78_AAV2S | Q89268
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

(no "Entries Sharing at Least One Protein Chain" available for 5BYG)

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 5BYG)