Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asymmetric Unit
(-)Asym. Unit - sites
(-)Biological Unit 1
(-)Biol. Unit 1 - sites
(-)Biological Unit 2
collapse expand < >
Image Asymmetric Unit
Asymmetric Unit  (Jmol Viewer)
Image Asym. Unit - sites
Asym. Unit - sites  (Jmol Viewer)
Image Biological Unit 1
Biological Unit 1  (Jmol Viewer)
Image Biol. Unit 1 - sites
Biol. Unit 1 - sites  (Jmol Viewer)
Image Biological Unit 2
Biological Unit 2  (Jmol Viewer)

(-) Description

Title :  CRYSTAL STRUCTURE OF RECOMBINANT WT BANANA LECTIN
 
Authors :  J. L. Meagher, J. A. Stuckey
Date :  08 May 14  (Deposition) - 04 Nov 15  (Release) - 04 Nov 15  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  1.70
Chains :  Asym. Unit :  A,B,C,D
Biol. Unit 1:  A,B  (1x)
Biol. Unit 2:  C,D  (1x)
Keywords :  Lectin, Sugar Binding Protein (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  M. D. Swanson, D. M. Boudreaux, L. Salmon, J. Chugh, H. C. Winter, J. L. Meagher, S. Andre, P. V. Murphy, S. Oscarson, R. Roy, S. King, M. H. Kaplan, I. J. Goldstein, E. B. Tarbet, B. L. Hurst, D. F. Smee, C. De La Fuente, H. H. Hoffmann, Y. Xue, C. M. Rice, D. Schols, J. V. Garcia, J. A. Stuckey, H. J. Gabius, H. M. Al-Hashimi, D. M. Markovitz
Engineering A Therapeutic Lectin By Uncoupling Mitogenicity From Antiviral Activity.
Cell V. 163 746 2015
PubMed-ID: 26496612  |  Reference-DOI: 10.1016/J.CELL.2015.09.056

(-) Compounds

Molecule 1 - RIPENING-ASSOCIATED PROTEIN
    ChainsA, B, C, D
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System Taxid562
    Expression System Vector TypePLASMID
    Organism CommonBANANA
    Organism ScientificMUSA ACUMINATA
    Organism Taxid4641
    SynonymLECTIN

 Structural Features

(-) Chains, Units

  1234
Asymmetric Unit ABCD
Biological Unit 1 (1x)AB  
Biological Unit 2 (1x)  CD

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (1, 13)

Asymmetric Unit (1, 13)
No.NameCountTypeFull Name
1GOL13Ligand/IonGLYCEROL
Biological Unit 1 (1, 7)
No.NameCountTypeFull Name
1GOL7Ligand/IonGLYCEROL
Biological Unit 2 (1, 6)
No.NameCountTypeFull Name
1GOL6Ligand/IonGLYCEROL

(-) Sites  (13, 13)

Asymmetric Unit (13, 13)
No.NameEvidenceResiduesDescription
01AC1SOFTWAREGLY A:14 , GLY A:15 , GLY A:129 , LYS A:130 , PHE A:131 , ASP A:133binding site for residue GOL A 201
02AC2SOFTWAREGLY A:34 , ASP A:35 , VAL A:36 , ASP A:38 , GLY A:60binding site for residue GOL A 202
03AC3SOFTWAREGLY B:34 , ASP B:35 , VAL B:36 , ASP B:38 , GLY B:60binding site for residue GOL B 201
04AC4SOFTWAREHIS B:54 , GLY B:56 , GLY B:57 , HOH B:315 , HOH B:316 , HOH B:331binding site for residue GOL B 202
05AC5SOFTWARETYR B:24 , ILE B:26 , TYR B:72 , LEU B:73 , LYS B:120 , ILE B:121 , HOH B:394 , HOH C:431binding site for residue GOL B 203
06AC6SOFTWAREGLY B:14 , GLY B:15 , GLY B:129 , LYS B:130 , PHE B:131 , ASP B:133 , HOH B:302 , HOH B:328binding site for residue GOL B 204
07AC7SOFTWARESER B:33 , GLY B:34 , HIS B:63 , PHE B:104 , GLY B:105 , HOH B:397binding site for residue GOL B 205
08AC8SOFTWAREGLY C:34 , ASP C:35 , VAL C:36 , ASP C:38 , GLY C:60binding site for residue GOL C 201
09AC9SOFTWAREGLY C:14 , GLY C:15 , VAL C:88 , LYS C:130 , PHE C:131 , ASP C:133binding site for residue GOL C 202
10AD1SOFTWAREGLU B:140 , TYR C:46 , GLU C:51 , ARG C:53binding site for residue GOL C 203
11AD2SOFTWAREGLY D:34 , ASP D:35 , VAL D:36 , ASP D:38 , GLY D:59 , GLY D:60binding site for residue GOL D 201
12AD3SOFTWAREGLY D:14 , GLY D:15 , GLY D:129 , LYS D:130 , PHE D:131 , ASP D:133 , HOH D:304 , HOH D:309 , HOH D:329binding site for residue GOL D 202
13AD4SOFTWAREHIS D:54 , GLY D:56 , GLY D:57 , SER D:58 , HOH D:319 , HOH D:320 , HOH D:321binding site for residue GOL D 203

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 4PIF)

(-) Cis Peptide Bonds  (8, 8)

Asymmetric Unit
No.Residues
1Gly A:8 -Ala A:9
2Gly A:102 -Pro A:103
3Gly B:8 -Ala B:9
4Gly B:102 -Pro B:103
5Gly C:8 -Ala C:9
6Gly C:102 -Pro C:103
7Gly D:8 -Ala D:9
8Gly D:102 -Pro D:103

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 4PIF)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 4PIF)

(-) Exons   (0, 0)

(no "Exon" information available for 4PIF)

(-) Sequences/Alignments

Asymmetric Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:139
                                                                                                                                                                           
               SCOP domains ------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .eeeeeee....eeeeeee..eeeeeeeee...eeeeeeeeee..eeeeeeee....eeeeee......eeeeeeeeeee..eeeeeeeeeee...eeeee.....eeeeeeeee.eeeeeeeee...eeeeeeeee.. Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 4pif A   4 AIKVGAWGGNGGSAFDMGPAYRIISVKIFSGDVVDGVDVTFTYYGKTETRHYGGSGGTPHEIVLQEGEYLVGMAGEVANYHGAVVLGKLGFSTNKKAYGPFGNTGGTPFSLPIAAGKISGFFGRGGKFLDAIGVYLEPL 142
                                    13        23        33        43        53        63        73        83        93       103       113       123       133         

Chain B from PDB  Type:PROTEIN  Length:142
                                                                                                                                                                              
               SCOP domains ---------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ---------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ---------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ....eeeeeee....eeee.....eeeeeeeee...eeeeeeeeee..eeeeeeee....eeeeee......eeeeeeeeeee..eeeeeeeeeee...eeeee.....eeeeeeeee.eeeeeeeee...eeeeeeeee.. Sec.struct. author
                 SAPs(SNPs) ---------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ---------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ---------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 4pif B   1 MNGAIKVGAWGGNGGSAFDMGPAYRIISVKIFSGDVVDGVDVTFTYYGKTETRHYGGSGGTPHEIVLQEGEYLVGMAGEVANYHGAVVLGKLGFSTNKKAYGPFGNTGGTPFSLPIAAGKISGFFGRGGKFLDAIGVYLEPL 142
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140  

Chain C from PDB  Type:PROTEIN  Length:139
                                                                                                                                                                           
               SCOP domains ------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .eeeeeee....eeeeeee..eeeeeeeee...eeeeeeeeee..eeeeeeee....eeeeee......eeeeeeeeeee..eeeeeeeeeee...eeeee.....eeeeeeeee.eeeeeeeee...eeeeeeeee.. Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 4pif C   4 AIKVGAWGGNGGSAFDMGPAYRIISVKIFSGDVVDGVDVTFTYYGKTETRHYGGSGGTPHEIVLQEGEYLVGMAGEVANYHGAVVLGKLGFSTNKKAYGPFGNTGGTPFSLPIAAGKISGFFGRGGKFLDAIGVYLEPL 142
                                    13        23        33        43        53        63        73        83        93       103       113       123       133         

Chain D from PDB  Type:PROTEIN  Length:142
                                                                                                                                                                              
               SCOP domains ---------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ---------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ---------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author ....eeeeeee....eeee.....eeeeeeeee...eeeeeeeeee..eeeeeeee....eeeeee......eeeeeeeeeee..eeeeeeeeeee...eeeee.....eeeeeeeee.eeeeeeeee...eeeeeeeee.. Sec.struct. author
                 SAPs(SNPs) ---------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ---------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ---------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 4pif D   1 MNGAIKVGAWGGNGGSAFDMGPAYRIISVKIFSGDVVDGVDVTFTYYGKTETRHYGGSGGTPHEIVLQEGEYLVGMAGEVANYHGAVVLGKLGFSTNKKAYGPFGNTGGTPFSLPIAAGKISGFFGRGGKFLDAIGVYLEPL 142
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140  

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 4PIF)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 4PIF)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 4PIF)

(-) Gene Ontology  (0, 0)

Asymmetric Unit(hide GO term definitions)
    (no "Gene Ontology" information available for 4PIF)

 Visualization

(-) Interactive Views

Asymmetric Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    GOL  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
    AC4  [ RasMol ]  +environment [ RasMol ]
    AC5  [ RasMol ]  +environment [ RasMol ]
    AC6  [ RasMol ]  +environment [ RasMol ]
    AC7  [ RasMol ]  +environment [ RasMol ]
    AC8  [ RasMol ]  +environment [ RasMol ]
    AC9  [ RasMol ]  +environment [ RasMol ]
    AD1  [ RasMol ]  +environment [ RasMol ]
    AD2  [ RasMol ]  +environment [ RasMol ]
    AD3  [ RasMol ]  +environment [ RasMol ]
    AD4  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
    Gly A:102 - Pro A:103   [ RasMol ]  
    Gly A:8 - Ala A:9   [ RasMol ]  
    Gly B:102 - Pro B:103   [ RasMol ]  
    Gly B:8 - Ala B:9   [ RasMol ]  
    Gly C:102 - Pro C:103   [ RasMol ]  
    Gly C:8 - Ala C:9   [ RasMol ]  
    Gly D:102 - Pro D:103   [ RasMol ]  
    Gly D:8 - Ala D:9   [ RasMol ]  
 
Biological Units
  Complete Structure
    Biological Unit 1  [ Jena3D ]
    Biological Unit 2  [ Jena3D ]

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  4pif
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  O22321_MUSAC | O22321
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/TrEMBL
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  O22321_MUSAC | O22321
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/TrEMBL
        O22321_MUSAC | O223212bmy 2bmz 2bn0 4pik 4pit 4piu 5exg

(-) Related Entries Specified in the PDB File

4pik 4pit 4piu