Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asym./Biol. Unit
(-)Asym./Biol. Unit - sites
collapse expand < >
Image Asym./Biol. Unit
Asym./Biol. Unit  (Jmol Viewer)
Image Asym./Biol. Unit - sites
Asym./Biol. Unit - sites  (Jmol Viewer)

(-) Description

Title :  CGL2 IN COMPLEX WITH THOMSEN-FRIEDENREICH ANTIGEN
 
Authors :  P. J. Walser, P. W. Haebel, M. Kuenzler, U. Kues, M. Aebi, N. Ban
Date :  12 Sep 03  (Deposition) - 20 Apr 04  (Release) - 24 Feb 09  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  2.20
Chains :  Asym./Biol. Unit :  A,B,C,D
Keywords :  Galectin, Lectin, Beta-Galactoside Binding Lectin, Sugar Binding, Sugar Binding Protein (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  P. J. Walser, P. W. Haebel, M. Kuenzler, D. Sargent, U. Kues, M. Aebi, N. Ban
Structure And Functional Analysis Of The Fungal Galectin Cgl2
Structure V. 12 689 2004
PubMed-ID: 15062091  |  Reference-DOI: 10.1016/J.STR.2004.03.002
(for further references see the PDB file header)

(-) Compounds

Molecule 1 - GALECTIN-2
    ChainsA, B, C, D
    EngineeredYES
    Expression SystemSACCHAROMYCES CEREVISIAE
    Expression System CommonBAKER'S YEAST
    Expression System PlasmidPYADE4
    Expression System StrainSEY 6210
    Expression System Taxid4932
    Expression System Vector TypePLASMID
    GeneCGL2
    Organism ScientificCOPRINOPSIS CINEREA
    Organism Taxid5346
    SynonymCGL2

 Structural Features

(-) Chains, Units

  1234
Asymmetric/Biological Unit ABCD

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (2, 8)

Asymmetric/Biological Unit (2, 8)
No.NameCountTypeFull Name
1GAL4Ligand/IonBETA-D-GALACTOSE
2NGA4Ligand/IonN-ACETYL-D-GALACTOSAMINE

(-) Sites  (8, 8)

Asymmetric Unit (8, 8)
No.NameEvidenceResiduesDescription
1AC1SOFTWAREHIS A:51 , SER A:53 , ARG A:55 , ASN A:64 , TRP A:72 , GLU A:75 , NGA A:152BINDING SITE FOR RESIDUE GAL A 151
2AC2SOFTWAREARG A:55 , GLU A:75 , GAL A:151 , HOH A:201 , ARG C:115BINDING SITE FOR RESIDUE NGA A 152
3AC3SOFTWAREHIS B:51 , ARG B:55BINDING SITE FOR RESIDUE GAL B 151
4AC4SOFTWAREHOH B:194 , HOH B:207BINDING SITE FOR RESIDUE NGA B 152
5AC5SOFTWAREHIS C:51 , SER C:53 , ARG C:55 , ASN C:64 , TRP C:72 , GLU C:75 , NGA C:152 , HOH C:174 , HOH C:185BINDING SITE FOR RESIDUE GAL C 151
6AC6SOFTWAREARG C:55 , GLU C:75 , GAL C:151 , HOH C:220 , HOH C:222BINDING SITE FOR RESIDUE NGA C 152
7AC7SOFTWAREHIS D:51 , ARG D:55 , ASN D:64 , TRP D:72 , GLU D:75 , NGA D:152 , HOH D:210 , HOH D:212 , HOH D:214BINDING SITE FOR RESIDUE GAL D 151
8AC8SOFTWAREARG D:55 , GLU D:58 , GLU D:75 , GAL D:151 , HOH D:230BINDING SITE FOR RESIDUE NGA D 152

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 1ULG)

(-) Cis Peptide Bonds  (8, 8)

Asymmetric/Biological Unit
No.Residues
1Pro A:86 -Pro A:87
2Ser A:134 -Pro A:135
3Pro B:86 -Pro B:87
4Ser B:134 -Pro B:135
5Pro C:86 -Pro C:87
6Ser C:134 -Pro C:135
7Pro D:86 -Pro D:87
8Ser D:134 -Pro D:135

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 1ULG)

(-) PROSITE Motifs  (1, 4)

Asymmetric/Biological Unit (1, 4)
 PROSITEUniProtKBPDB
No.IDACDescriptionIDLocationCountLocation
1GALECTINPS51304 Galactoside-binding lectin (galectin) domain profile.CGL2_COPCI9-141
 
 
 
  4A:9-141
B:9-141
C:9-141
D:9-141

(-) Exons   (0, 0)

(no "Exon" information available for 1ULG)

(-) Sequences/Alignments

Asymmetric/Biological Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:150
 aligned with CGL2_COPCI | Q9P4R8 from UniProtKB/Swiss-Prot  Length:150

    Alignment length:150
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150
           CGL2_COPCI     1 MLYHLFVNNQVKLQNDFKPESVAAIRSSAFNSKGGTTVFNFLSAGENILLHISIRPGENVIVFNSRLKNGAWGPEERIPYAEKFRPPNPSITVIDHGDRFQIRFDYGTSIYYNKRIKENAAAIAYNAENSLFSSPVTVDVHGLLPPLPPA 150
               SCOP domains d1ulga_ A: Galectin-2                                                                                                                                  SCOP domains
               CATH domains 1ulgA00 A:1-150  [code=2.60.120.200, no name defined]                                                                                                  CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------ Pfam domains
         Sec.struct. author .eeee...eeeeeeeee....eeeee..........eeeeee.....eeeeeeee....eeeeeee.........eeee..........eeeeee...eeeee......eeee......eeeeeeee........eeeeee......... Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------ SAPs(SNPs)
                    PROSITE --------GALECTIN  PDB: A:9-141 UniProt: 9-141                                                                                                --------- PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------ Transcript
                 1ulg A   1 MLYHLFVNNQVKLQNDFKPESVAAIRSSAFNSKGGTTVFNFLSAGENILLHISIRPGENVIVFNSRLKNGAWGPEERIPYAEKFRPPNPSITVIDHGDRFQIRFDYGTSIYYNKRIKENAAAIAYNAENSLFSSPVTVDVHGLLPPLPPA 150
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150

Chain B from PDB  Type:PROTEIN  Length:150
 aligned with CGL2_COPCI | Q9P4R8 from UniProtKB/Swiss-Prot  Length:150

    Alignment length:150
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150
           CGL2_COPCI     1 MLYHLFVNNQVKLQNDFKPESVAAIRSSAFNSKGGTTVFNFLSAGENILLHISIRPGENVIVFNSRLKNGAWGPEERIPYAEKFRPPNPSITVIDHGDRFQIRFDYGTSIYYNKRIKENAAAIAYNAENSLFSSPVTVDVHGLLPPLPPA 150
               SCOP domains d1ulgb_ B: Galectin-2                                                                                                                                  SCOP domains
               CATH domains 1ulgB00 B:1-150  [code=2.60.120.200, no name defined]                                                                                                  CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------ Pfam domains
         Sec.struct. author .eeee...eeeeeeeee....eeeee..........eeeeee.....eeeeeeee....eeeeeee.........eeee..........eeeeee...eeeee......eeee......eeeeeeee........eeeeee......... Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------ SAPs(SNPs)
                    PROSITE --------GALECTIN  PDB: B:9-141 UniProt: 9-141                                                                                                --------- PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------ Transcript
                 1ulg B   1 MLYHLFVNNQVKLQNDFKPESVAAIRSSAFNSKGGTTVFNFLSAGENILLHISIRPGENVIVFNSRLKNGAWGPEERIPYAEKFRPPNPSITVIDHGDRFQIRFDYGTSIYYNKRIKENAAAIAYNAENSLFSSPVTVDVHGLLPPLPPA 150
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150

Chain C from PDB  Type:PROTEIN  Length:150
 aligned with CGL2_COPCI | Q9P4R8 from UniProtKB/Swiss-Prot  Length:150

    Alignment length:150
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150
           CGL2_COPCI     1 MLYHLFVNNQVKLQNDFKPESVAAIRSSAFNSKGGTTVFNFLSAGENILLHISIRPGENVIVFNSRLKNGAWGPEERIPYAEKFRPPNPSITVIDHGDRFQIRFDYGTSIYYNKRIKENAAAIAYNAENSLFSSPVTVDVHGLLPPLPPA 150
               SCOP domains d1ulgc_ C: Galectin-2                                                                                                                                  SCOP domains
               CATH domains 1ulgC00 C:1-150  [code=2.60.120.200, no name defined]                                                                                                  CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------ Pfam domains
         Sec.struct. author .eeee...eeeeeeeee....eeeee..........eeeeee.....eeeeeeeehhh.eeeeeee.........eeee..........eeeeee...eeeee......eeee......eeeeeee.........eeeeee......... Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------ SAPs(SNPs)
                    PROSITE --------GALECTIN  PDB: C:9-141 UniProt: 9-141                                                                                                --------- PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------ Transcript
                 1ulg C   1 MLYHLFVNNQVKLQNDFKPESVAAIRSSAFNSKGGTTVFNFLSAGENILLHISIRPGENVIVFNSRLKNGAWGPEERIPYAEKFRPPNPSITVIDHGDRFQIRFDYGTSIYYNKRIKENAAAIAYNAENSLFSSPVTVDVHGLLPPLPPA 150
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150

Chain D from PDB  Type:PROTEIN  Length:150
 aligned with CGL2_COPCI | Q9P4R8 from UniProtKB/Swiss-Prot  Length:150

    Alignment length:150
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150
           CGL2_COPCI     1 MLYHLFVNNQVKLQNDFKPESVAAIRSSAFNSKGGTTVFNFLSAGENILLHISIRPGENVIVFNSRLKNGAWGPEERIPYAEKFRPPNPSITVIDHGDRFQIRFDYGTSIYYNKRIKENAAAIAYNAENSLFSSPVTVDVHGLLPPLPPA 150
               SCOP domains d1ulgd_ D: Galectin-2                                                                                                                                  SCOP domains
               CATH domains 1ulgD00 D:1-150  [code=2.60.120.200, no name defined]                                                                                                  CATH domains
           Pfam domains (1) -------Gal-bind_lectin-1ulgD01 D:8-134                                                                                                ---------------- Pfam domains (1)
           Pfam domains (2) -------Gal-bind_lectin-1ulgD02 D:8-134                                                                                                ---------------- Pfam domains (2)
           Pfam domains (3) -------Gal-bind_lectin-1ulgD03 D:8-134                                                                                                ---------------- Pfam domains (3)
           Pfam domains (4) -------Gal-bind_lectin-1ulgD04 D:8-134                                                                                                ---------------- Pfam domains (4)
         Sec.struct. author .eeee...eeeeeeeee....eeeee..........eeeeee.....eeeeeeee....eeeeeee.........eeee..........eeeeee...eeeee......eeee......eeeeeeee........eeeeee......... Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------ SAPs(SNPs)
                    PROSITE --------GALECTIN  PDB: D:9-141 UniProt: 9-141                                                                                                --------- PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------ Transcript
                 1ulg D   1 MLYHLFVNNQVKLQNDFKPESVAAIRSSAFNSKGGTTVFNFLSAGENILLHISIRPGENVIVFNSRLKNGAWGPEERIPYAEKFRPPNPSITVIDHGDRFQIRFDYGTSIYYNKRIKENAAAIAYNAENSLFSSPVTVDVHGLLPPLPPA 150
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (1, 4)

Asymmetric/Biological Unit

(-) CATH Domains  (1, 4)

Asymmetric/Biological Unit
(-)
Class: Mainly Beta (13760)

(-) Pfam Domains  (1, 4)

Asymmetric/Biological Unit

(-) Gene Ontology  (6, 6)

Asymmetric/Biological Unit(hide GO term definitions)
Chain A,B,C,D   (CGL2_COPCI | Q9P4R8)
molecular function
    GO:0030246    carbohydrate binding    Interacting selectively and non-covalently with any carbohydrate, which includes monosaccharides, oligosaccharides and polysaccharides as well as substances derived from monosaccharides by reduction of the carbonyl group (alditols), by oxidation of one or more hydroxy groups to afford the corresponding aldehydes, ketones, or carboxylic acids, or by replacement of one or more hydroxy group(s) by a hydrogen atom. Cyclitols are generally not regarded as carbohydrates.
cellular component
    GO:0005618    cell wall    The rigid or semi-rigid envelope lying outside the cell membrane of plant, fungal, most prokaryotic cells and some protozoan parasites, maintaining their shape and protecting them from osmotic lysis. In plants it is made of cellulose and, often, lignin; in fungi it is composed largely of polysaccharides; in bacteria it is composed of peptidoglycan; in protozoan parasites such as Giardia species, it's made of carbohydrates and proteins.
    GO:0012505    endomembrane system    A collection of membranous structures involved in transport within the cell. The main components of the endomembrane system are endoplasmic reticulum, Golgi bodies, vesicles, cell membrane and nuclear envelope. Members of the endomembrane system pass materials through each other or though the use of vesicles.
    GO:0005576    extracellular region    The space external to the outermost structure of a cell. For cells without external protective or external encapsulating structures this refers to space outside of the plasma membrane. This term covers the host cell environment outside an intracellular parasite.
    GO:0016020    membrane    A lipid bilayer along with all the proteins and protein complexes embedded in it an attached to it.
    GO:0005578    proteinaceous extracellular matrix    A layer consisting mainly of proteins (especially collagen) and glycosaminoglycans (mostly as proteoglycans) that forms a sheet underlying or overlying cells such as endothelial and epithelial cells. The proteins are secreted by cells in the vicinity. An example of this component is found in Mus musculus.

 Visualization

(-) Interactive Views

Asymmetric/Biological Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    GAL  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
    NGA  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
    AC4  [ RasMol ]  +environment [ RasMol ]
    AC5  [ RasMol ]  +environment [ RasMol ]
    AC6  [ RasMol ]  +environment [ RasMol ]
    AC7  [ RasMol ]  +environment [ RasMol ]
    AC8  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
    Pro A:86 - Pro A:87   [ RasMol ]  
    Pro B:86 - Pro B:87   [ RasMol ]  
    Pro C:86 - Pro C:87   [ RasMol ]  
    Pro D:86 - Pro D:87   [ RasMol ]  
    Ser A:134 - Pro A:135   [ RasMol ]  
    Ser B:134 - Pro B:135   [ RasMol ]  
    Ser C:134 - Pro C:135   [ RasMol ]  
    Ser D:134 - Pro D:135   [ RasMol ]  
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  1ulg
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  CGL2_COPCI | Q9P4R8
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  CGL2_COPCI | Q9P4R8
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        CGL2_COPCI | Q9P4R81ul9 1ulc 1uld 1ule 1ulf 2wkk

(-) Related Entries Specified in the PDB File

1a3k MAMMALIAN HOMOLOGUE GALECTIN-3 CARBOHYDRATE BINDING DOMAIN
1bkz MAMMALIAN HOMOLOGUE GALECTIN-7
1c1f FISH HOMOLOGUE CONGERIN I
1gan AMPHIBIAN HOMOLOGUE GALECTIN-1 WITH N-ACETYLLACTOSAMINE
1is5 FISH HOMOLOGUE CONGERIN II
1lcl MAMMALIAN CHARCOT-LEYDEN PROTEIN
1qmj AVIAN HOMOLOGUE CG-16
1sla MAMMALIAN HOMOLOGUE GALECTIN-1 WITH BIANTENNARY OLIGOSACCHARIDE
1ul9 1UL9 CONTAINS THE SAME PROTEIN WITHOUT LIGAND
1ulc 1ULC CONTAINS THE SAME PROTEIN COMPLEXED WITH LACTOSE
1uld 1ULD CONTAINS THE SAME PROTEIN COMPLEXED WITH BLOOD GROUP H TYPE II
1ule 1ULE CONTAINS THE SAME PROTEIN COMPLEXED WITH LINEAR B2 TRISACCHARIDE
1ulf 1ULF CONTAINS THE SAME PROTEIN COMPLEXED WITH BLOOD GROUP A TETRASACCHARIDE