Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)NMR Structure - manually
(-)NMR Structure - model 1
(-)NMR Structure - all models
collapse expand < >
Image NMR Structure - manually
NMR Structure - manually  (Jmol Viewer)
Image NMR Structure - model 1
NMR Structure - model 1  (Jmol Viewer)
Image NMR Structure - all models
NMR Structure - all models  (Jmol Viewer)

(-) Description

Title :  SOLUTION STRUCTURE OF THE SEA DOMAIN OF HUMAN MUCIN 1 (MUC1)
 
Authors :  B. Macao, D. G. A. Johansson, G. C. Hansson, T. Hard
Date :  19 Jul 05  (Deposition) - 17 Jan 06  (Release) - 24 Feb 09  (Revision)
Method :  SOLUTION NMR
Resolution :  NOT APPLICABLE
Chains :  NMR Structure  :  A,B  (15x)
Keywords :  Auto-Catalytic Proteolysis, Structural Protein (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  B. Macao, D. G. A. Johansson, G. C. Hansson, T. Hard
Autoproteolysis Coupled To Protein Folding In The Sea Domain Of The Membrane-Bound Muc1 Mucin
Nat. Struct. Mol. Biol. V. 13 71 2006
PubMed-ID: 16369486  |  Reference-DOI: 10.1038/NSMB1035
(for further references see the PDB file header)

(-) Compounds

Molecule 1 - MUCIN-1
    ChainsA
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPET3A
    Expression System StrainROSETTA PLYSS
    Expression System Taxid562
    Expression System Vector TypePLASMID
    FragmentSEA DOMAIN (RESIDUES 1041-1097)
    GeneMUC1
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SynonymMUC1 MUCUS PROTEIN, MUC-1, POLYMORPHIC EPITHELIAL MUCIN, PEM, PEMT, EPISIALIN, TUMOR-ASSOCIATED MUCIN, CARCINOMA-ASSOCIATED MUCIN, TUMOR-ASSOCIATED EPITHELIAL MEMBRANE ANTIGEN, EMA, H23AG, PEANUT- REACTIVE URINARY MUCIN, PUM, BREAST CARCINOMA-ASSOCIATED ANTIGEN DF3, CD227 ANTIGEN
 
Molecule 2 - MUCIN-1
    ChainsB
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPET3A
    Expression System StrainROSETTA PLYSS
    Expression System Taxid562
    Expression System Vector TypePLASMID
    FragmentSEA DOMAIN (RESIDUES 1098-1152)
    GeneMUC1
    Organism CommonHUMAN
    Organism ScientificHOMO SAPIENS
    Organism Taxid9606
    SynonymMUC1 MUCUS PROTEIN, MUC-1, POLYMORPHIC EPITHELIAL MUCIN, PEM, PEMT, EPISIALIN, TUMOR-ASSOCIATED MUCIN, CARCINOMA-ASSOCIATED MUCIN, TUMOR-ASSOCIATED EPITHELIAL MEMBRANE ANTIGEN, EMA, H23AG, PEANUT- REACTIVE URINARY MUCIN, PUM, BREAST CARCINOMA-ASSOCIATED ANTIGEN DF3, CD227 ANTIGEN

 Structural Features

(-) Chains, Units

  
NMR Structure (15x)

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (0, 0)

(no "Ligand,Modified Residues,Ions" information available for 2ACM)

(-) Sites  (0, 0)

(no "Site" information available for 2ACM)

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 2ACM)

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 2ACM)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (2, 2)

NMR Structure (2, 2)
  dbSNPPDB
No.SourceVariant IDVariantUniProt IDStatusIDChainVariant
1UniProtVAR_019390V1117MMUC1_HUMANPolymorphism1611770BV1117M
2UniProtVAR_019391S1142NMUC1_HUMANPolymorphism11465207BS1142N

  SNP/SAP Summary Statistics (UniProtKB/Swiss-Prot)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 2ACM)

(-) Exons   (0, 0)

(no "Exon" information available for 2ACM)

(-) Sequences/Alignments

NMR Structure
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:57
 aligned with MUC1_HUMAN | P15941 from UniProtKB/Swiss-Prot  Length:1255

    Alignment length:57
                                  1050      1060      1070      1080      1090       
          MUC1_HUMAN   1041 SFFFLSFHISNLQFNSSLEDPSTDYYQELQRDISEMFLQIYKQGGFLGLSNIKFRPG 1097
               SCOP domains --------------------------------------------------------- SCOP domains
               CATH domains --------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------- Pfam domains
         Sec.struct. author .eeeeeeee.....hhhhhh..hhhhhhhhhhhhhhhhhhh....eeeeeeeeeee. Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------- PROSITE
                 Transcript --------------------------------------------------------- Transcript
                2acm A 1041 SFFFLSFHISNLQFNSSLEDPSTDYYQELQRDISEMFLQIYKQGGFLGLSNIKFRPG 1097
                                  1050      1060      1070      1080      1090       

Chain B from PDB  Type:PROTEIN  Length:47
 aligned with MUC1_HUMAN | P15941 from UniProtKB/Swiss-Prot  Length:1255

    Alignment length:47
                                  1107      1117      1127      1137       
          MUC1_HUMAN   1098 SVVVQLTLAFREGTINVHDVETQFNQYKTEAASRYNLTISDVSVSDV 1144
               SCOP domains ----------------------------------------------- SCOP domains
               CATH domains ----------------------------------------------- CATH domains
               Pfam domains ----------------------------------------------- Pfam domains
         Sec.struct. author .eeeeeeeee.....hhhhhhhhhhhhhhhhhhhhh..eeeeeee.. Sec.struct. author
                 SAPs(SNPs) -------------------M------------------------N-- SAPs(SNPs)
                    PROSITE ----------------------------------------------- PROSITE
                 Transcript ----------------------------------------------- Transcript
                2acm B 1098 SVVVQLTLAFREGTINVHDVETQFNQYKTEAASRYNLTISDVSVSDV 1144
                                  1107      1117      1127      1137       

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 2ACM)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 2ACM)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 2ACM)

(-) Gene Ontology  (26, 26)

NMR Structure(hide GO term definitions)
Chain A,B   (MUC1_HUMAN | P15941)
molecular function
    GO:0000978    RNA polymerase II core promoter proximal region sequence-specific DNA binding    Interacting selectively and non-covalently with a sequence of DNA that is in cis with and relatively close to a core promoter for RNA polymerase II.
    GO:0002039    p53 binding    Interacting selectively and non-covalently with one of the p53 family of proteins.
    GO:0005515    protein binding    Interacting selectively and non-covalently with any protein or protein complex (a complex of two or more proteins that may include other nonprotein molecules).
    GO:0003712    transcription cofactor activity    Interacting selectively and non-covalently with a regulatory transcription factor and also with the basal transcription machinery in order to modulate transcription. Cofactors generally do not bind the template nucleic acid, but rather mediate protein-protein interactions between regulatory transcription factors and the basal transcription machinery.
biological process
    GO:0006977    DNA damage response, signal transduction by p53 class mediator resulting in cell cycle arrest    A cascade of processes induced by the cell cycle regulator phosphoprotein p53, or an equivalent protein, in response to the detection of DNA damage and resulting in the stopping or reduction in rate of the cell cycle.
    GO:0006978    DNA damage response, signal transduction by p53 class mediator resulting in transcription of p21 class mediator    A cascade of processes induced by the cell cycle regulator phosphoprotein p53, or an equivalent protein, resulting in the induction of the transcription of p21 (also known as WAF1, CIP1 and SDI1) or any equivalent protein, in response to the detection of DNA damage.
    GO:0016266    O-glycan processing    The stepwise addition of carbohydrate or carbohydrate derivative residues to the initially added O-linked residue (usually GalNAc) to form a core O-glycan structure.
    GO:0033629    negative regulation of cell adhesion mediated by integrin    Any process that stops, prevents, or reduces the frequency, rate, or extent of cell adhesion mediated by integrin.
    GO:1902166    negative regulation of intrinsic apoptotic signaling pathway in response to DNA damage by p53 class mediator    Any process that stops, prevents or reduces the frequency, rate or extent of intrinsic apoptotic signaling pathway in response to DNA damage by p53 class mediator.
    GO:0010944    negative regulation of transcription by competitive promoter binding    Any process that stops, prevents, or reduces the frequency, rate or extent of DNA-dependent transcription using a mechanism that involves direct competition for interaction with a promoter binding site.
    GO:0090240    positive regulation of histone H4 acetylation    Any process that increases the rate, frequency, or extent of histone H4 acetylation, the modification of histone H4 by the addition of an acetyl group.
    GO:0036003    positive regulation of transcription from RNA polymerase II promoter in response to stress    Any process that increases the frequency, rate or extent of transcription from an RNA polymerase II promoter as a result of a stimulus indicating the organism is under stress. The stress is usually, but not necessarily, exogenous (e.g. temperature, humidity, ionizing radiation).
    GO:0043618    regulation of transcription from RNA polymerase II promoter in response to stress    Any process that modulates the frequency, rate or extent of transcription from an RNA polymerase II promoter as a result of a stimulus indicating the organism is under stress. The stress is usually, but not necessarily, exogenous (e.g. temperature, humidity, ionizing radiation).
cellular component
    GO:0005796    Golgi lumen    The volume enclosed by the membranes of any cisterna or subcompartment of the Golgi apparatus, including the cis- and trans-Golgi networks.
    GO:0016324    apical plasma membrane    The region of the plasma membrane located at the apical end of the cell.
    GO:0005737    cytoplasm    All of the contents of a cell excluding the plasma membrane and nucleus, but including other subcellular structures.
    GO:0070062    extracellular exosome    A vesicle that is released into the extracellular region by fusion of the limiting endosomal membrane of a multivesicular body with the plasma membrane. Extracellular exosomes, also simply called exosomes, have a diameter of about 40-100 nm.
    GO:0005576    extracellular region    The space external to the outermost structure of a cell. For cells without external protective or external encapsulating structures this refers to space outside of the plasma membrane. This term covers the host cell environment outside an intracellular parasite.
    GO:0005615    extracellular space    That part of a multicellular organism outside the cells proper, usually taken to be outside the plasma membranes, and occupied by fluid.
    GO:0016021    integral component of membrane    The component of a membrane consisting of the gene products and protein complexes having at least some part of their peptide sequence embedded in the hydrophobic region of the membrane.
    GO:0005887    integral component of plasma membrane    The component of the plasma membrane consisting of the gene products and protein complexes having at least some part of their peptide sequence embedded in the hydrophobic region of the membrane.
    GO:0016020    membrane    A lipid bilayer along with all the proteins and protein complexes embedded in it an attached to it.
    GO:0000790    nuclear chromatin    The ordered and organized complex of DNA, protein, and sometimes RNA, that forms the chromosome in the nucleus.
    GO:0005634    nucleus    A membrane-bounded organelle of eukaryotic cells in which chromosomes are housed and replicated. In most cells, the nucleus contains all of the cell's chromosomes except the organellar chromosomes, and is the site of RNA synthesis and processing. In some species, or in specialized cell types, RNA metabolism or DNA replication may be absent.
    GO:0005886    plasma membrane    The membrane surrounding a cell that separates the cell from its external environment. It consists of a phospholipid bilayer and associated proteins.
    GO:0031982    vesicle    Any small, fluid-filled, spherical organelle enclosed by membrane.

 Visualization

(-) Interactive Views

NMR Structure
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
(no "Ligands, Modified Residues, Ions" information available for 2acm)
 
  Sites
(no "Sites" information available for 2acm)
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 2acm)
 

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  2acm
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  MUC1_HUMAN | P15941
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProtKB/SwissProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  MUC1_HUMAN | P15941
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT (for more information see DisProt)

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

UniProtKB/Swiss-Prot
        MUC1_HUMAN | P159411sm3 2fo4 5t6p 5t78

(-) Related Entries Specified in the PDB File

(no "Related Entries Specified in the PDB File" available for 2ACM)