Show PDB file:   
         Plain Text   HTML   (compressed file size)
QuickSearch:   
by PDB,NDB,UniProt,PROSITE Code or Search Term(s)  
(-)Asymmetric Unit
(-)Asym. Unit - sites
(-)Biological Unit 1
(-)Biol. Unit 1 - sites
(-)Biological Unit 2
(-)Biological Unit 3
(-)Biological Unit 4
collapse expand < >
Image Asymmetric Unit
Asymmetric Unit  (Jmol Viewer)
Image Asym. Unit - sites
Asym. Unit - sites  (Jmol Viewer)
Image Biological Unit 1
Biological Unit 1  (Jmol Viewer)
Image Biol. Unit 1 - sites
Biol. Unit 1 - sites  (Jmol Viewer)
Image Biological Unit 2
Biological Unit 2  (Jmol Viewer)
Image Biological Unit 3
Biological Unit 3  (Jmol Viewer)
Image Biological Unit 4
Biological Unit 4  (Jmol Viewer)

(-) Description

Title :  CRYSTAL STRUCTURE OF DE NOVO DESIGNED HELICAL REPEAT PROTEIN DHR8
 
Authors :  G. Bhabha, D. C. Ekiert
Date :  28 Jul 15  (Deposition) - 16 Dec 15  (Release) - 06 Jan 16  (Revision)
Method :  X-RAY DIFFRACTION
Resolution :  1.80
Chains :  Asym. Unit :  A,B,C,D
Biol. Unit 1:  A  (1x)
Biol. Unit 2:  B  (1x)
Biol. Unit 3:  C  (1x)
Biol. Unit 4:  D  (1x)
Keywords :  Helical Repeat Protein, De Novo Protein (Keyword Search: [Gene Ontology, PubMed, Web (Google))
 
Reference :  T. J. Brunette, F. Parmeggiani, P. S. Huang, G. Bhabha, D. C. Ekiert, S. E. Tsutakawa, G. L. Hura, J. A. Tainer, D. Baker
Exploring The Repeat Protein Universe Through Computational Protein Design.
Nature V. 528 580 2015
PubMed-ID: 26675729  |  Reference-DOI: 10.1038/NATURE16162

(-) Compounds

Molecule 1 - DESIGNED HELICAL REPEAT PROTEIN
    ChainsA, B, C, D
    EngineeredYES
    Expression SystemESCHERICHIA COLI
    Expression System PlasmidPET21_NESG
    Expression System StrainBL21(DE3)
    Expression System Taxid469008
    Expression System Vector TypePLASMID
    Organism ScientificSYNTHETIC CONSTRUCT
    Organism Taxid32630

 Structural Features

(-) Chains, Units

  1234
Asymmetric Unit ABCD
Biological Unit 1 (1x)A   
Biological Unit 2 (1x) B  
Biological Unit 3 (1x)  C 
Biological Unit 4 (1x)   D

Summary Information (see also Sequences/Alignments below)

(-) Ligands, Modified Residues, Ions  (1, 32)

Asymmetric Unit (1, 32)
No.NameCountTypeFull Name
1CA32Ligand/IonCALCIUM ION
Biological Unit 1 (0, 0)
No.NameCountTypeFull Name
1CA-1Ligand/IonCALCIUM ION
Biological Unit 2 (0, 0)
No.NameCountTypeFull Name
1CA-1Ligand/IonCALCIUM ION
Biological Unit 3 (0, 0)
No.NameCountTypeFull Name
1CA-1Ligand/IonCALCIUM ION
Biological Unit 4 (0, 0)
No.NameCountTypeFull Name
1CA-1Ligand/IonCALCIUM ION

(-) Sites  (32, 32)

Asymmetric Unit (32, 32)
No.NameEvidenceResiduesDescription
01AC1SOFTWAREASP A:25 , GLU A:26 , GLU B:161 , GLU B:165 , HOH B:348 , HOH B:355binding site for residue CA A 201
02AC2SOFTWAREHOH A:302 , HOH A:357 , ASP D:24 , ASP D:70 , HOH D:370binding site for residue CA A 202
03AC3SOFTWAREARG A:88 , HOH A:337 , HOH A:340 , HOH A:360 , ARG B:88 , HOH B:321 , HOH B:326binding site for residue CA A 203
04AC4SOFTWAREASP A:24 , ASP A:70 , HOH A:377 , HOH A:386 , ASN D:22 , HOH D:303 , HOH D:325binding site for residue CA A 204
05AC5SOFTWAREGLU A:172 , GLU A:175 , GLU B:82 , GLU B:85binding site for residue CA A 205
06AC6SOFTWAREASP A:160 , GLU A:161 , HOH A:391 , ASN C:113 , HOH C:321binding site for residue CA A 206
07AC7SOFTWAREGLU A:49 , HOH A:365 , ASP B:138binding site for residue CA A 207
08AC8SOFTWAREASN A:67 , ASN A:113 , HOH A:381binding site for residue CA A 208
09AC9SOFTWAREGLU A:82 , GLU A:85 , HOH A:318 , GLU B:172 , GLU B:175binding site for residue CA B 201
10AD1SOFTWAREASN A:23 , ASP A:25 , HOH A:328 , HOH A:345 , HOH A:361 , HOH A:373 , GLU B:161binding site for residue CA B 202
11AD2SOFTWAREGLU A:127 , GLU A:130 , HOH A:317 , GLU B:127 , GLU B:130binding site for residue CA B 203
12AD3SOFTWAREGLU B:71 , CA B:207 , CA B:208 , CA B:209 , ASP C:115 , HOH C:311binding site for residue CA B 204
13AD4SOFTWAREASP B:115 , CA B:211 , HOH B:314 , GLU C:71 , CA C:206 , CA C:208 , HOH D:313binding site for residue CA B 205
14AD5SOFTWAREGLU B:175 , GLU B:179binding site for residue CA B 206
15AD6SOFTWAREASP B:69 , GLU B:71 , CA B:204 , CA B:208 , CA B:210 , HOH B:330 , ASP C:69 , ASP C:115 , CA C:205binding site for residue CA B 207
16AD7SOFTWAREGLU A:116 , GLU B:71 , CA B:204 , CA B:207 , HOH B:330 , ASP C:115 , CA C:205binding site for residue CA B 208
17AD8SOFTWAREGLU A:161 , HOH A:355 , GLU B:71 , CA B:204 , ASN C:68 , HOH C:311binding site for residue CA B 209
18AD9SOFTWAREASP B:69 , ASP B:115 , CA B:207 , CA B:211 , ASP C:69 , CA C:205 , CA C:206 , HOH C:353binding site for residue CA B 210
19AE1SOFTWAREASP B:115 , CA B:205 , CA B:210 , GLU C:71 , CA C:206 , HOH C:353 , GLU D:116 , HOH D:352binding site for residue CA B 211
20AE2SOFTWAREGLU C:161 , ASN D:23 , ASP D:25 , HOH D:306 , HOH D:326 , HOH D:338 , HOH D:368binding site for residue CA C 201
21AE3SOFTWAREGLU C:172 , GLU C:175 , GLU D:82 , GLU D:85 , ARG D:133 , HOH D:321binding site for residue CA C 202
22AE4SOFTWAREGLU C:82 , GLU C:85 , GLU D:172 , GLU D:175binding site for residue CA C 203
23AE5SOFTWAREARG C:88 , HOH C:317 , ARG D:88 , HOH D:323binding site for residue CA C 204
24AE6SOFTWAREASP B:69 , CA B:207 , CA B:208 , CA B:210 , HOH B:330 , ASP C:69 , ASP C:115binding site for residue CA C 205
25AE7SOFTWAREASP B:69 , ASP B:115 , CA B:205 , CA B:210 , CA B:211 , ASP C:69 , GLU C:71 , HOH C:353binding site for residue CA C 206
26AE8SOFTWAREGLU C:175 , GLU C:179 , HOH C:335 , HOH C:356 , ARG D:133binding site for residue CA C 207
27AE9SOFTWAREASN B:68 , CA B:205 , HOH B:314 , HOH B:319 , GLU C:71 , GLU D:161 , HOH D:313 , HOH D:382binding site for residue CA C 208
28AF1SOFTWAREGLU C:161 , GLU C:165 , HOH C:340 , HOH C:348 , ASP D:25 , GLU D:26binding site for residue CA D 201
29AF2SOFTWAREGLU C:127 , GLU C:130 , GLU D:127 , GLU D:130 , HOH D:324binding site for residue CA D 202
30AF3SOFTWAREGLU C:130 , GLU D:130 , HOH D:312binding site for residue CA D 203
31AF4SOFTWAREASN B:113 , HOH B:334 , ASP D:160 , GLU D:161 , HOH D:373 , HOH D:382binding site for residue CA D 204
32AF5SOFTWAREASN D:67 , ASN D:113binding site for residue CA D 205

(-) SS Bonds  (0, 0)

(no "SS Bond" information available for 5CWF)

(-) Cis Peptide Bonds  (0, 0)

(no "Cis Peptide Bond" information available for 5CWF)

 Sequence-Structure Mapping

(-) SAPs(SNPs)/Variants  (0, 0)

(no "SAP(SNP)/Variant" information available for 5CWF)

(-) PROSITE Motifs  (0, 0)

(no "PROSITE Motif" information available for 5CWF)

(-) Exons   (0, 0)

(no "Exon" information available for 5CWF)

(-) Sequences/Alignments

Asymmetric Unit
   Reformat: Number of residues per line =  ('0' or empty: single-line sequence representation)
  Number of residues per labelling interval =   
  UniProt sequence: complete  aligned part    
   Show mapping: SCOP domains CATH domains Pfam domains Secondary structure (by author)
SAPs(SNPs) PROSITE motifs Exons
(details for a mapped element are shown in a popup box when the mouse pointer rests over it)
Chain A from PDB  Type:PROTEIN  Length:177
                                                                                                                                                                                                                 
               SCOP domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .hhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhh Sec.struct. author
                 SAPs(SNPs) --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 5cwf A   1 MSDEMKKVMEALKKAVELAKKNNDDEVAREIERAAKEIVEALRENNSDEMAKVMLALAKAVLLAAKNNDDEVAREIARAAAEIVEALRENNSDEMAKVMLALAKAVLLAAKNNDDEVAREIARAAAEIVEALRENNSDEMAKKMLELAKRVLDAAKNNDDETAREIARQAAEEVEAD 177
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150       160       170       

Chain B from PDB  Type:PROTEIN  Length:176
                                                                                                                                                                                                                
               SCOP domains -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .hhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhh..hhhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhh. Sec.struct. author
                 SAPs(SNPs) -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 5cwf B   3 DEMKKVMEALKKAVELAKKDDEVAREIERAAKEIVEALRENNSDEMAKVMLALAKAVLLAAKNNDDEVAREIARAAAEIVEALRENNSDEMAKVMLALAKAVLLAAKNNDDEVAREIARAAAEIVEALRENNSDEMAKKMLELAKRVLDAAKNNDDETAREIARQAAEEVEADREN 180
                                    12        24        34        44        54        64        74        84        94       104       114       124       134       144       154       164       174      
                                             21|                                                                                                                                                            
                                              24                                                                                                                                                            

Chain C from PDB  Type:PROTEIN  Length:175
                                                                                                                                                                                                               
               SCOP domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .hhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhh..hhhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhh Sec.struct. author
                 SAPs(SNPs) ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 5cwf C   3 DEMKKVMEALKKAVELAKKDDEVAREIERAAKEIVEALRENNSDEMAKVMLALAKAVLLAAKNNDDEVAREIARAAAEIVEALRENNSDEMAKVMLALAKAVLLAAKNNDDEVAREIARAAAEIVEALRENNSDEMAKKMLELAKRVLDAAKNNDDETAREIARQAAEEVEADRE 179
                                    12        24        34        44        54        64        74        84        94       104       114       124       134       144       154       164       174     
                                             21|                                                                                                                                                           
                                              24                                                                                                                                                           

Chain D from PDB  Type:PROTEIN  Length:176
                                                                                                                                                                                                                
               SCOP domains -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SCOP domains
               CATH domains -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CATH domains
               Pfam domains -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pfam domains
         Sec.struct. author .hhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhhhhh.hhhhhhhhhhhhhhhhhh Sec.struct. author
                 SAPs(SNPs) -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SAPs(SNPs)
                    PROSITE -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PROSITE
                 Transcript -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Transcript
                 5cwf D   1 MSDEMKKVMEALKKAVELAKKNNDDEVAREIERAAKEIVEALRENNSDEMAKVMLALAKAVLLAAKNNDDEVAREIARAAAEIVEALRENNSDEMAKVMLALAKAVLLAAKNNDDEVAREIARAAAEIVEALRENNSDEMAKKMLELAKRVLDAAKNNDDETAREIARQAAEEVEA 176
                                    10        20        30        40        50        60        70        80        90       100       110       120       130       140       150       160       170      

   Legend:   → Mismatch (orange background)
  - → Gap (green background, '-', border residues have a numbering label)
    → Modified Residue (blue background, lower-case, 'x' indicates undefined single-letter code, labelled with number + name)
  x → Chemical Group (purple background, 'x', labelled with number + name, e.g. ACE or NH2)
  extra numbering lines below/above indicate numbering irregularities and modified residue names etc., number ends below/above '|'

 Classification and Annotation

(-) SCOP Domains  (0, 0)

(no "SCOP Domain" information available for 5CWF)

(-) CATH Domains  (0, 0)

(no "CATH Domain" information available for 5CWF)

(-) Pfam Domains  (0, 0)

(no "Pfam Domain" information available for 5CWF)

(-) Gene Ontology  (0, 0)

Asymmetric Unit(hide GO term definitions)
    (no "Gene Ontology" information available for 5CWF)

 Visualization

(-) Interactive Views

Asymmetric Unit
  Complete Structure
    Jena3D(integrated viewing of ligand, site, SAP, PROSITE, SCOP information)
    WebMol | AstexViewer[tm]@PDBe
(Java Applets, require no local installation except for Java; loading may be slow)
    STRAP
(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    RasMol
(require local installation)
    Molscript (VRML)
(requires installation of a VRML viewer; select preferred view via VRML and generate a mono or stereo PDF format file)
 
  Ligands, Modified Residues, Ions
    CA  [ RasMol | Jena3D ]  +environment [ RasMol | Jena3D ]
 
  Sites
    AC1  [ RasMol ]  +environment [ RasMol ]
    AC2  [ RasMol ]  +environment [ RasMol ]
    AC3  [ RasMol ]  +environment [ RasMol ]
    AC4  [ RasMol ]  +environment [ RasMol ]
    AC5  [ RasMol ]  +environment [ RasMol ]
    AC6  [ RasMol ]  +environment [ RasMol ]
    AC7  [ RasMol ]  +environment [ RasMol ]
    AC8  [ RasMol ]  +environment [ RasMol ]
    AC9  [ RasMol ]  +environment [ RasMol ]
    AD1  [ RasMol ]  +environment [ RasMol ]
    AD2  [ RasMol ]  +environment [ RasMol ]
    AD3  [ RasMol ]  +environment [ RasMol ]
    AD4  [ RasMol ]  +environment [ RasMol ]
    AD5  [ RasMol ]  +environment [ RasMol ]
    AD6  [ RasMol ]  +environment [ RasMol ]
    AD7  [ RasMol ]  +environment [ RasMol ]
    AD8  [ RasMol ]  +environment [ RasMol ]
    AD9  [ RasMol ]  +environment [ RasMol ]
    AE1  [ RasMol ]  +environment [ RasMol ]
    AE2  [ RasMol ]  +environment [ RasMol ]
    AE3  [ RasMol ]  +environment [ RasMol ]
    AE4  [ RasMol ]  +environment [ RasMol ]
    AE5  [ RasMol ]  +environment [ RasMol ]
    AE6  [ RasMol ]  +environment [ RasMol ]
    AE7  [ RasMol ]  +environment [ RasMol ]
    AE8  [ RasMol ]  +environment [ RasMol ]
    AE9  [ RasMol ]  +environment [ RasMol ]
    AF1  [ RasMol ]  +environment [ RasMol ]
    AF2  [ RasMol ]  +environment [ RasMol ]
    AF3  [ RasMol ]  +environment [ RasMol ]
    AF4  [ RasMol ]  +environment [ RasMol ]
    AF5  [ RasMol ]  +environment [ RasMol ]
 
  Cis Peptide Bonds
(no "Cis Peptide Bonds" information available for 5cwf)
 
Biological Units
  Complete Structure
    Biological Unit 1  [ Jena3D ]
    Biological Unit 2  [ Jena3D ]
    Biological Unit 3  [ Jena3D ]
    Biological Unit 4  [ Jena3D ]

(-) Still Images

Jmol
  protein: cartoon or spacefill or dots and stick; nucleic acid: cartoon and stick; ligands: spacefill; active site: stick
Molscript
  protein, nucleic acid: cartoon; ligands: spacefill; active site: ball and stick

 Databases and Analysis Tools

(-) Databases

Access by PDB/NDB ID
  5cwf
    Family and Domain InformationProDom | SYSTERS
    General Structural InformationGlycoscienceDB | MMDB | NDB | OCA | PDB | PDBe | PDBj | PDBsum | PDBWiki | PQS | PROTEOPEDIA
    Orientation in MembranesOPM
    Protein SurfaceSURFACE
    Secondary StructureDSSP (structure derived) | HSSP (homology derived)
    Structural GenomicsGeneCensus
    Structural NeighboursCE | VAST
    Structure ClassificationCATH | Dali | SCOP
    Validation and Original DataBMRB Data View | BMRB Restraints Grid | EDS | PROCHECK | RECOORD | WHAT_CHECK
 
Access by UniProt ID/Accession number
  (no 'UniProt ID/Accession number' available) |
    Comparative Protein Structure ModelsModBase
    Genomic InformationEnsembl
    Protein-protein InteractionDIP
    Sequence, Family and Domain InformationInterPro | Pfam | SMART | UniProt
 
Access by Enzyme Classificator   (EC Number)
  (no 'Enzyme Classificator' available)
    General Enzyme InformationBRENDA | EC-PDB | Enzyme | IntEnz
    PathwayKEGG | MetaCyc
 
Access by Disease Identifier   (MIM ID)
  (no 'MIM ID' available)
    Disease InformationOMIM
 
Access by GenAge ID
  (no 'GenAge ID' available)
    Age Related InformationGenAge

(-) Analysis Tools

Access by PDB/NDB ID
    Domain InformationXDom
    Interatomic Contacts of Structural UnitsCSU
    Ligand-protein ContactsLPC
    Protein CavitiescastP
    Sequence and Secondary StructurePDBCartoon
    Structure AlignmentSTRAP(Java WebStart application, automatic local installation, requires Java; full application with system access!)
    Structure and Sequence BrowserSTING
 
Access by UniProt ID/Accession number
  (no 'UniProt ID/Accession number' available) |
    Protein Disorder PredictionDisEMBL | FoldIndex | GLOBPLOT

 Related Entries

(-) Entries Sharing at Least One Protein Chain (UniProt ID)

(no "Entries Sharing at Least One Protein Chain" available for 5CWF)

(-) Related Entries Specified in the PDB File

5cwb 5cwc 5cwd 5cwg 5cwh 5cwi 5cwj 5cwk 5cwl 5cwm 5cwn 5cwo 5cwp 5cwq