AU752168B2 - Peptides causing formation of compact structures - Google Patents
Peptides causing formation of compact structures Download PDFInfo
- Publication number
- AU752168B2 AU752168B2 AU34693/99A AU3469399A AU752168B2 AU 752168 B2 AU752168 B2 AU 752168B2 AU 34693/99 A AU34693/99 A AU 34693/99A AU 3469399 A AU3469399 A AU 3469399A AU 752168 B2 AU752168 B2 AU 752168B2
- Authority
- AU
- Australia
- Prior art keywords
- protein
- peptide
- proteins
- sequence
- cell
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 108090000765 processed proteins & peptides Proteins 0.000 title claims description 348
- 102000004196 processed proteins & peptides Human genes 0.000 title description 144
- 230000015572 biosynthetic process Effects 0.000 title description 17
- 108090000623 proteins and genes Proteins 0.000 claims description 384
- 102000004169 proteins and genes Human genes 0.000 claims description 349
- 150000007523 nucleic acids Chemical class 0.000 claims description 88
- 102000039446 nucleic acids Human genes 0.000 claims description 80
- 108020004707 nucleic acids Proteins 0.000 claims description 80
- 238000000034 method Methods 0.000 claims description 73
- 238000006471 dimerization reaction Methods 0.000 claims description 72
- 150000001413 amino acids Chemical class 0.000 claims description 62
- 230000004927 fusion Effects 0.000 claims description 38
- 239000000203 mixture Substances 0.000 claims description 35
- 102000037865 fusion proteins Human genes 0.000 claims description 33
- 108020001507 fusion proteins Proteins 0.000 claims description 33
- 238000012216 screening Methods 0.000 claims description 21
- 239000013604 expression vector Substances 0.000 claims description 16
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 claims description 14
- 239000003814 drug Substances 0.000 claims description 5
- 229910052740 iodine Inorganic materials 0.000 claims description 5
- 229910052700 potassium Inorganic materials 0.000 claims description 5
- 230000001747 exhibiting effect Effects 0.000 claims description 4
- 101150066002 GFP gene Proteins 0.000 claims 1
- 235000018102 proteins Nutrition 0.000 description 319
- 210000004027 cell Anatomy 0.000 description 160
- 238000012360 testing method Methods 0.000 description 70
- 235000001014 amino acid Nutrition 0.000 description 60
- 229940024606 amino acid Drugs 0.000 description 60
- 230000027455 binding Effects 0.000 description 55
- 239000013598 vector Substances 0.000 description 41
- 230000014509 gene expression Effects 0.000 description 32
- 239000000539 dimer Substances 0.000 description 30
- 230000000694 effects Effects 0.000 description 30
- 108010067372 Pancreatic elastase Proteins 0.000 description 26
- 102000016387 Pancreatic elastase Human genes 0.000 description 26
- 229920001184 polypeptide Polymers 0.000 description 24
- 238000001142 circular dichroism spectrum Methods 0.000 description 21
- 230000001413 cellular effect Effects 0.000 description 20
- 239000012528 membrane Substances 0.000 description 20
- 210000004379 membrane Anatomy 0.000 description 20
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 19
- 230000006870 function Effects 0.000 description 19
- 230000001177 retroviral effect Effects 0.000 description 18
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 17
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 17
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 17
- 239000002299 complementary DNA Substances 0.000 description 17
- 239000003112 inhibitor Substances 0.000 description 16
- 239000000758 substrate Substances 0.000 description 16
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 15
- 108020004414 DNA Proteins 0.000 description 15
- ZMXDDKWLCZADIW-UHFFFAOYSA-N N,N-Dimethylformamide Chemical compound CN(C)C=O ZMXDDKWLCZADIW-UHFFFAOYSA-N 0.000 description 15
- 108010076504 Protein Sorting Signals Proteins 0.000 description 15
- 230000001965 increasing effect Effects 0.000 description 15
- 108020004999 messenger RNA Proteins 0.000 description 15
- 238000001228 spectrum Methods 0.000 description 15
- YZCKVEUIGOORGS-OUBTZVSYSA-N Deuterium Chemical group [2H] YZCKVEUIGOORGS-OUBTZVSYSA-N 0.000 description 14
- 125000004122 cyclic group Chemical group 0.000 description 14
- 229910052805 deuterium Inorganic materials 0.000 description 14
- 239000012634 fragment Substances 0.000 description 14
- 125000000539 amino acid group Chemical group 0.000 description 13
- 238000003776 cleavage reaction Methods 0.000 description 13
- 238000004949 mass spectrometry Methods 0.000 description 13
- 239000000047 product Substances 0.000 description 13
- 230000007017 scission Effects 0.000 description 13
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 13
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 12
- 239000004471 Glycine Substances 0.000 description 12
- 241000700605 Viruses Species 0.000 description 12
- 238000007792 addition Methods 0.000 description 12
- 238000003556 assay Methods 0.000 description 12
- 150000002500 ions Chemical class 0.000 description 12
- 238000002955 isolation Methods 0.000 description 12
- 230000004048 modification Effects 0.000 description 12
- 238000012986 modification Methods 0.000 description 12
- 230000017854 proteolysis Effects 0.000 description 12
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 11
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Natural products NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 11
- 238000005859 coupling reaction Methods 0.000 description 11
- 235000018417 cysteine Nutrition 0.000 description 11
- 230000002209 hydrophobic effect Effects 0.000 description 11
- 230000005764 inhibitory process Effects 0.000 description 11
- 238000005259 measurement Methods 0.000 description 11
- 230000001105 regulatory effect Effects 0.000 description 11
- 108010069514 Cyclic Peptides Proteins 0.000 description 10
- 102000001189 Cyclic Peptides Human genes 0.000 description 10
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 10
- 150000001875 compounds Chemical class 0.000 description 10
- 238000010168 coupling process Methods 0.000 description 10
- 238000002474 experimental method Methods 0.000 description 10
- 238000000338 in vitro Methods 0.000 description 10
- 230000003834 intracellular effect Effects 0.000 description 10
- 230000004807 localization Effects 0.000 description 10
- 235000013930 proline Nutrition 0.000 description 10
- 230000003248 secreting effect Effects 0.000 description 10
- 230000008685 targeting Effects 0.000 description 10
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 9
- 239000012867 bioactive agent Substances 0.000 description 9
- 210000004899 c-terminal region Anatomy 0.000 description 9
- 230000008878 coupling Effects 0.000 description 9
- 230000003993 interaction Effects 0.000 description 9
- 235000018977 lysine Nutrition 0.000 description 9
- 210000004962 mammalian cell Anatomy 0.000 description 9
- 125000001500 prolyl group Chemical class [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 9
- 230000019491 signal transduction Effects 0.000 description 9
- 239000000243 solution Substances 0.000 description 9
- 230000002103 transcriptional effect Effects 0.000 description 9
- OBMZMSLWNNWEJA-XNCRXQDQSA-N C1=CC=2C(C[C@@H]3NC(=O)[C@@H](NC(=O)[C@H](NC(=O)N(CC#CCN(CCCC[C@H](NC(=O)[C@@H](CC4=CC=CC=C4)NC3=O)C(=O)N)CC=C)NC(=O)[C@@H](N)C)CC3=CNC4=C3C=CC=C4)C)=CNC=2C=C1 Chemical compound C1=CC=2C(C[C@@H]3NC(=O)[C@@H](NC(=O)[C@H](NC(=O)N(CC#CCN(CCCC[C@H](NC(=O)[C@@H](CC4=CC=CC=C4)NC3=O)C(=O)N)CC=C)NC(=O)[C@@H](N)C)CC3=CNC4=C3C=CC=C4)C)=CNC=2C=C1 OBMZMSLWNNWEJA-XNCRXQDQSA-N 0.000 description 8
- 101710176384 Peptide 1 Proteins 0.000 description 8
- 230000008859 change Effects 0.000 description 8
- 239000003795 chemical substances by application Substances 0.000 description 8
- 230000035772 mutation Effects 0.000 description 8
- 238000003786 synthesis reaction Methods 0.000 description 8
- QXXBUXBKXUHVQH-FMTGAZOMSA-N (2s)-2-[[(2s)-2-[[(2s,3s)-2-[[(2s)-2-[[(2s)-6-amino-2-[[(2s)-3-hydroxy-2-[[2-[[2-[[(2s)-1-[(2s)-1-[(2s)-5-oxopyrrolidine-2-carbonyl]pyrrolidine-2-carbonyl]pyrrolidine-2-carbonyl]amino]acetyl]amino]acetyl]amino]propanoyl]amino]hexanoyl]amino]-3-methylbutan Chemical compound C([C@H]1C(=O)N2CCC[C@H]2C(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=2C=CC=CC=2)C(O)=O)C(C)C)CCN1C(=O)[C@@H]1CCC(=O)N1 QXXBUXBKXUHVQH-FMTGAZOMSA-N 0.000 description 7
- 102100031521 Morphogenetic neuropeptide Human genes 0.000 description 7
- 101710105851 Morphogenetic neuropeptide Proteins 0.000 description 7
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 7
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 7
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 7
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 7
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 7
- 125000003275 alpha amino acid group Chemical group 0.000 description 7
- 150000001408 amides Chemical group 0.000 description 7
- 238000004873 anchoring Methods 0.000 description 7
- 238000013459 approach Methods 0.000 description 7
- -1 but not limited to Chemical group 0.000 description 7
- 238000002983 circular dichroism Methods 0.000 description 7
- 238000010367 cloning Methods 0.000 description 7
- 150000001945 cysteines Chemical class 0.000 description 7
- 230000001086 cytosolic effect Effects 0.000 description 7
- 238000001514 detection method Methods 0.000 description 7
- 125000000291 glutamic acid group Chemical class N[C@@H](CCC(O)=O)C(=O)* 0.000 description 7
- 150000002333 glycines Chemical class 0.000 description 7
- 125000003588 lysine group Chemical class [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 7
- 125000003729 nucleotide group Chemical group 0.000 description 7
- 230000036961 partial effect Effects 0.000 description 7
- 230000002829 reductive effect Effects 0.000 description 7
- 239000000523 sample Substances 0.000 description 7
- 230000028327 secretion Effects 0.000 description 7
- 239000000126 substance Substances 0.000 description 7
- 230000003612 virological effect Effects 0.000 description 7
- 101000946843 Homo sapiens T-cell surface glycoprotein CD8 alpha chain Proteins 0.000 description 6
- 239000004472 Lysine Substances 0.000 description 6
- 102000018697 Membrane Proteins Human genes 0.000 description 6
- 108010052285 Membrane Proteins Proteins 0.000 description 6
- 238000005481 NMR spectroscopy Methods 0.000 description 6
- 108091034117 Oligonucleotide Proteins 0.000 description 6
- 102000035195 Peptidases Human genes 0.000 description 6
- 108091005804 Peptidases Proteins 0.000 description 6
- 239000004365 Protease Substances 0.000 description 6
- 102100034922 T-cell surface glycoprotein CD8 alpha chain Human genes 0.000 description 6
- 239000002253 acid Substances 0.000 description 6
- 125000004429 atom Chemical group 0.000 description 6
- 230000015556 catabolic process Effects 0.000 description 6
- 210000000170 cell membrane Anatomy 0.000 description 6
- 230000013595 glycosylation Effects 0.000 description 6
- 238000006206 glycosylation reaction Methods 0.000 description 6
- 239000005556 hormone Substances 0.000 description 6
- 229940088597 hormone Drugs 0.000 description 6
- 229910052739 hydrogen Inorganic materials 0.000 description 6
- 230000031146 intracellular signal transduction Effects 0.000 description 6
- 239000003446 ligand Substances 0.000 description 6
- 238000002156 mixing Methods 0.000 description 6
- 239000000178 monomer Substances 0.000 description 6
- 238000005016 nuclear Overhauser enhanced spectroscopy Methods 0.000 description 6
- 239000002773 nucleotide Substances 0.000 description 6
- 210000004940 nucleus Anatomy 0.000 description 6
- 229920000642 polymer Polymers 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- 238000001896 rotating frame Overhauser effect spectroscopy Methods 0.000 description 6
- 235000004400 serine Nutrition 0.000 description 6
- 239000002904 solvent Substances 0.000 description 6
- 238000001551 total correlation spectroscopy Methods 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 5
- 102000014914 Carrier Proteins Human genes 0.000 description 5
- 108090000695 Cytokines Proteins 0.000 description 5
- 102000004127 Cytokines Human genes 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 102100037872 Intercellular adhesion molecule 2 Human genes 0.000 description 5
- 101710148794 Intercellular adhesion molecule 2 Proteins 0.000 description 5
- 108090000787 Subtilisin Proteins 0.000 description 5
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 5
- 230000004075 alteration Effects 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 239000007864 aqueous solution Substances 0.000 description 5
- 108091008324 binding proteins Proteins 0.000 description 5
- 230000000975 bioactive effect Effects 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 238000004422 calculation algorithm Methods 0.000 description 5
- 239000003153 chemical reaction reagent Substances 0.000 description 5
- 230000000875 corresponding effect Effects 0.000 description 5
- 230000001419 dependent effect Effects 0.000 description 5
- 229940088598 enzyme Drugs 0.000 description 5
- 239000007789 gas Substances 0.000 description 5
- GVVPGTZRZFNKDS-JXMROGBWSA-N geranyl diphosphate Chemical compound CC(C)=CCC\C(C)=C\CO[P@](O)(=O)OP(O)(O)=O GVVPGTZRZFNKDS-JXMROGBWSA-N 0.000 description 5
- 239000001257 hydrogen Substances 0.000 description 5
- 230000001939 inductive effect Effects 0.000 description 5
- 206010022000 influenza Diseases 0.000 description 5
- 238000000329 molecular dynamics simulation Methods 0.000 description 5
- 238000002703 mutagenesis Methods 0.000 description 5
- 231100000350 mutagenesis Toxicity 0.000 description 5
- 230000003389 potentiating effect Effects 0.000 description 5
- 238000000159 protein binding assay Methods 0.000 description 5
- 235000004252 protein component Nutrition 0.000 description 5
- 230000004850 protein–protein interaction Effects 0.000 description 5
- 102000005962 receptors Human genes 0.000 description 5
- 108020003175 receptors Proteins 0.000 description 5
- 239000011347 resin Substances 0.000 description 5
- 229920005989 resin Polymers 0.000 description 5
- 238000004007 reversed phase HPLC Methods 0.000 description 5
- 238000012546 transfer Methods 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 4
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 4
- 108091026890 Coding region Proteins 0.000 description 4
- 241000209219 Hordeum Species 0.000 description 4
- 235000007340 Hordeum vulgare Nutrition 0.000 description 4
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 4
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 4
- NQRYJNQNLNOLGT-UHFFFAOYSA-N Piperidine Chemical compound C1CCNCC1 NQRYJNQNLNOLGT-UHFFFAOYSA-N 0.000 description 4
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 4
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 4
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 4
- 150000001720 carbohydrates Chemical group 0.000 description 4
- 230000000295 complement effect Effects 0.000 description 4
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 4
- 238000006731 degradation reaction Methods 0.000 description 4
- 201000010099 disease Diseases 0.000 description 4
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 4
- 230000003246 elastolytic effect Effects 0.000 description 4
- 238000004128 high performance liquid chromatography Methods 0.000 description 4
- NPZTUJOABDZTLV-UHFFFAOYSA-N hydroxybenzotriazole Substances O=C1C=CC=C2NNN=C12 NPZTUJOABDZTLV-UHFFFAOYSA-N 0.000 description 4
- 238000001727 in vivo Methods 0.000 description 4
- 238000005040 ion trap Methods 0.000 description 4
- 150000002632 lipids Chemical class 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 239000002609 medium Substances 0.000 description 4
- 230000002438 mitochondrial effect Effects 0.000 description 4
- 238000010369 molecular cloning Methods 0.000 description 4
- 230000037361 pathway Effects 0.000 description 4
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 4
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 4
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 4
- 230000004224 protection Effects 0.000 description 4
- 238000001742 protein purification Methods 0.000 description 4
- 230000002797 proteolythic effect Effects 0.000 description 4
- 230000002441 reversible effect Effects 0.000 description 4
- 241001430294 unidentified retrovirus Species 0.000 description 4
- 125000003088 (fluoren-9-ylmethoxy)carbonyl group Chemical group 0.000 description 3
- BDNKZNFMNDZQMI-UHFFFAOYSA-N 1,3-diisopropylcarbodiimide Chemical compound CC(C)N=C=NC(C)C BDNKZNFMNDZQMI-UHFFFAOYSA-N 0.000 description 3
- ASOKPJOREAFHNY-UHFFFAOYSA-N 1-Hydroxybenzotriazole Chemical compound C1=CC=C2N(O)N=NC2=C1 ASOKPJOREAFHNY-UHFFFAOYSA-N 0.000 description 3
- WFDIJRYMOXRFFG-UHFFFAOYSA-N Acetic anhydride Chemical compound CC(=O)OC(C)=O WFDIJRYMOXRFFG-UHFFFAOYSA-N 0.000 description 3
- 102000005367 Carboxypeptidases Human genes 0.000 description 3
- 108010006303 Carboxypeptidases Proteins 0.000 description 3
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 3
- HVLSXIKZNLPZJJ-TXZCQADKSA-N HA peptide Chemical compound C([C@@H](C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HVLSXIKZNLPZJJ-TXZCQADKSA-N 0.000 description 3
- 101710154606 Hemagglutinin Proteins 0.000 description 3
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 3
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 3
- 101710173438 Late L2 mu core protein Proteins 0.000 description 3
- 108090000189 Neuropeptides Proteins 0.000 description 3
- 102000003797 Neuropeptides Human genes 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 3
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 3
- 108010033276 Peptide Fragments Proteins 0.000 description 3
- 102000007079 Peptide Fragments Human genes 0.000 description 3
- 108010067902 Peptide Library Proteins 0.000 description 3
- 101710176177 Protein A56 Proteins 0.000 description 3
- 101710188315 Protein X Proteins 0.000 description 3
- 101710188306 Protein Y Proteins 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 239000012190 activator Substances 0.000 description 3
- 230000001588 bifunctional effect Effects 0.000 description 3
- 229920001222 biopolymer Polymers 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 3
- 150000007942 carboxylates Chemical class 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000004132 cross linking Methods 0.000 description 3
- 238000012258 culturing Methods 0.000 description 3
- 210000000805 cytoplasm Anatomy 0.000 description 3
- 238000004925 denaturation Methods 0.000 description 3
- 230000036425 denaturation Effects 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 230000029087 digestion Effects 0.000 description 3
- BGRWYRAHAFMIBJ-UHFFFAOYSA-N diisopropylcarbodiimide Natural products CC(C)NC(=O)NC(C)C BGRWYRAHAFMIBJ-UHFFFAOYSA-N 0.000 description 3
- 238000007865 diluting Methods 0.000 description 3
- 238000010790 dilution Methods 0.000 description 3
- 239000012895 dilution Substances 0.000 description 3
- 230000003292 diminished effect Effects 0.000 description 3
- 238000009826 distribution Methods 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 239000012636 effector Substances 0.000 description 3
- 238000002330 electrospray ionisation mass spectrometry Methods 0.000 description 3
- 238000010828 elution Methods 0.000 description 3
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 3
- 230000002255 enzymatic effect Effects 0.000 description 3
- 238000001415 gene therapy Methods 0.000 description 3
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 3
- 239000000185 hemagglutinin Substances 0.000 description 3
- 125000001165 hydrophobic group Chemical group 0.000 description 3
- 238000010348 incorporation Methods 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 238000001802 infusion Methods 0.000 description 3
- 230000000977 initiatory effect Effects 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 3
- 239000002245 particle Substances 0.000 description 3
- 210000002824 peroxisome Anatomy 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- 239000013600 plasmid vector Substances 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 239000011541 reaction mixture Substances 0.000 description 3
- 150000003384 small molecules Chemical class 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 125000006850 spacer group Chemical group 0.000 description 3
- 230000003595 spectral effect Effects 0.000 description 3
- 210000000130 stem cell Anatomy 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 125000000341 threoninyl group Chemical class [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- DHBXNPKRAUYBTH-UHFFFAOYSA-N 1,1-ethanedithiol Chemical compound CC(S)S DHBXNPKRAUYBTH-UHFFFAOYSA-N 0.000 description 2
- 238000005160 1H NMR spectroscopy Methods 0.000 description 2
- OINNEUNVOZHBOX-QIRCYJPOSA-K 2-trans,6-trans,10-trans-geranylgeranyl diphosphate(3-) Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C(C)=C\COP([O-])(=O)OP([O-])([O-])=O OINNEUNVOZHBOX-QIRCYJPOSA-K 0.000 description 2
- 102000004400 Aminopeptidases Human genes 0.000 description 2
- 108090000915 Aminopeptidases Proteins 0.000 description 2
- 241000203069 Archaea Species 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 2
- 108090000317 Chymotrypsin Proteins 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 108020004635 Complementary DNA Proteins 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- XZWYTXMRWQJBGX-VXBMVYAYSA-N FLAG peptide Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 XZWYTXMRWQJBGX-VXBMVYAYSA-N 0.000 description 2
- OINNEUNVOZHBOX-XBQSVVNOSA-N Geranylgeranyl diphosphate Natural products [P@](=O)(OP(=O)(O)O)(OC/C=C(\CC/C=C(\CC/C=C(\CC/C=C(\C)/C)/C)/C)/C)O OINNEUNVOZHBOX-XBQSVVNOSA-N 0.000 description 2
- 102100031132 Glucose-6-phosphate isomerase Human genes 0.000 description 2
- 102000005720 Glutathione transferase Human genes 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- BCCRXDTUTZHDEU-VKHMYHEASA-N Gly-Ser Chemical compound NCC(=O)N[C@@H](CO)C(O)=O BCCRXDTUTZHDEU-VKHMYHEASA-N 0.000 description 2
- 102000005744 Glycoside Hydrolases Human genes 0.000 description 2
- 108010031186 Glycoside Hydrolases Proteins 0.000 description 2
- 108060003951 Immunoglobulin Proteins 0.000 description 2
- 108090000978 Interleukin-4 Proteins 0.000 description 2
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- 101150117895 LAMP2 gene Proteins 0.000 description 2
- 102100025169 Max-binding protein MNT Human genes 0.000 description 2
- JGFZNNIVVJXRND-UHFFFAOYSA-N N,N-Diisopropylethylamine (DIPEA) Chemical compound CCN(C(C)C)C(C)C JGFZNNIVVJXRND-UHFFFAOYSA-N 0.000 description 2
- 238000012565 NMR experiment Methods 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 239000004793 Polystyrene Substances 0.000 description 2
- 210000001744 T-lymphocyte Anatomy 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- 230000002378 acidificating effect Effects 0.000 description 2
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 2
- 125000003368 amide group Chemical group 0.000 description 2
- 150000001412 amines Chemical group 0.000 description 2
- 150000003862 amino acid derivatives Chemical class 0.000 description 2
- 125000003277 amino group Chemical group 0.000 description 2
- 210000004102 animal cell Anatomy 0.000 description 2
- RDOXTESZEPMUJZ-UHFFFAOYSA-N anisole Chemical compound COC1=CC=CC=C1 RDOXTESZEPMUJZ-UHFFFAOYSA-N 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 235000009697 arginine Nutrition 0.000 description 2
- 238000003491 array Methods 0.000 description 2
- 229940098773 bovine serum albumin Drugs 0.000 description 2
- 230000030833 cell death Effects 0.000 description 2
- 230000003833 cell viability Effects 0.000 description 2
- 229960002376 chymotrypsin Drugs 0.000 description 2
- 230000002860 competitive effect Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 239000003431 cross linking reagent Substances 0.000 description 2
- 201000010251 cutis laxa Diseases 0.000 description 2
- 239000000975 dye Substances 0.000 description 2
- 238000002337 electrophoretic mobility shift assay Methods 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 238000000132 electrospray ionisation Methods 0.000 description 2
- 150000002148 esters Chemical class 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 238000011049 filling Methods 0.000 description 2
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 2
- 238000001476 gene delivery Methods 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 239000004220 glutamic acid Substances 0.000 description 2
- 210000002288 golgi apparatus Anatomy 0.000 description 2
- 239000003102 growth factor Substances 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 2
- 102000018358 immunoglobulin Human genes 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 210000003734 kidney Anatomy 0.000 description 2
- 230000002045 lasting effect Effects 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 210000003712 lysosome Anatomy 0.000 description 2
- 230000001868 lysosomic effect Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 210000003470 mitochondria Anatomy 0.000 description 2
- 230000007935 neutral effect Effects 0.000 description 2
- 230000026792 palmitoylation Effects 0.000 description 2
- 239000000816 peptidomimetic Substances 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 230000035479 physiological effects, processes and functions Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000016434 protein splicing Effects 0.000 description 2
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 2
- 238000010791 quenching Methods 0.000 description 2
- 230000000171 quenching effect Effects 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 150000003355 serines Chemical class 0.000 description 2
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 2
- 239000011550 stock solution Substances 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- HNKJADCVZUBCPG-UHFFFAOYSA-N thioanisole Chemical compound CSC1=CC=CC=C1 HNKJADCVZUBCPG-UHFFFAOYSA-N 0.000 description 2
- 238000012582 total correlation spectroscopy experiment Methods 0.000 description 2
- 108091006107 transcriptional repressors Proteins 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 238000004104 two-dimensional total correlation spectroscopy Methods 0.000 description 2
- 235000002374 tyrosine Nutrition 0.000 description 2
- 241001515965 unidentified phage Species 0.000 description 2
- FXYPGCIGRDZWNR-UHFFFAOYSA-N (2,5-dioxopyrrolidin-1-yl) 3-[[3-(2,5-dioxopyrrolidin-1-yl)oxy-3-oxopropyl]disulfanyl]propanoate Chemical compound O=C1CCC(=O)N1OC(=O)CCSSCCC(=O)ON1C(=O)CCC1=O FXYPGCIGRDZWNR-UHFFFAOYSA-N 0.000 description 1
- YFGBQHOOROIVKG-BHDDXSALSA-N (2R)-2-[[(2R)-2-[[2-[[2-[[(2S)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]amino]acetyl]amino]-3-phenylpropanoyl]amino]-4-methylsulfanylbutanoic acid Chemical compound C([C@H](C(=O)N[C@H](CCSC)C(O)=O)NC(=O)CNC(=O)CNC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=CC=C1 YFGBQHOOROIVKG-BHDDXSALSA-N 0.000 description 1
- QDZOEBFLNHCSSF-PFFBOGFISA-N (2S)-2-[[(2R)-2-[[(2S)-1-[(2S)-6-amino-2-[[(2S)-1-[(2R)-2-amino-5-carbamimidamidopentanoyl]pyrrolidine-2-carbonyl]amino]hexanoyl]pyrrolidine-2-carbonyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-N-[(2R)-1-[[(2S)-1-[[(2R)-1-[[(2S)-1-[[(2S)-1-amino-4-methyl-1-oxopentan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-3-(1H-indol-3-yl)-1-oxopropan-2-yl]amino]-1-oxo-3-phenylpropan-2-yl]amino]-3-(1H-indol-3-yl)-1-oxopropan-2-yl]pentanediamide Chemical compound C([C@@H](C(=O)N[C@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(N)=O)NC(=O)[C@@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCCN)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](N)CCCNC(N)=N)C1=CC=CC=C1 QDZOEBFLNHCSSF-PFFBOGFISA-N 0.000 description 1
- CXNPLSGKWMLZPZ-GIFSMMMISA-N (2r,3r,6s)-3-[[(3s)-3-amino-5-[carbamimidoyl(methyl)amino]pentanoyl]amino]-6-(4-amino-2-oxopyrimidin-1-yl)-3,6-dihydro-2h-pyran-2-carboxylic acid Chemical compound O1[C@@H](C(O)=O)[C@H](NC(=O)C[C@@H](N)CCN(C)C(N)=N)C=C[C@H]1N1C(=O)N=C(N)C=C1 CXNPLSGKWMLZPZ-GIFSMMMISA-N 0.000 description 1
- KUHSEZKIEJYEHN-BXRBKJIMSA-N (2s)-2-amino-3-hydroxypropanoic acid;(2s)-2-aminopropanoic acid Chemical compound C[C@H](N)C(O)=O.OC[C@H](N)C(O)=O KUHSEZKIEJYEHN-BXRBKJIMSA-N 0.000 description 1
- NCYCYZXNIZJOKI-IOUUIBBYSA-N 11-cis-retinal Chemical compound O=C/C=C(\C)/C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C NCYCYZXNIZJOKI-IOUUIBBYSA-N 0.000 description 1
- 125000003287 1H-imidazol-4-ylmethyl group Chemical group [H]N1C([H])=NC(C([H])([H])[*])=C1[H] 0.000 description 1
- 150000003923 2,5-pyrrolediones Chemical class 0.000 description 1
- AENZGWONVTXLRC-UHFFFAOYSA-N 2-(2,5-dioxopyrrol-1-yl)benzoic acid Chemical compound OC(=O)C1=CC=CC=C1N1C(=O)C=CC1=O AENZGWONVTXLRC-UHFFFAOYSA-N 0.000 description 1
- 238000004466 2D NOESY spectrum Methods 0.000 description 1
- 238000005084 2D-nuclear magnetic resonance Methods 0.000 description 1
- BIGBDMFRWJRLGJ-UHFFFAOYSA-N 3-benzyl-1,5-didiazoniopenta-1,4-diene-2,4-diolate Chemical compound [N-]=[N+]=CC(=O)C(C(=O)C=[N+]=[N-])CC1=CC=CC=C1 BIGBDMFRWJRLGJ-UHFFFAOYSA-N 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- GVUGADOWXGKRAE-SRVKXCTJSA-N 4-[[(2s)-1-[[(2s)-1-[[(2s)-1-(4-nitroanilino)-1-oxopropan-2-yl]amino]-1-oxopropan-2-yl]amino]-1-oxopropan-2-yl]amino]-4-oxobutanoic acid Chemical compound OC(=O)CCC(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)NC1=CC=C([N+]([O-])=O)C=C1 GVUGADOWXGKRAE-SRVKXCTJSA-N 0.000 description 1
- NLPWSMKACWGINL-UHFFFAOYSA-N 4-azido-2-hydroxybenzoic acid Chemical compound OC(=O)C1=CC=C(N=[N+]=[N-])C=C1O NLPWSMKACWGINL-UHFFFAOYSA-N 0.000 description 1
- FVFVNNKYKYZTJU-UHFFFAOYSA-N 6-chloro-1,3,5-triazine-2,4-diamine Chemical compound NC1=NC(N)=NC(Cl)=N1 FVFVNNKYKYZTJU-UHFFFAOYSA-N 0.000 description 1
- ZCYVEMRRCGMTRW-UHFFFAOYSA-N 7553-56-2 Chemical compound [I] ZCYVEMRRCGMTRW-UHFFFAOYSA-N 0.000 description 1
- 108010027410 Adenovirus E3 Proteins Proteins 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 1
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 1
- 240000000662 Anethum graveolens Species 0.000 description 1
- 102100030346 Antigen peptide transporter 1 Human genes 0.000 description 1
- 102100030343 Antigen peptide transporter 2 Human genes 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 1
- 102100022717 Atypical chemokine receptor 1 Human genes 0.000 description 1
- 238000012935 Averaging Methods 0.000 description 1
- 108010006654 Bleomycin Proteins 0.000 description 1
- 101800001415 Bri23 peptide Proteins 0.000 description 1
- 101800000655 C-terminal peptide Proteins 0.000 description 1
- 102400000107 C-terminal peptide Human genes 0.000 description 1
- 102100029968 Calreticulin Human genes 0.000 description 1
- 108090000549 Calreticulin Proteins 0.000 description 1
- 241000282461 Canis lupus Species 0.000 description 1
- 201000009030 Carcinoma Diseases 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 102000019034 Chemokines Human genes 0.000 description 1
- 108010012236 Chemokines Proteins 0.000 description 1
- 229940122644 Chymotrypsin inhibitor Drugs 0.000 description 1
- 108091028732 Concatemer Proteins 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 241000700108 Ctenophora <comb jellyfish phylum> Species 0.000 description 1
- 108010060385 Cyclin B1 Proteins 0.000 description 1
- 102100039205 Cytochrome P450 3A4 Human genes 0.000 description 1
- 102000000634 Cytochrome c oxidase subunit IV Human genes 0.000 description 1
- 108050008072 Cytochrome c oxidase subunit IV Proteins 0.000 description 1
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 1
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 1
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 1
- 101710202200 Endolysin A Proteins 0.000 description 1
- 102000005593 Endopeptidases Human genes 0.000 description 1
- 108010059378 Endopeptidases Proteins 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 241000701533 Escherichia virus T4 Species 0.000 description 1
- 239000005977 Ethylene Substances 0.000 description 1
- 108050001049 Extracellular proteins Proteins 0.000 description 1
- 241000724791 Filamentous phage Species 0.000 description 1
- 108010040721 Flagellin Proteins 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 102100023686 G protein-coupled receptor kinase 6 Human genes 0.000 description 1
- 108010040540 G-protein-coupled receptor kinase 6 Proteins 0.000 description 1
- 102100032340 G2/mitotic-specific cyclin-B1 Human genes 0.000 description 1
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N Glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 1
- 108010051696 Growth Hormone Proteins 0.000 description 1
- XLYOFNOQVPJJNP-ZSJDYOACSA-N Heavy water Chemical compound [2H]O[2H] XLYOFNOQVPJJNP-ZSJDYOACSA-N 0.000 description 1
- 241001622557 Hesperia Species 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- LYCVKHSJGDMDLM-LURJTMIESA-N His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 LYCVKHSJGDMDLM-LURJTMIESA-N 0.000 description 1
- 101000678879 Homo sapiens Atypical chemokine receptor 1 Proteins 0.000 description 1
- 101000745711 Homo sapiens Cytochrome P450 3A4 Proteins 0.000 description 1
- 101000716102 Homo sapiens T-cell surface glycoprotein CD4 Proteins 0.000 description 1
- 108010070875 Human Immunodeficiency Virus tat Gene Products Proteins 0.000 description 1
- 241000725303 Human immunodeficiency virus Species 0.000 description 1
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 1
- 241000243251 Hydra Species 0.000 description 1
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 1
- GRRNUXAQVGOGFE-UHFFFAOYSA-N Hygromycin-B Natural products OC1C(NC)CC(N)C(O)C1OC1C2OC3(C(C(O)C(O)C(C(N)CO)O3)O)OC2C(O)C(CO)O1 GRRNUXAQVGOGFE-UHFFFAOYSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 102000003746 Insulin Receptor Human genes 0.000 description 1
- 108010001127 Insulin Receptor Proteins 0.000 description 1
- 102100025390 Integrin beta-2 Human genes 0.000 description 1
- 102000000588 Interleukin-2 Human genes 0.000 description 1
- 108010002350 Interleukin-2 Proteins 0.000 description 1
- 102000010789 Interleukin-2 Receptors Human genes 0.000 description 1
- 108010038453 Interleukin-2 Receptors Proteins 0.000 description 1
- 229930194542 Keto Natural products 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-N L-arginine Chemical compound OC(=O)[C@@H](N)CCCN=C(N)N ODKSFYDXXFIFQN-BYPYZUCNSA-N 0.000 description 1
- RHGKLRLOHDJJDR-BYPYZUCNSA-N L-citrulline Chemical compound NC(=O)NCCC[C@H]([NH3+])C([O-])=O RHGKLRLOHDJJDR-BYPYZUCNSA-N 0.000 description 1
- JTTHKOPSMAVJFE-VIFPVBQESA-N L-homophenylalanine Chemical compound OC(=O)[C@@H](N)CCC1=CC=CC=C1 JTTHKOPSMAVJFE-VIFPVBQESA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 101150048357 Lamp1 gene Proteins 0.000 description 1
- 101710128836 Large T antigen Proteins 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 239000000232 Lipid Bilayer Substances 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 108010064548 Lymphocyte Function-Associated Antigen-1 Proteins 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- UGTZHPSKYRIGRJ-YUMQZZPRSA-N Lys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UGTZHPSKYRIGRJ-YUMQZZPRSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- 102000014944 Lysosome-Associated Membrane Glycoproteins Human genes 0.000 description 1
- 108010064171 Lysosome-Associated Membrane Glycoproteins Proteins 0.000 description 1
- 101710151321 Melanostatin Proteins 0.000 description 1
- 108010023335 Member 2 Subfamily B ATP Binding Cassette Transporter Proteins 0.000 description 1
- 102400000988 Met-enkephalin Human genes 0.000 description 1
- DBTDEFJAFBUGPP-UHFFFAOYSA-N Methanethial Chemical class S=C DBTDEFJAFBUGPP-UHFFFAOYSA-N 0.000 description 1
- 108010042237 Methionine Enkephalin Proteins 0.000 description 1
- 241000713869 Moloney murine leukemia virus Species 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- NQTADLQHYWFPDB-UHFFFAOYSA-N N-Hydroxysuccinimide Chemical class ON1C(=O)CCC1=O NQTADLQHYWFPDB-UHFFFAOYSA-N 0.000 description 1
- 125000000729 N-terminal amino-acid group Chemical group 0.000 description 1
- 108091061960 Naked DNA Proteins 0.000 description 1
- RHGKLRLOHDJJDR-UHFFFAOYSA-N Ndelta-carbamoyl-DL-ornithine Natural products OC(=O)C(N)CCCNC(N)=O RHGKLRLOHDJJDR-UHFFFAOYSA-N 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 102000003729 Neprilysin Human genes 0.000 description 1
- 108090000028 Neprilysin Proteins 0.000 description 1
- 102400000064 Neuropeptide Y Human genes 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 108020004485 Nonsense Codon Proteins 0.000 description 1
- 102000002488 Nucleoplasmin Human genes 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 230000004989 O-glycosylation Effects 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 101710116435 Outer membrane protein Proteins 0.000 description 1
- 208000012868 Overgrowth Diseases 0.000 description 1
- 108010043958 Peptoids Proteins 0.000 description 1
- 102100029251 Phagocytosis-stimulating peptide Human genes 0.000 description 1
- 102000045595 Phosphoprotein Phosphatases Human genes 0.000 description 1
- 108700019535 Phosphoprotein Phosphatases Proteins 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 101710149951 Protein Tat Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108700008625 Reporter Genes Proteins 0.000 description 1
- 206010038997 Retroviral infections Diseases 0.000 description 1
- 102100040756 Rhodopsin Human genes 0.000 description 1
- 108090000820 Rhodopsin Proteins 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 241000220317 Rosa Species 0.000 description 1
- 102000001332 SRC Human genes 0.000 description 1
- 108060006706 SRC Proteins 0.000 description 1
- 102000013541 Shaker Superfamily of Potassium Channels Human genes 0.000 description 1
- 108010026533 Shaker Superfamily of Potassium Channels Proteins 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- 102100038803 Somatotropin Human genes 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 102400000096 Substance P Human genes 0.000 description 1
- 101800003906 Substance P Proteins 0.000 description 1
- 108010056079 Subtilisins Proteins 0.000 description 1
- 102000005158 Subtilisins Human genes 0.000 description 1
- 102100036011 T-cell surface glycoprotein CD4 Human genes 0.000 description 1
- 108700026226 TATA Box Proteins 0.000 description 1
- 101150052863 THY1 gene Proteins 0.000 description 1
- 101800000849 Tachykinin-associated peptide 2 Proteins 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 241000710145 Tomato bushy stunt virus Species 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- 108060008539 Transglutaminase Proteins 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- 102000004243 Tubulin Human genes 0.000 description 1
- 108090000704 Tubulin Proteins 0.000 description 1
- 108010084754 Tuftsin Proteins 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 239000003875 Wang resin Substances 0.000 description 1
- 241000269368 Xenopus laevis Species 0.000 description 1
- HMNZFMSWFCAGGW-XPWSMXQVSA-N [3-[hydroxy(2-hydroxyethoxy)phosphoryl]oxy-2-[(e)-octadec-9-enoyl]oxypropyl] (e)-octadec-9-enoate Chemical compound CCCCCCCC\C=C\CCCCCCCC(=O)OCC(COP(O)(=O)OCCO)OC(=O)CCCCCCC\C=C\CCCCCCCC HMNZFMSWFCAGGW-XPWSMXQVSA-N 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000008186 active pharmaceutical agent Substances 0.000 description 1
- 102000019997 adhesion receptor Human genes 0.000 description 1
- 108010013985 adhesion receptor Proteins 0.000 description 1
- 210000001789 adipocyte Anatomy 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 239000000556 agonist Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- PPQRONHOSHZGFQ-LMVFSUKVSA-N aldehydo-D-ribose 5-phosphate Chemical group OP(=O)(O)OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PPQRONHOSHZGFQ-LMVFSUKVSA-N 0.000 description 1
- SHGAZHPCJJPHSC-YCNIQYBTSA-N all-trans-retinoic acid Chemical compound OC(=O)\C=C(/C)\C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C SHGAZHPCJJPHSC-YCNIQYBTSA-N 0.000 description 1
- 230000009435 amidation Effects 0.000 description 1
- 238000007112 amidation reaction Methods 0.000 description 1
- 150000003863 ammonium salts Chemical class 0.000 description 1
- 239000005557 antagonist Substances 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 210000000612 antigen-presenting cell Anatomy 0.000 description 1
- 239000003125 aqueous solvent Substances 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 230000008003 autocrine effect Effects 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000009141 biological interaction Effects 0.000 description 1
- CXNPLSGKWMLZPZ-UHFFFAOYSA-N blasticidin-S Natural products O1C(C(O)=O)C(NC(=O)CC(N)CCN(C)C(N)=N)C=CC1N1C(=O)N=C(N)C=C1 CXNPLSGKWMLZPZ-UHFFFAOYSA-N 0.000 description 1
- 229960001561 bleomycin Drugs 0.000 description 1
- OYVAGSVQBOHSSS-UAPAGMARSA-O bleomycin A2 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-UAPAGMARSA-O 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 210000000481 breast Anatomy 0.000 description 1
- MIOPJNTWMNEORI-UHFFFAOYSA-N camphorsulfonic acid Chemical compound C1CC2(CS(O)(=O)=O)C(=O)CC1C2(C)C MIOPJNTWMNEORI-UHFFFAOYSA-N 0.000 description 1
- 210000004413 cardiac myocyte Anatomy 0.000 description 1
- 239000005018 casein Substances 0.000 description 1
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 1
- 235000021240 caseins Nutrition 0.000 description 1
- 238000010370 cell cloning Methods 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 210000003855 cell nucleus Anatomy 0.000 description 1
- 238000003570 cell viability assay Methods 0.000 description 1
- 230000007248 cellular mechanism Effects 0.000 description 1
- 230000036755 cellular response Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- BJDCWCLMFKKGEE-CMDXXVQNSA-N chembl252518 Chemical compound C([C@@](OO1)(C)O2)C[C@H]3[C@H](C)CC[C@@H]4[C@@]31[C@@H]2O[C@H](O)[C@@H]4C BJDCWCLMFKKGEE-CMDXXVQNSA-N 0.000 description 1
- 238000010382 chemical cross-linking Methods 0.000 description 1
- 210000003763 chloroplast Anatomy 0.000 description 1
- 210000001612 chondrocyte Anatomy 0.000 description 1
- 239000003541 chymotrypsin inhibitor Substances 0.000 description 1
- 108010081370 chymotrypsin inhibitor 2 Proteins 0.000 description 1
- 235000013477 citrulline Nutrition 0.000 description 1
- 229960002173 citrulline Drugs 0.000 description 1
- 238000000749 co-immunoprecipitation Methods 0.000 description 1
- 210000001072 colon Anatomy 0.000 description 1
- 238000002648 combination therapy Methods 0.000 description 1
- 239000003636 conditioned culture medium Substances 0.000 description 1
- 238000002884 conformational search Methods 0.000 description 1
- 210000001608 connective tissue cell Anatomy 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 238000005100 correlation spectroscopy Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 125000004093 cyano group Chemical group *C#N 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 108010035432 cytochrome C(L) Proteins 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 230000006240 deamidation Effects 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000022811 deglycosylation Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000010511 deprotection reaction Methods 0.000 description 1
- 238000001212 derivatisation Methods 0.000 description 1
- 229960004132 diethyl ether Drugs 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 230000000447 dimerizing effect Effects 0.000 description 1
- 229910001873 dinitrogen Inorganic materials 0.000 description 1
- 238000004090 dissolution Methods 0.000 description 1
- 125000002228 disulfide group Chemical group 0.000 description 1
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 1
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 1
- 238000009510 drug design Methods 0.000 description 1
- 238000009509 drug development Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 230000002849 elastaseinhibitory effect Effects 0.000 description 1
- 238000007787 electrohydrodynamic spraying Methods 0.000 description 1
- 230000002124 endocrine Effects 0.000 description 1
- 210000002889 endothelial cell Anatomy 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 239000002532 enzyme inhibitor Substances 0.000 description 1
- 229940125532 enzyme inhibitor Drugs 0.000 description 1
- 210000003979 eosinophil Anatomy 0.000 description 1
- 230000002922 epistatic effect Effects 0.000 description 1
- 210000002919 epithelial cell Anatomy 0.000 description 1
- 238000011067 equilibration Methods 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 210000001723 extracellular space Anatomy 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000006126 farnesylation Effects 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 210000001650 focal adhesion Anatomy 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 239000013505 freshwater Substances 0.000 description 1
- 238000002825 functional assay Methods 0.000 description 1
- 230000009395 genetic defect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 230000006130 geranylgeranylation Effects 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 229930195712 glutamate Natural products 0.000 description 1
- 125000000404 glutamine group Chemical group N[C@@H](CCC(N)=O)C(=O)* 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 229930182470 glycoside Natural products 0.000 description 1
- 150000002338 glycosides Chemical class 0.000 description 1
- 229930004094 glycosylphosphatidylinositol Natural products 0.000 description 1
- 239000005090 green fluorescent protein Substances 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 239000000122 growth hormone Substances 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 210000003494 hepatocyte Anatomy 0.000 description 1
- 238000013537 high throughput screening Methods 0.000 description 1
- 210000003630 histaminocyte Anatomy 0.000 description 1
- 235000014304 histidine Nutrition 0.000 description 1
- 150000002411 histidines Chemical class 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 125000002349 hydroxyamino group Chemical group [H]ON([H])[*] 0.000 description 1
- 230000033444 hydroxylation Effects 0.000 description 1
- 238000005805 hydroxylation reaction Methods 0.000 description 1
- 229960002591 hydroxyproline Drugs 0.000 description 1
- GRRNUXAQVGOGFE-NZSRVPFOSA-N hygromycin B Chemical compound O[C@@H]1[C@@H](NC)C[C@@H](N)[C@H](O)[C@H]1O[C@H]1[C@H]2O[C@@]3([C@@H]([C@@H](O)[C@@H](O)[C@@H](C(N)CO)O3)O)O[C@H]2[C@@H](O)[C@@H](CO)O1 GRRNUXAQVGOGFE-NZSRVPFOSA-N 0.000 description 1
- 229940097277 hygromycin b Drugs 0.000 description 1
- UGQQAJOWXNCOPY-VBCJEVMVSA-N i1osj03h46 Chemical compound C([C@H]12)C[C@H]3[C@@](C4(Cl)Cl)(Cl)C(Cl)=C(Cl)[C@@]4(Cl)[C@H]3CC[C@@H]1[C@]1(Cl)C(Cl)=C(Cl)[C@@]2(Cl)C1(Cl)Cl UGQQAJOWXNCOPY-VBCJEVMVSA-N 0.000 description 1
- 108010063679 ice nucleation protein Proteins 0.000 description 1
- 150000002463 imidates Chemical class 0.000 description 1
- 125000001841 imino group Chemical group [H]N=* 0.000 description 1
- 230000014726 immortalization of host cell Effects 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 238000001114 immunoprecipitation Methods 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 238000012750 in vivo screening Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 108010038415 interleukin-8 receptors Proteins 0.000 description 1
- 102000010681 interleukin-8 receptors Human genes 0.000 description 1
- 230000007728 intracellular signaling mechanism Effects 0.000 description 1
- 239000011630 iodine Substances 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- QRXWMOHMRWLFEY-UHFFFAOYSA-N isoniazide Chemical compound NNC(=O)C1=CC=NC=C1 QRXWMOHMRWLFEY-UHFFFAOYSA-N 0.000 description 1
- 230000005445 isotope effect Effects 0.000 description 1
- 210000002510 keratinocyte Anatomy 0.000 description 1
- 125000000468 ketone group Chemical group 0.000 description 1
- 210000003292 kidney cell Anatomy 0.000 description 1
- 230000002147 killing effect Effects 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 101150066555 lacZ gene Proteins 0.000 description 1
- 150000002611 lead compounds Chemical class 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 210000005229 liver cell Anatomy 0.000 description 1
- 230000033001 locomotion Effects 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- 230000006674 lysosomal degradation Effects 0.000 description 1
- 230000002132 lysosomal effect Effects 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 210000002752 melanocyte Anatomy 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- UZKWTJUDCOPSNM-UHFFFAOYSA-N methoxybenzene Substances CCCCOC=C UZKWTJUDCOPSNM-UHFFFAOYSA-N 0.000 description 1
- YCXSYMVGMXQYNT-UHFFFAOYSA-N methyl 3-[(4-azidophenyl)disulfanyl]propanimidate Chemical compound COC(=N)CCSSC1=CC=C(N=[N+]=[N-])C=C1 YCXSYMVGMXQYNT-UHFFFAOYSA-N 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 238000007431 microscopic evaluation Methods 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 230000025608 mitochondrion localization Effects 0.000 description 1
- 238000012900 molecular simulation Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 210000002433 mononuclear leukocyte Anatomy 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 208000025113 myeloid leukemia Diseases 0.000 description 1
- 210000000107 myocyte Anatomy 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 210000000633 nuclear envelope Anatomy 0.000 description 1
- 238000000655 nuclear magnetic resonance spectrum Methods 0.000 description 1
- 238000012585 nuclear overhauser effect spectroscopy experiment Methods 0.000 description 1
- URPYMXQQVHTUDU-OFGSCBOVSA-N nucleopeptide y Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(N)=O)NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CNC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 URPYMXQQVHTUDU-OFGSCBOVSA-N 0.000 description 1
- 108060005597 nucleoplasmin Proteins 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 210000002997 osteoclast Anatomy 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 230000003076 paracrine Effects 0.000 description 1
- 244000045947 parasite Species 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 239000000813 peptide hormone Substances 0.000 description 1
- 108010082406 peptide permease Proteins 0.000 description 1
- 230000007030 peptide scission Effects 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- 238000002823 phage display Methods 0.000 description 1
- 108010083127 phage repressor proteins Proteins 0.000 description 1
- 238000011170 pharmaceutical development Methods 0.000 description 1
- 239000002831 pharmacologic agent Substances 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N phenylalanine group Chemical group N[C@@H](CC1=CC=CC=C1)C(=O)O COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- 238000003566 phosphorylation assay Methods 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 230000010287 polarization Effects 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 229920001451 polypropylene glycol Polymers 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 229920002223 polystyrene Polymers 0.000 description 1
- 230000029279 positive regulation of transcription, DNA-dependent Effects 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 238000011533 pre-incubation Methods 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 108010066381 preproinsulin Proteins 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 210000002307 prostate Anatomy 0.000 description 1
- 235000019419 proteases Nutrition 0.000 description 1
- 238000002818 protein evolution Methods 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 230000020978 protein processing Effects 0.000 description 1
- 230000018883 protein targeting Effects 0.000 description 1
- 108010033896 proteinase inhibitor I (plants) Proteins 0.000 description 1
- 150000003212 purines Chemical class 0.000 description 1
- 229950010131 puromycin Drugs 0.000 description 1
- FFISLTWEISOMFC-UHFFFAOYSA-N pyridin-2-yl propanedithioate Chemical compound CCC(=S)SC1=CC=CC=N1 FFISLTWEISOMFC-UHFFFAOYSA-N 0.000 description 1
- 239000010453 quartz Substances 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 239000012508 resin bead Substances 0.000 description 1
- 210000003660 reticulum Anatomy 0.000 description 1
- 229930002330 retinoic acid Natural products 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 238000007423 screening assay Methods 0.000 description 1
- 210000004739 secretory vesicle Anatomy 0.000 description 1
- 238000010187 selection method Methods 0.000 description 1
- 230000008684 selective degradation Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 230000007781 signaling event Effects 0.000 description 1
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N silicon dioxide Inorganic materials O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 1
- 238000002922 simulated annealing Methods 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 159000000000 sodium salts Chemical class 0.000 description 1
- 239000002195 soluble material Substances 0.000 description 1
- ATHGHQPFGPMSJY-UHFFFAOYSA-Q spermidine(3+) Chemical compound [NH3+]CCCC[NH2+]CCC[NH3+] ATHGHQPFGPMSJY-UHFFFAOYSA-Q 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 238000005556 structure-activity relationship Methods 0.000 description 1
- 230000004960 subcellular localization Effects 0.000 description 1
- IWOKCMBOJXYDEE-UHFFFAOYSA-N sulfinylmethane Chemical class C=S=O IWOKCMBOJXYDEE-UHFFFAOYSA-N 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 108700029760 synthetic LTSP Proteins 0.000 description 1
- 210000001550 testis Anatomy 0.000 description 1
- NBOMNTLFRHMDEZ-UHFFFAOYSA-N thiosalicylic acid Chemical compound OC(=O)C1=CC=CC=C1S NBOMNTLFRHMDEZ-UHFFFAOYSA-N 0.000 description 1
- 238000010399 three-hybrid screening Methods 0.000 description 1
- 235000008521 threonine Nutrition 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 238000004448 titration Methods 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 108700012359 toxins Proteins 0.000 description 1
- FGMPLJWBKKVCDB-UHFFFAOYSA-N trans-L-hydroxy-proline Natural products ON1CCCC1C(O)=O FGMPLJWBKKVCDB-UHFFFAOYSA-N 0.000 description 1
- 108091006106 transcriptional activators Proteins 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 102000003601 transglutaminase Human genes 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- 102000035160 transmembrane proteins Human genes 0.000 description 1
- 108091005703 transmembrane proteins Proteins 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 229960001727 tretinoin Drugs 0.000 description 1
- 125000002221 trityl group Chemical group [H]C1=C([H])C([H])=C([H])C([H])=C1C([*])(C1=C(C(=C(C(=C1[H])[H])[H])[H])[H])C1=C([H])C([H])=C([H])C([H])=C1[H] 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- IESDGNYHXIOKRW-LEOABGAYSA-N tuftsin Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@H](CCCNC(N)=N)C(O)=O IESDGNYHXIOKRW-LEOABGAYSA-N 0.000 description 1
- 229940035670 tuftsin Drugs 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 150000003668 tyrosines Chemical class 0.000 description 1
- 238000010798 ubiquitination Methods 0.000 description 1
- 230000034512 ubiquitination Effects 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 230000002792 vascular Effects 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 238000001086 yeast two-hybrid system Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K7/00—Peptides having 5 to 20 amino acids in a fully defined sequence; Derivatives thereof
- C07K7/04—Linear peptides containing only normal peptide links
- C07K7/06—Linear peptides containing only normal peptide links having 5 to 11 amino acids
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K1/00—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length
- C07K1/04—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length on carriers
- C07K1/047—Simultaneous synthesis of different peptide species; Peptide libraries
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Zoology (AREA)
- Toxicology (AREA)
- Analytical Chemistry (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Description
WO 99/51625 PCT/US99/07374 PEPTIDES CAUSING FORMATION OF COMPACT STRUCTURES Field of the Invention The compositions and methods of the invention relate to the use of dimerization peptides that selfassociate and their use with other proteins to effect the formation of compact structures.
Background of the Invention Proteins interact with each other largely through conformationally constrained domains. Although linear peptides with freely rotating amino and carboxyl termini can have potent functions as is known in the art, the conversion of such peptide structures into pharmacologic agents is frequently difficult. Therefore the presentation of peptides in conformationally constrained structures can result in the generation of pharmaceuticals with high affinity to its target protein. Constrained peptides have many valuable features compared to their linear analogs. These include: enhanced stability to proteolysis [Szewczuk et al., Biochemistry 31:9132-9140 (1992)] due to the lack of unconstrained N- or C- terminal amino acid residues accessible to amino- or carboxypeptidases and a non-extended structure which diminishes endopeptidase susceptibility;(ii) a restricted conformation space that can result in a higher binding affinity for cognate binding proteins due to a reduced entropic cost of binding [Hruby, Life Sci., 31:189-199 (1982); Rizo and Gierasch, Ann. Rev. Biochem. 61:387-418 (1992)];(iii) the geometry to mimic reverse turns, loops or other secondary structures [Rose et al., Adv. Protein Chem., 37:1-109 (1985); Stradley et al., Biopolymers 29:263-287 (1990); Rizo et al., in Molecular Conformation and Biological Interactions (P.
Balaram and S. Ramaseshan, eds.), Indian Academy of Sciences Publications (Bangalore, India), p469- 496 (1991)]; and (iv) a conformationally restricted scaffold which allows easier pharmaophore and drug development.
Thus constrained peptides can form the basis for the isolation of new ligands and receptors and subsequently for the rational design of small molecules which may be useful as drugs. The desirability of this approach was shown using cyclic peptide libraries which have been used to discover and refine WO 99/51625 PCT/US99/07374 potent ligands of a variety of receptors [O'Neil et al., Proteins: Structure Function and Genetics 14:509- 515 (1992); Giebel et al., Biochem. 34:15430-35 (1995); Spatola and Crozet, J. Med. Chem. 39:3842-46 (1996); Koivunen et al., J. Biol. Chem. 268:20205-10 (1993); Koivunen et al., J. Cell. Biol. 124:373-380 (1994)], enzymes [McBride et al., J. Mol. Biol. 259:819-27 (1996); Eichler et al., Mol. Divers. 1:233-240 (1996)], and other proteins [Wang et al., J. Biol. Chem. 270:23239-42 (1995)].
Several constrained protein scaffolds, capable of presenting a protein of interest as a conformationallyrestricted domain are described in the literature and include minibody structures (Bianchi et al., J. Mol.
Biol. 236(2):649-59 (1994), loops on beta-sheet turns, coiled-coil stem structures (Myszka and Chaiken, Biochemistry 33:2363-2372 (1994), zinc-finger domains, cysteine-linked (disulfide) structures, transglutaminase linked structures, cyclic peptides, helical barrels or bundles, leucine zipper motifs (Martin et al., EMBO J. 13(22):5303-5309 (1994); O'Shea et al., Science 243:538-42 (1993), etc.
In addition, self-aggregation has been described for regulatory peptides such as the neuropeptide head activator [as further outlined below; Bodenmuller et al., EMBO J. 5(8):1825-1829 (1986)], substance P [Poujade et al., Biochem. Biophys. Res. Commun. 114:1109-1116 (1983)], metenkephalin [Mastropaolo et al., Biochem. Biophys. Res. Commun. 134:698-703 (1986)], and neuropeptide Y [Minakata et al., J. Biol.
Chem. 264:7907-7913 (1989)].
Pertinent to the subject of this invention is a peptide derived from the neuropeptide head activator (HA) isolated from the freshwater coelenterate Hydra (Bodenmuller et al., supra). Bodenmuller et al.
demonstrated that under physiological conditions the HA peptide (pEPPGGSKVILF) dimerizes to form a biologically inactive molecule.
Dimerization of the monomer form yields a stable structure, which does not dissociate into its monomeric components at concentrations as low as 10- 13 M. Further analysis of HA fragments revealed that a fragment containing only the last six amino acid residues from the carboxy terminus of the HA peptide (pSKVILF) dimerized more efficiently that HA itself. However, a fragment containing only the last 4 amino acid residues (pVILF) and a fragment derived from the amino-terminal end of HA (pEPPGGSK) did not lead to dimer formation. Most importantly, their analysis showed that both the replacement of the carboxy-terminal phenylalanine and a modification thereof introduction of an iodine in the para position of the aromatic ring) abolished dimerization completely or decreased dimerization tendency drastically.
Aldwin et al. (US 5,491,074), referring to SKVILF as 'association peptide', added additional amino acid residues at either its amino terminal sequence or to its carboxy-terminus and found that some of the resulting proteins could form dimeric peptides. However, Aldwin et al. did not demonstrate or anticipate the addition of more than one 'association peptide' to one polypetide of interest.
P:\OPER\Fas34(93-9.-spc.doc-23/115/1)2 SUMMARY OF THE INVENTION Peptides which have a moderate or high affinity for each other, when added as extensions to both the N- and C-terminus of a protein, can be used to help fold the protein into a compact structure.
Compared to cognate linear proteins and disulfide-cyclized proteins, this new compact structure is more stable to cellular and other proteases, and is significantly more conformationally constrained than the linear peptides. The compact structure can have other functional sequences embedded within its sequence, and is preferable to linear and less constrained peptides for intracellular and extracellular library screens, and for targeting to specific intracellular locations. It can be used, with appropriate flanking residues on each end of the varied residues in a random peptide sequence, to create structurally-biased peptide libraries. By virtue of its stability and constraints, this scaffold can prolong the activity of any embedded peptide sequences in the presence of proteases.
Peptides having the property of self-aggregating herein are referred to as dimerization peptides The dimerization peptides of this invention comprise the sequence FLIVK (from aminoterminal to carboxy-terminal). Examples of dimerization sequences which enhance the folding of a protein of interest include, but are not limited to, FLIVK, EFLIVKS, KFVLIKS, VSIKFEL, LIVKS, EFLIVK, KFLIVK, FESIKVL, and LKSIVEF. These dimerization peptides (DP) can be used in several combinations to yield proteins of the general structure 'DP-protein' or 'DP-protein-DP' wherein 'DP' is a dimerization peptide, 'protein' comprises at least two amino acid residues. In addition other amino acid sequences including, but not limited to, linker sequences, tag sequences, targeting sequences and stabilization sequences are generally included.
Other sequences include those with a high content of hydrophobic amino acids and 1 or 2 charged 25 residue side chains. Generally, a sequence at each terminus of the dimerization peptide composed of 5, 6, 7 and 8 amino acids with at least 3-4 highly hydrophobic residues (taken from F, I, L, M, V, W, and Y) will function in this fashion.
In one embodiment, the invention provides a composition comprising at least a first dimerization 30 peptide that is no more than 8 amino acids long, comprising the sequence NH 2
-FLIVX
5 -COOH and wherein Xs is selected from the group consisting of K, R, D and E.
oe The compositions of this invention are displayed intracellularly or extracellulary and are useful to identify binding proteins and molecules and to modulate intracellular signaling pathways. In one 35 aspect of the invention, a library of constrained proteins is evaluated in vivo for its bioactive :i potential. Thus, the invention accesses molecules or targets within living cells and provides for the isolation of the constrained protein which has a phenotypic effect on this living cell. This method comprises the steps of a) introducing a library encoding constrained proteins into a plurality of cells; and b) screening the plurality of cells for an altered phenotype, conferred upon the cell by a member of the library. The methods may also WO 99/51625 PCT/US99/07374 include the steps of c) isolating cell(s) exhibiting an altered phenotype and d) isolating the member of the library which caused an altered phenotype.
In another aspect, the compositions of the invention are useful to identify in vitro binding proteins and other small molecules capable of binding to the constrained protein. This method comprises the steps of a) providing a constrained protein of interest; b) binding the constrained protein of interest to a solid support; c) providing a molecular library comprising a plurality of individual members; and d) providing conditions allowing the individual members to bind to the constrained protein of interest. The method may also include the steps of e) isolating the bound library member.
In another aspect, the invention provides for the construction of molecular libraries comprising a plurality of constrained proteins. This library of constrained proteins is used in vitro binding assays to identify individual members capable of binding to a protein of interest. This method comprises the steps of a) providing a protein of interest; b) binding the protein of interest to a solid support; c) providing a molecular library comprising a plurality of constrained proteins; and d) providing conditions allowing the constrained proteins to bind to the protein of interest. The method may also include the steps of e) isolating the bound constrained protein.
The compositions of the invention are thus useful as a scaffold for gene therapy and for potential use as a therapeutic in physiological fluids.
In an additional aspect of the invention, the constrained peptides are linked to fusion partners or are targeted to specific subcellular compartments.
The present invention also provides molecular libraries encoding constrained proteins, comprising plasmids and retroviral components and host cells comprising these molecular libraries.
BRIEF DESCRIPTION OF THE FIGURES Figures 1A, 1B, 1C, 1D, 1E, 1F, and 1G depict schematic drawings of some embodiments of DP-protein structures. Fig. 1A. Two dimerization peptides (DP) are fused to a linear protein which results in a DP-protein structure (shown here as DP-protein-DP), which may fold into a compact structure due to the dimerization of DP. Fig lB. DP-protein structures comprising a linker Fig 1C. DP-protein structures comprising a tag sequence (Tag). Tag 1 and Tag 2 are two different tags fused to one DP-protein, indicating that many combinations of fusing tags to the DP-protein are possible. Fig 1D. DP-protein with linkers in between DP and P and two different tags. Fig 1E. DP-protein, wherein a dimerization peptide added to the N-terminus of P is different from a dimerization peptide (DP 2 added to the C-terminus of P. Fig 1F. DP-protein comprising stability sequences such as MG at its N-terminus and GGPP at its C- 4 WO 99/51625 PCT/US99/07374 terminus. Fig 1G. DP-proteins, wherein multiple proteins P 1
P
2 and P 3 are fused to dimerization peptides.
Figures 2A, 2B, and 2C depict schematic drawings of complex DP-proteins. Fig. 2A. Covalently associated double-loop structure. Due to the specific dimerization of DPhyd:DPhyd and DPLys:DPGlu, two constrained peptides are formed within one DP-protein and a double loop structure is expected. The two loop structures are covalently linked through a flexible glycine linker. Fig 2B. Non-covalently associated double-loop structure. Two DP-proteins, one comprising P 1 the other comprising P 2 are made, each resulting in a compact structure due to the dimerization of DPhyd:DPhyd. When combined, due to the specific dimerization of and DPLys:DPGlu, the two constrained structures associate yielding a double loop structure. The two dimerization peptides DPhyd and DPLys or DPhyd and DPGIu are connected through a flexible glycine linker. Fig 2C. Non-covalently associated double-loop structure, wherein unconstrained proteins P, and P 2 are forced into a compact structure due to the specific dimerization of DPhyd:DPhyd and DPLys:DPGiu. The dimerization peptides which associate are confined to different DP-proteins, however, associate with one another when the two DP-proteins are combined. Figs. 2A-C. DPhyd is a dimerization peptide comprising mostly hydrophobic amino acids; DPLys is a dimerization peptide comprising mostly lysines; DPGlu is a dimerization peptide comprising mostly glutamic acids; Lp is a linker comprising prolines, LG is a linker comprising glycines; P, and P 2 are proteins, which may or may not be the same.
Figures 3A and 3B show that novel peptides form observable dimers. Fig. 3A. Dimerization of SKVILFEamide and EFLIVKS-amide. Fig 3B. Dimerization of EFLIVKS-amide when eluted from a C18 reversed phase column at pH -2.5 in ca. 25% acetonitrile.
Figure 4 shows LC/MS examination of the crude synthesis products from an all-single coupled fmoc synthesis of EFLIVKS-amide, for shorter sequences which can dimerize after electrospray ionization.
Figures 5A, 5B, and 5C show proteolytically resistant structures. Fig 5A. Elastase digestion products of the 18mer test protein sequence CGTIVTMEYRIDRTRSFC. Fig. 5B. Elastase digestion products of the 18mer test protein sequence CGTIVTMEYRIDRTRSFC with disulfide bonds between the two underlined cysteines. Fig. 5C. Elastase digestion products of EFLIVKS-VGTIVTMEYRIDRTRSFV-EFLIVKS. Figs Proteolytic fragments are monitored by reversed phase hplc coupled to mass spectrometry detection and identified.
Figure 6. Overlay of the 45 lowest energy structures (only the peptide backbone is shown) of EFLIVKS-
VGTIVTMEYRIDRTRSFV-EFLIVKS.
DETAILED DESCRIPTION OF THE INVENTION WO 99/51625 PCTIS99/07374 Cyclic or otherwise constrained peptides have many valuable features compared to their linear analogs, including enhanced stability to proteolysis and a restricted conformation space that can result in a higher binding affinity for cognate binding proteins due to a reduced entropic cost of binding. These constrained peptides can form the basis for the subsequent design of small molecules which may be useful as drugs.
Constrained peptides contained in minimized proteins may also be useful as an intermediate step in the design of agents blocking protein-protein interactions [Cunningham and Wells, Curr. Opin. Struct. Biol.
7:457-462 (1997)], incorporated herein by reference, which may offer a novel method of regulating intracellular signaling pathways. When peptides are intracellularly expressed, they may modulate intracellular signaling pathways [Souroujon and Mochly-Rosen, Nat. Biotechnol. 16(10):919-24 (1998)]. If the peptides are expressed in live mammalian cells, they may be screened for defined changes in cellular phenotype, and the resulting bioactive peptides may provide a route for the affinity isolation of their binding targets.
Accordingly, the present invention provides dimerization peptides. By "dimerization peptide", "DP" or "association peptide" or grammatical equivalents herein is meant a peptide which either self-aggregates or dimerizes or associates with a second peptide.
By "self-aggregates", or "dimerizes" or "associates" herein is meant that a peptide has an affinity for another peptide and non-covalently attaches itself to this peptide. The interaction between two molecules two peptides) that are capable of binding to one another is usually characterized in terms of the strength with which these molecules interact, the "affinity" that the molecules have for one another.
The range of measured affinity constants, for example, for antibody-antigen binding extends from 105 liter mol-' to above 1012 liter mol-1 (Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 1988). For comparison, the affinity of trypsin for its substrate is approximately 1.25 x 104 liter mol and the affinity of lambda repressor for DNA is 1010 liter mol" 1 (Harlow and Lane, supra).
Dimerization peptides provided by this invention usually have affinities for one another in the range from about 105 liter mol- to about 1013 liter mol-1, more usually from about 106 liter mol- to about 1013 liter mol- 1 from about 107 liter mol 1 to about 10 13 liter mol- 1 being preferred, from about 108 liter mol-1 to about 1013 liter mol"1 being more preferred, from about 109 liter mol-1 to about 1013 liter mol 1 being mostly preferred, and from about 1010 liter mol 1 to about 1013 liter mol'1 being especially preferred. As is known to those in the art, measurement of affinity constants is affected by temperature, pH and solvent.
By "peptide" herein is meant a compound which comprises at least two covalently attached amino acids and includes proteins, polypeptides, oligopeptides and peptides. The peptide may be made up of naturally occurring amino acids and peptide bonds, or synthetic peptidomimetic structures. Thus "amino acid", or "amino acid residue", or "peptide residue", as used herein means both naturally occurring and synthetic amino acids. For example, homo-phenylalanine, citrulline and noreleucine are considered amino acids for the purposes of the invention. "Amino acid" also includes imino acid residues such as WO 99/51625 PCT/US99/07374 proline and hydroxyproline. The side chains may be in either the or the configuration. In the preferred embodiment, the amino acids are in the or L-configuration. If non-naturally occurring side chains are used, non-amino acid substituents may be used, for example to prevent or retard in vivo degradations.
In general, peptides of the invention, including DPs and test peptides, comprise at least about 3 amino acids in length, usually from about 3 amino acids in length to about 100 amino acids, from about 3 amino acids in length to about 50 amino acids being preferred, from about 3 amino acids in length to about amino acids being more preferred, from about 4 amino acids in length to about 10 amino acids being mostly preferred and from about 5 amino acids in length to about 9 amino acids being especially preferred; peptides of 5, 6, 7, 8, 9, and 10 amino acids are preferred. Similarly, when larger test proteins are used, these may comprise at least about 3 amino acids in length, usually from about 3 amino acids in length to about 1000 amino acids, from about 3 amino acids in length to about 600 amino acids being preferred, from about 3 amino acids in length to about 400 amino acids being more preferred, from about 3 amino acids in length to about 200 amino acids being mostly preferred and from about 3 amino acids in length to about 100 amino acids being especially preferred.
The dimerization peptides (DP) of the invention comprise the sequence NH 2
-X,-X
2
-X
3
-X
4
-X
5 -COOH and generally are no more than 9 amino acids long and wherein X 2
X
3 and X 4 are generally selected from the group consisting of amino acids A, V, I, L, W, F, M and Y and X s is generally selected from the group consisting of K, R, D and E.
In a preferred embodiment, the dimerization peptides (DP) comprise the sequence NH 2
-FLIVK-COOH.
As outlined above, other sequences include those with a high content of hydrophobic amino acids and 1 or 2 charged amino acid residues. Generally, a sequence composed of 5, 6, 7 and 8 amino acids with at least 3-4 highly hydrophobic residues (taken from A, F, I, L, M, V, W, and Y) will function in this fashion.
In a preferred embodiment the dimerization sequence is NH 2 -XFLIVK-COOH, wherein X is either D, E, K, or R.
In another preferred embodiment the dimerization sequence is NH 2
-FLIVKS-COOH.
In a preferred embodiment the dimerization sequence is NH 2 -XFLIVKS-COOH, wherein X is either glutamic acid, aspartic acid, lysine or arginine.
In another embodiment, DP-proteins comprise sequences comprising (Lys) 4 or (Arg) 4 fused, as outlined in more detail below, to one terminus of a protein, and (Asp) 4 8 or (Glu)4 fused to the other WO 99/51625 PCT/US99/07374 terminus of a protein. Such DP- proteins would be expected to form compact structures with the ends forming a 4-8 residue ion-paired extended array.
Particularly preferred embodiments include, but are not limited to, the sequences EFLIVKS, KFLIVKS, EEFLIVKKS, EEFLIVKKS-acid, VSIKFEL, SKVILFE, AFLIVKS, EALIVKS, EFAIVKS, EFLAVKS, EFLIAKS, EFLIVAS, EFLIVKA, EFLKVKS, SKVILFE, EFLIVES, EKLKVKS, ESLSVKS, EFLIVES, VSIKFEL, LIVKS, FESIKVL and LKSIVEF.
In a preferred embodiment, the DPs of the invention are covalently to a protein or peptide of interest, frequently referred to herein as "protein of interest", "peptide of interest", "test protein", or "test peptide", depending on its size.
By "protein of interest", "peptide of interest", "test protein", or "test peptide" or grammatical equivalents herein is meant a protein for which generally a function is sought or which has certain characteristics to be tested. Generally, test proteins are encoded by nucleic acids which are obtained from genomic DNA, cDNA or from random nucleic acids. These nucleic acids are expressed (as detailed below) to generate the test proteins. Smaller test proteins, usually test peptides, can also be synthesized on a peptide synthesizer. Synthesis on a peptide synthesizer allows the incorporation of synthetic analogs including, but not limited to, unnatural amino acids or peptidomimetic bonds to enhance potency and stability of the test protein or test peptide.
In a preferred embodiment, the test peptides are randomized. By "random" or "randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide consists of essentially random nucleotides and random amino acids, respectively. Generally these random test peptides are expressed from a molecular library. In a preferred embodiment, the molecular library comprises at least two different randomized nucleic acid sequences, with a plurality of different randomized nucleic acid sequences being preferred. These nucleic acid sequences are chemically synthesized, and may incorporate any nucleotide at any position. The synthetic process can be designed to generate randomized nucleic acids, to allow the formation of all or most of the possible combinations over the length of the sequence, thus forming a library of randomized nucleic acids encoding randomized candidate proteinaceous molecules randomized candidate DP-proteins). The randomized nucleic acid sequences such create a library of fragments, each encoding a different protein, which are ligated into suitable vectors and transformed into cells, as outlined herein.
In one embodiment, the library is fully randomized, with no sequence preferences or constants at any position.
WO 99/51625 PCT/US99/07374 In another preferred embodiment, the library is biased. That is, some positions within the sequence are either held constant, or are selected from a limited number of possibilities. For example, in a preferred embodiment, triplets of nucleotides (NNN) are randomized to encode amino acid residues within a defined class, for example, hydrophobic amino acids, hydrophilic residues, sterically biased (either small or large) residues, towards the creation of cysteines, for cross-linking, prolines for SH-3 domains, serines, threonines, tyrosines or histidines for phosphorylation sites, etc., or to purines, etc.
The term "random peptide library"or "random protein library" is meant herein as to comprise recombinant vectors encoding random peptides (or random proteins), the random peptides (or random proteins) encoded by those recombinant vectors, recombinant vectors encoding fusion proteins, comprising random peptides (or random proteins), and the fusion proteins, comprising random peptides (random proteins), encoded by those recombinant vectors.
In a preferred embodiment, the sequence of the candidate DP-protein is used to generate derivatives of the originally isolated candidate DP-protein. For example, the sequence of the candidate DP-protein may be the basis of a second round of (biased) randomization, to generate derivative DP-proteins with increased or altered activities. Alternatively, the second round of randomization may change the affinity of the bioactive agent. Furthermore, it may be desirable to operably link the protein component of the identified DP-protein to different dimerization sequences than those used to isolate the original candidate DP-protein. This may result in a fusion protein that is more or less constrained and thus may have altered activities. It may also be desirable to "walk" around a potential binding site, in a manner similar to the mutagenesis of a binding pocket, by keeping one end of the ligand region constant and randomizing the other end to shift the binding of the peptide around.
In a preferred embodiment, the test protein comprises a wild-type or naturally occurring sequence.
Alternatively, it may be a derivative protein thereof, that is, it may contain amino acid substitutions, insertions or deletions, or combinations thereof which are not found in the originally isolated DP-protein.
These modifications are routinely performed by in vitro mutagenesis of the nucleic acid encoding the protein of interest. In vitro mutagenesis methods are well known to those in the art and are found in, for example, Sambrook et al., Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989) and Ausubel et al., Short Protocols in Molecular Biology (John Wiley Sons, Inc., 1995).
The DPs of the invention are covalently joined to the test protein. By "covalently attached" or "covalently joined" or grammatical equivalents herein is meant that two moieties are attached by at least one bond, including sigma bonds, Pi bonds, and coordination bonds. As is more fully outlined below, the DPs of the invention are covalently joined to fusion partners and/or test peptides. Covalent attachment to a fusion partners and test peptides is accomplished by employing cysteine (disulfide) linkage, peptide bond WO 99/51625 PCT/US99/07374 linkage, a variety of bifunctional agents (cross-linking agents, such as maleimidobenzoic acid, methyidithioacetic acid, mercaptobenzoic acid, S-pyridyl dithiopropionate, etc.), or attachment via nonpeptide bonds. Examples of nonpeptide bonds include, but are not limited to, retroinverso bonds, Nmethyl amine bonds, depspeptide bonds, hydroxyamino peptide isoteres, thioamide bonds, peptoids [Simon et al., Proc. Natl. Acad. Sci. USA 89:9367-71 (1992)], double bonds, reduced peptide bonds, ethylene bonds, keto peptide bond analogs, methylene sulfoxides, and methylene sulfides [Rizo and Gierasch, Annu. Rev. Biochem. 61:387-418 (1992)].
In general, as detailed below, the DPs are joined to peptides or proteins using peptide bonds, for example by expressing nucleic acids that encode the DP and the respective peptide or protein of interest.
In a preferred embodiment, the DPs of the invention are joined to a test protein to form fusion proteins, in a wide variety of ways, as will be appreciated by those in the art. As is more fully described below, they can be joined to one or more internal positions, or preferably to either or both of the N- and C-terminal terminus. The attachment of DP to a fusion partner results in a structure referred to herein as DP-protein.
By "DP-protein" herein is meant a compound comprising at least one dimerization peptide covalently joined to at least one peptide. DP-proteins include candidate DP-proteins, as defined below. As will be appreciated by those in the art, when a single DP is used, the compositions and methods of the invention find use in the association of two test peptides. That is, a first DP can be joined to a first test protein (protein,), and a second DP (DP 2 can be joined to a second test protein (protein 2 When two DPs are used, the compositions find use in the generation of constrained test peptides.
In a preferred embodiment, at least one DP is joined to the N-terminus of a test protein, with the attachment of two DPs being preferred. In this embodiment, when two or more DPs are joined to the test protein, the DPs may be identical in sequence or may have a different sequence. The DPs may or may not be separated by a linker sequence as further outlined below. In an embodiment, wherein the same DP or two different DPs with affinity for one another are joined to the N-termini of two different test proteins, protein, and protein 2 generating, for example, DP-protein, and DP-protein 2 the two DPs associate with one another and protein, and protein 2 are brought into proximity. Due to the presence of the same DP sequence, in addition to protein,:protein 2 heterodimers, protein,:protein, homodimers and protein 2 :protein 2 homodimers can be made.
In a preferred embodiment, at least one DP is joined to the C-terminus of a test protein, with the attachment of two DPs being preferred. As above, the DPs may be identical in sequence or may have a different sequence. The DPs may or may not be separated by a linker sequence as further outlined below. In an embodiment, wherein the same DP or two different DPs with affinity for one another are joined to the C-termini of two different test proteins, protein, and protein 2 generating, for example, WO 99/51625 PCT/US99/07374 protein,-DP and protein 2 -DP, the two DPs associate with one another and protein, and protein 2 are brought into proximity. Due to the presence of the same DP sequence, in addition to protein,:protein 2 heterodimers, protein,:protein, homodimers and protein 2 :protein 2 homodimers are formed.
In a preferred embodiment, at least one DP is joined to an internal position of a test protein, with attachment of two DPs being preferred. As above, the DPs may be identical in sequence or may have a different sequence. The DPs may or may not be separated by a linker sequence as further outlined below. When two or more DPs are joined to an internal position, the DPs may be juxtaposed, that is inserted into the same internal position, for example, generating Nprotein,-DP,-DP, 2 -proteinc or the DPs may be separated and joined to different internal positions, for example, generating Nprotein,-DP,-,protein-
DP
2 -_proteinc, wherein is the amino-terminal part of the test protein, is the carboxy-terminal part of the test protein, is an internal part of the protein, flanked by the dimerization peptides, DP, and DP 2 In an embodiment, wherein DP, and DP 2 are of identical sequence or have an affinity for one another, they associate and the part of the test protein enclosed by DP, and DP 2 protein,) forms a loop structure.
In a preferred embodiment, the linkage of the DP to the test protein is direct; that is, there is a direct fusion of the DP sequence with the test protein sequence.
In a preferred embodiment, the linkage of the DP to the test protein is indirect; that is a linker or spacer is used. The term "linker", or "spacer', or "tethering sequence" or grammatical equivalents is meant herein to comprise a molecule or a group of molecules that connects two molecules. Often the inclusion of a linker serves to place the two molecules in a preferred configuration, for example, imposing a more constrained configuration on two molecules (such when linkers comprising prolines are used) or imposing a more relaxed configuration on two molecules that is, minimal steric hindrance; such when linkers comprising serines and glycines are used).
In a preferred embodiment, a linker sequence is included at any position, in between DP and the protein of interest, in between two unrelated DPs, or in between two fusion partners. As outlined herein, the linker sequence can be proteinaceous or non-proteinaceous. Linker sequences between individual components of the compound may be desirable, for example, to allow the protein of interest to interact with potential targets unhindered, to constrain the protein of interest, or to allow functioning of a new property conferred upon the protein of interest subcellular localization). For constraining a protein of interest, proline-containing linkers are particularly preferred. As is known in the art, prolines confer unique conformational constraints on a polypeptide chain. Useful proline linkers include proline-glycine polymers (including, but not limited to, (PG)n, (PPGG)n, and combinations thereof, wherein n is an integer of at least one). Preferred linkers allowing some flexibility of the polypeptide include glycine-serine polymers (including, but not limited to, (GSGGS)n and (GGGS)n, and combinations thereof, wherein n is an integer-of at least one), glycine-alanine polymers, alanine-serine polymers, and other flexible WO 99/51625 PCT/US99/07374 linkers such as the tether for the shaker potassium channel, and a large variety of other flexible linkers, as will be appreciated by those in the art. Glycine-serine polymers are particularly preferred.
In a preferred embodiment, the DP-protein comprises two DPs. In this embodiment, the two DPs are used to conformationally constrict the test protein. DPs, when covalently joined at the N- and C-terminus of a protein of interest (ranging from 3 to 50 or more amino acid residues), help the protein of interest to fold into a compact structure (also referred herein to as a constrained structure) which is more proteolytically resistant than the linear protein sequence alone. Particularly preferred in this embodiment and, when screening for interacting molecules, are random test proteins.
In a preferred embodiment, a first DP is fused to the N-terminus of a test protein, and a second DP is fused to the C-terminus of a test protein (protein), generating, for example, DP,-Nproteinc- DP,. In this embodiment, the first and second DP can be the same or different. When two DPs are used that can self-aggregate, the two DPs associate and impose a constrained structure upon the test protein enclosed in between the two DPs. When two different DPs (DP, and DP 2 are joined to the N-terminus and to the C-terminus of a test protein, the two different DPs nevertheless can associate and impose a constrained structure upon the test protein, provided, that DP, and DP 2 have an affinity for one another.
Different DP sequences that can associate are, for example KFLIVKS and EFLIVES.
Particularly preferred examples of DP-proteins, include, but are not limited to: EFLIVKS-protein- EFLIVKS; (ii) KVLIKS-protein-EFLIVES; (iii) VSIKFEL-protein-VSIKFEL; (iv) LIVKS-protein-LIVKS; (v) EFLIVK-protein-EFLIVK; (vi) FESIKVL-protein-FESIKVL; and (vii) LKSIVEF-protein-LKSIVEF.
More specifically, DP,-protein-DP, like compounds provided by this invention comprise EFLIKS- VGTIVTMEYRIDRTRSFV-EFLIFKS, wherein the protein sequence is obtained from the barley c2chymotrypsin inhibitor [VGTIVTMEYRIDRTRSFV; Leatherbarrow and Salacinski, Biochemistry 30:10717- 21 (1991)] and DP, and DP 2 are identical; (ii) EFLIKS-VGTIVTMEYRIDRTRSFV-SKVILFE, wherein the sequence of DP 2 is the reverse sequence of DP,; (iii) SKVILFE-VGTIVTMEYRIDRTRSFV-EFLIVKS, wherein the sequence of DP, is the reverse of DP,; (iv) SKVILFE-VGTIVTMEYRIDRTRSFV-SKVILFE, wherein both DP, and DP, are identical, however, the reverse of DP, and DP 2 shown in KFLIVKS- VGTIVTMEYRIDRTRSFV-KFLIVKS, wherein DP, and DP, are identical; (vi) KFLIVKS- VGTIVTMEYRIDRTRSFV-EFLIVES, wherein DP, and DP 2 are different; (vii) EFLIVES- VGTIVTMEYRIDRTRSFV-EFLIVES, wherein DP, and DP, are identical; (iix) EKLKVKS- VGTIVTMEYRIDRTRSFV-EKLKVKS, wherein DP, and DP, are identical; (ix) ESLSVKS- VGTIVTMEYRIDRTRSFV-ESLSVKS wherein DP, and DP 2 are identical; EFLKVKS- VGTIVTMEYRIDRTRSFV-EFLKVKS, wherein DP, and DP 2 are identical; (xi) EEFLIVKKS- VGTIVTMEYRIDRTRSFV-EEFLIVKKS, wherein DP, and DP 2 are identical; (xii) MGEFLIVKS- VGTIVTMEYRIDRTRSFV-EFLIVKSGPP, wherein DP, and DP, are identical and DP, comprises amino WO 99/51625 PCT/US99/07374 acids MG and DP, comprises amino acids GPP for conferring increased stability; (xiii) KKKKKKGGGGEFLIVKS-VGTIVTMEYRIDRTRSFV-EFLIVKS, wherein DP, and DP 2 are identical and DP, comprises amino acids KKKKKKGGGG for conferring increased solubility; (xiv) KKKGSGSEFLIVKS- VGTIVTMEYRIDRTRSFV-EFLIVKS, wherein DP, and DP 2 are identical and DP, comprises amino acids KKKGSGS for conferring increased solubility; (xv) EFLIVKS-STKSIPPQS-EFLIVKS, wherein the 9-mer insert represents an analog of a protease inhibitor [Gariani and Leatherbarrow, J. Peptide Res. 49:467-75 (1997)]; (xvi) MGEFLIVKS-GGGGDYKDDDDKGGGG-EFLIVKSGPP, wherein DP, and DP 2 are identical and DP, comprises amino acids MG and DP 2 comprises amino acids GPP for conferring increased stability and the protein comprises the flag epitope (DYKDDDDK) with glycine spacers; (xvii) MGEFLIVKS-GGGGYPYDVPDYASLGGGG-EFLIVKSGPP, wherein DP, and DP 2 are identical and DP, comprises amino acids MG and DP 2 comprises amino acids GPP for conferring increased stability and the protein comprises the influenza hemagglutinin epitope tag (YPYDVPDYASL) with glycine spacers.
The dimerization sequence is underlined in all the above examples.
In a preferred embodiment, a first DP is joined to the N-terminus of the test protein and a second DP (DP 2 is joined to an internal position of the test protein. A structure such as DP,-Nprotein,-DP 2 ,proteinc is generated. In an embodiment, wherein DP, and DP 2 are of identical sequence or have an affinity for one another, they associate and the part of the test protein enclosed by DP, and DP 2 ,protein,) forms a loop.
In a preferred embodiment, a first DP is joined to the C-terminus of the test protein and a second DP (DP 2 is joined to an internal position of the test protein. A structure such as Nprotein,-DP 2 -iproteinc- DP, is generated. In an embodiment, wherein DP, and DP 2 are of identical sequence or have an affinity for one another, they associate and the part of the test protein enclosed by DP, and DP 2 proteinc) forms a loop.
In a preferred embodiment, both the first DP and the second DP (DP 2 are joined to an internal position of the test protein or preferably to two different internal positions of the test protein, generating a structure such as Nprotein,-DPi,-protein,-DP 2 -lproteinc. In an embodiment, wherein DP, and DP 2 are of identical sequence or have an affinity for one another, they associate and the part of the test protein enclosed by DP, and DP, protein,) forms a loop.
In a preferred embodiment, different dimerization peptides are fused to more than one protein which will be covalently associated with one another. In this embodiment, the individual dimerization peptides may also be separated by linkers inserted in between DP and a protein and/or in between individual DPs. For example, a DP fusion protein such as DPhyd-Lp-protein,-Lp-DPhyd-LG-DPLys-Lp-protein-Lp-DPGiu (see Figure 2A), wherein DPhyd is a DP comprising mostly hydrophobic amino acid residues, DPLys is a DP comprising mostly lysine residues, DPGIu is a DP comprising mostly glutamic acid residues, Lp is a linker WO 99/51625 PCT/US99/07374 comprising proline residues, LG is a linker comprising glycine residues, and protein, and protein 2 are proteins which comprise different protein sequences, can be made. The above illustrated bivalent DP fusion protein will allow two constrained proteins covalently associated with one another within a single fusion protein, forming a 'double-loop' structure. Within such a structure, the first loop (comprising protein,) is formed by the dimerization of the first DPhyd with the second DPhyd and the second loop (comprising protein 2 is formed by the dimerization of DPLys and DPG u The two loop structures may be separated by a flexible linker such as a glycine or serine/glycine linker as outlined above.
In a preferred embodiment, different dimerization peptides are fused to more than one protein which then non-covalently associate with one another. In this embodiment, the individual dimerization peptides may also be separated by linkers inserted in between DP and a protein and/or in between individual DPs. For example, the following DP fusion proteins can be made: DPhyd-Lp-protein,-Lp-DPhyd-LG-DPLys and (ii) DPhyd-Lp-protein 2 -Lp-DPhyd-LG-DPGlu (see Figure 2B) wherein DPhyd is a DP comprising mostly hydrophobic amino acid residues, DPLys is a DP comprising mostly lysine residues, DPGlu is a DP comprising mostly glutamic acid residues, Lp is a linker comprising proline residues, LG is a linker comprising glycine residues, and protein, and protein 2 are proteins which comprise different protein sequences. In the above illustration, two individual proteins (protein, and protein 2 are each held in a compact structure, due to the association of the respective DPs. Upon mixing the two DP-fusion proteins, they form non-covalently associated dimers, due to the specific association of DPLys with DPGIu resulting in a dimer structure which comprises two different compact proteins (protein, and protein 2 In another embodiment the protein sequences inserted in between the two DPhyds are identical: DPhyd-Lp-protein,- L-DPhyd-LG-DPLys and (ii) DPhyd-L-protein,-Lp-DPhyd-LG-DPGlu, resulting in a non-covalent double loop structure comprising two juxtaposed compact structures of the same protein. It will be obvious to those in the art that a plurality of DP fusion proteins other than those illustrated herein, can be made.
In a preferred embodiment, different dimerization peptides are fused to more than one protein which noncovalently associate with one another. In this embodiment, DP-proteins are generated, wherein the DPs are used to non-covalently associate two or more unconstrained proteins to form constrained structures (see Figure 2C). In this embodiment, the individual dimerization peptides may also be separated conveniently by linkers inserted in between DP and a protein. For example, the following DP fusion proteins can be made: DPhyd-Lp-proteinl-Lp-DPLys and (ii) DPhyd-Lp-protein 2 -Lp-DPG1u, wherein DPhyd is a DP comprising mostly hydrophobic amino acid residues, DPLys is a DP comprising mostly lysine residues, DP 0
G
u is a DP comprising mostly glutamic acid residues, Lp is a linker comprising proline residues, LG is a linker comprising glycine residues, and protein, and protein 2 are proteins which comprise different protein sequences. In the above illustration, two individual proteins (protein, and protein 2 are each held in a compact structure, due to the association of the respective DPs. Upon mixing the two DP-fusion proteins, they form non-covalently associated dimers, due to the specific association of DPLys with DPG,,, resulting in a dimer structure which comprises two different compact proteins (protein, WO 99/51625 PCT/US99/07374 and protein 2 In another embodiment the protein sequences inserted in between the two DPhyds are identical: DPhyd-Lp-proteinl-Lp-DPhyd-LG-DPLys and (ii) DPhyd-Lp-protein,-Lp-DPhyd-LG-DPGlu, resulting in a non-covalent double loop structure comprising two juxtaposed compact structures of the same protein.
It will be obvious to those in the art that a plurality of DP fusion proteins other than those illustrated herein, can be made.
Other dimerizing protein sequences are known in the art or may be isolated using known screening systems, such as the yeast two-hybrid system.
In one embodiment, each of the two protein sequences (protein, and protein 2 for example, within the above DPhyd-Lp-protein,-Lp-DPhyd-LG-DPLys-Lp-protein 2 -Lp-DPGlu, has a specific bioactivity, which when combined in a structure as outlined above, results in a bivalent DP-fusion protein which has a greater bioactivity than each alone. For example, both compact structures may bind to the same target protein, however with low affinity. Combining both compact structures into a single bivalent DP-fusion protein as outlined above, may result in much higher affinity for the target protein, and thus the single DP-fusion protein may be a more potent agonist or antagonist than each isolated DP-protein.
In another preferred embodiment, DP-fusion protein structures as outlined above, such as DPhyd-L,protein-Lp-DPhyd-LG-DPLys-Lp-protein 2 -Lp-DPGIu which have a bivalent binding specificity are also useful for associating two proteins for which they have affinity. In this embodiment, the compact structure comprising protein, has affinity to a protein X; and the compact structure comprising protein 2 has affinity to protein Y. Introducing this DP-fusion protein into a cell which expresses both protein X and protein Y, results in binding of the bivalent DP-fusion protein to both protein X and protein Y, which thereby are brought into close proximity.
Similarly, DP-fusion protein structures as outlined above, such as DPhyd-Lp-protein,-Lp-DPhyd-LG-DPLys- Lp-protein 2 -Lp-DPGu which have a bivalent binding specificity are also useful for associating two cells. The cells may be identical or different. In this embodiment, the compact structure comprising protein, has affinity to a cell surface component X displayed on a first cell. The compact structure comprising protein 2 has affinity to a cell surface component Y displayed on a second cell. Co-culturing the first and second cells and providing this bivalent DP-fusion protein, results in binding of the DP-fusion protein to both cell surface component X and cell surface component Y, which will force first cell and second cell into close proximity.
Among the most challenging aspects in gene therapy is the delivery of the gene of interest into a specific target cell, wherein a genetic defect is sought to be corrected. Several gene delivery systems are known to those in the art, including, but not limited to naked DNA, liposome-embedded DNA, and viral systems, comprising retroviruses, adenoviruses, herpesviruses, HIV, etc. However, whatever system is employed, WO 99/51625 PCT/US99/07374 cell-type specific delivery remains the most critical aspect of gene therapy. In a preferred embodiment, DP-fusion protein structures as outlined above, such as DPhyd-Lp-proteinl-Lp-DPhyd-LG-DPLys-Lp-protein 2 Lp-DPGU which have a bivalent binding specificity are also useful as tools for associating virus particles a virus that delivers a gene of interest) with the desired target cells. In this embodiment, the compact structure, comprising protein 1 has affinity to a cell surface component X displayed on a the virus and the compact structure comprising protein 2 has affinity to a cell surface component Y displayed on a target cell. Co-culturing the virus and the target cells and providing this bivalent DP-fusion protein, results in binding of the bivalent DP-fusion protein to both viral surface component X and target cell surface component Y, which will force the virus into close proximity with its target cell. The viral particle thus may dock to the desired target cell and fuse with the membrane ensuring gene delivery. Suitable controls are performed such that the virus does not dock with its target cell without addition of the bivalent DP-fusion protein.
In another embodiment the protein sequences inserted in between the two gPhyds and in between DPLy s and DPGIu are identical, resulting in a double loop structure comprising two juxtaposed compact structures of the same protein. This embodiment allows the dimerization of the same protein, which may be a cellular protein or an extracellular protein component. It will be obvious to those in the art that a plurality of DP fusion proteins other than those illustrated herein, can be made.
The DPs or DP-proteins of the present invention may also be modified, as more fully outlined below, to form fusion proteins comprising a DP or a DP-protein and another, heterologous protein or amino acid sequence, usually referred to as a fusion partner.
The term "fusion protein" or "chimeric protein" refers to a protein composed of at least two proteins that, while typically unjoined in their native state, typically are joined by their respective amino and carboxyl termini through a peptide linkage to form a single continuous protein. It will be appreciated that the protein components can be directly joined or joined through a peptide linker/spacer.
By "fusion partner" herein is meant a sequence that is associated with DP or DP-protein and confers upon DP or DP-protein an additional function or ability. Suitable fusion partners include, but are not limited to: a) tag sequences (also referred to as rescue sequences), as defined below, which allow the purification or isolation of either the DP or DP-protein or the nucleic acids encoding them; b) targeting sequences, defined below, which allow the localization of DP or DP-protein to a subcellular or extracellular compartment; c) stability sequences, which confer stability or protection from degradation to DP or DPprotein, for example resistance to proteolytic degradation; or d) any combination of and as well as linker sequences as needed. It is well known to those in the art that fusion proteins preferably are generated by in vitro mutagenesis and genetic engineering, whereby the nucleic acid encoding the respective fusion protein is modified accordingly. Suitable methods can be found, for example, in WO 99/51625 PCT/US99/07374 Sambrook et al., Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989) and Ausubel et al., Short Protocols in Molecular Biology (John Wiley Sons, Inc., 1995).
In a preferred embodiment, the fusion partner comprises a tag polypeptide which provides an epitope to which an anti-tag antibody can selectively bind or an epitope comprising a purification sequence. The epitope tag is generally, but not required to be, placed at the amino-or carboxyl-terminus of DP or DPprotein. The presence of such epitope-tagged forms of DP or DP-protein can be detected using an antibody against the tag polypeptide. Also, the use of the tag enables the protein to be readily purified by affinity purification using an anti-tag antibody or another type of affinity matrix that binds to the epitope tag In an alternative embodiment, the chimeric molecule may comprise a fusion of DP or DP-protein with an immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the chimeric molecule, such a fusion could be to the Fc region of an IgG molecule or to GST (glutathione S transferase).
Various tag polypeptides and their respective antibodies are well known in the art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; the flu HA tag polypeptide and its antibody 12CA5 [Field et al., Mol. Cell. Biol., 8:2159-2165 (1988)]; the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies thereto [Evan et al., Mol. Cell. Biol., 5:3610-3616 (1985)]; and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody [Paborsky et al., Protein Eng., 3(6):547-553 (1990)]. Other tag polypeptides include the Flag-peptide [Hopp et al., Mol. Immunol., 33:601-8 (1996); Brizzard et al., Biotechniques 16(4):730-735 (1994); Knappik and Pluckthun, Biotechniques 17(4):754-61 (1994), the KT3 epitope peptide [Martin et al., Science, 255:192-194 (1992)], the tubulin epitope peptide [Skinner et al., J. Biol. Chem., 266:14163-14166 (1991)], and the T7 gene 10 protein peptide tag [Lutz- Freyermuth et al., Proc. Natl. Acad. Sci. USA, 87:6393-6397 (1990)]. Alternatively, for example, fusion proteins encompassing poly-his tags are efficiently purified on metal (Ni) affinity resins.
In a preferred embodiment, a tag sequence (also referred to as a rescue sequence) is used to isolate the nucleic acid encoding the DP-protein (see also below). In this embodiment the rescue sequence may be a unique oligonucleotide sequence which serves as a probe target site to allow quick and easy isolation of the nucleic acid construct (see below), via PCR, hybridization, or related techniques.
In a preferred embodiment, the fusion partner is a targeting sequence. As will be appreciated by those in the art, the localization of proteins within a cell is a simple method for increasing effective concentration and determining function. These mechanisms are thought to rely on the principle of limiting the search space for ligands, that is to say, the localization of a protein to the plasma membrane limits the search for its ligand to that limited dimensional space near the membrane as opposed to the three dimensional space of the cytoplasm. Alternatively, the concentration of a protein can also be simply increased by WO 99/51625 PCT/US99/07374 nature of the localization, for example, shuttling the proteins into the nucleus confines them to a smaller space thereby increasing concentration.
Thus, suitable targeting sequences include, but are not limited to, sequences capable of causing binding of the respective protein to a predetermined molecule or class of molecules while retaining bioactivity of the expression product, (for example by using enzyme inhibitor or substrate sequences to target a class of relevant enzymes); (ii) sequences signaling selective degradation, of itself or co-bound proteins; and (iii) signal sequences capable of constitutively localizing the candidate expression products to a predetermined cellular locale, including subcellular locations such as the Golgi apparatus, endoplasmic reticulum, nucleus, nucleoli, nuclear membrane, mitochondria, chloroplast, secretory vesicles, lysosome, and cellular membrane; and extracellular locations via a secretory signal [see, von Heijne, EXS 73:67-76 (1995); von Heijne, Subcell. Biochem. 22:1-19 (1994) and von Heijne, Curr. Opin.
Cell. Biol. 2(4):604-8 (1990)]. Particularly preferred is localization to either subcellular locations or to the outside of the cell via secretion.
In a preferred embodiment, the fusion partner is a nuclear localization signal (NLS). NLSs are generally short, positively charged (basic) domains that serve to direct the entire protein in which they occur to the cell's nucleus. Numerous NLS amino acid sequences have been reported including: single basic NLS's such as that of the SV40 (monkey virus) large T Antigen [Pro Lys Lys Lys Arg Lys Val; Kalderon et al., Cell 39:499-509 (1984)]; the human retinoic acid receptor-11 nuclear localization signal (ARRRRP; Hamy et al., Bioconjug. Chem. 2(5):375-8 (1991); NFKB p50 [EEVQRKRQKL; Ghosh et al., Cell 62:1019- 1029 (1990)]; NFKB p65 [EEKRKRTYE; Nolan et al., Cell 64:961-969 (1991)]; and others [see for example Boulikas, J. Cell. Biochem. 55(1):32-58 (1994)], hereby incorporated by reference and (ii) double basic NLS's exemplified by that of the Xenopus laevis (African clawed toad) protein, nucleoplasmin [Ala Val Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gin Ala Lys Lys Lys Lys Leu Asp; Dingwall et al., Cell 30:449-458 (1982) and Dingwall et al., J. Cell. Biol. 107:841-849 (1988)]. Numerous localization studies have demonstrated that NLSs incorporated in synthetic peptides or grafted onto reporter proteins not normally targeted to the cell nucleus cause these peptides and reporter proteins to be concentrated in the nucleus. See, for example, Dingwall and Laskey, Annu. Rev. Cell. Biol., 2:367-390 (1986); Bonnerot et al., Proc. Natl. Acad. Sci. USA, 84:6795-6799 (1987) and Galileo et al., Proc. Natl. Acad. Sci. USA, 87:458-462 (1990).
In a preferred embodiment, the fusion partner is a membrane anchoring signal sequence. This is particularly useful since many parasites and pathogens bind to the membrane, in addition to the fact that many intracellular events originate at the plasma membrane. Thus, membrane-bound DP-proteins are useful for both the identification of important elements in these processes as well as for the discovery of effective inhibitors or activators. The invention provides methods for presenting the DP protein extracellularly or in the cytoplasmic space. For extracellular presentation, a membrane anchoring region WO 99/51625 PCT/US99/07374 is provided at the carboxyl terminus of the DP-protein. The DP-protein is exposed on the cell surface and presented to the extracellular space, such that it can bind to other surface molecules (affecting their function) or molecules present in the extracellular medium. The binding of such molecules could confer function on the cells expressing a DP-protein that binds the molecule. The cytoplasmic region could be neutral or could contain a domain that, when the extracellular DP-protein is bound by a target protein or test protein, confers a function on the cells (activation of a kinase, phosphatase, binding of other cellular components to effect function). Similarly, the DP-protein-containing region could be contained within a cytoplasmic region, and the transmembrane region and extracellular region remain constant or have a defined function.
Membrane-anchoring sequences are well known in the art and are based on the genetic geometry of mammalian transmembrane molecules. Peptides are inserted into the membrane based on a secretory signal sequence and require a hydrophobic transmembrane domain. Of course, if a transmembrane domain is placed amino-terminal to the DP-protein region, it will serve to anchor the DP-protein as an intracellular domain, which may be desirable in some embodiments. Secretory signal sequences and transmembrane domains are known for a wide variety of membrane bound proteins, and these sequences may be used accordingly, either as pairs from a particular protein or with each component being taken from a different protein, or alternatively, the sequences may be synthetic, and derived entirely from consensus as artificial delivery domains.
As will be appreciated by those in the art, membrane-anchored protein sequences, including both SS and TM, are known for a wide variety of proteins and any of these may be used. Particularly preferred membrane-anchoring sequences include, but are not limited to, those derived from CD8, ICAM-2, IL-8R, CD4 and LFA-1.
Useful sequences include sequences from: class I integral membrane proteins such as IL-2 receptor beta-chain [residues 1-26 are the signal sequence, residues 241-265 are the transmembrane residues; see Hatakeyama et al., Science 244:551-556 (1989) and von Heijne and Gavel, Eur. J. Biochem.174:671- 678 (1988)] and insulin receptor beta chain [residues 1-27 are the signal sequence, residues 957-959 are the transmembrane domain and residues 960-1382 are the cytoplasmic domain; see Hatakeyama, supra, and Ebina et al., Cell 40:747-758 (1985)]; (ii) class II integral membrane proteins such as neutral endopeptidase (residues 29-51 are the transmembrane domain, residues 2-28 are the cytoplasmic domain; see Malfroy et al., Biochem. Biophys. Res. Commun. 144:59-66 (1987)]; (iii) type III proteins such as human cytochrome P450 NF25 (Hatakeyama, supra); and (iv) type IV proteins such as human Pglycoprotein (Hatakeyama, supra). Particularly preferred are CD8 and ICAM-2. For example, the signal sequences from CD8 and ICAM-2 lie at the extreme 5' end of the transcript. These sequences encode the amino acids 1-32 in the case of CD8 [MASPLTRFLSLNLLLLGESILGSGEAKPQAP; Nakauchi et al., Proc. Natl. Acad. Sci. USA 82:5126-30 (1985)] and 1-21 in the case of ICAM-2 WO 99/51625 PCT/US99/07374 [MSSFGYRTLTVALFTLICCPG; Staunton et al., Nature 339:61-64 (1989)]. These leader sequences deliver the construct to the membrane while the hydrophobic transmembrane domains, placed carboxyterminal to the DP-protein region, serve to anchor the construct in the membrane. These transmembrane domains are encompassed by amino acids 145-195 from CD8 (PQRPEDCRPRGSVKGTGLDFACDIYIWAPLAGICVALLLSLIITLICYHSR; Nakauchi, supra) and 224-256 from ICAM-2 (MVIIVTVVSVLLSLFVTSVLLCFIFGQHLRQQR; Staunton, supra).
Alternatively, membrane anchoring sequences include the GPI anchor, which results in a covalent bond between the molecule and the lipid bilayer via a glycosyl-phosphatidylinositol bond for example in DAF [PNKGSGTTSGTTRLLSGHTCFTLTGLLGTLVTMGLLT, with the bolded serine being the site of the anchor; see Homans et al., Nature 333(6170):269-72 (1988), and Moran et al., J. Biol. Chem. 266:1250- 1257 (1991)]. In order to do this, the GPI sequence from Thy-1 can be inserted 3' of the variable region in place of a transmembrane sequence.
It is within the scope of this invention to display the DP-protein on membranes of viral, archaebacterial, prokaryotic and eukaryotic origin. In this embodiment, the DP-protein is fused to a membrane protein such that after insertion into the membrane, the DP-protein region will be located on the outside of the virus, archaebacteria, prokaryote or eukaryotic cell and thus be accessible for binding target molecules, when screening for binding target molecules. Prokaryotic surface display systems, include, for example, functional fusions to surface proteins such as flagellin [Lu et al., Biotechnology 13(4):366-72 (1995) and ice-nucleation protein [Jung et al., Nat. Biotechnol. 16(6):576-80 (1998)]. Other prokaryotic protein display systems are reviewed by Stahl and Uhlen, Trends Biotechnol. 15(5):185-92 (1997) and Georgiou et al., Nat. Biotechnol. 15(1):29-34 (1997). Viral display system include, but are not limited to, filamentous bacteriophages such as M13 and derivatives [for review see Felici et al., Biotechnol. Annu.
Rev. 1:149-83 (1995)]; (ii) bacteriophage T4 [Jiang et al., Infect. Immun. 65(11):4770-7 (1997)]; (iii) bacteriophage lambda [Stolz et al., FEBS Lett. 440(1-2):213-7 (1998)]; (iv) tomato bushy stunt virus [Joelson et al., J. Gen. Virol. 78(Pt 6):1213-7 (1997)]; and retrovirus [Buchholz et al., Nat. Biotechnol.
16(10):951-4 (1998)]. Yeast display systems, for example, employ C-terminal fusions to the Aga2p mating adhesion receptor of Saccharomyces cerevisiae [Boder and Wittrup, Nat. Biotechnol. 15(6):553-7 (1997)]. Display of proteins using any of the above listed systems or mammalian transmembrane proteins (some of which are described herein) is generally achieved by inserting the nucleic acid encoding the DP-protein (or any other protein of interest) in frame with an amino-terminal secretion signal and a Cterminal transmembrane anchoring domain (as further described below).
Similarly, myristylation sequences can serve as membrane anchoring sequences. It is known that the myristylation of c-src recruits it to the plasma membrane. This is a simple and effective method of membrane localization, given that the first 14 amino acids of the protein are solely responsible for this function: MGSSKSKPKDPSQR (see Cross et al., Mol. Cell. Biol. 4(9):1834-1842 (1984); Spencer et al., WO 99/51625 PCT/US99/07374 Science 262:1019-1024 (1993), both of which are hereby incorporated by reference). This motif has already been shown to be effective in the localization of reporter genes and can be used to anchor the zeta chain of the TCR. This motif is placed amino-terminal to the variable region in order to localize the fusion protein to the plasma membrane. Other modifications such as palmitoylation can be used to anchor fusion proteins in the plasma membrane; for example, palmitoylation sequences from the G proteincoupled receptor kinase GRK6 sequence [LLQRLFSRQDCCGNCSDSEEELPTRL, with the bold cysteines being palmitolyated; Stoffel et al., J. Biol. Chem. 269:27791-4 (1994)]; from rhodopsin [KQFRNCMLTSLCCGKNPLGD; Barnstable and Morabito, J. Mol. Neurosci. 5(3):207-9 (1994)]; and the p21 H-ras 1 protein [LNPPDESGPGCMSCKCVLS; Capon et al., Nature 302:33 (1983); Cadwallader et al., Mol. Cell. Biol. 14(7):4722-30 (1994)].
In a preferred embodiment, the fusion partner is a lysosomal targeting sequence, including, for example, a lysosomal degradation sequence such as Lamp-2 [KFERQ; Dice, Ann. N.Y. Acad. Sci. 674:58-64 (1992)]; or lysosomal membrane sequences from Lamp-1 [MLIPIAGFFALAGLVLIVLIAYLIGRKRSHAGYQTI, Uthayakumar et al., Cell. Mol. Biol. Res. 41:405-20 (1995)] or Lamp-2 [LVPIAVGAALAGVLILVLLAYFIGLKHHHAGYEQF, Konecki et al., Biochem. Biophys. Res. Comm.
205:1-5 (1994)], both of which show the transmembrane domains in bold and the cytoplasmic targeting signal underlined.
Alternatively, the fusion partner may be a mitochondrial localization sequence, including mitochondrial matrix sequences yeast alcohol dehydrogenase III; MLRTSSLFTRRVQPSLFSRNILRLQST; Schatz, Eur. J. Biochem. 165:1-6 (1987)]; mitochondrial inner membrane sequences (yeast cytochrome c oxidase subunit IV; MLSLRQSIRFFKPATRTLCSSRYLL; Schatz, supra); mitochondrial intermembrane space sequences (yeast cytochrome cl; MFSMLSKRWAQRTLSKSFYSTATGAASKSGKLTQKLVTAGVAAAGITASTLLYADSLTAEAMTA; Schatz, supra) or mitochondrial outer membrane sequences (yeast 70 kD outer membrane protein; MKSFITRNKTAILATVAATGTAIGAYYYYNQLQQQQQRGKK; Schatz, supra).
The fusion partner may also be derived from endoplasmic reticulum sequences, including a sequence derived from calreticulin [KDEL; Pelham, Proc. R. Soc. Lond. B. Biol. Sci., 250:1-10 (1992)] or from adenovirus E3/19K protein [LYLSRRSFIDEKKMP; Jackson et al., EMBO J. 9:3153-62 (1990)].
Furthermore, targeting sequences also include peroxisome sequences [for example, the peroxisome matrix sequence from luciferase; SKL; Keller et al., Proc. Natl. Acad. Sci. USA 84:3264-8 (1987)]; farnesylation sequences [for example, P21 H-ras 1; LNPPDESGPGCMSCKCVLS, with the bold cysteine farnesylated; Capon, supra; Zhang et al., Biochemistry, 35(25):8166-71 (1996)]; geranylgeranylation sequences [for example, protein rab-5A; LTEPTQPTRNQCCSN, with the bold cysteines WO 99/51625 PCT/US99/07374 geranylgeranylated; Farnsworth, Proc. Natl. Acad. Sci. USA 91:11963-7 (1994)]; or destruction sequences [cyclin B1; RTALGDIGN; Klotzbucher et al., EMBO J. 15(12):3053-64 (1996)].
In a preferred embodiment, the targeting sequence is a secretory signal sequence capable of effecting the secretion of the DP-protein. There is a large number of known secretory signal sequences which, for example, when placed amino-terminal to the DP-protein region are cleaved from the respective fusion protein during the secretion process.
Suitable secretory signal sequences, include those from IL-2 [MYRMQLLSCIALSLALVTNS; Villinger et al., J. Immunol. 155:3946-54 (1995)], growth hormone [MATGSRTSLLLAFGLLCLPWLQEGSAFPT; Roskam and Rougeon, Nucleic Acids Res. 7:305-20 (1979)]; preproinsulin [MALWMRLLPLLALLALWGPDPAAAFVN; Bell et al., Nature 284:26-32 (1980); and influenza HA protein [MKAKLLVLLYAFVAGDQI; Sekiwawa and Lai, Proc. Natl. Acad. Sci. USA 80:3563-71 (1983)], with cleavage between the non-underlined-underlined junction. A particularly preferred secretory signal sequence is the secretory signal sequence from the secreted cytokine IL-4, which comprises the first 24 amino acids of IL-4 as follows: MGLTSQLLPPLFFLLACAGNFVHG. Other secretory signal peptides are discussed in von Heinje, supra.
In a preferred embodiment, the fusion partner is a stability sequence which confers stability to DP or DPprotein or the nucleic acid encoding them. Thus, for example, proteins may be stabilized by the incorporation of glycines after the initiation methionine (MG or MGG), for protection of the protein to ubiquitination as per Varshavsky's N-End Rule [Bachmair et al., Science, 234:179-86 (1986); Gonda et al., J. Biol. Chem. 264:16700-12 (1989); Varshafsky, Genes Cells, 2(1):13-28 (1997)], thus conferring long half-life in the cytoplasm. Similarly, one or two prolines at the C-terminus impart peptides that are largely resistant to carboxypeptidase action. The presence of two glycines prior to the prolines impart both flexibility and prevent structure initiating events in the di-proline to be propagated into the candidate peptide structure. Thus, preferred stability sequences are as follows: MG(X),GGPP, MG(X)nGPP, MGG(X)nGGPP, and MGG(X),GPP or wherein X is any amino acid and n is an integer of at least four.
In a preferred embodiment, to increase the solubility of the DP-protein, lysines are added to the Nterminus, which may or may not comprise a glycine spacer. For example, the DP-protein K 6
G
4
-EFLIVKS-
protein-EFLIVKS can be made, which has different characteristics than the DP-protein without the K 6
G
4 sequence added (see Examples). In this embodiment, the number of lysine residues and linker sequence can be determined experimentally to ensure the resulting DP-protein has the desired characteristics.
In a preferred embodiment, combinations of fusion partners are used. Thus, for example, any number of combinations of fusion partners, targeting sequences, rescue sequences, and stability sequences may be used, with or without linker sequences. As is more fully described below, using a base vector that WO 99/51625 PCT/US99/07374 contains at least one cloning site for receiving random and/or biased libraries, one can cassette in nucleic acids encoding various fusion partners 5' and 3' of the nucleic acid encoding the DP-protein.
In a preferred embodiment, the DPs, DP-proteins, DPs fused to a fusion partner or DP-proteins fused to a fusion partner of the invention can be further modified.
A compound wherein at least one dimerization peptide (DP) is fused to a protein of interest for example, yielding DP-P, P-DP, DP-P-DP or similar compounds, as more fully described above, wherein DP is the dimerization peptide and P is a protein of interest, is collectively referred to as "DP-protein".
Covalent modifications of DP and DP-proteins are included within the scope of this invention.
One type of covalent modification includes reacting targeted amino acid residues with an organic derivatizing agent that is capable of reacting with selected side chains or the N- or C-terminal residues of DP or DP-protein. Derivatization with bifunctional agents is useful, for instance, for crosslinking DP or DP-protein to a water-insoluble support matrix or surface for use in the method for purifying anti-DP or anti-DP-protein antibodies or screening assays, as is more fully described below. Commonly used crosslinking agents include, 1,1-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl esters such as 3,3'-dithiobis(succinimidylpropionate), bifunctional maleimides such as bis- N-maleimido-1,8-octane and agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate.
Other modifications include deamidation of glutaminyl and asparaginyl residues to the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, phosphorylation of hydroxyl groups of seryl or threonyl residues, methylation of the amino groups of lysine, arginine, and histidine side chains Creighton, in Proteins: Structure and Molecular Properties, W.H. Freeman Co., San Francisco, pp. 79-86 (1983)], acetylation of the N-terminal amine, and amidation of any Cterminal carboxyl group.
Another type of covalent modification of DP or DP-protein included within the scope of this invention comprises altering the native glycosylation pattern of the polypeptide. "Altering the native glycosylation pattern" is intended for purposes herein to mean deleting one or more carbohydrate moieties found in either DP or DP-protein, and/or adding one or more glycosylation sites that are not present in either DP or DP-protein.
Addition of glycosylation sites to DP or DP-protein may be accomplished by altering the amino acid sequence thereof. The alteration may be made, for example, by the addition of, or substitution by, one or more serine or threonine residues to the native sequence of DP or DP-protein (for O-linked glycosylation sites). The DP or DP-protein amino acid sequence may optionally be altered through changes at the WO 99/51625 PCT/US99/07374 DNA level, particularly by mutating the DNA encoding DP or DP-protein at preselected bases such that codons are generated that will translate into the desired amino acids. Methods for introducing mutations into DNA by in vitro mutagenesis are well known to those in the art and can be found, for example, in Sambrook et al., Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989) and Ausubel et al., Short Protocols in Molecular Biology (John Wiley Sons, Inc., 1995).
Another means of increasing the number of carbohydrate moieties on DP or DP-protein is by chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described in the art, for example, in WO 87/05330 published 11 September 1987, and in Aplin and Wriston, CRC Crit. Rev. Biochem., 10(4):259-306 (1981).
Removal of carbohydrate moieties present on DP or DP-protein may be accomplished chemically or enzymatically or by mutational substitution of codons encoding amino acid residues that serve as targets for glycosylation. Chemical deglycosylation techniques are known in the art and described, for instance, by Sojar and Bahl, Arch. Biochem. Biophys., 259:52-57 (1987) and by Edge et al., Anal. Biochem., 118:131-137 (1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety of endo-and exo-glycosidases as described by Thotakura and Bahl, Meth. Enzymol., 138:350-359 (1987).
Another type of covalent modification comprises linking a DP or a DP-protein to one of a variety of nonproteinaceous polymers, polyethylene glycol, polypropylene glycol, or polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337.
As will be appreciated by those in the art, the DPs, DP-proteins, and fusion proteins of the invention can be made in a variety of ways.
In a preferred embodiment, the DPs, DP-proteins, and fusion proteins are made synthetically, as is well known in theart.
In a preferred embodiment, the DPs, DP-proteins, and fusion proteins are encoded by nucleic acids, as is well known in the art.
In a preferred embodiment, the DP-proteins, including candidate DP-proteins, are translation products of nucleic acids. The candidate DP-protein comprises a randomized test protein. That is, every candidate DP-protein has a randomized portion, as defined above, that is the basis of the screening methods outlined below. In addition, to the randomized portion, the candidate DP-protein may also include a fusion WO 99/51625 PCT/US99/07374 partner. In this embodiment, the nucleic acids are introduced into cells, and the cells express the nucleic acids to generate DP-proteins (or candidate DP-proteins).
As outlined above, the DP-proteins are encoded by nucleic acids. A "nucleic acid", or "oligonucleotide", or a grammatical equivalent thereof herein means at least two nucleotide residues covalently linked together. A nucleic acid of the present invention will generally contain phosphodiester bonds.
Modifications of the ribose-phosphate backbone may be done to facilitate the addition of additional moieties such as labels, or to increase the stability and half-life of such molecules in physiological environments. The nucleic acids may be single stranded or double stranded, or contain portions of both double stranded or single stranded sequence. The nucleic acid may be RNA, comprising RNA, mRNA, and defined or random ribo-oligonucleotides. The nucleic acid may be DNA, comprising genomic DNA, cDNA and defined or random deoxyribo-oligonucleotides. The nucleic acid may also be a hybrid, where the nucleic acid contains any combination of deoxyribo- and ribo-nucleotides, and any combination of nucleotide bases.
The nucleic acids encode the DP-proteins and the fusion partners, if present. In addition, the nucleic acids will also generally contain extra sequences to effect translation or transcription, as necessary.
Usually, the nucleic acid encoding the DP proteins is incorporated into a suitable vector such as plasmid vectors or retroviral vectors. In a preferred embodiment, when plasmid vectors are used to express the DP-proteins, the nucleic acid is generally DNA. In another preferred embodiment, when retroviral vectors are used to express the DP-proteins, the nucleic acid is generally RNA.
In a preferred embodiment, vectors are used to express candidate DP-proteins. By "vector" herein is meant a replicon which comprises nucleic acid and can be used for the transformation of host cells. The vectors may be either self-replicating extrachromosomal vectors, referred to as "plasmids" or "plasmid vectors", or vectors which integrate into a host genome. A preferred embodiment utilizes retroviral vectors, as is more fully described below.
For non-retroviral embodiments, suitable vectors are derived from any number of known vectors, including, but not limited to, pcDNA3.1 (Invitrogen), pSI (Promega Corporation), and pBI (Clontech Laboratories, Inc.). Basically, any mammalian expression vectors with strong promoters such as CMV can be used to construct vectors expressing DP-proteins.
Generally, these expression vectors include transcriptional and translational regulatory nucleic acid operably linked to nucleic acids which are to be expressed. "Operably linked" in this context means that the transcriptional and translational regulatory nucleic acid is positioned relative to a coding sequence encoding DP-protein) in such a manner that transcription is initiated and translation of the protein is assured. Generally, this will mean that the promoter and transcriptional initiation or start sequences are WO 99/51625 PCT/US99/07374 positioned 5' to the coding region. The transcriptional and translational regulatory nucleic acid will generally be appropriate to the host cell used, as will be appreciated by those in the art. Numerous types of appropriate expression vectors, and suitable regulatory sequences, are known in the art for a variety of host cells.
In general, the transcriptional and translational regulatory sequences may include, but are not limited to, promoter sequences (including CAAT box and TATA box), ribosomal binding sites (including internal ribosome entry sites (IRES)), transcriptional start and stop sequences (including mRNA polyadenylation sequence 5'-AATAAA-3'), RNA splicing sequences, translational start and stop sequences (including and 3' untranslated regions, initiator codon (ATG), Kozak consensus sequence (5'-A/GNNATGG-3') and nonsense codons (UAA, UAG, UGA), either constitutive or inducible enhancer, activator or repressor sequences (located either upstream, downstream or overlapping relative to promoter and being either cell-line dependent, tissue-specific or temporally dependent), and protein targeting signals (including signals for endoplasmatic reticulum retention and extracellular secretion, signals for localization to plasma membranes, peroxisomes, nucleus, mitochondria, lysosomes, golgi complex and focal adhesions).
In a preferred embodiment, the regulatory sequences include a promoter and transcriptional start and stop sequences. Promoter sequences include constitutive and inducible promoter sequences [for example, see Walther and Stein, J. Mol. Med. 74(7):379-92 (1996)]. In a preferred embodiment, the promoters are constitutive and drive the expression of the DP-protein encoding nucleic acid at a high level. The promoters may be either naturally occurring promoters, hybrid or synthetic promoters.
Hybrid promoters, which combine elements of more than one promoter, are also known in the art, and are useful in the present invention.
Particularly preferred promoters for expression in mammalian cells are CMV promoters. Preferred retroviral promoters are discussed below.
In a preferred embodiment, the promoter is associated with at least one copy of a nucleic acid encoding the DP-protein. Individual components encoding parts of the fusion protein, such as the dimerization protein, the protein of interest and one or more fusion partners can be inserted in a parental vector which comprises at least on suitable cloning site, preferable 3' to the promoter sequence. In a preferred embodiment, the fusion protein encoding nucleic acid is composed of individual components to generate a fusion protein such as DP-L-protein-L-DP or N-DP-L-protein-L-DP, wherein is a nuclear localization signal, 'DP' is a dimerization peptide, L' is a linker sequence and 'protein' is a protein of interest. As discussed in detail above, many possible combinations of nucleic acid components encoding individual components of the fusion protein to be constructed.
WO 99/51625 PCT/US99/07374 Generation of such vectors is performed using methods known to those in the art which are, for example, described in Sambrook et al., Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989) and Ausubel et al., Short Protocols in Molecular Biology (John Wiley Sons, Inc., 1995). Pre-configured vectors are suitable to be included in kits. The end user of such vectors will have to insert the nucleic acid encoding a protein of interest or a library of proteins of interest into convenient cloning sites.
In another preferred embodiment, a rescue sequence is used to isolate the nucleic acid encoding the DPprotein. In this embodiment the rescue sequence may be a unique oligonucleotide sequence which serves as a probe target site to allow quick and easy isolation of the nucleic acid construct, via PCR, hybridization, or related techniques.
In addition, the vector may comprise additional elements such as a origin of replication, selection genes, etc., as is more fully described in Kriegler, in Gene Transfer and Expression: A Laboratory Manual, Freeman and Company, New York, (1990) and Murray, Methods in Molecular Biology, Vol 7: Gene Transfer and Expression Protocols, Humana Press (1991).
The nucleic acid encoding the protein of interest may be obtained from genomic DNA, cDNA, from defined oligonucleotides or from random nucleotides.
Usually the DP-proteins and DP-fusion proteins will be encoded by nucleic acids and are generated after transcription thereof and translation of the corresponding mRNA. In one preferred embodiment, concatemers of a nucleic acid encoding, for example, a DP fusion-peptide such as illustrated above (DPhyd-Lp-proteinl-Lp-DPhyd-LG-DPLys-Lp-protein 2 -Lp-DPGlu) can be inserted into suitable cloning vectors (as detailed below) resulting in the generation of concatemerized DP-fusion proteins such as (DPhyd-Lpprotein,-Lp-DPhyd-LG-DPLys-Lp-protein 2 -Lp-DPGlu)n, wherein n is an integer of at least 2. As will be obvious to those in the art a plurality of DP fusion protein encoding nucleic acids other than those illustrated herein, including bivalent and monovalent derivatives thereof, can be combined in suitable vectors and the corresponding DP-proteins can be made.
In one embodiment, retroviral vectors are used to express the candidate DP-proteins and the nucleic acid encoding the candidate DP-protein is generally RNA.
A particularly well suited retroviral transfection system is described in Mann et al., Cell 33:153-159 (1983); Pear et al., Proc. Natl. Acad. Sci. USA 90(18):8392-6 (1993); Kitamura et al., Proc. Natl. Acad. Sci. USA 92:9146-9150 (1995); Kinsella et al., Hum. Gene Ther. 7:1405-1413 (1996); Hofmann et al., Proc. Natl.
Acad. Sci. USA 93:5185-5190 (1996); Choate et al., Hum. Gene Ther. 7:2247-53 (1996); and WO WO 99/51625 PCT/US99/07374 94/19478 and PCT/US97/01019; and references cited therein, all of which are expressly incorporated by reference.
Any number of suitable retroviral vectors may be used. Preferred retroviral expression vectors include vectors based on the murine stem cell virus [MSCV; see Hawley et at., Gene Ther. 1:136-8 (1994)] and a modified MFG virus [Riviere et al., Proc. Natl. Acad. Sci. USA 92:6733-7 (1995)], and pBABE (see PCT US97/01019, incorporated by reference). Other suitable retroviral expression vectors are derived from Moloney murine leukemia virus and include vectors such as pLNCX, pLXSN, pLAPSN; a self-inactivating expression vector, such as pSIR; a bicistronic expression vector, such as pLXIN; inducible expression vectors, such as pRevTet-On, pRevTet-Off [Clontech Laboratories; see also Coffin and Varmus, in Retroviruses (Cold Spring Harbor Laboratory Press, New York, 1996)].
As described above for other vectors, retroviral vectors may include inducible and constitutive promoters.
Constitutive promoters are preferred and include, but are not limited to, CMV, SV40, Sra, RSV, EF-la, UbC and TK.
Generally, the retroviral expression vectors may include one or more selection genes (also referred to as selectable marker genes) under the control of internal ribosome entry sites (IRES), which allows for bicistronic operons and thus greatly facilitates the selection of cells expressing fusion constructs at uniformly high levels; and promoters driving expression of a second gene, placed in sense or anti-sense relative to the 5' LTR.
Selection genes allow the selection of transformed host cells containing the vector, and particularly in the case of mammalian cells, ensures the stability of the vector, since cells which do not contain the vector will generally die. Selection genes are well known in the art and will vary with the host cell used. By "selection gene" herein is meant any gene which encodes a gene product that either confers resistance to a selection agent or that encodes a marker allowing selecting the cell expressing this marker. Suitable selection agents include, but are not limited to, neomycin (or its analog G418), blasticidin S, histinidol D, bleomycin, puromycin, hygromycin B, and other drugs. Suitable marker genes, which can be inserted into a bicistronic transcriptional unit (see above) and subsequently allow the identification of host cells expressing a gene of interest include, but are not limited to, self-fluorescent markers such as green fluorescent protein, enzymatic markers such as lacZ, and surface proteins such as CD8, etc.
As described for the other vectors, the retroviral vectors may comprise a variety of transcriptional and translational regulatory sequences and at least one cloning site for the subcloning of at least one recombinant DNA fragment.
WO 99/51625 PCT/US99/07374 The compositions of the invention are introduced into host cells to screen for bioactive agents capable of altering the phenotype of a cell which expresses a gene of interest or protein of interest. By "introduced into or grammatical equivalents herein is meant that the nucleic acids enter the cells in a manner suitable for subsequent expression of the nucleic acid. The method of introduction is largely dictated by the targeted cell type, discussed below. Exemplary methods include CaPO 4 precipitation, liposome fusion, lipofectin®, electroporation, viral infection, etc. [see Kriegler, Gene Transfer and Expression: A Laboratory Manual (New York: Oxford University Press, 1991); Roth, Protein Expression in Animal Cells, Methods in Cell Biology Vol. 43 (San Diego: Academic Press, 1994); and Murray, Gene Transfer and Expression Protocols, Methods in Molecular Biology, Vol. 7 (Clifton: Humana Press, 1991)].
The compositions of the invention may stably integrate into the genome of the host cell (for example, when using retroviral particles), or may exist either transiently or stably in the cytoplasm through the use of traditional plasmids, utilizing standard regulatory sequences, selection markers, etc.). As many pharmaceutically important screens require human or model mammalian cell targets, retroviral vectors capable of transfecting such targets are preferred.
As will be appreciated by those in the art, the type of cells used in the present invention can vary widely.
Basically, any cell may be used, with mammalian cells being preferred, with mouse, rat, primate and human cells being particularly preferred. As is more fully described below, a screen will be set up such that the cells exhibit a selectable phenotype in the presence of a candidate DP-protein. As is more fully described below, cell types implicated in a wide variety of disease conditions are particularly useful, so long as a suitable screen may be designed to allow the selection of cells that exhibit an altered phenotype as a consequence of the presence of a candidate DP-protein within the cell.
Accordingly, suitable cell types include, but are not limited to, tumor cells of all types (particularly melanoma, myeloid leukemia, carcinomas of the lung, breast, ovaries, colon, kidney, prostate, pancreas and testes), cardiomyocytes, endothelial cells, epithelial cells, lymphocytes (T-cell and B cell) mast cells, eosinophils, vascular intimal cells, hepatocytes, leukocytes including mononuclear leukocytes, stem cells such as haemopoetic, neural, skin, lung, kidney, liver and myocyte stem cells (for use in screening for differentiation and de-differentiation factors), osteoclasts, chondrocytes and other connective tissue cells, keratinocytes, melanocytes, liver cells, kidney cells, and adipocytes. Suitable cells also include known research cells, including, but not limited to, Jurkat T cells, NIH 3T3 cells, CHO, Cos, etc. See the ATCC cell line catalog, hereby expressly incorporated by reference.
In one embodiment, the cells may be genetically engineered, that is, contain exogenous nucleic acid (for example, encoding a target molecule) in addition to the compositions of the invention.
WO 99/51625 PCTIUS99/07374 Once made, the compositions of the invention find use in a number of applications. The present invention provides compositions which are useful to identify, both in vivo and in vitro proteins capable of interacting with, binding to or modulating the activity of a second protein.
In a preferred embodiment the present invention provides methods and compositions to create, effectively introduce into cells and screen compounds that affect a signaling pathway. Little or no knowledge of the pathway is required, other than a presumed signaling event and an observable physiologic change in the target cell. The disclosed methods comprise an in vivo stratagem for accessing intracellular signaling mechanisms. The invention also provides for the isolation of the constituents of the pathway, the tools to characterize the pathway, and lead compounds for pharmaceutical development.
The present invention provides methods for the screening of compounds, referred to herein as DPproteins, which are capable of altering the phenotype of cells comprising them. By "candidate DP-protein" herein is meant a DP-protein for which a function, an intrinsic property, or an interaction with a second protein is sought. While the "DP" component of candidate DP-proteins is generally not changed within a molecular library, the "protein" component of candidate DP-proteins is variable.
In one embodiment, a plurality of candidate DP-proteins is provided in form of a molecular library. The term "molecular library" herein is meant to include a plurality of different DP-proteins, a plurality of isolated different nucleic acids encoding a plurality of different DP-proteins, and a plurality of different nucleic acids which encode a plurality of different DP-proteins and which are comprised by vectors. The methods of the present invention provide for the rapid in vivo screening of molecular libraries comprising large numbers of candidate DP-proteins, wherein the 'protein' components of DP-proteins are encoded by a candidate nucleic acid, comprising either random oligonucleotides, cDNA fragments and genomic DNA.
Thus, by delivering the random oligonucleotides, cDNA fragments and genomic DNA to cells, the cellular machinery generates the candidate DP-proteins. By screening the same cells, without the need to collect or synthesize in vitro the candidate DP-protein, highly efficient screening is accomplished. Thus, the present invention provides methods for screening a plurality of candidate DP-proteins, for effectors capable of altering the phenotype of a cell.
Signaling pathways in cells often involve an effector stimulus chemokine, growth factor, hormone, etc.) that leads to a phenotypically describable change in cellular physiology. Despite the key role intracellular signaling pathways play in disease pathogenesis, in most cases, little is known about a signaling pathway other than the initial stimulus and the ultimate cellular response. When peptides are intracellularly expressed, they may modulate intracellular signaling pathways (Souroujon and Mochly- Rosen, Nat. Biotechnol. 16(10):919-24 (1998) and thus may participate in protein-protein interactions.
Molecular libraries of chemical compounds or peptides were screened for effector molecules that modulate up-regulate or down-regulate) signaling pathways. Thus constrained peptides contained WO 99/51625 PCT/US99/07374 in minimized proteins may also be useful in the design of agents modulating intracellular protein-protein interactions [Cunningham and Wells, Curr. Opin. Struct. Biol. 7:457-462 (1997)], which may offer a novel method of regulating intracellular signaling pathways. If the peptides are expressed in live mammalian cells, for example by using retroviral vectors, they may be screened for defined changes in cellular phenotype, and the resulting active peptides may provide a route for the affinity isolation of their binding targets.
Some form of conformationally constrained peptides may be useful and even necessary in displaying peptides for intracellular combinatorial chemistry in live mammalian cells. Unlike peptides in phage display libraries, intracellular peptides may be subject to catabolism and thus preferably these peptides should be relatively inert to cellular proteases. Although intracellular peptide catabolism has not been well characterized, the ubiquitin-proteasome system is known to be involved in the degradation of proteins [Goldberg et al, Biol. Chem. 378:131-140 (1997); Hilt and Wolf, Trends Biochem. Sci. 21:96-102 (1996)], and can act as a carboxy-octapeptidase. Further proteolysis, perhaps involving aminopeptidases, can result in the degradation of peptides to amino acids [Lee and Goldberg, Trends Cell Biol. 8:397-403 (1998)]. In antigen presenting cells, short linear peptides resulting from cytoplasmic proteolysis can be removed to the endoplasmic reticulum by the peptide transporters TAP1 and TAP2 [Belich and Trowsdale, Mol. Biol. Rep. 21:53-56 (1995)].
Developing a scaffold for the intracellular display of expressed peptides which is relatively inert to proteolysis resulting in enhanced intracellular stability and a higher steady state concentration of the expressed protein and (ii) which is also small enough to allow access to binding sites on proteins such as active site crevices may be very useful. The compact nature of this scaffold should decrease the flexibility of the expressed protein and decrease the conformational entropy, effectively increasing the concentration of individual conformers. This and the increased stability to proteolysis should in turn make these scaffolds when used as peptide libraries) more likely to contain active proteins, since the higher concentrations should allow saturation of weaker binding interactions. This benefits screening protocols to detect bioactive peptides, by allowing phenotypic selection of lower affinity peptides, and thus allowing more bioactive peptides to be detected. Such features of enhanced proteolytic stability and diminished conformational entropy may also make the more compact structure more attractive as a potential therapeutic. Addition of specific short sequences to the N- and C-terminus of the peptide may be useful for enhancing the above properties. A loop structure [Leszczynski and Rose, Science 234:849- 855 (1986)] may be of particular interest, since loops are globular and compact, are common on protein surfaces, and may be frequently involved in protein function and protein-protein interactions.
In a preferred embodiment, the compositions of the invention are used to screen for candidate bioactive agents; that is the test protein within the DP-protein (see above) is a candidate bioactive agent. The candidate DP-proteins, as part of a molecular library, are introduced into suitable host cells to screen for WO 99/51625 PCT/US99/07374 DP-proteins, capable of altering the phenotype of the host cell, harboring or expressing such a candidate DP-protein. If necessary, the cells are treated to conditions suitable for the expression of genes encoding the candidate DP-proteins (for example, when inducible promoters are used), to produce the candidate expression products.
In a preferred embodiment, a first plurality of cells is screened. That is, the cells into which a molecular library is introduced, which provides candidate DP-proteins, are screened for an altered phenotype.
Thus, in this embodiment, the effect of the candidate DP-protein is seen in the same cells in which it is made; i.e. an autocrine effect.
By a "plurality of cells" herein is meant roughly from about 103 cells to 108 or 10', with from 106 to 101 being preferred. This plurality of cells comprises a cellular library, wherein generally each cell within this cellular library contains a member of the molecular library, i.e. a different candidate DP-protein or a different DP-protein encoding nucleic acid, although as will be appreciated by those in the art, some cells within the cellular library may not contain a member of the molecular library, and some may contain more than one. When methods other than retroviral infection are used to introduce the candidate DP-protein into a plurality of cells, the distribution of candidate nucleic acids within the individual cell members of the cellular library may vary widely, as it is generally difficult to control the number of nucleic acids which enter a cell during electroporation, etc.
In a preferred embodiment, the molecular library is introduced into a first plurality of cells, and the effect of the expressed candidate DP-protein is screened in a second or third plurality of cells, different from the first plurality of cells, i.e. generally a different cell type. That is, the effect of the candidate DP-protein is due to an extracellular effect on a second cell; i.e. an endocrine or paracrine effect. This is done using standard techniques. The first plurality of cells may be grown in or on one media, and the media (referred to as "conditioned media") is allowed to touch a second plurality of cells, and the effect measured.
Alternatively, there may be direct contact between the cells. Thus, "contacting" is functional contact, and includes both direct and indirect. In this embodiment, the first plurality of cells may or may not be screened.
Thus, the methods of the present invention comprise introducing a molecular library of randomized candidate nucleic acids into a plurality of cells, generating a cellular library. Each of the nucleic acids comprises a different, generally randomized, nucleotide sequence, encoding a different DP-protein. The plurality of cells is then screened, as is more fully outlined below, for a cell exhibiting an altered phenotype. The altered phenotype is due to the presence of a DP-protein.
By "altered phenotype" or "changed physiology" or other grammatical equivalents herein is meant that the phenotype of the cell is altered in some way, preferably in some detectable and/or measurable way. As WO 99/51625 PCT/US99/07374 will be appreciated in the art, a strength of the present invention is the wide variety of cell types and potential phenotypic changes which may be tested using the present methods. Accordingly, any phenotypic change which may be observed, detected, or measured may be the basis of the screening methods herein. Suitable phenotypic changes include, but are not limited to: gross physical changes such as changes in cell morphology, cell growth, cell viability, adhesion to substrates or other cells, and cellular density; changes in the expression of one or more RNAs, mRNAs, proteins, lipids, hormones, cytokines, or other molecules; changes in the equilibrium state half-life) of one or more RNAs, mRNAs, proteins, lipids, hormones, cytokines, or other molecules; changes in the localization of one or more RNAs, mRNAs, proteins, lipids, hormones, cytokines, or other molecules; changes in the bioactivity or specific activity of one or more RNAs, mRNAs, proteins, lipids, hormones, cytokines, receptors, or other molecules; changes in the secretion of ions, cytokines, hormones, growth factors, proteins, or other molecules; alterations in cellular membrane potentials, polarization, integrity or transport; changes in infectivity, susceptibility, latency, adhesion, and uptake of viruses and bacterial pathogens; etc.
By "capable of altering the phenotype" or grammatical equivalents, herein is meant that a candidate DPprotein can change the phenotype of the cell in some detectable and/or measurable way.
The altered phenotype may be detected in a wide variety of ways, as is described more fully below and in PCT/US97/01019, and will generally depend and correspond to the phenotype that is being changed.
Generally, the changed phenotype is detected using, for example: microscopic analysis of cell morphology; standard cell viability assays, including both increased cell death and increased cell viability, for example, cells that are now resistant to cell death via virus, bacteria, or bacterial or synthetic toxins; standard labeling assays such as fluorometric indicator assays for the presence or level of a particular cell or molecule, including FACS or other dye staining techniques; biochemical detection of the expression of target compounds after killing the cells; monitoring changes in gene expression within a target cell, etc. In some cases, as is more fully described herein, the altered phenotype is detected in the cell in which the molecular library comprising the randomized nucleic acid or randomized proteins was introduced; in other embodiments, the altered phenotype is detected in a second cell which is responding to some molecular signal from the first cell.
In a preferred embodiment, upon its translocation into the nucleus, the DP-protein modulates gene expression causing an increase or a decrease of expression of a target gene. In one embodiment, a transcriptional activation protein binds to the DP-protein and thus either may be inactivated or prevented from activating its target gene. In this embodiment, the DP-protein comprises a protein which has an affinity to the target transcriptional activator, for example the HIV tat protein. In another embodiment, DPprotein may lead to an increase expression of a target gene, by virtue of comprising a protein component which has an affinity to a transcriptional repressor. Upon binding of the transcriptional repressor to the WO 99/51625 PCT/US99/07374 DP-protein, it either may be inactivated or prevented from binding to its target gene and thus leading to a higher expression of the gene of interest.
In a preferred embodiment, once a cell with an altered phenotype is detected, the cell is isolated from the plurality of cells which do not have altered phenotypes. This may be done in any number of ways, as is known in the art, and will in some instances depend on the assay or screen. Suitable isolation techniques include, but are not limited to, FACS, lysis selection using complement, cell cloning, scanning by Fluorimager, expression of a "survival" protein, induced expression of a cell surface protein or other molecule that can be rendered fluorescent or taggable for physical isolation; expression of an enzyme that changes a non-fluorescent molecule to a fluorescent one; overgrowth against a background of no or slow growth; death of cells and isolation of DNA or other cell vitality indicator dyes, etc.
In a preferred embodiment, the candidate nucleic acid encoding the candidate DP-protein and/or the candidate DP-protein is isolated from the cell with an altered phenotype. This may be done in a number of ways. In a preferred embodiment, primers complementary to DNA regions common to the vector, or to specific components of the molecular library such as a rescue sequence, defined above, are used to "rescue" the unique random nucleic acid encoding the candidate DP-protein. Alternatively, the candidate DP-protein is isolated using a rescue sequence which is operably linked to the candidate DP-protein (as described above). Thus, for example, rescue sequences comprising epitope tags or purification sequences may be used to pull out the bioactive agent, using immunoprecipitation or affinity columns. In some instances, as is outlined below, this may also pull out the primary target molecule, if there is a sufficiently strong binding interaction between the bioactive agent and the target molecule. Alternatively, the peptide may be detected using mass spectroscopy.
Once rescued, the sequence of the candidate nucleic acid encoding the candidate DP protein and/or the sequence of the candidate DP-protein is determined. This information can then be used in a number of ways.
Often, when genomic libraries or cDNA libraries or DNA fragments obtained thereof are employed in the screening method outlined herein when they are used to encode candidate DP-proteins) the nucleic acid sequence encoding the test protein is not full-length, the nucleic acid sequence does not encode the complete test protein. By "full-length" cDNA, gene, mRNA, RNA or grammatical equivalents herein is meant any nucleic acid which encodes a complete protein as it is encoded by its corresponding cellular genetic locus. In addition to the complete protein encoding sequence, a full-length cDNA, gene, mRNA or RNA may optionally contain 5' and 3' untranslated nucleic acid sequences. The complete protein may include amino acids incorporated by translation of the corresponding mRNA, that may subsequently be eliminated from the native protein, e.g. secretory signal peptide sequences or sequences involved in protein splicing and protein processing. By "full-length protein" or grammatical equivalents herein is WO 99/51625 PCT/US99/07374 meant a protein encoded by a full-length cDNA, gene, RNA or mRNA. As appreciated by those in the art, full-length proteins may include posttranslationally modifications, including, but not limited to, signal peptide cleavage, protein splicing, protein precursor processing, glycosylation, and the like. Accordingly, a "partial cDNA", "partial gene", "partial mRNA", "partial RNA" or a "partial protein" or grammatical equivalents are meant to indicate a cDNA, gene, mRNA, RNA or a protein which represents a fragment of a full-length cDNA, gene, mRNA, RNA or a protein. Accordingly, in a preferred embodiment, the determined nucleic acid sequence information of the rescued partial protein will be used to isolate the fulllength coding sequence of the DP-protein. The isolation and characterization of a full-length coding sequence using a partial sequence information is well known in the art.
In a preferred embodiment, the nucleic acid encoding the candidate DP-protein, or a nucleic acid encoding a full-length version thereof or any fragment of the full-length version, or a derivative of the candidate DP-protein (see below), is reintroduced into the host cells, to verify the originally observed altered phenotype of the cell. These cells may be the same as in the original screening experiment or different. This may be done using retroviruses, or alternatively using fusions to the HIV-1 Tat protein and analogs and related proteins, which allows very high uptake into target cells. See for example, Fawell et al., Proc. Natl.. Acad. Sci. USA 91:664-8 (1994); Frankel and Pabo, Cell 55:1189-93 (1988); Savion et al., J. Biol. Chem. 256:1149-54 (1981); Derossi et al., J. Biol. Chem. 269:10444-50 (1994); and Baldin et al., EMBO J. 9:1511-7 (1990), all of which are incorporated by reference.
In a preferred embodiment, a recombinant DP-protein is generated (as outlined further below) and used to confirm the alteration of the phenotype of a target cell. This is a preferred embodiment, when the alteration of a phenotype was observed in a second or third plurality of cells as described above. That is, the effect of the candidate DP-protein may be due to its secretion from a first cell, wherein it was generated, followed by its binding to a cellular receptor on the second cell different cell) or internalization by a different means and subsequently exerting its effect in or on this second cell. In this embodiment, the recombinant DP-protein or a derivative thereof is provided to the second cell and an alteration of phenotype is monitored.
In a preferred embodiment, the nucleic acids encoding the DP-protein or a derivative thereof (referred to herein also as protein of interest) are used to express the respective recombinant protein. A variety of expression vectors, including viral and non-viral expression vectors can be made which are useful for recombinant protein expression in a variety of systems, including, but not limited to, yeast, bacteria, archaebacteria, fungi, insect cells and animal cells, including mammalian cells.
The protein of interest may also be expressed as a fusion protein, including fusions to fusion partners, as outlined before, or fusions to other protein sequences. Recombinant proteins of interest are produced by culturing host cells into which nucleic acids encoding the protein of interest (generally as an expression WO 99/51625 PCT/US99/07374 vector) is introduced, under the appropriate conditions that induce or cause expression of the recombinant protein.
In a preferred embodiment, the recombinant protein is purified following expression. Numerous suitable methods for recombinant protein expression, including generation of expression vectors, generation of fusion proteins, introducing expression vectors into host cells, protein expression in host cells, and purification methods are known to those in the art and are described, for example, in the following textbooks: Ausubel et al., Short Protocols in Molecular Biology (John Wiley Sons, Inc., 1995); O'Reilly et al., Baculovirus Expression Vectors: A Laboratory Manual (New York: Oxford University Press, 1994); Kriegler, Gene Transfer and Expression: A Laboratory Manual (New York: Oxford University Press, 1991); and Deutscher, Guide to Protein Purification, Methods in Enzymology Vol. 182 (San Diego: Academic Press, Inc., 1990).
In a preferred embodiment, either the DP-protein or the nucleic acid encoding it is used to identify target molecules, i.e. the molecules with which the DP-protein interacts. As will be appreciated by those in the art, there may be primary target molecules, to which the DP-protein binds or acts upon directly, and there may be secondary target molecules, which are part of the signaling pathway affected by the DP-protein.
In a preferred embodiment, the DP-protein is used to pull out target molecules. For example, as outlined herein, if the target molecules are proteins, the use of epitope tags or purification sequences operably linked to the DP-protein can allow the purification of primary target molecules via biochemical means [coimmunoprecipitation, affinity columns, etc., for example, see Deutscher, Guide to Protein Purification, Methods in Enzymology Vol. 182 (San Diego: Academic Press, Inc., 1990); Harris and Angal, Protein Purification Methods: A Practical Approach (Oxford: IRL Press at Oxford University Press, 1994); Harris and Angal, Protein Purification Applications: A Practical Approach (Oxford: IRL Press at Oxford University Press, 1990)]. Alternatively, the recombinant DP-protein, when expressed in bacteria and purified, can be used as a probe against a cDNA expression library made from mRNA of the target cell type. Or, DPproteins can be used as a "bait" protein when a DP-protein of defined sequence is employed in a screening to identify unknown binding proteins) or as a "test" protein when a known protein is employed as a bait and screened against a molecular library comprising candidate DP-proteins) in either yeast or mammalian two or three hybrid systems see Fields and Song, Nature 340:245-6 (1989); Vasavada et al., Proc. Natl. Acad. Sci. USA 88:10686-90 (1991); Fearon et al., Proc. Natl. Acad. Sci. USA 89:7958-62 (1992); Dang et al., Mol. Cell. Biol. 11:954-62 (1991); Chien et al., Proc. Natl. Acad. Sci. USA 88:9578-82 (1991); Luo et al., Bio/Techniques 22(2):350-352 (1997) and U.S. Patent Nos. 5,283,173, 5,667,973, 5,468,614, 5,525,490, and 5,637,463). Such interaction cloning approaches have been very useful to isolate DNA-binding proteins and other interacting protein components. The DP-protein(s) can be combined with other pharmacologic activators to study the epistatic relationships of signal transduction pathways in question. It is also possible to synthetically prepare labeled DP-protein or a derivative WO 99/51625 PCT/US99/07374 thereof and use it to screen a cDNA library expressed in bacteriophage, bacteria or eukaryotic cells for those cDNAs which bind the DP-protein or its derivative. Furthermore, it is also possible to use cDNA cloning via retroviral libraries to "complement" the effect induced by the DP-protein. In such a strategy, the DP-protein would be required to be stoichiometrically titrating away some important factor for a specific signaling pathway. If this molecule or activity is replenished by over-expression of a cDNA from within a cDNA library, then one can clone the target. Similarly, cDNAs cloned by any of the above yeast or bacteriophage systems can be reintroduced to mammalian cells in this manner to confirm that they act to complement function in the system the peptide acts upon.
Once primary target molecules have been identified and validated, secondary target molecules may be identified in the same manner, using the primary target as the "bait". In this manner, signaling pathways may be elucidated. Similarly, bioactive agents specific for secondary target molecules may also be discovered, to allow a number of bioactive agents to act on a single pathway, for example for combination therapies.
In a preferred embodiment, a molecular library of recombinant DP-proteins is used in in vitro binding assays to identify member that are capable of binding to a selected target protein, a receptor, a ligand, an enzyme, etc.
Generally, in a preferred embodiment of the methods herein, a target protein (which can be a recombinant protein or a naturally occurring protein) is non-diffusably bound to an insoluble support having isolated sample receiving areas a microtiter plate, an array, etc.). The insoluble supports may be made of any composition to which the target protein can be bound, is readily separated from soluble material, and is otherwise compatible with the overall method of screening. The surface of such supports may be solid or porous and of any convenient shape. Examples of suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are typically made of glass, plastic polystyrene), polysaccharides, nylon or nitrocellulose, teflonTM, etc. Microtiter plates and arrays are especially convenient because a large number of assays can be carried out simultaneously, using small amounts of reagents and samples. The particular manner of binding of the target protein is not crucial so long as it is compatible with the reagents and overall methods of the invention, maintains the characteristics of the target protein and is nondiffusable. The target protein may be either bound directly to the insoluble support via cross-linking) or indirectly via antibody, other protein or nucleic acid, etc.).
Preferred methods of binding include the use of antibodies (which do not sterically block the proteinprotein interaction surface for the test protein and preferably are directed against a tag polypeptide which may be incorporated into the recombinant bait protein), direct binding to "sticky" or ionic supports, chemical crosslinking, etc. Following binding of the target protein, excess unbound material is removed by washing. The sample receiving areas may then be blocked through incubation with bovine serum albumin (BSA), casein or other innocuous protein.
WO 99/51625 PCT/US99/07374 A molecular library comprising a plurality of recombinant DP-proteins is added to the binding assay. The binding assay is performed at any temperature which facilitates optimal binding, typically between 4 0
C
and 400C. Incubation periods are selected for optimal binding, but are also optimized to facilitate high through-put screening. Typically between 0.1 and 1 hour is sufficient. Determination of the binding of DP-proteins to the target protein may be done using a wide variety of assays, including labeled in vitro protein-protein binding assays, electrophoretic mobility shift assays (EMSA), immunoassays for protein binding, functional assays (phosphorylation assays, etc.) and the like. see Harlow and Lane, Antibodies: A Laboratory Manual (New York, Cold Spring Harbor Laboratory Press, 1988) and Ausubel et al., Short Protocols in Molecular Biology (John Wiley Sons, Inc., 1995).
The screening methods of the present invention.may be useful to screen a large number of cell types under a wide variety of conditions. Generally, the host cells are cells that are involved in disease states, and they are tested or screened under conditions that normally result in undesirable consequences on the cells. When a suitable bioactive agent is found, the undesirable effect may be reduced or eliminated.
Alternatively, normally desirable consequences may be reduced or eliminated, with an eye towards elucidating the cellular mechanisms associated with the disease state or signaling pathway. These screening methods are outlined in PCT/US97/01019, hereby incorporated by reference.
The following examples serve to more fully describe the manner of using the above-described invention, as well as to set forth the best modes contemplated for carrying out various aspects of the invention. It is understood that these examples in no way serve to limit the true scope of this invention, but rather are presented for illustrative purposes. All references cited herein are incorporated by reference in their entirety.
EXAMPLE 1 Novel peptides which form observable dimers under harsh conditions.
Upon infusion into the electrospray source of a Finnigan LCQ ion trap mass spectrometer of a 3 x 10- M pH 6.4 solution of EFLIVKS-amide, this peptide appears to self-associate to form dimers (fig. 3A), detected at exactly two times the monomer molecular weight in the gas phase, after surviving an inlet capillary temperature of 2100C and harsh electrospray conditions, and thus would be expected to dimerize at significantly lower concentrations in aqueous solution. The peptide also forms dimers (also detected by mass spectrometry) when eluted off a C18 reversed phase column at pH -2.5 in ca. acetonitrile (fig. 3B). Comparison of its dimerization in fig. 3A with that of the test peptide SKVILFE (which forms dimers in the range of 10 1 3 M in aqueous solution (Bodenmuller et al., supra), when both are continuously infused by an electrospray interface into an ion trap mass spectrometer, suggests that both peptides dimerize to a similar extent (within a factor of 10 or so). This suggests that EFLIVKS may dimerize in aqueous solution at very low concentrations. The dimerization of EFLIVKS cannot be WO 99/51625 PCT/US99/07374 predicted from dimerization of SKVILFE since reversed sequences are often used as inactive controls for bioactive peptides.
LC/MS examination of the crude synthesis products from an all-single coupled fmoc synthesis of EFLIVKS-amide, for shorter sequences which can dimerize after electrospray ionization is shown in fig. 4.
HPLC elution was with a gradient of 99.9% water-0.1% TFA to 99.9% acetonitrile-0.1% TFA. Dimers of the following truncated sequences were detected by mass spectrometry with the percent acetonitrile in parentheses, all at pH 2.5: peak 1, LIVKS-amide monomer m/z 705.3, dimer m/z 1409.3; peak 4: EFLIVKS-amide monomer m/z 834.4, dimer m/z 1667.3; and peak 5, EFLIVK-amide monomer m/z 747.4, dimer m/z 1493.1. These results suggest that the N-terminal EF and Cterminal S can be deleted without abolishing dimerization.
Examination of a peptide designed to form a short beta sheet, VSIKFEL, shows that upon elution from a C18 reversed phase column with mass spectrometry detection, the dimeric form of the peptide (m/z 1667.5) is detected in addition to the monomeric form (m/z 834.5) after electrospraying into the ion trap.
This suggests that this peptide, which contains alternating hydrophilic and hydrophobic residues and thus may form a beta sheet, can also form stable dimers.
EXAMPLE 2 EFLIVKS can form compact proteolytically resistant structures when added to the N- and C-terminus of a test 18mer polypeptide.
The peptide EFLIVKS, when fused to both the N- and C-terminus of a test 18mer polypeptide, can form a compact structure of this polypeptide (referred to herein also as peptide The 18mer polypeptide sequence is VGTIVTMEYRIDRTRSFV, derived from the barley c2-chymotrypsin inhibitor [Leatherbarrow and Salacinski, Biochemistry 30:10717-21 (1991)]. The analog of this peptide containing an N-and Cterminal cysteine, in both cases substituted for valine, is thought to fold into a similar compact structure to the loop present in barley chymotrypsin inhibitor-2. Such a compact structure should be a poor substrate for proteases such as elastase, and in fact has been proposed as an inhibitor of elastase, chymotrypsin, and two variants of subtilisin. This disulfide-cyclized analog has been synthesized and tested by us, and is in fact a poor protease substrate, but a substrate nonetheless, and not an inhibitor. The linear peptide CGTIVTMEYRIDRTRSFC is a good substrate for elastase, with ca. 15 peptides produced after a 3 hour incubation (fig. 5A), with the proteolysis being monitored by reversed phase hplc coupled to mass spectrometry detection and identification of the proteolytic fragments. The same peptide with a disulfide bond between the two cysteines is also a substrate for elastase, but with fewer peptide products, and the major initial cleavage occurring after the tyrosine (fig. 5B). Fusion of the dimerizer EFLIVKS onto both the N- and C-terminus (EFLIVKS-VGTIVTMEYRIDRTRSFV-EFLIVKS-amide) creates a more proteolytically WO 99/51625 PCT/US99/07374 resistant construct (fig. 5C), with little proteolysis evident after almost 3 hours, and very minor amounts of a number of different cleavage products.
EXAMPLE 3 Examination of the low energy conformers of peptide 1.
To examine the structural nature of the compact construct of peptide 1, low energy conformers were obtained by a high temperature molecular dynamics-simulated annealing protocol similar to that published by Nilges et al. Protein Engineering 2:27-38 (1988), as implemented in Discover 95. Structures were saved every 2 psec (from different trajectories lasting from 400 psec to 1400 psec at 900 cooled to 300 K over 5 ps, and minimized using a distance-dependent dielectric constant (varying linearly between e 80 at 80 A to e 1 at 1A separation) with 200 steps of the steepest descent algorithm, then with as many steps as necessary using the conjugate gradient algorithm to give a maximum derivative of less than 0.001 kcal/A. The resulting low energy structures were collected and compared from trajectories starting from a) peptide 1, with the 18mer polypeptide started from its conformation present in barley chymotrypsin inhibitor 1, [McPhalen and James, Biochemistry 26:261-269 (1987)1, and subsequently minimized using Discover 2.9.5, attached to the two dimerizers minimized from an extended conformation; b) structures derived from a continuation of the trajectory in a) starting from the last structure, but with the trajectory modified by the use of a different dseed (different initial velocities); c) a continuation of the trajectory in b) with a third dseed; d) a trajectory starting as in a) except with the dimerizers forced into a starting beta sheet structure; e) a trajectory starting as in a) except with the dimerizers forced into a starting right handed alpha helical conformation f) a trajectory starting from a fully extended peptide 1.
All structures within 20 kcal/mole of the lowest energy structure from trajectories a-f were collected and compared. Figure 6 shows an overlay of the 45 lowest energy structures (only the peptide backbone is shown) from all of the trajectories, after a least-squares alignment of the peptide backbones. All structures when examined individually appear compact. Examination of the backbone conformations suggests that the 18mer polypeptide folds onto the surface of the dimerizers in different ways. Space filling models suggest that the resulting low energy structures are well-packed. This suggests that for polypeptide lengths on the order of 18 residues, a library of these constructs may be a library of very small proteins or compact structures. The relatively small size of these mini-proteins should allow facile nmr structure determination and thus the establishment of structure-activity relationships. These compact low energy conformers are also consistent with the observed inertness of this construct to elastase.
EXAMPLE 4 WO 99/51625 PCT/US99/07374 Estimation of the affinity of the folding peptides attached to the N- and C-terminus of a polypeptide necessary to help form a compact structure.
Unlike the strict requirements for very high affinity for efficacy of peptides which bind to a second peptide sequence which is not covalently linked, the affinity requirements here for making a compact structure are less demanding. High affinity peptides may well work, but are not required. The tethering of a second copy of a homodimeric peptide at a fixed distance from the first dimerizer (separated by the polypeptide) will result in a very high local concentration of the second dimerizer. An estimate of this local concentration is derived for an 8mer polypeptide tethering the two together as follows: a linear 8mer is ca.
3 A /residue x 8 residues or 24 A long when fully extended. A rough estimate of the distance from a second copy of the dimerizer would thus be in the range of 20 A or less, since the peptide will not be fully extended in all (or even many) conformations. A solution with a second attached copy of a dimerizer every 20 A away will have an effective concentration of ca. 0.2 mM. Thus peptides which form homo- or heterodimers at 1/100 of this concentration, or 2 uM and below, will be 99% cyclized by such a dimerizer.
Thus any homo- or heterodimerizer with a binding constant (for itself or its dimeric partner) of 2 uM or below will be sufficient for the formation of 99%- cyclized peptides.
Based on minimized structures of peptide 1, the second copy of the dimerizer may be significantly closer to the first copy than 20 A, depending on the folded state of the polypeptide inserted between the folding peptides. If on average it is 10 A away, its local concentration will be roughly 1.6 mM, and 99% cyclized peptides will be attained from dimerizers with self-binding constants of 16 uM or less.
Example Synthesis of peptide constructs The following materials were obtained from the indicated sources: Protected N-aFmoc amino acid derivatives were purchased from Advanced ChemTech (Louisville, KY) and all the peptide synthesis reagents such as diisopropylcarbodiimide (DIC), N-hydroxybenzotriazole (HOBt), 2-(1-H-Benzotriazole-1yl)1,1,3,3-tetramethyluroniumhexafluorophosphate (HBTU), trifluoroacetic acid (TFA), N,Ndiisopropylethylamine (DIPEA), piperidine, thioanisole, ethanedithiol and anisole were obtained from Sigma (St. Louis, MO). Pre-loaded Fmoc-Xaa-Wang-resins and H-Pro-2-CI-Trt-resin (to synthesize Cterminal Pro peptides) were purchased from Novabiochem (La Jolla, CA). Organic solvents such as dimethylformamide (DMF) and dichloromethane (DCM) were from Fisher Scientific (Santa Clara, CA) and were of analytical grade.
The dimerizer scaffold peptides were synthesized on an automated Symphony/ Multiplex multiple peptide synthesizer of Protein Technologies Inc., (Tucson, AZ) following classical Fmoc-chemistry. The duration for coupling (1.5 h/coupling) and deprotection (3 x 20 min) steps were slightly modified to the WO 99/51625 PCT/US99/07374 existing default program to achieve desired peptide in good yields. The pulsing rate of nitrogen gas to stir the resin mixture was carefully manipulated to ensure complete mixing of resin beads with the added reagents. Standard Fmoc-compatible side-chain-protection groups such as tertiary-butyl (tBu) for Ser, Thr, Glu, Asp, Tyr; trityl (Trt) group for Gin, His, Asn; tertiary-butyloxycarbonyl (Boc) group for Lys, Trp were used for the respective amino acid derivatives. Similarly, 2 2 4 6 7 -pentamethyldihydrobenzofuran (Pbf) group served as side-chain protection for Arg [Fields and Fields, Tetrahedron Lett.
34:6661 (1993)]. The coupling reactions were carried out twice with a five fold excess of Fmoc-protected a-amino acids in a mixture v/v) of DMF and DCM using DIC/HOBt mediated coupling procedure [Fields and Noble, Int. J. Pept. Protein Res. 35:161-214 (1990); Hudson, J. Org. Chem. 53:617-624 (1988)]. In some cases (coupling of Arg, His and Lys), triple coupling of amino acids were to be carried out in order to ensure completion of the reaction and also HBTU/HOBt coupling method was adopted in difficult situations [Knorr et al., Tetrahedron Lett. 30:1927-1930 (1989)]. Unreacted amino groups were capped with 50% acetic anhydride in DMF. After coupling five amino acids in an automated mode per sequence, synthesis was paused and completion of the coupling reaction was checked by Kaiser's test [Kaiser et al., Anal. Biochem. 34:595-598 (1970)] and proceeded further. At the end of the synthesis, the Fmoc group on the N-terminus was deprotected in 30 min with 25% piperidine in DMF and the resin was washed extensively with DCM followed by absolute ethanol. After an extensive wash with absolute ethanol, peptides were cleaved from the resin in manual mode by treatment with King's cleavage cocktail (Reagent K) composed of TFA (82.5%)/phenol (5%)thioanisole ethanedithiol water for 3 h at ambient temperature, during which, all the side-chain protections were also removed simultaneously [King et al., Int. J. Pept. Protein Res. 36:255-266 (1990)]. After the cleavage, crude peptides were precipitated using cold diethylether after which the precipitate was solubilized in water/acetonitrile solution and lyophilized as described previously [Gururaja and Levine, Peptide Res.
9:283-289 (1996)].
The lyophilized crude peptide extracts were purified to homogeneity by reversed-phase high performance liquid chromatography (RP-HPLC) (Hewlett-Packard model 1100 series HPLC system having UV variable wavelength DAD detector; San Francisco, CA, USA) using a semi-preparative Rainin (Woburn, MA) Dynamax 60 A reversed-phase cyano column (10 X 250 mm) coupled to a guard column (10 X 50 mm).
Mobile buffer consists of A: 0.1% TFA in water and B: 0.1% TFA in acetonitrile. A linear gradient of 0-40% buffer B in 40 min was employed to elute the peptide at a flow rate of 2.0 ml min- 1 using dual wavelength detection mode at 230 and 280 nm as described previously (Gururaja and Levine, supra). Fractions containing pure peptide were pooled and lyophilized. The integrity and identity of all the purified synthetic peptides were confirmed by on-line electrospray ionization-mass spectroscopy (ESI-MS) technique wherein the HPLC column outlet was connected directly to a Finnigan LCQ mass spectrometer (San Jose, CA, USA) equipped with the standard ESI source. Mass spectrometric data were in good agreement with the expected values. The peptide EFLIVKS-STKSIPPQS-EFLIVKS, used for nmr studies, was over 99% pure as judged by LC/MS.
WO 99/51625 PCT/US99/07374 Example 6 Peptide dimerization observed by mass spectrometry Peptides were dissolved from a lyophilized white powder into water at pH 5.0, and the pH of the most concentrated stock was checked. After initial observation of peptide dimers when purifying the crude peptide, the Finnigan (San Jose, CA) LCQ mass spectrometer was tuned to optimize the signal intensity of the dimer at pH 5.0. The optimal parameters were: heated inlet capillary, 130-150 C, source voltage kV, capillary voltage 38 v, tube lens offset 24 v, sheath gas 40-80 I/min., and auxiliary gas 20 I/min. All binding measurements were made using a continuous infusion rate of 5 15 10 pl/min. The relative ion current of the dimeric peptides was calculated as S (intensity of all dimer ions) [S (all dimer ions) S (all monomer ions)]; sodium adduct ions were included when observed.
To construct a scaffold with a self-associating peptide at each end, we examined variants of part of the sequence of a proposed self-associating peptide hormone, the neuropeptide head activator (Bodenmuller et al., supra). After some preliminary tests, an analog which contains a reversed sequence of part of the peptide was used for further studies. To establish the stoichiometry of binding of this analog, EFLIVKS, the self association was examined by electrospray mass spectrometry. For binding studies the mass spectrometer was tuned on the dimeric version of the peptide (see Figure 3 and data not shown). No evidence for a trimeric or tetrameric peptide association was found. When the relative ion current of the dimer was plotted against the concentration of peptide infused into the source, a curve which is well fit by a rectangular hyperbola was seen (data not shown). This saturable dimer formation yields an apparent binding constant for EFLIVKS of 7.8 pM. This experiment was repeated with different analogs of this sequence (Table Replacing the N-terminal glu with lys did not significantly change the binding, suggesting that these peptides do not dimerize simply by forming reciprocal lys-glu interactions. Addition of glu and lys to make EEFLIVKKS results in an apparent 4-fold increase in self-binding affinity. Each residue was individually replaced with alanine, and dimerization monitored. Replacement of the Nterminal glu and C-terminal ser had little effect on the apparent dimerization constant, while ala replacement of F2, L3, 14, V5 and K6 weakened binding 5.4-fold, 9.7-fold, 5.4-fold, 10-fold and over 6fold, respectively.
Table 1. Self-dimerization constants measured by mass spectrometry.
peptide sequence Kd(app)pM peptide sequence Kd(app)pM EFLIVKS 7.8 AFLIVKS 12 KFLIVKS 6.8 EALIVKS 42 VSIKFEL 13 EFAIVKS 76 SKVILFE 12 EFLAVKS 42 EEFLIVKKS-acid 2.1 EFLIAKS 82 WO 99/51625 PCT/US99/07374 EFLIVAS EFLIVKA 9.9 Example 7 Inhibition of elastase by peptide constructs The activity of 100 nM porcine pancreatic elastase (Sigma Chemical Co., St. Louis, MO) in 0.1 M Tris buffer, pH 7.88, at 25 0 C was followed by cleavage of 100 pM succinyl-ala-ala-ala-p-nitroanilide at 412 nm for 1-2 min. The assay kinetics were linear over this time. Inhibition by different peptide constructs was followed by preincubation of 10 pM peptide with elastase for 1-1.5 min. followed by addition of the substrate. Percent inhibition was calculated as [1 {assay slope (peptide elastase) assay slope (elastase alone)}] x 100%. These assay conditions are identical to those used by Leatherbarrow and Salacinski, supra.
As a test sequence for insertion between the N- and C-terminal peptide sequence EFLIVKS used to constrain the test sequence, we chose a variant of the sequence of the 18mer Ci2b protease-inhibitory loop. This sequence has been reported (Leatherbarrow and Salacinski, supra) to be a very potent inhibitor of subtilisin, chymotrypsin and elastase. To test this, we assayed the inhibition of elastase under identical conditions to those reported with both the disulfide-cyclized form as well as the EFLIVKSconstrained analog. The results are shown in Table 2. At a concentration of 500 nM, the disulfidecyclized peptide cyclic[CGTIVTMEYRIDRTRSFC] causes only a slight inhibition of porcine pancreatic elastase, 6.2% (n Based on its reported apparent inhibition constant of 390 pM (Leatherbarrow and Salacinski, supra), and the concentration and Km of the substrate used in the assay, an inhibition of 99.9% would be estimated assuming this putative inhibitory peptide is competitive with substrate. The same 18mer sequence, with the N- and C-terminal val of the native sequence substituted for the cysteines, was also tested with different combinations of dimerizer peptides fused to its N- and C-termini.
None gave significant inhibition of elastase (Table 2).
Table 2. Peptide Inhibition of porcine pancreatic elastase*.
assay inhibition expected n 50 nM enzyme alone 0% 0% 500 nM cyclic [CGTIVTMEYRIDRTRSFC] 6.2 1.2 99.9** 3 10 pM EFLIVKSVGTIVTMEYRIDRTRSFVEFLIVKS -4.3 0.3 3 10 pM EFLIVKS-18mer insert-SKVILFE 7.0 1.0 3 10 pM SKVILFE-18mer insert-EFLIVKS 2.4 0.29 3 WO 99/51625 PCT/US99/07374 10 pM SKVILFE-18mer insert-SKVILFE -1.4 0.14 3 2 mM PMSF 100 100 3 *assay at pH 7.88, 25 C, using 100 pM succ-ala-ala-ala-p-nitroanilide as a substrate, observed at 412 nm.
Each value is derived from 3-5 replicates *estimated as 100[inhibitor]/[inhibitor] K,(1 [substrate]/Km) assuming the peptide is a competitive inhibitor with a K, of 390 pM Example 8 Elastolysis of peptide constructs To examine the effects of elastase on different peptides, purified synthetic peptides (10 pM) were dissolved in 0.1 M Tris pH 7.88, at 25 0 C. Elastase was added to 100 nM. At time 0, 15 min., 1, 2, 3 and 24 hours, an aliquot of the reaction mixture was injected onto a 0.1 x 25 cm C18 reversed phase hplc column (Vydac Inc., Hesperia CA). The reaction mixture was eluted using a gradient of 100% A (99.9% 0.1% v/v trifluoroacetic acid) for 10 min., followed by a 1%/min. increasing gradient of B (99.9% acetonitrile, 0.1% trifluoroacetic acid) to 60% B, followed by a 5%/min. gradient of B to 100%. Peptides were examined by direct elution from the column into the source of a Finnigan LCQ ion trap mass spectrometer. Peptides were scanned from 300-2000 amu, and identified by searching their mass with that of different fragments of the full length peptide, or comparing their mass with different masses of expected elastolytic fragments in the case of the cyclic peptide, using MacBioSpec (obtained courtesy of PE-Sciex, Foster City, CA). Proteolysis of the reduced peptide CGTIVTMEYRIDRTRSFC was done in the presence of 2 mM dithiothreitol (Sigma). Cleavage products of the oxidized peptide were either directly chromatographed without reduction, or chromatographed after an aliquot was treated first with 1 mM PMSF for 1 hour and then with 30 mM DTT for 10 min.
To further examine the reason for the lack of expected elastase-inhibitory activity of cyclic [CGTIVTMEYRIDRTRSFC], and to examine the elastolytic stability of some of the peptide constructs in table 1, we incubated each peptide at a concentration of 10 uM with 100 nM elastase for 3 hours at pH 7.8, 25 0 C. Each reaction mixture was then chromatographed over a microbore C18 reversed phase column, and the peptide fragments were identified using mass spectrometry. The cyclic peptide reaction was examined either with or without subsequent reduction (data not shown). The linear peptide CGTIVTMEYRIDRTRSFC was highly susceptible to elastolysis, giving ca. 11 different identifiable peptides (Table The main cleavage was after Y9 and additional cleavages were after 14, V5, T6, M7, T14 and F17. Cyclic [CGTIVTMEYRIDRTRSFC] appeared to be cleaved more slowly than its linear analog, and after 3 hours was cleaved at fewer sites, mainly after Y9, and also after M7, T14 AND F17.
The Ci2b loop 18mer with EFLIVKS attached to each end was not attacked to a significant degree by elastase after 3 hours, with only cleavage after Y16 initially being observed. Most low level peaks WO 99/51625 PCT/US99/07374 observed in the chromatogram are mainly synthetic impurities also present in the absence of added elastase After 24 hours, enough proteolysis occurred to assign additional elastolytic sites in this construct as after V5, M14, T21, F27 and 129. Thus cleavage occurred at many of the same residues as in the linear and cyclic peptides, but at a much reduced rate.
Table 3. Identification of elastolytic peptides of the Ci2b loop peptide and analogs.
peptide substrate* peptide substrate peptide substrate linear CGTIVTMEYRIDRTRSFC cyclic EFLIVKS-Ci2b insert-EFLIVKS
[CGTIVTMEYRIDRTRSFC]
parent peptide 2150.2/2151.0 parent peptide 2149.8/2150.5 parent peptide 3775.2/3775.1 CGTIVTMEY 1016.3/1016.44 c[CGTIVTMEYRIDRTRSFC]H 2 0 EFLIVKSVGTIVTMEY 2168.3/2168.5-single cleavage in 1828.6/1828.98 ring RIDRTRSF 1050.6/1050.58 CGTIVTMEY 1016.3/1016.44 RIDRTRSFVEFLIVKS 1965.1/1965.14 RIDRTRSFC 1152.6/1153.54 RIDRTRSFC 1153.3/1152.6 EFLIV or VEFLI 620.2/620.37 CGTIVTMEYRIDRTRSF RIDRTRSF 1050.6/1050.58 VEFLI or EFLIV 2048.2/2048.0 620.1/620.37 (second peak; both may be present) CGTIVTMEYRIDRT EYRIDRT 952.4/952.49 RSFVEF 784.4/784.4 1658.0/1658.0 CGTIVTM 724.2/724.34 KSVGTIVTM 935.4/935.52 EYRIDRTRSFC 1444.9/1445.7 EYRIDRT 952.4/952.49
VTMEYRIDRTRSF
1673.8/1673.84
MEYRIDRTRSFC
1576.9/1576.74 MEYRIDRT 1083.6/1083.53 TIMEYRIDRT 1184.6/1184.57 *presented as peptide fragment/observed monoisotopic mass/expected monoisotopic mass Example 9 Deuterium exchange experiments using constrained loop peptides Deuterium exchange experiments were carried out by dissolving the peptide of interest in water at pH and diluting the peptide 10-fold into D 2 0 at t 0. For the initial constructs tested, the peptide WO 99/51625 PCT/US99/07374 concentrations after dilution in D 2 0 were in the range of 10 pM. For other time points, an aliquot of the peptide solution was quenched by addition of a 2.5-fold volume excess of 1:1 H 2 0:MeCN with 1% formic acid at 0°C or 250C and immediately infused into the mass spectrometer. This acidic pH jump slows the rate of amide bond hydrogen exchange with solvent. For selected time points the mass derived from the first 2 min. of the infusion was compared to that of later 2 min. blocks to assess the significance of backexchange, which was usually 1 proton or less. The total number of exchangeable protons was derived by 1) initially dissolving the peptide in DMSO, and diluting it directly into D20 before quenching and measurement of the new mass of the peptide several minutes later; 2) diluting a peptide dissolved in DMSO 10-fold into D20; or 3) heating the solution of peptide diluted into D 2 0 at 1000C for 15 min. DMSO was included since in preliminary experiments low levels added to aqueous peptides appeared to greatly accelerate proton exchange. When all three methods were used with EFLIVKS- VGTIVTMEYRIDRTRSFV-EFLIVKS, they gave the same results. Calculation of the total protons exchanged included correction for the 10% by volume of H 2 0 present after dilution in D20. For peptides which were soluble in the 1 mM range, samples from the 10-fold D20 solution at pH 5 were directly infused into the mass spectrometer without quenching. Rate constants and amplitudes for deuterium exchange were derived by fitting the time course of the gain in mass above the fully protonated form to a single exponential function.
Relative to surface-exposed residues, the amide backbone protons of peptides and proteins will exchange more slowly with deuterated water when they are buried in the interior of a protein (and inaccessible to water) or are involved in stable hydrogen bonding [Englander et al., Protein Science 6:1101-1109 (1997)]. Mass spectrometry has been used to examine the hydrogen exchange properties of a variety of different proteins (see Chung et al., Protein Science 6:1316-24 (1997) and Smith et al., J.
Mass Spectrometry 32:135:146 (1997) for recent examples), and the existence of slowly exchanging protons has been used to infer the existence of tertiary structure [McKnight et al., J. Mol. Biol. 12:126-34 (1996)]. Deuterium exchange studies here were done at pH 5 since below pH 4.5 the constrained loops do not appear to retain structure as measured by circular dichroism (vide infra). To examine the compactness of the peptide dimerizer-constrained Ci2b loop peptide, the rate and stoichiometry of deuterium incorporation upon dilution into D20 was examined. The results for a variety of different constructs are summarized in Table 4.
Table 4. Deuterium exchange rates and amplitudes for constrained Ci2b peptide insert and other peptide inserts'.
dimerizer 1 insert total proton exchange rate exchangea exchange constants ble protons amplitudes 3 EFLIVKS 18mer 2 66.5 1.4 29.3 1.5 fast k intermed 0.054 hr 1 16 intermediate 21 slow WO 99/51625 PCT/US99/07374 EEFLIVKKS 18mer 70.3 39.4 fast k intermed 0.15 hr 1 7.9 intermediate 23 slow MGEFLIVKS- 18mer insert-
EFLIVKSGPP
K
6
G
4 -EFLIVKS- 18mer 87.7 82.8 fast insert-EFLIVKS
K
3
GSGS-
EFLIVKS-
insert-EFLIVKS-
GSGSK
3 EFLKVKS 18mer 71.6 70.1 fast SKVILFE 18mer 66 31 fast k intermed 0.041 hr 1 17 intermediate 18 slow KFLIVKS 18mer EFLIVKS -STKSIPPQS- 36.1 1.9 32.6 3.8 fast MGEFLIVKS- -G 4
DYKDDDDKG
4 35.9 34.8 insert-EFLIVKS
GPP
MGEFLIVKS- -G 4
YPYDVPDYASLG
3 40.3 40.3 insert-
EFLIVKSGPP
'data is presented for peptides of the form dimerizer-insert-dimerizer; the dimerizer sequence is the same at the N- and the C-terminus except as noted 2 the 18mer standard insert is the Ci2b sequence VGTIVTMEYRIDRTRSFV- 3 the fast phase amplitude is calculated for the fastest exchange data, lasting at most ca. 1 hour The kinetics of deuterium exchange for protons for the construct EFLIVKS-VGTIVTMEYRIDRTRSFV- EFLIVKS-amide were determined (data not shown). A total of 66.5 Da was added to the time zero mass of the peptide upon complete proton exchange. In the fast phase of proton exchange, 29 protons exchanged; roughly 33 side chain, N- and C-terminal protons would be expected to be rapidly exchangeable at this pH if exposed to water. A further 16 protons exchanged at an intermediate rate with a rate constant of 0.054 hr This left 21 protons which are presumed to exchange even more slowly than observable on this time scale. Both classes of protons exchange at a rate slower than measured for surface-exposed protons, taken from nearest neighbors identical to those found in EFLIVKS [Bai et al., Proteins: Structure, Function and Genetics 67:75-86 (1993)]. These control protons exchanged with deuterium with rate constants in the range of 6 60 hr 1 at pH 5. Similar results were obtained with the reversed dimerizer sequence attached to the same end of the 18mer insert. With this 18mer insert, WO 99/51625 PCT/US99/07374 similar results were also obtained with an apparently more potent dimerizer (table 3) attached to each end of the insert, EEFLIVKKS. In this case, 39 of the 70 exchangeable protons exchanged with deuterium within an hour, 8 protons exchanged with a rate constant of 0.15 hr and the remaining 23 protons exchanged more slowly than this. The total of 31 slowly exchanging protons in this analog was somewhat less than the 37 protons in the parent peptide, suggesting some subtle changes in structure between the two constructs. For the peptide analog with lys 6 -gly 4 fused to the N-terminus of the parent peptide, designed to enhance the solubility of the peptide, all but ca. 5 protons exchanged within an hour. This Nterminal fusion may thus destabilize the structure or at least makes it more mobile.
The side chain of the isoleucine normally at the 4th position in this peptide appeared in the low energy conformers obtained from high temperature molecular dynamics trajectories (vide infra) to be buried in the folded peptide. Thus we created a single point mutation at the 4th position in each dimerizer (creating EFLKVKS), and examined the effect of this mutation on the 18mer insert structure by deuterium exchange. If this mutation disrupted the structure in a significant way, the number of slowly exchanging protons might be diminished. When the deuterium exchange kinetics were examined, all but one proton exchanged within an hour.
We also examined the effect of insert sequences different from Ci2b on the exchange kinetics of the overall peptide when EFLIVKS or its analogs were fused to both termini of these inserts. One insert, STKSIPPQS, represented an analog of the protease inhibitor cyclic[CTKSIPPQC] [Gariani and Leatherbarrow, J. Peptide Res. 49:467-75 (1997)]. This short construct had 36 exchangeable protons when heated, 33 of which exchanged in an hour. Thus if this peptide has a folded structure, it contains only a few slowly exchanging protons. A second insert included the flag epitope tag, DYKDDDDK, flanked by four glycines on each end to increase its flexibility so as to allow binding of the epitope to antiflag antibodies. This was fused to MGEFLIVKS- at the N-terminus and -EFLIVKSGPP at the C-terminus.
This peptide had 36 exchangeable protons, 35 of which exchanged within an hour. A similar construct, also expected to be somewhat flexible due to the presence of 7 glycines, was synthesized with the influenza hemagglutinin epitope tag replacing the flag tag. All protons were exchanged for deuterium within ca. 1 hour. Thus a shorter inserted sequence, or inserts with multiple glycines at each end to allow flexibility of the insert, did not have slowly exchanging protons.
Example Models of peptide constructs None of the so far-examined peptide dimerizer constructs with slowly exchanging protons have been soluble in the millimolar range, thus structure determination by nmr is not readily accomplished. To derive a working model of selected structures, which can be roughly compared to secondary structure content derived from circular dichroism, we used high temperature molecular dynamics to generate conformers WO 99/51625 PCT/US99/07374 [Brooks, Chem. Scripta 29A:165-169 (1989); Bruccoleri and Karplus, Biopolymers 29:1847-62 (1990); Auffinger and Wipff, J. Comput. Chem. 11:190 (1990)] and subsequent thorough minimization to energyrank different conformers. The lowest 5-7 kcal/mole energy band width conformers are then compared.
This method of "quenched molecular dynamics" has been applied to a tumor surface octapeptide and peptide fragments of different proteins (Brooks, supra), tuftsin and its cyclic analogs [O'Connor et al., J.
Med. Chem. 35:2870-81 (1992)], and to linear and cyclic melanotropins [Al-Obeidi et al., J. Peptide Res.
51:420-31 (1998)]. While this approach has also been applied to larger systems such as the 70 residues in hypervariable loops of antibodies [Bruccoleri and Karplus, Biopolymers 29:1847-62 (1990)] not enough conformers are generated for such large systems to provide complete conformational coverage [Dill, Biochemistry 24:1501-9 (1985)]. Thus this approach when applied to a 32mer will only allow examination of a few of the expected low energy conformers, giving only a rough idea of the overall fold. It may however allow a more significant coverage of conformation space for a cyclic 18mer or two linear 7mer peptides.
We applied this methodology first to the disulfide bond-constrained cyclic 18mer peptide, cyclic[CGTIVTMEYRIDRTRSFC], which is thought to be a sub-nM inhibitor of elastase and other proteases (Leatherbarrow and Salacinski, supra). For the cyclic peptide to be an inhibitor with similar potency to Ci2b, it presumably should have low energy conformers of similar structure and rigidity to the inhibitor loop of Ci2b. In its free form and in its complex with subtilisin Novo [McPhalen and James, Biochemistry 26:261-9 (1987)], the overall backbone of the inhibitor loop is roughly planar, with the R67 and F69 side chains filling the interior of the loop. The edge which docks into the subtilisin binding site is an irregular beta sheet, with the side chains of 156, T58, M59, and Y61 extending into solution or into subtilisin in the bound complex, and the side chain of M59 noticeably bent from the solution structure when docked with subtilisin. The structure of the native loop and the energy distribution of the minimized conformers (two trajectories, 5.4 ns, 2700 structures) of this cyclic peptide was determined (data not shown). It is roughly Gaussian, as shown previously for cyclic melanotropin analogs (AI-Obeidi et al, supra). The low energy conformers found for the cyclic peptide mimicking the loop appear to be significantly more compact and globular than the native inhibitor loop (data not shown) and they have backbone atom root mean square deviations from the 18mer inhibitor loop of 6.22, 6.12, 5.46, 6.31, 5.72, 5.69 and 5.64 A respectively. Residues 3-10, which form much of the subtilisin-contact region and which surround the reactive site of the inhibitor loop (met 7-glu have heavy atom root mean square deviations from the same residues in the native loop of 5.65, 6.40, 3.95, 5.54, 5.20, 5.07 and 4.59 A, respectively. In addition, the side chains of R65, R67 and F69 are not buried inside the loop region in any of these low energy conformers. These results are consistent with significantly different structures for the low energy conformers when compared to Ci2b's inhibitory loop. These significant structural differences are consistent with the failure to observe inhibition of elastase with this cyclic 18mer.
WO 99/51625 PCT/US99/07374 We next applied quenched molecular dynamics to look at low energy conformers of EFLIVKS when dimerized. These gas phase calculations may be particularly relevant to the low energy forms of peptide dimers observed by mass spectrometry. Peptide dimers were constructed by either tethering the two peptides together at approximately their centers of mass, or binding different parallel and antiparallel starting configurations (see the methods section) together in the gas phase before starting the conformation search. A total of 7 trajectories were run, covering 5250 minimized structures and 10.5 ns (data not shown) and a backbone overlay of the 14 lowest energy dimers, covering the lowest 7 kcal/mole energy bandwidth, from all of the trajectories was performed (data not shown). This was created by a least-squares superposition of all backbone atoms of one of the peptide dimerizers. Both peptides appear to adopt a turn conformation, but are not symmetric across the inter-dimer axis.
In addition a cluster graph of the 14 lowest energy conformers, in which the backbone atoms of each conformer are compared to those of every other conformer, and the RMSD deviation was created (data not shown). Two conformers are most similar if their RMSD difference is in the 0 1 A range. There appears to be one main family of low energy conformers, and several other unique conformations (data not shown). For all of the peptides, both the N- and C-terminus of the first peptide, which are charged in these simulations, appear to be close to the N- and C-terminus of the second peptide of the dimer. Since each peptide has two acidic and two basic groups, there are a number of different intra-dimer ion pairs which are possible. Examination of the distances for all possible inter-dimer ion pairs in all 14 low energy conformers suggests that the most stable ion pairs are a) peptide 1: N terminus to peptide 2: glu 1: side chain carboxylate; b) peptide 1: lys 6 e-amine to peptide 2: glu 1: side chain carboxylate; and c) peptide 2: C-terminus to peptide 1: lys 6:e-amine. Both peptides form somewhat stable intramolecular ion pairs between their own N- and C-termini as well.
Quenched molecular dynamics was also used to examine low energy structures of the Ci2b 18mer test insert fused to EFLIVKS at each end (data not shown). This peptide is relatively inert to elastase, has 37 slowly exchanging protons, and shows no evidence for higher order aggregates (data not shown) when observed by mass spectrometry. A total of 6900 different structures were collected from 12.3 ns of dynamics trajectories. These structures were distributed in a Gaussian distribution (data not shown).
Two conformers were at least 7 kcal/mole lower in energy than all others (data not shown). Both conformers appear compact and globular, consistent with other experimental results above. As with the EFLIVKS dimer modeled above, each termimal EFLIVKS attached to the 18mer insert appears to form a turn, and their N- and C-termini are within 3.8 4.3 A. However, unlike the structure of the EFLIVKS dimer, their second termini, which are now fused to the 18mer insert, do not loop back to the center of the molecule, but are instead 11.5 15 A apart in the two conformers. This distance is significantly greater than the comparable distance in the native Ci2b structure (4.1A) and in the cyclic peptide low energy conformers (6.88 A on average) suggesting that the dimerizer peptides, at least with this insert, form a WO 99/51625 PCTIUS99/07374 "loop" with a fairly wide base. The 18mer insert also appears to contain a significant proportion of turn structure, consistent with circular dichroism measurements.
Example 11 Circular dichroism studies, NMR measurements and peptide conformation searches on peptide constructs Circular dichroism measurement CD spectra were recorded on an AVIV 62A DS CD spectropolarimeter (Lakewood, NJ, USA) equipped with a Peltier temperature control unit. The temperature of the instrument was maintained constantly below 20 0 C using Neslab CFT-33 refrigerated recirculator water bath. The device was periodically calibrated with the ammonium salt of (+)-10-camphorsulfonic acid according to manufacturer's recommendations. Spectra were recorded between 250 and 195 nm at 0.2 nm intervals with a time constant of 1 s at 25 0 C. Data were collected from five separate scans and averaged using an IBM PS/2 computer. A cylindrical quartz cell of path length 0.1 cm was used for the spectral range with the sample concentration of 0.02 0.05 mM as determined by amino acid analysis. Peptide stock solutions (1 mM) were made in 10 mM KPO 4 buffer containing 100 mM KF at pH 7.5 except as noted. For pH titration experiments, pH of the buffer was carefully adjusted to desired value using either 0.1 M HCI or 0.1 M NaOH before adding the above peptide stock solution. Mean residual ellepticity (MRE) in deg.cm 2 .dmol-1 was obtained through the equation MRE(A) 10 Icn where 8(A) is the ellipticity in degrees at wavelength A, I is the path length in cm, c is the concentration in M, and n is the number of residues in peptide/protein [Schmidt, in Protein Structure: A Practical Approach, IRL Press, New York, pp251-285 (1989)]. Raw data collected from individual experiments were converted to an ASCII format and the plots were created using Microsoft Excel software package as described previously [Gururaja and Levine, Peptide Res. 9:283-289 (1996)]. Thermal denaturation data were taken on samples containing 20 uM peptide in 10 mM KPO 4 buffer containing 100 mM KF at pH The thermal denaturation was measured at 220 nm over a range of 4-98 0 C with a temperature step of 2 0
C
and a 2 min equilibration time and a 60 s signal averaging time. Apparent Tm was calculated as the maximum of the first derivative of the CD signal at 220 nm with respect to T 1 CD spectra were deconvoluted with the program CD spectra were deconvoluted using the program Dichroprot v. 2.4, which uses the variable selection method of Johnson.
NMR measurements WO 99/51625 PCT/US99/07374 All deuterated solvents such as D20 (99.96% D) and DCI (99.5% D) for NMR experiments were purchased from Cambridge Isotope Laboratories (Andover, MA). Samples mM) were prepared by dissolving the synthetic peptide in 0.7 ml of H 2 0:DO 90:10 or 100% D 2 0. Sample in water was prepared by the dissolution of HPLC purified peptide, adjusting the pH to 4.0 with HCI or DCI. All pH values were measured at room temperature; the values reported herein are apparent pH values and were not corrected for the deuterium isotope effect. TSP [3(trimethylsilyl)propionic-2,2,3,3-d 4 acid, sodium salt] was used as an internal chemical shift standard.
'H NMR experiments were performed on a Varian Unity INOVA-500 spectrometer at 25 0
C
equipped with a Sunsparcstation 5 as described previously [Naganagowda et al., J. Biomol.
Struct. Dynam. 16:91-107 (1998)]. Two dimensional Double Quantum Filtered Correlated Spectroscopy (DQF-COSY) [Rance etal., Biochem. Biophys. Res. Commun. 117:479-485 (1983)], Total Correlation Spectroscopy (TOCSY) [Bax and Davis, J. Magn. Reson. 65:355-360 (1985)], Rotating frame Overhauser enhancement spectroscopy (ROESY) [Bothner-By et al., J.
Am. Chem. Soc. 106:811-813 (1984)] and Nuclear Overhauser Enhancement Spectroscopy (NOESY) [Macura and Ernst, Mol. Phys. 41:95-117 (1980)] experiments were acquired in pure phase absorption mode with quadrature detection in t, dimension using the hypercomplex method [States et al., J. Magn. Reson., 48:286-292 (1982)]. The carrier was placed on the water resonance to enable irradiation of the water during the relaxation delay (1.5 to 2.5 s) and during mixing time in NOESY experiments. For TOCSY spectra, MLEV-17 sequence was used with a spin lock time of 50 to 85 ms. For ROESY experiments spin lock times of 200 and 250 ms were used while for NOESY, mixing times of 200 and 300 ms were used. 'H NMR spectrum in H 2 0 had a spectral width in both the dimensions of 5400 Hz. In D20 solvent, after complete exchange of the amide protons, the spectrum was recorded by reducing the spectral width to 3000 Hz in both the dimensions. 256 or 512 t, increments were acquired with a size of 1024 or 2048 data points. Slowly exchanging amide protons were identified by dissolving the samples in D20 and recording 1D and TOCSY spectra, immediately. For temperature coefficient measurements of the amide protons, 1D and TOCSY experiments were performed between 25 and 50 0 C in steps of 5 0 C. Typically 16 or 32 scans were collected for DQF-COSY and TOCSY spectra, and 64 scans for ROESY and NOESY spectra. Prior to Fourier transformation, the free induction decays (FIDs) were zero filled once in both dimensions. For processing of DQF-COSY spectra, a squared sine-bell window function shifted by 900 was used in both the dimensions, whereas for the TOCSY, ROESY and NOESY spectra, the data were processed separately, using 900 and 450 shifted squared sine-bell window functions.
The distances for structure determination were deduced from NOE cross peak intensities in the 2D-NOESY spectrum obtained with 200 ms mixing time in water. Ranges of interproton distances were calculated by comparing the volume of the cross peaks and were categorized into three classifications, 1.8-2.5 A (strong), 2.5-3.5 A (medium), and 3.5-5.0 A (weak), for the WO 99/51625 PCT/US99/07374 distance geometry calculations. The vicinal coupling constants 3 JNH-aH of each residue were taken from the NMR studies and used to estimate possible torsional angles via the Karplus relationship [Karplus, J. Chem. Phys. 30:11-15 (1959)]: 3
JNH-C
H A cos 2 8 B cos8 C where_6 I 600° The A, B and C constants proposed by Pardi et al., J. Mol. Biol. 180:741-751 (1984) have values of 6.4, 1.4 and 1.9, respectively. The techniques used to obtain conformational data will only be briefly summarized as these have been discussed in great detail elsewhere. Delineation of conformation from NMR technique is purely based on the measurement of torsional angles along the polypeptide chain using two-dimensional NMR data acquired at high magnetic field strength. Specifically, to assign protons that are coupled through bond, TOCSY experiments are performed. Sequential assignments, for example Ha(i)-NH(i+1), are based on NOESY and ROESY experiment, in which a correlation is observed between protons in close spatial proximity which is then an indicator of conformations.
Peptide conformation searches Low energy conformers of different peptide constructs were generated as follows. Explicit atom models of the peptide constructs were built using Insight II 95.0 (Molecular Simulations Inc., San Diego, CA) and the cff91 forcefield [Maple et al., J. Comp. Chem. 15:162-182 (1994)]. Peptides were modeled as zwitter ions, with lys, arg, asp and glu fully ionized in addition to the N-terminal amine and C-terminal carboxylate. The effects of an aqueous solvent environment and counterion screening were simulated by the use of a linear distance-dependent dielectric constant. The Verlet algorithm [Verlet, Phys. Rev.
159:98-103 (1967)] with a time step of 1 fs was used to integrate the equations of motion; this was implemented as the default leapfrog algorithm of Discover 2.9.7. A 15 A cutoff was used for nonbonded interactions. Peptide bonds were restrained to the trans conformation at high temperatures using a torsional restraint of 5 kcal/mol/rad 2 In the dynamics protocol, based on modifications of a program written by Mackay et al. [Mackey et al., in Prediction of Protein Structure and the Principles of Protein Conformation, (Fasman, ed.; New York, Plenum Press) pp 317-358 (1989)], the starting peptide structures were first minimized using 300 steps of steepest descent and 1000 steps or as many steps of conjugate gradient minimization as necessary so that the maximum energy derivative was less than 0.1 kcal/A, to remove high energy structures created during construction of the molecule. The peptide atoms were assigned random initial velocities using the dseed variable, and the peptide was heated to 900 K over 2 ps. Individual trajectories were continued for times varying from 400 ps to 3 ns with individual structures collected every 1-2 ps for subsequent minimization. Each saved structure was equilibrated at 900 K for 50 fs, cooled to 300 K over 5 ps, and minimized with 300 steps of the steepest descents algorithm followed by Fletcher-Reeves conjugate gradient minimization using as many steps as necessary to give a maximum energy derivative of less than 0.001 kcal/mole/A. The minimized total energy vs. number of conformers in individual 5 kcal/mole windows was plotted for each peptide. The WO 99/51625 PCT/US99/07374 conformers in the lowest 5 kcal/mole window above the minimum energy [O'Connor et al., J. Med. Chem.
35:2870-81 (1992)] were selected for further analysis.
Starting structures for the different peptides were obtained as follows. For dimerized EFLIVKS, extended structures were aligned in a parallel or anti-parallel fashion, with the Cyl of ile 4 ca. 7 A apart, giving 4 different starting structures. Two extended structures (parallel and antiparallel) were tethered together with an energy penalty of 100 kcal/mole when the distance between the Cyl of ile 4 of both peptides was outside of the range of 1.5-12 A. For the putative protease inhibitor cyclic[CGTIVTMEYRIDRTRSFC] the initial structure was a mixture of right handed alpha helix and beta sheet allowing formation of a disulfide bond between the two terminal cysteines. A second run started with a partially minimized version of the first structure.
For the peptide dimer-constrained construct EFLIVKS-VGTIVTMEYRIDRTRSFV-EFLIVKS, several different starting structures were used. One started from the Ci2b-based structure (PDB file 2CI2) of the 18mer insert, which was derived by removing all residues from the crystal structure except for the inhibitor loop, and mutating individual residues to give the 18mer sequence reported in Leatherbarrow and Salacinski (supra). EFLIVKS in an extended conformation was fused to each end of the peptide and the resulting construct was minimized as above. A second structure started from EFLIVKS fused as a beta sheet to each end of the 18mer Ci2b insert. A third started from EFLIVKS fused as a right handed alpha helix to each end of the 18mer Ci2b insert. A fourth started from an extended conformation for the entire construct, and a fifth started from a different partially extended conformation. A sixth run started with the entire construct as a beta sheet.
Since the peptides studied here are soluble at neutral or near-neutral pH at levels well below the millimolar range needed for an nmr structure determination, we examined their solution structure using circular dichroism Circular dichroism measurements are sensitive to the secondary structure of both peptides and proteins, and have been extensively used to examine the conformation of both [Bloemendal and Johnson, Pharm. Biotechnol. 7:65-100 (1995); Woody, Methods Enzymol. 246:34-71 (1995); Greenfield, Anal. Biochem. 235:1-10 (1996)]. Here these measurements are used to examine the pH-dependence of secondary structure formation and stability, to compare the effects on insert structure of different dimerizers, to examine the effects of mutations in the dimerizers, and to look at the effects of different insert sequences on the overall structure of dimerizer-constrained loops. When these measurements are combined with measurements of proteolytic susceptibility, deuterium exchange, and the results of conformational searches, they give information on the overall structure and folding of the mini-loops examined here.
EFLIVKS-dimerized 9mer insert WO 99/51625 PCT/US99/07374 The first insert examined was EFLIVKS-STKSIPPQS-EFLIVKS. The 9mer insert represents an analog of the protease inhibitor cyclic[CTKSIPPQC] (Gariani and Leatherbarrow, supra). The CD spectrum was recorded between pH 3.5 8.5 (data not shown). A pH-dependent transition in secondary structure was observed. At pH 3.5, a secondary structure with a strong minimum at 201 nm was seen. While this is near the expected minimum for a random coil [Greenfield and Fasman, Biochemistry 8:4108-4116 (1969)] of 195-197 nm, the shape of the spectrum is also similar to that of a type 1 beta turn observed in a short peptide [Perczel et al., Int. J. Peptide Protein Res. 41:223-236 (1993)].
'H-NMR examination of low pH structure As this CD spectrum was seen with a number of other inserts under defined conditions, and since this peptide was quite soluble at low pH, we examined this structure using nmr. The resonance assignments of the 'H-NMR spectrum of 9mer insert in water were made by standard sequential assignment procedures [Wuthrich, in NMR of Proteins and Nucleic Acids, New York, Wiley-lnterscience, pp 166ff (1986)]. The assignments of 'H resonances were accomplished by the combined analyses of 2D-TOCSY and 2D-NOE spectra. The 2D-TOCSY spectrum was also recorded at various temperatures (25 to 50 0 C) to resolve overlapping connectivities for unambiguous assignments, and was also used to determine the temperature coefficients of the NH chemical shifts. The resonances buried under the water signal (in 90% H 2 0) were assigned by recording the spectra in 100% D 2 0. The chemical shifts of all the assigned protons are listed in Table 5. The temperature coefficients of NH chemical shifts, 'H/H exchange rate of amide groups, JNH-ca values, and a set of characteristic strong, medium, and weak NOE connectivities have been used as criteria to examine whether the peptide has any preferred backbone conformation in aqueous solution.
The temperature coefficients of all amide resonances are found to be 0.004 ppm (data not shown), suggesting that the backbone NH groups are exposed to the solvent and not involved in any intramolecular hydrogen bonding interactions. The fast 'HPH exchange rate observed for all backbone amide resonances provides further evidence that the amide groups are not involved in any intramolecular hydrogen bonding. The prevalence of strong i. and weak NOEs and a continuous stretch of weak and medium and d, i. NOEs in the absence of any observable dNN NOE interactions indicate that the backbone dihedral angles are predominantly in the unfolded extended region of 4, (p_space [Rance et al., Biochem. Biophys. Res. Commun.
117:479-485 (1983); Pardi et al., J. Mol. Biol. 180:741-751 (1984)]. The JNH-c, values provided in Table 5 are in the range of 6.5 to 8.4 Hz for all residues except Ser-7. For a regular P-strand, the JNH-CH is expected to be 9 Hz, while for a-helix it is 4.0 Hz [Rance et al., Biochem.
Biophys. Res. Commun. 117:479-485 (1983); Pardi et al., J. Mol. Biol. 180:741-751 (1984)]. The coupling constants of 6.5-8.4 Hz observed for this peptide suggest the existence of populations WO 99/51625 PCT/US99/07374 of unfolded nonhydrogen bonded conformations of comparable energy with 4 values exceeding the regular helical region. Collectively, the NMR data provide evidence that EFLIVKS- STKSIPPQS-EFLIVKS is unstructured in aqueous solution.
Table 5. Compilation of 1 H Chemical Shift values for EFLIVKS-STKSIPPQS-EFLIVKS at pH Xaa NH CH 3 JNH-CaH CPH CYH C 6 H CH WO 99/51625 PCT/US99/07374 E-1 3.870 F-2 7.941 3.954 8.13 3.005 2.805 L-3 8.060 4.193 7.80 1.347 1.347 0.722 1.347 0.670 1-4 7.926 3.978 7.92 1.678 1.380 0.713 1.025 V-5 8.116 3.946 8.22 K-6 8.342 4.190 6.54 1.680 1.270 1.525 2.827 1.590 1.270 1.525 2.827 S-7 8.237 4.300 6.17 3.710 3.710 S-8 8.338 4.369 6.66 3.775 3.715 T-9 8.080 4.180 7.23 4.060 1.048 K-10 8.182 4.208 6.68 1.690 1.280 1.525 2.839 1.590 1.280 1.525 2.839 S-11 8.090 4.320 7.58 1-12 8.114 4.320 8.02 1.700 1.344 0.771 1.030 P-13 4.196 P-14 4.266 Q-15 8.365 4.117 6.96 1.950 2.240 1.860 2.240 S-16 8.146 4.212 6.73 E-17 8.110 4.096 6.93 2.090 2.148 1.876 F-18 7.946 4.438 8.22 3.005 2.805 L-19 7.822 4.181 8.02 1.487 1.487 0.738 1.487 0.694 1-20 8.045 3.948 7.97 1.642 1.350 0.720 1.020 V-21 8.151 3.936 8.19 1.860 0.748 0.748 K-22 8.295 4.220 6.73 1.700 1.290 1.530 2.835 1.600 1.290 1.530 2.835 S-23 8.173 4.290 7.79 At pH 4.5, a different secondary structure was observed, which remained stable up to pH 8.5. The CD spectra at pH 4.5-7.5 had a much diminished band at 202 nm, indicating a loss of random coil. They also had a slight positive band at ca. 210-215 nm, and a negative band around 228-230 nm, indicating the presence of beta turns [Brahms and Brahms, J. Mol. Biol. 138:149-178 (1980)]. Since at pH 5.0 this construct has 3 or fewer slowly exchanging protons (table the peptide may be unfolded at this pH (i.e.
WO 99/51625 PCT/US99/07374 it has no tertiary structure) but with a secondary structure containing some beta turn and significantly less random coil than at pH 3.5. Alternatively, if it is folding and has some tertiary structure, the backbone is mobile enough so that no amide protons are sequestered from solvent for a long period of time. When observing the CD spectrum at 225 nm, the structure present at pH 7.5 has a T, of 39.6 1 0 C (data not shown).
EFLIVKS-VGTIVTMEYRIDRTRSFV-EFLIVKS
A second construct examined by CD contained the Ci2b 18mer insert, EFLIVKS- VGTIVTMEYRIDRTRSFV-EFLIVKS. The pH-dependence of the CD spectrum of this peptide was determined (data not shown). Unlike the first peptide examined above, the CD spectrum is not as pHdependent, and does not appear to have a major amount of random coil. The strong maximum around 210 nm and strong minimum at 225-230 nm are consistent with a significant content of beta turn structure at all pH values examined turns [Brahms and Brahms, J. Mol. Biol. 138:149-178 (1980)]. The smaller minimum seen at ca. 200 nm is consistent with a small percent of random coil, or the presence of a type II beta turn [Perczel et al., Int. J. Peptide Protein Res. 41:223-236 (1993)]. Using the signal at 225 nm, the peptide can be melted with temperature, with a T, of 39.85 1.6 0
C.
Constructs with an N-terminal MG- and C-terminal -GPP For peptide expression in live cells, MG- was added to the N-terminus of a number of peptide constructs, and -GPP was added to the C-terminus to block proteolysis by cellular carboxy peptidase [Vanhoof et al., FASEB J. 9:736-44 (1995)]. The CD spectra of a variety of these peptides were then compared at pH 7.5 (data not shown). Examination of the pH-dependence of the CD spectrum of MGEFLIVKS-Ci2b insert-EFLIVKSGPP was performed (data not shown)and suggests that the additional five residues cause significant changes in the CD spectrum compared to EFLIVKS-Ci2b insert- EFLIVKS. The positive band at 208 nm is no longer distinct, and the negative band at 200 nm has disappeared. The major minimum around 225 nm (characteristic of some beta turn structure) remains.
Thus the addition of these five residues appears to cause distinct conformational changes, but not unfolding of the structure.
Addition of other insert sequences also resulted in rather different CD spectra. An insert consisting of the flag epitope tag with glycine spacers, -G 4
DYKDDDDKG
4 designed to allow detection of expressed peptide in cells using Western blots, resulted in a CD spectrum containing a minimum at ca. 202 nm and a small minimum at ca 220 nm (data not shown). Based on the similarity of this spectrum to that of EFLIVKS-STKSIPPQS-EFLIVKS, this peptide appears to be mainly random coil between pH 3.5 This construct does not have slowly exchanging protons, consistent with its unfolded structure. An insert consisting of the influenza hemagglutinin epitope tag with glycine spacers, -G 4
YPYDVPDYASLG
3 gives WO 99/51625 PCT/US99/07374 a CD spectrum with a minimum at 205-207 nm and a second smaller minimum at ca. 220 nm (data not shown). This may be due to a somewhat different composition of secondary structures, and could include some alpha helix (due to the minimum at 205-207 nm) as well as random coil or beta turn. Since this construct also did not have slowly exchanging protons (table the CD spectrum may reflect the presence of only secondary structure.
Other additions to the EFLIVKS sequence The effects of mutations in the EFLIVKS sequence on the CD spectrum of the Ci2b peptide insert were determined (data notshown). The peptide EEFLIVKKS-Ci2b insert-EEFLIVKKS is of particular interest, since it has 23 slow-exchanging protons and 8 intermediate-exchanging protons (table 4) and thus may have tertiary structure, and because this dimerizer may have a somewhat higher self-affinity than EFLIVKS. It gives a CD spectrum which is similar to that of the control peptide, except that the minimum at 202 nm is missing, and the maximum at 210 nm (control peptide) is shifted closer to 207 nm.
This peptide thus appears to have beta turn structure and less random coil than the control peptide.
To increase the solubility of the structure, lysines were added to the N-terminus with a glycine spacer.
For the construct K 6
G
4 -EFLIVKS-Ci2b insert-EFLIVKS, a very different CD spectrum was obtained than for the control peptide, with a broad minimum at ca. 220 nm (data not shown). This spectrum does not appear to be characteristic of any one dominant secondary structure, but can be deconvoluted to a mixture of beta sheet and beta turn alpha helix and the rest random coil. Since this structure has at most 5 slowly exchanging protons (table the additional residues added to the Nterminus appear to have destabilized the tertiary structure of the control peptide, while creating a different secondary structure.
Mutations in the EFLIVKS sequence Three charge modifications of the dimerizer sequence were tested at pH 7.0. In one peptide, a single lys and glu were switched between dimerizers, giving KFLIVKS-Ci2b insert-EFLIVES. In a second peptide, the glutamate of each dimerizer was mutated to lysine, giving KFLIVKS-Ci2b insert-KFLIVKS. In a third peptide, the lys of each dimerizer was mutated to glu, giving EFLIVES-Ci2b insert-EFLIVES. Each peptide had a CD spectrum resembling that of the control peptide of EFLIVKS-Ci2b insert-EFLIVKS (data not shown). In a second set of mutations, the hydrophobic character of the dimerizer was changed. First, F2 and 14 in both EFLIVKS sequences were mutated to lysine or to serine, giving a dimerizer sequence on each terminus of the Ci2b insert of EKLKVKS or ESLSVKS. This resulted in a major change in the CD spectrum, with the appearance of a large negative band at 202-205 nm, indicating a significant increase in random coil structure, or denaturation (data not shown).
P:\OPER\Fas46'3-99-spc.doc-23)5/)2 Second, only 14 was mutated to lysine in each dimerizer. This also changed the CD spectrum in a similar fashion (data not shown). This construct had at most 1-2 slowly exchanging protons, suggesting that this single change in the hydrophobic core of the EFLIVKS sequence was sufficient to disrupt the structure of the entire peptide construct.
The reference to any prior art in this specification is not, and should not be taken as, an acknowledgment or any form of suggestion that that prior art forms part of the common general knowledge in Australia.
Throughout this specification and the claims which follow, unless the context requires otherwise, the word "comprise", and variations such as "comprises" and "comprising", will be understood to imply the inclusion of a stated integer or step or group of integers or steps but not the exclusion of any other integer or step or group of integers or steps.
*6 61
Claims (16)
1. A composition comprising at least a first dimerization peptide that is no more than 8 amino acids long, comprising the sequence NH 2 -FLIVXs-COOH and wherein X 5 is selected from the group consisting of K, R, D and E.
2. A composition according to claim 1, wherein at least said first dimerization peptide comprises the sequence NH 2 -FLIVK-COOH.
3. A composition according to claim 1, wherein at least said first dimerization peptide comprises the sequence NH 2 -Xo- FLIVXs-COOH wherein Xo is selected from the group consisting of K, R, D and E.
4. A composition according to claims 1, 2 or 3, further comprising a first protein, wherein at least said first dimerization peptide is covalently joined to said first protein forming a first fusion protein.
5. A composition according to claim 4, further comprising a second dimerization peptide covalently joined to said fusion protein, wherein said 20 second dimerization peptide is no more than 8 amino acids long and comprises the sequence NH 2 -X 1 -X 2 -X 3 -X 4 -X 5 -COOH, and wherein X 1 X 2 X 3 and X4 are selected from the group consisting of amino acids A, V, I, L, W, F, M and Y and X 5 is selected from the group consisting of K, R, D and E. S. 25 6. A composition according to claim 5, wherein said first dimerization peptide is joined to the N-terminus of said first protein and said second dimerization peptide is joined to the C-terminus of said first protein. P:\OPER\Fas\34693-99-spc2.doc-w)9/)712
7. A composition according to claim 5, wherein at least one of said first dimerization peptide or said second dimerization peptide is covalently joined to an internal position of said first protein.
8. A composition according to claim 4, 5, 6, or 7, wherein at least one of said first dimerization peptide or said second dimerization peptide is covalently joined to said first protein via a linker.
9. A composition according to any one of claims 1 to 8, further comprising a fusion partner. A molecular library comprising a plurality of members each comprising a composition according to any one of claims 4 to 9, wherein each of said members comprises a different first protein. S11. A recombinant nucleic acid encoding the composition of any one of claims 1 to
12. An expression vector comprising the recombinant nucleic acid of claim 11.
13. An expression vector according to claim 12, further comprising a GFP gene.
14. A host cell comprising the recombinant nucleic acid of claim 11. A method of producing the composition of any one of claims 1 to 9 comprising: a) providing a cell according to claim 14; and b) subjecting said cell to conditions, whereby said composition is expressed. P:\OPER\Fas3463-99)-spe.doc-23/)5/)2
16. A method for screening for compositions capable of altering the phenotype of a cell, said method comprising: a) introducing a recombinant nucleic acid according to claim 11 into a plurality of cells; b) subjecting said plurality of cells to conditions whereby protein encoded by said nucleic acid is expressed; c) screening said plurality of cells for a cell exhibiting an altered phenotype.
17. A method according to claim 16, further comprising isolating said cell exhibiting an altered phenotype.
18. A method according to claim 16, further comprising isolating a nucleic acid from said cell.
19. A method according to claim 16, further comprising isolating a target molecule.
20. A composition according to any one of claims 1 to 9 substantially as 20 hereinbefore described with reference to the drawings and/or examples. S.0 21. A method according to any one of claims 15 to 19 substantially as hereinbefore described with reference to the drawings and/or examples. 25 DATED this 2 3 rd day of May 2002 RIGEL PHARMACEUTICALS, INC. by DAVIES COLLISON CAVE Patent Attorneys for the Applicants
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US8044498P | 1998-04-02 | 1998-04-02 | |
US60/080444 | 1998-04-02 | ||
PCT/US1999/007374 WO1999051625A2 (en) | 1998-04-02 | 1999-04-02 | Peptides causing formation of compact structures |
Publications (2)
Publication Number | Publication Date |
---|---|
AU3469399A AU3469399A (en) | 1999-10-25 |
AU752168B2 true AU752168B2 (en) | 2002-09-05 |
Family
ID=22157430
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU34693/99A Ceased AU752168B2 (en) | 1998-04-02 | 1999-04-02 | Peptides causing formation of compact structures |
Country Status (7)
Country | Link |
---|---|
EP (1) | EP1071705A2 (en) |
JP (1) | JP2002510479A (en) |
CN (1) | CN1302305A (en) |
AU (1) | AU752168B2 (en) |
CA (1) | CA2324284A1 (en) |
NZ (1) | NZ507063A (en) |
WO (1) | WO1999051625A2 (en) |
Families Citing this family (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1189934A2 (en) * | 1999-05-14 | 2002-03-27 | Medical Research Council | Oligomeric chaperone proteins |
IL146798A0 (en) | 1999-06-14 | 2002-07-25 | Genentech Inc | A structured peptide scaffold for displaying turn libraries on phage |
AU1351501A (en) * | 1999-10-26 | 2001-05-08 | Mitokor | Gene sequences identified by protein motif database searching |
JP4586275B2 (en) * | 2001-01-17 | 2010-11-24 | 味の素株式会社 | Method for identifying the interface of a composite |
US6914123B2 (en) | 2001-04-17 | 2005-07-05 | Genentech, Inc. | Hairpin peptides with a novel structural motif and methods relating thereto |
EP3192808A1 (en) | 2007-11-27 | 2017-07-19 | The University Of British Columbia | 14-3-3 antagonists for the prevention and treatment of arthritis |
JP5658866B2 (en) * | 2009-04-23 | 2015-01-28 | 花王株式会社 | Evaluation method |
FR2958936A1 (en) * | 2010-04-14 | 2011-10-21 | Sanofi Aventis | ROBO1-FC FUSION PROTEIN AND ITS USE IN THE TREATMENT OF TUMORS |
CA2919583C (en) * | 2013-07-31 | 2018-09-11 | Rinat Neuroscience Corp. | Engineered polypeptide conjugates |
CN114805527A (en) * | 2014-05-21 | 2022-07-29 | 哈佛大学的校长及成员们 | RAS inhibitory peptides and their uses |
CN105399802B (en) * | 2015-12-08 | 2020-04-03 | 中国农业科学院兰州兽医研究所 | Type A foot-and-mouth disease genetically engineered compound epitope protein and vaccine |
CN105777909B (en) * | 2016-03-03 | 2020-04-03 | 中国农业科学院兰州兽医研究所 | A porcine chemokine-mediated targeting complex epitope protein and vaccine for type A foot-and-mouth disease |
CN106220740A (en) * | 2016-08-18 | 2016-12-14 | 中山大学 | Soluble protein BAFF is in B cell In vitro culture and the application of amplification |
CN108866635B (en) | 2017-05-09 | 2021-11-26 | 安升(上海)医药科技有限公司 | Multispecific protein medicine and library thereof, and preparation method and application thereof |
WO2018209052A1 (en) * | 2017-05-10 | 2018-11-15 | Wellstat Immuno Therapeutics, Llc | Enveloped virus resistant to complement inactivation for the treatment of cancer |
CN108864276B (en) * | 2017-05-16 | 2023-02-03 | 上海恒润达生生物科技股份有限公司 | NY-ESO-1-targeted T cell receptor combined expression PD 1antibody variable region and application thereof |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5491074A (en) * | 1993-04-01 | 1996-02-13 | Affymax Technologies Nv | Association peptides |
EP0724593A4 (en) * | 1993-06-11 | 1997-11-26 | Smithkline Beecham Corp | Coiled-coil stem loop templates |
-
1999
- 1999-04-02 NZ NZ507063A patent/NZ507063A/en unknown
- 1999-04-02 CA CA002324284A patent/CA2324284A1/en not_active Abandoned
- 1999-04-02 EP EP99916352A patent/EP1071705A2/en not_active Withdrawn
- 1999-04-02 JP JP2000542346A patent/JP2002510479A/en active Pending
- 1999-04-02 AU AU34693/99A patent/AU752168B2/en not_active Ceased
- 1999-04-02 WO PCT/US1999/007374 patent/WO1999051625A2/en not_active Application Discontinuation
- 1999-04-02 CN CN 99806463 patent/CN1302305A/en active Pending
Also Published As
Publication number | Publication date |
---|---|
NZ507063A (en) | 2003-11-28 |
WO1999051625A2 (en) | 1999-10-14 |
AU3469399A (en) | 1999-10-25 |
EP1071705A2 (en) | 2001-01-31 |
CA2324284A1 (en) | 1999-10-14 |
CN1302305A (en) | 2001-07-04 |
WO1999051625A9 (en) | 2001-07-05 |
JP2002510479A (en) | 2002-04-09 |
WO1999051625A3 (en) | 2000-04-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Owens et al. | MOrPH-PhD: an integrated phage display platform for the discovery of functional genetically encoded peptide macrocycles | |
AU752168B2 (en) | Peptides causing formation of compact structures | |
Simonet et al. | Structural and functional properties of a novel serine protease inhibiting peptide family in arthropods | |
RU2631931C2 (en) | Modulation of structured proteins specificity | |
Duve et al. | Isolation and identification of multiple neuropeptides of the allatostatin superfamily in the shore crab Carcinus maenas | |
US20220090054A1 (en) | Chimeric proteins | |
JP2014205669A (en) | Method of constructing and screening libraries of peptide structures | |
Merlino et al. | Functional selectivity revealed by N-methylation scanning of human urotensin II and related peptides | |
Jakubke et al. | Peptides from A to Z: a concise encyclopedia | |
Jiang et al. | Macrocyclic peptides as regulators of protein-protein interactions | |
Shu et al. | Development of mirror-image screening systems for XIAP BIR3 domain inhibitors | |
US6709814B1 (en) | Peptides causing formation of compact structures | |
Alluri et al. | Isolation and characterization of coactivator-binding peptoids from a combinatorial library | |
Jaulent et al. | Design, synthesis and analysis of novel bicyclic and bifunctional protease inhibitors | |
US6747135B1 (en) | Fluorescent dye binding peptides | |
García‐Pindado et al. | Bromotryptophans and their incorporation in cyclic and bicyclic privileged peptides | |
US20030166003A1 (en) | Structured peptide scaffold for displaying turn libraries on phage | |
Hart et al. | Potent inhibitory ligands of the GRB2 SH2 domain from recombinant peptide libraries | |
Adachi et al. | Investigation on cellular uptake and pharmacodynamics of DOCK2-inhibitory peptides conjugated with cell-penetrating peptides | |
Gauthier et al. | Peptide Synthesis: Methods and Protocols | |
WO2007021661A2 (en) | Ligands of sh3 domains | |
Serwe et al. | Complete amino acid sequence of the regulatory light chain of obliquely striated muscle myosin from earthworm, Lumbricus terrestris | |
Li et al. | Peptide substrate identification for yeast Hsp40 Ydj1 by screening the phage display library | |
WO2004020466A1 (en) | Protein tyrosine phosphatase inhibitors | |
Neukirchen et al. | Impact of the amino acid sequence on the conformation of side chain lactam‐bridged octapeptides |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FGA | Letters patent sealed or granted (standard patent) |