WO2024174680A1 - Building blocks for difficult peptide synthesis and method of making thereof - Google Patents
Building blocks for difficult peptide synthesis and method of making thereof Download PDFInfo
- Publication number
- WO2024174680A1 WO2024174680A1 PCT/CN2023/137784 CN2023137784W WO2024174680A1 WO 2024174680 A1 WO2024174680 A1 WO 2024174680A1 CN 2023137784 W CN2023137784 W CN 2023137784W WO 2024174680 A1 WO2024174680 A1 WO 2024174680A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- unsubstituted
- group
- formula
- substituted
- hydrogen
- Prior art date
Links
- 238000004519 manufacturing process Methods 0.000 title claims abstract description 12
- 238000010647 peptide synthesis reaction Methods 0.000 title claims description 25
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 44
- 238000000034 method Methods 0.000 claims description 121
- 229910052739 hydrogen Inorganic materials 0.000 claims description 96
- 239000001257 hydrogen Substances 0.000 claims description 96
- -1 -CF3) Chemical class 0.000 claims description 85
- 150000001413 amino acids Chemical class 0.000 claims description 70
- 229940024606 amino acid Drugs 0.000 claims description 69
- 235000001014 amino acid Nutrition 0.000 claims description 67
- 125000003118 aryl group Chemical group 0.000 claims description 63
- SMQUZDBALVYZAC-UHFFFAOYSA-N salicylaldehyde Chemical class OC1=CC=CC=C1C=O SMQUZDBALVYZAC-UHFFFAOYSA-N 0.000 claims description 62
- 150000002431 hydrogen Chemical class 0.000 claims description 59
- 150000001875 compounds Chemical class 0.000 claims description 53
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 claims description 50
- 125000000623 heterocyclic group Chemical group 0.000 claims description 50
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 claims description 50
- 125000001072 heteroaryl group Chemical group 0.000 claims description 48
- 125000003545 alkoxy group Chemical group 0.000 claims description 40
- 150000002148 esters Chemical class 0.000 claims description 40
- 125000003710 aryl alkyl group Chemical group 0.000 claims description 37
- 125000002887 hydroxy group Chemical group [H]O* 0.000 claims description 37
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical group ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 claims description 36
- 125000003342 alkenyl group Chemical group 0.000 claims description 36
- 125000000217 alkyl group Chemical group 0.000 claims description 33
- 125000000304 alkynyl group Chemical group 0.000 claims description 33
- 125000003368 amide group Chemical group 0.000 claims description 32
- 229910052736 halogen Inorganic materials 0.000 claims description 28
- 150000002367 halogens Chemical class 0.000 claims description 28
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 claims description 27
- 239000002253 acid Substances 0.000 claims description 26
- 125000004093 cyano group Chemical group *C#N 0.000 claims description 26
- 125000000449 nitro group Chemical group [O-][N+](*)=O 0.000 claims description 25
- JGFZNNIVVJXRND-UHFFFAOYSA-N N,N-Diisopropylethylamine (DIPEA) Chemical group CCN(C(C)C)C(C)C JGFZNNIVVJXRND-UHFFFAOYSA-N 0.000 claims description 24
- 125000000524 functional group Chemical group 0.000 claims description 24
- 230000004224 protection Effects 0.000 claims description 23
- 125000002023 trifluoromethyl group Chemical group FC(F)(F)* 0.000 claims description 23
- 125000004122 cyclic group Chemical group 0.000 claims description 22
- 229910052760 oxygen Inorganic materials 0.000 claims description 22
- 150000003573 thiols Chemical class 0.000 claims description 22
- 239000000872 buffer Substances 0.000 claims description 20
- 229910052717 sulfur Inorganic materials 0.000 claims description 20
- 125000004178 (C1-C4) alkyl group Chemical group 0.000 claims description 19
- 125000001188 haloalkyl group Chemical group 0.000 claims description 19
- 125000004404 heteroalkyl group Chemical group 0.000 claims description 19
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 claims description 18
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 claims description 18
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 claims description 18
- JUJWROOIHBZHMG-UHFFFAOYSA-N Pyridine Chemical compound C1=CC=NC=C1 JUJWROOIHBZHMG-UHFFFAOYSA-N 0.000 claims description 18
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 18
- 239000004473 Threonine Substances 0.000 claims description 18
- 230000003213 activating effect Effects 0.000 claims description 18
- 125000002252 acyl group Chemical group 0.000 claims description 18
- 235000018417 cysteine Nutrition 0.000 claims description 18
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 18
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 claims description 17
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 claims description 17
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 17
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 claims description 17
- 239000003153 chemical reaction reagent Substances 0.000 claims description 17
- 239000007821 HATU Substances 0.000 claims description 15
- 125000002485 formyl group Chemical group [H]C(*)=O 0.000 claims description 15
- 239000003960 organic solvent Substances 0.000 claims description 15
- 125000006727 (C1-C6) alkenyl group Chemical group 0.000 claims description 14
- 125000004169 (C1-C6) alkyl group Chemical group 0.000 claims description 14
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims description 14
- 239000002904 solvent Substances 0.000 claims description 14
- DFVRUHANEXOZGT-UHFFFAOYSA-N tert-butyl n-methyl-n-[2-(methylamino)ethyl]carbamate Chemical compound CNCCN(C)C(=O)OC(C)(C)C DFVRUHANEXOZGT-UHFFFAOYSA-N 0.000 claims description 14
- KXDHJXZQYSOELW-UHFFFAOYSA-N Carbamic acid Chemical group NC(O)=O KXDHJXZQYSOELW-UHFFFAOYSA-N 0.000 claims description 13
- 125000002777 acetyl group Chemical group [H]C([H])([H])C(*)=O 0.000 claims description 13
- 125000001797 benzyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])* 0.000 claims description 13
- 125000002843 carboxylic acid group Chemical group 0.000 claims description 13
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 claims description 13
- 125000001501 propionyl group Chemical group O=C([*])C([H])([H])C([H])([H])[H] 0.000 claims description 13
- LJRDOKAZOAKLDU-UDXJMMFXSA-N (2s,3s,4r,5r,6r)-5-amino-2-(aminomethyl)-6-[(2r,3s,4r,5s)-5-[(1r,2r,3s,5r,6s)-3,5-diamino-2-[(2s,3r,4r,5s,6r)-3-amino-4,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-6-hydroxycyclohexyl]oxy-4-hydroxy-2-(hydroxymethyl)oxolan-3-yl]oxyoxane-3,4-diol;sulfuric ac Chemical compound OS(O)(=O)=O.N[C@@H]1[C@@H](O)[C@H](O)[C@H](CN)O[C@@H]1O[C@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](N)C[C@@H](N)[C@@H]2O)O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)N)O[C@@H]1CO LJRDOKAZOAKLDU-UDXJMMFXSA-N 0.000 claims description 12
- 125000003903 2-propenyl group Chemical group [H]C([*])([H])C([H])=C([H])[H] 0.000 claims description 12
- 150000001732 carboxylic acid derivatives Chemical class 0.000 claims description 12
- 229960001639 penicillamine Drugs 0.000 claims description 12
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 claims description 12
- 125000003088 (fluoren-9-ylmethoxy)carbonyl group Chemical group 0.000 claims description 11
- KPFBUSLHFFWMAI-HYRPPVSQSA-N [(8r,9s,10r,13s,14s,17r)-17-acetyl-6-formyl-3-methoxy-10,13-dimethyl-1,2,7,8,9,11,12,14,15,16-decahydrocyclopenta[a]phenanthren-17-yl] acetate Chemical compound C1C[C@@H]2[C@](CCC(OC)=C3)(C)C3=C(C=O)C[C@H]2[C@@H]2CC[C@](OC(C)=O)(C(C)=O)[C@]21C KPFBUSLHFFWMAI-HYRPPVSQSA-N 0.000 claims description 11
- 238000010511 deprotection reaction Methods 0.000 claims description 11
- 230000009257 reactivity Effects 0.000 claims description 11
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims description 11
- 150000001408 amides Chemical class 0.000 claims description 10
- 238000009833 condensation Methods 0.000 claims description 10
- 230000005494 condensation Effects 0.000 claims description 10
- 125000004435 hydrogen atom Chemical group [H]* 0.000 claims description 10
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 claims description 9
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 claims description 9
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 claims description 9
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 claims description 9
- YXFVVABEGXRONW-UHFFFAOYSA-N Toluene Chemical compound CC1=CC=CC=C1 YXFVVABEGXRONW-UHFFFAOYSA-N 0.000 claims description 9
- ZMANZCXQSJIPKH-UHFFFAOYSA-N Triethylamine Chemical compound CCN(CC)CC ZMANZCXQSJIPKH-UHFFFAOYSA-N 0.000 claims description 9
- 125000006242 amine protecting group Chemical group 0.000 claims description 9
- 150000001733 carboxylic acid esters Chemical class 0.000 claims description 9
- UMJSCPRVCHMLSP-UHFFFAOYSA-N pyridine Natural products COC1=CC=CN=C1 UMJSCPRVCHMLSP-UHFFFAOYSA-N 0.000 claims description 9
- ASOKPJOREAFHNY-UHFFFAOYSA-N 1-Hydroxybenzotriazole Chemical compound C1=CC=C2N(O)N=NC2=C1 ASOKPJOREAFHNY-UHFFFAOYSA-N 0.000 claims description 8
- XBNGYFFABRKICK-UHFFFAOYSA-N 2,3,4,5,6-pentafluorophenol Chemical compound OC1=C(F)C(F)=C(F)C(F)=C1F XBNGYFFABRKICK-UHFFFAOYSA-N 0.000 claims description 8
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 claims description 8
- LCFXLZAXGXOXAP-QPJJXVBHSA-N ethyl (2e)-2-cyano-2-hydroxyiminoacetate Chemical compound CCOC(=O)C(=N\O)\C#N LCFXLZAXGXOXAP-QPJJXVBHSA-N 0.000 claims description 8
- NPZTUJOABDZTLV-UHFFFAOYSA-N hydroxybenzotriazole Substances O=C1C=CC=C2NNN=C12 NPZTUJOABDZTLV-UHFFFAOYSA-N 0.000 claims description 8
- IZUPBVBPLAPZRR-UHFFFAOYSA-N pentachloro-phenol Natural products OC1=C(Cl)C(Cl)=C(Cl)C(Cl)=C1Cl IZUPBVBPLAPZRR-UHFFFAOYSA-N 0.000 claims description 8
- 239000004471 Glycine Substances 0.000 claims description 7
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 claims description 7
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims description 7
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims description 7
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 claims description 7
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 claims description 7
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 claims description 7
- 125000000852 azido group Chemical group *N=[N+]=[N-] 0.000 claims description 7
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 claims description 7
- 229960000310 isoleucine Drugs 0.000 claims description 7
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 claims description 7
- 239000004474 valine Substances 0.000 claims description 7
- 125000004209 (C1-C8) alkyl group Chemical group 0.000 claims description 6
- LJCZNYWLQZZIOS-UHFFFAOYSA-N 2,2,2-trichlorethoxycarbonyl chloride Chemical group ClC(=O)OCC(Cl)(Cl)Cl LJCZNYWLQZZIOS-UHFFFAOYSA-N 0.000 claims description 6
- BPSNETAIJADFTO-UHFFFAOYSA-N 2-pyridinylacetic acid Chemical group OC(=O)CC1=CC=CC=N1 BPSNETAIJADFTO-UHFFFAOYSA-N 0.000 claims description 6
- 239000004475 Arginine Substances 0.000 claims description 6
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 claims description 6
- 125000004650 C1-C8 alkynyl group Chemical group 0.000 claims description 6
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 claims description 6
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 claims description 6
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 claims description 6
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 claims description 6
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 6
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 claims description 6
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 claims description 6
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 claims description 6
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 claims description 6
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 claims description 6
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 claims description 6
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 claims description 6
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 claims description 6
- 239000004472 Lysine Substances 0.000 claims description 6
- WYURNTSHIVDZCO-UHFFFAOYSA-N Tetrahydrofuran Chemical compound C1CCOC1 WYURNTSHIVDZCO-UHFFFAOYSA-N 0.000 claims description 6
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 claims description 6
- 230000010933 acylation Effects 0.000 claims description 6
- 238000005917 acylation reaction Methods 0.000 claims description 6
- 235000004279 alanine Nutrition 0.000 claims description 6
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 claims description 6
- 235000009582 asparagine Nutrition 0.000 claims description 6
- 229960001230 asparagine Drugs 0.000 claims description 6
- 235000003704 aspartic acid Nutrition 0.000 claims description 6
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 claims description 6
- 239000003795 chemical substances by application Substances 0.000 claims description 6
- 235000013922 glutamic acid Nutrition 0.000 claims description 6
- 239000004220 glutamic acid Substances 0.000 claims description 6
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 claims description 6
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 claims description 6
- 229930182817 methionine Natural products 0.000 claims description 6
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 claims description 6
- 239000007790 solid phase Substances 0.000 claims description 5
- YLQBMQCUIZJEEH-UHFFFAOYSA-N tetrahydrofuran Natural products C=1C=COC=1 YLQBMQCUIZJEEH-UHFFFAOYSA-N 0.000 claims description 5
- LMDZBCPBFSXMTL-UHFFFAOYSA-N 1-Ethyl-3-(3-dimethylaminopropyl)carbodiimide Substances CCN=C=NCCCN(C)C LMDZBCPBFSXMTL-UHFFFAOYSA-N 0.000 claims description 3
- FPQQSJJWHUJYPU-UHFFFAOYSA-N 3-(dimethylamino)propyliminomethylidene-ethylazanium;chloride Chemical compound Cl.CCN=C=NCCCN(C)C FPQQSJJWHUJYPU-UHFFFAOYSA-N 0.000 claims description 3
- QOSSAOTZNIDXMA-UHFFFAOYSA-N Dicylcohexylcarbodiimide Chemical compound C1CCCCC1N=C=NC1CCCCC1 QOSSAOTZNIDXMA-UHFFFAOYSA-N 0.000 claims description 3
- RDWDVLFMPFUBDV-PXMDEAMVSA-N [(e)-(1-cyano-2-ethoxy-2-oxoethylidene)amino]oxy-tripyrrolidin-1-ylphosphanium;hexafluorophosphate Chemical compound F[P-](F)(F)(F)(F)F.C1CCCN1[P+](N1CCCC1)(O/N=C(C(=O)OCC)\C#N)N1CCCC1 RDWDVLFMPFUBDV-PXMDEAMVSA-N 0.000 claims description 3
- GPDHNZNLPKYHCN-DZOOLQPHSA-N [[(z)-(1-cyano-2-ethoxy-2-oxoethylidene)amino]oxy-morpholin-4-ylmethylidene]-dimethylazanium;hexafluorophosphate Chemical compound F[P-](F)(F)(F)(F)F.CCOC(=O)C(\C#N)=N/OC(=[N+](C)C)N1CCOCC1 GPDHNZNLPKYHCN-DZOOLQPHSA-N 0.000 claims description 3
- 125000004185 ester group Chemical group 0.000 claims description 3
- 238000002156 mixing Methods 0.000 claims description 3
- RYHBNJHYFVUHQT-UHFFFAOYSA-N 1,4-Dioxane Chemical compound C1COCCO1 RYHBNJHYFVUHQT-UHFFFAOYSA-N 0.000 claims description 2
- 239000007825 activation reagent Substances 0.000 claims description 2
- 229940037201 oris Drugs 0.000 claims description 2
- 238000003786 synthesis reaction Methods 0.000 abstract description 30
- 230000015572 biosynthetic process Effects 0.000 abstract description 23
- 102000004196 processed proteins & peptides Human genes 0.000 abstract description 23
- 102000004169 proteins and genes Human genes 0.000 abstract description 20
- 108090000623 proteins and genes Proteins 0.000 abstract description 20
- 108010016626 Dipeptides Proteins 0.000 abstract description 17
- 238000013341 scale-up Methods 0.000 abstract description 5
- 238000012827 research and development Methods 0.000 abstract description 2
- 238000006243 chemical reaction Methods 0.000 description 26
- 239000000047 product Substances 0.000 description 26
- 125000006239 protecting group Chemical group 0.000 description 20
- 125000001424 substituent group Chemical group 0.000 description 20
- 230000008878 coupling Effects 0.000 description 19
- 238000010168 coupling process Methods 0.000 description 19
- 238000005859 coupling reaction Methods 0.000 description 19
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 18
- 235000018102 proteins Nutrition 0.000 description 18
- 238000000746 purification Methods 0.000 description 17
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 15
- 238000004440 column chromatography Methods 0.000 description 15
- 239000011541 reaction mixture Substances 0.000 description 15
- 239000011347 resin Substances 0.000 description 12
- 229920005989 resin Polymers 0.000 description 12
- FPIRBHDGWMWJEP-UHFFFAOYSA-N 1-hydroxy-7-azabenzotriazole Chemical compound C1=CN=C2N(O)N=NC2=C1 FPIRBHDGWMWJEP-UHFFFAOYSA-N 0.000 description 11
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 11
- 239000012043 crude product Substances 0.000 description 11
- 239000000203 mixture Substances 0.000 description 11
- 125000000061 phosphanyl group Chemical group [H]P([H])* 0.000 description 11
- XYFCBTPGUUZFHI-UHFFFAOYSA-O phosphonium Chemical compound [PH4+] XYFCBTPGUUZFHI-UHFFFAOYSA-O 0.000 description 11
- 239000007787 solid Substances 0.000 description 11
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 10
- 239000001301 oxygen Substances 0.000 description 10
- 125000004432 carbon atom Chemical group C* 0.000 description 9
- 125000000999 tert-butyl group Chemical group [H]C([H])([H])C(*)(C([H])([H])[H])C([H])([H])[H] 0.000 description 9
- 238000005406 washing Methods 0.000 description 9
- NQTADLQHYWFPDB-UHFFFAOYSA-N N-Hydroxysuccinimide Chemical compound ON1C(=O)CCC1=O NQTADLQHYWFPDB-UHFFFAOYSA-N 0.000 description 8
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 8
- 230000004913 activation Effects 0.000 description 8
- 125000005842 heteroatom Chemical group 0.000 description 8
- 239000000741 silica gel Substances 0.000 description 8
- 229910002027 silica gel Inorganic materials 0.000 description 8
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 7
- 125000002877 alkyl aryl group Chemical group 0.000 description 7
- 125000000753 cycloalkyl group Chemical group 0.000 description 7
- 229910052757 nitrogen Inorganic materials 0.000 description 7
- 125000001181 organosilyl group Chemical group [SiH3]* 0.000 description 7
- 238000001243 protein synthesis Methods 0.000 description 7
- 238000000926 separation method Methods 0.000 description 7
- 239000011593 sulfur Substances 0.000 description 7
- UIIMBOGNXHQVGW-UHFFFAOYSA-M Sodium bicarbonate Chemical compound [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 6
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 6
- 125000004414 alkyl thio group Chemical group 0.000 description 6
- 229910052799 carbon Inorganic materials 0.000 description 6
- 238000000605 extraction Methods 0.000 description 6
- 108010074708 B7-H1 Antigen Proteins 0.000 description 5
- 102000008096 B7-H1 Antigen Human genes 0.000 description 5
- 229910004373 HOAc Inorganic materials 0.000 description 5
- 125000003277 amino group Chemical group 0.000 description 5
- 125000000392 cycloalkenyl group Chemical group 0.000 description 5
- 230000003993 interaction Effects 0.000 description 5
- 125000005499 phosphonyl group Chemical group 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- 125000000475 sulfinyl group Chemical group [*:2]S([*:1])=O 0.000 description 5
- 230000014616 translation Effects 0.000 description 5
- WZUODJNEIXSNEU-UHFFFAOYSA-N 2-Hydroxy-4-methoxybenzaldehyde Chemical compound COC1=CC=C(C=O)C(O)=C1 WZUODJNEIXSNEU-UHFFFAOYSA-N 0.000 description 4
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 4
- 102000001327 Chemokine CCL5 Human genes 0.000 description 4
- 108010055166 Chemokine CCL5 Proteins 0.000 description 4
- PMZURENOXWZQFD-UHFFFAOYSA-L Sodium Sulfate Chemical compound [Na+].[Na+].[O-]S([O-])(=O)=O PMZURENOXWZQFD-UHFFFAOYSA-L 0.000 description 4
- 230000002776 aggregation Effects 0.000 description 4
- 238000004220 aggregation Methods 0.000 description 4
- 238000001914 filtration Methods 0.000 description 4
- CMWYAOXYQATXSI-UHFFFAOYSA-N n,n-dimethylformamide;piperidine Chemical compound CN(C)C=O.C1CCNCC1 CMWYAOXYQATXSI-UHFFFAOYSA-N 0.000 description 4
- 238000005580 one pot reaction Methods 0.000 description 4
- LFGREXWGYUGZLY-UHFFFAOYSA-N phosphoryl Chemical group [P]=O LFGREXWGYUGZLY-UHFFFAOYSA-N 0.000 description 4
- 229910052938 sodium sulfate Inorganic materials 0.000 description 4
- 235000011152 sodium sulphate Nutrition 0.000 description 4
- 238000010532 solid phase synthesis reaction Methods 0.000 description 4
- 230000002194 synthesizing effect Effects 0.000 description 4
- RIOQSEWOXXDEQQ-UHFFFAOYSA-N triphenylphosphine Chemical compound C1=CC=CC=C1P(C=1C=CC=CC=1)C1=CC=CC=C1 RIOQSEWOXXDEQQ-UHFFFAOYSA-N 0.000 description 4
- WFDIJRYMOXRFFG-UHFFFAOYSA-N Acetic anhydride Chemical compound CC(=O)OC(C)=O WFDIJRYMOXRFFG-UHFFFAOYSA-N 0.000 description 3
- UHOVQNZJYSORNB-UHFFFAOYSA-N Benzene Chemical compound C1=CC=CC=C1 UHOVQNZJYSORNB-UHFFFAOYSA-N 0.000 description 3
- 125000004399 C1-C4 alkenyl group Chemical group 0.000 description 3
- 102000036770 Islet Amyloid Polypeptide Human genes 0.000 description 3
- 108010041872 Islet Amyloid Polypeptide Proteins 0.000 description 3
- 125000000415 L-cysteinyl group Chemical group O=C([*])[C@@](N([H])[H])([H])C([H])([H])S[H] 0.000 description 3
- 150000001412 amines Chemical class 0.000 description 3
- 125000004429 atom Chemical group 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 125000005678 ethenylene group Chemical group [H]C([*:1])=C([H])[*:2] 0.000 description 3
- 230000002209 hydrophobic effect Effects 0.000 description 3
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 3
- 238000009434 installation Methods 0.000 description 3
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 3
- 239000012074 organic phase Substances 0.000 description 3
- PARWUHTVGZSQPD-UHFFFAOYSA-N phenylsilane Chemical compound [SiH3]C1=CC=CC=C1 PARWUHTVGZSQPD-UHFFFAOYSA-N 0.000 description 3
- 238000010992 reflux Methods 0.000 description 3
- 229910000030 sodium bicarbonate Inorganic materials 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 150000007970 thio esters Chemical class 0.000 description 3
- 125000002813 thiocarbonyl group Chemical group *C(*)=S 0.000 description 3
- LSNNMFCWUKXFEE-UHFFFAOYSA-M Bisulfite Chemical compound OS([O-])=O LSNNMFCWUKXFEE-UHFFFAOYSA-M 0.000 description 2
- KXDHJXZQYSOELW-UHFFFAOYSA-M Carbamate Chemical compound NC([O-])=O KXDHJXZQYSOELW-UHFFFAOYSA-M 0.000 description 2
- DRCKHKZYDLJYFQ-YWIQKCBGSA-N Ile-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRCKHKZYDLJYFQ-YWIQKCBGSA-N 0.000 description 2
- 102100040035 Interferon-induced transmembrane protein 3 Human genes 0.000 description 2
- 101710087316 Interferon-induced transmembrane protein 3 Proteins 0.000 description 2
- 125000002842 L-seryl group Chemical group O=C([*])[C@](N([H])[H])([H])C([H])([H])O[H] 0.000 description 2
- 125000000769 L-threonyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])[C@](O[H])(C([H])([H])[H])[H] 0.000 description 2
- WMFOQBRAJBCJND-UHFFFAOYSA-M Lithium hydroxide Chemical compound [Li+].[OH-] WMFOQBRAJBCJND-UHFFFAOYSA-M 0.000 description 2
- UFWIBTONFRDIAS-UHFFFAOYSA-N Naphthalene Chemical compound C1=CC=CC2=CC=CC=C21 UFWIBTONFRDIAS-UHFFFAOYSA-N 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- KDLHZDBZIXYQEI-UHFFFAOYSA-N Palladium Chemical compound [Pd] KDLHZDBZIXYQEI-UHFFFAOYSA-N 0.000 description 2
- KYQCOXFCLRTKLS-UHFFFAOYSA-N Pyrazine Chemical compound C1=CN=CC=N1 KYQCOXFCLRTKLS-UHFFFAOYSA-N 0.000 description 2
- KAESVJOAVNADME-UHFFFAOYSA-N Pyrrole Chemical compound C=1C=CNC=1 KAESVJOAVNADME-UHFFFAOYSA-N 0.000 description 2
- FFOKMZOAVHEWET-IMJSIDKUSA-N Ser-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(O)=O FFOKMZOAVHEWET-IMJSIDKUSA-N 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-L Sulfate Chemical compound [O-]S([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 2
- YTPLMLYBLZKORZ-UHFFFAOYSA-N Thiophene Chemical compound C=1C=CSC=1 YTPLMLYBLZKORZ-UHFFFAOYSA-N 0.000 description 2
- 241000767684 Thoe Species 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- 125000004453 alkoxycarbonyl group Chemical group 0.000 description 2
- 150000001409 amidines Chemical class 0.000 description 2
- 125000004397 aminosulfonyl group Chemical group NS(=O)(=O)* 0.000 description 2
- MWPLVEDNUUSJAV-UHFFFAOYSA-N anthracene Chemical compound C1=CC=CC2=CC3=CC=CC=C3C=C21 MWPLVEDNUUSJAV-UHFFFAOYSA-N 0.000 description 2
- 239000012062 aqueous buffer Substances 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- 125000005110 aryl thio group Chemical group 0.000 description 2
- 150000001540 azides Chemical class 0.000 description 2
- 229960001948 caffeine Drugs 0.000 description 2
- 239000011203 carbon fibre reinforced carbon Substances 0.000 description 2
- PFKFTWBEEFSNDU-UHFFFAOYSA-N carbonyldiimidazole Chemical compound C1=CN=CN1C(=O)N1C=CN=C1 PFKFTWBEEFSNDU-UHFFFAOYSA-N 0.000 description 2
- WDECIBYCCFPHNR-UHFFFAOYSA-N chrysene Chemical compound C1=CC=CC2=CC=C3C4=CC=CC=C4C=CC3=C21 WDECIBYCCFPHNR-UHFFFAOYSA-N 0.000 description 2
- VPUGDVKSAQVFFS-UHFFFAOYSA-N coronene Chemical compound C1=C(C2=C34)C=CC3=CC=C(C=C3)C4=C4C3=CC=C(C=C3)C4=C2C3=C1 VPUGDVKSAQVFFS-UHFFFAOYSA-N 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000001035 drying Methods 0.000 description 2
- 238000006345 epimerization reaction Methods 0.000 description 2
- 238000001704 evaporation Methods 0.000 description 2
- 230000008020 evaporation Effects 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 150000002430 hydrocarbons Chemical group 0.000 description 2
- 150000002466 imines Chemical class 0.000 description 2
- 125000002462 isocyano group Chemical group *[N+]#[C-] 0.000 description 2
- AWIJRPNMLHPLNC-UHFFFAOYSA-N methanethioic s-acid Chemical compound SC=O AWIJRPNMLHPLNC-UHFFFAOYSA-N 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 2
- 150000002894 organic compounds Chemical class 0.000 description 2
- 230000006919 peptide aggregation Effects 0.000 description 2
- YNPNZTXNASCQKK-UHFFFAOYSA-N phenanthrene Chemical compound C1=CC=C2C3=CC=CC=C3C=CC2=C1 YNPNZTXNASCQKK-UHFFFAOYSA-N 0.000 description 2
- 125000000951 phenoxy group Chemical group [H]C1=C([H])C([H])=C(O*)C([H])=C1[H] 0.000 description 2
- 125000003356 phenylsulfanyl group Chemical group [*]SC1=C([H])C([H])=C([H])C([H])=C1[H] 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 2
- 239000010452 phosphate Substances 0.000 description 2
- ACVYVLVWPXVTIT-UHFFFAOYSA-M phosphinate Chemical compound [O-][PH2]=O ACVYVLVWPXVTIT-UHFFFAOYSA-M 0.000 description 2
- UEZVMMHDMIWARA-UHFFFAOYSA-M phosphonate Chemical compound [O-]P(=O)=O UEZVMMHDMIWARA-UHFFFAOYSA-M 0.000 description 2
- 229910052698 phosphorus Inorganic materials 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- BBEAQIROQSPTKN-UHFFFAOYSA-N pyrene Chemical compound C1=CC=C2C=CC3=CC=CC4=CC=C1C2=C43 BBEAQIROQSPTKN-UHFFFAOYSA-N 0.000 description 2
- 125000004076 pyridyl group Chemical group 0.000 description 2
- 125000000714 pyrimidinyl group Chemical group 0.000 description 2
- 229920006395 saturated elastomer Polymers 0.000 description 2
- 150000003384 small molecules Chemical class 0.000 description 2
- 230000003381 solubilizing effect Effects 0.000 description 2
- 239000007858 starting material Substances 0.000 description 2
- BDHFUVZGWQCTTF-UHFFFAOYSA-M sulfonate Chemical compound [O-]S(=O)=O BDHFUVZGWQCTTF-UHFFFAOYSA-M 0.000 description 2
- 125000000472 sulfonyl group Chemical group *S(*)(=O)=O 0.000 description 2
- 125000004434 sulfur atom Chemical group 0.000 description 2
- 125000001544 thienyl group Chemical group 0.000 description 2
- DUYAAUVXQSMXQP-UHFFFAOYSA-M thioacetate Chemical compound CC([S-])=O DUYAAUVXQSMXQP-UHFFFAOYSA-M 0.000 description 2
- RYYVLZVUVIJVGH-UHFFFAOYSA-N trimethylxanthine Natural products CN1C(=O)N(C)C(=O)C2=C1N=CN2C RYYVLZVUVIJVGH-UHFFFAOYSA-N 0.000 description 2
- 125000004502 1,2,3-oxadiazolyl group Chemical group 0.000 description 1
- 125000004511 1,2,3-thiadiazolyl group Chemical group 0.000 description 1
- 125000004504 1,2,4-oxadiazolyl group Chemical group 0.000 description 1
- 125000004514 1,2,4-thiadiazolyl group Chemical group 0.000 description 1
- 125000004506 1,2,5-oxadiazolyl group Chemical group 0.000 description 1
- 125000004517 1,2,5-thiadiazolyl group Chemical group 0.000 description 1
- 125000001781 1,3,4-oxadiazolyl group Chemical group 0.000 description 1
- 125000004520 1,3,4-thiadiazolyl group Chemical group 0.000 description 1
- 125000005955 1H-indazolyl group Chemical group 0.000 description 1
- HJWBNUQVZJTJIV-UHFFFAOYSA-N 2-(2-methylpyridin-3-yl)acetic acid Chemical compound CC1=NC=CC=C1CC(O)=O HJWBNUQVZJTJIV-UHFFFAOYSA-N 0.000 description 1
- GAMYYCRTACQSBR-UHFFFAOYSA-N 4-azabenzimidazole Chemical compound C1=CC=C2NC=NC2=N1 GAMYYCRTACQSBR-UHFFFAOYSA-N 0.000 description 1
- DWBZEJHQQIURML-IMJSIDKUSA-N Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O DWBZEJHQQIURML-IMJSIDKUSA-N 0.000 description 1
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 1
- QWOJMRHUQHTCJG-UHFFFAOYSA-N CC([CH2-])=O Chemical group CC([CH2-])=O QWOJMRHUQHTCJG-UHFFFAOYSA-N 0.000 description 1
- BVKZGUZCCUSVTD-UHFFFAOYSA-L Carbonate Chemical compound [O-]C([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-L 0.000 description 1
- 102000019034 Chemokines Human genes 0.000 description 1
- 108010012236 Chemokines Proteins 0.000 description 1
- 101710095920 Competence-stimulating peptide type 1 Proteins 0.000 description 1
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- 102000003951 Erythropoietin Human genes 0.000 description 1
- 108090000394 Erythropoietin Proteins 0.000 description 1
- BDAGIHXWWSANSR-UHFFFAOYSA-M Formate Chemical compound [O-]C=O BDAGIHXWWSANSR-UHFFFAOYSA-M 0.000 description 1
- PABVKUJVLNMOJP-WHFBIAKZSA-N Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(O)=O PABVKUJVLNMOJP-WHFBIAKZSA-N 0.000 description 1
- 241000711549 Hepacivirus C Species 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- YSDQQAXHVYUZIW-QCIJIYAXSA-N Liraglutide Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCNC(=O)CC[C@H](NC(=O)CCCCCCCCCCCCCCC)C(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=C(O)C=C1 YSDQQAXHVYUZIW-QCIJIYAXSA-N 0.000 description 1
- 108010019598 Liraglutide Proteins 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- 101800001020 Non-structural protein 4A Proteins 0.000 description 1
- ZCQWOFVYLHDMMC-UHFFFAOYSA-N Oxazole Chemical compound C1=COC=N1 ZCQWOFVYLHDMMC-UHFFFAOYSA-N 0.000 description 1
- WYNCHZVNFNFDNH-UHFFFAOYSA-N Oxazolidine Chemical compound C1COCN1 WYNCHZVNFNFDNH-UHFFFAOYSA-N 0.000 description 1
- PCNDJXKNXGMECE-UHFFFAOYSA-N Phenazine Natural products C1=CC=CC2=NC3=CC=CC=C3N=C21 PCNDJXKNXGMECE-UHFFFAOYSA-N 0.000 description 1
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 1
- 239000004793 Polystyrene Substances 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- WTKZEGDFNFYCGP-UHFFFAOYSA-N Pyrazole Chemical compound C=1C=NNC=1 WTKZEGDFNFYCGP-UHFFFAOYSA-N 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 102000012479 Serine Proteases Human genes 0.000 description 1
- 108010022999 Serine Proteases Proteins 0.000 description 1
- FZWLAAWBMGSTSO-UHFFFAOYSA-N Thiazole Chemical compound C1=CSC=N1 FZWLAAWBMGSTSO-UHFFFAOYSA-N 0.000 description 1
- GVRKWABULJAONN-VQVTYTSYSA-N Val-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVRKWABULJAONN-VQVTYTSYSA-N 0.000 description 1
- 108010003205 Vasoactive Intestinal Peptide Proteins 0.000 description 1
- 102400000015 Vasoactive intestinal peptide Human genes 0.000 description 1
- BBAWTPDTGRXPDG-UHFFFAOYSA-N [1,3]thiazolo[4,5-b]pyridine Chemical compound C1=CC=C2SC=NC2=N1 BBAWTPDTGRXPDG-UHFFFAOYSA-N 0.000 description 1
- 125000002015 acyclic group Chemical group 0.000 description 1
- 150000001299 aldehydes Chemical class 0.000 description 1
- 150000001370 alpha-amino acid derivatives Chemical class 0.000 description 1
- 235000008206 alpha-amino acids Nutrition 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000003125 aqueous solvent Substances 0.000 description 1
- 238000011021 bench scale process Methods 0.000 description 1
- 125000003785 benzimidazolyl group Chemical group N1=C(NC2=C1C=CC=C2)* 0.000 description 1
- 125000004604 benzisothiazolyl group Chemical group S1N=C(C2=C1C=CC=C2)* 0.000 description 1
- 125000004603 benzisoxazolyl group Chemical group O1N=C(C2=C1C=CC=C2)* 0.000 description 1
- 125000000499 benzofuranyl group Chemical group O1C(=CC2=C1C=CC=C2)* 0.000 description 1
- 125000001164 benzothiazolyl group Chemical group S1C(=NC2=C1C=CC=C2)* 0.000 description 1
- 125000004196 benzothienyl group Chemical group S1C(=CC2=C1C=CC=C2)* 0.000 description 1
- 125000004935 benzoxazolinyl group Chemical group O1C(=NC2=C1C=CC=C2)* 0.000 description 1
- 125000004541 benzoxazolyl group Chemical group O1C(=NC2=C1C=CC=C2)* 0.000 description 1
- 125000005512 benztetrazolyl group Chemical group 0.000 description 1
- 125000000609 carbazolyl group Chemical group C1(=CC=CC=2C3=CC=CC=C3NC12)* 0.000 description 1
- 125000002837 carbocyclic group Chemical group 0.000 description 1
- 125000004623 carbolinyl group Chemical group 0.000 description 1
- 125000003016 chromanyl group Chemical group O1C(CCC2=CC=CC=C12)* 0.000 description 1
- 125000004230 chromenyl group Chemical group O1C(C=CC2=CC=CC=C12)* 0.000 description 1
- 125000000259 cinnolinyl group Chemical group N1=NC(=CC2=CC=CC=C12)* 0.000 description 1
- 230000001268 conjugating effect Effects 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 229910052802 copper Inorganic materials 0.000 description 1
- 239000010949 copper Substances 0.000 description 1
- VXRUJZQPKRBJKH-UHFFFAOYSA-N corannulene Chemical compound C1=CC(C2=C34)=CC=C3C=CC3=C4C4=C2C1=CC=C4C=C3 VXRUJZQPKRBJKH-UHFFFAOYSA-N 0.000 description 1
- 238000007275 deallylation reaction Methods 0.000 description 1
- 125000004856 decahydroquinolinyl group Chemical group N1(CCCC2CCCCC12)* 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 239000002274 desiccant Substances 0.000 description 1
- 125000000118 dimethyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 229940105423 erythropoietin Drugs 0.000 description 1
- 230000032050 esterification Effects 0.000 description 1
- 238000005886 esterification reaction Methods 0.000 description 1
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 238000004880 explosion Methods 0.000 description 1
- GVEPBJHOBDJJJI-UHFFFAOYSA-N fluoranthrene Natural products C1=CC(C2=CC=CC=C22)=C3C2=CC=CC3=C1 GVEPBJHOBDJJJI-UHFFFAOYSA-N 0.000 description 1
- 238000007306 functionalization reaction Methods 0.000 description 1
- 125000003838 furazanyl group Chemical group 0.000 description 1
- 125000002541 furyl group Chemical group 0.000 description 1
- BHEPBYXIRTUNPN-UHFFFAOYSA-N hydridophosphorus(.) (triplet) Chemical compound [PH] BHEPBYXIRTUNPN-UHFFFAOYSA-N 0.000 description 1
- 238000005984 hydrogenation reaction Methods 0.000 description 1
- 125000002632 imidazolidinyl group Chemical group 0.000 description 1
- 125000002636 imidazolinyl group Chemical group 0.000 description 1
- 125000002883 imidazolyl group Chemical group 0.000 description 1
- 125000004926 indolenyl group Chemical group 0.000 description 1
- 125000003387 indolinyl group Chemical group N1(CCC2=CC=CC=C12)* 0.000 description 1
- 125000003406 indolizinyl group Chemical group C=1(C=CN2C=CC=CC12)* 0.000 description 1
- 125000001041 indolyl group Chemical group 0.000 description 1
- 238000009776 industrial production Methods 0.000 description 1
- 208000037797 influenza A Diseases 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- VBUWHHLIZKOSMS-RIWXPGAOSA-N invicorp Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC=1NC=NC=1)C(C)C)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=C(O)C=C1 VBUWHHLIZKOSMS-RIWXPGAOSA-N 0.000 description 1
- 125000004936 isatinoyl group Chemical group N1(C(=O)C(=O)C2=CC=CC=C12)C(=O)* 0.000 description 1
- 125000001977 isobenzofuranyl group Chemical group C=1(OC=C2C=CC=CC12)* 0.000 description 1
- 125000003384 isochromanyl group Chemical group C1(OCCC2=CC=CC=C12)* 0.000 description 1
- 125000005438 isoindazolyl group Chemical group 0.000 description 1
- 125000004594 isoindolinyl group Chemical group C1(NCC2=CC=CC=C12)* 0.000 description 1
- 125000000904 isoindolyl group Chemical group C=1(NC=C2C=CC=CC12)* 0.000 description 1
- 125000001972 isopentyl group Chemical group [H]C([H])([H])C([H])(C([H])([H])[H])C([H])([H])C([H])([H])* 0.000 description 1
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 125000002183 isoquinolinyl group Chemical group C1(=NC=CC2=CC=CC=C12)* 0.000 description 1
- 125000001786 isothiazolyl group Chemical group 0.000 description 1
- 125000000842 isoxazolyl group Chemical group 0.000 description 1
- 150000002576 ketones Chemical class 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 229960002701 liraglutide Drugs 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 238000001819 mass spectrum Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 239000002808 molecular sieve Substances 0.000 description 1
- 125000002950 monocyclic group Chemical group 0.000 description 1
- 125000004108 n-butyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 125000001280 n-hexyl group Chemical group C(CCCCC)* 0.000 description 1
- 125000000740 n-pentyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 125000004123 n-propyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 125000004593 naphthyridinyl group Chemical group N1=C(C=CC2=CC=CN=C12)* 0.000 description 1
- 125000004433 nitrogen atom Chemical group N* 0.000 description 1
- 230000031787 nutrient reservoir activity Effects 0.000 description 1
- 125000004930 octahydroisoquinolinyl group Chemical group C1(NCCC2CCCC=C12)* 0.000 description 1
- 125000000160 oxazolidinyl group Chemical group 0.000 description 1
- QNNHQVPFZIFNFK-UHFFFAOYSA-N oxazolo[4,5-b]pyridine Chemical compound C1=CC=C2OC=NC2=N1 QNNHQVPFZIFNFK-UHFFFAOYSA-N 0.000 description 1
- 125000002971 oxazolyl group Chemical group 0.000 description 1
- 125000004095 oxindolyl group Chemical group N1(C(CC2=CC=CC=C12)=O)* 0.000 description 1
- 125000004430 oxygen atom Chemical group O* 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
- UQPUONNXJVWHRM-UHFFFAOYSA-N palladium;triphenylphosphane Chemical compound [Pd].C1=CC=CC=C1P(C=1C=CC=CC=1)C1=CC=CC=C1 UQPUONNXJVWHRM-UHFFFAOYSA-N 0.000 description 1
- 238000005897 peptide coupling reaction Methods 0.000 description 1
- 125000004934 phenanthridinyl group Chemical group C1(=CC=CC2=NC=C3C=CC=CC3=C12)* 0.000 description 1
- 125000004625 phenanthrolinyl group Chemical group N1=C(C=CC2=CC=C3C=CC=NC3=C12)* 0.000 description 1
- 125000001791 phenazinyl group Chemical group C1(=CC=CC2=NC3=CC=CC=C3N=C12)* 0.000 description 1
- 125000001484 phenothiazinyl group Chemical group C1(=CC=CC=2SC3=CC=CC=C3NC12)* 0.000 description 1
- 125000004932 phenoxathinyl group Chemical group 0.000 description 1
- 125000001644 phenoxazinyl group Chemical group C1(=CC=CC=2OC3=CC=CC=C3NC12)* 0.000 description 1
- 239000011574 phosphorus Substances 0.000 description 1
- 125000004592 phthalazinyl group Chemical group C1(=NN=CC2=CC=CC=C12)* 0.000 description 1
- 125000003367 polycyclic group Chemical group 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 229920002223 polystyrene Polymers 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- OXCMYAYHXIHQOA-UHFFFAOYSA-N potassium;[2-butyl-5-chloro-3-[[4-[2-(1,2,4-triaza-3-azanidacyclopenta-1,4-dien-5-yl)phenyl]phenyl]methyl]imidazol-4-yl]methanol Chemical compound [K+].CCCCC1=NC(Cl)=C(CO)N1CC1=CC=C(C=2C(=CC=CC=2)C2=N[N-]N=N2)C=C1 OXCMYAYHXIHQOA-UHFFFAOYSA-N 0.000 description 1
- 125000001042 pteridinyl group Chemical group N1=C(N=CC2=NC=CN=C12)* 0.000 description 1
- 125000000561 purinyl group Chemical group N1=C(N=C2N=CNC2=C1)* 0.000 description 1
- 125000003373 pyrazinyl group Chemical group 0.000 description 1
- 125000003072 pyrazolidinyl group Chemical group 0.000 description 1
- 125000002755 pyrazolinyl group Chemical group 0.000 description 1
- 125000003226 pyrazolyl group Chemical group 0.000 description 1
- PBMFSQRYOILNGV-UHFFFAOYSA-N pyridazine Chemical compound C1=CC=NN=C1 PBMFSQRYOILNGV-UHFFFAOYSA-N 0.000 description 1
- 125000002098 pyridazinyl group Chemical group 0.000 description 1
- 125000000719 pyrrolidinyl group Chemical group 0.000 description 1
- 125000001422 pyrrolinyl group Chemical group 0.000 description 1
- 125000000168 pyrrolyl group Chemical group 0.000 description 1
- 125000002294 quinazolinyl group Chemical group N1=C(N=CC2=CC=CC=C12)* 0.000 description 1
- 125000002943 quinolinyl group Chemical group N1=C(C=CC2=CC=CC=C12)* 0.000 description 1
- 125000001567 quinoxalinyl group Chemical group N1=C(C=NC2=CC=CC=C12)* 0.000 description 1
- 239000000376 reactant Substances 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000007363 ring formation reaction Methods 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
- 125000002914 sec-butyl group Chemical group [H]C([H])([H])C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 238000007086 side reaction Methods 0.000 description 1
- 229910052710 silicon Inorganic materials 0.000 description 1
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 1
- 235000014347 soups Nutrition 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000000707 stereoselective effect Effects 0.000 description 1
- 125000001973 tert-pentyl group Chemical group [H]C([H])([H])C([H])([H])C(*)(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 125000003039 tetrahydroisoquinolinyl group Chemical group C1(NCCC2=CC=CC=C12)* 0.000 description 1
- 125000000147 tetrahydroquinolinyl group Chemical group N1(CCCC2=CC=CC=C12)* 0.000 description 1
- 150000003536 tetrazoles Chemical class 0.000 description 1
- 125000003831 tetrazolyl group Chemical group 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 125000004627 thianthrenyl group Chemical group C1(=CC=CC=2SC3=CC=CC=C3SC12)* 0.000 description 1
- 125000000335 thiazolyl group Chemical group 0.000 description 1
- 150000003566 thiocarboxylic acids Chemical class 0.000 description 1
- 229930192474 thiophene Natural products 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 150000003852 triazoles Chemical class 0.000 description 1
- 125000001834 xanthenyl group Chemical group C1=CC=CC=2OC3=CC=CC=C3C(C12)* 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K5/00—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof
- C07K5/04—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof containing only normal peptide links
- C07K5/06—Dipeptides
- C07K5/06008—Dipeptides with the first amino acid being neutral
- C07K5/06017—Dipeptides with the first amino acid being neutral and aliphatic
- C07K5/0606—Dipeptides with the first amino acid being neutral and aliphatic the side chain containing heteroatoms not provided for by C07K5/06086 - C07K5/06139, e.g. Ser, Met, Cys, Thr
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K1/00—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length
- C07K1/02—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length in solution
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K1/00—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length
- C07K1/06—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length using protecting groups or activating agents
- C07K1/061—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length using protecting groups or activating agents using protecting groups
- C07K1/065—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length using protecting groups or activating agents using protecting groups for hydroxy functions, not being part of carboxy functions
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K1/00—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length
- C07K1/06—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length using protecting groups or activating agents
- C07K1/061—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length using protecting groups or activating agents using protecting groups
- C07K1/067—General methods for the preparation of peptides, i.e. processes for the organic chemical preparation of peptides or proteins of any length using protecting groups or activating agents using protecting groups for sulfur-containing functions
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K5/00—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof
- C07K5/04—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof containing only normal peptide links
- C07K5/06—Dipeptides
- C07K5/06008—Dipeptides with the first amino acid being neutral
- C07K5/06017—Dipeptides with the first amino acid being neutral and aliphatic
- C07K5/06034—Dipeptides with the first amino acid being neutral and aliphatic the side chain containing 2 to 4 carbon atoms
- C07K5/06052—Val-amino acid
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K5/00—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof
- C07K5/04—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof containing only normal peptide links
- C07K5/06—Dipeptides
- C07K5/06008—Dipeptides with the first amino acid being neutral
- C07K5/06017—Dipeptides with the first amino acid being neutral and aliphatic
- C07K5/0606—Dipeptides with the first amino acid being neutral and aliphatic the side chain containing heteroatoms not provided for by C07K5/06086 - C07K5/06139, e.g. Ser, Met, Cys, Thr
- C07K5/06069—Ser-amino acid
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K5/00—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof
- C07K5/04—Peptides containing up to four amino acids in a fully defined sequence; Derivatives thereof containing only normal peptide links
- C07K5/06—Dipeptides
- C07K5/06104—Dipeptides with the first amino acid being acidic
Definitions
- This invention is generally in the field of protein/peptide synthesis.
- SPPS solid phase peptide synthesis
- NBDs N, O/S-benzylidene acetal dipeptides
- the compounds can have the structure of Formula I:
- R 1 can be a protected or unprotected side chain of an amino acid
- R 2 can be an amine protecting group
- R 3 -R 6 can be independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF 3 ) , -NO 2 , unsubstituted C 1 -C 6 alkyl, unsubsituted C 1 -C 6 alkenyl, unsubstituted aryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol;
- R 7 and R 8 can be indepedently hydrogen, methyl, or other protected/unprotected side chains of amino acids
- R 9 can be hydrogen or a functional group suitable for protecting and/or
- the protection group can be a substituted or unsubstitited alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a carbonyl (e.g., carboxylic acid or carboxylic ester) , an amide, an amino, a substitute or unsubstituted aryl, a substituted or unsubstituted heteroaryl, or a substituted or unsubstituted heterocyclic ring.
- the compounds can have the structure of Formula I’:
- R 1 -R 9 and X can be as defined above, and R 10 can be hydrogen or a functional group suitable for hindering the reactivity of phenyl group, such as a C 1 -C 4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) .
- a C 1 -C 4 acyl group e.g., formyl, acetyl, propionyl
- a carbonyl group e.g., alloc
- carbamate group e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate
- a L-or D-Amino acid with suitable protection on amine and optionally its sidechain functional groups can be firstly coupled with salicylaldehyde or a mono-/multiply- substituted form thereof. Then, the obtained salicylaldehyde ester reacts with serine/threonine/cysteine/penicillamine (Ser/Thr/Cys/Pen) or its ester in a suitable solvent. The obtained ester may be further derivatized to relevant acids with or without phenol protections.
- the NBDs can be obtained in high yield (at least 40%) .
- FIG. 1 shows a comparison of the synthesis of PD-L1 (121-123) using the disclosed method and conventional SPPS.
- FIG. 2 shows a comparison of the synthesis of IL-2 (125-133) using the disclosed method and conventional SPPS.
- FIG. 3 shows a comparison of the synthesis of Amylin using the disclosed method and conventional SPPS.
- FIG. 4 shows a comparison of the synthesis of Rantes using the disclosed method and conventional SPPS.
- NBD N, O/S-benzylidene acetal dipeptide
- alkyl refers to a branched or straight-chain monovalent saturated aliphatic hydrocarbon radical of one to eight carbon atoms. This term is further exemplified by radicals such as methyl, ethyl, n-propyl, isopropyl, n-butyl, s-butyl, t-butyl, n-pentyl, isopentyl, t-pentyl, n-hexyl, etc.
- alkenyl refers to a branched or straight-chain hydrocarbon group of from two to eight carbon atoms and structural formula containing at least one carbon-carbon double bond.
- alkynyl refers to a branched or straight-chain hydrocarbon group of from two to eight carbon atoms and structural formula containing at least one carbon-carbon triple bond.
- aryl refers to C 5 -C 20 -membered aromatic or fused aromatic ring systems. Examples of aromatic groups are benzene, naphthalene, anthracene, phenanthrene, chrysene, pyrene, corannulene, coronene, etc.
- cyclic group refers to a saturated or unsaturated monocyclic ring or a polycyclic ring (such as those formed from single or fused ring systems) , such as a cycloalkyl, a cycloalkenyl, or a cycloalkynyl, which have from three to ten carbon atoms, as geometric constraints permit.
- heteroalkyl refers to straight or branched chain carbon-containing alkyl radicals containing at least one heteroatom on the carbon backbone. Suitable heteroatoms include, but are not limited to, O, N, Si, P and S, wherein the nitrogen, phosphorous and sulfur atoms are optionally oxidized, and the nitrogen heteroatom is optionally quaternized.
- heteroaryl refers to C 5 -C 20 -membered aromatic or fused aromatic ring systems, in which one or more carbon atoms on one or more aromatic ring structures have been substituted with a heteroatom. Suitable heteroatoms include, but are not limited to, oxygen, sulfur, and nitrogen. Examples of heteroaryl groups pyrrole, furan, thiophene, imidazole, oxazole, thiazole, triazole, tetrazole, pyrazole, pyridine, pyrazine, pyridazine and pyrimidine, and the like.
- heteroaryl rings include, but are not limited to, benzimidazolyl, benzofuranyl, benzothiofuranyl, benzothiophenyl, benzoxazolyl, benzoxazolinyl, benzthiazolyl, benztriazolyl, benztetrazolyl, benzisoxazolyl, benzisothiazolyl, benzimidazolinyl, carbazolyl, 4aH-carbazolyl, carbolinyl, chromanyl, chromenyl, cinnolinyl, decahydroquinolinyl, 2H, 6H-1, 5, 2-dithiazinyl, furanyl, furazanyl, imidazolidinyl, imidazolinyl, imidazolyl, 1H-indazolyl, indolenyl, indolinyl, indolizinyl, indolyl, 3H-indolyl, is
- heterocyclic group refers to a cyclic group as defined above where at least one of the carbon atoms of the ring is substituted with a heteroatom, such as, nitrogen, oxygen, sulfur, or phosphorus.
- aralkyl refers to an aryl group or a heteroaryl group having an alkyl, alkynyl, or alkenyl group as defined above attached to the aromatic group, such as an aryl, a heteroaryl, a polyaryl, or a polyheteroaryl.
- An example of an aralkyl group is a benzyl group.
- alkoxyl refers to compounds represented by the formula -OR v , wherein R v includes, but is not limited to, an alkyl, an alkenyl, an alkynyl, a cycloalkyl, a heterocyclic, a cycloalkenyl, an aryl, a heteroaryl, an aralkyl, a heteroalkyl, etc.
- exemplary alkoxyl groups include methoxy, ethoxy, propyloxy, tert-butoxy and the like.
- a “lower alkoxyl” group is an alkoxyl group containing from one to six carbon atoms.
- An “ether” is two functional groups covalently linked by an oxygen as defined below.
- the substituent of an alkyl that renders that alkyl an ether is or resembles an alkoxyl, such as can be represented by one of -O-alkyl, -O-alkenyl, -O-alkynyl, -O-aralkyl, -O-aryl, -O-heteroaryl, -O-cyclic, -O-heterocyclic, etc.
- amino as used herein includes the group
- E is absent, or E is substituted or unsubstituted alkyl, substituted or unsubstituted alkenyl, substituted or unsubstituted alkynyl, substituted or unsubstituted aralkyl, substituted or unsubstituted aryl, substituted or unsubstituted heteroaryl, a substituted or unsubstituted polyaryl, a substituted or unsubstituted polyheteroaryl, substituted or unsubstituted heterocyclyl, wherein independently of E, R x , R xi , and R xii each independently represent a substituted or unsubstituted alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a substituted or unsubstituted carbonyl, a substituted or unsubstituted heterocyclyl, a substituted or unsubstituted al
- R represents a hydroxyl group, a substituted or unsubstituted carbonyl group, a substituted or unsubstituted aryl, a substituted or unsubstituted cycloalkyl
- quaternary amino also includes the groups where the nitrogen, R x , R xi , and R xii with the N + to which they are attached complete a heterocyclyl or heteroaryl having from 3 to 14 atoms in the ring structure. It is understood by those of ordinary skill in the art, that the E groups listed above are divalent (e.g., methylene, ethane-1, 2-diyl, ethene-1, 2-diyl, 1, 4-phenylene, cyclohexane-1, 2-diyl) .
- amide or “amido” are used interchangeably, refer to both “unsubstituted amido” and “substituted amido” and are represented by the general formula:
- E is absent, or E is a substituted or unsubstituted alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a substituted or unsubstituted aralkyl, a substituted or unsubstituted aryl, substituted or unsubstituted heteroaryl, a substituted or unsubstituted polyaryl, a substituted or unsubstituted polyheteroaryl, or a substituted or unsubstituted heterocyclyl, wherein independently of E, R and R’ each independently represent a hydrogen, a substituted or unsubstituted alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a substituted or unsubstituted carbonyl, a substituted or unsubstituted heterocyclyl, a substituted or unsubsti
- R” represents a hydroxyl group, a substituted or unsubstituted carbon
- E when E is oxygen, a carbamate is formed. It is understood by those of ordinary skill in the art, that the E groups listed above are divalent (e.g., methylene, ethane-1, 2-diyl, ethene-1, 2-diyl, 1, 4-phenylene, cyclohexane-1, 2-diyl) .
- Carbonyl, ” as used herein, is art-recognized and includes such moieties as can be represented by the general formula:
- X is a bond, or represents an oxygen or a sulfur
- R represents a hydrogen, a substituted or unsubstituted alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a substituted or unsubstituted carbonyl, a substituted or unsubstituted heterocyclyl, a substituted or unsubstituted aralkyl (e.g.
- E is absent, or E” is a substituted or unsubstituted alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a substituted or unsubstituted aralkyl, a substituted or unsubsti
- R represents a hydroxyl group, a substituted or unsubstituted aryl, a substituted or unsubstituted cycloalkyl, a substituted or unsubstituted cycloalkenyl, a substituted or unsubstituted heterocyclyl, a substituted or unsubstituted arylalkyl) , a substituted or unsubstituted aryl, a substituted or unsubstituted heteroaryl, a substituted or unsubstituted polyaryl, a substituted or unsubstituted polyheteroaryl, a substituted or unsubstituted heterocyclyl, a hydroxyl, an alkoxy, a phosphonium, a phosphanyl, an amido, an amino, or - (CH 2 ) m -R”; R” represents a hydroxyl group, a substituted or unsubstituted aryl, a substituted
- substituents can be any substituents described above, e.g., halogen, azide, alkyl, aralkyl, alkenyl, alkynyl, cycloalkyl, hydroxyl, carbonyl (such as a carboxyl, alkoxycarbonyl, formyl, or an acyl) , silyl, ether, ester, thiocarbonyl (such as a thioester, a thioacetate, or a thioformate) , alkoxyl, phosphonium, phosphanyl, phosphoryl, phosphate, phosphonate, phosphinate, amino (e.g., halogen, azide, alkyl, aralkyl, alkenyl, alkynyl, cycloalkyl, hydroxyl, carbonyl (such as a carboxyl, alkoxycarbonyl, formyl, or an acyl) , silyl, ether,
- E groups listed above are divalent (e.g., methylene, ethane-1, 2-diyl, ethene-1, 2-diyl, 1, 4-phenylene, cyclohexane-1, 2-diyl) .
- X oxygen and R is defined as above, the moiety is also referred to as a carboxyl group.
- R is hydrogen
- the formula represents a “carboxylic acid. ”
- X is oxygen and R’ is hydrogen
- the formula represents a “formate. ”
- X is oxygen and R or R’ is not hydrogen, the formula represents an "ester.
- thiol are used interchangeably and are represented by -SR, where R can be a hydrogen, a substituted or unsubstituted alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a substituted or unsubstituted heterocyclyl, a substituted or unsubstituted aryl, a substituted or unsubstituted heteroaryl, a substituted or unsubstituted aralkyl (e.g. a substituted or unsubstituted alkylaryl, a substituted or unsubstituted arylalkyl, etc.
- R can be a hydrogen, a substituted or unsubstituted alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a substituted or unsubstituted heterocyclyl, a substituted or un
- a substituted or unsubstituted polyaryl a substituted or unsubstituted polyheteroaryl, a substituted or unsubstituted carbonyl, a phosphonium, a phosphanyl, an amido, an amino, an alkoxy, an oxo, a phosphonyl, a sulfinyl, or a silyl, described above.
- substituents can be any substituents described above, e.g., halogen, azide, alkyl, aralkyl, alkenyl, alkynyl, cycloalkyl, hydroxyl, carbonyl (such as a carboxyl, alkoxycarbonyl, formyl, or an acyl) , silyl, ether, ester, thiocarbonyl (such as a thioester, a thioacetate, or a thioformate) , alkoxyl, phosphoryl, phosphate, phosphonate, phosphinate, amino (e.g., halogen, azide, alkyl, aralkyl, alkenyl, alkynyl, cycloalkyl, hydroxyl, carbonyl (such as a carboxyl, alkoxycarbonyl, formyl, or an acyl) , silyl, ether, ester, thiocarbonyl (such as
- the permissible substituents include acyclic and cyclic, branched and unbranched, carbocyclic and heterocyclic, aromatic and nonaromatic substituents of organic compounds.
- Illustrative substituents include, but are not limited to, halogens, hydroxyl groups, or any other organic groupings containing any number of carbon atoms, preferably 1-14 carbon atoms, and optionally include one or more heteroatoms such as oxygen, sulfur, or nitrogen grouping in linear, branched, or cyclic structural formats.
- substituents include a substituted or unsubstituted alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a substituted or unsubstituted heterocyclic, a substituted or unsubstituted phenyl, a substituted or unsubstituted aryl, a substituted or unsubstituted heteroaryl, a substituted or unsubstituted polyaryl, a substituted or unsubstituted polyheteroaryl, a substituted or unsubstituted aralkyl, a halogen, a hydroxyl, an alkoxy, a phenoxy, an aroxy, a silyl, a thiol, an alkylthio, a substituted alkylthio, a phenylthio, an arylthio, a cyano, an isocyano, a nitro,
- Heteroatoms such as nitrogen may have hydrogen substituents and/or any permissible substituents of organic compounds described herein which satisfy the valences of the heteroatoms. It is understood that “substitution” or “substituted” includes the implicit proviso that such substitution is in accordance with permitted valence of the substituted atom and the substituent, and that the substitution results in a stable compound, i.e., a compound that does not spontaneously undergo transformation such as by rearrangement, cyclization, elimination, etc.
- compositions include a building block structure containing N,O/S-benzylidene acetal dipeptide (NBD) .
- NBDs also referred to herein as “compounds”
- compounds are simple and effective building blocks for syntheses of difficult peptides/proteins. Without being bound to any theories, it is believed the NBDs disclosed herein can act as a conformation-twisted dipeptide building block, which forms a pseudoproline turn to break down the interaction between peptides, and thereby can disrupt aggregation of difficult peptides/proteins on the resin (typically due to hydrophobic interaction of amino-acid side chains and hydrogen boding of amide bond) during SPPS process.
- the disclosed compounds can have the structures of Formula I:
- R 1 can be a protected or unprotected side chain of an amino acid
- R 2 can be an amino protecting group
- R 3 -R 6 can be independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF 3 ) , -NO 2 , unsubstituted C 1 -C 6 alkyl, unsubsituted C 1 -C 6 alkenyl, unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol;
- R 7 and R 8 can be indepdently hydrogen, methyl, or other protected/unprotected side chains of amino acids
- R 9 can be hydrogen or
- the protection group can be a substituted or unsubstitited alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a carbonyl (e.g., carboxylic acid or carboxylic ester) , an amide, an amino, a substitute or unsubstituted aryl, a substituted or unsubstituted heteroaryl, or a heterocyclic ring.
- R 1 is a protected side chain of an amino acid and the protection group can be an unsubstitited alkyl (e.g., unsubstitued C 1 -C 6 or C 1 -C 4 alkyl) , an unsubstituted alkenyl (e.g., unsubstitued C 1 -C 6 or C 1 -C 4 alkenyl) , an unsubstituted alkynyl (e.g., unsubstitued C 1 -C 6 or C 1 -C 4 alkynyl) , a carbonyl (e.g., carboxylic acid or carboxylic ester) , an amide, an amino, an unsubstituted aryl, an unsubstituted heteroaryl, or an unsubstituted heterocyclic ring.
- an unsubstitited alkyl e.g., unsubstitued C 1 -C 6 or C 1 -C 4 al
- the in the formulae described herein indicates a carbon stereocenter (to which the wave bond attaches) that can be either S-or R-configuration.
- the acetyl carbon attaching to -C (O) OR 9 can be either S-or R-configuration.
- R 1 can be a side chain of any suitable amino acid, natural or synthetic, and the amin acid can be either in its L-or D-configuration, or in various mixtures of their isomers.
- R 1 may be a protected or unprotected side chain of any suitable alpha-amino acid, natural or synthetic.
- R 1 is a side chain of an amino acid that carries a hydroxy group
- the hydroxy group is optionally protected by a suitable hydroxy protecting group known in the art.
- the amino group is optionally protected by a suitable amino protecting group known in the art.
- the protection group can be t Bu, Boc, Thz, Acm, Trt, Cbz, Alloc, azido, Bn, Bz, Ac, or Pbf, or a combination thereof.
- R 1 can be a protected or unprotected side chain of glycine, alanine, valine, leucine, isoleucine, methionine, phenylalanine, tyrosine, tryptophan, cysteine, lysine, arginine, histidine, aspartic acid, glutamic acid, asparagine, glutamine, serine, or threonine. Any one of these amino acids can be either in its L-or D-configuration.
- the protection group when R 1 is a protected side chain, can be t Bu, Boc, Thz, Acm, Trt, Cbz, Alloc, azido, Bn, Bz, Ac, or Pbf, or a combination thereof.
- R 2 can be any substituents conventionally used to hinder the reactivity of the amino group.
- suitable amino protecting groups are described in Green T., “Protective Groups in Organic Synthesis” , Chapter 7, John Wiley and Sons, Inc., 1991, 309-385.
- R 2 can be Fmoc, Cbz, Moz, Boc, Troc, Teoc, Alloc, or Voc.
- R 3 -R 6 can be independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF 3 ) , -NO 2 , unsubstituted alkyl (such as unsubstituted C 1 -C 8 alkyl) , unsubsituted alkenyl (unsubstituted C 1 -C 8 alkenyl) , unsubstituted alkynyl (such as unsubstituted C 1 -C 8 alkynyl) , unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol.
- unsubstituted alkyl such as unsubstituted C 1 -C 8 alky
- R 3 -R 6 can be independently hydrogen, halogen, -CN, -CF 3 , -NO 2 , unsubstituted C 1 -C 6 or C 1 -C 4 alkyl, unsubstituted C 1 -C 6 or C 1 -C 4 alkenyl, unsubstituted C 1 -C 6 or C 1 -C 4 alkynyl, alkoxyl, and unsubstituted aryl.
- at least one of R 3 -R 6 is/are not hydrogen.
- one of R 3 -R 6 , two of R 3 -R 6 , three of R 3 -R 6 , or all of R 3 -R 6 is/are not hydrogen.
- R 7 and R 8 can be indepedently hydrogen or methyl. In some forms, R 7 and R 8 can be indepdently protected/unprotected side chains of amino acids.
- R 9 can be hydrogen, benzyl, allyl, or unsubstituted C 1 -C 4 alkyl. In some forms, R 9 can be hydrogen. In some forms, R 9 can be any substituents conventionally used to protect the carboxylic acid group. For example, suitable amino protecting groups are described in Green T., “Protective Groups in Organic Synthesis” , Chapter 5, John Wiley and Sons, Inc., 2006, 533-646. In some forms, R 9 can also be any substituents conventionally used to activate the carboxylic acid group. For example, suitable amino protecting groups are described in Albericio F. “Peptide Coupling Reagents, More than a Letter Soup” , Chem. Rev. 2011, 111, 6557-6602.
- R 9 can be benzyl, allyl, or unsubstituted C 1 -C 4 alkyl.
- R 9 can be an activating group, such as l-hydroxy-7-azabenzotriazole (HOAt) , 1-hydroxybenzotriazole (HOBt) , ethyl 2-cyano-2- (hydroxyimino) acetate (Oxyma) , N-hydroxysuccinimide (NHS) , pentafluorophenol (Pfp) , etc.
- HOAt l-hydroxy-7-azabenzotriazole
- HBt 1-hydroxybenzotriazole
- Oxyma ethyl 2-cyano-2- (hydroxyimino) acetate
- NHS N-hydroxysuccinimide
- Pfp pentafluorophenol
- an activated ester group such as l-hydroxy-7-azabenzotriazole (HOAt) ester, 1-hydroxybenzotriazole (HOBt) ester, ethyl 2-cyano-2- (hydroxyimino) acetate (Oxyma) ester, N-hydroxysuccinimide (NHS) ester, pentafluorophenol (Pfp) ester, etc.
- the disclosed compounds can have the structures of Formula I':
- R 1 -R 9 and X can be in any forms as defined above, and R 10 can be hydrogen or a functional group suitable for hindering the reactivity of phenyl group, such as a C 1 -C 4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) .
- a C 1 -C 4 acyl group e.g., formyl, acetyl, propionyl
- a carbonyl group e.g., alloc
- carbamate group e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate
- R 10 can be hydrogen. In some forms, R 10 can be any substituents conventionally used to hinder the reactivity of the phenol group. For example, suitable amino protecting groups are described in Green T., “Protective Groups in Organic Synthesis” , Chapter 3, John Wiley and Sons, Inc., 2006, 367-430. In some forms, R 10 can be a C 1 -C 4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) .
- acyl group e.g., formyl, acetyl, propionyl
- carbonyl group e.g., alloc
- carbamate group e.g., tert-butyl methyl (2- (methylamino) e
- R 9 can be hydrogen and R 10 can be hydrogen.
- R 9 can be benzyl, allyl, unsubstituted C 1 -C 4 alkyl, or an activating group, such as l-hydroxy-7-azabenzotriazole (HOAt) , 1-hydroxybenzotriazole (HOBt) , ethyl 2-cyano-2- (hydroxyimino) acetate (Oxyma) , N-hydroxysuccinimide (NHS) , pentafluorophenol (Pfp) , etc. and R 10 can be hydrogen.
- HOAt l-hydroxy-7-azabenzotriazole
- HBt 1-hydroxybenzotriazole
- Oxyma ethyl 2-cyano-2- (hydroxyimino) acetate
- NHS N-hydroxysuccinimide
- Pfp pentafluorophenol
- R 9 can be hydrogen and R 10 can be a C 1 -C 4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) .
- R 10 can be a C 1 -C 4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) .
- R 9 can be benzyl, allyl, unsubstituted C 1 -C 4 alkyl, or an activating group, such as l-hydroxy-7-azabenzotriazole (HOAt) , 1-hydroxybenzotriazole (HOBt) , ethyl 2-cyano-2- (hydroxyimino) acetate (Oxyma) , N-hydroxysuccinimide (NHS) , pentafluorophenol (Pfp) , etc.
- an activating group such as l-hydroxy-7-azabenzotriazole (HOAt) , 1-hydroxybenzotriazole (HOBt) , ethyl 2-cyano-2- (hydroxyimino) acetate (Oxyma) , N-hydroxysuccinimide (NHS) , pentafluorophenol (Pfp) , etc.
- R 10 can be a C 1 -C 4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) .
- acyl group e.g., formyl, acetyl, propionyl
- carbonyl group e.g., alloc
- carbamate group e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate
- NBDs are shown below, where ⁇ represents indicates that attachment point, and R 3 -R 10 can be in any forms as defined above.
- L-Thr-L-Thr NBD L-Asp-L-Ser NBD
- L-Glu-L-Cys NBD L-Val-L-Pen NBD
- L-Val-L-Cys NBD L-Val-L-Cys NBD
- the disclosed NBDs are structurally different from of pseudo-prolines disclosed in et al., Pseudo-Prolines as a Solubilizing, Structure-Disrupting Protection Technique in Peptide Synthesis (J. Am. Chem. Soc. 1996, 118, 39, 9218-9227) .
- the phenyl group of the disclosed NBDs provides more substitution sites for further functionalization, where in pseudo-proline, the symmetric dimethyl group was adopted without any other site for modification.
- NBDs can cover broader range of peptide sequence and they can be easily prepared from mg- scale to sub-kg scale, whose estimated cost is only around 1 ⁇ 10%price of commercial competitors.
- a L-or D-Amino acid with suitable protection on amine and optionally its sidechain functional groups can be firstly coupled with salicylaldehyde or a mono-/multiply-substituted form thereof. Then, the obtained salicylaldehyde ester reacts with serine/threonine/cysteine/penicillamine (Ser/Thr/Cys/Pen) or its ester in a suitable solvent. The obtained ester may be further derivatized to relevant acids with or without phenol protections.
- the NBDs can be obtained in high yield (at least 40%) .
- 76 exemplary NBDs (structures shown in the section above) were produced in moderate to excellent yield (i.e., 40%to 95%) .
- Variants of those 76 NBDs that include D-amino acid version, different protecting groups on amine or side chain, other carboxylic derivatives and different substitutions of aryl could be also synthesized by following the developed synthetic strategy.
- the disclosed method includes: (a) reacting a protected amino acid of Formula II:
- R 1 can be a protected or unprotected side chain of an amino acid; and (ii) R 2 can be an amino protecting group,
- R 3 -R 6 can be independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF 3 ) , -NO 2 , unsubstituted alkyl (such as unsubstituted C 1 -C 8 alkyl) , unsubsituted alkenyl (unsubstituted C 1 -C 8 alkenyl) , unsubstituted alkynyl (such as unsubstituted C 1 -C 8 alkynyl) , unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol,
- R 7 and R 8 can be indepdently hydrogen, methyl, or other protected/unprotected side chains of amino acids;
- R 9 can be hydrogen, benzyl, allyl, unsubstituted C 1 -C 4 alkyl, or a suitable activating group, such as N-hydroxysuccinimide (NHS) ;
- X can be O or S,
- R 1 -R 9 and X are as defined above for the reactants.
- Activating groups suitable for R 9 of Formula V and Formula I are typically less active than HOAt and Oxyma, such as NHS, to reduce/prevent side reactions in step (b) .
- the method disclosed herein may further include step (c) : deprotecting a protected carboxyl group of the NBDs; or step (d) acylating or activating a free acid group or a free phenol group of the NBDs; or a combination thereof.
- a deprotected NBD with free acid form is obtained, for example, a deprotected NBD of Formula VI:
- R 1 -R 8 , R 10 , and X can be in any forms as defined above.
- R 9 and R 10 are hydrogen.
- R 9 of the NBDs is hydrogen, such that this free carboxylic acid group can be activated in step (d) .
- R 10 of the NBDs is hydrogen, such that this free hydroxyl group can be acylated in step (d) .
- both R 9 and R 10 of the NBDs is hydrogen, such that the free carbocylic acid and/or the hydroxyl group can be acylated and/or activated, as needed, in step (d) .
- an acylated NBD with or without a free carboxylic acid is obtained, for example, an acylated NBD of Formula I’:
- R 1 -R 9 and X can be in any forms as defined above, and R 10 can be in any form as defined above expect for hydrogen.
- the deprotection step (c) and acylation/activation step (d) can be performed in any order, and in any combination with steps (a) and (b) as described above.
- the method includes steps (a) , (b) , and (c) , performed sequentially.
- the method includes steps (a) , (b) , and (d) , performed sequentially.
- the method includes steps (a) , (b) , (c) , and (d) , performed sequentially.
- the method includes steps (a) , (b) , (d) , and (c) , performed sequentially.
- the in the formulae described herein indicates a carbon stereocenter (to which the wave bond attaches) that can be either S-or R-configuration.
- the acetyl carbon attaching to -C (O) OR 9 can be either S-or R-configuration.
- the threonine, cysteine, or penicillamine, or an ester thereof used in step (b) can be either in its L-or D-configuration, as racemates, or in various mixtures of their isomers.
- the protected amino acid used in step (a) can be either in its L-or D-configuration, as racemates, or in various mixtures of their isomers.
- a protected amino acid of Formula II is reacted with a subsituted or unsubstituted salicylaldehyde of Formula III to obtain an intermediate comprising a salicylaldehyde ester of Formula IV:
- R 1 -R 6 can be in any forms as defined above.
- the protection group can be a substituted or unsubstitited alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a carbonyl (e.g., carboxylic acid or carboxylic ester) , an amide, an amino, a substitute or unsubstituted aryl, a substituted or unsubstituted heteroaryl, or a heterocyclic ring.
- R 1 is a protected side chain of an amino acid and the protection group can be an unsubstitited alkyl, an unsubstituted alkenyl, an unsubstituted alkynyl, a carbonyl (e.g., carboxylic acid or carboxylic ester) , an amide, an amino, an unsubstituted aryl, an unsubstituted heteroaryl, or an unsubstituted heterocyclic ring.
- the protection group can be an unsubstitited alkyl, an unsubstituted alkenyl, an unsubstituted alkynyl, a carbonyl (e.g., carboxylic acid or carboxylic ester) , an amide, an amino, an unsubstituted aryl, an unsubstituted heteroaryl, or an unsubstituted heterocyclic ring.
- the protected amino acid of Formula II used in step (a) can be any natural amino acids or synthetic amino acids, which can be either in their L-or D-configuration.
- the protected amino acid used in step (a) is glycine, alanine, valine, leucine, isoleucine, methionine, phenylalanine, tyrosine, tryptophan, cysteine, lysine, arginine, histidine, aspartic acid, glutamic acid, asparagine, glutamine, serine, or threonine, in its L-or D-configuration.
- the side chain of the amino acid used in step (a) may be protected using a protecting group known in the art.
- Suitable protecting groups for the side chain include, but are not limited to, t Bu, Boc, Thz, Acm, Trt, Cbz, Alloc, Azido, Bn, Bz, Ac, and Pbf.
- the terminal amino group of the amino acid used in step (a) are protected (i.e., R 2 ) .
- Any amino protecting group known in the art can be used to protect the amino group of the amino acid used in the disclosed methods.
- R 2 of the protected amino acids used in step (a) can be Fmoc, Cbz, Moz, Boc, Troc, Teoc, Alloc, or Voc.
- the salicylaldehyde of Formula III that reacts with the protected amino acids of Formula II can be an unsubstituted salicylaldehyde or a mono-or poly-substituted salicylaldehyde.
- the salicylaldehyde reacting with the protected amino acid of Formula II is substituted with one or more substituents (i.e., at least one of R 3 -R 6 , optionally two or more of R 3 -R 6 , is/are not hydrogen) .
- one of R 3 -R 6 , two of R 3 -R 6 , three of R 3 -R 6 , or all of R 3 -R 6 , of the salicylaldehyde of Formula III used in step (a) is/are not hydrogen.
- Suitable substituents of the substituted salicylaldehyde of Formula III are hydroxyl, halogen, -CN, haloalkyl (such as -CF 3 ) , -NO 2 , unsubstituted C 1 -C 6 alkyl, unsubsituted C 1 -C 6 alkenyl, unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, and thiol.
- the substituents are independently halogen, -CN, -CF 3 , -NO 2 , unsubstituted C 1 -C 6 or C 1 -C 4 alkyl, unsubstituted C 1 -C 6 or C 1 -C 4 alkenyl, unsubstitued C 1 -C 6 or C 1 -C 4 alkynyl, alkoxyl, and unsubstituted aryl.
- the reaction between the protected amino acid of Formula II and the subsituted or unsubstituted salicylaldehyde of Formula III is performed in a suitable organic solvent, and optionally in the presence of a condensation reagent and/or a base.
- suitable organic solvents for performing the reaction in step (a) include, but are not limited to, DMF, DMSO, DMAc, NMP.
- the organic solvent for performing the reaction in step (a) is DMF.
- the condensation agent used in the reaction in step (a) is HATU.
- Suitable base for performing the reaction in step (a) include, but are not limited to, DIPEA and Triethylamine.
- the base used in the reaction in step (a) is DIPEA.
- the reaction between the protected amino acid of Formula II and the subsituted or unsubstituted salicylaldehyde of Formula III can be performed in an suitable organic solvent, such as any one of thoes described above, and optionally in the presence of a condensation reagent and/or a base, at room temperature (i.e., 20 °C to 25 °C, at 1 atm) for a time period ranging from 30 mins to 5 hours, preferably from 1 hour to 3 hours, such as about 1 hour or about 1.5 hour.
- an organic solvent such as any one of thoes described above
- a condensation reagent and/or a base at room temperature (i.e., 20 °C to 25 °C, at 1 atm) for a time period ranging from 30 mins to 5 hours, preferably from 1 hour to 3 hours, such as about 1 hour or about 1.5 hour.
- the method disclosed herein further includes a step of purifying the intermediate containing the salicylaldehyde ester of Formula IV after step (a) and prior to step (b) .
- the purification step can be performed using any suitable techniques known in the art, such as using extraction, washing, drying, column chromatography (such as by using a silica gel column) , or filtration, or a combination thereof.
- the intermdeiate containing the salicylaldehyde ester of Formula IV is purified using the following techniques: extraction with an organic solvent that is different from the organic solvent used for performing the reaction, washing with an acid for one or more times, drying with a drying agent (such as sodium sulfate) and/or evaporation, and then separation using silica gel column.
- a drying agent such as sodium sulfate
- the salicylaldehyde ester of Formula IV obtained in step (a) has a yield of at least 40%, at least 50%, or in a range from about 50%to about 95%.
- the yield of salicylaldehyde ester of Formula IV can be calculated using the formula: (The mole of the salicylaldehyde ester of Formula IV) / (The mole of the protected amino acid of Formula II) *100%. The yield may be determined either before or after purification of the intermediate.
- the salicylaldehyde ester of Formula IV is reacted with serine, threonine, cysteine, or penicillamine, or an ester thereof of Formula V to obtain the compounds of Formula I:
- R 1 -R 9 and X can be in any forms as defined above.
- R 9 can be hydrogen or any suitable functional group that can form an ester with the carboxyl group of serine, threonine, cysteine, and penicillamine, such as benzyl, allyl, and unsubstituted C 1 -C 4 alkyl.
- the reaction between the salicylaldehyde ester of Formula IV and the serine, threonine, cysteine, or penicillamine, or ester thereof of Formula V is performed in a suitable solvent.
- the solvent may be an organic solvent or water or a combination thereof, and typically contains a buffer at an amount ranging from 10 vol%to 20 vol%, such as 10 vol%to 20 vol%pyridine/HOAc (1: 1, molar ratio) .
- Suitable solvents for preforming the reaction in step (b) include, but are not limited to, dichloromethane, trifluoroacetic acid, ethyl acetate, toluene, tetrahydrofuran, 1, 4-dixoane, acetonitrile, acetone, hexane, and water, and a combination thereof.
- buffers suitable for use in the solvent for performing the reaction in step (b) include, but are not limited to, pyridine-acetic acid buffer, colidine-acetic acid buffer, picoline-acetic acid buffer, DMSO-pyridine-acetic acid buffer, DMSO-colidine-acetic acid buffer, and DMSO-picoline-acetic acid buffer.
- a preferred buffer suitable for use is pyridine-acetic acid buffer, where the molar ratio of pyridine to acetic acid ranges from 1: 9 to 9: 1, such as 1: 1.
- the reaction between the salicylaldehyde ester of Formula IV and the serine, threonine, cysteine, or penicillamine, or ester thereof of Formula V can be performed in an suitable solvent, such as any one of thoes described above, at room temperature for a time period ranging from 1 hour to 5 hours, preferably from 2 hours to 3 hours, such as about 2 hours or about 3 hours.
- the method disclosed herein further includes a step of purifying the product containing the compound of Formula I after step (b) .
- the purification step can be performed using any suitable techniques known in the art, such as any of those described above in step (a) .
- the product containing the compounds of Formula I is purified using the following techniques: extraction with a suitable solvent and separation using column chromatography or filtration, washing with cod water and cold organic solvent (such as cold ethanol and/or cold ether) for one or more times, and separation by silica gel column.
- the compound of Formula I obtained in step (b) has a yield of at least 40%or in a range from about 40%to about 95%.
- the yield of compond of Formula I can be calculated using the formula: (The mole of the compound of Formula I) / (The mole of the salicylaldehyde ester Formula IV) *100%. The yield may be determined either before or after purification of the product.
- the disclosed method further includes a step (c) : deprotecting a protected carboxyl group of the NBDs formed in step (b) or (d) as described herein.
- the disclosed method includes steps (a) , (b) , and (c) performed sequentilly.
- the disclosed method includes steps (a) , (b) , (c) , and (d) , performed sequentially.
- the disclosed method includes steps (a) , (b) , (d) , and (c) , performed sequentially.
- R 9 of the NBDs is typically not hydrogen, in particular when X of the formulae described herein is O.
- the protected carboxyl group of the NBDs can be deprotected using reagents and reactions known in the art, for example, by using reagents such as Pd (PPh 3 ) 4 and/or PhSiH 3 for deallylation; H 2 and/or Pd/C for hydrogenation of Bn protection; LiOH for sponsification of alkyl protection, etc.
- reagents such as Pd (PPh 3 ) 4 and/or PhSiH 3 for deallylation; H 2 and/or Pd/C for hydrogenation of Bn protection; LiOH for sponsification of alkyl protection, etc.
- R 1 -R 8 , R 10 , and X can be in any forms as defined above.
- the method disclosed herein further includes a step of purifying the product containing the compound of Formula VI after step (c) .
- the purification step can be performed using any suitable techniques known in the art, such as any of those described above in step (a) or (b) .
- the product containing the compounds of Formula VI is purified using the following techniques: extraction with a suitable solvent and separation using column chromatography or filtration, washing with cod water and cold organic solvent (such as cold ethanol and/or cold ether) for one or more times, and separation by silica gel column.
- the compound of Formula VI obtained in step (c) has a yield of at least 40%or in a range from about 40%to about 95%.
- the yield of compond of Formula VI can be calculated using the formula: (The mole of the compound of Formula VI) / (The mole of the compound of Formula I’) *100%. The yield may be determined either before or after purification of the product.
- the disclosed method further includes a step (d) : acylating or activating a free acid or a free phenol group of the NBDs formed in step (b) or (c) .
- a step (d) acylating or activating a free acid or a free phenol group of the NBDs formed in step (b) or (c) .
- at least one of R 9 and R 10 of the reagent, such as an NBD of Formula I or VI is hydrogen.
- R 9 of the NBDs is hydrogen, either in the NBD product of step (b) or in the deprotected NBD formed in the deprotection step (c) , such that this free carboxylic acid group can be activated in step (d) .
- R 10 of the NBDs is hydrogen in the NBD product of step (b) and/or the deprotected NBD formed in step (c) , such that this free hydroxyl group can be acylated in step (d) .
- both R 9 and R 10 of the NBDs is hydrogen, either in the NBD product of step (b) or in the deprotected NBD formed in the deprotection step (c) , such that the free carbocylic acid and/or the hydroxyl group can be acylated and/or activated, as needed, in step (d) .
- both R 9 and R 10 of the NBDs is hydrogen, either in the NBD product of step (b) or in the deprotected NBD formed in the deprotection step (c) , such that the free carbocylic acid is activated, and/or the hydroxyl group is acylated, in step (d) .
- an acylated NBD with or without a free carboxylic acid (e.g., -COOR 9 is carboxyl or ester) can be obtained, for example, an acylated NBD of Formula I’:
- R 1 -R 9 and X can be in any forms as defined above, and R 10 can be in any form as defined above expect for hydrogen.
- a free acid of the NBDs can be acylated and/or activated using reagents and reactions known in the art.
- a free acid of the NBDs (-COOH and/or -OH) can be acylated using reagents such as acetyl anhydride, carbonate, or carbamate to obtain an acyl-capped NBD of Formula I’, where R 9 and/or R 10 , such as R 10 , is an acyl group.
- a free acid of the NBDs e.g., -COOH
- the method disclosed herein further includes a step of purifying the product containing acylated and/or activated NBDs after step (d) .
- the purification step can be performed using any suitable techniques known in the art, such as any of those described above in step (a) , (b) or (c) .
- the product containing the acylated and/or activated NBDs is purified using the following techniques: extraction with a suitable solvent and separation using column chromatography or filtration, washing with cod water and cold organic solvent (such as cold ethanol and/or cold ether) for one or more times, and separation by silica gel column.
- the acylated and/or activated NBDs obtained in step (d) has a yield of at least 40%or in a range from about 40%to about 95%.
- the yield of acylated and/or activated NBDs can be calculated using the formula: (The mole of the acylated and/or activated NBDs) / (The mole of the compound of Formula VI) *100%. The yield may be determined either before or after purification of the product.
- R 1 -R 10 and X can be in any forms as defined above.
- NBD of Formula I’ having a desired structure can be obtained following steps (a) and (b) ; following steps (a) , (b) , and step (c) ; following steps (a) , (b) , and (d) ; or following steps (a) , (b) , (c) , and (d) .
- the peptide/protein synthesis methods disclosed herein are based at least on the application of NBD as a pre-formed dipeptide building block for synthesizing difficult peptides/proteins.
- Difficult sequence and “difficult peptide/protein” are used interchangeably herein.
- the concept of “difficult sequence” was introduced in the 80’s and was given distinction by Kent and co-workers for peptides that form strong inter-or intra molecular, non-covalent interactions which form insoluble peptide aggregates.
- “Difficult sequences” are peptide sequences that contain high number of amino acids possessing hydrophobic side chains, so-called ⁇ -branched amino acids, including leucine, valine, phenylalanine or isoleucine. Additionally, glycine is known to induce ⁇ -sheet packing in combination with afore mentioned amino acids.
- NBD-SPPS NBD-SPPS
- difficult sequences include membrane proteins and their functional parts, for example, amylin, BM2 proton channel, influenza A, Copper storage protein 1 CSP-1, Interferon-induced transmembrane protein 3 (IFITM3) , NS4A, cofactor protein of serine protease from Hepatitis C virus; PD-L1 (programmed death-ligand 1) , IL-2 (INTERLEUCKIN -2) , Vasoactive intestinal peptide, Erythropoietin, Liraglutide, etc.
- membrane proteins and their functional parts for example, amylin, BM2 proton channel, influenza A, Copper storage protein 1 CSP-1, Interferon-induced transmembrane protein 3 (IFITM3) , NS4A, cofactor protein of serine protease from Hepatitis C virus; PD-L1 (programmed death-ligand 1) , IL-2 (INTERLEUCKIN -2) , Vasoactive intestinal peptide, Erythropoi
- the disclosed NBDs are typically used in SPPS methods known in the art.
- Solid phase synthesis in which the C-terminal amino acid of the sequence is attached to an insoluble support followed by sequential addition of the remaining amino acids in the sequence is a preferred method for the chemical synthesis of the peptides.
- Techniques for solid phase synthesis are known to those of skill in the art and are described, for example, by Barany and Merrifield (1963) Solid-Phase Peptide Synthesis; pp. 3-284 in The Peptides: Analysis, Synthesis, Biology. Vol. 2: Special Methods in Peptide Synthesis, Part A.; Merrifield et al. (1963) J. Am. Chem. Soc, 85: 2149-2156, and Stewart et al.
- Solid Phase Peptide Synthesis 2 nd ed. Pierce Chem. Co., Rockford, 111.
- Such methods include bench scale solid phase synthesis and automated peptide synthesis in any one of the many commercially available peptide synthesizers. Solid phase synthesis is commonly used, and various commercial synthesizers are available, such as automated synthesizers by Applied Biosystems Inc., Foster City, CA; Beckman; MultiSyntech, Bochum, Germany etc.
- Functional groups for conjugating the peptide to small molecules, label moieties, peptides, or proteins may be introduced into the molecule during chemical synthesis.
- small molecules and label moieties/reporter units may be attached during the synthetic process. Introduction of the functional groups and conjugation to other molecules minimally affects the structure and function of the NBDs.
- US 2008/0004451 describes the synthesis of dipeptides building blocks with acetonide protecting group to form a pseudoproline ring.
- the disclosed methods utilizes the ligation intermediate of Serine/Threonine Ligation (STL) or Cysteine/Penicillamine Ligation (CPL) as the proline mimic i.e., NBDs, which allows the scale-up production of peptides in a very safe manner (NBDs are safe to handle) and it is environmentally friendly, because NBDs synthesis is a mild endothermic process and does not require heating or reflux conditions.
- the disclosed NBDs are difficult to obtain by the methods disclosed in US 2008/0004451.
- the disclosed methods have advantages over the methods disclosed in Sohma, et al., Novel and efficient synthesis of difficult sequence-containing peptides through O-N intramolecular acyl migration reaction of O-acyl isopeptides, Chem. Commun (Camb) , 1: 124-5 (2004) ( (DOI: 10.1039/B312129A) , which discloses isoacyl dipeptides (which are chemically distinct from the disclosed NBDs) . Further, the synthesis of isopeptide may have a risk of epimerization since the esterification required highly active condensation reagents.
- the disclosed methods proceed via NBD; the ligation between two amino acid fragment is chemoselective and stereoselective, without the risk of epimerization.
- the disclosed NBDs are much more stable under basic condition, which is important in Fmoc-SPPS for peptide synthesis. This advantage makes the disclosed NBDs much more suitable for long peptide or protein chemical synthesis.
- R 1 is a protected or unprotected side chain of an amino acid
- R 2 is an amine protecting group
- R 3 -R 6 are independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF 3 ) , -NO 2 , unsubstituted C 1 -C 6 alkyl, unsubsituted C 1 -C 6 alkenyl, unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol;
- R 7 and R 8 are indepdently hydrogen, methyl, or other protected/unprotected side chains of amino acids
- R 9 is hydrogen or a functional group suitable for protecting and/or activating a carboxylic acid group
- R 10 is hydrogen or a functional group suitable for hindering the reactivity of phenyl group, such as a C 1 -C 4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) ; and
- R 1 is a protected or unprotected side chain of glycine, alanine, valine, leucine, isoleucine, methionine, phenylalanine, tyrosine, tryptophan, cysteine, lysine, arginine, histidine, aspartic acid, glutamic acid, asparagine, glutamine, serine, or threonine.
- R 1 is a protected side chain
- the protection group is selected from the group consisting of a substituted or unsubstitited alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a carbonyl (e.g., carboxylic acid or carboxylic ester) , an amide, an amino, a substitute or unsubstituted aryl, a substituted or unsubstituted heteroaryl, and a heterocyclic ring, optionally wherein the protection group is t Bu, Boc, Thz, Acm, Trt, Cbz, Alloc, Azido, Bn, Bz, Ac, and Pbf, or a combination thereof.
- R 3 -R 6 are independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF 3 ) , -NO 2 , unsubstituted alkyl (such as unsubstituted C 1 -C 8 alkyl) , unsubsituted alkenyl (unsubstituted C 1 -C 8 alkenyl) , unsubstituted alkynyl (such as unsubstituted C 1 -C 8 alkynyl) , unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol, optionally wherein R 3 -R 6 are independently hydrogen, hydroxyl, halogen, -CN, hal
- R 9 is hydrogen, benzyl, allyl, unsubstituted C 1 -C 4 alkyl; or is an activated ester group, such as l-hydroxy-7-azabenzotriazole (HOAt) ester, 1- hydroxybenzotriazole (HOBt) ester, ethyl 2-cyano-2- (hydroxyimino) acetate (Oxyma) ester, N-hydroxysuccinimide (NHS) ester, pentafluorophenol (Pfp) ester, etc.
- HOAt l-hydroxy-7-azabenzotriazole
- HBt 1- hydroxybenzotriazole
- NHS N-hydroxysuccinimide
- Pfp pentafluorophenol
- R 10 is hydrogen, a C 1 -C 4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) .
- a C 1 -C 4 acyl group e.g., formyl, acetyl, propionyl
- a carbonyl group e.g., alloc
- a carbamate group e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate
- R 1 is a protected or unprotected side chain of an amino acid
- R 2 is an amine protecting group
- R 3 -R 6 are independently hydrogen, hdyroxyl, halogen, -CN, haloalkyl (such as -CF 3 ) , -NO 2 , unsubstituted C 1 -C 6 alkyl, unsubsituted C 1 -C 6 alkenyl, unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol;
- R 7 and R 8 are indepdently hydrogen, methyl, or other protected/unprotected side chains of amino acids
- R 9 is hydrogen, benzyl, allyl, unsubstituted C 1 -C 4 alkyl, or a functional group suitable for protecting and/or activating a carboxylic acid group;
- R 1 and R 2 are as defined above for Formula I,
- R 3 -R 6 are as defined above for Formula I,
- R 1 is a protected or unprotected side chain of glycine, alanine, valine, leucine, isoleucine, methionine, phenylalanine, tyrosine, tryptophan, cysteine, lysine, arginine, histidine, aspartic acid, glutamic acid, asparagine, glutamine, serine, or threonine.
- R 1 is a protected side chain
- the protection group is selected from the group consisting of a substituted or unsubstitited alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a carbonyl (e.g., carboxylic acid or carboxylic ester) , an amide, an amino, a substitute or unsubstituted aryl, a substituted or unsubstituted heteroaryl, and a heterocyclic ring, optionally wherein the protection group is t Bu, Boc, Thz, Acm, Trt, Cbz, Alloc, Azido, Bn, Bz, Ac, and Pbf, or a combination thereof.
- R 2 is Fmoc, Cbz, Moz, Boc, Troc, Teoc, Alloc, or Voc.
- R 3 -R 6 are independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF 3 ) , -NO 2 , unsubstituted alkyl (such as unsubstituted C 1 -C 8 alkyl) , unsubsituted alkenyl (unsubstituted C 1 -C 8 alkenyl) , unsubstituted alkynyl (such as unsubstituted C 1 -C 8 alkynyl) , unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol, optionally wherein R 3 -R 6 are independently hydrogen, hydroxyl, halogen, -CN, hal
- step (a) is performed in the presence of a condensation agent, optionally wherein the condensation agent is HATU, PyBOP, HBTU, HCTU, COMU, TSTU, PyClock, PyOxim, EDCI, or DCC, or a combination thereof.
- a condensation agent optionally wherein the condensation agent is HATU, PyBOP, HBTU, HCTU, COMU, TSTU, PyClock, PyOxim, EDCI, or DCC, or a combination thereof.
- step (a) is performed in the presence of a base, optionally wherein the base is DIPEA or Triethylamine, or a combination thereof.
- step (a) is performed in an organic solvent, optionally wherein the organic solvent is DMF, DMSO, DMAc, or NMP, or a combination thereof.
- step (a) is performed at room temperature for a time period ranging from 30 mins to 5 hours, preferably from 1 hour to 3 hours.
- step (a) has a yield of at least 40%, at least 50%, or in a range from about 50%to about 95%.
- step (b) is performed in a solvent comprising a buffer, optionally wherein the solvent is dichloromethane, trifluoroacetic acid, tetrahydrofuran, toluene, ethyl acetate, 1, 4-dioxane, acetonitrile, acetone, or water, or a combination thereof.
- the solvent is dichloromethane, trifluoroacetic acid, tetrahydrofuran, toluene, ethyl acetate, 1, 4-dioxane, acetonitrile, acetone, or water, or a combination thereof.
- step (b) is performed at room temperature for a time period ranging from 1 hour to 5 hours, preferably from 2 hours to 3 hours.
- step (b) The method of any one of paragraphs 9-22, wherein the compound of Formula I obtained in step (b) has a yield of at least 40%, such as in a range from about 40%to about 95%.
- R 1 is a protected or unprotected side chain of an amino acid
- R 2 is an amine protecting group
- R 3 -R 6 are independently hydrogen, hdyroxyl, halogen, -CN, haloalkyl (such as -CF 3 ) , -NO 2 , unsubstituted C 1 -C 6 alkyl, unsubsituted C 1 -C 6 alkenyl, unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol;
- R 7 and R 8 are indepdently hydrogen, methyl, or other protected/unprotected side chains of amino acids
- R 10 is hydrogen or a functional group suitable for hindering the reactivity of phenyl group, such as a C 1 -C 4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) ; and
- a functional group suitable for hindering the reactivity of phenyl group such as a C 1 -C 4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) ; and
- X is O or S.
- R 1 is a protected or unprotected side chain of an amino acid
- R 2 is an amine protecting group
- R 3 -R 6 are independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF 3 ) , -NO 2 , unsubstituted C 1 -C 6 alkyl, unsubsituted C 1 -C 6 alkenyl, unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol;
- R 7 and R 8 are indepdently hydrogen, methyl, or other protected/unprotected side chains of amino acids
- R 9 is hydrogen or a functional group suitable for protecting and/or activating a carboxylic acid group
- R 10 is a functional group suitable for hindering the reactivity of phenyl group, such as a C 1 -C 4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) ; and
- step (c) is performed immediately following step (b) or immediately following step (d) , wherein the carboxyl protected NBD is the NBD of Formula I formed in step (b) or the NBD of Formula I’ formed in step (d) , and wherin R 9 of Formula I or Formula I’ is not hydrogen.
- step (d) is performed immediately following step (b) or immediately following step (c)
- the free-acid containing NBD is the NBD of Formula I formed in step (b) or the NBD of Formula VI formed in step (c) .
- Step b
- Step a & Step b
- L-Fmoc-Val-OH (3.39 g, 10.0 mmol) and HATU (3.80 g, 10.0 mmol) were dissolved in 50.0 mL DMF. Subsequently, DIPEA (2.58 g, 20.0 mmol) was added and the reaction mixture was stirred under room temperature for 1 min. Then, 4-methoxy salicylaldehyde (1.52 g, 10.0 mmol) was added and the reaction was stirred for 1.0 h.
- the first example synthesized Ac-capped L-Thr-L-Thr NBD the structure of which is shown below.
- Step b
- the Fmoc-Val-SAL ester was dissolved into 250 mL dichloromethane, and 10 mL pyridine/HOAc buffer containing 10.0 mmol L-H-Thr-Oall ester was added. The reaction mixture was stirred at room temperature for 2 h and extracted similarly as mentioned above. Later the crude product was purified by column chromatography, affording the unprotected NBD as pale solid in 91%yield.
- NBD acid was then dissolved into 250 mL dichloromethane, and Ac 2 O (2.0 eq. ) and DIPEA (4.0 eq. ) were added.
- the reaction mixture was stirred at room temperature for 3 h and extracted similarly as mentioned above. Then the crude product was purified by column chromatography, affording the desired Ac-capped NBD acid as pale solid in 90%yield.
- the first example synthesized HOAt-activated L-Thr-L-Thr NBD the structure of which is shown below.
- Step b
- the Fmoc-Val-SAL ester was dissolved into 250 mL dichloromethane, and 10 mL pyridine/HOAc buffer containing 10.0 mmol L-H-Thr-OAll ester was added. The reaction mixture was stirred at room temperature for 2 h and extracted similarly as mentioned above. Later the crude product was purified by column chromatography, affording the unprotected NBD as pale solid in 91%yield.
- NBD acid was then dissolved into 250 mL dichloromethane, and HATU (1.0 eq. ) and DIPEA (1.0 eq. ) were added.
- the reaction mixture was stirred at room temperature for 0.5 h and extracted similarly as mentioned above. Then the crude product was purified by column chromatography, affording the desired HOAt-activated NBD acid as pale solid in 78%yield.
- Step a & Step b
- L-Fmoc-Val-OH (3.39 g, 10.0 mmol) and HATU (3.80 g, 10.0 mmol) were dissolved in 50.0 mL DMF. Subsequently, DIPEA (2.58 g, 20.0 mmol) was added and the reaction mixture was stirred under room temperature for 1 min. Then, 4-methoxy salicylaldehyde (1.52 g, 10.0 mmol) was added and the reaction was stirred for 1.0 h.
- the NBD is already in acid form so that the step c could be skipped.
- the unprotected NBD was dissolved into 250 mL dichloromethane, and tert-butyl methyl (2- (methylamino) ethyl) carbamate (2.0 eq. ) activated by 1, 1'-carbonyldiimidazole (2.0 eq. ) and DIPEA (4.0 eq. ) were added.
- the reaction mixture was stirred at room temperature for 1 h and extracted similarly as mentioned above. Then the crude product was purified by column chromatography, affording the desired carbamate-capped NBD as pale solid in 82%yield.
- the first case was the synthesis of PD-L1 (121-132) , a C-terminus of PD-L1 that was highly prone to form ⁇ -sheet structure.
- the amino acids were installed by standard coupling procedure by using HATU as coupling reagent, sequentially, the resin was subjected to washing steps. Then the resin was treated with 20%piperidine/DMF to remove the Fmoc protecting group and allowed for the next coupling cycle.
- Ile-Thr NBD was installed by a one-pot activation and coupling protocol. As shown in the Figure 1, the installation of Ile-Thr NBD improved the quality of final products significantly. Furthermore, without dipeptides, serious amino acid deletion was observed, indicating the powerfulness of dipeptide in solving difficult SPPS issue.
- RANTES a natural HIV-suppressive factor containing 68 amino acids
- the amino acids were installed by standard coupling procedure by using HATU as coupling reagent, sequentially, the resin was subjected to washing steps. Then the resin was treated with 20%piperidine/DMF to remove the Fmoc protecting group and enabled the next coupling cycle.
- three corresponding NBDs were installed by a one-pot activation and coupling protocol. As shown in Figure 4, RANTES was successfully constructed when four NBDs were introduced and up to 21%isolated yield could be achieved after HPLC purification.
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Life Sciences & Earth Sciences (AREA)
- Analytical Chemistry (AREA)
- Organic Low-Molecular-Weight Compounds And Preparation Thereof (AREA)
Abstract
Building block structures containing N, O/S-benzylidene acetal dipeptide (NBD) are described. The unique core structure of NBD can effectively convert those difficult peptide/proteins sequence into the easy ones, allowing smooth synthesis of those challenging targets. Further, the production of NBDs is highly robust, and it can be scale up facilely, which can facilitate the research and development of tailor-made peptides/proteins in the future.
Description
CROSS-REFERENCE TO RELATED APPLICATION
This application claims the benefit of and priority to U.S. Provisional Application No. 63/486,748 filed February 24, 2023, the entire content of which is incorporated herein by reference for all purpose in its entirety.
This invention is generally in the field of protein/peptide synthesis.
Chemical protein synthesis plays an important role in the study of fundamental biochemistry and development of macromolecule-based therapy. It allows precise modifications at the atomic level, such as mirror image proteins. The advent of solid phase peptide synthesis (SPPS) has allowed peptides or small proteins to be synthesized chemically, allowing site-specific installation of chemical probes or post-translational modification of peptides to realize their biofunctions. Following SPPS, a variety of coupling reagents, orthogonal protecting groups and Fmoc-based SPPS have been well-developed to ensure peptide synthesis as a convenient and commonly used technique. SPPS remains the gold-standard method, and it often faces challenges in synthesizing difficult peptides/proteins that tend to aggregate on the resin. There are still many peptide sequences that cannot be synthesized by direct SPPS due to their aggregation tendency during the peptide synthesis, which classifies them as difficult peptides. The current strategies to solve this problem are less cost-effective and have limited application in industrial production. To overcome the problem of difficult peptide synthesis, three methods have been developed to inhibit hydrogen-bonded interchain association, including pseudoproline dipeptides, HMB (2-hydroxy-4-methoxybenzyl) , and isoacyl dipeptides and as the backbone-protecting group, reported in 1992 (Haack & Mutter, Tetrahedron Lett.
33, 1589-1592 (1992) ) , 1995 (Johnson, T., et al., J. Chem. Soc. Chem. Commun. 369-372 (1993) ) , and 2004 (Sohma, Y., et al., Chem. Commun. 124-125 (2004) ) , respectively. These compounds can disrupt peptide aggregation during SPPS, but are not widely used in industries, as the synthesis of these building blocks are not very cost-effective.
There is a need for new and improved methods for synthesizing challenging peptides/proteins at low cost for fundamental chemical biology studies and therapeutic research & development.
Building block structures containing N, O/S-benzylidene acetal dipeptides (NBDs) are described herein. The unique core structure of NBD (also referred to herein as “compound” ) can effectively convert those difficult peptide/proteins sequences into the easier ones, allowing smooth synthesis of those challenging targets. Further, the production of NBDs is highly robust, and it can be scale up facilely, which can facilitate research and development of tailor-made peptides/proteins.
In some forms, the compounds can have the structure of Formula I:
wherien: (i) R1 can be a protected or unprotected side chain of an amino acid; (ii) R2 can be an amine protecting group; (iii) R3-R6 can be independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF3) , -NO2, unsubstituted C1-C6 alkyl, unsubsituted C1-C6 alkenyl, unsubstituted aryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol;
(iv) R7 and R8 can be indepedently hydrogen, methyl, or other protected/unprotected side chains of amino acids; (v) R9 can be hydrogen or a functional group suitable for protecting and/or activating a carboxylic acid group, such as benzyl, allyl, unsubstituted C1-C4 alkyl, and an activating group, e.g., l-hydroxy-7-azabenzotriazole (HOAt) , 1-hydroxybenzotriazole (HOBt) , ethyl 2-cyano-2- (hydroxyimino) acetate (Oxyma) , N-hydroxysuccinimide (NHS) , pentafluorophenol (Pfp) , etc.; and (vi) X can be O or S.
When R1 is a protected side chain of an amino acid, the protection group can be a substituted or unsubstitited alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a carbonyl (e.g., carboxylic acid or carboxylic ester) , an amide, an amino, a substitute or unsubstituted aryl, a substituted or unsubstituted heteroaryl, or a substituted or unsubstituted heterocyclic ring.
In some forms, the compounds can have the structure of Formula I’:
wherein: R1-R9 and X can be as defined above, and R10 can be hydrogen or a functional group suitable for hindering the reactivity of phenyl group, such as a C1-C4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) .
Methods of producing the compounds disclosed herein are described. Generally, the method includes two steps of reactions. Briefly, a L-or D-Amino acid with suitable protection on amine and optionally its sidechain functional groups, can be firstly coupled with salicylaldehyde or a mono-/multiply-
substituted form thereof. Then, the obtained salicylaldehyde ester reacts with serine/threonine/cysteine/penicillamine (Ser/Thr/Cys/Pen) or its ester in a suitable solvent. The obtained ester may be further derivatized to relevant acids with or without phenol protections. Using the methods disclosed herein, the NBDs can be obtained in high yield (at least 40%) .
Methods of using the NBDs for peptide/protein synthesis as a pre-formed dipeptide building block for synthesizing difficult peptides/proteins are also disclosed.
FIG. 1 shows a comparison of the synthesis of PD-L1 (121-123) using the disclosed method and conventional SPPS.
FIG. 2 shows a comparison of the synthesis of IL-2 (125-133) using the disclosed method and conventional SPPS.
FIG. 3 shows a comparison of the synthesis of Amylin using the disclosed method and conventional SPPS.
FIG. 4 shows a comparison of the synthesis of Rantes using the disclosed method and conventional SPPS.
Difficult peptides/proteins are normally prone to aggregate on the resin through hydrophobic interaction of amino-acid side chains and hydrogen bonding of amide bond, which can cause incomplete coupling or deprotection during the SPPS process. Without being bound to any theories, it is believed that the disclosed N, O/S-benzylidene acetal dipeptide (NBD) can act as a conformation-twisted dipeptide building block, which forms a pseudoproline turn to disrupt the interaction between peptides. Therefore, the aggregation can be successfully prevented, and targeted sequence can be synthesized smoothly.
I. DEFINTIONS
The term “alkyl” refers to a branched or straight-chain monovalent saturated aliphatic hydrocarbon radical of one to eight carbon atoms. This term
is further exemplified by radicals such as methyl, ethyl, n-propyl, isopropyl, n-butyl, s-butyl, t-butyl, n-pentyl, isopentyl, t-pentyl, n-hexyl, etc.
The term “alkenyl” refers to a branched or straight-chain hydrocarbon group of from two to eight carbon atoms and structural formula containing at least one carbon-carbon double bond.
The term “alkynyl” refers to a branched or straight-chain hydrocarbon group of from two to eight carbon atoms and structural formula containing at least one carbon-carbon triple bond.
The term “aryl” refers to C5-C20-membered aromatic or fused aromatic ring systems. Examples of aromatic groups are benzene, naphthalene, anthracene, phenanthrene, chrysene, pyrene, corannulene, coronene, etc.
The term “cyclic group” refers to a saturated or unsaturated monocyclic ring or a polycyclic ring (such as those formed from single or fused ring systems) , such as a cycloalkyl, a cycloalkenyl, or a cycloalkynyl, which have from three to ten carbon atoms, as geometric constraints permit.
The term “heteroalkyl” refers to straight or branched chain carbon-containing alkyl radicals containing at least one heteroatom on the carbon backbone. Suitable heteroatoms include, but are not limited to, O, N, Si, P and S, wherein the nitrogen, phosphorous and sulfur atoms are optionally oxidized, and the nitrogen heteroatom is optionally quaternized.
The term “heteroaryl” refers to C5-C20-membered aromatic or fused aromatic ring systems, in which one or more carbon atoms on one or more aromatic ring structures have been substituted with a heteroatom. Suitable heteroatoms include, but are not limited to, oxygen, sulfur, and nitrogen. Examples of heteroaryl groups pyrrole, furan, thiophene, imidazole, oxazole, thiazole, triazole, tetrazole, pyrazole, pyridine, pyrazine, pyridazine and pyrimidine, and the like. Examples of heteroaryl rings include, but are not limited to, benzimidazolyl, benzofuranyl, benzothiofuranyl, benzothiophenyl, benzoxazolyl, benzoxazolinyl, benzthiazolyl, benztriazolyl, benztetrazolyl,
benzisoxazolyl, benzisothiazolyl, benzimidazolinyl, carbazolyl, 4aH-carbazolyl, carbolinyl, chromanyl, chromenyl, cinnolinyl, decahydroquinolinyl, 2H, 6H-1, 5, 2-dithiazinyl, furanyl, furazanyl, imidazolidinyl, imidazolinyl, imidazolyl, 1H-indazolyl, indolenyl, indolinyl, indolizinyl, indolyl, 3H-indolyl, isatinoyl, isobenzofuranyl, isochromanyl, isoindazolyl, isoindolinyl, isoindolyl, isoquinolinyl, isothiazolyl, isoxazolyl, methylenedioxyphenyl, naphthyridinyl, octahydroisoquinolinyl, 1, 2, 3-oxadiazolyl, 1, 2, 4-oxadiazolyl, 1, 2, 5-oxadiazolyl, 1, 3, 4-oxadiazolyl, oxazolidinyl, oxazolyl, oxindolyl, pyrimidinyl, phenanthridinyl, phenanthrolinyl, phenazinyl, phenothiazinyl, phenoxathinyl, phenoxazinyl, phthalazinyl, pteridinyl, purinyl, pyrazinyl, pyrazolidinyl, pyrazolinyl, pyrazolyl, pyridazinyl, pyridooxazole, pyridoimidazole, pyridothiazole, pyridinyl, pyridyl, pyrimidinyl, pyrrolidinyl, pyrrolinyl, pyrrolyl, quinazolinyl, quinolinyl, quinoxalinyl, tetrahydroisoquinolinyl, tetrahydroquinolinyl, tetrazolyl, 1, 2, 3-thiadiazolyl, 1, 2, 4-thiadiazolyl, 1, 2, 5-thiadiazolyl, 1, 3, 4-thiadiazolyl, thianthrenyl, thiazolyl, thienyl, thienothiazolyl, thienooxazolyl, thienoimidazolyl, thiophenyl and xanthenyl. One or more of the rings can be substituted as defined below for “substituted heteroaryl. ”
The term “heterocyclic group” refers to a cyclic group as defined above where at least one of the carbon atoms of the ring is substituted with a heteroatom, such as, nitrogen, oxygen, sulfur, or phosphorus.
The term “aralkyl” refers to an aryl group or a heteroaryl group having an alkyl, alkynyl, or alkenyl group as defined above attached to the aromatic group, such as an aryl, a heteroaryl, a polyaryl, or a polyheteroaryl. An example of an aralkyl group is a benzyl group.
The terms “alkoxyl” refers to compounds represented by the formula -ORv, wherein Rv includes, but is not limited to, an alkyl, an alkenyl, an alkynyl, a cycloalkyl, a heterocyclic, a cycloalkenyl, an aryl, a heteroaryl, an aralkyl, a heteroalkyl, etc. Exemplary alkoxyl groups include methoxy, ethoxy, propyloxy, tert-butoxy and the like. A “lower alkoxyl” group is an alkoxyl group
containing from one to six carbon atoms. An “ether” is two functional groups covalently linked by an oxygen as defined below. Accordingly, the substituent of an alkyl that renders that alkyl an ether is or resembles an alkoxyl, such as can be represented by one of -O-alkyl, -O-alkenyl, -O-alkynyl, -O-aralkyl, -O-aryl, -O-heteroaryl, -O-cyclic, -O-heterocyclic, etc.
The term "amino" as used herein includes the group
wherein, E is absent, or E is substituted or unsubstituted alkyl, substituted or unsubstituted alkenyl, substituted or unsubstituted alkynyl, substituted or unsubstituted aralkyl, substituted or unsubstituted aryl, substituted or unsubstituted heteroaryl, a substituted or unsubstituted polyaryl, a substituted or unsubstituted polyheteroaryl, substituted or unsubstituted heterocyclyl, wherein independently of E, Rx, Rxi, and Rxii each independently represent a substituted or unsubstituted alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a substituted or unsubstituted carbonyl, a substituted or unsubstituted heterocyclyl, a substituted or unsubstituted aralkyl (e.g. a substituted or unsubstituted alkylaryl, a substituted or unsubstituted arylalkyl) , a substituted or unsubstituted aryl, a substituted or unsubstituted heteroaryl, a substituted or unsubstituted polyaryl, a substituted or unsubstituted polyheteroaryl, a substituted or unsubstituted heterocyclyl, a hydroxyl, an alkoxy, a phosphonium, a phosphanyl, a phosphonyl, a sulfinyl, a silyl, a thiol, an amido, an amino, or - (CH2) m-R”’; R”’ represents a hydroxyl group, a substituted or unsubstituted carbonyl group, a substituted or unsubstituted aryl, a substituted or unsubstituted cycloalkyl, a substituted or unsubstituted cycloalkenyl, a substituted or unsubstituted heterocyclyl, a substituted or unsubstituted aryl, a
substituted or unsubstituted heteroaryl, a substituted or unsubstituted polyaryl, a substituted or unsubstituted polyheteroaryl, an alkoxy, a phosphonium, a phosphanyl, an amido, or an amino; and m is zero or an integer ranging from 1 to 8. The term “quaternary amino” also includes the groups where the nitrogen, Rx, Rxi, and Rxii with the N+ to which they are attached complete a heterocyclyl or heteroaryl having from 3 to 14 atoms in the ring structure. It is understood by those of ordinary skill in the art, that the E groups listed above are divalent (e.g., methylene, ethane-1, 2-diyl, ethene-1, 2-diyl, 1, 4-phenylene, cyclohexane-1, 2-diyl) .
The terms “amide” or “amido” are used interchangeably, refer to both “unsubstituted amido” and “substituted amido” and are represented by the general formula:
wherein, E is absent, or E is a substituted or unsubstituted alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a substituted or unsubstituted aralkyl, a substituted or unsubstituted aryl, substituted or unsubstituted heteroaryl, a substituted or unsubstituted polyaryl, a substituted or unsubstituted polyheteroaryl, or a substituted or unsubstituted heterocyclyl, wherein independently of E, R and R’ each independently represent a hydrogen, a substituted or unsubstituted alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a substituted or unsubstituted carbonyl, a substituted or unsubstituted heterocyclyl, a substituted or unsubstituted aralkyl (e.g. a substituted or unsubstituted alkylaryl, a substituted or unsubstituted arylalkyl) , a substituted or unsubstituted aryl, a substituted or unsubstituted heteroaryl, a substituted or unsubstituted polyaryl, a substituted or unsubstituted polyheteroaryl, a substituted or unsubstituted heterocyclyl, a hydroxyl, an alkoxy, a phosphonium, a phosphanyl, a
phosphonyl, a sulfinyl, a silyl, a thiol, an amido, an amino, or - (CH2) m-R”’, or R and R’ taken together with the N atom to which they are attached complete a heterocycle having from 3 to 14 atoms in the ring structure; R”’ represents a hydroxyl group, a substituted or unsubstituted carbonyl group, a substituted or unsubstituted aryl, a substituted or unsubstituted cycloalkyl, a substituted or unsubstituted cycloalkenyl, a substituted or unsubstituted heterocyclyl, a substituted or unsubstituted aryl, a substituted or unsubstituted heteroaryl, a substituted or unsubstituted polyaryl, a substituted or unsubstituted polyheteroaryl, an alkoxy, a phosphonium, a phosphanyl, an amido, or an amino; and m is zero or an integer ranging from 1 to 8. In some forms, when E is oxygen, a carbamate is formed. It is understood by those of ordinary skill in the art, that the E groups listed above are divalent (e.g., methylene, ethane-1, 2-diyl, ethene-1, 2-diyl, 1, 4-phenylene, cyclohexane-1, 2-diyl) .
“Carbonyl, ” as used herein, is art-recognized and includes such moieties as can be represented by the general formula:
wherein X is a bond, or represents an oxygen or a sulfur, and R represents a hydrogen, a substituted or unsubstituted alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a substituted or unsubstituted carbonyl, a substituted or unsubstituted heterocyclyl, a substituted or unsubstituted aralkyl (e.g. a substituted or unsubstituted alkylaryl, a substituted or unsubstituted arylalkyl) , a substituted or unsubstituted aryl, a substituted or unsubstituted heteroaryl, a substituted or unsubstituted polyaryl, a substituted or unsubstituted polyheteroaryl, a substituted or unsubstituted heterocyclyl, a hydroxyl, an alkoxy, a phosphonium, a phosphanyl, an amido, an amino, or - (CH2) m-R” , or a pharmaceutical acceptable salt; E” is absent, or E” is a substituted or unsubstituted alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a substituted or unsubstituted aralkyl, a substituted or unsubstituted aryl, a substituted or unsubstituted heteroaryl, a
substituted or unsubstituted polyaryl, a substituted or unsubstituted polyheteroaryl, a substituted or unsubstituted heterocyclyl; R’ represents a hydrogen, a substituted or unsubstituted alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a substituted or unsubstituted heterocyclyl, a substituted or unsubstituted aralkyl (e.g. a substituted or unsubstituted alkylaryl, a substituted or unsubstituted arylalkyl) , a substituted or unsubstituted aryl, a substituted or unsubstituted heteroaryl, a substituted or unsubstituted polyaryl, a substituted or unsubstituted polyheteroaryl, a substituted or unsubstituted heterocyclyl, a hydroxyl, an alkoxy, a phosphonium, a phosphanyl, an amido, an amino, or - (CH2) m-R”; R” represents a hydroxyl group, a substituted or unsubstituted aryl, a substituted or unsubstituted cycloalkyl, a substituted or unsubstituted cycloalkenyl, a substituted or unsubstituted heterocyclyl, a substituted or unsubstituted aryl, a substituted or unsubstituted heteroaryl, a substituted or unsubstituted polyaryl, a substituted or unsubstituted polyheteroaryl, an alkoxy, a phosphonium, a phosphanyl, an amido, or an amino; and m is zero or an integer ranging from 1 to 8. Such substituents can be any substituents described above, e.g., halogen, azide, alkyl, aralkyl, alkenyl, alkynyl, cycloalkyl, hydroxyl, carbonyl (such as a carboxyl, alkoxycarbonyl, formyl, or an acyl) , silyl, ether, ester, thiocarbonyl (such as a thioester, a thioacetate, or a thioformate) , alkoxyl, phosphonium, phosphanyl, phosphoryl, phosphate, phosphonate, phosphinate, amino (e.g. quarternized amino) , amido, amidine, imine, cyano, nitro, azido, sulfhydryl, alkylthio, sulfate, sulfonate, sulfamoyl, sulfonamido, sulfonyl, heterocyclyl, alkylaryl, haloalkyl, -CN, aryl, heteroaryl, and combinations thereof. It is understood by those of ordinary skill in the art, that the E” groups listed above are divalent (e.g., methylene, ethane-1, 2-diyl, ethene-1, 2-diyl, 1, 4-phenylene, cyclohexane-1, 2-diyl) . Where X is oxygen and R is defined as above, the moiety is also referred to as a carboxyl group. When X is oxygen and R is hydrogen, the formula represents a “carboxylic acid. ” Where X is oxygen and R’ is hydrogen,
the formula represents a “formate. ” Where X is oxygen and R or R’ is not hydrogen, the formula represents an "ester. ” In general, where the oxygen atom of the above formula is replaced by a sulfur atom, the formula represents a “thiocarbonyl” group. Where X is sulfur and R or R’ is not hydrogen, the formula represents a “thioester. ” Where X is sulfur and R is hydrogen, the formula represents a “thiocarboxylic acid. ” Where X is sulfur and R’ is hydrogen, the formula represents a “thioformate. ” Where X is a bond and R is not hydrogen, the above formula represents a “ketone. ” Where X is a bond and R is hydrogen, the above formula represents an “aldehyde. ”
The terms “thiol” are used interchangeably and are represented by -SR, where R can be a hydrogen, a substituted or unsubstituted alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a substituted or unsubstituted heterocyclyl, a substituted or unsubstituted aryl, a substituted or unsubstituted heteroaryl, a substituted or unsubstituted aralkyl (e.g. a substituted or unsubstituted alkylaryl, a substituted or unsubstituted arylalkyl, etc. ) , a substituted or unsubstituted polyaryl, a substituted or unsubstituted polyheteroaryl, a substituted or unsubstituted carbonyl, a phosphonium, a phosphanyl, an amido, an amino, an alkoxy, an oxo, a phosphonyl, a sulfinyl, or a silyl, described above. Such substituents can be any substituents described above, e.g., halogen, azide, alkyl, aralkyl, alkenyl, alkynyl, cycloalkyl, hydroxyl, carbonyl (such as a carboxyl, alkoxycarbonyl, formyl, or an acyl) , silyl, ether, ester, thiocarbonyl (such as a thioester, a thioacetate, or a thioformate) , alkoxyl, phosphoryl, phosphate, phosphonate, phosphinate, amino (e.g. quarternized amino) , amido, amidine, imine, cyano, nitro, azido, sulfhydryl, alkylthio, sulfate, sulfonate, sulfamoyl, sulfonamido, sulfonyl, heterocyclyl, alkylaryl, haloalkyl, CN, aryl, heteroaryl, polyaryl, polyheteroaryl, and combinations thereof.
Use of the term "about" is intended to describe values either above or below the stated value, which the term “about” modifies, to be within a range of approximately +/-10%. When the term "about" is used before a range of
numbers (i.e., about 1-5) or before a series of numbers (i.e., about 1, 2, 3, 4, etc. ) it is intended to modify both ends of the range of numbers and/or each of the numbers recited in the entire series, unless specified otherwise.
“Substituted, ” as used herein, refers to all permissible substituents of the compounds or functional groups described herein. In the broadest sense, the permissible substituents include acyclic and cyclic, branched and unbranched, carbocyclic and heterocyclic, aromatic and nonaromatic substituents of organic compounds. Illustrative substituents include, but are not limited to, halogens, hydroxyl groups, or any other organic groupings containing any number of carbon atoms, preferably 1-14 carbon atoms, and optionally include one or more heteroatoms such as oxygen, sulfur, or nitrogen grouping in linear, branched, or cyclic structural formats. Representative substituents include a substituted or unsubstituted alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a substituted or unsubstituted heterocyclic, a substituted or unsubstituted phenyl, a substituted or unsubstituted aryl, a substituted or unsubstituted heteroaryl, a substituted or unsubstituted polyaryl, a substituted or unsubstituted polyheteroaryl, a substituted or unsubstituted aralkyl, a halogen, a hydroxyl, an alkoxy, a phenoxy, an aroxy, a silyl, a thiol, an alkylthio, a substituted alkylthio, a phenylthio, an arylthio, a cyano, an isocyano, a nitro, a substituted or unsubstituted carbonyl, a carboxyl, an amino, an amido, an oxo, a sulfinyl, a sulfonyl, a sulfonic acid, a phosphonium, a phosphanyl, a phosphoryl, a phosphonyl, an amino acid. Such a substituted or unsubstituted alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a substituted or unsubstituted heterocyclyl, a substituted or unsubstituted phenyl, a substituted or unsubstituted aryl, a substituted or unsubstituted heteroaryl, a substituted or unsubstituted polyaryl, a substituted or unsubstituted polyheteroaryl, a substituted or unsubstituted aralkyl, a halogen, a hydroxyl, an alkoxy, a phenoxy, an aroxy, a silyl, a thiol, an alkylthio, a substituted alkylthio, a phenylthio, an arylthio, a cyano, an isocyano, a nitro, a substituted or
unsubstituted carbonyl, a carboxyl, an amino, an amido, an oxo, a sulfinyl, a sulfonyl, a sulfonic acid, a phosphonium, a phosphanyl, a phosphoryl, a phosphonyl, and an amino acid can be further substituted.
Heteroatoms such as nitrogen may have hydrogen substituents and/or any permissible substituents of organic compounds described herein which satisfy the valences of the heteroatoms. It is understood that “substitution” or “substituted” includes the implicit proviso that such substitution is in accordance with permitted valence of the substituted atom and the substituent, and that the substitution results in a stable compound, i.e., a compound that does not spontaneously undergo transformation such as by rearrangement, cyclization, elimination, etc.
The full names of certain abbreviations as used herein is provided in the following table:
II. COMPOSITIONS
The disclosed compositions include a building block structure containing N,O/S-benzylidene acetal dipeptide (NBD) . The disclosed NBDs (also referred to herein as “compounds” ) are simple and effective building blocks for syntheses of difficult peptides/proteins. Without being bound to any theories, it is believed the NBDs disclosed herein can act as a conformation-twisted dipeptide building block, which forms a pseudoproline turn to break down the interaction between peptides, and thereby can disrupt aggregation of difficult peptides/proteins on the resin (typically due to hydrophobic interaction of amino-acid side chains and hydrogen boding of amide bond) during SPPS process.
In some forms, the disclosed compounds can have the structures of Formula I:
wherein: (i) R1 can be a protected or unprotected side chain of an amino acid; (ii) R2 can be an amino protecting group; (iii) R3-R6 can be independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF3) , -NO2, unsubstituted C1-C6 alkyl, unsubsituted C1-C6 alkenyl, unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol; (iv) R7 and R8 can be indepdently hydrogen, methyl,
or other protected/unprotected side chains of amino acids; (v) R9 can be hydrogen or a functional group suitable for protecting and/or activating a carboxylic acid group, such as benzyl, allyl, unsubstituted C1-C4 alkyl, and an activating group, e.g., 1-hydroxy-7-azabenzotriazole (HOAt) , 1-hydroxybenzotriazole (HOBt) , ethyl 2-cyano-2- (hydroxyimino) acetate (Oxyma) ; N-hydroxysuccinimide (NHS) , pentafluorophenol (Pfp) , etc.; and (vi) X can be O or S.
When R1 is a protected side chain of an amino acid, the protection group can be a substituted or unsubstitited alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a carbonyl (e.g., carboxylic acid or carboxylic ester) , an amide, an amino, a substitute or unsubstituted aryl, a substituted or unsubstituted heteroaryl, or a heterocyclic ring. In some forms, R1 is a protected side chain of an amino acid and the protection group can be an unsubstitited alkyl (e.g., unsubstitued C1-C6 or C1-C4 alkyl) , an unsubstituted alkenyl (e.g., unsubstitued C1-C6 or C1-C4 alkenyl) , an unsubstituted alkynyl (e.g., unsubstitued C1-C6 or C1-C4 alkynyl) , a carbonyl (e.g., carboxylic acid or carboxylic ester) , an amide, an amino, an unsubstituted aryl, an unsubstituted heteroaryl, or an unsubstituted heterocyclic ring.
Thein the formulae described herein indicates a carbon stereocenter (to which the wave bond attaches) that can be either S-or R-configuration. For example, the acetyl carbon attaching to -C (O) OR9 can be either S-or R-configuration. It is also understood that R1 can be a side chain of any suitable amino acid, natural or synthetic, and the amin acid can be either in its L-or D-configuration, or in various mixtures of their isomers.
R1 may be a protected or unprotected side chain of any suitable alpha-amino acid, natural or synthetic. When R1 is a side chain of an amino acid that carries a hydroxy group, the hydroxy group is optionally protected by a suitable hydroxy protecting group known in the art. For side chains that carry additional amino groups, the amino group is optionally protected by a suitable amino
protecting group known in the art. In some forms, when R1 is a protected side chain of an amino acid, the protection group can be tBu, Boc, Thz, Acm, Trt, Cbz, Alloc, azido, Bn, Bz, Ac, or Pbf, or a combination thereof.
In some forms, R1 can be a protected or unprotected side chain of glycine, alanine, valine, leucine, isoleucine, methionine, phenylalanine, tyrosine, tryptophan, cysteine, lysine, arginine, histidine, aspartic acid, glutamic acid, asparagine, glutamine, serine, or threonine. Any one of these amino acids can be either in its L-or D-configuration. In these forms, when R1 is a protected side chain, the protection group can be tBu, Boc, Thz, Acm, Trt, Cbz, Alloc, azido, Bn, Bz, Ac, or Pbf, or a combination thereof.
R2 can be any substituents conventionally used to hinder the reactivity of the amino group. For example, suitable amino protecting groups are described in Green T., “Protective Groups in Organic Synthesis” , Chapter 7, John Wiley and Sons, Inc., 1991, 309-385. In some forms, R2 can be Fmoc, Cbz, Moz, Boc, Troc, Teoc, Alloc, or Voc.
R3-R6 can be independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF3) , -NO2, unsubstituted alkyl (such as unsubstituted C1-C8 alkyl) , unsubsituted alkenyl (unsubstituted C1-C8 alkenyl) , unsubstituted alkynyl (such as unsubstituted C1-C8 alkynyl) , unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol. In some forms, R3-R6 can be independently hydrogen, halogen, -CN, -CF3, -NO2, unsubstituted C1-C6 or C1-C4 alkyl, unsubstituted C1-C6 or C1-C4 alkenyl, unsubstituted C1-C6 or C1-C4 alkynyl, alkoxyl, and unsubstituted aryl. In some forms, at least one of R3-R6, optionally two or more of R3-R6, is/are not hydrogen. For example, one of R3-R6, two of R3-R6, three of R3-R6, or all of R3-R6, is/are not hydrogen.
In some forms, R7 and R8 can be indepedently hydrogen or methyl. In some forms, R7 and R8 can be indepdently protected/unprotected side chains of amino acids.
In some forms, R9 can be hydrogen, benzyl, allyl, or unsubstituted C1-C4 alkyl. In some forms, R9 can be hydrogen. In some forms, R9 can be any substituents conventionally used to protect the carboxylic acid group. For example, suitable amino protecting groups are described in Green T., “Protective Groups in Organic Synthesis” , Chapter 5, John Wiley and Sons, Inc., 2006, 533-646. In some forms, R9 can also be any substituents conventionally used to activate the carboxylic acid group. For example, suitable amino protecting groups are described in Albericio F. “Peptide Coupling Reagents, More than a Letter Soup” , Chem. Rev. 2011, 111, 6557-6602. In some forms, R9 can be benzyl, allyl, or unsubstituted C1-C4 alkyl. In some forms, R9 can be an activating group, such as l-hydroxy-7-azabenzotriazole (HOAt) , 1-hydroxybenzotriazole (HOBt) , ethyl 2-cyano-2- (hydroxyimino) acetate (Oxyma) , N-hydroxysuccinimide (NHS) , pentafluorophenol (Pfp) , etc. In these forms, is an activated ester group, such as l-hydroxy-7-azabenzotriazole (HOAt) ester, 1-hydroxybenzotriazole (HOBt) ester, ethyl 2-cyano-2- (hydroxyimino) acetate (Oxyma) ester, N-hydroxysuccinimide (NHS) ester, pentafluorophenol (Pfp) ester, etc.
In some forms, the disclosed compounds can have the structures of Formula I':
wherein R1-R9 and X can be in any forms as defined above, and R10 can be hydrogen or a functional group suitable for hindering the reactivity of phenyl group, such as a C1-C4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) .
In some forms, R10 can be hydrogen. In some forms, R10 can be any substituents conventionally used to hinder the reactivity of the phenol group. For example, suitable amino protecting groups are described in Green T., “Protective Groups in Organic Synthesis” , Chapter 3, John Wiley and Sons, Inc., 2006, 367-430. In some forms, R10 can be a C1-C4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) .
In some forms, R9 can be hydrogen and R10 can be hydrogen. In some forms, R9 can be benzyl, allyl, unsubstituted C1-C4 alkyl, or an activating group, such as l-hydroxy-7-azabenzotriazole (HOAt) , 1-hydroxybenzotriazole (HOBt) , ethyl 2-cyano-2- (hydroxyimino) acetate (Oxyma) , N-hydroxysuccinimide (NHS) , pentafluorophenol (Pfp) , etc. and R10 can be hydrogen. In some forms, R9 can be hydrogen and R10 can be a C1-C4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) . In some forms, R9 can be benzyl, allyl, unsubstituted C1-C4 alkyl, or an activating group, such as l-hydroxy-7-azabenzotriazole (HOAt) , 1-hydroxybenzotriazole (HOBt) , ethyl 2-cyano-2- (hydroxyimino) acetate (Oxyma) , N-hydroxysuccinimide (NHS) , pentafluorophenol (Pfp) , etc. and R10 can be a C1-C4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) .
Representative examples of NBDs are shown below, where ● representsindicates that attachment point, and R3-R10 can be in any forms as defined above.
Additional examples of the compounds are L-Thr-L-Thr NBD, L-Asp-L-Ser NBD, L-Glu-L-Cys NBD, L-Val-L-Pen NBD, and L-Val-L-Cys NBD shown below:
The disclosed NBDs are structurally different from of pseudo-prolines disclosed inet al., Pseudo-Prolines as a Solubilizing, Structure-Disrupting Protection Technique in Peptide Synthesis (J. Am. Chem. Soc. 1996, 118, 39, 9218-9227) . The disclosed NBDs are also structurally different from pseudo-prolines supplied by CHEM IMPEX (Woodale IL, for example, catalog Number 46591 (www. chemimpex. com/product/productinfo/fmoc-arg-pbf-ser-psi-me-me-pro-oh/36788? cid=860) . For example, compared to the oxazolidine in pseudo-proline dipeptides, the phenyl group of the disclosed NBDs provides more substitution sites for further functionalization, where in pseudo-proline, the symmetric dimethyl group was adopted without any other site for modification.
III. METHODS
Compared to exiting protein/peptide synthesis methods, NBDs can cover broader range of peptide sequence and they can be easily prepared from mg-
scale to sub-kg scale, whose estimated cost is only around 1~10%price of commercial competitors.
A. Production of NBD
Methods of producing the compounds disclosed herein are described. Generally, the method includes two steps of reactions. Briefly, a L-or D-Amino acid with suitable protection on amine and optionally its sidechain functional groups, can be firstly coupled with salicylaldehyde or a mono-/multiply-substituted form thereof. Then, the obtained salicylaldehyde ester reacts with serine/threonine/cysteine/penicillamine (Ser/Thr/Cys/Pen) or its ester in a suitable solvent. The obtained ester may be further derivatized to relevant acids with or without phenol protections. Using the methods disclosed herein, the NBDs can be obtained in high yield (at least 40%) .
Using the methods disclosed herein, 76 exemplary NBDs (structures shown in the section above) were produced in moderate to excellent yield (i.e., 40%to 95%) . Variants of those 76 NBDs that include D-amino acid version, different protecting groups on amine or side chain, other carboxylic derivatives and different substitutions of aryl could be also synthesized by following the developed synthetic strategy.
The disclosed method includes: (a) reacting a protected amino acid of Formula II:
wherein R1 can be a protected or unprotected side chain of an amino acid; and (ii) R2 can be an amino protecting group,
with a substituted or unsubstituted salicylaldehyde of Formula III:
wherein R3-R6 can be independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF3) , -NO2, unsubstituted alkyl (such as unsubstituted C1-C8 alkyl) , unsubsituted alkenyl (unsubstituted C1-C8 alkenyl) , unsubstituted alkynyl (such as unsubstituted C1-C8 alkynyl) , unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol,
to obtain an intermediate comprising a salicylaldehyde ester of Formula IV:
(b) reacting the salicylaldehyde ester of Formula IV with serine, threonine, cysteine, or penicillamine, or an ester thereof of Formula V:
wherein R7 and R8 can be indepdently hydrogen, methyl, or other protected/unprotected side chains of amino acids; (v) R9 can be hydrogen,
benzyl, allyl, unsubstituted C1-C4 alkyl, or a suitable activating group, such as N-hydroxysuccinimide (NHS) ; and (vi) X can be O or S,
to obtain a product comprising the compound of Formula I:
wherein R1-R9 and X are as defined above for the reactants.
Activating groups suitable for R9 of Formula V and Formula I are typically less active than HOAt and Oxyma, such as NHS, to reduce/prevent side reactions in step (b) .
The method disclosed herein may further include step (c) : deprotecting a protected carboxyl group of the NBDs; or step (d) acylating or activating a free acid group or a free phenol group of the NBDs; or a combination thereof.
In some forms, in the deprotection step (c) , a deprotected NBD with free acid form is obtained, for example, a deprotected NBD of Formula VI:
wherein R1-R8, R10, and X can be in any forms as defined above.
In some forms, in the acylation/activation step (d) , at least one of R9 and R10 is hydrogen. For example, R9 of the NBDs is hydrogen, such that this free
carboxylic acid group can be activated in step (d) . For example, R10 of the NBDs is hydrogen, such that this free hydroxyl group can be acylated in step (d) . In some forms, both R9 and R10 of the NBDs is hydrogen, such that the free carbocylic acid and/or the hydroxyl group can be acylated and/or activated, as needed, in step (d) . In some forms, in step (d) , an acylated NBD with or without a free carboxylic acid is obtained, for example, an acylated NBD of Formula I’:
wherein R1-R9 and X can be in any forms as defined above, and R10 can be in any form as defined above expect for hydrogen.
The deprotection step (c) and acylation/activation step (d) can be performed in any order, and in any combination with steps (a) and (b) as described above. For example, the method includes steps (a) , (b) , and (c) , performed sequentially. For example, the method includes steps (a) , (b) , and (d) , performed sequentially. For example, the method includes steps (a) , (b) , (c) , and (d) , performed sequentially. For example, the method includes steps (a) , (b) , (d) , and (c) , performed sequentially.
As noted above, thein the formulae described herein indicates a carbon stereocenter (to which the wave bond attaches) that can be either S-or R-configuration. For example, the acetyl carbon attaching to -C (O) OR9 can be either S-or R-configuration. It is thus understood that the threonine, cysteine, or penicillamine, or an ester thereof used in step (b) can be either in its L-or D-configuration, as racemates, or in various mixtures of their isomers. It is also
understood that the protected amino acid used in step (a) can be either in its L-or D-configuration, as racemates, or in various mixtures of their isomers.
1. Step (a) Producing Salicylaldehyde Ester
In the first step (a) of the disclosed methods, a protected amino acid of Formula II is reacted with a subsituted or unsubstituted salicylaldehyde of Formula III to obtain an intermediate comprising a salicylaldehyde ester of Formula IV:
wherein R1-R6 can be in any forms as defined above. When R1 is a protected side chain of an amino acid, the protection group can be a substituted or unsubstitited alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a carbonyl (e.g., carboxylic acid or carboxylic ester) , an amide, an amino, a substitute or unsubstituted aryl, a substituted or unsubstituted heteroaryl, or a heterocyclic ring. In some forms, R1 is a protected side chain of an amino acid and the protection group can be an unsubstitited alkyl, an unsubstituted alkenyl, an unsubstituted alkynyl, a carbonyl (e.g., carboxylic acid or carboxylic ester) , an amide, an amino, an unsubstituted aryl, an unsubstituted heteroaryl, or an unsubstituted heterocyclic ring.
The protected amino acid of Formula II used in step (a) can be any natural amino acids or synthetic amino acids, which can be either in their L-or D-configuration. Preferably, the protected amino acid used in step (a) is glycine, alanine, valine, leucine, isoleucine, methionine, phenylalanine, tyrosine, tryptophan, cysteine, lysine, arginine, histidine, aspartic acid, glutamic acid, asparagine, glutamine, serine, or threonine, in its L-or D-configuration.
The side chain of the amino acid used in step (a) may be protected using a protecting group known in the art. Suitable protecting groups for the side chain include, but are not limited to, tBu, Boc, Thz, Acm, Trt, Cbz, Alloc, Azido, Bn, Bz, Ac, and Pbf.
The terminal amino group of the amino acid used in step (a) are protected (i.e., R2) . Any amino protecting group known in the art can be used to protect the amino group of the amino acid used in the disclosed methods. For example, R2 of the protected amino acids used in step (a) can be Fmoc, Cbz, Moz, Boc, Troc, Teoc, Alloc, or Voc.
The salicylaldehyde of Formula III that reacts with the protected amino acids of Formula II can be an unsubstituted salicylaldehyde or a mono-or poly-substituted salicylaldehyde. In some forms of the methods, the salicylaldehyde reacting with the protected amino acid of Formula II is substituted with one or more substituents (i.e., at least one of R3-R6, optionally two or more of R3-R6, is/are not hydrogen) . For example, one of R3-R6, two of R3-R6, three of R3-R6, or all of R3-R6, of the salicylaldehyde of Formula III used in step (a) is/are not hydrogen. Examples of suitable substituents of the substituted salicylaldehyde of Formula III are hydroxyl, halogen, -CN, haloalkyl (such as -CF3) , -NO2, unsubstituted C1-C6 alkyl, unsubsituted C1-C6 alkenyl, unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, and thiol. Preferably, when the salicylaldehyde reacting with the protected amino acid of Formula II is substituted with one or more substituents, the substituents are independently halogen, -CN, -CF3, -NO2, unsubstituted C1-C6 or C1-C4 alkyl, unsubstituted C1-C6 or C1-C4 alkenyl, unsubstitued C1-C6 or C1-C4 alkynyl, alkoxyl, and unsubstituted aryl.
Typically, the reaction between the protected amino acid of Formula II and the subsituted or unsubstituted salicylaldehyde of Formula III is performed
in a suitable organic solvent, and optionally in the presence of a condensation reagent and/or a base.
Examples of suitable organic solvents for performing the reaction in step (a) include, but are not limited to, DMF, DMSO, DMAc, NMP. Preferably, the organic solvent for performing the reaction in step (a) is DMF.
Examples of suitable condensation agent for performing the reaction in step (a) include, but are not limited to, HATU, PyBOP, HBTU, HCTU, COMU, TSTU, PyClock, PyOxim, EDCI, and DCC. Preferably, the condensation agent used in the reaction in step (a) is HATU.
Examples of suitable base for performing the reaction in step (a) include, but are not limited to, DIPEA and Triethylamine. Preferably, the base used in the reaction in step (a) is DIPEA.
The reaction between the protected amino acid of Formula II and the subsituted or unsubstituted salicylaldehyde of Formula III can be performed in an suitable organic solvent, such as any one of thoes described above, and optionally in the presence of a condensation reagent and/or a base, at room temperature (i.e., 20 ℃ to 25 ℃, at 1 atm) for a time period ranging from 30 mins to 5 hours, preferably from 1 hour to 3 hours, such as about 1 hour or about 1.5 hour.
Optionally, the method disclosed herein further includes a step of purifying the intermediate containing the salicylaldehyde ester of Formula IV after step (a) and prior to step (b) . The purification step can be performed using any suitable techniques known in the art, such as using extraction, washing, drying, column chromatography (such as by using a silica gel column) , or filtration, or a combination thereof. For example, the intermdeiate containing the salicylaldehyde ester of Formula IV is purified using the following techniques: extraction with an organic solvent that is different from the organic solvent used for performing the reaction, washing with an acid for one or more times, drying
with a drying agent (such as sodium sulfate) and/or evaporation, and then separation using silica gel column.
Generally, the salicylaldehyde ester of Formula IV obtained in step (a) has a yield of at least 40%, at least 50%, or in a range from about 50%to about 95%. The yield of salicylaldehyde ester of Formula IV can be calculated using the formula: (The mole of the salicylaldehyde ester of Formula IV) / (The mole of the protected amino acid of Formula II) *100%. The yield may be determined either before or after purification of the intermediate.
2. Step (b) Producing NBD
In the second step of the disclosed methods, the salicylaldehyde ester of Formula IV is reacted with serine, threonine, cysteine, or penicillamine, or an ester thereof of Formula V to obtain the compounds of Formula I:
wherein R1-R9 and X can be in any forms as defined above.
The serine, threonine, cysteine, or penicillamine, or an ester of Formula V can be either in their L-or D-configuration. Accordingly, R9 can be hydrogen or any suitable functional group that can form an ester with the carboxyl group of serine, threonine, cysteine, and penicillamine, such as benzyl, allyl, and unsubstituted C1-C4 alkyl.
Typically, the reaction between the salicylaldehyde ester of Formula IV and the serine, threonine, cysteine, or penicillamine, or ester thereof of Formula V is performed in a suitable solvent. The solvent may be an organic solvent or water or a combination thereof, and typically contains a buffer at an amount
ranging from 10 vol%to 20 vol%, such as 10 vol%to 20 vol%pyridine/HOAc (1: 1, molar ratio) .
Examples of suitable solvents for preforming the reaction in step (b) include, but are not limited to, dichloromethane, trifluoroacetic acid, ethyl acetate, toluene, tetrahydrofuran, 1, 4-dixoane, acetonitrile, acetone, hexane, and water, and a combination thereof.
Examples of buffers suitable for use in the solvent for performing the reaction in step (b) include, but are not limited to, pyridine-acetic acid buffer, colidine-acetic acid buffer, picoline-acetic acid buffer, DMSO-pyridine-acetic acid buffer, DMSO-colidine-acetic acid buffer, and DMSO-picoline-acetic acid buffer. A preferred buffer suitable for use is pyridine-acetic acid buffer, where the molar ratio of pyridine to acetic acid ranges from 1: 9 to 9: 1, such as 1: 1.
The reaction between the salicylaldehyde ester of Formula IV and the serine, threonine, cysteine, or penicillamine, or ester thereof of Formula V can be performed in an suitable solvent, such as any one of thoes described above, at room temperature for a time period ranging from 1 hour to 5 hours, preferably from 2 hours to 3 hours, such as about 2 hours or about 3 hours.
Optionally, the method disclosed herein further includes a step of purifying the product containing the compound of Formula I after step (b) . The purification step can be performed using any suitable techniques known in the art, such as any of those described above in step (a) . For example, the product containing the compounds of Formula I is purified using the following techniques: extraction with a suitable solvent and separation using column chromatography or filtration, washing with cod water and cold organic solvent (such as cold ethanol and/or cold ether) for one or more times, and separation by silica gel column.
Generally, the compound of Formula I obtained in step (b) has a yield of at least 40%or in a range from about 40%to about 95%. The yield of compond of Formula I can be calculated using the formula: (The mole of the compound of
Formula I) / (The mole of the salicylaldehyde ester Formula IV) *100%. The yield may be determined either before or after purification of the product.
3. Step (c) Deprotection of Protected Carboxyl Group
Optionally, the disclosed method further includes a step (c) : deprotecting a protected carboxyl group of the NBDs formed in step (b) or (d) as described herein. For example, the disclosed method includes steps (a) , (b) , and (c) performed sequentilly. For example, the disclosed method includes steps (a) , (b) , (c) , and (d) , performed sequentially. For example, the disclosed method includes steps (a) , (b) , (d) , and (c) , performed sequentially. When a deprotection step is included in the disclosed method, R9 of the NBDs is typically not hydrogen, in particular when X of the formulae described herein is O. The protected carboxyl group of the NBDs can be deprotected using reagents and reactions known in the art, for example, by using reagents such as Pd (PPh3) 4 and/or PhSiH3 for deallylation; H2 and/or Pd/C for hydrogenation of Bn protection; LiOH for sponsification of alkyl protection, etc. In step (c) , a deprotected NBD with free acid form is obtained, for example, a deprotected NBD of Formula VI:
wherein R1-R8, R10, and X can be in any forms as defined above.
Optionally, the method disclosed herein further includes a step of purifying the product containing the compound of Formula VI after step (c) . The purification step can be performed using any suitable techniques known in the art, such as any of those described above in step (a) or (b) . For example, the product containing the compounds of Formula VI is purified using the following
techniques: extraction with a suitable solvent and separation using column chromatography or filtration, washing with cod water and cold organic solvent (such as cold ethanol and/or cold ether) for one or more times, and separation by silica gel column.
Generally, the compound of Formula VI obtained in step (c) has a yield of at least 40%or in a range from about 40%to about 95%. The yield of compond of Formula VI can be calculated using the formula: (The mole of the compound of Formula VI) / (The mole of the compound of Formula I’) *100%. The yield may be determined either before or after purification of the product.
4. Step (d) Acylation or Activation of Acid Groups
Optionally, the disclosed method further includes a step (d) : acylating or activating a free acid or a free phenol group of the NBDs formed in step (b) or (c) . When an acylation/activation step is included in the disclosed method, at least one of R9 and R10 of the reagent, such as an NBD of Formula I or VI, is hydrogen. In some forms, R9 of the NBDs is hydrogen, either in the NBD product of step (b) or in the deprotected NBD formed in the deprotection step (c) , such that this free carboxylic acid group can be activated in step (d) . In the forms where the free carboxylic acid group is activated, exmples of the formed activated group include, but are not limited to NHS, HOAt, or Oxyma. In some forms, R10 of the NBDs is hydrogen in the NBD product of step (b) and/or the deprotected NBD formed in step (c) , such that this free hydroxyl group can be acylated in step (d) . In some forms, both R9 and R10 of the NBDs is hydrogen, either in the NBD product of step (b) or in the deprotected NBD formed in the deprotection step (c) , such that the free carbocylic acid and/or the hydroxyl group can be acylated and/or activated, as needed, in step (d) . For example, both R9 and R10 of the NBDs is hydrogen, either in the NBD product of step (b) or in the deprotected NBD formed in the deprotection step (c) , such that the free carbocylic acid is activated, and/or the hydroxyl group is acylated, in step (d) .
In some forms, in step (d) , an acylated NBD, with or without a free carboxylic acid (e.g., -COOR9 is carboxyl or ester) can be obtained, for example, an acylated NBD of Formula I’:
wherein R1-R9 and X can be in any forms as defined above, and R10 can be in any form as defined above expect for hydrogen.
A free acid of the NBDs can be acylated and/or activated using reagents and reactions known in the art. For example, a free acid of the NBDs (-COOH and/or -OH) can be acylated using reagents such as acetyl anhydride, carbonate, or carbamate to obtain an acyl-capped NBD of Formula I’, where R9 and/or R10, such as R10, is an acyl group. For example, a free acid of the NBDs (e.g., -COOH) can be activated by using reagents such as HATU, HBTU, Oxyma Pure to obtain an activated NBD of Formula I’, where R9 is an activating group.
Optionally, the method disclosed herein further includes a step of purifying the product containing acylated and/or activated NBDs after step (d) . The purification step can be performed using any suitable techniques known in the art, such as any of those described above in step (a) , (b) or (c) . For example, the product containing the acylated and/or activated NBDs is purified using the following techniques: extraction with a suitable solvent and separation using column chromatography or filtration, washing with cod water and cold organic solvent (such as cold ethanol and/or cold ether) for one or more times, and separation by silica gel column.
Generally, the acylated and/or activated NBDs obtained in step (d) has a yield of at least 40%or in a range from about 40%to about 95%. The yield of acylated and/or activated NBDs can be calculated using the formula: (The mole of the acylated and/or activated NBDs) / (The mole of the compound of Formula VI) *100%. The yield may be determined either before or after purification of the product.
An exemplary synthesis scheme of NBDs showing specific reaction conditions is shown below.
wherein R1-R10 and X can be in any forms as defined above.
Although not illustrated in the exemplary synthesis scheme, it is understood that obtaining the NBDs of Formula I’ above does not require all of the steps shown in this exemplary reaction scheme. An NBD of Formula I’ having a desired structure can be obtained following steps (a) and (b) ; following steps (a) , (b) , and step (c) ; following steps (a) , (b) , and (d) ; or following steps (a) , (b) , (c) , and (d) .
B. Peptide synthesis
The peptide/protein synthesis methods disclosed herein are based at least on the application of NBD as a pre-formed dipeptide building block for synthesizing difficult peptides/proteins.
The terms “difficult sequence” and “difficult peptide/protein” are used interchangeably herein. The concept of “difficult sequence” was introduced in
the 80’s and was given distinction by Kent and co-workers for peptides that form strong inter-or intra molecular, non-covalent interactions which form insoluble peptide aggregates. “Difficult sequences” are peptide sequences that contain high number of amino acids possessing hydrophobic side chains, so-called β-branched amino acids, including leucine, valine, phenylalanine or isoleucine. Additionally, glycine is known to induce β-sheet packing in combination with afore mentioned amino acids. These sequences tend to form β-sheet or α-helical structures within the molecule and therefore they have high aggregation potential and low solubility in aqueous or organic solvents. This results in a generally difficult handling, synthesis and purification. Peptide chains of “difficult sequences” exhibiting over 50-60 amino acids remain a challenge even when applying automated peptide synthesis protocols. With respect to SPPS “difficult sequences” are defined as peptides that are poorly solvated while attached to the solid support thus preventing complete deprotection and coupling steps. (Mueller, et al., Front Bioeng Biotechnol. 2020; 8: 162) Using the disclosed NBDs in SPPS (NBD-SPPS) , the peptide length (~50 AAs) in standard SPPS can be further extended to near 100 AAs, which was previously unobtainable in the field of peptide/protein synthesis.
Examples of difficult sequences include membrane proteins and their functional parts, for example, amylin, BM2 proton channel, influenza A, Copper storage protein 1 CSP-1, Interferon-induced transmembrane protein 3 (IFITM3) , NS4A, cofactor protein of serine protease from Hepatitis C virus; PD-L1 (programmed death-ligand 1) , IL-2 (INTERLEUCKIN -2) , Vasoactive intestinal peptide, Erythropoietin, Liraglutide, etc.
The disclosed NBDs are typically used in SPPS methods known in the art. Solid phase synthesis, in which the C-terminal amino acid of the sequence is attached to an insoluble support followed by sequential addition of the remaining amino acids in the sequence is a preferred method for the chemical synthesis of the peptides. Techniques for solid phase synthesis are known to
those of skill in the art and are described, for example, by Barany and Merrifield (1963) Solid-Phase Peptide Synthesis; pp. 3-284 in The Peptides: Analysis, Synthesis, Biology. Vol. 2: Special Methods in Peptide Synthesis, Part A.; Merrifield et al. (1963) J. Am. Chem. Soc, 85: 2149-2156, and Stewart et al. (1984) Solid Phase Peptide Synthesis, 2nd ed. Pierce Chem. Co., Rockford, 111. Such methods include bench scale solid phase synthesis and automated peptide synthesis in any one of the many commercially available peptide synthesizers. Solid phase synthesis is commonly used, and various commercial synthesizers are available, such as automated synthesizers by Applied Biosystems Inc., Foster City, CA; Beckman; MultiSyntech, Bochum, Germany etc. Functional groups for conjugating the peptide to small molecules, label moieties, peptides, or proteins may be introduced into the molecule during chemical synthesis. In addition, small molecules and label moieties/reporter units may be attached during the synthetic process. Introduction of the functional groups and conjugation to other molecules minimally affects the structure and function of the NBDs.
The disclosed methods are different and/or superior over previously described methods. For example, US 2008/0004451 describes the synthesis of dipeptides building blocks with acetonide protecting group to form a pseudoproline ring. By contrast, the disclosed methods utilizes the ligation intermediate of Serine/Threonine Ligation (STL) or Cysteine/Penicillamine Ligation (CPL) as the proline mimic i.e., NBDs, which allows the scale-up production of peptides in a very safe manner (NBDs are safe to handle) and it is environmentally friendly, because NBDs synthesis is a mild endothermic process and does not require heating or reflux conditions. Furthermore, the disclosed NBDs are difficult to obtain by the methods disclosed in US 2008/0004451. The much less reactive salicylaldehyde cannot fully react with the dipeptide, even in reflux temperature, resulting in low yield of the desired product. Also, the similar polarity of starting material and product made the
purification very challenging. Careful column chromatography and sophisticated chemists would be required, which dramatically increased the cost and the whole process was not effective enough. In contrast, using the methods disclosed herein, the NBDs could be easily prepared from milligram to kilogram scale without the need of heating or condensation apparatuses. The high reactivity of salicylaldehyde ester rendered complete conversion of starting materials and the purification process has also been simplified. Further, extra functional groups may be attached to the benzene ring of the NBDs to further facilitate the difficult peptide/protein synthesis.
The methods disclosed herein similar provide for simple and safer peptide synthesis than contemplated in previous disclosures, for example, et al., Pseudo-Prolines as a Solubilizing, Structure-Disrupting Protection Technique in Peptide Synthesis (J. Am. Chem. Soc. 1996, 118, 39, 9218-9227) ; the production of its dipeptides required harsh conditions such as anhydrous reflux with molecular sieves, which consumed more energy and had potential risk of explosion. By contrast, production of the disclosed NBD product do not need such complicated set up. NBDs produced as described herein j require mixing all ingredients together under room temperature, air atmosphere, moisture and high yield of product up to sub-kilogram can be obtained easily.
Similarly, the disclosed methods have advantages over the methods disclosed in Sohma, et al., Novel and efficient synthesis of difficult sequence-containing peptides through O-N intramolecular acyl migration reaction of O-acyl isopeptides, Chem. Commun (Camb) , 1: 124-5 (2004) ( (DOI: 10.1039/B312129A) , which discloses isoacyl dipeptides (which are chemically distinct from the disclosed NBDs) . Further, the synthesis of isopeptide may have a risk of epimerization since the esterification required highly active condensation reagents. By contrast, the disclosed methods proceed via NBD; the ligation between two amino acid fragment is chemoselective and stereoselective, without the risk of epimerization. In contrast, with acyl isopeptides, the
disclosed NBDs are much more stable under basic condition, which is important in Fmoc-SPPS for peptide synthesis. This advantage makes the disclosed NBDs much more suitable for long peptide or protein chemical synthesis.
The disclosed compounds and methods can be further understood through the following numbered paragraphs.
1. A compound having a structure of:
wherien:
(i) R1 is a protected or unprotected side chain of an amino acid;
(ii) R2 is an amine protecting group;
(iii) R3-R6 are independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF3) , -NO2, unsubstituted C1-C6 alkyl, unsubsituted C1-C6 alkenyl, unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol;
(iv) R7 and R8 are indepdently hydrogen, methyl, or other protected/unprotected side chains of amino acids;
(v) R9 is hydrogen or a functional group suitable for protecting and/or activating a carboxylic acid group;
(vi) R10 is hydrogen or a functional group suitable for hindering the reactivity of phenyl group, such as a C1-C4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) ; and
(vii) X is O or S.
2. The compound of paragraph 1, wherein R1 is a protected or unprotected side chain of glycine, alanine, valine, leucine, isoleucine, methionine, phenylalanine, tyrosine, tryptophan, cysteine, lysine, arginine, histidine, aspartic acid, glutamic acid, asparagine, glutamine, serine, or threonine.
3. The compound of paragraph 1 or 2, wherein when R1 is a protected side chain, wherein the protection group is selected from the group consisting of a substituted or unsubstitited alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a carbonyl (e.g., carboxylic acid or carboxylic ester) , an amide, an amino, a substitute or unsubstituted aryl, a substituted or unsubstituted heteroaryl, and a heterocyclic ring, optionally wherein the protection group is tBu, Boc, Thz, Acm, Trt, Cbz, Alloc, Azido, Bn, Bz, Ac, and Pbf, or a combination thereof.
4. The compound of any one of paragraphs 1-3, wherein R2 is Fmoc, Cbz, Moz, Boc, Troc, Teoc, Alloc, or Voc.
5. The compound of any one of paragraphs 1-4, wherein R3-R6 are independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF3) , -NO2, unsubstituted alkyl (such as unsubstituted C1-C8 alkyl) , unsubsituted alkenyl (unsubstituted C1-C8 alkenyl) , unsubstituted alkynyl (such as unsubstituted C1-C8 alkynyl) , unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol, optionally wherein R3-R6 are independently hydrogen, hydroxyl, halogen, -CN, -CF3, -NO2, unsubstituted C1-C4 alkyl, alkoxyl, and unsubstituted aryl.
6. The compound of any one of paragraphs 1-5, wherein at least one of R3-R6, optionally two or more of R3-R6, is/are not hydrogen.
7. The compound of any one of paragraphs 1-6, wherein R9 is hydrogen, benzyl, allyl, unsubstituted C1-C4 alkyl; oris an activated ester group, such as l-hydroxy-7-azabenzotriazole (HOAt) ester, 1-
hydroxybenzotriazole (HOBt) ester, ethyl 2-cyano-2- (hydroxyimino) acetate (Oxyma) ester, N-hydroxysuccinimide (NHS) ester, pentafluorophenol (Pfp) ester, etc.
8. The comound of any one of paragraphs 1-7, wherein R10 is hydrogen, a C1-C4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) .
9. A method of making the compound of Formula I:
wherien:
(i) R1 is a protected or unprotected side chain of an amino acid;
(ii) R2 is an amine protecting group;
(iii) R3-R6 are independently hydrogen, hdyroxyl, halogen, -CN, haloalkyl (such as -CF3) , -NO2, unsubstituted C1-C6 alkyl, unsubsituted C1-C6 alkenyl, unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol;
(iv) R7 and R8 are indepdently hydrogen, methyl, or other protected/unprotected side chains of amino acids;
(v) R9 is hydrogen, benzyl, allyl, unsubstituted C1-C4 alkyl, or a functional group suitable for protecting and/or activating a carboxylic acid group; and
(vi) X is O or S,
wherein the method comprises:
(a) reacting a protected amino acid of Formula II:
wherein R1 and R2 are as defined above for Formula I,
with a substituted or unsubstituted salicylaldehyde of Formula III:
wherein R3-R6 are as defined above for Formula I,
to obtain an intermediate comprising a salicylaldehyde ester of Formula IV:
(b) reacting the salicylaldehyde ester of Formula IV with serine, threonine, cysteine, or penicillamine, or an ester thereof to obtain a product comprising the compound of Formula I.
10. The method of paragraph 9, wherein R1 is a protected or unprotected side chain of glycine, alanine, valine, leucine, isoleucine, methionine, phenylalanine, tyrosine, tryptophan, cysteine, lysine, arginine, histidine, aspartic acid, glutamic acid, asparagine, glutamine, serine, or threonine.
11. The method of paragraph 9 or 10, wherein when R1 is a protected side chain, wherein the protection group is selected from the group consisting of a substituted or unsubstitited alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a carbonyl (e.g., carboxylic acid or
carboxylic ester) , an amide, an amino, a substitute or unsubstituted aryl, a substituted or unsubstituted heteroaryl, and a heterocyclic ring, optionally wherein the protection group is tBu, Boc, Thz, Acm, Trt, Cbz, Alloc, Azido, Bn, Bz, Ac, and Pbf, or a combination thereof..
12. The method of any one of paragraphs 9-11, wherein R2 is Fmoc, Cbz, Moz, Boc, Troc, Teoc, Alloc, or Voc.
13. The method of any one of paragraphs 9-12, wherein R3-R6 are independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF3) , -NO2, unsubstituted alkyl (such as unsubstituted C1-C8 alkyl) , unsubsituted alkenyl (unsubstituted C1-C8 alkenyl) , unsubstituted alkynyl (such as unsubstituted C1-C8 alkynyl) , unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol, optionally wherein R3-R6 are independently hydrogen, hydroxyl, halogen, -CN, -CF3, -NO2, unsubstituted C1-C4 alkyl, alkoxyl, and unsubstituted aryl.
14. The method of any one of paragraphs 9-13, wherein at least one of R3-R6, optionally two or more of R3-R6, is/are not hydrogen.
15. The method of any one of paragraphs 9-14, wherein step (a) is performed in the presence of a condensation agent, optionally wherein the condensation agent is HATU, PyBOP, HBTU, HCTU, COMU, TSTU, PyClock, PyOxim, EDCI, or DCC, or a combination thereof.
16. The method of any one of paragraphs 9-15, wherein step (a) is performed in the presence of a base, optionally wherein the base is DIPEA or Triethylamine, or a combination thereof.
17. The method of any one of paragraphs 9-16, wherein step (a) is performed in an organic solvent, optionally wherein the organic solvent is DMF, DMSO, DMAc, or NMP, or a combination thereof.
18. The method of any one of paragraphs 9-17, wherein step (a) is performed at room temperature for a time period ranging from 30 mins to 5 hours, preferably from 1 hour to 3 hours.
19. The method of any one of paragraphs 9-18, wherein the salicylaldehyde ester of Formula IV obtained in step (a) has a yield of at least 40%, at least 50%, or in a range from about 50%to about 95%.
20. The method of any one of paragraphs 9-19, wherein step (b) is performed in a solvent comprising a buffer, optionally wherein the solvent is dichloromethane, trifluoroacetic acid, tetrahydrofuran, toluene, ethyl acetate, 1, 4-dioxane, acetonitrile, acetone, or water, or a combination thereof.
21. The method of paragraph 20, wherein the buffer is pyridine-acetic acid buffer, optionally wherein the molar ratio of pyridine to acetic acid ranges from 1: 9 to 9: 1, such as 1: 1.
22. The method of any one of paragraphs 9-21, wherein step (b) is performed at room temperature for a time period ranging from 1 hour to 5 hours, preferably from 2 hours to 3 hours.
23. The method of any one of paragraphs 9-22, wherein the compound of Formula I obtained in step (b) has a yield of at least 40%, such as in a range from about 40%to about 95%.
24. The method of any one of paragraphs 9-23, further comprising purifying the intermediate to obtain the salicylaldehyde ester of Formula IV after step (a) and prior to step (b) .
25. The method of any one of paragraphs 9-24, further comprising purifying the product to obtain the compound of Formula I after step (b) .
26. The method of any one of paragraphs 9-25, further comprising:
(c) reacting a carboxyl protected NBD with a deprotection reagent to form a deprotected NBD of Formula VI:
(i) R1 is a protected or unprotected side chain of an amino acid;
(ii) R2 is an amine protecting group;
(iii) R3-R6 are independently hydrogen, hdyroxyl, halogen, -CN, haloalkyl (such as -CF3) , -NO2, unsubstituted C1-C6 alkyl, unsubsituted C1-C6 alkenyl, unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol;
(iv) R7 and R8 are indepdently hydrogen, methyl, or other protected/unprotected side chains of amino acids;
(v) R10 is hydrogen or a functional group suitable for hindering the reactivity of phenyl group, such as a C1-C4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) ; and
(vi) X is O or S.
27. The method of any one of paragraphs 9-26, further comprising:
(d) reacting a free-acid or free-phenol containing NBD with an acylation and/or activation reagent to form an acylated/activated NBD of Formula I’:
wherien:
(i) R1 is a protected or unprotected side chain of an amino acid;
(ii) R2 is an amine protecting group;
(iii) R3-R6 are independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF3) , -NO2, unsubstituted C1-C6 alkyl, unsubsituted C1-C6 alkenyl, unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol;
(iv) R7 and R8 are indepdently hydrogen, methyl, or other protected/unprotected side chains of amino acids;
(v) R9 is hydrogen or a functional group suitable for protecting and/or activating a carboxylic acid group;
(vi) R10 is a functional group suitable for hindering the reactivity of phenyl group, such as a C1-C4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) ; and
(vii) X is O or S.
28. The method of paragraph 26 or 27, wherein step (c) is performed immediately following step (b) or immediately following step (d) , wherein the carboxyl protected NBD is the NBD of Formula I formed in step (b) or the NBD of Formula I’ formed in step (d) , and wherin R9 of Formula I or Formula I’ is not hydrogen.
29. The method of paragraph 27 or 28, wherein step (d) is performed immediately following step (b) or immediately following step (c) , and wherein the free-acid containing NBD is the NBD of Formula I formed in step (b) or the NBD of Formula VI formed in step (c) .
30. A method of using the compound of any one of paragraphs 1-8 for solid phase peptide synthesis.
31. The method of paragraph 30, comprising: mixing the compound with amino acids for forming the peptide.
The detailed synthetic procedures of exemplary NBDs are described below in the following non-limiting examples.
EXAMPLES
EXAMPLE 1. L-Thr-L-Thr NBD
The first example synthesized L-Thr-L-Thr NBD, the structure of which is shown below.
Step a:
Firstly, L-Fmoc-Thr (tBu) -OH (3.97 g, 10.0 mmol) and HATU (3.80 g, 10.0 mmol) were dissolved in 50.0 mL DMF. Subsequently, DIPEA (2.58 g, 20.0 mmol) was added and the reaction mixture was stirred under room temperature for 1 min. Then, salicylaldehyde (1.22 g, 10.0 mmol) was added and the reaction was stirred for 1.5 h. After that, the mixture was extracted with 250 mL ethyl acetate and washed with 100 mL 1M HCl (aq) and 100 mL NaHCO3 (aq) for 3 times. The obtained organic phase was dried by sodium sulfate and evaporated under reduced pressure to give the crude product as a pink oil. After purification by silica gel column, Fmoc-Thr (tBu) -SAL ester could be obtained in 95%yield, 4.77 g.
Step b:
The Fmoc-Thr (tBu) -SAL ester was dissolved into 250 mL dichloromethane, and 10 mL pyridine/HOAc buffer containing 10.0 mmol L-H-Thr-OAll ester was added. The reaction mixture was stirred at room temperature for 2 h and extracted similarly as mentioned above. Later the crude product was purified by column chromatography, affording the desired NBD as pale solid in 93%yield.
Alternatively, the above procedure could be simplified in only one purification process:
After extraction and evaporation at step a., the crude L-Fmoc-Thr (tBu) -SAL ester was directly dissolved into 250 mL dichloromethane without any extra purification. Then, 10 mL pyridine/HOAc buffer containing 10.0 mmol L-H-Thr-OAll ester was added and the mixture reacted for 2 h to afford the desired L-Thr-L-Thr NBD in 85%yield over two steps. Furthermore, this simplified protocol has been robustly scale-up to sub-kg scale in the production of NBDs.
EXAMPLE 2. L-Val-L-Cys NBD
The second example synthesized L-Val-L-Cys NBD, the structure of which is shown below.
Step a & Step b:
First, L-Fmoc-Val-OH (3.39 g, 10.0 mmol) and HATU (3.80 g, 10.0 mmol) were dissolved in 50.0 mL DMF. Subsequently, DIPEA (2.58 g, 20.0 mmol) was added and the reaction mixture was stirred under room temperature for 1 min. Then, 4-methoxy salicylaldehyde (1.52 g, 10.0 mmol) was added and the reaction was stirred for 1.0 h. After that, the mixture was directly diluted by 100 mL 6M pyridine-acetic acid aqueous buffer, and 10 mL 10%TFA aqueous
solution containing 12.0 mmol L-H-Cys-OH was added dropwise. After the reaction mixture was stirred at room temperature for 3 h, it was poured into 1-L ice-cold water. The white precipitation was filtered by suction and washed by cold water, cold ethanol and cold ether. Later the product was further purified by column chromatography, affording the desired L-Val-L-Cys NBD as white solid in 83%yield. This protocol can also be scale-up to sub-kg scale in the production of Cys/Pen-based NBDs.
EXAMPLE 3. Ac-capped L-Val-L-Thr NBD
The first example synthesized Ac-capped L-Thr-L-Thr NBD, the structure of which is shown below.
Step a:
Firstly, L-Fmoc-Val-OH (3.39 g, 10.0 mmol) and HATU (3.80 g, 10.0 mmol) were dissolved in 50.0 mL DMF. Subsequently, DIPEA (2.58 g, 20.0 mmol) was added and the reaction mixture was stirred under room temperature for 1 min. Then, salicylaldehyde (1.22 g, 10.0 mmol) was added and the reaction was stirred for 1.5 h. After that, the mixture was extracted with 250 mL ethyl acetate and washed with 100 mL 1M HCl (aq) , 100 mL NaHCO3 (aq) for 3 times. The obtained organic phase was dried by sodium sulfate and evaporated under reduced pressure to give the crude product as a pink oil. After purification by silica gel column, Fmoc-Val-SAL ester could be obtained in 95%yield, 4.51 g.
Step b:
The Fmoc-Val-SAL ester was dissolved into 250 mL dichloromethane, and 10 mL pyridine/HOAc buffer containing 10.0 mmol L-H-Thr-Oall ester was
added. The reaction mixture was stirred at room temperature for 2 h and extracted similarly as mentioned above. Later the crude product was purified by column chromatography, affording the unprotected NBD as pale solid in 91%yield.
Step c:
The unprotected NBD was dissolved into 250 mL dichloromethane, and Pd (PPh3) 4 (5 mol%) , PhSiH3 (10 eq. ) were added. The reaction mixture was stirred at room temperature for 1 h and extracted similarly as mentioned above. Then the crude product was purified by column chromatography, affording the NBD acid form as pale solid in 95%yield.
Step d:
The NBD acid was then dissolved into 250 mL dichloromethane, and Ac2O (2.0 eq. ) and DIPEA (4.0 eq. ) were added. The reaction mixture was stirred at room temperature for 3 h and extracted similarly as mentioned above. Then the crude product was purified by column chromatography, affording the desired Ac-capped NBD acid as pale solid in 90%yield.
EXAMPLE 4. HOAt-Activated L-Thr-L-Thr NBD
The first example synthesized HOAt-activated L-Thr-L-Thr NBD, the structure of which is shown below.
Step a:
Firstly, L-Fmoc-Val-OH (3.39 g, 10.0 mmol) and HATU (3.80 g, 10.0 mmol) were dissolved in 50.0 mL DMF. Subsequently, DIPEA (2.58 g, 20.0 mmol) was added and the reaction mixture was stirred under room temperature for 1 min. Then, salicylaldehyde (1.22 g, 10.0 mmol) was added and the reaction
was stirred for 1.5 h. After that, the mixture was extracted with 250 mL ethyl acetate and washed with 100 mL 1M HCl (aq) , 100 mL NaHCO3 (aq) for 3 times. The obtained organic phase was dried by sodium sulfate and evaporated under reduced pressure to give the crude product as a pink oil. After purification by silica gel column, Fmoc-Val-SAL ester could be obtained in 95%yield, 4.51 g.
Step b:
The Fmoc-Val-SAL ester was dissolved into 250 mL dichloromethane, and 10 mL pyridine/HOAc buffer containing 10.0 mmol L-H-Thr-OAll ester was added. The reaction mixture was stirred at room temperature for 2 h and extracted similarly as mentioned above. Later the crude product was purified by column chromatography, affording the unprotected NBD as pale solid in 91%yield.
Step c:
The unprotected NBD was dissolved into 250 mL dichloromethane, and Pd(PPh3) 4 (5 mol%) , PhSiH3 (10 eq. ) were added. The reaction mixture was stirred at room temperature for 1 h and extracted similarly as mentioned above. Then the crude product was purified by column chromatography, affording the NBD acid form as pale solid in 93%yield.
Step d:
The NBD acid was then dissolved into 250 mL dichloromethane, and HATU (1.0 eq. ) and DIPEA (1.0 eq. ) were added. The reaction mixture was stirred at room temperature for 0.5 h and extracted similarly as mentioned above. Then the crude product was purified by column chromatography, affording the desired HOAt-activated NBD acid as pale solid in 78%yield.
EXAMPLE 5. Carbamate-capped L-Val-L-Cys NBD
The second example synthesized carbamate-protected L-Val-L-Cys NBD, the structure of which is shown below.
Step a & Step b:
First, L-Fmoc-Val-OH (3.39 g, 10.0 mmol) and HATU (3.80 g, 10.0 mmol) were dissolved in 50.0 mL DMF. Subsequently, DIPEA (2.58 g, 20.0 mmol) was added and the reaction mixture was stirred under room temperature for 1 min. Then, 4-methoxy salicylaldehyde (1.52 g, 10.0 mmol) was added and the reaction was stirred for 1.0 h. After that, the mixture was directly diluted by 100 mL 6M pyridine-acetic acid aqueous buffer, and 10 mL 10%TFA aqueous solution containing 12.0 mmol L-H-Cys-OH was added dropwise. After the reaction mixture was stirred at room temperature for 3 h, it was poured into 1-L ice-cold water. The white precipitation was filtered by suction and washed by cold water, cold ethanol and cold ether. Later the product was further purified by column chromatography, affording the desired L-Val-L-Cys NBD as white solid in 83%yield.
Step d:
In this case, the NBD is already in acid form so that the step c could be skipped. The unprotected NBD was dissolved into 250 mL dichloromethane, and tert-butyl methyl (2- (methylamino) ethyl) carbamate (2.0 eq. ) activated by 1, 1'-carbonyldiimidazole (2.0 eq. ) and DIPEA (4.0 eq. ) were added. The reaction mixture was stirred at room temperature for 1 h and extracted similarly as mentioned above. Then the crude product was purified by column chromatography, affording the desired carbamate-capped NBD as pale solid in 82%yield.
EXAMPLE 6. Peptide Synthesis
With these NBDs in hand, we further evaluated their effect in the construction of difficult peptides by SPPS. For this purpose, several difficult or even inaccessible peptide sequences reported from literature were selected.
The first case was the synthesis of PD-L1 (121-132) , a C-terminus of PD-L1 that was highly prone to form β-sheet structure. The amino acids were installed by standard coupling procedure by using HATU as coupling reagent, sequentially, the resin was subjected to washing steps. Then the resin was treated with 20%piperidine/DMF to remove the Fmoc protecting group and allowed for the next coupling cycle. During the synthesis, Ile-Thr NBD was installed by a one-pot activation and coupling protocol. As shown in the Figure 1, the installation of Ile-Thr NBD improved the quality of final products significantly. Furthermore, without dipeptides, serious amino acid deletion was observed, indicating the powerfulness of dipeptide in solving difficult SPPS issue. In the synthesis of IL-2 (125-133) , a Ser-Cys NBD was incorporated (Figure 2) . The amino acids were installed by standard coupling procedure by using HATU as coupling reagent, sequentially, the resin was subjected to washing steps. Then the resin was treated with 20%piperidine/DMF to remove the Fmoc protecting group and allowed for the next coupling cycle. During the synthesis, Ser-Cys NBD was installed by a one-pot activation and coupling protocol. Although SPPS with/without NBD both showed similar retention peak from the analytical HPLC data of the peptide, the one without using NBD showed messy mass spectrum. By contrast, with NBD insertion the mass quality of peptide was excellent. After that, synthesis of another difficult sequence, Amylin with 37 amino acids, was performed by introducing 3 NBDs during the Fmoc-SPPS (Figure 3) . The amino acids were installed by standard coupling procedure by using HATU as coupling reagent, sequentially, the resin was subjected to washing steps. Then the resin was treated with 20%piperidine/DMF to remove the Fmoc protecting group and enabled the next coupling cycle. During the synthesis, three corresponding NBDs were installed
by a one-pot activation and coupling protocol. With the installation of NBDs, the desired product could be achieved in high quality. However, no any desired product was observed in the synthesis without using NBDs. Finally, a natural HIV-suppressive factor containing 68 amino acids, RANTES which was also known as chemokine ligand 5, was chosen as an example of challenging targets with longer sequence. The amino acids were installed by standard coupling procedure by using HATU as coupling reagent, sequentially, the resin was subjected to washing steps. Then the resin was treated with 20%piperidine/DMF to remove the Fmoc protecting group and enabled the next coupling cycle. During the synthesis, three corresponding NBDs were installed by a one-pot activation and coupling protocol. As shown in Figure 4, RANTES was successfully constructed when four NBDs were introduced and up to 21%isolated yield could be achieved after HPLC purification. At the same time, the attempt to synthesized RANTES by conventional SPPS failed and only messy result was obtained, emphasizing the key role of our NBDs on preventing peptide aggregation during SPPS. Furthermore, only polystyrene-based resin was applied in all examples above, without the need to pursue modified and expensive polymer supports for difficult peptides/proteins.
Claims (31)
- A compound having a structure of:
wherien:(i) R1 is a protected or unprotected side chain of an amino acid;(ii) R2 is an amine protecting group;(iii) R3-R6 are independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF3) , -NO2, unsubstituted C1-C6 alkyl, unsubsituted C1-C6 alkenyl, unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol;(iv) R7 and R8 are indepdently hydrogen, methyl, or other protected/unprotected side chains of amino acids;(v) R9 is hydrogen or a functional group suitable for protecting and/or activating a carboxylic acid group;(vi) R10 is hydrogen or a functional group suitable for hindering the reactivity of phenyl group, such as a C1-C4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) ; and(vii) X is O or S. - The compound of claim 1, wherein R1 is a protected or unprotected side chain of glycine, alanine, valine, leucine, isoleucine, methionine, phenylalanine, tyrosine, tryptophan, cysteine, lysine, arginine, histidine, aspartic acid, glutamic acid, asparagine, glutamine, serine, or threonine.
- The compound of claim 1 or 2, wherein when R1 is a protected side chain, wherein the protection group is selected from the group consisting of a substituted or unsubstitited alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a carbonyl (e.g., carboxylic acid or carboxylic ester) , an amide, an amino, a substitute or unsubstituted aryl, a substituted or unsubstituted heteroaryl, and a heterocyclic ring, optionally wherein the protection group is tBu, Boc, Thz, Acm, Trt, Cbz, Alloc, Azido, Bn, Bz, Ac, and Pbf, or a combination thereof.
- The compound of any one of claims 1-3, wherein R2 is Fmoc, Cbz, Moz, Boc, Troc, Teoc, Alloc, or Voc.
- The compound of any one of claims 1-4, wherein R3-R6 are independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF3) , -NO2, unsubstituted alkyl (such as unsubstituted C1-C8 alkyl) , unsubsituted alkenyl (unsubstituted C1-C8 alkenyl) , unsubstituted alkynyl (such as unsubstituted C1-C8 alkynyl) , unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol, optionally wherein R3-R6 are independently hydrogen, hydroxyl, halogen, -CN, -CF3, -NO2, unsubstituted C1-C4 alkyl, alkoxyl, and unsubstituted aryl.
- The compound of any one of claims 1-5, wherein at least one of R3-R6, optionally two or more of R3-R6, is/are not hydrogen.
- The compound of any one of claims 1-6, wherein R9 is hydrogen, benzyl, allyl, unsubstituted C1-C4 alkyl; oris an activated ester group, such as l-hydroxy-7-azabenzotriazole (HOAt) ester, 1-hydroxybenzotriazole (HOBt) ester, ethyl 2-cyano-2- (hydroxyimino) acetate (Oxyma) ester, N-hydroxysuccinimide (NHS) ester, pentafluorophenol (Pfp) ester, etc.
- The comound of any one of claims 1-7, wherein R10 is hydrogen, a C1-C4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) .
- A method of making the compound of Formula I:
wherien:(i) R1 is a protected or unprotected side chain of an amino acid;(ii) R2 is an amine protecting group;(iii) R3-R6 are independently hydrogen, hdyroxyl, halogen, -CN, haloalkyl (such as -CF3) , -NO2, unsubstituted C1-C6 alkyl, unsubsituted C1-C6 alkenyl, unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol;(iv) R7 and R8 are indepdently hydrogen, methyl, or other protected/unprotected side chains of amino acids;(v) R9 is hydrogen, benzyl, allyl, unsubstituted C1-C4 alkyl, or a functional group suitable for protecting and/or activating a carboxylic acid group; and(vi) X is O or S,wherein the method comprises:(a) reacting a protected amino acid of Formula II:
wherein R1 and R2 are as defined above for Formula I,with a substituted or unsubstituted salicylaldehyde of Formula III:
wherein R3-R6 are as defined above for Formula I,to obtain an intermediate comprising a salicylaldehyde ester of Formula IV:
(b) reacting the salicylaldehyde ester of Formula IV with serine, threonine, cysteine, or penicillamine, or an ester thereof to obtain a product comprising the compound of Formula I. - The method of claim 9, wherein R1 is a protected or unprotected side chain of glycine, alanine, valine, leucine, isoleucine, methionine, phenylalanine, tyrosine, tryptophan, cysteine, lysine, arginine, histidine, aspartic acid, glutamic acid, asparagine, glutamine, serine, or threonine.
- The method of claim 9 or 10, wherein when R1 is a protected side chain, wherein the protection group is selected from the group consisting of a substituted or unsubstitited alkyl, a substituted or unsubstituted alkenyl, a substituted or unsubstituted alkynyl, a carbonyl (e.g., carboxylic acid or carboxylic ester) , an amide, an amino, a substitute or unsubstituted aryl, a substituted or unsubstituted heteroaryl, and a heterocyclic ring, optionally wherein the protection group is tBu, Boc, Thz, Acm, Trt, Cbz, Alloc, Azido, Bn, Bz, Ac, and Pbf, or a combination thereof.
- The method of any one of claims 9-11, wherein R2 is Fmoc, Cbz, Moz, Boc, Troc, Teoc, Alloc, or Voc.
- The method of any one of claims 9-12, wherein R3-R6 are independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF3) , -NO2, unsubstituted alkyl (such as unsubstituted C1-C8 alkyl) , unsubsituted alkenyl (unsubstituted C1-C8 alkenyl) , unsubstituted alkynyl (such as unsubstituted C1-C8 alkynyl) , unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol, optionally wherein R3-R6 are independently hydrogen, hydroxyl, halogen, -CN, -CF3, -NO2, unsubstituted C1-C4 alkyl, alkoxyl, and unsubstituted aryl.
- The method of any one of claims 9-13, wherein at least one of R3-R6, optionally two or more of R3-R6, is/are not hydrogen.
- The method of any one of claims 9-14, wherein step (a) is performed in the presence of a condensation agent, optionally wherein the condensation agent is HATU, PyBOP, HBTU, HCTU, COMU, TSTU, PyClock, PyOxim, EDCI, or DCC, or a combination thereof.
- The method of any one of claims 9-15, wherein step (a) is performed in the presence of a base, optionally wherein the base is DIPEA or Triethylamine, or a combination thereof.
- The method of any one of claims 9-16, wherein step (a) is performed in an organic solvent, optionally wherein the organic solvent is DMF, DMSO, DMAc, or NMP, or a combination thereof.
- The method of any one of claims 9-17, wherein step (a) is performed at room temperature for a time period ranging from 30 mins to 5 hours, preferably from 1 hour to 3 hours.
- The method of any one of claims 9-18, wherein the salicylaldehyde ester of Formula IV obtained in step (a) has a yield of at least 40%, at least 50%, or in a range from about 50%to about 95%.
- The method of any one of claims 9-19, wherein step (b) is performed in a solvent comprising a buffer, optionally wherein the solvent is dichloromethane, trifluoroacetic acid, tetrahydrofuran, toluene, ethyl acetate, 1, 4-dioxane, acetonitrile, acetone, or water, or a combination thereof.
- The method of claim 20, wherein the buffer is pyridine-acetic acid buffer, optionally wherein the molar ratio of pyridine to acetic acid ranges from 1: 9 to 9: 1, such as 1: 1.
- The method of any one of claims 9-21, wherein step (b) is performed at room temperature for a time period ranging from 1 hour to 5 hours, preferably from 2 hours to 3 hours.
- The method of any one of claims 9-22, wherein the compound of Formula I obtained in step (b) has a yield of at least 40%, such as in a range from about 40%to about 95%.
- The method of any one of claims 9-23, further comprising purifying the intermediate to obtain the salicylaldehyde ester of Formula IV after step (a) and prior to step (b) .
- The method of any one of claims 9-24, further comprising purifying the product to obtain the compound of Formula I after step (b) .
- The method of any one of claims 9-25, further comprising:(c) reacting a carboxyl protected NBD with a deprotection reagent to form a deprotected NBD of Formula VI:
(i) R1 is a protected or unprotected side chain of an amino acid;(ii) R2 is an amine protecting group;(iii) R3-R6 are independently hydrogen, hdyroxyl, halogen, -CN, haloalkyl (such as -CF3) , -NO2, unsubstituted C1-C6 alkyl, unsubsituted C1-C6 alkenyl, unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol;(iv) R7 and R8 are indepdently hydrogen, methyl, or other protected/unprotected side chains of amino acids;(v) R10 is hydrogen or a functional group suitable for hindering the reactivity of phenyl group, such as a C1-C4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2- (methylamino) ethyl) carbamate) ; and(vi) X is O or S. - The method of any one of claims 9-26, further comprising:(d) reacting a free-acid or free phenol containing NBD with an acylation and/or activation reagent to form an acylated/activated NBD of Formula I’:
wherien:(i) R1 is a protected or unprotected side chain of an amino acid;(ii) R2 is an amine protecting group;(iii) R3-R6 are independently hydrogen, hydroxyl, halogen, -CN, haloalkyl (such as -CF3) , -NO2, unsubstituted C1-C6 alkyl, unsubsituted C1-C6 alkenyl, unsubstituted aryl, unsubstituted heteroaryl, unsubstituted heteroalkyl, an unsubstituted cyclic group, an unsubstituted heterocyclic, an unsubstituted aralkyl, alkoxyl, amino, amido, carbonyl, or thiol;(iv) R7 and R8 are indepdently hydrogen, methyl, or other protected/unprotected side chains of amino acids;(v) R9 is hydrogen or a functional group suitable for protecting and/or activating a carboxylic acid group;(vi) R10 is a functional group suitable for hindering the reactivity of phenyl group, such as a C1-C4 acyl group (e.g., formyl, acetyl, propionyl) , a carbonyl group (e.g., alloc) , or a carbamate group (e.g., tert-butyl methyl (2-(methylamino) ethyl) carbamate) ; and(vii) X is O or S. - The method of claim 26 or 27, wherein step (c) is performed immediately following step (b) or immediately following step (d) , wherein the carboxyl protected NBD is the NBD of Formula I formed in step (b) or the NBD of Formula I’ formed in step (d) , and wherin R9 of Formula I or Formula I’ is not hydrogen.
- The method of claim 27 or 28, wherein step (d) is performed immediately following step (b) or immediately following step (c) , and wherein the free-acid containing NBD is the NBD of Formula I formed in step (b) or the NBD of Formula VI formed in step (c) .
- A method of using the compound of any one of claims 1-8 for solid phase peptide synthesis.
- The method of claim 30, comprising: mixing the compound with amino acids for forming the peptide.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202363486748P | 2023-02-24 | 2023-02-24 | |
US63/486,748 | 2023-02-24 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2024174680A1 true WO2024174680A1 (en) | 2024-08-29 |
Family
ID=92500183
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2023/137784 WO2024174680A1 (en) | 2023-02-24 | 2023-12-11 | Building blocks for difficult peptide synthesis and method of making thereof |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2024174680A1 (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102482320A (en) * | 2009-08-12 | 2012-05-30 | 李学臣 | Native chemical ligation at serine and threonine sites |
CN112851761A (en) * | 2019-11-12 | 2021-05-28 | 杭州多普源生物科技有限公司 | Method for preparing plinacandin |
-
2023
- 2023-12-11 WO PCT/CN2023/137784 patent/WO2024174680A1/en unknown
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102482320A (en) * | 2009-08-12 | 2012-05-30 | 李学臣 | Native chemical ligation at serine and threonine sites |
CN112851761A (en) * | 2019-11-12 | 2021-05-28 | 杭州多普源生物科技有限公司 | Method for preparing plinacandin |
Non-Patent Citations (5)
Title |
---|
ASADOLLAHI-BABOLI M., MANI-VARNOSFADERANI A.: "Molecular docking, molecular dynamics simulation, and QSAR model on potent thiazolidine-4-carboxylic acid inhibitors of influenza neuraminidase", MEDICINAL CHEMISTRY RESEARCH, BIRKHAEUSER, BOSTON., US, vol. 22, no. 4, 1 April 2013 (2013-04-01), US , pages 1700 - 1710, XP093203595, ISSN: 1054-2523, DOI: 10.1007/s00044-012-0175-y * |
FREDERIC ZOLLER; ANNETTE MARKERT; PHILIPPE BARTHE; ULRIKE HEBLING; ANNETTE ALTMANN; THOMAS LINDNER; WALTER MIER; UWE HABERKORN: "A Disulfide‐Constrained Miniprotein with Striking Tumor‐Binding Specificity Developed by Ribosome Display", ANGEWANDTE CHEMIE INTERNATIONAL EDITION, VERLAG CHEMIE, HOBOKEN, USA, vol. 52, no. 45, 20 September 2013 (2013-09-20), Hoboken, USA, pages 11760 - 11764, XP072077276, ISSN: 1433-7851, DOI: 10.1002/anie.201304603 * |
FREDERIC ZOLLER; THIMON SCHWAEBEL; ANNETTE MARKERT; UWE HABERKORN; WALTER MIER: "Engineering and Functionalization of the Disulfide‐Constrained Miniprotein Min‐23 as a Scaffold for Diagnostic Application", CHEMMEDCHEM COMMUNICATIONS, WILEY-VCH, DE, vol. 7, no. 2, 23 December 2011 (2011-12-23), DE , pages 237 - 247, XP072423159, ISSN: 1860-7179, DOI: 10.1002/cmdc.201100497 * |
XU YANYAN; QI CHENZE; WANG CHEN: "Mechanism, origin of diastereoselectivity and factors affecting reaction efficiency of serine/threonine ligation: A computational study", TETRAHEDRON, ELSEVIER SIENCE PUBLISHERS, AMSTERDAM, NL, vol. 76, no. 18, 17 March 2020 (2020-03-17), AMSTERDAM, NL , XP086132265, ISSN: 0040-4020, DOI: 10.1016/j.tet.2020.131143 * |
XUECHEN LI, HIU YUNG LAM, YINFENG ZHANG, CHUN KEI CHAN: "Salicylaldehyde Ester-Induced Chemoselective Peptide Ligations: Enabling Generation of Natural Peptidic Linkages at the Serine/Threonine Sites", ORGANIC LETTERS, AMERICAN CHEMICAL SOCIETY, vol. 12, no. 8, 16 April 2010 (2010-04-16), pages 1724 - 1727, XP055076642, ISSN: 15237060, DOI: 10.1021/ol1003109 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5849954A (en) | Method of peptide synthesis | |
Tamiaki et al. | A novel protecting group for constructing combinatorial peptide libraries | |
CN112236436B (en) | Method for synthesizing solution phase peptide and protection strategy thereof | |
EP0098865B1 (en) | Peptide synthesis and amino acid blocking agents | |
US20230242572A1 (en) | a-CARBONYL ALKENYL ESTER PREPARATION METHOD THEREFOR AND APPLICATION THEREOF | |
JPH01308297A (en) | Tetrapeptide | |
KR20230175279A (en) | Compositions for chemical synthesis of peptides | |
WO2024174680A1 (en) | Building blocks for difficult peptide synthesis and method of making thereof | |
JP5445456B2 (en) | Method for removing dibenzofulvene | |
Woo et al. | The use of aryl hydrazide linkers for the solid phase synthesis of chemically modified peptides | |
EP4071135A1 (en) | Etelcalcetide intermediate and method for synthesizing etelcalcetide | |
JP2888991B2 (en) | Nα-2- (4-nitrophenylsulfonyl) ethoxycarbonyl-amino acid | |
WO1990005738A1 (en) | Trialkysilyl esters of amino acids and their use in the synthesis of peptides | |
CZ282881B6 (en) | Process for preparing peptides | |
EP1100812A2 (en) | Resin derivatization method and uses thereof | |
CN105622424A (en) | Compound and preparation method and application thereof | |
Tanaka et al. | A novel oxazolidine linker for the synthesis of peptide aldehydes | |
EP0410182A2 (en) | A new technique for rapid peptide coupling | |
Rodrigues et al. | Synthesis and conformational investigation of tetrapeptide analogues of the fragment B23-B26 of insulin | |
KR100272310B1 (en) | A method for preparation over 1-deamino-8-d-arginine vasopressin acetate | |
CN110330552B (en) | Synthetic method of degarelix acetate | |
TWI778027B (en) | Manufacturing method of amino acid polymer | |
JP2006511459A (en) | Peptide synthesis method | |
KR100385096B1 (en) | Method for producing azapeptide derivatives useful for pharmaceutical synthesis by solid phase reaction with high efficiency | |
Ahmed et al. | 3-(5-NITRO-2-PYRIDON-1-YL)-1, 2-BENZOISOTHIAZOLE 1, 1-DIOXIDE (BID-NPy) AS A NEW EFFECTIVE CONDENSING REAGENT |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23923843 Country of ref document: EP Kind code of ref document: A1 |